Overview

Mistral released Voxtral Transcribe 2, a family of two new audio transcription models including an open-source version. The models demonstrate near-instantaneous transcription capability with accurate handling of technical jargon and real-time performance.

Key Facts

  • Open-source model available with Apache-2.0 license - developers can now self-host high-quality transcription without API dependencies
  • Real-time transcription with technical accuracy - handles specialized jargon like Django and WebAssembly instantly
  • Includes speaker diarization functionality - automatically identifies who is speaking when in multi-person recordings
  • Browser-based live demo available - test transcription quality immediately without setup
  • API version with context bias feature - improves accuracy for domain-specific terminology
  • Export options include SRT, JSON, and text formats - seamlessly integrates with video editing and development workflows

Why It Matters

This matters because it democratizes high-quality speech-to-text technology - developers can now access enterprise-level transcription capabilities either through open-source self-hosting or affordable API access, potentially transforming how audio content is processed and made accessible.