ViqusViqus
Navigate
Company
Blog
About Us
Contact
System Status
Enter Viqus Hub

Cohere Launches Open-Source Voice Model – Transcribe

Automatic Speech Recognition AI Model Voice AI Transcribe Cohere Open Source Hugging Face Leaderboard
March 26, 2026
Source: TechCrunch AI
Viqus Verdict Logo Viqus Verdict Logo 5
Steady Progression
Media Hype 6/10
Real Impact 5/10

Article Summary

Cohere unveiled Transcribe, its first voice model, on Thursday. This open-source Automatic Speech Recognition (ASR) model is targeted at users with consumer-grade GPUs, allowing for self-hosting. Transcribe supports 14 languages, including English, French, German, and Spanish, and achieves a WER of 5.42 on the Hugging Face Open ASR leaderboard, outperforming models like Zoom Scribe and Qwen3-ASR-1.7B. Cohere claims a 61% win rate in accuracy tests against other models. Despite competitive performance, the model showed some weaknesses in Portuguese, German and Spanish transcription. The company intends to integrate Transcribe into its North agent orchestration platform and offer it via API and its managed inference platform, Model Valut. This release aligns with the growing popularity of speech recognition models for applications like note-taking and dictation, mirroring the rise of tools like Granola and Wispr Flow. Cohere’s financial growth, with reported $240M ARR in 2025, adds another layer to the significance of this launch.

Key Points

  • Cohere launched Transcribe, its first open-source voice model.
  • Transcribe supports 14 languages and achieves a competitive WER on the Hugging Face Open ASR leaderboard.
  • The model will be integrated into Cohere’s North platform and offered via API and its managed inference platform.

Why It Matters

This launch is moderately significant. While the model’s performance is impressive, it primarily represents an incremental development within the rapidly evolving speech recognition space. The open-source nature and the availability of a self-hosted option could attract developers and smaller businesses looking for cost-effective ASR solutions. However, the overall impact is limited to the developer community and those already invested in similar technology solutions. It doesn’t represent a fundamental shift in how speech is processed.

You might also be interested in