Arabic Speech-to-Text Comparison

Mistral Voxtral MinivsElevenLabs Scribe v2

Head-to-head comparison based on real production benchmarks with Gulf Arabic callers.

Overview

Mistral Voxtral Mini

Non-functional

Mistral's speech model — completely non-functional for Arabic.

production testedvoxtral-mini-latest

ElevenLabs Scribe v2

Not Recommended

ElevenLabs' realtime STT offering — poor quality and slow for Arabic.

production testedscribe_v2_realtime

Latency

Mistral Voxtral Mini

Avg EOU Delay
N/A
Best Case
N/A
Worst Case
N/A

ElevenLabs Scribe v2

Avg EOU Delay2000ms–2500ms
Best Case2000ms
Worst Case2500ms

Quality

Mistral Voxtral Mini

Non-functional

Produced zero transcriptions for Arabic audio. Tested with and without explicit language parameter.

ElevenLabs Scribe v2

Poor

Described as 'shit quality' in production testing. Not viable for Arabic.

Saudi Arabic

Features

FeatureMistral Voxtral MiniElevenLabs Scribe v2
Multilingual speech recognition (claimed)
Audio understanding
Real-time streaming transcription
Multiple language support
LiveKit inference integration

Pricing

Mistral Voxtral Mini

Free tier
APIMistral API pricing
Usage-basedper request

ElevenLabs Scribe v2

Free tier
StarterIncludes STT credits
$5per month

Streaming & Integration

CapabilityMistral Voxtral MiniElevenLabs Scribe v2
Streaming support
LiveKit plugin
Self-hostable
API styleRESTWebSocket streaming
SDKsPython, Node.jsPython, Node.js

Verdict

Non-functional

Mistral Voxtral Mini

Does not work for Arabic at all. Zero transcriptions produced in testing despite claiming multilingual support.

Choose Mistral Voxtral Mini if you need:

    Pros
    • +Part of Mistral ecosystem
    Cons
    • -Completely non-functional for Arabic
    • -Zero output despite audio processing
    • -Misleading multilingual claims
    Not Recommended

    ElevenLabs Scribe v2

    Poor quality and poor latency for Arabic. Not recommended for any Arabic STT use case.

    Choose ElevenLabs Scribe v2 if you need:

      Pros
      • +LiveKit plugin available
      • +Part of ElevenLabs ecosystem (TTS bundle)
      Cons
      • -Poor Arabic transcription quality
      • -High latency (2-2.5s EOU)
      • -No advantage over better alternatives

      Frequently Asked Questions

      Which has better Arabic transcription quality, Mistral Voxtral Mini or ElevenLabs Scribe v2?

      Mistral Voxtral Mini has a quality rating of 1/5 (Non-functional). Produced zero transcriptions for Arabic audio. Tested with and without explicit language parameter.

      Is Mistral Voxtral Mini or ElevenLabs Scribe v2 better for production voice agents?

      Both providers are viable options. Mistral Voxtral Mini: Does not work for Arabic at all. Zero transcriptions produced in testing despite claiming multilingual support. ElevenLabs Scribe v2: Poor quality and poor latency for Arabic. Not recommended for any Arabic STT use case.

      How does Mistral Voxtral Mini pricing compare to ElevenLabs Scribe v2?

      Mistral Voxtral Mini starts at Usage-based per request (Mistral API pricing). ElevenLabs Scribe v2 starts at $5 per month (Includes STT credits).