Transcription in 98 languages

EKHOS AI runs on top of OpenAI Whisper, one of the most advanced transcription AI models available today. The Whisper model supports 98 languages. The table below lists its accuracy based on the large v3 model, which is used by Expert mode in EKHOS AI.

Note: accuracy ≈ 100 − WER/CER (%). CER (character error rate) values were treated the same for conversion.

The accuracy table was reference form Open AI Whisper

Common Voice 15 — large-v3 (Expert Accuracy) (%)

Language Accuracy (%)
Dutch95.7
Spanish95.3
Korean94.8
Italian94.5
German94.3
Thai94.2
Russian94.2
Portuguese94.1
Polish94.0
Indonesian92.8
Mandarin (TW)91.8
Swedish91.7
Czech91.0
English90.7
Japanese89.7
French89.2
Romanian89.2
Cantonese (CN)89.1
Turkish88.0
Mandarin (CN)87.2
Catalan86.7
Hungarian86.6
Ukrainian86.4
Greek86.3
Bulgarian85.7
Arabic84.9
Serbian84.3
Macedonian84.2
Cantonese (HK)84.1
Latvian83.3
Slovenian83.2
Hindi83.1
Galician82.0
Danish81.9
Urdu79.6
Slovak76.6
Hebrew76.5
Finnish75.4
Azerbaijani75.2
Lithuanian72.4
Estonian70.1
Nynorsk69.3
Welsh68.0
Punjabi65.3
Afrikaans64.0
Persian62.6
Basque61.1
Vietnamese60.1
Bengali59.7
Nepali59.6
Marathi58.8
Belarusian56.3
Kazakh52.4
Armenian51.9
Swahili48.8
Tamil48.5
Albanian44.3

FLEURS — large-v3 (Expert Accuracy) (%)

Language Accuracy (%)
Spanish97.2
Italian97.0
Korean96.9
Portuguese95.9
English95.9
Polish95.4
Catalan95.2
Japanese95.1
German95.1
Russian95.0
Dutch94.8
French94.7
Indonesian93.9
Ukrainian93.6
Turkish93.3
Malay92.7
Swedish92.4
Mandarin92.3
Finnish92.3
Norwegian92.2
Romanian91.8
Thai91.6
Vietnamese91.4
Slovak90.8
Arabic90.4
Czech89.9
Croatian89.2
Greek89.1
Serbian88.4
Danish88.0
Bulgarian87.5
Hungarian87.1
Filipino87.0
Bosnian87.0
Galician86.9
Macedonian85.3
Hindi83.0
Estonian81.9
Slovenian81.7
Tamil81.7
Latvian80.6
Azerbaijani80.3
Urdu79.4
Lithuanian76.3
Hebrew73.9
Welsh71.4
Persian70.6
Icelandic69.6
Kazakh67.6
Afrikaans67.6
Kannada67.0
Marathi65.9
Swahili65.9
Telugu60.7
Maori60.2
Nepali59.8
Armenian57.8
Belarusian57.5
Gujarati52.6
Punjabi51.5
Bengali50.0

Source: WER/CER values visualized for Whisper large-v3 vs large-v2 on Common Voice 15 and FLEURS. Accuracy listed is 100 − reported error rate (WER or CER).

Have suggestions or feedback for EKHOS AI?

We’d love to hear from you! Please email us at feedback@ekhos.ai with your suggestions or ideas for improvement. Your input helps us refine our services and deliver the best experience possible.