Language and voice support for the Speech service
The following tables summarize language support for speech to text, text to speech, pronunciation assessment, speech translation, speaker recognition, and more service features.
You can also get a list of locales and voices supported for each specific region or endpoint via:
Supported languages
Language support varies by Speech service functionality.
Note
See Speech Containers and Embedded Speech separately for their supported languages.
Choose a Speech feature
- Speech to text
- Text to speech
- Pronunciation assessment
- Speech translation
- Language identification
- Speaker recognition
- Custom keyword
- Intent Recognition
The table in this section summarizes the locales supported for Speech to text. See the table footnotes for more details.
More remarks for Speech to text locales are included in the custom speech section of this article.
Tip
Try out the real-time speech to text tool without having to use any code.
Locale (BCP-47) | Language | Custom speech support |
---|---|---|
af-ZA |
Afrikaans (South Africa) | Plain text |
am-ET |
Amharic (Ethiopia) | Plain text |
ar-AE |
Arabic (United Arab Emirates) | Plain text |
ar-BH |
Arabic (Bahrain) | Audio + human-labeled transcript Plain text |
ar-DZ |
Arabic (Algeria) | Audio + human-labeled transcript Plain text |
ar-EG |
Arabic (Egypt) | Audio + human-labeled transcript Plain text |
ar-IL |
Arabic (Israel) | Plain text |
ar-IQ |
Arabic (Iraq) | Plain text |
ar-JO |
Arabic (Jordan) | Plain text |
ar-KW |
Arabic (Kuwait) | Plain text |
ar-LB |
Arabic (Lebanon) | Plain text |
ar-LY |
Arabic (Libya) | Plain text |
ar-MA |
Arabic (Morocco) | Audio + human-labeled transcript Plain text |
ar-OM |
Arabic (Oman) | Plain text |
ar-PS |
Arabic (Palestinian Authority) | Plain text |
ar-QA |
Arabic (Qatar) | Plain text |
ar-SA |
Arabic (Saudi Arabia) | Audio + human-labeled transcript Plain text Phrase list |
ar-SY |
Arabic (Syria) | Plain text |
ar-TN |
Arabic (Tunisia) | Audio + human-labeled transcript Plain text |
ar-YE |
Arabic (Yemen) | Audio + human-labeled transcript Plain text |
az-AZ |
Azerbaijani (Latin, Azerbaijan) | Plain text |
bg-BG |
Bulgarian (Bulgaria) | Plain text |
bn-IN |
Bengali (India) | Plain text |
bs-BA |
Bosnian (Bosnia and Herzegovina) | Plain text |
ca-ES |
Catalan | Plain text Pronunciation |
cs-CZ |
Czech (Czechia) | Audio + human-labeled transcript Plain text Pronunciation |
cy-GB |
Welsh (United Kingdom) | Plain text |
da-DK |
Danish (Denmark) | Audio + human-labeled transcript Plain text Output format Pronunciation |
de-AT |
German (Austria) | Audio + human-labeled transcript Plain text Structured text Pronunciation |
de-CH |
German (Switzerland) | Audio + human-labeled transcript Plain text Pronunciation Phrase list |
de-DE |
German (Germany) | Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list |
el-GR |
Greek (Greece) | Plain text |
en-AU |
English (Australia) | Audio + human-labeled transcript Audio Plain text Structured text Output format Pronunciation Phrase list |
en-CA |
English (Canada) | Audio + human-labeled transcript Audio Plain text Structured text Output format Pronunciation Phrase list |
en-GB |
English (United Kingdom) | Audio + human-labeled transcript Audio Plain text Structured text Output format Pronunciation Phrase list |
en-GH |
English (Ghana) | Audio + human-labeled transcript Audio Plain text Structured text Pronunciation |
en-HK |
English (Hong Kong SAR) | Audio + human-labeled transcript Audio Plain text Output format Pronunciation |
en-IE |
English (Ireland) | Audio + human-labeled transcript Audio Plain text Output format Pronunciation Phrase list |
en-IN |
English (India) | Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list |
en-KE |
English (Kenya) | Audio + human-labeled transcript Audio Plain text Structured text Pronunciation |
en-NG |
English (Nigeria) | Audio + human-labeled transcript Audio Plain text Output format Pronunciation |
en-NZ |
English (New Zealand) | Audio + human-labeled transcript Audio Plain text Output format Pronunciation |
en-PH |
English (Philippines) | Audio + human-labeled transcript Audio Plain text Output format Pronunciation |
en-SG |
English (Singapore) | Audio + human-labeled transcript Audio Plain text Output format Pronunciation |
en-TZ |
English (Tanzania) | Audio + human-labeled transcript Audio Plain text Structured text Pronunciation |
en-US |
English (United States) | Audio + human-labeled transcript Audio Plain text Structured text Output format Pronunciation Phrase list |
en-ZA |
English (South Africa) | Audio + human-labeled transcript Audio Plain text Pronunciation Phrase list |
es-AR |
Spanish (Argentina) | Plain text Structured text Pronunciation |
es-BO |
Spanish (Bolivia) | Plain text Structured text Pronunciation |
es-CL |
Spanish (Chile) | Plain text Structured text Pronunciation |
es-CO |
Spanish (Colombia) | Plain text Structured text Pronunciation |
es-CR |
Spanish (Costa Rica) | Plain text Structured text Pronunciation |
es-CU |
Spanish (Cuba) | Plain text Structured text Pronunciation |
es-DO |
Spanish (Dominican Republic) | Plain text Structured text Pronunciation |
es-EC |
Spanish (Ecuador) | Plain text Structured text Pronunciation |
es-ES |
Spanish (Spain) | Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list |
es-GQ |
Spanish (Equatorial Guinea) | Plain text Structured text |
es-GT |
Spanish (Guatemala) | Plain text Structured text Pronunciation |
es-HN |
Spanish (Honduras) | Plain text Structured text Pronunciation |
es-MX |
Spanish (Mexico) | Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list |
es-NI |
Spanish (Nicaragua) | Plain text Structured text Pronunciation |
es-PA |
Spanish (Panama) | Plain text Structured text Pronunciation |
es-PE |
Spanish (Peru) | Plain text Structured text Pronunciation |
es-PR |
Spanish (Puerto Rico) | Plain text Structured text Pronunciation |
es-PY |
Spanish (Paraguay) | Plain text Structured text Pronunciation |
es-SV |
Spanish (El Salvador) | Plain text Pronunciation |
es-US |
Spanish (United States)1 | Plain text Structured text Pronunciation Phrase list |
es-UY |
Spanish (Uruguay) | Plain text Structured text Pronunciation |
es-VE |
Spanish (Venezuela) | Plain text Structured text Pronunciation |
et-EE |
Estonian (Estonia) | Plain text Pronunciation |
eu-ES |
Basque | Plain text |
fa-IR |
Persian (Iran) | Plain text |
fi-FI |
Finnish (Finland) | Plain text Output format Pronunciation |
fil-PH |
Filipino (Philippines) | Audio + human-labeled transcript Plain text Pronunciation |
fr-BE |
French (Belgium) | Audio + human-labeled transcript Plain text |
fr-CA |
French (Canada)1 | Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list |
fr-CH |
French (Switzerland) | Plain text Pronunciation |
fr-FR |
French (France) | Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list |
ga-IE |
Irish (Ireland) | Plain text Pronunciation |
gl-ES |
Galician | Plain text |
gu-IN |
Gujarati (India) | Plain text |
he-IL |
Hebrew (Israel) | Audio + human-labeled transcript Plain text |
hi-IN |
Hindi (India) | Audio + human-labeled transcript Plain text Output format Phrase list |
hr-HR |
Croatian (Croatia) | Plain text Pronunciation |
hu-HU |
Hungarian (Hungary) | Audio + human-labeled transcript Plain text Pronunciation |
hy-AM |
Armenian (Armenia) | Plain text |
id-ID |
Indonesian (Indonesia) | Plain text Pronunciation Phrase list |
is-IS |
Icelandic (Iceland) | Plain text |
it-CH |
Italian (Switzerland) | Audio + human-labeled transcript Plain text |
it-IT |
Italian (Italy) | Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list |
ja-JP |
Japanese (Japan) | Audio + human-labeled transcript Plain text Structured text Output format Phrase list |
jv-ID |
Javanese (Latin, Indonesia) | Plain text |
ka-GE |
Georgian (Georgia) | Plain text |
kk-KZ |
Kazakh (Kazakhstan) | Plain text |
km-KH |
Khmer (Cambodia) | Plain text |
kn-IN |
Kannada (India) | Plain text |
ko-KR |
Korean (Korea) | Audio + human-labeled transcript Plain text Structured text Output format Phrase list |
lo-LA |
Lao (Laos) | Plain text |
lt-LT |
Lithuanian (Lithuania) | Plain text Pronunciation |
lv-LV |
Latvian (Latvia) | Plain text Pronunciation |
mk-MK |
Macedonian (North Macedonia) | Plain text |
ml-IN |
Malayalam (India) | Plain text |
mn-MN |
Mongolian (Mongolia) | Plain text |
mr-IN |
Marathi (India) | Plain text |
ms-MY |
Malay (Malaysia) | Audio + human-labeled transcript Plain text |
mt-MT |
Maltese (Malta) | Plain text |
my-MM |
Burmese (Myanmar) | Plain text |
nb-NO |
Norwegian Bokmål (Norway) | Audio + human-labeled transcript Plain text Output format |
ne-NP |
Nepali (Nepal) | Plain text |
nl-BE |
Dutch (Belgium) | Plain text |
nl-NL |
Dutch (Netherlands) | Audio + human-labeled transcript Plain text Output format Pronunciation Phrase list |
pa-IN |
Punjabi (India) | Audio + human-labeled transcript |
pl-PL |
Polish (Poland) | Audio + human-labeled transcript Plain text Output format Pronunciation Phrase list |
ps-AF |
Pashto (Afghanistan) | Plain text |
pt-BR |
Portuguese (Brazil) | Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list |
pt-PT |
Portuguese (Portugal) | Audio + human-labeled transcript Plain text Output format Pronunciation Phrase list |
ro-RO |
Romanian (Romania) | Plain text Pronunciation |
ru-RU |
Russian (Russia) | Audio + human-labeled transcript Plain text Phrase list |
si-LK |
Sinhala (Sri Lanka) | Plain text |
sk-SK |
Slovak (Slovakia) | Plain text Pronunciation |
sl-SI |
Slovenian (Slovenia) | Plain text Pronunciation |
so-SO |
Somali (Somalia) | Plain text |
sq-AL |
Albanian (Albania) | Plain text |
sr-RS |
Serbian (Cyrillic, Serbia) | Plain text |
sv-SE |
Swedish (Sweden) | Audio + human-labeled transcript Plain text Output format Pronunciation Phrase list |
sw-KE |
Swahili (Kenya) | Audio + human-labeled transcript Plain text |
sw-TZ |
Swahili (Tanzania) | Audio + human-labeled transcript Plain text |
ta-IN |
Tamil (India) | Plain text |
te-IN |
Telugu (India) | Plain text |
th-TH |
Thai (Thailand) | Audio + human-labeled transcript Plain text Phrase list |
tr-TR |
Turkish (Türkiye) | Audio + human-labeled transcript Plain text Structured text Output format |
uk-UA |
Ukrainian (Ukraine) | Plain text |
ur-IN |
Urdu (India) | Audio + human-labeled transcript |
uz-UZ |
Uzbek (Latin, Uzbekistan) | Plain text |
vi-VN |
Vietnamese (Vietnam) | Audio + human-labeled transcript Plain text Phrase list |
wuu-CN |
Chinese (Wu, Simplified) | Audio + human-labeled transcript Plain text |
yue-CN |
Chinese (Cantonese, Simplified) | Audio + human-labeled transcript Plain text |
zh-CN |
Chinese (Mandarin, Simplified) | Audio + human-labeled transcript Plain text Structured text Output format Phrase list |
zh-CN-shandong |
Chinese (Jilu Mandarin, Simplified) | Audio + human-labeled transcript Plain text |
zh-CN-sichuan |
Chinese (Southwestern Mandarin, Simplified) | Audio + human-labeled transcript Plain text |
zh-HK |
Chinese (Cantonese, Traditional) | Audio + human-labeled transcript Plain text Output format Phrase list |
zh-TW |
Chinese (Taiwanese Mandarin, Traditional) | Audio + human-labeled transcript Plain text Phrase list |
zu-ZA |
Zulu (South Africa) | Plain text |
1 The model is bilingual and also supports English.
Custom speech
To improve Speech to text recognition accuracy, customization is available for some languages and base models. Depending on the locale, you can upload audio + human-labeled transcripts, plain text, structured text, and pronunciation data. By default, plain text customization is supported for all available base models. To learn more about customization, see custom speech.
These are the locales that support the display text format feature: da-DK, de-DE, en-AU, en-CA, en-GB, en-HK, en-IE, en-IN, en-NG, en-NZ, en-PH, en-SG, en-US, es-ES, es-MX, fi-FI, fr-CA, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, nb-NO, nl-NL, pl-PL, pt-BR, pt-PT, sv-SE, tr-TR, zh-CN, zh-HK.
Next steps
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for