What's new in Azure AI Translator?
Bookmark this page to stay up to date with release notes, feature enhancements, and our newest documentation.
Translator is a language service that enables users to translate text and documents, helps entities expand their global outreach, and supports preservation of at-risk and endangered languages.
Translator service supports language translation for more than 100 languages. If your language community is interested in partnering with Microsoft to add your language to Translator, contact us via the Translator community partner onboarding form.
May 2024
A single API is now available for both asynchronous batch and synchronous single document translation operations.
February 2024
The Document translation API now supports two translation operations:
Asynchronous Batch document translation supports asynchronous processing of multiple documents and files. The batch translation process requires an Azure Blob storage account with containers for your source and translated documents.
Synchronous document translation supports synchronous processing of single file translations. The file translation process doesn't require an Azure Blob storage account. The final response contains the translated document and is returned directly to the calling client.
September 2023
- Translator service has text, document translation, and container language support for the following 18 languages:
Language | Code | Cloud – Text Translation and Document Translation | Containers – Text Translation | Description |
---|---|---|---|---|
chiShona | sn |
✔ | ✔ | The official language of Zimbabwe with more than 8 million native speakers. |
Hausa | ha |
✔ | ✔ | The most widely used language in West Africa with more than 150 million speakers worldwide. |
Igbo | ig |
✔ | ✔ | The principal native language of the Igbo people of Nigeria with more than 44 million speakers. |
Kinyarwanda | rw |
✔ | ✔ | The national language of Rwanda with more than 12 million speakers primarily in East and Central Africa. |
Lingala | ln |
✔ | ✔ | One of four official languages of the Democratic Republic of the Congo with more than 60 million speakers. |
Luganda | lug |
✔ | ✔ | A major language of Uganda with more than 5 million speakers. |
Nyanja | nya |
✔ | ✔ | Nynaja, also known as Chewa, is spoken mainly in Malawi and has more than 2 million native speakers. |
Rundi | run |
✔ | ✔ | Rundi, also known as Kirundi, is the national language of Burundi and has more than 6 million native speakers. |
Sesotho | st |
✔ | ✔ | Sesotho, also know as Sotho, is the national and official language of Lesotho, one of 12 official languages of South Africa, and one of 16 official languages of Zimbabwe. It has more than 5.6 native speakers. |
Sesotho sa Leboa | nso |
✔ | ✔ | Sesotho, also known as Northern Sotho, is the native language of more than 4.6 million people in South Africa. |
Setswana | tn |
✔ | ✔ | Setswana, also known as Tswana, is an official language of Botswana and South Africa and has more than 5 million speakers. |
Xhosa | xh |
✔ | ✔ | An official language of South Africa and Zimbabwe, Xhosa has more than 20 million speakers. |
Yoruba | yo |
✔ | ✔ | The principal native language of the Yoruba people of West Africa, it has more than 50 million speakers. |
Konkani | gom |
✔ | ✔ | The official language of the Indian state of Goa with more than 7 million speakers worldwide. |
Maithili | mai |
✔ | ✔ | One of the 22 officially recognized languages of India and the second most spoken language in Nepal. It has more than 20 million speakers. |
Sindhi | sd |
✔ | ✔ | Sindhi is an official language of the Sindh province of Pakistan and the Rajasthan state in India. It has more than 33 million speakers worldwide. |
Sinhala | si |
✔ | ✔ | One of the official and national languages of Sri Lanka, Sinhala has more than 16 million native speakers. |
Lower Sorbian | dsb |
✔ | Currently, not supported in containers | A West Slavic language spoken primarily in eastern Germany. It has approximately 7,000 speakers. |
July 2023
Note
As of July 2023, Azure AI services encompass all of what were previously known as Cognitive Services and Azure Applied AI Services. There are no changes to pricing. The names Cognitive Services and Azure Applied AI continue to be used in Azure billing, cost analysis, price list, and price APIs. There are no breaking changes to application programming interfaces (APIs) or SDKs.
- Document Translation REST API v1.1 is now Generally Available (GA).
June 2023
Documentation updates
- The Document Translation SDK overview is now available to provide guidance and resources for the .NET/C# and Python
SDK
s. - The Document Translation SDK quickstart is now available for the C# and Python programming languages.
May 2023
Announcing new releases for Build 2023
Text Translation SDK (preview)
The Text translation SDK
s are now available in public preview for C#/.NET, Java, JavaScript/TypeScript, and Python programming languages.
- To learn more, see Text translation SDK overview.
- To get started, try a Text Translation SDK quickstart using a programming language of your choice.
Microsoft Translator V3 Connector (preview)
The Translator V3 Connector is now available in public preview. The connector creates a connection between your Translator Service instance and Microsoft Power Automate enabling you to use one or more prebuilt operations as steps in your apps and workflows. To learn more, see the following documentation:
February 2023
Document Translation in Language Studio is now available for Public Preview. The feature provides a no-code user interface to interactively translate documents from local or Azure Blob Storage.
November 2022
Custom Translator stable GA v2.0 release
Custom Translator version v2.0 is generally available and ready for use in your production applications!
June 2022
Document Translation stable GA 1.0.0 release
Document Translation .NET and Python client-library SDK
s are now generally available and ready for use in production applications!
Version 1.0.0 (GA)
2022-06-07
README
Changelog/Release History
Package (NuGet)
SDK reference documentation
May 2022
Document Translation support for scanned PDF documents
- Document Translation uses optical character recognition (OCR) technology to extract and translate text in scanned PDF document while retaining the original layout.
April 2022
Text and document translation support for Faroese
- Translator service has text and document translation language support for Faroese, a Germanic language originating on the Faroe Islands. The Faroe Islands are a self-governing region within the Kingdom of Denmark located between Norway and Iceland. Faroese is descended from Old West Norse spoken by Vikings in the Middle Ages.
Text and document translation support for Basque and Galician
- Translator service has text and document translation language support for Basque and Galician. Basque is a language isolate, meaning it isn't related to any other modern language and is spoken in parts of northern Spain and southern France. Galician is spoken in northern Portugal and western Spain. Both Basque and Galician are official languages of Spain.
March 2022
Text and document translation support for Somali and Zulu languages
- Translator service has text and document translation language support for Somali and Zulu. The Somali language, spoken throughout Africa, has more than 21 million speakers and is in the Cushitic branch of the Afroasiatic language family. The Zulu language has 12 million speakers and is recognized as one of South Africa's 11 official languages.
February 2022
Text and document translation support for Upper Sorbian,
- Translator service has text and document translation language support for Upper Sorbian. The Translator team works tirelessly to preserve indigenous and endangered languages around the world. Language data provided by the Upper Sorbian language community was instrumental in introducing this language to Translator.
Text and document translation support for Inuinnaqtun and Romanized Inuktitut
- Translator service has text and document translation language support for Inuinnaqtun and Romanized Inuktitut. Both are indigenous languages that are essential and treasured foundations of Canadian culture and society.
January 2022
Custom Translator portal (v2.0) public preview
The Custom Translator portal (v2.0) is now in public preview and includes significant changes that makes it easier to create your custom translation systems.
To learn more, see our Custom Translator documentation and try our quickstart for step-by-step instructions.
October 2021
Text and document support for more than 100 languages
- Translator service adds text and document language support for the following languages:
- Bashkir. A Turkic language spoken by approximately 1.4 million native speakers. It has three regional language groups: Southern, Eastern, and Northwestern.
- Dhivehi. Also known as Maldivian, it's an Indo-Iranian language primarily spoken in the island nation of Maldives.
- Georgian. A Kartvelian language that is the official language of Georgia. It has approximately 4 million speakers.
- Kyrgyz. A Turkic language that is the official language of Kyrgyzstan.
- Macedonian (Cyrillic). An Eastern South Slavic language that is the official language of North Macedonia. It has approximately 2 million people.
- Mongolian (Traditional). Traditional Mongolian script is the first writing system created specifically for the Mongolian language. Mongolian is the official language of Mongolia.
- Tatar. A Turkic language used by speakers in modern Tatarstan closely related to Crimean Tatar and Siberian Tatar but each belongs to different subgroups.
- Tibetan. It has nearly 6 million speakers and can be found in many Tibetan Buddhist publications.
- Turkmen. The official language of Turkmenistan. It's similar to Turkish and Azerbaijani.
- Uyghur. A Turkic language with nearly 15 million speakers spoken primarily in Western China.
- Uzbek (Latin). A Turkic language that is the official language of Uzbekistan. It has 34 million native speakers.
These additions bring the total number of languages supported in Translator to 103.
August 2021
Text and document translation support for literary Chinese
- Azure AI Translator has text and document language support for literary Chinese. Classical or literary Chinese is a traditional style of written Chinese used by traditional Chinese poets and in ancient Chinese poetry.
June 2021
Document Translation client libraries for C#/.NET and Python—now available in prerelease
May 2021
Document Translation ― now generally available
- Feature release: Translator's Asynchronous batch translation feature is generally available. Document Translation is designed to translate large files and batch documents with rich content while preserving original structure and format. You can also use custom glossaries and custom models built with Custom Translator to ensure your documents are translated quickly and accurately.
Translator service available in containers
- New release: Translator service is available in containers as a gated preview. Submit an online request for approval to get started. Containers enable you to run several Translator service features in your own environment and are great for specific security and data governance requirements. For more information, See Install and run Translator containers (preview)
February 2021
Document Translation public preview
- New release: Asynchronous batch translation is available as a preview feature of the Translator Service. Preview features are still in development and aren't meant for production use. They're made available on a "preview" basis so customers can get early access and provide feedback. Document Translation enables you to translate large documents and process batch files while still preserving the original structure and format. See Microsoft Translator blog: Introducing Document Translation
Text and document translation support for nine added languages
Translator service has text and document translation language support for the following languages:
- Albanian. An isolate language unrelated to any other and spoken by nearly 8 million people.
- Amharic. An official language of Ethiopia spoken by approximately 32 million people. It's also the liturgical language of the Ethiopian Orthodox church.
- Armenian. The official language of Armenia with 5-7 million speakers.
- Azerbaijani. A Turkic language spoken by approximately 23 million people.
- Khmer. The official language of Cambodia with approximately 16 million speakers.
- Lao. The official language of Laos with 30 million native speakers.
- Myanmar. The official language of Myanmar, spoken as a first language by approximately 33 million people.
- Nepali. The official language of Nepal with approximately 16 million native speakers.
- Tigrinya. A language spoken in Eritrea and northern Ethiopia with nearly 11 million speakers.
January 2021
Text and document translation support for Inuktitut
- Translator service has text and document translation language support for Inuktitut, one of the principal Inuit languages of Canada. Inuktitut is one of eight official Aboriginal languages in the Northwest Territories.
November 2020
Custom Translator V2 is generally available
- New release: Custom Translator V2 upgrade is fully available to the generally available (GA). The V2 platform enables you to build custom models with all document types (training, testing, tuning, phrase dictionary, and sentence dictionary). See Microsoft Translator blog: Custom Translator pushes the translation quality bar closer to human parity.
October 2020
Text and document translation support for Canadian French
- Translator service has text and document translation language support for Canadian French. Canadian French and European French are similar to one another and are mutually understandable. However, there can be significant differences in vocabulary, grammar, writing, and pronunciation. Over 7 million Canadians (20 percent of the population) speak French as their first language.
September 2020
Text and document translation support for Assamese and Axomiya
- Translator service has text and document translation language support for Assamese also knows as Axomiya. Assamese / Axomiya is primarily spoken in Eastern India by approximately 14 million people.
August 2020
Introducing virtual networks and private links for translator
- New release: Virtual network capabilities and Azure private links for Translator are generally available (GA). Azure private links allow you to access Translator and your Azure hosted services over a private endpoint in your virtual network. You can use private endpoints for Translator to allow clients on a virtual network to securely access data over a private link. See Microsoft Translator blog: Virtual Networks and Private Links for Translator are generally available
Custom Translator upgrade to v2
- New release: Custom Translator V2 phase 1 is available. The newest version of Custom Translator rolls out in two phases to provide quicker translation and quality improvements, and allow you to keep your training data in the region of your choice. See Microsoft Translator blog: Custom Translator: Introducing higher quality translations and regional data residency
Text and document translation support for two Kurdish regional languages
- Northern (Kurmanji) Kurdish (15 million native speakers) and Central (Sorani) Kurdish (7 million native speakers). Most Kurdish texts are written in Kurmanji and Sorani.
Text and document translation support for two Afghan languages
- Dari (20 million native speakers) and Pashto (40 - 60 million speakers). The two official languages of Afghanistan.
Text and document translation support for Odia
- Odia is a classical language spoken by 35 million people in India and across the world. It joins Bangla, Gujarati, Hindi, Kannada, Malayalam, Marathi, Punjabi, Tamil, Telugu, Urdu, and English as the 12th most used language of India supported by Microsoft Translator.
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for