Google Translate
Google Translate
Machine translation | |
Available in | 105 languages, seebelow |
Owner | |
Website | |
Commercial | Yes |
Registration | Optional |
Users | Over 200 million people daily |
Launched | April 28, 2006 (asstatistical machine translation)[1]November 15, 2016 (asneural machine translation)[2] |
Current status | Active |
Developer(s) | |
Stable release(s) | |
Platform | |
Size | 26.59MB(Android) 76.7 MB (iOS) |
Type | Machine translation |
Website |
Google Translate is a free multilingual machine translation service developed by Google, to translate text. It offers a website interface, mobile apps for Android and iOS, and an API that helps developers build browser extensions and software applications. Google Translate supports over 100 languages at various levels and as of May 2017, serves over 500 million people daily.
Launched in April 2006 as a statistical machine translation service, it used United Nations and European Parliament transcripts to gather linguistic data. Rather than translating languages directly, it first translates text to English and then to the target language. During a translation, it looks for patterns in millions of documents to help decide on the best translation. Its accuracy has been criticized and ridiculed on several occasions. In November 2016, Google announced that Google Translate would switch to a neural machine translation engine - Google Neural Machine Translation (GNMT) - which translates "whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar". Originally only enabled for a few languages in 2016, GNMT is currently available in 105 languages as of 2019.
Machine translation | |
Available in | 105 languages, seebelow |
Owner | |
Website | |
Commercial | Yes |
Registration | Optional |
Users | Over 200 million people daily |
Launched | April 28, 2006 (asstatistical machine translation)[1]November 15, 2016 (asneural machine translation)[2] |
Current status | Active |
Developer(s) | |
Stable release(s) | |
Platform | |
Size | 26.59MB(Android) 76.7 MB (iOS) |
Type | Machine translation |
Website |
History
Google Translation is a complimentary translation service developed by Google in April 2006.[3] It translates multiple forms of texts and media such as words, phrases and webpages.
Originally Google Translate was released as a Statistical Machine Translation (SMT).[3] Translating the required text into English before translating into the selected language was a mandatory step that it had to take.[3] Since SMT uses predictive algorithms to translate text, it had poor grammatical accuracy. However, Google initially did not hire experts to resolve this limitation due to ever-evolving nature of language.[3]
In January 2010, Google has introduced Android app and iOS version in February 2011 to serve as a portable personal interpreter.[3] As of February 2010, it was integrated into browsers such as Chrome and was able to pronounce the text, automatically recognize words in the picture and spot unfamiliar text and languages.[3]
In May 2014, Google acquired Word Lens to improve the quality of visual and voice translation.[4] It is able to scan text or picture with one's device and have it translated instantly. Moreover, the system automatically identifies foreign languages and translates speech without requiring individuals to tap the mic button whenever speech translation is needed.[4]
In November 2016, Google has transitioned its translating method to a system called “Neural Machine Translation.”[5] It uses Deep Learning techniques to translate the whole sentences at a time and ensures more adequate accuracy of the context.[3]
As of 2018, it translates more than 100 billion words a day.[5]
Functions
Google Translate can translate multiple forms of text and media, which includes text, speech, images, and videos.
Specifically, its functions include:
Written Words Translation
A function that translates written words or text to a foreign language.[6]
Website Translation
A function that translates a whole webpage to selected languages[7]
Document Translation
A function that translates a document uploaded by the users to selected languages.
The documents should be in the form of:.doc,.docx,.odf,.pdf,.ppt,.pptx,.ps,.rtf,.txt,.xls,.xlsx.[7]
Speech Translation
A function that instantly translates spoken language into the selected foreign language.[8]
Mobile App Translation
In 2018, Google Translate has introduced its new feature called “Tap to Translate,” which made instant translation accessible inside any apps without exiting or switching it.[9]
Image Translation
A function that identifies text in a picture taken by the users and translates text on the screen instantly by images.[10]
Handwritten Translation
A function that translates language that are hand written on the phone screen or drew on a virtual keyboard without the support of keyboard.[11]
For most of its features, Google Translate provides the pronunciation, dictionary, and listen to translation.
Additionally, Google Translate has introduced its own Translate app, so translation is available with mobile phone in offline mode.[9]
Features
Google Translate can translate multiple forms of text and media, including text, speech, images, sites, or real-time video, from one language to another.[9][13] It supports over 100 languages at various levels[14] and as of May 2017, serves over 500 million people daily.[15] For some languages, Google Translate can pronounce translated text,[16] highlight corresponding words and phrases in the source and target text, and act as a simple dictionary for single-word input. If "Detect language" is selected, text in an unknown language can be automatically identified. If a user enters a URL in the source text, Google Translate will produce a hyperlink to a machine translation of the website.[17] Users can save translations in a "phrasebook" for later use.[18] For some languages, text can be entered via an on-screen keyboard, through handwriting recognition, or speech recognition.[19][20]
Browser integration
Google Translate is available in some web browsers as an optional downloadable extension that can run the translation engine.[21] In February 2010, Google Translate was integrated into the Google Chrome browser by default, for optional automatic webpage translation.[22]
Mobile apps
The Google Translate app for Android and iOS supports more than 100 languages and can translate 37 languages via photo, 32 via voice in "conversation mode", and 27 via real-time video in "augmented reality mode".[25]
The Android app was released in January 2010, and for iOS on February 8, 2011.[26]
In January 2015, the apps gained the ability to translate physical signs in real time using the device's camera, as a result of Google's acquisition of the Word Lens app.[29][4] The original January launch only supported seven languages, but a July update added support for 20 new languages, and also enhanced the speed of Conversation Mode translations.[31][32]
API
In May 2011, Google announced that the Google Translate API for software developers had been deprecated and would cease functioning.[33] The Translate API page stated the reason as "substantial economic burden caused by extensive abuse" with an end date set for December 1, 2011.[34] In response to public pressure, Google announced in June 2011 that the API would continue to be available as a paid service.[35]
Google Assistant
Google Translate also provides translations for Google Assistant and the devices that Google Assistant runs on such as Google Home and Google Pixel Buds.
Supported languages
The following languages are supported in Google Translate.[14]
Albanian
Amharic
Azerbaijani
Belarusian
Bosnian
Burmese
Catalan
Cebuano
Chichewa
Chinese
Chinese
Corsican
Croatian
Estonian
Filipino
Galician
Georgian
Gujarati
Haitian Creole
Hausa
Hmong
Icelandic
Igbo
Indonesian
Javanese
Kazakh
Khmer
Kurdish (Kurmanji)
Kyrgyz
Lao
Latvian
Lithuanian
Luxembourgish
Macedonian
Malagasy
Malay
Maltese
Maori
Marathi
Mongolian
Nepali
Norwegian (Bokmål)
Pashto
Samoan
Shona
Sindhi
Sinhala
Slovak
Somali
Spanish
Sundanese
Swahili
Tajik
Tongan
Thai
Uzbek
Wolof
Frisian
Xhosa
Yoruba
Zulu
1st stage English to and from German English to and from Spanish English to and from French
2nd stage English to and from Portuguese
3rd stage English to and from Italian
4th stage English to and from Chinese (Simplified) English to and from Japanese English to and from Korean
5th stage (launched April 28, 2006)[1] English to and from Arabic
6th stage (launched December 16, 2006) English to and from Russian
7th stage (launched February 9, 2007) English to and from Chinese (Traditional) Chinese (Simplified to and from Traditional)
8th stage (all 25 language pairs use Google's machine translation system) (launched October 22, 2007) English to and from Dutch English to and from Greek
9th stage English to and from Hindi
10th stage (as of this stage, translation can be done between any two languages, using English as an intermediate step, if needed) (launched May 8, 2008) Bulgarian Croatian Czech Danish Finnish Norwegian Polish Romanian Swedish
11th stage (launched September 25, 2008) Catalan Filipino Hebrew Indonesian Latvian Lithuanian Serbian Slovak Slovene Ukrainian Vietnamese
12th stage (launched January 30, 2009) Albanian Estonian Galician Hungarian Maltese Thai Turkish
13th stage (launched June 19, 2009) Persian
14th stage (launched August 24, 2009) Afrikaans Belarusian Icelandic Irish Macedonian Malay Swahili Welsh Yiddish
15th stage (launched November 19, 2009) The Beta stage is finished.
Users can now choose to have the romanization written for Chinese, Japanese, Korean, Russian, Ukrainian, Belarusian, Bulgarian, Greek, Hindi and Thai. For translations from Arabic, Persian and Hindi, the user can enter a Latin transliteration of the text and the text will be transliterated to the native script for these languages as the user is typing. The text can now be read by a text-to-speech program in English, Italian, French and German.
16th stage (launched January 30, 2010) Haitian Creole
17th stage (launched April 2010) Speech program launched in Hindi and Spanish.
18th stage (launched May 5, 2010) Speech program launched in Afrikaans, Albanian, Catalan, Chinese (Mandarin), Croatian, Czech, Danish, Dutch, Finnish, Greek, Hungarian, Icelandic, Indonesian, Latvian, Macedonian, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Swahili, Swedish, Turkish, Vietnamese and Welsh (based on eSpeak)[39]
19th stage (launched May 13, 2010)[40] Armenian Azerbaijani Basque Georgian Urdu
20th stage (launched June 2010) Provides romanization for Arabic.
21st stage (launched September 2010) Allows phonetic typing for Arabic, Greek, Hindi, Persian, Russian, Serbian and Urdu. Latin[41]
22nd stage (launched December 2010) Romanization of Arabic removed.
Spell check added.
For some languages, Google replaced text-to-speech synthesizers from eSpeak's robot voice to native speaker's nature voice technologies made by SVOX[42] (Chinese, Czech, Danish, Dutch, Finnish, Greek, Hungarian, Norwegian, Polish, Portuguese, Russian, Swedish, Turkish). Also the old versions of French, German, Italian and Spanish. Latin uses the same synthesizer as Italian. Speech program launched in Arabic, Japanese and Korean.
23rd stage (launched January 2011) Choice of different translations for a word.
24th stage (launched June 2011) 5 new Indic languages (in alpha) and a transliterated input method:[43] Bengali Gujarati Kannada Tamil Telugu
25th stage (launched July 2011) Translation rating introduced.
26th stage (launched January 2012) Dutch male voice synthesizer replaced with female.
Elena by SVOX replaced the Slovak eSpeak voice.
Transliteration of Yiddish added.
27th stage (launched February 2012) Speech program launched in Thai. Esperanto[44]
28th stage (launched September 2012) Lao
29th stage (launched October 2012) Transliteration of Lao added.
30th stage (launched October 2012) New speech program launched in English.
31st stage (launched November 2012) New speech program in French, Spanish, Italian and German.
32nd stage (launched March 2013) Phrasebook added.
33rd stage (launched April 2013) Khmer
34th stage (launched May 2013) Bosnian Cebuano Hmong Javanese Marathi
35th stage (launched May 2013) 16 additional languages can be used with camera-input: Bulgarian, Catalan, Croatian, Danish, Estonian, Finnish, Hungarian, Indonesian, Icelandic, Latvian, Lithuanian, Norwegian, Romanian, Slovak, Slovenian and Swedish.
36th stage (launched December 2013) Hausa Igbo Maori Mongolian Nepali Punjabi Somali Yoruba Zulu
37th stage (launched June 2014) Definition of words added.
38th stage (launched December 2014) Burmese Chewa Kazakh Malagasy Malayalam Sinhalese Sotho Sundanese Tajik Uzbek
39th stage (launched October 2015) Transliteration of Arabic restored.
40th stage (launched November 2015) Aurebesh
41st stage (launched February 2016) Aurebesh removed.
Speech program launched in Bengali. Amharic Corsican Hawaiian Kurdish (Kurmanji) Kyrgyz Luxembourgish Pashto Samoan Scottish Gaelic Shona Sindhi[47] West Frisian Xhosa[48][49][50][51]
42nd stage (launched September 2016) Speech program launched in Ukrainian.
43rd stage (launched December 2016) Speech program launched in Khmer and Sinhala.
44th stage (launched June 2018) Speech program launched in Malayalam, Telugu, Marathi, and Myanmar (Burmese).
45th stage (launched September 2019) Speech program launched in Urdu, Kannada, and Gujarati.
46th stage (launched December 2019) Speech program launched in Uzbek, Ahmaric, Somali, Slovenian, Azerbaijani, Assamese, Basque, Belarusian, Bulgarian, Corsican, Cebuano, Dari, Bhutanese (Dzongkha), Egyptian, Frisian, Galician, Georgian, Guarani, Haitian Creole, Hausa, Hawaiian, Hebrew, Hmong, Irish, Igbo, Kazakh, Kurdish (Kurmanji), Kurdish (Sorani), Kinyarwanda, Kyrgyz, Lao, Lithuanian, Luxembourgish, Michif, Malagasy, Malay, Maori, Mongolian, Nyanja, Odia, Oriya, Pashto, Polabian, Persian, Punjabi, Romansch, Samoan, Scottish Gaelic, Shanghainese, Southern Sotho, Shona, Saka, Sicilian, Sindhi, Slovenian, Somali, Tajik, Tamazight, Tatar, Tongan, Tibetan, Turkmen, Uighur, Udmurt, Wolof, Xhosa, Yiddish, Yoruba, and Zulu.
47th stage (launched January 2020) Hakka added.
Method of translation
In April 2006, Google Translate launched with a statistical machine translation engine.[1]
Google Translate does not apply grammatical rules, since its algorithms are based on statistical analysis rather than traditional rule-based analysis. The system's original creator, Franz Josef Och, has criticized the effectiveness of rule-based algorithms in favor of statistical approaches.[52] It is based on a method called statistical machine translation, and more specifically, on research by Och who won the DARPA contest for speed machine translation in 2003. Och was the head of Google's machine translation group until leaving to join Human Longevity, Inc. in July 2014.[53]
According to Och, a solid base for developing a usable statistical machine translation system for a new pair of languages from scratch would consist of a bilingual text corpus (or parallel collection) of more than 150-200 million words, and two monolingual corpora each of more than a billion words.[52] Statistical models from these data are then used to translate between those languages.
To acquire this huge amount of linguistic data, Google used United Nations and European Parliament transcripts.[54][55]
Google Translate does not translate from one language to another (L1 → L2).
Instead, it often translates first to English and then to the target language (L1 → EN → L2).[56]
When Google Translate generates a translation, it looks for patterns in hundreds of millions of documents to help decide on the best translation.
By detecting patterns in documents that have already been translated by human translators, Google Translate makes intelligent guesses as to what an appropriate translation should be.[57]
Before October 2007, for languages other than Arabic, Chinese and Russian, Google Translate was based on SYSTRAN, a software engine which is still used by several other online translation services such as Babel Fish (now defunct). Since October 2007, Google Translate has used proprietary, in-house technology based on statistical machine translation instead.[58][59]
Google Translate Community
Google constantly seeks for volunteers to be a part of its “Translate Community” to translate languages people speak and help improve Google Translate's accuracy.[60] There are two ways to contribute.
First, Google will show a phrase then one should type in the translated version.[60] Second, Google will show all possible translation for a phrase for individuals to click the right translation.[60]
Statistical Machine Translation
Although, Google deployed a new system called “Neural Machine Translation” for better quality translation, there are languages that still use the traditional translation method called “Statistical Machine Translation.”
It is a “rule-based” translation method that utilizes predictive algorithms to guess ways to translate texts in foreign languages.
It aims to translate whole phrases rather than single words then gather overlapping phrases for translation.
Moreover, it also analyzes bilingual text corpora to generate statistical model that translates texts from one language to another.[61]
Google Neural Machine Translation
In September 2016, a research team at Google led by the software engineer Harold Gilchrist announced the development of the Google Neural Machine Translation system (GNMT) to increase fluency and accuracy in Google Translate[2][62] and in November announced that Google Translate would switch to GNMT.
Google Translate's neural machine translation system uses a large end-to-end artificial neural network capable of deep learning,[2][63] in particular, long short-term memory networks.[64][65][66][67] GNMT improves the quality of translation because it uses an example-based machine translation (EBMT) method in which the system "learns from millions of examples."[63] It translates "whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar".[2] GNMT's "proposed architecture" of "system learning" was first tested on over a hundred languages supported by Google Translate.[63] With the end-to-end framework, "the system learns over time to create better, more natural translations."[2] The GNMT network is capable of interlingual machine translation, which encodes the "semantics of the sentence rather than simply memorizing phrase-to-phrase translations",[63][56] and the system did not invent its own universal language, but uses "the commonality found inbetween many languages".[69] GNMT was first enabled for eight languages: to and from English and Chinese, French, German, Japanese, Korean, Portuguese, Spanish and Turkish.[2][62] In March 2017, it was enabled for Hindi, Russian and Vietnamese languages,[70] followed by Indonesian, Bengali, Gujarati, Kannada, Malayalam, Marathi, Punjabi, Tamil and Telugu languages in April.[71]
GNMT has the capability to translate directly from one language to another (L1 → L2), which improves upon the previous versions of Google Translate which first translated to English and then to the target language (L1 → EN → L2).[56] The GNMT system is also capable of Zero-Shot Translation - translating between a language pair (for example, Japanese to Korean) which the "system has never explicitly seen before."[63]
Crowdsourcing
Accuracy
Although Google Translate is not as reliable as human translation, it can provide relatively accurate translation and a gist of foreign language text.
It can translate text with the general use of words and phrases in a consistent manner.[5] Research conducted in 2011 showed that Google Translate got a slightly higher score than the UCLA minimum score for the English Proficiency Exam.[77] Due to its identical choice of words without considering the flexibility of choosing alternative words or expressions, it produces a relatively similar translation to human translation from the perspective of formality, referential cohesion, and conceptual cohesion.[78] Moreover, a number of languages are translated into a sentence structure and sentence length similar to a human translation.[78] Furthermore, Google carried out a test that required native speakers of each language to rate the translation on a scale between 0 and 6, and Google Translate scored 5.43 on average.[5]
Limitations
Due to the differences between languages in complexity and nature, the accuracy varies greatly between languages.[5] Some languages produce better results than others.
Typically, western languages such as English and Spanish are generally accurate, but the accuracy of African languages is often the poorest, followed by Asian and European languages.[80] Moreover, Google Translate performs well especially when English is the target language and the source language is from the European Union, due to the prominence of translated EU Parliament notes. A 2010 analysis indicated that French to English translation is relatively accurate.[81]
However, if the source text is shorter, rule-based machine translations often perform better; this effect is particularly evident in Chinese to English translations.
While edits of translations may be submitted, in Chinese specifically one cannot edit sentences as a whole.
Instead, one must edit sometimes arbitrary sets of characters, leading to incorrect edits.[81] A good example is Russian-to-English.
Formerly one would use Google Translate to make a draft and then use a dictionary and common sense to correct the numerous mistakes.
As of early 2018 Translate is sufficiently accurate to make the Russian Wikipedia accessible to those who can read English.
The quality of Translate can be checked by adding it as an extension to Chrome or Firefox and applying it to the left language links of any Wikipedia article.
It can be used as a dictionary by typing in words.
One can translate from a book by using a scanner and an OCR like Google Drive, but this takes about five minutes per page.
Once Google Translate has deployed a new technology called Neural Machine Translation to translate whole sentences or text block in the context at a time, alternative translations for a word or phrase are no longer available.[6] Moreover, in its Written Words Translation function, there is a word limit on the amount of text that can be translated at once.[6] Therefore, long text should be transferred to a document form and translated through its Document Translate function.[6]
Moreover, machine translation often does not identify the double meanings of a word.[5] A word in a foreign language might have two different meanings in the translated language.
This might lead to mistranslations.
Additionally, grammatical errors remain a major limitation to the accuracy of Google Translate.
Reviews
Shortly after launching the translation service for the first time, Google won an international competition for English–Arabic and English–Chinese machine translation.[83]
Translation mistakes and oddities
Since Google Translate used statistical matching to translate, translated text can often include apparently nonsensical and obvious errors,[84] sometimes swapping common terms for similar but nonequivalent common terms in the other language,[85] or inverting sentence meaning. Novelty websites like Bad Translator and Translation Party have utilized the service to produce humorous text by translating back and forth between multiple languages, similar to the children's game telephone.[86]
Court usage
In 2017, the system Google Translate was used during a court hearing when court officials at Teesside Magistrates' Court failed to book an interpreter for a Chinese defendant.[87]
See also
Apertium
Babel Fish (discontinued; redirects to main Yahoo! site)
Comparison of machine translation applications
Google Dictionary
Google Text-to-Speech
Google Translator Toolkit
Jollo (discontinued)
List of Google products
Microsoft Translator
Reverso
SmartCAT
SYSTRAN
Word Lens (discontinued; merged into Google Translate app)
Yandex.Translate