Tag Archives: Google Translate

Lost in Translation no more with Word Lens in Japanese

If you don’t speak Japanese, Tokyo can be a confusing and sometimes daunting place to visit. Even if you make it through the complex subway system, you’ll be faced by street signs, menus or products on supermarket shelves that are only in Japanese. 

With Word Lens now available in Japanese, you’ll never have to worry about taking a wrong turn on a busy Shibuya street or ordering something you wouldn't normally eat. 

The Google Translate app already lets you snap a photo of Japanese text and get a translation for it in English. But it’s a whole lot more convenient if you can just point your camera and instantly translate text on the go. With Word Lens, you just need to fire up the Translate app, point your camera at the Japanese text, and the English translations will appear overlaid on your screen—even if you don't have an Internet or data connection. It’s every savvy traveller’s dream! 

Google Translate: Cash only

To turn your smartphone into a powerful instant translation tool for English to Japanese (and vice versa), all you need to do is download the Google Translate app, either on Android or iOS

Source: Translate


Lost in Translation no more with Word Lens in Japanese

If you don’t speak Japanese, Tokyo can be a confusing and sometimes daunting place to visit. Even if you make it through the complex subway system, you’ll be faced by street signs, menus or products on supermarket shelves that are only in Japanese. 

With Word Lens now available in Japanese, you’ll never have to worry about taking a wrong turn on a busy Shibuya street or ordering something you wouldn't normally eat. 

The Google Translate app already lets you snap a photo of Japanese text and get a translation for it in English. But it’s a whole lot more convenient if you can just point your camera and instantly translate text on the go. With Word Lens, you just need to fire up the Translate app, point your camera at the Japanese text, and the English translations will appear overlaid on your screen—even if you don't have an Internet or data connection. It’s every savvy traveller’s dream! 

Google Translate: Cash only

To turn your smartphone into a powerful instant translation tool for English to Japanese (and vice versa), all you need to do is download the Google Translate app, either on Android or iOS

Source: Translate


Zero-Shot Translation with Google’s Multilingual Neural Machine Translation System



In the last 10 years, Google Translate has grown from supporting just a few languages to 103, translating over 140 billion words every day. To make this possible, we needed to build and maintain many different systems in order to translate between any two languages, incurring significant computational cost. With neural networks reforming many fields, we were convinced we could raise the translation quality further, but doing so would mean rethinking the technology behind Google Translate.

In September, we announced that Google Translate is switching to a new system called Google Neural Machine Translation (GNMT), an end-to-end learning framework that learns from millions of examples, and provided significant improvements in translation quality. However, while switching to GNMT improved the quality for the languages we tested it on, scaling up to all the 103 supported languages presented a significant challenge.

In “Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation”, we address this challenge by extending our previous GNMT system, allowing for a single system to translate between multiple languages. Our proposed architecture requires no change in the base GNMT system, but instead uses an additional “token” at the beginning of the input sentence to specify the required target language to translate to. In addition to improving translation quality, our method also enables “Zero-Shot Translation” — translation between language pairs never seen explicitly by the system.
Here’s how it works. Let’s say we train a multilingual system with Japanese⇄English and Korean⇄English examples, shown by the solid blue lines in the animation. Our multilingual system, with the same size as a single GNMT system, shares its parameters to translate between these four different language pairs. This sharing enables the system to transfer the “translation knowledge” from one language pair to the others. This transfer learning and the need to translate between multiple languages forces the system to better use its modeling power.

This inspired us to ask the following question: Can we translate between a language pair which the system has never seen before? An example of this would be translations between Korean and Japanese where Korean⇄Japanese examples were not shown to the system. Impressively, the answer is yes — it can generate reasonable Korean⇄Japanese translations, even though it has never been taught to do so. We call this “zero-shot” translation, shown by the yellow dotted lines in the animation. To the best of our knowledge, this is the first time this type of transfer learning has worked in Machine Translation.

The success of the zero-shot translation raises another important question: Is the system learning a common representation in which sentences with the same meaning are represented in similar ways regardless of language — i.e. an “interlingua”? Using a 3-dimensional representation of internal network data, we were able to take a peek into the system as it translates a set of sentences between all possible pairs of the Japanese, Korean, and English languages.

Part (a) from the figure above shows an overall geometry of these translations. The points in this view are colored by the meaning; a sentence translated from English to Korean with the same meaning as a sentence translated from Japanese to English share the same color. From this view we can see distinct groupings of points, each with their own color. Part (b) zooms in to one of the groups, and part (c) colors by the source language. Within a single group, we see a sentence with the same meaning but from three different languages. This means the network must be encoding something about the semantics of the sentence rather than simply memorizing phrase-to-phrase translations. We interpret this as a sign of existence of an interlingua in the network.

We show many more results and analyses in our paper, and hope that its findings are not only interesting for machine learning or machine translation researchers but also to linguists and others who are interested in how multiple languages can be processed by machines using a single system.

Finally, the described Multilingual Google Neural Machine Translation system is running in production today for all Google Translate users. Multilingual systems are currently used to serve 10 of the recently launched 16 language pairs, resulting in improved quality and a simplified production architecture.

Found in translation: More accurate, fluent sentences in Google Translate

In 10 years, Google Translate has gone from supporting just a few languages to 103, connecting strangers, reaching across language barriers and even helping people find love. At the start, we pioneered large-scale statistical machine translation, which uses statistical models to translate text. Today, we’re introducing the next step in making Google Translate even better: Neural Machine Translation.  

Neural Machine Translation has been generating exciting research results for a few years and in September, our researchers announced Google's version of this technique. At a high level, the Neural system translates whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar. Since it’s easier to understand each sentence, translated paragraphs and articles are a lot smoother and easier to read. And this is all possible because of end-to-end learning system built on Neural Machine Translation, which basically means that the system learns over time to create better, more natural translations.

Today we’re putting Neural Machine Translation into action with a total of eight language pairs to and from English and French, German, Spanish, Portuguese, Chinese, Japanese, Korean and Turkish. These represent the native languages of around one-third of the world's population, covering more than 35% of all Google Translate queries!


NeuralLearning_Translate_Blog_hires.jpg


With this update, Google Translate is improving more in a single leap than we’ve seen in the last ten years combined. But this is just the beginning. While we’re starting with eight language pairs within Google Search  the Google Translate app, and website; our goal is to eventually roll Neural Machine Translation out to all 103 languages and surfaces where you can access Google Translate.

And there’s more coming today too -- Google Cloud Platform, our public cloud service, offers Machine Learning APIs that make it easy for anyone to use our machine learning technology. Today, Google Cloud Platform is also making the system behind Neural Machine Translation available for all businesses through Google Cloud Translation API. You can learn more about it here.

Today’s step towards Neural Machine Translation is a significant milestone for Google Translate, but there’s always more work to do and we’ll continue to learn over time. We’ll also continue to rely on  Translate Community, where language loving multilingual speakers can help share their language by contributing and reviewing translations. We can’t wait for you to start translating and understanding the world just a little bit better.

Source: Translate


Found in translation: More accurate, fluent sentences in Google Translate

In 10 years, Google Translate has gone from supporting just a few languages to 103, connecting strangers, reaching across language barriers and even helping people find love. At the start, we pioneered large-scale statistical machine translation, which uses statistical models to translate text. Today, we’re introducing the next step in making Google Translate even better: Neural Machine Translation.  

Neural Machine Translation has been generating exciting research results for a few years and in September, our researchers announced Google's version of this technique. At a high level, the Neural system translates whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar. Since it’s easier to understand each sentence, translated paragraphs and articles are a lot smoother and easier to read. And this is all possible because of end-to-end learning system built on Neural Machine Translation, which basically means that the system learns over time to create better, more natural translations.

Today we’re putting Neural Machine Translation into action with a total of eight language pairs to and from English and French, German, Spanish, Portuguese, Chinese, Japanese, Korean and Turkish. These represent the native languages of around one-third of the world's population, covering more than 35% of all Google Translate queries!


NeuralLearning_Translate_Blog_hires.jpg


With this update, Google Translate is improving more in a single leap than we’ve seen in the last ten years combined. But this is just the beginning. While we’re starting with eight language pairs within Google Search  the Google Translate app, and website; our goal is to eventually roll Neural Machine Translation out to all 103 languages and surfaces where you can access Google Translate.

And there’s more coming today too -- Google Cloud Platform, our public cloud service, offers Machine Learning APIs that make it easy for anyone to use our machine learning technology. Today, Google Cloud Platform is also making the system behind Neural Machine Translation available for all businesses through Google Cloud Translation API. You can learn more about it here.

Today’s step towards Neural Machine Translation is a significant milestone for Google Translate, but there’s always more work to do and we’ll continue to learn over time. We’ll also continue to rely on  Translate Community, where language loving multilingual speakers can help share their language by contributing and reviewing translations. We can’t wait for you to start translating and understanding the world just a little bit better.

Source: Translate


Found in translation: More accurate, fluent sentences in Google Translate

In 10 years, Google Translate has gone from supporting just a few languages to 103, connecting strangers, reaching across language barriers and even helping people find love. At the start, we pioneered large-scale statistical machine translation, which uses statistical models to translate text. Today, we’re introducing the next step in making Google Translate even better: Neural Machine Translation.  

Neural Machine Translation has been generating exciting research results for a few years and in September, our researchers announced Google's version of this technique. At a high level, the Neural system translates whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar. Since it’s easier to understand each sentence, translated paragraphs and articles are a lot smoother and easier to read. And this is all possible because of end-to-end learning system built on Neural Machine Translation, which basically means that the system learns over time to create better, more natural translations.

Today we’re putting Neural Machine Translation into action with a total of eight language pairs to and from English and French, German, Spanish, Portuguese, Chinese, Japanese, Korean and Turkish. These represent the native languages of around one-third of the world's population, covering more than 35% of all Google Translate queries!


NeuralLearning_Translate_Blog_hires.jpg


With this update, Google Translate is improving more in a single leap than we’ve seen in the last ten years combined. But this is just the beginning. While we’re starting with eight language pairs within Google Search  the Google Translate app, and website; our goal is to eventually roll Neural Machine Translation out to all 103 languages and surfaces where you can access Google Translate.

And there’s more coming today too -- Google Cloud Platform, our public cloud service, offers Machine Learning APIs that make it easy for anyone to use our machine learning technology. Today, Google Cloud Platform is also making the system behind Neural Machine Translation available for all businesses through Google Cloud Translation API. You can learn more about it here.

Today’s step towards Neural Machine Translation is a significant milestone for Google Translate, but there’s always more work to do and we’ll continue to learn over time. We’ll also continue to rely on  Translate Community, where language loving multilingual speakers can help share their language by contributing and reviewing translations. We can’t wait for you to start translating and understanding the world just a little bit better.

Source: Translate


Found in translation: More accurate, fluent sentences in Google Translate

In 10 years, Google Translate has gone from supporting just a few languages to 103, connecting strangers, reaching across language barriers and even helping people find love. At the start, we pioneered large-scale statistical machine translation, which uses statistical models to translate text. Today, we’re introducing the next step in making Google Translate even better: Neural Machine Translation.  

Neural Machine Translation has been generating exciting research results for a few years and in September, our researchers announced Google's version of this technique. At a high level, the Neural system translates whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar. Since it’s easier to understand each sentence, translated paragraphs and articles are a lot smoother and easier to read. And this is all possible because of end-to-end learning system built on Neural Machine Translation, which basically means that the system learns over time to create better, more natural translations.

Today we’re putting Neural Machine Translation into action with a total of eight language pairs to and from English and French, German, Spanish, Portuguese, Chinese, Japanese, Korean and Turkish. These represent the native languages of around one-third of the world's population, covering more than 35% of all Google Translate queries!


NeuralLearning_Translate_Blog_hires.jpg


With this update, Google Translate is improving more in a single leap than we’ve seen in the last ten years combined. But this is just the beginning. While we’re starting with eight language pairs within Google Search  the Google Translate app, and website; our goal is to eventually roll Neural Machine Translation out to all 103 languages and surfaces where you can access Google Translate.

And there’s more coming today too -- Google Cloud Platform, our public cloud service, offers Machine Learning APIs that make it easy for anyone to use our machine learning technology. Today, Google Cloud Platform is also making the system behind Neural Machine Translation available for all businesses through Google Cloud Translation API. You can learn more about it here.

Today’s step towards Neural Machine Translation is a significant milestone for Google Translate, but there’s always more work to do and we’ll continue to learn over time. We’ll also continue to rely on  Translate Community, where language loving multilingual speakers can help share their language by contributing and reviewing translations. We can’t wait for you to start translating and understanding the world just a little bit better.

Source: Translate


A Neural Network for Machine Translation, at Production Scale



Ten years ago, we announced the launch of Google Translate, together with the use of Phrase-Based Machine Translation as the key algorithm behind this service. Since then, rapid advances in machine intelligence have improved our speech recognition and image recognition capabilities, but improving machine translation remains a challenging goal.

Today we announce the Google Neural Machine Translation system (GNMT), which utilizes state-of-the-art training techniques to achieve the largest improvements to date for machine translation quality. Our full research results are described in a new technical report we are releasing today: “Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation” [1].

A few years ago we started using Recurrent Neural Networks (RNNs) to directly learn the mapping between an input sequence (e.g. a sentence in one language) to an output sequence (that same sentence in another language) [2]. Whereas Phrase-Based Machine Translation (PBMT) breaks an input sentence into words and phrases to be translated largely independently, Neural Machine Translation (NMT) considers the entire input sentence as a unit for translation.The advantage of this approach is that it requires fewer engineering design choices than previous Phrase-Based translation systems. When it first came out, NMT showed equivalent accuracy with existing Phrase-Based translation systems on modest-sized public benchmark data sets.

Since then, researchers have proposed many techniques to improve NMT, including work on handling rare words by mimicking an external alignment model [3], using attention to align input words and output words [4] and breaking words into smaller units to cope with rare words [5,6]. Despite these improvements, NMT wasn't fast or accurate enough to be used in a production system, such as Google Translate. Our new paper [1] describes how we overcame the many challenges to make NMT work on very large data sets and built a system that is sufficiently fast and accurate enough to provide better translations for Google’s users and services.
Data from side-by-side evaluations, where human raters compare the quality of translations for a given source sentence. Scores range from 0 to 6, with 0 meaning “completely nonsense translation”, and 6 meaning “perfect translation."
The following visualization shows the progression of GNMT as it translates a Chinese sentence to English. First, the network encodes the Chinese words as a list of vectors, where each vector represents the meaning of all words read so far (“Encoder”). Once the entire sentence is read, the decoder begins, generating the English sentence one word at a time (“Decoder”). To generate the translated word at each step, the decoder pays attention to a weighted distribution over the encoded Chinese vectors most relevant to generate the English word (“Attention”; the blue link transparency represents how much the decoder pays attention to an encoded word).
Using human-rated side-by-side comparison as a metric, the GNMT system produces translations that are vastly improved compared to the previous phrase-based production system. GNMT reduces translation errors by more than 55%-85% on several major language pairs measured on sampled sentences from Wikipedia and news websites with the help of bilingual human raters.
An example of a translation produced by our system for an input sentence sampled from a news site. Go here for more examples of translations for input sentences sampled randomly from news sites and books.
In addition to releasing this research paper today, we are announcing the launch of GNMT in production on a notoriously difficult language pair: Chinese to English. The Google Translate mobile and web apps are now using GNMT for 100% of machine translations from Chinese to English—about 18 million translations per day. The production deployment of GNMT was made possible by use of our publicly available machine learning toolkit TensorFlow and our Tensor Processing Units (TPUs), which provide sufficient computational power to deploy these powerful GNMT models while meeting the stringent latency requirements of the Google Translate product. Translating from Chinese to English is one of the more than 10,000 language pairs supported by Google Translate, and we will be working to roll out GNMT to many more of these over the coming months.

Machine translation is by no means solved. GNMT can still make significant errors that a human translator would never make, like dropping words and mistranslating proper names or rare terms, and translating sentences in isolation rather than considering the context of the paragraph or page. There is still a lot of work we can do to serve our users better. However, GNMT represents a significant milestone. We would like to celebrate it with the many researchers and engineers—both within Google and the wider community—who have contributed to this direction of research in the past few years.

Acknowledgements:
We thank members of the Google Brain team and the Google Translate team for the help with the project. We thank Nikhil Thorat and the Big Picture team for the visualization.

References:
[1] Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Łukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, Jeffrey Dean. Technical Report, 2016.
[2] Sequence to Sequence Learning with Neural Networks, Ilya Sutskever, Oriol Vinyals, Quoc V. Le. Advances in Neural Information Processing Systems, 2014.
[3] Addressing the rare word problem in neural machine translation, Minh-Thang Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, and Wojciech Zaremba. Proceedings of the 53th Annual Meeting of the Association for Computational Linguistics, 2015.
[4] Neural Machine Translation by Jointly Learning to Align and Translate, Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio. International Conference on Learning Representations, 2015.
[5] Japanese and Korean voice search, Mike Schuster, and Kaisuke Nakajima. IEEE International Conference on Acoustics, Speech and Signal Processing, 2012.
[6] Neural Machine Translation of Rare Words with Subword Units, Rico Sennrich, Barry Haddow, Alexandra Birch. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016.

Celebrating languages in the European Parliament

Ten years ago when we launched Google Translate, our goal was to break language barriers and to make the world more accessible. Languages shape our identities, culture, how we relate to others and how we communicate. They’re an additional source of cultural wealth, worthy of celebration. 

To mark this important milestone, and thanks to the commitment of MEP Catherine Stihler, we organized a reception in the European Parliament earlier this month. To demonstrate how some of the tools of Google Translate work, artists Donnie Munro and Trail West performed a beautiful and melancholic love song in Scots Gaelic, which was translated into English on Google Translate for the audience of MEPs and their staff, and translators working in the European Parliament.
europarliament.JPG
Donnie Munro singing in Scots Gaelic, with a translation to English on the screen

MEP Stihler stressed the importance of minority languages for local communities at the event, a sentiment shared by her colleague Jordi Sebastia (Co-Chair of the Languages Intergroup), when he said that Europe means diversity.

Google Translate in the European Parliament

As our policy director Lie Junius explained, Google Translate cannot replace the essential work done by the professional translators in the European Parliament. But we do think it can be a tool that can help people understand each other, also in the most difficult of times, such as demonstrated by stories of British families opening up their homes to refugees, using Translate to start their conversations with them.

In the last decade we’ve grown from supporting two languages to 103, and from hundreds of users to more than 500 million people. And we’ll continue to improve Translate.  In February 2016 we announced that we’re adding 13 new languages to Google Translate, including Scots Gaelic, Luxembourgish, and Corsican - covering every single one of the EU member states' official national languages. 

Source: Translate


Celebrating languages in the European Parliament

Ten years ago when we launched Google Translate, our goal was to break language barriers and to make the world more accessible. Languages shape our identities, culture, how we relate to others and how we communicate. They’re an additional source of cultural wealth, worthy of celebration. 

To mark this important milestone, and thanks to the commitment of MEP Catherine Stihler, we organized a reception in the European Parliament earlier this month. To demonstrate how some of the tools of Google Translate work, artists Donnie Munro and Trail West performed a beautiful and melancholic love song in Scots Gaelic, which was translated into English on Google Translate for the audience of MEPs and their staff, and translators working in the European Parliament.

MEP Stihler stressed the importance of minority languages for local communities at the event, a sentiment shared by her colleague Jordi Sebastia (Co-Chair of the Languages Intergroup), when he said that Europe means diversity.

Google Translate in the European Parliament

As our policy director Lie Junius explained, Google Translate cannot replace the essential work done by the professional translators in the European Parliament. But we do think it can be a tool that can help people understand each other, also in the most difficult of times, such as demonstrated by stories of British families opening up their homes to refugees, using Translate to start their conversations with them.

In the last decade we’ve grown from supporting two languages to 103, and from hundreds of users to more than 500 million people. And we’ll continue to improve Translate.  In February 2016 we announced that we’re adding 13 new languages to Google Translate, including Scots Gaelic, Luxembourgish, and Corsican - covering every single one of the EU member states' official national languages. 

Source: Translate