March 18, 2020 report

Google introduces real-time extended voice translation

by Peter Grad , Tech Xplore

Google has announced a new real-time transcription feature for its free Translate app for Android phones. An IOS version is planned for the future, the company says.

The feature will allow users to obtain instantaneous text translations of ongoing speeches, lectures or monologues into any of eight languages, including English.

Currently, Translate allows conversions of only relatively short snippets of speech.

The only requirements are having only one speaker talking at a time in a quiet room (other voices or noises will diminish accuracy) and an Internet connection, necessary for interaction with Google's cloud-based Tensor Processing Units.

The rollout begins today (March 18) and should be available to all users by the end of the week at Google's Play Store.

In conversation mode, the app permits users to have a back-and-forth conversation with someone speaking a different language.

In addition to English, translations are available in French, German, Hindi, Portuguese, Russian, Spanish and Thai.

The app will also work with playbacks of prerecorded audio. But Google says direct digital translation from uploaded audio files is not yet available.

This week's announcement is a reminder of just how far we have come since the earliest days of digital voice recognition. Bell Laboratories debuted its futuristic "Audrey" system in 1952 that recognized the spoken digits 0-9. A giant step was made a decade later when IBM displayed the "Shoebox" at the 1962 World's Fair—it could recognize a whopping 16 words.

For five years in the 1970s, voice recognition got a huge boost from America's military. The Department of Defense underwrote massive research projects into speech recognition, including Carnegie-Mellon's "Harpy" Speech Understanding Research (SUR) initiative, which built a recognition vocabulary of more than 1,011 words. That program notably introduced the concept of pronunciation patterns and probability for the first time, greatly enhancing the ability to recognize distinct modes of speech.

The 1980s brought ever greater advances in word detection, with researchers applying probability theory to unknown sounds. Tech giant IBM's program expanded recognition to 5,000 words. But the decade may be best remembered for the introduction of the world's first talking doll, "Julie," that understood speech. An ad campaign stated: "Finally, the doll that understands you."

Dragon brought voice recognition to the masses in the 1990s, with its first largely accurate though still buggy consumer product priced at "only" $9,000. By the end of the decade, the vastly improved Dragon NaturallySpeaking program, which for the first time did not require pauses between each spoken word, was available to consumers for about $700.

Today we have Siri and Alexa and other free and low-cost mobile apps that let us request driving directions, order food, buy household items and type out spoken text in emails and word processing documents, all of which have expanded speech recognition to points unimaginable not too many years ago.

With the latest advances available to millions of users with handheld devices, Harpy, Audrey, Julie would likely be left speechless.

More information: www.blog.google/products/trans … e/transcribe-speech/

Citation: Google introduces real-time extended voice translation (2020, March 18) retrieved 9 November 2024 from https://techxplore.com/news/2020-03-google-real-time-voice.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Hey, Google, be my Spanish translator

1046 shares

Feedback to editors

First practical application of viscous electron flow realizes terahertz photoconductivity in graphene

9 hours ago

Washbasin-cleaning robot can imitate human motions and adapt its knowledge flexibly to different situations

Nov 8, 2024

Artificial magnetic muscles can support tensile stresses up to 1,000 times their own weight

Nov 8, 2024

Making jet engines fit for the hydrogen age

Nov 8, 2024

Creating AI that's fair and accurate: Framework moves beyond binary decisions to offer a more nuanced approach

Nov 8, 2024

Material with increased band gap design could make electronics faster and more efficient

Nov 8, 2024

One-step electrochemical regeneration of CO₂ from (bi)carbonates enhances carbon capture efficiency

Nov 8, 2024

First artwork by humanoid robot sells for over $1.0 million

Nov 8, 2024

Up to 30% of the power used to train AI is wasted: A software tool could help fix that

Nov 7, 2024

Crowdsourcing system aims to map wildfires in seconds

Nov 7, 2024

Load comments (0)

Google introduces real-time extended voice translation

First practical application of viscous electron flow realizes terahertz photoconductivity in graphene

Washbasin-cleaning robot can imitate human motions and adapt its knowledge flexibly to different situations

Artificial magnetic muscles can support tensile stresses up to 1,000 times their own weight

Making jet engines fit for the hydrogen age

Creating AI that's fair and accurate: Framework moves beyond binary decisions to offer a more nuanced approach

Material with increased band gap design could make electronics faster and more efficient

One-step electrochemical regeneration of CO₂ from (bi)carbonates enhances carbon capture efficiency

First artwork by humanoid robot sells for over $1.0 million

Up to 30% of the power used to train AI is wasted: A software tool could help fix that

Crowdsourcing system aims to map wildfires in seconds

Hey, Google, be my Spanish translator

Mozilla releases transcription model and huge voice dataset

Google Assistant to read web pages aloud on some devices

Hey Google, do you really record everything I say? Yes.

Google to update translation app for phones

Google Brain posse takes neural network approach to translation

Creating AI that's fair and accurate: Framework moves beyond binary decisions to offer a more nuanced approach

Washbasin-cleaning robot can imitate human motions and adapt its knowledge flexibly to different situations

Unique memristor design with analog switching shows promise for high-efficiency neuromorphic computing

Up to 30% of the power used to train AI is wasted: A software tool could help fix that

Advance in 4-inch heterostructure fabrication enhances AI semiconductors

Aquatic robot's self-learning optimization enhances underwater object manipulation skills

Phys.org

Medical Xpress

Science X

Google introduces real-time extended voice translation

First practical application of viscous electron flow realizes terahertz photoconductivity in graphene

Washbasin-cleaning robot can imitate human motions and adapt its knowledge flexibly to different situations

Artificial magnetic muscles can support tensile stresses up to 1,000 times their own weight

Making jet engines fit for the hydrogen age

Creating AI that's fair and accurate: Framework moves beyond binary decisions to offer a more nuanced approach

Material with increased band gap design could make electronics faster and more efficient

One-step electrochemical regeneration of CO₂ from (bi)carbonates enhances carbon capture efficiency

First artwork by humanoid robot sells for over $1.0 million

Up to 30% of the power used to train AI is wasted: A software tool could help fix that

Crowdsourcing system aims to map wildfires in seconds

Related Stories

Hey, Google, be my Spanish translator

Mozilla releases transcription model and huge voice dataset

Google Assistant to read web pages aloud on some devices

Hey Google, do you really record everything I say? Yes.

Google to update translation app for phones

Google Brain posse takes neural network approach to translation

Recommended for you

Creating AI that's fair and accurate: Framework moves beyond binary decisions to offer a more nuanced approach

Washbasin-cleaning robot can imitate human motions and adapt its knowledge flexibly to different situations

Unique memristor design with analog switching shows promise for high-efficiency neuromorphic computing

Up to 30% of the power used to train AI is wasted: A software tool could help fix that

Advance in 4-inch heterostructure fabrication enhances AI semiconductors

Aquatic robot's self-learning optimization enhances underwater object manipulation skills

Your Privacy