Comments on Google Operating System: Voice Search for Google Chrome

Also, try to speak clearly for best speech recogni...

2016-03-05T21:55:49.196-08:00

Also, try to speak clearly for best speech recognition results," suggests the author.speech recognition program

i use google chrome but my grey speaker is not sho...

2011-12-14T00:32:10.719-08:00

i use google chrome but my grey speaker is not showing since two days please help me?

does not work on mine chrome

2011-01-06T11:16:50.978-08:00

does not work on mine chrome

@Alex. Cool, thanks for the additional explanatio...

2011-01-05T05:29:31.528-08:00

@Alex.

Cool, thanks for the additional explanations.

I'm sure things will improve soon enough, especially after Google purchased that voice recognition firm from (I believe) the UK.

I found a better explanation from Google: "C...

2011-01-05T03:12:38.798-08:00

I found a better explanation from Google:

"Creating a general voice input service had different requirements and technical challenges compared to voice search. While voice search was optimized to give the user the correct web page, voice input was optimized to minimize (Hangul) character error rate. Voice inputs are usually longer than searches (short full sentences or parts of sentences), and the system had to be trained differently for this type of data. The current system's language model was trained on millions of Korean sentences that are similar to those we expect to be spoken. In addition to the queries we used for training voice search, we also used parts of web pages, selected blogs, news articles and more. Because the system expects spoken data similar to what it was trained on, it will generally work well on normal spoken sentences, but may yet have difficulty on random or rare word sequences -- we will work to keep improving on those."

Google's approach to voice recognition is simi...

2011-01-05T03:01:04.051-08:00

Google's approach to voice recognition is similar to the one used for Google Translate, so the two services probably share a lot of data. Google needs to build a language model using large amounts of data, then use a lot of audio samples to build a voice model and then develop a voice recognition system that tries to connect the two models and produce some useful results.

Google has to find text sources (probably from the Web) and audio sources (these are more difficult to find). It's much easier to build a recognition system for voice search because Google already has a lot of queries that could be used as text sources and the system can self-improve by using the audio samples collected using Voice Search. It's a lot more difficult to build a speech-to-text system for voicemail or YouTube videos because the input is more complex and less predictable.

Thanks for the response, Alex. I'm personally...

2011-01-04T16:41:27.589-08:00

Thanks for the response, Alex.

I'm personally unfamiliar with "training data."

I'm very comfortable with your explanations.

Do you think it's the same software/machinery that performs the transcriptions?

@Cougar: Some possible explanations: less trainin...

2011-01-04T14:26:42.884-08:00

@Cougar:

Some possible explanations: less training data, less feedback, more background noise, more complex phrases.

On a related note, I have a question for everyone ...

2011-01-04T12:39:00.867-08:00

On a related note, I have a question for everyone that may have a simple answer:

If Android phones can do Google searches for "chubby bunny," with near perfection, while the speaker has five marshmallows in his mouth, why is Google Voice's transcription service often far from accurate?

This is cool! I don't know if I want to shouti...

2011-01-04T12:32:29.485-08:00

This is cool! I don't know if I want to shouting at my monitor to search, but it is cool.