November 19, 2009

YouTube Audio Transcription

YouTube added a feature that generates video captions. "We've combined Google's automatic speech recognition (ASR) technology with the YouTube caption system to offer automatic captions, or auto-caps for short. Auto-caps use the same voice recognition algorithms in Google Voice to automatically generate captions for video."

The feature only works for English and it's been enabled for a small number of channels that usually feature talks and interviews: UC Berkeley, Stanford, MIT, Yale, UCLA, Duke,UCTV, Columbia, PBS, National Geographic.


Another new feature is auto-timing, which lets you upload the transcription of a video and it automatically generates the time codes. "All you need to do is create a simple text file with all the words in the video and we'll use Google's ASR technology to figure out when the words are spoken and create captions for your video."

Since Google's speech recognition technology is not perfect, it would be useful to generate the captions and then to manually edit them to correct the mistakes.

Automatic captions make YouTube videos more accessible: you can watch videos with the sound off and you can translate the captions into another language using Google Translate.

24 comments:

  1. I think this is great, but I wish they would go further.

    Yesterday, I wanted a reasonably accurate (google voice like) transcript from the Diane Rehm Show. The DR Show charges $19 for a transcript -- I'm not sure why.

    I'd like to see Google offer services to websites like the DR Show where they can get a not perfect, google voice like transcript of their show made to accompany their mp3.

    ReplyDelete
    Replies
    1. You don't want a google voice transcription, as it's awful. I once used phonetag and their transcription rocks compared to google voice's engine. Once google buys phonetag it'll be better.

      Delete
  2. That's amazing, now by using this, there may be no frustration to think over for the great caption... its awesome...things goes so easy and without fear.. its great..!!1

    ReplyDelete
  3. What would be really great is to be able to download the captions file seperately, once the timings have been automatically applied. Also, do auto-timing captions have an option to translate them or is it just the automatic speech-text ones that do?

    ReplyDelete
  4. This is truly beneficial for deaf or people who are hearing-impaired and the language translation option is also great for the international community. But it seems to have no SEO value. Or does it ?

    ReplyDelete
  5. Great! Now youtube is international!

    ReplyDelete
  6. I always loved Google that they think of users and do only useful services.

    ReplyDelete
  7. very nice. hopefully they would come up with other languages soon as well.

    ReplyDelete
  8. I like it that's Really nice future ...But that text Include in Google Search ranking ?

    ReplyDelete
  9. nice feature, especially the translation bit..

    ReplyDelete
  10. There is no doubt that YouTube Video rock and its best.

    ReplyDelete
  11. It's absolutely awful.

    It seems to be completely deaf as to what people are actually saying and offers WAY off the mark captions.

    ReplyDelete
  12. LOL even though most of it doesnt make sense its great entertainment- once on a Blendtec video it came up as 'Don't Breathe Air'

    ReplyDelete
  13. "Not perfect" is a bit of an understatement.

    And when you say "The feature only works for English" surely what you in fact mean is: "the feature doesn't fail quite so spectactularly if you speak English in a clear American accent."

    Any actual English people may as well be speaking Portuguese in my experience.

    Let's be honest, it shouldn't be released from Beta status until it works better. At present its main function is as a comedy generator.

    See how it hilariously renders classic works of literature:

    http://www.youtube.com/watch?v=syFXOH3t4CY

    ReplyDelete
  14. Transcribing your audios is now easy with Secure Transcription, email to support@securetranscription.com or dial 18664108787 to our toll free number.

    ReplyDelete
  15. So far I've used Google Trascribe Audio on two music videos on YouTube, and the results have been amazingly laughable. Nothing in either video translation resulted in a rational thought. Using it made me dumber. I award Google no points, and may God have mercy on its soul.

    ReplyDelete
  16. You can't really rely on Google softwares to transcribe your videos.

    It's wise to check out a transcription service provider like www.transcriptionpro.net, which makes outsourcing videos for transcription easy and secure with their online transcription software.

    ReplyDelete
  17. yes that is nice functionality, one day it will work perfectly

    ReplyDelete
  18. Is there a tool that accurately does this?

    ReplyDelete
  19. THis is great! Thanks to both companies for their attention to accessibility issues. This is a great step in the right direction.

    Thanks again.

    ReplyDelete
  20. Google's voice recognition technology is not still matured to produce accurate transcription for non-American accents, or with people speaking quickly or multiple speakers audio files. If you have more than one voice it is almost impossible to get a good transcript.Better go for a manual transcription services.

    ReplyDelete
  21. Automatic speech recognition (ASR) technology is very cool. I Think it is very helpful Audio transcription technology if YouTube caption system to offer all those thinks.

    ReplyDelete
  22. The Voicera tool is now available to transcribe meetings and calls and costs less than Google or other transcription services, plus offers note taking through an AI. Try that out if you have a moment at www.voicera.com

    ReplyDelete

Note: Only a member of this blog may post a comment.