Google Docs OCR

September 29, 2009

Google Docs OCR

Google Docs API tests a new feature that lets you perform OCR (optical character recognition) on an image. There's a live demo that illustrates this feature: you can upload a high-resolution JPG, GIF, or PNG image that has less than 10 MB and Google Docs extracts the text and converts it into a new document. Google mentions that "the operation can currently take up to 40 seconds" and a small test showed that the service is not yet reliable: it's slow and it frequently returns errors.

The results are far from perfect and you'll find many errors, but the service is free and it's constantly improving. Here's the result of the OCR for this scanned document:

There aren't many free OCR services available, so an OCR service provided by Google would be very popular. ABBYY FineReader Online is one of the best online OCR services, but the free version is limited to 10 pages a day.

Google sponsors the development of an open-source OCR software called OCRopus, but it's not clear if the online service provided by Google Docs uses OCRopus.

25 comments:

Baby BSeptember 29, 2009 at 11:29 AM
This fits in very will with their acquisition of reCAPTCHA....
ReplyDelete
Replies
JurgiSeptember 29, 2009 at 12:04 PM
@ebboodah — yup, then can send hard words to users od reCaptcha to get amazing accuracy. And soon after that, cybercriminals will invent API to send reCaptcha images to Google Docs OCR for great accuracy of spamming. Perpetuum mobile. :D
ReplyDelete
Replies
AnonymousSeptember 29, 2009 at 12:06 PM
Great idea ! But you're right : this doesn't work very well. I 'm excpecting a much more performant version :).
ReplyDelete
Replies
AnonymousSeptember 29, 2009 at 12:31 PM
Is this English only for now? Currently we're very pleased with the Acrobat tools performance, but that installation is not available to all users. Web based would be great.
ReplyDelete
Replies
gregSeptember 29, 2009 at 6:05 PM
If you are going to criticize Google for not having perfect OCR translations, at least get the spelling right in your post.....

"excpecting a much more performant version :)"
ReplyDelete
Replies
allan stuwartSeptember 29, 2009 at 9:33 PM
Is this like stegnography?
ReplyDelete
Replies
FetardSeptember 30, 2009 at 12:45 AM
I just tested it on some pictures. In some cases, I need to resize *2 the image, to be recognized, but it's awesome to be able to use OCR online !
Don't forget OCR is very complex technology, so, be patient !
ReplyDelete
Replies
UnknownSeptember 30, 2009 at 2:06 AM
Very poor quality

The best online ocr service is :
http://www.finereaderonline.com
ReplyDelete
Replies
ChrisSeptember 30, 2009 at 1:37 PM
Google and OCR technology. My prediction is pretty close:
http://detailcode.com/news/future-predictions-of-googles-recaptcha/

Has anyone tested the accuracy when images are slightly skewed/rotated?
ReplyDelete
Replies
AnonymousOctober 1, 2009 at 7:26 PM
Dear Google,

it would be so kind of you if you could further in your endeavour and make Google Docs OCR work for Hebrew and Arabic. Perhaps, Cyrillic too. Certainly, people who read Chinese, Japanese or Hindi have similar levels of enthusiasm too.

This is great news, as I have a lot of books I wish to scan. Contrary to people who love collecting books on shelves but not reading them, I love reading books but not keeping them. Printed books are real heavy stuffs and inconvenient to move around with. Printed books also do not lend themselves willingly towards searching their contents - very bad feature.

I want to be able to read my stuffs anywhere I am in the world. I would scan the books, recycle the paper, but keep the covers in a trunk in an obscure corner of the cellar as proof of my ownership of the books.

BTW, what's the meaning of "performant"? I am afraid the frequent appearance of this word is about to make it join the list of annoying words like "obligated" that have replaced "obliged", "percentage points" instead of "percent" or "presumptive candidate" instead of "presumed candidate" as real words.
ReplyDelete
Replies
AnonymousOctober 2, 2009 at 2:41 AM
Jurgi,
That way, Google can find the criminals much faster and black-list all their property faster.

Hence,
Isn't “candidate” enough?
ReplyDelete
Replies
AnonymousNovember 17, 2009 at 1:26 AM
Good start, the other so called free services suck.
ReplyDelete
Replies
OCR SoftwareJanuary 17, 2010 at 1:57 PM
Tesseract seems to have some quality and formatting issues.
ReplyDelete
Replies
AnonymousJanuary 23, 2010 at 1:26 AM
why is this redirecting me to googlecodesamples.com which doesn't appear to be owned by google? And then asks me for access to my docs. Is this legit?

Registered through: GoDaddy.com, Inc. (http://www.godaddy.com)
Domain Name: GOOGLECODESAMPLES.COM
Created on: 30-Jun-08
Expires on: 30-Jun-10
Last Updated on: 01-Jul-09

Private, Registration GOOGLECODESAMPLES.COM@domainsbyproxy.com
Domains by Proxy, Inc.
DomainsByProxy.com
15111 N. Hayden Rd., Ste 160, PMB 353
Scottsdale, Arizona 85260
United States
(480) 624-2599 Fax -- (480) 624-2598

Domain servers in listed order:
NS1.SLICEHOST.NET
NS2.SLICEHOST.NET
ReplyDelete
Replies
AnonymousJanuary 28, 2010 at 2:29 AM
Tesseract has a number of formatting and accuracy issues. So far, the best solution I have encountered is OCR Terminal (www.ocrterminal.com). 20 free pages a month and very good accuracy.
ReplyDelete
Replies
AnonymousMarch 7, 2010 at 8:37 PM
What's the point of this if you can't upload a PDF file.
ReplyDelete
Replies
AnonymousJune 18, 2010 at 3:49 PM
PDF! PDF! PDF!
ReplyDelete
Replies
AnonymousOctober 8, 2010 at 7:26 AM
Try this online OCR instead. Worked great for me.
http://www.cvisiontech.com/index.php?option=com_docman&task=cat_view&gid=45&&Itemid=206&lang=eng
ReplyDelete
Replies
UnknownDecember 29, 2010 at 12:49 PM
I think that they are using neither Ocropus nor Tesseract. When i gave the same image to Google OCR, Tesseract, and Ocropus, I noticed that Google OCR provided better results than Tesseract, and Tesseract provided much better results than Ocropus, although Ocropus uses Tesseract as the recognition engine....strange!

I nearly tried all free online OCR services. All of them give more or less the same quality. I really like i2OCR because it provides a balance between accuracy and usability.
http://www.sciweavers.org/free-online-ocr

Finally, the secret sauce is DPI. The more DPI, the better the quality of recognition.
ReplyDelete
Replies
prashantDecember 30, 2010 at 4:02 PM
can u pls let me know how can i use api code for my own stuffs to read text from image
ReplyDelete
Replies
lean mediaJanuary 20, 2011 at 6:09 AM
HELLO YUVATANTRA,
i also have the same pb, i need to use the api code for my ownn project.
Are u a developper?
ReplyDelete
Replies
AnonymousMarch 2, 2011 at 7:04 AM
Well, I just tried Google's OCR and it was on pants at b/w 600dpi. Junk character"5 and no formatting. This may have been down to the font (bold Tahoma, don't ask me why, some people just have no sense of style). However, tried the same document with onlineocr.net, and got much better results.
ReplyDelete
Replies
zonenderJuly 23, 2011 at 3:48 PM
Please make this service available in Arabic... good start anyway.
ReplyDelete
Replies
AnonymousSeptember 11, 2011 at 7:40 AM
You should try http://www.ocr-extract.com/
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.

Google Operating System

Unofficial news and tips about Google

September 29, 2009

Google Docs OCR

25 comments:

Follow

Labels

Popular Posts

Blog Archive

Recommended Sites