This feature should've been added a long time ago: Google Docs indexes PDF files and you can finally search the contents of all your files.
Google Docs search has been recently improved by adding support for automatic stemming and synonyms, so you can search for [create shortcut] and find documents that contain [creating shortcuts] or [creates a shortcut].
To make things even better, Google should detect scanned PDF documents and use OCR to extract text. This feature is already used by Google's search engine to index scanned documents and it's available as an experiment for Google Docs API.
December 15, 2009
Subscribe to:
Post Comments (Atom)
Another good reason to start using docs. Thanks for the info.
ReplyDeleteDoesn't appear to be working with PDFs that haven't already been OCR'd, which has always been the case since they introduced PDF support. :\
ReplyDeleteIsn't working for now, my pdfs are still only title-indexed.
ReplyDeleteIt's detecting scanned text in a PDF that I uploaded last week. I'm guessing that it just takes time.
ReplyDeleteIt is seriously about time.
ReplyDeleteIt seems to index only the first 100 pages of a pdf file!
ReplyDeleteOnce you have a PDF you want to index edit the keywords in the preferences. I have found that google index the keywords as well.
ReplyDelete