Contact Us
 
 

Welcome to ineedhits Blog

Welcome to the ineedhits Search Engine Marketing blog, where we share the latest search engine and online marketing news, releases, industry trends and great DIY tips and advice.



Sunday, November 2, 2008

Google Now Uncovers Scanned Documents

Posted by @ 4:44 pm
0
  •  

Scanned documents such as academic papers, or government reports used to be off limits for the Googlebot. That’s because when scanned, the entire paper appeared as a giant image, instead of text.

Now, using Optical Character Recognition (OCR), Google is able to turn these documents into text and will begin including these files in its search results.

Previously, Google was only able to search the filename and limited meta data associated with these files in order to include them in search results. Google’s new technology now turns the scanned “images of text” into computer readable text itself.

As with traditional PDF files, when you encounter a scanned document, you’ll be able to view the original version, or the text only version Google has created. To see the technology in action, try the search repairing aluminum wiring (the first result should be a scanned document).

This type of technology has been around for a while now, but the scanning accuracy has always been a problem. Some words would get jumbled or miss spelt, so it’s impressive that Google has found a solution that’s accurate enough to be used for their search results.

What does this mean for SEO?

If you’ve got any scanned documents on your site, for example press releases, newspaper articles or research papers, this now gives your business more chances to appear in Google’s results. By giving more content for Google to index, you’ll improve your chances in coming up for queries related to these documents!

P.S. If you’re hiding any information on the web by keeping it as an image, you may want to consider removing those files now :)


Tags:

Matthew Elshaw Matt is a marketing professional at ineedhits.com, an international search marketing firm. Matt's passion for online marketing began at university and has proved invaluable in steering product development and marketing initiatives at the company. Matt is a regular contributor to the ineedhits search marketing blog.

View Matthew Elshaw's profile






Discussion (Not Started)


Add Your Comments







SUBSCRIBE

Keep up to date with the latest from our blogs.

Subscribe to all blog posts

The Newsletter
BROWSE OUR POSTS




  • New Posts
  • Popular
  • Comments


Jobthread



More in Search News, Small Business News (967 of 1876 articles)


Last weekend, Microsoft launched the Fall upgrade to the AdCenter advertising platform.According to Carolyn Miller in the release post, "we ...