Contact Us
 
 

Welcome to ineedhits Blog

Welcome to the ineedhits Search Engine Marketing blog, where we share the latest search engine and online marketing news, releases, industry trends and great DIY tips and advice.



Sunday, November 2, 2008

Google Now Uncovers Scanned Documents

Posted by Matthew Elshaw @ 4:44 pm
0
  •  

Scanned documents such as academic papers, or government reports used to be off limits for the Googlebot. That’s because when scanned, the entire paper appeared as a giant image, instead of text.

Now, using Optical Character Recognition (OCR), Google is able to turn these documents into text and will begin including these files in its search results.

Previously, Google was only able to search the filename and limited meta data associated with these files in order to include them in search results. Google’s new technology now turns the scanned “images of text” into computer readable text itself.

As with traditional PDF files, when you encounter a scanned document, you’ll be able to view the original version, or the text only version Google has created. To see the technology in action, try the search repairing aluminum wiring (the first result should be a scanned document).

This type of technology has been around for a while now, but the scanning accuracy has always been a problem. Some words would get jumbled or miss spelt, so it’s impressive that Google has found a solution that’s accurate enough to be used for their search results.

What does this mean for SEO?

If you’ve got any scanned documents on your site, for example press releases, newspaper articles or research papers, this now gives your business more chances to appear in Google’s results. By giving more content for Google to index, you’ll improve your chances in coming up for queries related to these documents!

P.S. If you’re hiding any information on the web by keeping it as an image, you may want to consider removing those files now :)

Related posts:


Tags:

Matthew Elshaw Matt is a marketing professional at ineedhits.com, an international search marketing firm. Matt's passion for online marketing began at university and has proved invaluable in steering product development and marketing initiatives at the company. Matt is a regular contributor to the ineedhits search marketing blog.

View Matthew Elshaw's profile




Top 10 Listing!
Get more targeted visitors to your site! No click fees!
Find Out More Here

1000+ Guaranteed Visitors
Get thousands of guaranteed website visitors to your site in 30 days!
Get Started Here



Discussion (Not Started)


Add Your Comments







SUBSCRIBE

Keep up to date with the latest from our blogs.

Subscribe to all blog posts

The Newsletter
BROWSE OUR POSTS




  • New Posts
  • Popular
  • Comments
  • Robin: Thanks Courtney....
  • Royce: This is a big concern. Search results are...
  • Courtney Mills: @Robin check out this info page from Link...
  • Robin: Courtney Where do you add your company...
  • Andy Gage: Good Tips! I've also not done much on Lin...
  • Courtney Mills: @ Luke M, Great concept and the website d...
  • sever3d: I am following those tips as well on each...
  • Luke M: Hi Courtney, If you enjoyed Melissa's ...
  • Nick Stamoulis: Every page of a website should have uniqu...
  • faisal: is there major diffrence between fire fox...