On Friday, the Google webmaster central blog announced that the Google search spider will now attempt to fill out web forms to uncover new links and content. Google claims that the technology will help to uncover the “invisible web” that’s been hiding from search engines for many years.
The invisible web is a collection of webpage’s that have traditionally been hard or impossible for search engines to index. This can be for several reasons including pages that aren’t linked from other pages, sites that require a registration/login and content that is generated dynamically. Because these pages are either unlinked, or require some sort of form action to access, spiders such as the Googlebot have been unable to retrieve them.
Now, Google has introduced a new way to find these pages by completing web forms on high quality websites. From the Google blog post:
In the past few months we have been exploring some HTML forms to try to discover new web pages and URLs that we otherwise couldn’t find and index for users who search on Google. Specifically, when we encounter a <form> element on a high-quality site, we might choose to do a small number of queries using the form…. If we ascertain that the web page resulting from our query is valid, interesting, and includes content not in our index, we may include it in our index much as we would include any other web page.
Google claims the technology should help to increase exposure for sites that use <form> elements on their webpages. Popular Google personality Matt Cutts, has also given a great rundown of how this technology can improve web coverage for businesses with many country specific websites.
If you’ve experienced some unusual crawl patterns on your site, let us know if you’ve got any new pages indexed. If not, G-Boost is a great way to get these tricky pages indexed by Google in only 7 days and comes with a full money-back guarantee.