Google has released a new Web indexing system called caffeine. The Google blog mentioned it a “whole new web indexing system” that’s “more than 50 percent fresher than our last index and it’s the largest collection of web content we’ve offered”.
Previously, Google would crawl a fraction of the Web each night, index it and push it out in its results. With Caffeine, as Google crawls the Web and finds new information, it indexes it immediately. "We process it immediately so we can serve it seconds later," said Matt Cutts, the head of Google's webspam team. He unveiled the news at the Search Marketing Expo in Seattle.
The new system, called Caffeine, delivers results that are closer to "live" than Google's previous system, the company said.
Why google did launch a new Indexing system?
Google posted on it official blog “Content on the web is blossoming. It's growing not just in size and numbers but with the advent of video, images, news and real-time updates, the average webpage are richer and more complex. In addition, people's expectations for search are higher than they used to be. Searchers want to find the latest relevant content and publishers expect to be found the instant they publish.”
Content will be available to searchers more quickly:
Previously, Google’s crawling and indexing systems worked as batch processes. Googlebot would crawl a set of pages, then process those pages (extracting content from them, associating data about them, such as anchor text and external links, determining what those pages were about), and finally add them to the index. While this system was continuous, all the documents in the batch had to wait until the whole batch was processed to be pushed live. Now, when Google crawls a page, it processes that page through the entire indexing pipeline and pushes it live nearly instantly. This change has already resulted in a 50 percent fresher index than before.
Note that the introduction of Caffeine doesn’t necessarily mean that pages will be crawled on a faster schedule than before. It simply means that once those pages are crawled, they are made available to searchers much more quickly. (Remember, you can estimate how often your pages are crawled by taking a look at your server logs or checking the cache dates in Google.)
How could you get benefited by this as a website owner or blogger?
As content owners, you will get benefits of Caffeine indexing system without doing anything at all.
This change doesn’t make any of the crawling, indexing, or ranking factors more or less important than before. It does not mean that Google will crawl or index you web page much faster. A benefit that you will get from Caffeine is that it will simply make crawled content available in search results more quickly in compare to old indexing system.
Via: searchengineland , Google Blog
Dear Readers:
|