Monday, November 17, 2008

Secret Exposed: How the Google Search Engine Really Works

By Henry Zeng

While we don't often think about - or even care - how the Google search engine works, it is very important for marketer who is using search engine optimization, or natural search marketing for their online business to understand the basic principles that drive the Big G.

There are three main parts to Google: Googlebot, The Indexer, and the Query Processor. Believe it or not, understanding how each of the parts works is crucial to your online businesses. Let's take a look at each of the elements to see how they work.

Googlebot is the search engine robot that comes to your website and spiders your pages.

It actually works more like a web browser by calling a server, requesting pages, and downloading them - just as you do when you open your web browser, type in an web address, and the page loads.

Googlebot, anyway, is run by numerous computers and performs this task much faster than one user could on their home or work personal computer. Googlebot also doesn't know whether a website exists or not until a URL is put into their Add URL form, or they find a link points to that website on another web page.

When Googlebot visits a page, they take all the links from that page, put them in a queue for crawling, and just repeat this process over and over again. Once Googlebot has found a link, and downloaded the page, it hands that page off to the Indexer.

The Indexer stores the pages in Google's Index Data base. The Index then sorts all of the pages in its data base, alphabetically, by looking at all of the keywords on the pages. The Indexer does not pay any attention to what Google calls stop words, such as is, on, or, why, how, etc. It just pays attention to more important keyword type words.

The Indexer, after Indexing pages, waits for the Google Query Processor to ask it for a list of documents. A keyword is given by the Query Processor, which in turn asks the Indexer for a list of websites that contain that keyword.

The Indexer then supplies the Query processor with a list of the documents, and the Query processor presents that to the user that requested the keyword.

As you see, each element is important to the others. It all starts with the Googlebot finding your webpage link on another page, or by you going to Google and using the Add form to list your website.

About the Author:

No comments: