Google For Dummies Free Open Book

Google For Dummies

Previous Section
 < Day Day Up > 
Next Section

The Google Crawl

As with most search engines, Google’s work has two parts: searching the Web and building an index. When you enter a search request, Google doesn’t really go onto the Web to find matching sites. Instead, it searches its index for matches. Google is special at both ends of its work spectrum: first in the scope of its Web searching (and therefore the size of its index), and second in the method by which it matches keywords to Web pages stored in the index.

Most search engine indexes start with an automatic, wide-flung search of the Web, conducted by automated software fancifully called a spider or crawler. Google’s crawl is farther-flung than most, resulting in an index that includes between three and four billion Web pages, as of this writing.

Google performs two levels of Web crawl. The main survey, often referred to as Google’s deep crawl, is conducted once a month. Google’s spider takes slightly more than a week to accomplish its profound examination of the Web. Then, as a bonus, Google launches a so-called fresh crawl much more frequently. The fresh crawl is an experimental update to Google’s index that began in mid-2002 and runs almost every day, at the company’s discretion. Naturally, the fresh crawl is shallower than the deep crawl and is designed to pick up new material from sites that change often. Material gleaned from the fresh crawl is added to the main Google index, though the schedule for the incorporation of new pages is a company secret.

Remember 

Webmasters can see the fresh crawl in action by searching for their new content in the main Google index. The continual index shifting (sometimes called the Everflux) is all part of the Google dance described in Chapter 14. Eager Webmasters should never forget that the Everflux is unpredictable, and one should never pin one’s hopes on the Google dance. There are no guarantees in the Google index, including one saying that any particular site must be included in the daily crawl. Hold fast to persistence and patience. The daily crawl is by no means designed to provide the Google index with a daily comprehensive update of the Web. Its purpose is to freshen the index with targeted updates.


Previous Section
 < Day Day Up > 
Next Section
Index: [SYMBOL][A][B][C][D][E][F][G][H][I][J][K][L][M][N][O][P][Q][R][S][T][U][V][W][X][Y][Z]


     Main Menu
Table of Contents
BackCover
Google For Dummies
Introduction
Part I: Taming Google
Part II: Specialty Searching
Part III: Putting Google to Work for You
Chapter 9: Google on Your Browser
Chapter 10: Googling in Tongues
Chapter 11: Using Google AdWords
Chapter 12: Bringing Google and Its Users to Your Site
The Google Crawl
Getting into Google
The Folly of Fooling Google
Keeping Google Out
Part IV: Tricks, Games, and Alternatives to Google
Part V: The Part of Tens
Google For Dummies Cheat Sheet
Index
List of Figures
List of Sidebars


More Books
PHP Hacks
Processing Xml With Java - A Guide To Sax, Dom, Jdom, Jaxp, And Trax
The Koran (Holy Qur'an)
Macromedia Flash 8 Bible
Search Engine Optimization for Dummies
YouTube Traffic
PHP 5 for Dummies
Harry Potter and The Chamber of Secrets
Harry Potter and the Sorcerer's Stone
The Pilgrim's Progress
Wireless Hacks
Flash Hacks. 100 Industrial-Strength Tips & Tools
PayPal Hacks. 100 Industrial-Strength Tips and Tools
Amazon Hacks
Pdf Hacks
The Da Vinci Code
Google Hacks
The Holy Bible
Windows XP For Dummies
Harry Potter and the Half-Blood Prince
Seo Book
Upgrading and Repairing Networks
Macromedia Dreamweaver 8 UNLEASHED
Windows XP Annoyances
Windows XP Hacks
Microsoft Windows XP Power Toolkit
Teach Yourself MS Office In 24Hours
iPod & iTunes Missing Manual
PC Hacks 100 Industrial-Strength Tips and Tools
PC Overclocking, Optimization, and Tuning - 2th Edition
PC Hardware In A Nutshell 3rd Edition
PC Hardware in a Nutshell, 2nd Edition
Upgrading and Repairing PCs
Google for Dummies
MySQL Cookbook
Teach Yourself Macromedia Flash 8 In 24 Hours
PHP CookBook
Sams Teach Yourself JavaScript in 24 Hours
PHP5 Manual
Free Games Paper Airplanes
500 Juegos Gratis 500 Giochi Gratis 500 Jeux Gratuits 500 Jogos Gratis 500 Kostenlose Spiele