google ppt by amit

26

Upload: davv

Post on 17-May-2015

8.380 views

Category:

Education


4 download

DESCRIPTION

a brief knowledge of Google working,how Google search so fast

TRANSCRIPT

Page 1: Google ppt by amit
Page 2: Google ppt by amit

Name- Google, because itis a common spelling of googol, or 10100 and fits well with our goal of building very large-scale search

Page 3: Google ppt by amit

1. Google is a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext.

2. Google is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems.

3.large-scale search engine which addresses many of the problems of existing systems. It makes

especially heavy use of the additional structure present in hypertext to provide much higher quality search results

.

Page 4: Google ppt by amit

How Google Works Google consists of three distinct parts, each of which is run on a

distributed network of thousands of low-cost computers and can therefore when we enter a query.

1.carry out fast parallel processing -Parallel processing is a method of computation in which many calculations can be performed simultaneously, significantly speeding up data processing.2. Googlebot- a web crawler that finds and fetches web pages. The indexer that sorts every word on every page and stores the resulting index of words in a huge database.

3. The query processor, which compares your search query to the index and recommends the documents that it considers most relevant.

Page 5: Google ppt by amit

3. The search results are returned tothe user in a fraction of a second.

1. The web server sends the query tothe index servers. The content insidethe index servers is similar to the indexin the back of a book--it tells whichpages contain the words that match anyparticular query term2. The query travels to the doc

servers, which actually retrieve thestored documents. Snippets aregenerated to describe each searchresult.Copyright

Page 6: Google ppt by amit

crawling technology is needed to gather the web documents and keep them up to date. Storage space must be used efficiently to store indices and, optionally, the documents themselves.

indexing The indexing system must process hundreds of gigabytes of data efficiently. Queries must be handled quickly, at a rate of hundreds to thousands per second.

.

Google is designed to scale well to extremely large data sets. It makes efficient use of storage space to store the index. Its data structures are optimized for fast and efficient access Further, we expect that the cost to index and store text or HTML will eventually decline relative to the amount that will be available This will result in favourable scaling properties for centralized systems like Google.

Page 7: Google ppt by amit

The Google search engine has two important features that help it produce high precision results.

First it makes use of the link structure of the Web to calculate a quality ranking for each web page. This rankingis called PageRank.

Second Google utilizes link to improve search results.

Page 8: Google ppt by amit

PageRank: Bringing Order to the Web

The citation (link) graph of the web is an important resource that has largely gone unused in existing web search engines.

they have created maps containing as many as 518 million of these hyperlinks These maps allow rapid calculation of a web page’s "PageRank", anobjective measure of its citation importance that corresponds well with people’s subjective idea of importance Because of this correspondence, PageRank is an excellent way to prioritize the results of web keyword searches For most popular subjects, a simple text matching search that is restricted to webpage titles performs admirably when PageRank prioritizes the results For the type of full text searches in the main Google system, PageRank also helpsa great deal.

Page 9: Google ppt by amit

Anchor TextThe text of links is treated in a special way in our search engine. Most search engines associate the text of a link with the page that the link is on. In addition, we associate it with the page the link points to.This has several advantages. First, anchors often provide more accurate descriptions of web pages thanthe pages themselves. Second, anchors may exist for documents which cannot be indexed by atext-based search engine, such as images, programs, and databases. This makes it possible to return webpages which have not actually been crawled. Note that pages that have not been crawled can causeproblems, since they are never checked for validity before being returned to the user. In this case, thesearch engine can even return a page that never actually existed, but had hyperlinks pointing to it.However, it is possible to sort the results, so that this particular problem rarely happens.

Page 10: Google ppt by amit

Aside from PageRank and the use of anchor text, Google has several other features.

First- it has location information for all hits and so it makes extensive use of proximity in search.

Second- Google keeps track of some visual presentation details such as font size of words. Words in a larger or bolder font areweighted higher than other words.

Third- full raw HTML of pages is available in a repository.

Page 11: Google ppt by amit
Page 12: Google ppt by amit

The Google Advanced Search is of course applicable to texts, terms, files and so on. In thatway is possible to do an advanced search in texts with following terms:• Idioms• Format file• Domains• Books• Codes

Page 13: Google ppt by amit

Figura 1: Google Advanced Image Search

Page 14: Google ppt by amit

1. Parse the query.

2. Convert words into wordIDs.

3. Seek to the start of the doclist in the short barrel for every word

4. Scan through the doclists until there is a document that matches all the search terms.

5. Compute the rank of that document for the query.

6. If we are in the short barrels and at the end of any doclist, seek to the start of the doclist in the full barrel for every word and

go to step 4.

7. If we are not at the end of any doclist go to step 4. Sort the documents that have matched by rank and return the top k.

Page 15: Google ppt by amit
Page 16: Google ppt by amit

What is a query? It's a request for information from a search engine. A query consists of one or more words, numbers, or phrases that youhope you will find in the search results listings. To enter a query, type in descriptive words into Google's search box. You can use either the search box on Google's home page (shown above) or the search box that always appears at the top of a Google results page .Now press the ENTER key or click on the "Google Search" button to view your search results, which include links to pages that match yourquery along with relevant snippets (excerpts) with your search terms in a boldface

Page 17: Google ppt by amit

Search within resultsYou can get the same results in one step fewer by simply specifying additional terms to your previous query.On Internet Explorer and on some other browsers, you can double click on a term to highlight it. Then type a new term or hit theDELETE key to remove the term. Triple click in the search box to highlight your entire query. Enter a new query or hit the DELETE key toremove the old query.l Instead of searching for related topics with a single query, divide the query into several parts. Looking for a job? By searching for tips oneach aspect, you'll find more sites than by searching for sites that describe all the aspects of a job search

Page 18: Google ppt by amit

Google Earth is very famous interactive application mapping program powered by satellite andaerial imagery that covers the vast majority of the planet. Google Earth is generally considered to be remarkably accurate and extremely detailed. Many major cities in the planet have such detailed images that one can zoom in close enough to see vehicles and pedestrians clearly. Consequentlythere have been some concerns about national security implications in despite of the images hasbeen not updated constantly. Google has many others products through the Google Labs notreleased yet due it are still being tested for use by general public.One good differential on Google Search is regarding to logic engine based on Boolean Logiccreated by mathematician Britain George Boole. Therefore the Google engine allows finding words,texts and so on using logic values conditioned to:• The value must be true or false• The value must not be true and false at same time• If true, it is defined as 1 and if false it is defined as 0(zero

Page 19: Google ppt by amit

Now we came to Google Desktop (2) is desktop search software made by Google for Mac OS X,Linux, and Microsoft Windows. The program allows text searches of a user's e-mails, computer files,music, photos, chats, Web pages viewed, and other "Google Gadgets."Google Desktop have the following features:File indexing: After initially installing Google Desktop, the software completes an indexingof all the files in the computer And after the initial indexing is completed, the softwarecontinues to index files as needed. Users can start searching for files immediately afterinstalling the program. After performing searches, results can also be returned in an Internetbrowser on the Google Desktop Home Page much like the results for Google Web searches.

Page 20: Google ppt by amit
Page 21: Google ppt by amit

• Sidebar: Screenshot of gadgets. Google Desktop running on Microsoft Windows Vista. Aprominent feature of Google Desktop is the Sidebar, which holds several common Gadgetsand resides off to one side of the desktop. The Sidebar is available with the MicrosoftWindows version of Google Desktop only. The Sidebar comes pre-installed with thefollowing gadgets: Email - a panel which lets one view one's Gmail messages. Scratch Pad - here one can store random notes; they are saved automatically Photos - displays a slideshow of photos from the "My Pictures" folder .News - shows the latest headlines from Google News, and how long ago they were written. The News panel is personalized depending on the type of news you read. Weather - shows the current weather for a location specified by the user. Web Clips - shows recent posts from RSS news feeds. Google Talk - If Google Talk is installed, double clicking the window title will dock it to one's sidebar

Page 22: Google ppt by amit

Quick Find: When searching in the sidebar, deskbar or floating deskbar, Google Desktopdisplays a "Quick Find" window. This window is filled with 6 (by default) of the most relevantresults from one's computer. These results update as one types so that one can get to whatone wants on one's computer without having to open another browser window.

Page 23: Google ppt by amit

Deskbars: Deskbars are boxes which enable one to type in a search query directly fromone's desktop. Web results will open in a browser window and selected computer results willbe displayed in the "Quick Find" box (see above). A Deskbar can either be a fixed deskbar,which sits in one's Windows Taskbar, or a Floating Deskbar, which one may positionanywhere one wants on one's desktop.

Page 24: Google ppt by amit
Page 25: Google ppt by amit
Page 26: Google ppt by amit

THANK YOU