crawling the web

Post on 23-Feb-2016

23 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

CRAWLING THE WEB. CRAWLING THE WEB. What do you do when you need information from the internet? . Search Engines. directories. Open directory project (DMOZ). Meta-search engines. FINDING INFORMATION ON THE WEB. SEARCH ENGINES DIRECTORIES META-SEARCH ENGINES. - PowerPoint PPT Presentation

TRANSCRIPT

CRAWLING THE WEB

CRAWLING THE WEB

What do you do when you need information from the internet?

SEARCH EN

GIN

ES

DIRECTO

RIESOpen directory project (DMOZ)

META-SEARCH

EN

GIN

ES

FINDING INFORMATION ON THE WEB

SEARCH ENGINES

DIRECTORIES

META-SEARCH ENGINES

HOW DOES A SEARCH ENGINE WORK? Search engines use a computer program

called a SPIDER to roam the World Wide Web pages and their links.

HOW DOES A SEARCH ENGINE WORK? The spider collects the information and

then indexes all the information.

HOW DOES A SEARCH ENGINE WORK? Each search engine’s spider indexes and organizes the

Web pages

While indexing, matches between keywords and Web pages are found.

The sites with the best matches are displayed first. Each search engine has a different way of identifying the best sites.

HOW DOES A SEARCH ENGINE WORK?

HOW DOES A SEARCH ENGINE WORK?1. ROAMS and COLLECTS INFORMATION

2. INDEXES ALL THE INFORMATION

3. MATCHES THE INFORMATION

These 3 tasks are all done WITHOUT ANY HUMAN INVOLVEMENT–so a huge number of sites are indexed quickly.

HOW DOES A DIRECTORY WORK? In a DIRECTORY, PEOPLE, not

computers, put the index together.

HOW DOES A DIRECTORY WORK? Editors evaluate Web sites and organize

them into subject categories.

Because people have chosen them, the sites in directories may be of higher QUALITY.

HOW DOES A DIRECTORY WORK? The number of sites in a DIRECTORY is

usually much SMALLER than in a search engine’s index.

Many people use the term “SEARCH ENGINE” to describe either a search engine or a directory. That is because many search sites offer both services.

HOW DOES A META-SEARCH ENGINE WORK? A META-SEARCH ENGINE sends your

keywords to several search engines at the same time.

The results from each searchengine are organized and displayed on one page.

HOW DOES A META-SEARCH ENGINE WORK?

This type of service is useful when your topic is very NARROW and you want to search as many Web sites as possible.

REMEMBER …

No one search engine, directory or meta-search engine covers the entire Web. So, don’t get stuck in a rut by using only one. Try them all!

top related