internet intelligence

28
Internet Intelligence Robert Crayford Copyright © Halliwells LLP 2008 All rights reserved.

Upload: robcrayford

Post on 11-Jun-2015

837 views

Category:

Documents


3 download

TRANSCRIPT

Page 2: Internet Intelligence

The Internet

“Have you heard of this new thing called the internet? It's giving people new expectations. It's allowing them to become their own expert. Knowledge lies anxious at their fingertips”

Roy H. Williams

Page 3: Internet Intelligence

Internet Intelligence

• Open Source

• Social networking sites.

• Internet footprint

• Questions

Page 4: Internet Intelligence

Open Source searching

• Open source searching refers to any site that, does not need a password or log in to enter.

• The more common open source searches relate to search engines.

Page 5: Internet Intelligence

Deep Web Searching

• The term Deep Web refers to information found on Web sites that is hidden or generally inaccessible through traditional search methods

Page 6: Internet Intelligence

Deep Web searching

• Searching social networking sites and newsgroups/forums is an example of deep web searching.

• The information would not be found from searching search engines.

• It is important to remember that there is a lot of data that can only be found through deep web searching

• To search the deep web you need to locate online databases and forums and search them individually

Page 7: Internet Intelligence

Search Engines

• When you search the web using a search engine, you are always searching a somewhat stale copy of the real web page. When you click on links provided in a search engine's search results, you retrieve from the server the current version of the page.

• Search engine databases are selected and built by computer robot programs called spiders. These "crawl" the web, finding pages for potential inclusion by following the links in the pages they already have in their database (i.e., already "know about").

Page 9: Internet Intelligence

Search engines

• If a web page is never linked to in any other page, search engine spiders cannot find it. The only way a brand new page - one that no other page has ever linked to - can get into a search engine is for its URL to be sent by some human to the search engine companies as a request that the new page be included. All search engine companies offer ways to do this.

• Many web pages are excluded from most search engines by policy. The contents of most of the searchable databases mounted on the web, such as library catalogs and article databases, are excluded because search engine spiders cannot access them. All this material is referred to as the Invisible web-what you don't see in search engine results.

Page 12: Internet Intelligence

One Enough??

• Less than half the searchable Web is fully searchable in Google.

• The percent of total results unique to one search engine was established to be 88.3 percent.

• The percent of total results shared by any two search engines was established to be 8.9 percent.

• The percent of total results shared by three search engines was established to be 2.2 percent.

• The percent of total results shared by the top four search engines was established to be 0.6 percent.

Page 13: Internet Intelligence

One Enough??

• The majority of first page results are unique:

• On average, 69.6 percent of Google first page search results were unique to Google.

• On average, 79.4 percent of Yahoo! first page search results were unique to Yahoo!

• On average, 80.1 percent of Live first page search results were unique to Live.

• On average, 75.0 percent Ask first page search results were unique to Ask.

Page 15: Internet Intelligence

The Top 9 Social Networking Sites by internet visits

Rank Name Domain Market Share %

1 Facebook www.facebook.com 37.7

2 Bebo www.bebo.com 28

3 Myspace www.myspace.com 18.97

4 Faceparty www.faceparty.com 2.01

5 Windows Live Space Spaces.live.com 1.99

6 BBC h2g2 www.bbc.co.uk/dna 1.25

7 Stumble Upon www.stumbleupon.com 1.19

8 Club Penguin www.clubpenguin.com 1.05

9 Friends Reunited www.friendsreunited.co.uk 0.88

Page 17: Internet Intelligence

I.P Addresses

• All computers across the internet are assigned a unique identifier called an IP address. They are used like street addresses so other computers can find them. An IP address could look something like this: 87.242.211.23.

• Websites can log any IP addresses that look at their site.

• IP addresses can then be traced back to the server.

Page 18: Internet Intelligence

IP Address

Page 19: Internet Intelligence

I.P

• They could then Google or yahoo “Halliwells” and “Manchester” to find our address.

• IP Address finder:

• http://www.ip-adress.com/

Page 20: Internet Intelligence

Search Results

• Webmasters can even trace, what search term you used to find their website.

• For example, if you searched for fraudulent people in Liverpool and then clicked on one of the search results, the owner of the site found in the search could see that you were searching for fraudulent people in Liverpool.

Page 21: Internet Intelligence

Search Results

• To avoid this, most search results provide the URL of the results. You can copy and paste this in to a new web browser.

Page 22: Internet Intelligence

Cloaking

• There are many web based proxys that claim to hide your IP address.

• These sites are untested- and this must be considered while using them.

• The websites records information of who blocked who, to look at what.

• http://www.the-cloak.com/anonymous-surfing-home.html

Page 23: Internet Intelligence

Tracing Emails

• You can trace a IP address of the server the email was sent from.

• Web mail tracing would reveal the IP address of the web mail server. e.g. Hotmail.

• The IP address is hidden in the internet header of the email.

• You can either search through the headers to find the IP address or you can paste the header on to an online engine and it will find it for you.

• http://www.ip2location.com/emailtracer.aspx

Page 24: Internet Intelligence

Tracing Emails

Page 25: Internet Intelligence

Tracing Emails

Page 26: Internet Intelligence

BBC news 6/12/1998

Page 27: Internet Intelligence

Halliwells Website 27.11.2004

Page 28: Internet Intelligence

Any Questions

• Robert Crayford

[email protected]

• 0161 618 4312