google and beyond: not google

44
08/05/22 www.rba.co.uk 1 Google and Beyond: not Google NHS South West Monday, 11 th November 2013, Exeter Thursday, 14 th November 2013, Bristol This presentation is licensed under a Creative Commons Attribution 3.0 License Karen Blakeman, RBA Information Services [email protected] , http://www.rba.co.uk/search/ twitter.com/karenblakeman , http://google.com/+KarenBlakeman/ , http://www.linkedin.com/in/karenblakeman Slides will be available on http://www.authorstream.com and http://www.slideshare.com/ . Also available temporarily at http://www.rba.co.uk/as/ XXX

Upload: karen-blakeman

Post on 08-May-2015

3.718 views

Category:

Technology


1 download

DESCRIPTION

Slides for a workshop held for NHS South West in Exeter and Bristol. This session was on alternatives to Googl

TRANSCRIPT

Page 1: Google and Beyond: NOT Google

11/04/23 www.rba.co.uk 1

Google and Beyond: not GoogleNHS South West

Monday, 11th November 2013, ExeterThursday, 14th November 2013, Bristol

This presentation is licensed under a Creative Commons Attribution 3.0 License

Karen Blakeman, RBA Information [email protected], http://www.rba.co.uk/search/

twitter.com/karenblakeman, http://google.com/+KarenBlakeman/, http://www.linkedin.com/in/karenblakeman

Slides will be available on http://www.authorstream.com and http://www.slideshare.com/. Also available temporarily at http://www.rba.co.uk/as/

X X X

Page 2: Google and Beyond: NOT Google

Bing/Yahoo

http://www.bing.com/ http://www.yahoo.com/

Yahoo now uses Bing’s database, commands and ranking algorithms

No advanced search screen - use commands. List at Advanced Operator Reference http://msdn.microsoft.com/en-us/library/ff795620.aspx

filetype: site: inbody: inurl:

AND, NOT, OR parentheses for complex Boolean searches

NEAR:n where n is a number, specifies that the terms must be within that number of words of each other and in any order

- director NEAR:3 marketing

11/04/23 www.rba.co.uk 2

Page 3: Google and Beyond: NOT Google

bing.com

11/04/23 www.rba.co.uk 3

Page 4: Google and Beyond: NOT Google

bingiton.com

11/04/23 www.rba.co.uk 4

Page 5: Google and Beyond: NOT Google

Bing

Results seem to be more consumer/retail focused– more ‘shopping’ than research

– results improve as soon as you start using the advanced search commands

Sometimes more up to date than Google– updates sites more frequently

– adds new sites more quickly

– useful if you are looking for information on a new company or organisation

Many features and options available to US users only– changing your location does not always work

– using anonymous proxy does not always work

Excellent for images

11/04/23 www.rba.co.uk 5

Page 6: Google and Beyond: NOT Google

Bing images

11/04/23 www.rba.co.uk 6

Page 7: Google and Beyond: NOT Google

DuckDuckGo – http://duckduckgo.com/

Does not track, does not personalise

Results are a compilation of about 50 sources including Wikipedia, Wolfram Alpha, Bing, Blekko and its own Web crawler DuckDuckBot

Advanced search commands include:

site: inbody: intitle: filetype: sort:date to sort by date (uses results from Blekko)region:cc (e.g. de) to boost a country

DuckDuckGo Syntax http://help.duckduckgo.com/customer/portal/articles/300304

DuckDuckGo – silly name but a neat little search tool http://www.rba.co.uk/wordpress/2011/11/07/duckduckgo-silly-name-but-a-neat-little-search-tool/ 11/04/23 www.rba.co.uk 7

Page 8: Google and Beyond: NOT Google

DuckDuckGo

11/04/23 www.rba.co.uk 8

Page 9: Google and Beyond: NOT Google

Millionshort http://millionshort.com

Million Short: unearthing information hidden in the dungeons of Google’s results

– http://www.rba.co.uk/wordpress/2012/10/04/million-short-unearthing-stuff-hidden-in-the-dungeons-of-googles-results/

Uses Bing API plus other sources

Great for finding specialist articles that Google buries beyond reach

Removes top 10k sites from results - can change to top million, 100k, 1k, 100

Can add sites back in, can block sites

Can “Boost!” sites so that they always appear at the top

Can use site: and filetype: commands

Country versions give different results (under Manage Settings and Country)11/04/23 www.rba.co.uk 9

Page 10: Google and Beyond: NOT Google

Million Short

11/04/23 www.rba.co.uk 10

Page 11: Google and Beyond: NOT Google

Yandex http://www.yandex.com/

– for filetype use mime:

diabetic retinopathy mime:pptx

– has an advanced search screen at http://yandex.com/search/advanced

Blekko http://www.blekko.com/

Ask http://www.ask.com/

Teoma http://www.teoma.com/

– all three support filetype: and site:

11/04/23 www.rba.co.uk 11

Page 12: Google and Beyond: NOT Google

eTools.ch

11/04/23 www.rba.co.uk 12

Page 13: Google and Beyond: NOT Google

WolframAlpha

http://www.wolframalpha.com/

Computational knowledge engine, curated data

Click Examples, Random, or an image in the homepage background to get an idea of what it covers

11/04/23 www.rba.co.uk 13

Page 14: Google and Beyond: NOT Google

11/04/23 www.rba.co.uk14

Page 15: Google and Beyond: NOT Google

11/04/23 www.rba.co.uk 15

Free specialist tools for research information and grey literature (anything but Google

Scholar!)

Page 16: Google and Beyond: NOT Google

Microsoft Academic Search

http://academic.research.microsoft.com/

Journal articles, pre-prints, post-prints, conference proceedings, reports and white papers

Free to use but the full text of some papers can only be viewed on payment of a fee to the original journal publisher

Author may have several different profiles and articles may be assigned to wrong author

Sometimes very slow to load

Uses Silverlight for the charts

11/04/23 www.rba.co.uk 16

Page 17: Google and Beyond: NOT Google

Microsoft Academic Search

11/04/23 www.rba.co.uk 17

Page 18: Google and Beyond: NOT Google

Microsoft Academic Search

11/04/23 www.rba.co.uk 18

Page 19: Google and Beyond: NOT Google

Open access

US

All research publications resulting from work funded by the US National Institutes of Health are expected to be made freely available and deposited in PubMed Central (http://www.ncbi.nlm.nih.gov/pmc/)

– some material embargoed for up to 12 or 24 months (http://www.ncbi.nlm.nih.gov/pmc/journals/)

– Europe PubMed Central (http://europepmc.org/) part of PMC network of international repositories

UK

1st of April 2013 - researchers at UK Research Institutions are expected to publish as open access any peer‐reviewed research papers and conference proceedings that acknowledge Research Council UK funding

11/04/23 www.rba.co.uk 19

Page 20: Google and Beyond: NOT Google

UK Gold versus Green OA

Gold OA– researchers publish their articles in journals that offer open

access publishing (can be established, conventional publishers)– articles can be made available free of charge to readers

immediately – author or institution/department pays article processing fee– copyright license CC-BY

Green OA – researchers deposit copies of articles in an institutional or

subject-based repository, subject to copyright/license permissions

– repository makes copies available to the public either immediately or embargoed (more common)

– period of embargo varies (for example http://cdn.elsevier.com/assets/pdf_file/0018/121293/external-embargo-list.pdf)

– copyright license CC-BY-NC11/04/23 www.rba.co.uk 20

Page 21: Google and Beyond: NOT Google

Fragmentation of open access

Where are the open access publications?– Individual OA articles within existing subscription

journals?

– Separate OA journals?

– Publishers web site?

– Author’s website

– Institutional repositories

– Aggregators e.g. Scopus, Web of Science, Google?

– “Predatory publishers”

• see Jeffrey Beall’s list at http://scholarlyoa.com/publishers/

11/04/23 www.rba.co.uk 21

Page 22: Google and Beyond: NOT Google

Institutional repositories and open access

BASE - Bielefeld Academic Search Engine http://www.base-search.net/

CORE (COnnecting Repositories) http://core.kmi.open.ac.uk/search

DART-Europe E-theses Portal http://www.dart-europe.eu/basic-search.php

DOAJ: Directory of Open Access Journals http://www.doaj.org/doaj

Institutional Repository Search (IRS) http://irs.mimas.ac.uk/

Open DOAR http://opendoar.org/

RIAN - Pathways to Irish Research http://rian.ie

ROAR - Registry of Open Access Repositories http://roar.eprints.org/

OpenAIRE http://www.openaire.eu/

11/04/23 www.rba.co.uk 22

Page 23: Google and Beyond: NOT Google

Specialist search tools for research information

A selection can be found at http://www.rba.co.uk/search/links.shtml#research

BioMed Central http://www.biomedcentral.com/

ChemSpider http://www.chemspider.com/

Deep Web TechnologiesMednar http://mednar.com/

Science.gov http://www.science.gov/

Science Research http://scienceresearch.com/

WorldWideScience http://worldwidescience.org/

11/04/23 www.rba.co.uk 23

Page 24: Google and Beyond: NOT Google

Specialist search tools for research information

PubMed Central http://www.ncbi.nlm.nih.gov/pmc/

Europe PubMed Central http://europepmc.org/

Mendeley http://www.mendeley.com/

Open Biology http://rsob.royalsocietypublishing.org/

Scirus http://www.scirus.com/ [closing in 2014] 

11/04/23 www.rba.co.uk 24

Page 25: Google and Beyond: NOT Google

Grey literature

Literature that has been “peer reviewed” or assessed/approved in some way by colleagues or subject experts but is not easy to find or access

Print run may have been small, possibly never published electronically

Published on the web but page or site is no longer available. (Use web archives e.g. http://www.archive.org/).

Research & technical papers, government reports, pre-prints, market surveys, press releases, committee papers, conference papers, presentations

Use advanced search commands to try and track down information

GreyNet International, Grey Literature Network Service – http://www.greynet.org/ – http://www.opengrey.eu/

11/04/23 www.rba.co.uk 25

Page 26: Google and Beyond: NOT Google

Images

Creative Commons and public domain images

– Flickr Creative Commons http://www.flickr.com/creativecommons

– Wikimedia Commons http://commons.wikimedia.org/

– MorgueFile.com http://www.morguefile.com/

– Wellcome Images http://wellcomeimages.org/

– Most of the images on US government web sites

– Nasa http://www.nasa.gov/

11 April 2023 Karen Blakeman www.rba.co.uk 26

Page 27: Google and Beyond: NOT Google

Statistics

11/04/23 www.rba.co.uk 27

Page 28: Google and Beyond: NOT Google

http://www.offstats.auckland.ac.nz/

11/04/23 www.rba.co.uk 28

Page 29: Google and Beyond: NOT Google

Official statistics

NHS Statistics Links– http://www.nhs.uk/Pages/LinkListing.aspx?CategoryId=Statistics

UK National Statistics Publication Hub– http://www.statistics.gov.uk/

Office for National Statistics– http://www.ons.gov.uk/

Welsh Government | Statistics– http://wales.gov.uk/topics/statistics/

Welsh Assembly Government StatsWales– http://statswales.wales.gov.uk/

data.gov.uk – http://data.gov.uk/

Eurostat – http://ec.europa.eu/eurostat/

European Union Open Data Portal – http://open-dat.europa.eu/en/

11/04/23 www.rba.co.uk 29

Page 30: Google and Beyond: NOT Google

NHS Statistics Links http://www.nhs.uk/Pages/LinkListing.aspx?CategoryId=Statistics

11/04/23 www.rba.co.uk 30

Page 31: Google and Beyond: NOT Google

UK National Statistics & ONS

http://www.statistics.gov.uk/, http://www.ons.gov.uk/

11/04/23 www.rba.co.uk 31

Page 32: Google and Beyond: NOT Google

Publication Hub (http://www.statistics.gov.uk/) is an “index” to what is available and links through to other sites when necessary.

ONS (http://www.ons.gov.uk/) only shows reports since 2008 even if there are earlier editions. Use the Publication Hub to search for the report title/series. Once you have found an edition of the title click on “Current and past editions” to see the list of editions available. Then click on the relevant report.

11/04/23 www.rba.co.uk 32

Page 33: Google and Beyond: NOT Google

Publication Hub search (statistics.gov.uk)

11/04/23 www.rba.co.uk 33

Page 34: Google and Beyond: NOT Google

data.gov.uk http://data.gov.uk/

11/04/23 www.rba.co.uk 34

Page 35: Google and Beyond: NOT Google

data.gov.uk http://data.gov.uk/

Not all of the data on this site is open data – may be restrictions on use

Download links sometimes take you to the wrong dataset

Download links sometimes completely broken

It’s all or nothing! Have to filter the datasets for the information you want and produce your own graphs and charts

Variety of formats

11/04/23 www.rba.co.uk 35

Page 36: Google and Beyond: NOT Google

Eurostat http://ec.europa.eu/eurostat

11/04/23 www.rba.co.uk 36

Page 37: Google and Beyond: NOT Google

European Union - Open Data Portal http://open-data.europa.eu/open-data/

11/04/23 www.rba.co.uk 37

Page 38: Google and Beyond: NOT Google

Datamarket http://datamarket.com/

Open portal to datasets worldwide and market research

Creates visualisations of the data

11/04/23 www.rba.co.uk 38

Page 39: Google and Beyond: NOT Google

Zanran http://www.zanran.com/

Searches graphs, charts, tables, PDFs, spreadsheets

Can limit by location of server, date, and filetype

Title in results list is usually the title or caption to the table and not title of the document

Hover over the thumbnail to see a preview of the table or page

Click on the URL button next to the result to view the original URL of the document

– clicking on it may take you to “page not found” 404

Click on the title of the result to see Zanran’s own copy (free registration usually required

Useful if document no longer available at it’s original location and can’t be found in any of the web archives

11/04/23 www.rba.co.uk 39

Page 40: Google and Beyond: NOT Google

Zanran

http://www.zanran.com/

Zanran – great for data in tables, charts and graphs http://www.rba.co.uk/wordpress/2013/01/25/zanran-great-for-data-in-tables-charts-and-graphs/

11/04/23 www.rba.co.uk 40

Page 41: Google and Beyond: NOT Google

Tracking down websites and pages that have disappeared

11/04/23 www.rba.co.uk 41

Page 42: Google and Beyond: NOT Google

UK Government Web Archive | The National Archives http://www.nationalarchives.gov.uk/webarchive/

Browse by category or choose your organisation from an A-Z listChoose the date of the archived version of the website you want to view

11/04/23 www.rba.co.uk 42

Page 43: Google and Beyond: NOT Google

http://www.legislation.gov.uk/

11/04/23 www.rba.co.uk 43

Page 44: Google and Beyond: NOT Google

Wayback Machine http://www.archive.org/

11/04/23 www.rba.co.uk 44