google and beyond: not google
DESCRIPTION
Slides for a workshop held for NHS South West in Exeter and Bristol. This session was on alternatives to GooglTRANSCRIPT
11/04/23 www.rba.co.uk 1
Google and Beyond: not GoogleNHS South West
Monday, 11th November 2013, ExeterThursday, 14th November 2013, Bristol
This presentation is licensed under a Creative Commons Attribution 3.0 License
Karen Blakeman, RBA Information [email protected], http://www.rba.co.uk/search/
twitter.com/karenblakeman, http://google.com/+KarenBlakeman/, http://www.linkedin.com/in/karenblakeman
Slides will be available on http://www.authorstream.com and http://www.slideshare.com/. Also available temporarily at http://www.rba.co.uk/as/
X X X
Bing/Yahoo
http://www.bing.com/ http://www.yahoo.com/
Yahoo now uses Bing’s database, commands and ranking algorithms
No advanced search screen - use commands. List at Advanced Operator Reference http://msdn.microsoft.com/en-us/library/ff795620.aspx
filetype: site: inbody: inurl:
AND, NOT, OR parentheses for complex Boolean searches
NEAR:n where n is a number, specifies that the terms must be within that number of words of each other and in any order
- director NEAR:3 marketing
11/04/23 www.rba.co.uk 2
bing.com
11/04/23 www.rba.co.uk 3
bingiton.com
11/04/23 www.rba.co.uk 4
Bing
Results seem to be more consumer/retail focused– more ‘shopping’ than research
– results improve as soon as you start using the advanced search commands
Sometimes more up to date than Google– updates sites more frequently
– adds new sites more quickly
– useful if you are looking for information on a new company or organisation
Many features and options available to US users only– changing your location does not always work
– using anonymous proxy does not always work
Excellent for images
11/04/23 www.rba.co.uk 5
Bing images
11/04/23 www.rba.co.uk 6
DuckDuckGo – http://duckduckgo.com/
Does not track, does not personalise
Results are a compilation of about 50 sources including Wikipedia, Wolfram Alpha, Bing, Blekko and its own Web crawler DuckDuckBot
Advanced search commands include:
site: inbody: intitle: filetype: sort:date to sort by date (uses results from Blekko)region:cc (e.g. de) to boost a country
DuckDuckGo Syntax http://help.duckduckgo.com/customer/portal/articles/300304
DuckDuckGo – silly name but a neat little search tool http://www.rba.co.uk/wordpress/2011/11/07/duckduckgo-silly-name-but-a-neat-little-search-tool/ 11/04/23 www.rba.co.uk 7
DuckDuckGo
11/04/23 www.rba.co.uk 8
Millionshort http://millionshort.com
Million Short: unearthing information hidden in the dungeons of Google’s results
– http://www.rba.co.uk/wordpress/2012/10/04/million-short-unearthing-stuff-hidden-in-the-dungeons-of-googles-results/
Uses Bing API plus other sources
Great for finding specialist articles that Google buries beyond reach
Removes top 10k sites from results - can change to top million, 100k, 1k, 100
Can add sites back in, can block sites
Can “Boost!” sites so that they always appear at the top
Can use site: and filetype: commands
Country versions give different results (under Manage Settings and Country)11/04/23 www.rba.co.uk 9
Million Short
11/04/23 www.rba.co.uk 10
Yandex http://www.yandex.com/
– for filetype use mime:
diabetic retinopathy mime:pptx
– has an advanced search screen at http://yandex.com/search/advanced
Blekko http://www.blekko.com/
Ask http://www.ask.com/
Teoma http://www.teoma.com/
– all three support filetype: and site:
11/04/23 www.rba.co.uk 11
eTools.ch
11/04/23 www.rba.co.uk 12
WolframAlpha
http://www.wolframalpha.com/
Computational knowledge engine, curated data
Click Examples, Random, or an image in the homepage background to get an idea of what it covers
11/04/23 www.rba.co.uk 13
11/04/23 www.rba.co.uk14
11/04/23 www.rba.co.uk 15
Free specialist tools for research information and grey literature (anything but Google
Scholar!)
Microsoft Academic Search
http://academic.research.microsoft.com/
Journal articles, pre-prints, post-prints, conference proceedings, reports and white papers
Free to use but the full text of some papers can only be viewed on payment of a fee to the original journal publisher
Author may have several different profiles and articles may be assigned to wrong author
Sometimes very slow to load
Uses Silverlight for the charts
11/04/23 www.rba.co.uk 16
Microsoft Academic Search
11/04/23 www.rba.co.uk 17
Microsoft Academic Search
11/04/23 www.rba.co.uk 18
Open access
US
All research publications resulting from work funded by the US National Institutes of Health are expected to be made freely available and deposited in PubMed Central (http://www.ncbi.nlm.nih.gov/pmc/)
– some material embargoed for up to 12 or 24 months (http://www.ncbi.nlm.nih.gov/pmc/journals/)
– Europe PubMed Central (http://europepmc.org/) part of PMC network of international repositories
UK
1st of April 2013 - researchers at UK Research Institutions are expected to publish as open access any peer‐reviewed research papers and conference proceedings that acknowledge Research Council UK funding
11/04/23 www.rba.co.uk 19
UK Gold versus Green OA
Gold OA– researchers publish their articles in journals that offer open
access publishing (can be established, conventional publishers)– articles can be made available free of charge to readers
immediately – author or institution/department pays article processing fee– copyright license CC-BY
Green OA – researchers deposit copies of articles in an institutional or
subject-based repository, subject to copyright/license permissions
– repository makes copies available to the public either immediately or embargoed (more common)
– period of embargo varies (for example http://cdn.elsevier.com/assets/pdf_file/0018/121293/external-embargo-list.pdf)
– copyright license CC-BY-NC11/04/23 www.rba.co.uk 20
Fragmentation of open access
Where are the open access publications?– Individual OA articles within existing subscription
journals?
– Separate OA journals?
– Publishers web site?
– Author’s website
– Institutional repositories
– Aggregators e.g. Scopus, Web of Science, Google?
– “Predatory publishers”
• see Jeffrey Beall’s list at http://scholarlyoa.com/publishers/
11/04/23 www.rba.co.uk 21
Institutional repositories and open access
BASE - Bielefeld Academic Search Engine http://www.base-search.net/
CORE (COnnecting Repositories) http://core.kmi.open.ac.uk/search
DART-Europe E-theses Portal http://www.dart-europe.eu/basic-search.php
DOAJ: Directory of Open Access Journals http://www.doaj.org/doaj
Institutional Repository Search (IRS) http://irs.mimas.ac.uk/
Open DOAR http://opendoar.org/
RIAN - Pathways to Irish Research http://rian.ie
ROAR - Registry of Open Access Repositories http://roar.eprints.org/
OpenAIRE http://www.openaire.eu/
11/04/23 www.rba.co.uk 22
Specialist search tools for research information
A selection can be found at http://www.rba.co.uk/search/links.shtml#research
BioMed Central http://www.biomedcentral.com/
ChemSpider http://www.chemspider.com/
Deep Web TechnologiesMednar http://mednar.com/
Science.gov http://www.science.gov/
Science Research http://scienceresearch.com/
WorldWideScience http://worldwidescience.org/
11/04/23 www.rba.co.uk 23
Specialist search tools for research information
PubMed Central http://www.ncbi.nlm.nih.gov/pmc/
Europe PubMed Central http://europepmc.org/
Mendeley http://www.mendeley.com/
Open Biology http://rsob.royalsocietypublishing.org/
Scirus http://www.scirus.com/ [closing in 2014]
11/04/23 www.rba.co.uk 24
Grey literature
Literature that has been “peer reviewed” or assessed/approved in some way by colleagues or subject experts but is not easy to find or access
Print run may have been small, possibly never published electronically
Published on the web but page or site is no longer available. (Use web archives e.g. http://www.archive.org/).
Research & technical papers, government reports, pre-prints, market surveys, press releases, committee papers, conference papers, presentations
Use advanced search commands to try and track down information
GreyNet International, Grey Literature Network Service – http://www.greynet.org/ – http://www.opengrey.eu/
11/04/23 www.rba.co.uk 25
Images
Creative Commons and public domain images
– Flickr Creative Commons http://www.flickr.com/creativecommons
– Wikimedia Commons http://commons.wikimedia.org/
– MorgueFile.com http://www.morguefile.com/
– Wellcome Images http://wellcomeimages.org/
– Most of the images on US government web sites
– Nasa http://www.nasa.gov/
11 April 2023 Karen Blakeman www.rba.co.uk 26
Statistics
11/04/23 www.rba.co.uk 27
Official statistics
NHS Statistics Links– http://www.nhs.uk/Pages/LinkListing.aspx?CategoryId=Statistics
UK National Statistics Publication Hub– http://www.statistics.gov.uk/
Office for National Statistics– http://www.ons.gov.uk/
Welsh Government | Statistics– http://wales.gov.uk/topics/statistics/
Welsh Assembly Government StatsWales– http://statswales.wales.gov.uk/
data.gov.uk – http://data.gov.uk/
Eurostat – http://ec.europa.eu/eurostat/
European Union Open Data Portal – http://open-dat.europa.eu/en/
11/04/23 www.rba.co.uk 29
NHS Statistics Links http://www.nhs.uk/Pages/LinkListing.aspx?CategoryId=Statistics
11/04/23 www.rba.co.uk 30
UK National Statistics & ONS
http://www.statistics.gov.uk/, http://www.ons.gov.uk/
11/04/23 www.rba.co.uk 31
Publication Hub (http://www.statistics.gov.uk/) is an “index” to what is available and links through to other sites when necessary.
ONS (http://www.ons.gov.uk/) only shows reports since 2008 even if there are earlier editions. Use the Publication Hub to search for the report title/series. Once you have found an edition of the title click on “Current and past editions” to see the list of editions available. Then click on the relevant report.
11/04/23 www.rba.co.uk 32
Publication Hub search (statistics.gov.uk)
11/04/23 www.rba.co.uk 33
data.gov.uk http://data.gov.uk/
11/04/23 www.rba.co.uk 34
data.gov.uk http://data.gov.uk/
Not all of the data on this site is open data – may be restrictions on use
Download links sometimes take you to the wrong dataset
Download links sometimes completely broken
It’s all or nothing! Have to filter the datasets for the information you want and produce your own graphs and charts
Variety of formats
11/04/23 www.rba.co.uk 35
Eurostat http://ec.europa.eu/eurostat
11/04/23 www.rba.co.uk 36
European Union - Open Data Portal http://open-data.europa.eu/open-data/
11/04/23 www.rba.co.uk 37
Datamarket http://datamarket.com/
Open portal to datasets worldwide and market research
Creates visualisations of the data
11/04/23 www.rba.co.uk 38
Zanran http://www.zanran.com/
Searches graphs, charts, tables, PDFs, spreadsheets
Can limit by location of server, date, and filetype
Title in results list is usually the title or caption to the table and not title of the document
Hover over the thumbnail to see a preview of the table or page
Click on the URL button next to the result to view the original URL of the document
– clicking on it may take you to “page not found” 404
Click on the title of the result to see Zanran’s own copy (free registration usually required
Useful if document no longer available at it’s original location and can’t be found in any of the web archives
11/04/23 www.rba.co.uk 39
Zanran
http://www.zanran.com/
Zanran – great for data in tables, charts and graphs http://www.rba.co.uk/wordpress/2013/01/25/zanran-great-for-data-in-tables-charts-and-graphs/
11/04/23 www.rba.co.uk 40
Tracking down websites and pages that have disappeared
11/04/23 www.rba.co.uk 41
UK Government Web Archive | The National Archives http://www.nationalarchives.gov.uk/webarchive/
Browse by category or choose your organisation from an A-Z listChoose the date of the archived version of the website you want to view
11/04/23 www.rba.co.uk 42
http://www.legislation.gov.uk/
11/04/23 www.rba.co.uk 43
Wayback Machine http://www.archive.org/
11/04/23 www.rba.co.uk 44