microsoft academic search overview at nfais 2012 - lee dirks
DESCRIPTION
TRANSCRIPT
Microsoft Academic Search:
Next-Generation Scholarly Discovery
Lee Dirks | Director—Portfolio Strategy Microsoft Research
Agenda
• MSR + Connections Overview• Working with Publishers / Content
Partners• Market Reception• Business Model• Introduction to Academic Search • Demo
• API + usage examples• Next Steps Our Roadmap
2
ProductGroups
5-10 years +2-4 yearsPresent
MicrosoftResearch
Graphics &Multimedia
Human-ComputerInteraction
Machine Learning
Hardware& Devices
Computer Systems & Devices
Communication& Collaboration
ComputationalLinguistics
ComputationalSciences
Information Retrieval& Management
Security &Privacy
MicrosoftLabs
Rich Media Labs
E&D Labs
India Labs
Israel Labs
Live Labs Mobile Labs
Office Labs
Search Labs
Startup BusinessAccelerator
Startup Labs
ProductGroups
Microsoft Research | ConnectionsOutreach. Collaboration. Innovation.
44
• Division within Microsoft Research focused on partnerships between academia, industry and government to advance computer science, education, and research in fields that rely heavily upon advanced computing
• Supporting groundbreaking research to help advance human potential and the wellbeing of our planet
• Developing advanced technologies and services to support every stage of the research process
• Microsoft Research Connections is committed to interoperability and to providing open access, open tools, and open technology
http://research.microsoft.com/collaboration/
Explore over 38.8 million publications…and growinghttp://academic.research.microsoft.com
MSR Academic Search data comes from open access repositories, publishers, and web crawls – Currently 38.8M papers across ~15 domains• 75M+ papers in the queue
– More enhancements to come…
Working with publishers
70+ Content Providers growing weekly
Signed Signed & Indexed
70+ Content Partners SignedPartner Papers AAAS 225,000 American Geophysical Union 141,000 American Institute of Physics 437,000 American Medical Association 125,000 American Psychological Association 194,500 Annual Reviews 30,000 arXiv 680,000 Association for Computing Machinery 80,000 Astrophysics Data System (ADS) 8,700,000 BASE 29,000,000 bepress 500,000 BioMed Central 88,000 BioOne 85,000 BMJ 577,000 Brill 100,000 Cambridge University Press 580,000 Central & Eastern European Online Library: CEEOL 131,200 CERN Document Server 200,000 CiteSeer 750,000 Commonwealth Scientific and Industrial Research Org 50,000 CrossRef 46,000,000 Digital.CSIC 35,153 Elsevier 9,500,000 Emerald 150,000 HighWire 3,197,000 Hindawi Publishing Corporation 40,000 Humanities Text Initiative 50,000 IADIS 6,500 IEEE 625,000 IGI Global 14,000 Information Bridge: DOE 280,000 Institute of Physics 425,046 InTech 8,000 ITHAKA/JSTOR 1,000,000 Karger AG 260,000 M.E. Sharpe 100,000 MedKnow 60,000 MetaPress 5,600,000 MIT Press 23,000 National Institute of Informatics 2,800,000 NDLTD 1,600,000 OAIster 23,000,000 Oxford University Press 650,000 PNAS 103,000 PolicyArchive 30,000 Project Euclid 108,436 Project Muse 200,000 Public Knowledge Project 18,000 Public Library of Science 130,000 Publishing Technology Plc. (Ingenta) 5,200,000 PubMed 22,000,000 Qscience 1,000 RACO 116,054 RePEc 970,000 Royal Society 65,000 Royal Society of Chemistry 300,000 Royal Society of Medicine 18,000 Sage 700,000 Social Science Research Network (SSRN) 263,000 Springer 4,900,000 VGTU Press 1,100
Signed
Partner Papers Mary Ann Liebert, Inc. 80,000 SIAM 40,000 Thieme 300,000
Imminent
In DiscussionPartner Papers AgEcon Search 40,001 Allen Press ???? American Chemical Society 900,799 American Institute of Aeronautics and Astronautics (AIAA) 165,000 American Society of Civil Engineers 80,000 American Society of Mechanical Engineers 60,000 Atypon 10,000,000 De Gruyter 50,000 Nature Publishing Group 700,000 New England Journal of Medicine 185,000 Optical Society of America (OSA) 200,000 Silverchair ???? SPIE 300,000 Taylor & Francis 1,300,000 Trove (National Library of Australia) 800,000 USGS Publications Warehouse 75,000 Wiley-Blackwell 4,000,000 William S. Hein & Co. 1,000,000 Wolters Kluwer Health, Medical Research 1,100,000
“…Meanwhile, Microsoft Academic Search (MAS), which launched in 2009 and has a tool similar to Google Scholar, has over the past few months added a suite of nifty new tools based on its citation metrics (go.nature.com/u1ouut). These include visualizations of citation networks (see 'Mapping the structure of science'); publication trends; and rankings of the leading researchers in a field.”
4 August 2011 | Nature 476, 18 (2011) (doi:10.1038/476018a)
Analyst Coverage
12
Outsell: 10 to Watch Google Scholar and Microsoft Academic Search“Two giants from outside the main STM arena threaten to shake up the space for STM metrics and analytical services. Google Scholar and Microsoft Academic Search announced new functionalities that could disrupt those offered as commercial services from Thomson Reuter’s Web of Knowledge and Elsevier’s Scopus.”
“Now Google Scholar is offering its own citation tracker functionality – free to use, of course – that can allow authors to calculate performance metrics such as the h-index. Microsoft, meanwhile, has focused on adding functionality to its search results, such as dynamic lists of top authors/publications/journals/organisations.”
Outsell: A compelling user experience
13
• “…There are a number of interesting features here which make it a good alternative to Google Scholar [emphasis added] – in particular, …it provides some nice graphics, and some handy links in the left bar to refine your search by particular authors, journals or keywords.
• “The other tools on the Microsoft academic site are really well-implemented and a lot of fun as well, particularly if you’re a published researcher.
• “You can use the co-author graph and citation graph tools to generate beautiful visual representations of who’s citing your work, or who you’re connected to through authorship…”*
* Ware, Mark (2011), “Scientific, Technical & Market Information: 2011 Market Forecast and Trends Report,” Outsell (November 30)
14
demo
PapersEdit
Alerts
Links to fulltext
References & Citing Papers
Citation History and Context
Export
Citation History and Context
publication
keyword
Alerts
Usage History
Definitions
Top …Top …
journal
Top …Top …
Sort
Author network
Citing Papers
Embed
author
Domain Trends
Compare
organization
Call for Papers
conferences
Ranking
Embedding
Public API• Application Programming Interface– Supports queries against all academic entities and their
basic info• With the API, you can– Work with others to share info– Help users to build useful clients
• All openly available to everyone– Targeting the academic community– API is available for non-commercial use only
API details at http://academic.research.microsoft.com/About/Help.htm#5
Entity-Based DiscoveryArticle/Person/Organization/Event
Entity Count
Publications 38,869,302
Citations 87,777,630
Authors 16,875,458
Conferences 2,812
Journals 13,583
Organizations 19,237
Domain/subdomains 219
http://sciencecard.org/
Next up…• Growth
– More content coverage – Authenticated Access / Library Links – Committed to implementing CrossMark– Exploring partnerships – New usage scenarios (including data citation)
• Expanded content types
• Our Commitments– MAS is a web service for researchers, by researchers. – We intend to be an open platform for the community, and we
want to commit to the community to keep this service open, transparent, and a “sandbox” available to researchers for exploration and experimentation.
– We are very interested to evolving this service to better represent how science/academia works. As protocols and/or standards emerge, we hope to embody these as part of MAS moving forward.
(Associate Member)
(Founding Sponsor)
http://www.lounginlizzard.com/mountain-life-t-shirt-1.html
37
Thank you!
Lee DirksDirector, Portfolio Strategy
Microsoft Research | Connections
[email protected] or [email protected] – http://www.microsoft.com/scholarlycomm/Facebook: Scholarly Communication at Microsoft
@MSFTAcademic
38
• Tell us more about your projects, your workflows, your issues. We’re always in “requirements gathering mode”• Email us at [email protected] with questions and
ideas• Especially if you are already utilizing Microsoft
technologies
• Download our add-ins and try them, and then give us feedback! • Let us know if we can facilitate a connection with the
appropriate product group(s)
• Follow announcements via our RSS feed or via our Facebook group
How to engage with Microsoft Research Connections