elsevier smart content ldr semtech 2012

25
Elsevier Health Sciences SemTechBiz 2012 Conference June 5, 2012 Alan Yagoda VP, Business Technology [email protected] @alanyagoda Smart Content Drives Smart Applications The Future Of Using Knowledge In Healthcare

Upload: alan-yagoda

Post on 30-Apr-2015

2.337 views

Category:

Health & Medicine


3 download

DESCRIPTION

A session from SemTech SF in June 2012 Abstract: As access to a richer set of knowledge and research continues to be critical to the healthcare community, the users of healthcare and life science solutions are demanding the same level of discoverability, integration, and innovation from their professional tools that they enjoy in their personal applications. Through the Smart Content initiative Elsevier seeks to semantically enrich its diverse offerings of health sciences content to both improve the performance of existing online resources as well as to enable the creation of the next generation of digital products. In this session, Alan Yagoda will discuss Elsevier’s efforts in developing Smart Content capabilities to power a new portfolio of strategic product offerings. The journey into smarter search and discovery resulted in a new infrastructure with a rich set of semantic capabilities include the development of a standardized medical taxonomy called EMMeT (Elsevier’s Merged Medical Taxonomy), indexing and content enrichment, and linked data services.

TRANSCRIPT

Page 1: Elsevier Smart Content LDR SemTech 2012

Elsevier Health Sciences

SemTechBiz 2012 Conference June 5, 2012

Alan Yagoda VP, Business Technology [email protected] @alanyagoda

Smart Content Drives Smart Applications The Future Of Using Knowledge In Healthcare

Page 2: Elsevier Smart Content LDR SemTech 2012

Elsevier is the largest Science, Technical and Medical Publisher in the world. In the area of Health Sciences, Elsevier publishes leading brands including The Lancet, Braunwald’s Heart Disease, Gray’s Anatomy, and the Netter Atlases among others. In addition, Elsevier produces leading online clinical support tools and products including:

•  MD Consult •  Procedures Consult •  Mosby’s Nursing Consult •  CPMRC Nursing Care Plans •  Gold Standard Drug Database •  MEDai Analytics for Managed Care Plans

Elsevier  Proprietary  and  Confiden3al  

About Elsevier

Page 3: Elsevier Smart Content LDR SemTech 2012

The Challenge

Page 4: Elsevier Smart Content LDR SemTech 2012

The Challenge: Getting doctors the right information to make the best decisions and provide the best clinical care

Trusted: Authoritative medical and surgical content from Elsevier. Comprehensive: Integrated Medline and 3rd party journal content. Speed To Answer: Fast discoverability of the most relevant answers and more intuitive searching.

Elsevier  Health  Sciences  |  Proprietary  and  Confiden3al  

Page 5: Elsevier Smart Content LDR SemTech 2012

Introducing Smart Content

Elsevier  Proprietary  and  Confiden3al  

Page 6: Elsevier Smart Content LDR SemTech 2012

Copyright 2011 Outsell Gilbane Services, Inc.

http://www.outsellinc.com http://gilbane.com/xml/2009/11/what-is-smart-content.html#ixzz0hnuRhaBc

Taxonomy-Powered Content = Smart Content

Content  today  with  structured  XML  

Content  with  applied  taxonomy  

Elsevier  Proprietary  and  Confiden3al  

Page 7: Elsevier Smart Content LDR SemTech 2012

Smart Content At Elsevier

7  Elsevier  Proprietary  and  Confiden3all    

Entities, concepts and relationships

Smart Content Applications

Better understanding through analysis and visualization • Tag clouds • Heatmaps"• Streamgraphs"• Scatterplots"• Time series • Animations

Better discovery through semantic search & navigation • Faceted search & browse • Ontology-driven navigation • Task-specific results • Personalized/localized results • Question answering"• Link to evidenced-based content

New knowledge through aggregation and synthesis • Topic pages • Social network maps • Geolocation maps • Data mashups"• Text mining reports

Images

Text

Tables Elsevier content

Elsevier knowledge organization systems

Linked data from partners and the Web

Page 8: Elsevier Smart Content LDR SemTech 2012

     Co

ncep

t  Mapping  

Making Smart Content Work in the Clinical Setting

250K+  Core  Clinical  Concepts  

1M+  Hierarchical  Rela3onships  

1M+  Ontological  Rela3onships  

1M+  Synonyms  

•   Vast  amounts  of  content  made  easily  discoverable  •   Specialty-­‐specific  naviga9on  

•   Dynamic  clinical  summary  crea9on  •   Meaningful  related  content  recommenda9ons  

Pa3ent  Ed  Drug  Info  Procedural  Videos  

Clinical  Summaries  

EMMeT  

Elsevier  Custom  

UMLS  

Books  

Journals   Guidelines  Clinical  Trials  

Elsevier Merged Medical Taxonomy (EMMeT)

Elsevier  Proprietary  and  Confiden3al  

Page 9: Elsevier Smart Content LDR SemTech 2012

Introducing EMMeT (Elsevier Merged Medical Taxonomy)

Medical Name Malignant Neoplasm of the Breast

Consumer Friendly Name Breast Cancer

Synonyms Malignant Tumor of Breast Malignant Breast Neoplasm Breast Ca

Codes ICD9 – 174.9 MeSH – D001943 SNOMED-CT – 190121004

Semantic Type/Group Neoplastic Process/Disease

•  Breast Disorders •  Cancer of the Thorax •  Mammary Neoplasms •  More….

•  Breast Sarcoma •  Familial Breast Cancer •  Malignant lymphoma of the Breast •  Malignant Neoplasm of the breast outer

quadrant •  More…

Symptoms

Diagnostic Procedures

Treatment Procedures

Medications

Risk Factors

Prevention

Complications

Breast Lump, Nipple Retraction, …..

Mammography, Breast Biopsy, …..

Chemotherapy, Mastectomy, ….

Tamoxifen, Doxorubicin, …..

Family History, Genetics, Predisposition, ….

Screening, Preemptive Mastectomy, ….

Metastatic Cancer, ….

Parent Terms

Sem

antic

Rel

atio

nshi

ps

Children Terms

4

2

3

1

Elsevier  Proprietary  and  Confiden3al    

Page 10: Elsevier Smart Content LDR SemTech 2012

Automated Indexing: Weighted Tags for Better Search

Paragraph-level SMART Content tags uncover highly-relevant information not necessarily evident from the title or abstract alone.

Article-level SMART Content tags help confirm relevance and provide a topical overview about a piece of content.

Elsevier  Proprietary  and  Confiden3al  

Page 11: Elsevier Smart Content LDR SemTech 2012

Search & Discovery: ClinicalKey

Elsevier  Proprietary  and  Confiden3al  

Page 12: Elsevier Smart Content LDR SemTech 2012

EMMeT Powered Auto-Suggest

Elsevier  Proprietary  and  Confiden3al  

Page 13: Elsevier Smart Content LDR SemTech 2012

Speed to Answer: Most relevant preview

Page 14: Elsevier Smart Content LDR SemTech 2012

Linked Data Repository

Elsevier  Proprietary  and  Confiden3al  

Page 15: Elsevier Smart Content LDR SemTech 2012

Linked Data Repository (LDR): Warehouse for Smart Content Enhancements Evaluation and management of delirium in hospitalized older patients Delirium is common in hospitalized older patients and may be a symptom of a medical emergency, such as hypoxia or hypoglycemia. It is characterized by an acute change in cognition and attention, although the symptoms may be subtle and usually fluctuate throughout the day. This heterogeneous syndrome requires prompt recognition and evaluation, because the underlying medical condition may be life threatening. Risk factors for delirium include visual impairment, previous cognitive impairment, severe illness, and an elevated blood urea nitrogen/serum creatinine ratio. Interventions that have been shown to reduce the incidence of delirium in at-risk hospitalized patients include repeated reorientation of the patient to person and place, promotion of good sleep hygiene, early mobilization, correction of dehydration, and the minimization of unnecessary noise and stimuli. The treatment of delirium centers on the identification and management of the medical condition that triggered the delirious state. Nonpharmacologic interventions may be beneficial, but antipsychotic agents may be needed when the cause is nonspecific and other interventions do not sufficiently control symptoms such as severe agitation or psychosis. Although delirium is a temporary condition, it may persist for several months in the most vulnerable patients. Patient outcomes at one year include a higher mortality rate and a lower level of functioning compared with age-matched control patients. Copyright © 2008 American Academy of Family Physicians.

Title

Disease

Clinical finding

Source

•  Service that provides a rich semantic layer on top of content and enables search and discovery of metadata

•  Transforms content into data to allow exploration of Elsevier-wide knowledge base

•  Opens up discovery and utility of content beyond searchable documents

•  Extends Elsevier extracted knowledge by interlinking data with other related sources of content from partners and the web

•  Optimized for high-volume read-write of RDF data

•  Provide service layer APIs for ease of integration

Drugs

15  Elsevier  Proprietary  and  Confiden3al  

Page 16: Elsevier Smart Content LDR SemTech 2012

Represent Enhancements and Vocabularies In RDF Satellites

Elsevier  Proprietary  and  Confiden3al  

LDR

Example RDF Statements  Tags from a taxonomy for a given document  Document sections relevant to a given concept  Document sections providing answers to a given question  Genes mentioned in a given document  Documents supporting or disputing conclusions of a given document  Concepts in the areas of expertise for a given author

Creation of Satellite Standards •  Linked data compliant RDF representing metadata objects •  Leverage common namespaces from dct, pav, rdf, skos •  Taxonomies in SKOS to enhance portability in the linked data world •  Subject tagging against a vocabulary representing extracted

knowledge •  Concept URIs that can be equated to URIs in linked data

Delivery Infrastructure •  Product-specific indexes generating RDF “Smart Tags” •  Data pipeline transformations for building semantic warehouse •  Exposed through linked data delivery services

Page 17: Elsevier Smart Content LDR SemTech 2012

Discovery Services (Semantic Knowledgebase)

Data  Space  Services  

LDR Semantic Infrastructure

17  

Linked  Data  Pipeline  Services  (Hadoop)  

JSON Transform

N-Quads Extract

Reasoning

Interlinking

RDF Validation

Ontology Svcs

…  

Annota3on  Satellites  

Linked Data Loader (REST)

MongoDB NoSQL

Access & Entitlements

Asset  Satellites  

Vocab  Satellites  

3rd  Party  Data  

SOLR/SIREn

Admin & Monitoring Analytics Atom Feed

Discovery Svc API (REST)

Ontology Service SPARQL Alerts

Virtuoso Triplestore

Elsevier  Proprietary  and  Confiden3al  

AWS Cloud Management

Tagging  and  Indexing  Services  (Concepts,  Chapters,  Ar3cles,  Guidelines,etc)  RDF  Genera3on  

EMMeT Semantic Network

Vocabulary  SKOS    Genera3on  

Elsevier  

Conten

t  

Product-specific Smart Content Search Index

3rd  P

arty  

Conten

t  

Ins3t.  

Conten

t  

Smart Content Indexing Pipeline

Linked  Data  

Amazon S3

Vocab & Annotation RDF Satellites

Linked Data

Page 18: Elsevier Smart Content LDR SemTech 2012

Elsevier Smart Content In Action

Elsevier  Proprietary  and  Confiden3al  

Applications powered by Smart Content: –  Semantic search for practitioners and medical researchers –  Expose medical taxonomies in SKOS –  Crossref collaboration of scholarly publishers and funding agencies –  Lancet application mashups on specialty health topics –  Sciverse applications –  Clinical Decision Support Drug Research

Page 19: Elsevier Smart Content LDR SemTech 2012

Elsevier  Proprietary  and  Confiden3al  

LDR  API  Access  To  Ar4cle  Metadata  

Page 20: Elsevier Smart Content LDR SemTech 2012

Trend Analysis Of Special Health Topics

Elsevier  Proprietary  and  Confiden3al  

Page 21: Elsevier Smart Content LDR SemTech 2012

Elsevier  Proprietary  and  Confiden3al  

Comprehensive Drug Research

•  Moving world-class content online to Point of Care. •  Extracted knowledge is linked for further enrichment. •  Information is condensed, immediate and actionable.

Page 22: Elsevier Smart Content LDR SemTech 2012

Elsevier  Proprietary  and  Confiden3al  

-­‐  Discover  knowledge  from  research  relevant  to  a  pa3ent  profile  -­‐  Alerts  on  FDA  Announcements.

Linking Patient Data To Evidence-Based Research

Page 23: Elsevier Smart Content LDR SemTech 2012

Elsevier  Proprietary  and  Confiden3al  

Article search on ScienceDirect results in related specialty content recommendations available from The Lancet Journal.

SciVerse Widgets Powered by Smart Content

Page 24: Elsevier Smart Content LDR SemTech 2012

•  Smart content allows publishers to create new products and services through structuring content for better discovery, insight and utility –  The value is in the structure –  Creating that structure is hard work –  The kind of hard work that publishers have

traditionally focused on •  New consumer Internet businesses are using open

source software and the cloud to add structure to content today… quickly and on the cheap

•  Publishers and societies both large and small can use the same techniques to follow suit

Smart content is a bridge to the future of publishing

Elsevier  Proprietary  and  Confiden3al  

Page 25: Elsevier Smart Content LDR SemTech 2012

Thank you. Alan Yagoda [email protected]

Elsevier  Proprietary  and  Confiden3al