network building in cytoscape using the biogateway app · sparql query example. biogateway app...
TRANSCRIPT
![Page 1: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/1.jpg)
Network building in Cytoscape using the BioGateway App
benefits of hiding the SPARQL from your eyes
Martin Kuiper, Stian Holmås
Semantic Systems Biology Group
Department of Biology
Norwegian University of Science and Technology
Trondheim, Norway
www.semantic-systems-biology.org
www.biogateway.eu
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 2: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/2.jpg)
Mission of my group
• Develop tools and resources for ‘biologists’
• Provide knowledge in ‘one-stop shop’ for domain biologists
• Participate in systems biology projects
• Demonstrate the use of semantic systems biology
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 3: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/3.jpg)
Some products
Provide knowledge in a ‘one-stop shop’ for domain biologists:
✓ Cell cycle control: Cell Cycle Ontology CCO
✓ Generic RDF triple store: BioGateway
✓ Gene regulation: Gene eXpression Knowledge Base GeXKB
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 4: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/4.jpg)
Some products
Provide knowledge in a ‘one-stop shop’ for domain biologists:
✓ Cell cycle control: Cell Cycle Ontology CCO
✓ Generic RDF triple store: BioGateway
✓ Gene regulation: Gene eXpression Knowledge Base GeXKB
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 5: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/5.jpg)
BioGateway
• Uses Virtuoso Open Server
– Open Source software that can host a triple store
– Can build this from RDF files
– Has a DB backend
• Supports SPARQL* language which allows querying RDF data (graphs)
• Its syntax is similar to that of SQL
*http://www.w3.org/TR/rdf-sparql-query/
http://www.openlinksw.com/virtuoso/
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 6: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/6.jpg)
Semantic Systems Biology
![Page 7: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/7.jpg)
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 8: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/8.jpg)
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 9: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/9.jpg)
SPARQL query example
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 10: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/10.jpg)
the average user experience
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 11: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/11.jpg)
• Plugin for the popular network visualization tool Cytoscape
• Client – server communication with BioGateway
• Translates line-by-line questions into SPARQL
• Uses controlled vocabularies and identifiers for queries
• Supports this by an ‘autocomplete’ function
• Renders the result in the form of a network
• Allows further interactive network exploration and extension
• Full provenance and metadata
The BioGateway App
- operates within Cytoscape -
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 12: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/12.jpg)
Cytoscape has apps for access to RDF
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 13: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/13.jpg)
BioGateway Design- data sources -
Building blocks:
• protein-centric, hence built around the graph 'prot' (from Uniprot)
• ontologies (GO, SIO)
• OMIM
• NCBI Taxonomy
• 'bridge' information:
– 'prot2prot' (IntAct)
– 'prot2onto' (GOA)
– 'prot2phen' (currently only diseases, UniProt)
– 'tfac2gene' (TFactS, TRRUST, HTRI, Signor, IntAct, GOA, ExTRI)
• ExTRI: TF-TG information extracted from PubMed abstracts using a text
mining pipeline - collaboration NTNU/CNIO/BSC - 30.000 quality sentences
→ in our demo: only human data
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 14: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/14.jpg)
BioGateway Design- URIs -
Identification schemes:
• external URIs are re-used as much as possible
• BGW-specific URIs are minted only for:
– protein classes
– classes and instances in the 'bridge' graphs
• all other classes (ontology terms, taxa etc.) and instances (e.g. genes,
proteins) are identified by external URIs
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 15: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/15.jpg)
BioGateway design- data model -
Data model:
• Directed graph (not necessarily acyclic) - natural choice for networks
• 3 types of nodes:
– classes: entities in the domain of discourse, e.g. proteins, diseases, etc (URIs)
– instances: particular interpretations/views of entities, serves to introduce
metadata into the graphs
– attributes: qualities, quantities, etc (literals)
• Multiple types of edges (properties), semantically defined externally (URIs)
• Basic unit of information: subject - predicate - object, a.k.a. triple
• Triples are organized as graphs, 2 types of graphs in BGW:
– defining entities
– defining relations among entities, e.g. protein-protein interaction, TF-TG
interactions, etc.
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 16: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/16.jpg)
BioGateway Design- data model -
Examples BGW 3 (beta):• Entity class
– http://rdf.biogateway.eu/prot/Q9Y6T7-2
• Entity instances– http://identifiers.org/ensembl/ENSP00000386066
– http://identifiers.org/refseq/NP_663733.1
• Bridge class– http://rdf.biogateway.eu/prot-obo/Q9Y6T7-2--GO_0004143
• Bridge instances– http://rdf.biogateway.eu/prot-obo/Q9Y6T7-2--GO_0004143#goa
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 17: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/17.jpg)
BioGateway Design- new data model -
Examples BGW 3.0 (coming soon):• Entity class:- http://rdf.biogateway.eu/prot/9606/UPI0000033B9C
• Entity instances:- http://purl.obolibrary.org/obo/PR_Q9Y6T7-2
- http://identifiers.org/ensembl/ENSP00000386066
- http://identifiers.org/refseq/NP_663733.1
• Bridge class:- http://rdf.biogateway.eu/prot-obo/9606/UPI0000033B9C--GO_0004143
• Bridge instance:- http://rdf.biogateway.eu/prot-obo/9606/UPI0000033B9C--
GO_0004143#goa
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 18: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/18.jpg)
Future developments
Publication plan:
Short-term (next months)
• publish BioGateway 3.0 and the App:
– human protein/gene network building
Medium term (before summer 2019):
• publish BioGateway 3.1
– ENSEMBL proteomes
Longer term (before end of 2019?)
• publish BioGateway 3.2:
– content from signaling databases (e.g. Signor)
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 19: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/19.jpg)
Co-evolution of BioGateway and the App
• Translating App queries to SPARQL posed some constraints
on the BioGateway data model
• Proteins and genes are not really the same and need
deconvolution
• Regulatory networks need to be grounded on transcription
factor - target gene interactions, in the broadest sense:
include text mining results
• Such data in turn implicated further wishes for the App:
– Network inspection triggers hub and cluster-based user focus
on conspicuous nodes and relationships
• node expansion
• link provenance
• provenance checking: curation
• Mechanism to stimulate network-based community curation
BioGateway
backend
Cytoscape
App
user
feedback
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 20: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/20.jpg)
Co-evolution of BioGateway and the App
• Translating App queries to SPARQL posed some constraints
on the BioGateway data model
• Proteins and genes are not really the same and need
deconvolution
• Regulatory networks need to be grounded on transcription
factor - target gene interactions, in the broadest sense:
include text mining results
• Such data in turn implicated further wishes for the App:
– Network inspection triggers hub and cluster-based user focus
on conspicuous nodes and relationships
• node expansion
• link provenance
• provenance checking: curation
• Mechanism to stimulate network-based community curation
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 22: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/22.jpg)
The BioGateway Infotriples of the type:
Protein X
Protein Set
Gene Y
Gene Set
Protein Y
Protein Set
Protein Y
Protein Set
molecularly interacts with
encodes
Protein X
Protein SetTerm Yannotated with GO term
Protein X
Protein SetGene Y
Gene Settranscriptionally regulates
Protein X
Protein SetDisease Y
Disease Setinvolved in
IntAct
GOA
TRRUST
TFactS
Signor
ExTRI
UniProt
OMIM
UniProt
Ensembl
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 23: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/23.jpg)
The BioGateway App
- specifying a wish -
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 24: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/24.jpg)
• Give me all the proteins annotated with GO term X
• Give me all the proteins annotated with GO term Y
• Select from this the subset that shows a physical interaction
• Identify the genes that encode this protein subset
• Identify the transcription factors that regulate these genes
• Show only those transcription factors that are themselves
annotated with GO term Y
The BioGateway App
- a list of wishes -
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 25: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/25.jpg)
The BioGateway App
- a list of wishes -
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 26: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/26.jpg)
The BioGateway App- the SPARQL way -
#QUERY <http://www.semantic-systems-biology.org/biogateway/endpoint>BASE <http://rdf.biogateway.eu/graph/>SELECT DISTINCT ?A ?B ?C ?D ?E WHERE {GRAPH <prot2onto> {
?A <http://purl.obolibrary.org/obo/RO_0002331> <http://purl.obolibrary.org/obo/GO_0016567> .?B <http://purl.obolibrary.org/obo/RO_0002331> <http://purl.obolibrary.org/obo/GO_0000082> .
}GRAPH <prot2prot> {
?A <http://purl.obolibrary.org/obo/RO_0002436> ?B .}GRAPH <gene> {
?C <http://semanticscience.org/resource/SIO_010078> ?A .}GRAPH <gene> {
?D <http://semanticscience.org/resource/SIO_010078> ?B .}GRAPH <tfac2gene> {
?E <http://purl.obolibrary.org/obo/RO_0002428> ?C .?E <http://purl.obolibrary.org/obo/RO_0002428> ?D .
}GRAPH <prot2onto> {
?E <http://purl.obolibrary.org/obo/RO_0002331> <http://purl.obolibrary.org/obo/GO_0000082> .}}FILTER(?A!=?C)FILTER(?A!=?B)FILTER(?A!=?E)FILTER(?A!=?D)FILTER(?C!=?B)FILTER(?C!=?E)FILTER(?C!=?D)FILTER(?B!=?E)FILTER(?B!=?D)FILTER(?E!=?D)
}
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 27: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/27.jpg)
The BioGateway App- the result as Cytoscape network -
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 28: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/28.jpg)
The BioGateway App
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 29: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/29.jpg)
The BioGateway App
one transcription factor that
rules them all – G1/S
Set of genes coding for proteins related to G1/S and ubiquitination
Set of proteins related to G1/S and ubiquitination
GO term G1/S
GO term
ubiquitination
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 30: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/30.jpg)
The BioGateway App
confidence level 1,0
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 31: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/31.jpg)
The BioGateway App
confidence level 1
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 32: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/32.jpg)
The BioGateway App
confidence level 1
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 33: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/33.jpg)
The BioGateway App
confidence level 0,5
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 34: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/34.jpg)
![Page 35: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/35.jpg)
![Page 36: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/36.jpg)
![Page 37: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/37.jpg)
How do we see this being used?- address various questions -
?
protein
or
gene
result
setnetwork
what does
it connect
to?
how are
they
connected?
Can I link my
network with
gene
regulation?
Did I miss
anything?
Can I enrich
with signaling
causality?
(2019)
BioGateway App Presentation EBI-Hinxton Nov 2018
![Page 38: Network building in Cytoscape using the BioGateway App · SPARQL query example. BioGateway App Presentation EBI-Hinxton Nov 2018. the average user experience. BioGateway App Presentation](https://reader033.vdocuments.us/reader033/viewer/2022051902/5ff212d8f5312605bc74a7e2/html5/thumbnails/38.jpg)
How do we see this being used in workflows?
- possible uses -
protein
or genequery
list
network
(bulk)
query
queryimport
network
seed
network
expanded
network
other
Cytoscape
work
copy
type
network export
or merge
BioGateway App Presentation EBI-Hinxton Nov 2018