![Page 1: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/1.jpg)
The Gene Ontology: a real-life ontology, progress and future.
Jane Lomax EMBL-EBI
![Page 2: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/2.jpg)
What is the Gene Ontology?
Controlled vocabulary - GO– Terms and relationships– Bottom-up approach
Annotation of proteins to terms– Gene association files
Software/database development Freely available
![Page 3: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/3.jpg)
The vocabulary
GO is divided into three sub-vocabularies:– biological process
• broad series of events, can either be at the level of the cell or organism e.g. circulation, glycolysis
– molecular function• direct activities e.g. catalysis, binding
– cellular component• site of action e.g. nucleus, ribosome
![Page 4: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/4.jpg)
The vocabulary
Hierarchical
Directed Acyclic Graph– terms have one or more parents
is-a and part-of relations
![Page 5: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/5.jpg)
http://www.godatabase.org/cgi-bin/go.cgi
![Page 6: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/6.jpg)
How is GO maintained?
Several full-time editors Requests from community
– database curators, researchers, software developers
– SourceForge tracker GO Consortium meetings for large
changes Mailing lists
![Page 7: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/7.jpg)
OBO - Open Biological Ontologies
GO is a member vocabulary of OBO A repository for biological structured
vocabularies– Freely available without license– Common syntax– Orthogonal to existing ontologies
http://obo.sourceforge.net/
![Page 8: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/8.jpg)
Future developments
File format– Current GO flat file format
• partly redundant• difficult to parse
– New format• Extensible e.g. new relationship types can be specified• minimal redundancy, but human readable• easier to parse
Moving to a database being the primary form of GO
![Page 9: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/9.jpg)
Formalizing GO
Informality is a common criticism of GO– developed by biologists, for biologists
Now beginning work ‘decomposing’ GO using ProLog– Terms broken down into constituent parts e.g.
regulation of heart development– New terms could be created from orthogonal
ontologies e.g. anotomical Work translating GO in DL, reasoning across
the ontologies
![Page 10: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/10.jpg)
GO into UMLS
GO now released as part of the NLM’s Unified Medical Language System Metathesaurus
Links biomedical vocabularies including MeSH and SNOMED.
The process of including GO in UMLS highlighted problems in both systems
![Page 11: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/11.jpg)
GO synonyms
Text strings associated with GO terms Often do not have identical meaning to
term Reduces utility in e.g. semantic
matching Developed relationships between terms
and synonyms– soon to be fully implemented in GO
![Page 12: The Gene Ontology : a real-life ontology, progress and future](https://reader036.vdocuments.us/reader036/viewer/2022082505/56812c67550346895d90fda6/html5/thumbnails/12.jpg)
www.geneontology.org•FlyBase & Berkeley Drosophila Genome Project
•Saccharomyces Genome Database
• PomBase (Sanger Institute)
• Rat Genome Database
• Genome Knowledge Base (CSHL)
• The Institute for Genomic Research
• Compugen, Inc
•The Arabidopsis Information Resource
•WormBase
•DictyBase
•Mouse Genome Informatics
•Swiss-Prot/TrEMBL/InterPro
•Pathogen Sequencing Unit (Sanger Institute)
The Gene Ontology Consortium is supported by an R01 grant from the National Human Genome Research Institute (NHGRI) [grant HG02273]. SGD is supported by a P41, National Resources, grant from the NHGRI [grant HG01315]; MGD by a P41 from the NHGRI [grant HG00330]; GXD by the National Institute of Child Health and Human Development [grant HD33745]; FlyBase by a P41 from the NHGRI [grant HG00739] and by the Medical Research Council, London. TAIR is supported by the National Science Foundation [grant DBI-9978564]. WormBase is supported by a P41, National Resources, grant from the NHGRI [grant HG02223]; RGD is supported by an R01 grant from the NHLBI [grant HL64541]; DictyBase is supported by an R01 grant from the NIGMS [grant GM064426].