1000 1200 1400 1600 1800 2000 s d i web interface s o ... · a plant integrative ‘omics’...

1
a plant integrative ‘omics’ database Gabi Primary Database http://www.gabipd.org/ Species represented in GabiPD Gene-centric view Barley and Solanaceae genetic maps Source for figures: http://www.gabi.de/ http://en.wikipedia.org/wiki/DNA_microarray http://en.wikipedia.org/wiki/Image:2D_gel_images_dual_channel_warped.PNG Diego Mauricio Riaño-Pachón, Axel Nagel, Robert Wagner, Rico Basekow, Jost Neigenfind, Elke Weber, Sabrina Kleessen, Birgit Kersten GabiPD team, Bioinformatics group, Max Planck Institute of Molecular Plant Physiology, Wissenschaftspark Golm, Am Mühlenberg 1, 14476 Potsdam - Golm, Germany Max Planck Institute of Molecular Plant Physiology [email protected] 6. 3. 4. GabiPD References APG. 2003. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG II. Bot J Lin Soc. 141:399-436 Ball C.A. and Brazma A. 2006. MGED standards: work in progress. Omics. 10:138-144. Bohs L. 2005. Major clades in Solanum based on ndhF sequence data. pp 27-49 in Keating RC, Hollowell VC, and Croat TB (eds.), A festschrift for William G. D'Arcy: the legacy of a taxonomist. Monographs in Systematic Botany from the Missouri Botanical Garden, Vol. 104. Missouri Botanical Garden Press, St. Louis, MO. Bonfield J.K. and Beal K.F., et al. 2002. Trev: a DNA trace editor and viewer. Bioinformatics. 18:194-195. Jenkins H., Hardy N., Beckmann M., et al. 2004. A proposed framework for the description of plant metabolomics experiments and their results. Nat Biotechnol. 22:1601-1606. Knapp S . 2002. Tobacco to tomatoes: a phylogenetic perspective on fruit diversity in the Solanaceae. J Exp Bot. 53:2001-22. Sol Genomics Network (http://www.sgn.cornell.edu/about/about_solanaceae.pl) Soltis PS and Soltis DE. 2004. The origin and diversification of angiosperms. Am J Bot. 91:1614. Stein N., Prasad M., Scholz U., et al. 2007. A 1,000-loci transcript map of the barley genome: new anchoring points for integrative grass genomics. Theor Appl Genet. 114:823-839. The Tree of Life (http://www.tolweb.org/angiosperms) Usadel, B., Nagel, A., Thimm, O., et al. 2005. Extension of the visualization tool MapMan to allow statistical analysis of arrays, display of corresponding genes, and comparison with known responses. Plant Physiol. 138:1195-1204. http://commons.wikimedia.org/wiki/Image:Gouache-arabidopsis-thaliana.jpg http://commons.wikimedia.org/wiki/Image:Arabidopsis_thaliana-flower.jpg http://drnelson.utmem.edu/module5/BSA/Spectrum/BSA60ng%20.jpg GabiPD application structure GabiPD 1000 1200 1400 1800 m/z 2000 1600 1250 1000 750 500 250 Genome, Transcriptome Metabolome Proteome 1. Data Perl API Java API GreenCards BioTools YAMB BioMoby Web Interface Web Services Templates API Generator Schema Data Insertion & Update Routines Database meta information and api generation data flow and data processing generate read read Populus trichocarpa Populus tremula Medicago truncatula Arabidopsis thaliana Brassica napus Vitis vinifera Solanum tuberosum Solanum lycopersicum Capsicum annuum Nicotiana tabacum Beta vulgaris Hordeum vulgare Oryza sativa Angiosperms Salicaceae Salicaceae Fabaceae Brassicaceae Brassicaceae Vitaceae Solanaceae Solanaceae Solanaceae Solanaceae Amaranthaceae Poaceae Core Eudicots Monocots Rosids Asterids Caryophyllales Rosids I Rosids II Poaceae * * A E Schematic overview of the GabiPD application structure. The ´API generator´ translates the ´Templates´ using the database meta-information (shown in blue), generating the database application interface (Java and Perl API). The application logic (shown in yellow), i.e., Web Interface, WebServices and data manipulation routines, interacts with the ´Database´ through the database application interface. ‘omics’ data 2. Phylogenetic tree depicting the evolutionary relationships among species represented in GabiPD. Species for which whole-genome sequences and annotations are available are shown in blue. *Species that will soon be integrated in GabiPD. Species not shown: Solanum bulbocastanum, S. demissum, S. phureja, S. spegazzinii. Example of a keyword search using GreenCards. (A) The user had performed a search for the keywords 'FLOWERING LOCUS T', which retrieves links to the GreenCards of Genes (genome annotation projects), Clones (ESTs) and Plants (mutant plant lines). (B) Display of the Gene GreenCard, corresponding to the Arabidopsis annotated gene AT1G65480.1. Here the users find high confidence matching EST sequences displayed in the 'Related with' section. Sequence features and the sequences themselves are displayed as well. The selected gene has a matching EST (Clone: MPMGp2011E01215), this Clone GreenCard is shown in (C) and links back to the Gene GreenCard and to the original EST trace file, displayed by JTrev (D). A protein spot in rapeseed (Brassica napus) has been identified by 2-DE/MS related to the retrieved gene, and a link directs the user to the 2DE-gel with the identified protein spot highlighted with blue cross-hair (E). The spot identified links to a description of the protein (F) that provides links to the original Gene GreenCard. Functional annotation Functional annotation External resources External resources Sequence features Sequence features B A C Visualisation of expression profile data in MapManWeb. (A) The Affymetrix® NASC Array experiment on programmed cell death in Arabidopsis is displayed (NASCArrays reference number: 30). MapManWeb allows the visualisation of expressed genes in different biological processes; here only probesets (i.e., genes) involved in transcription regulation are shown. (B) Details for a strongly down-regulated probeset, with links to the related Gene GreenCard in A. thaliana. (C) The Gene GreenCard for the selected gene (ANAC79) links back to the probeset of the Affymetrix® ATH1 array. Visualising expression data 5. Visualisation of the genetic maps published by Stein et al . 2007. (A) The region between 0cM and 25cM of barley chromosome I is shown. A selected marker is displayed in red, and links to the Marker GreenCard (B), which contains information on a related EST sequence therewith connecting genomic with transcriptomic information. With the EST description, cluster information is included (Contig30438.1) that links to the schematic representation of all ClusterContig members displayed onto the related consensus sequence (C). EST sequences that were selected from this ClusterContig as representatives for the new 27K barley unigene set are shown in turquoise. B C A C Related clones Related clones Identified in 2DE gels Identified in 2DE gels GabiPD allows a seamless integration of varied ‘omics’ data types obtained from plant systems. Gene expression and metabolic profiling data, are included in GabiPD and integrated in gene- context. E B D Link to provider Link to provider F F This work was supported by the German Ministry for Education and Research (BMBF) [GABI I: 0312272, GABI II: 0313112 and GABI-FUTURE: 0315046], the former German Resource Center for Genome Research (RZPD) GmbH and the Max Planck Society. Funding Genomanaly se im biologischen System Pflanze GabiPD NAR paper doi:10.1093/nar/gkn611 GabiPD NAR paper doi:10.1093/nar/gkn611

Upload: others

Post on 06-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1000 1200 1400 1600 1800 2000 s d i Web Interface s o ... · a plant integrative ‘omics’ database Gabi Primary Database Species represented in GabiPD Gene-centric view Barley

a plant integrative ‘omics’ database

Gabi Primary Database http://www.gabipd.org/

Species represented in GabiPD

Gene-centric view

Barley and Solanaceae genetic maps

Source for figures:http://www.gabi.de/http://en.wikipedia.org/wiki/DNA_microarrayhttp://en.wikipedia.org/wiki/Image:2D_gel_images_dual_channel_warped.PNG

Diego Mauricio Riaño-Pachón, Axel Nagel, Robert Wagner, Rico Basekow, Jost Neigenfind, Elke Weber, Sabrina Kleessen, Birgit KerstenGabiPD team, Bioinformatics group, Max Planck Institute of Molecular Plant Physiology, Wissenschaftspark Golm, Am Mühlenberg 1, 14476 Potsdam - Golm, Germany

Max Planck Instituteof Molecular Plant Physiology

[email protected]

6.

3.

4.

GabiPD

ReferencesAPG. 2003. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG II. Bot J Lin Soc. 141:399-436Ball C.A. and Brazma A. 2006. MGED standards: work in progress. Omics. 10:138-144.Bohs L. 2005. Major clades in Solanum based on ndhF sequence data. pp 27-49 in Keating RC, Hollowell VC, and Croat TB (eds.), A festschrift for William G. D'Arcy: the legacy of a taxonomist. Monographs in Systematic Botany from the Missouri Botanical Garden, Vol. 104. Missouri Botanical Garden Press, St. Louis, MO.Bonfield J.K. and Beal K.F., et al. 2002. Trev: a DNA trace editor and viewer. Bioinformatics. 18:194-195.Jenkins H., Hardy N., Beckmann M., et al. 2004. A proposed framework for the description of plant metabolomics experiments and their results. Nat Biotechnol. 22:1601-1606.Knapp S . 2002. Tobacco to tomatoes: a phylogenetic perspective on fruit diversity in the Solanaceae. J Exp Bot. 53:2001-22.Sol Genomics Network (http://www.sgn.cornell.edu/about/about_solanaceae.pl)Soltis PS and Soltis DE. 2004. The origin and diversification of angiosperms. Am J Bot. 91:1614.Stein N., Prasad M., Scholz U., et al. 2007. A 1,000-loci transcript map of the barley genome: new anchoring points for integrative grass genomics. Theor Appl Genet. 114:823-839.The Tree of Life (http://www.tolweb.org/angiosperms)Usadel, B., Nagel, A., Thimm, O., et al. 2005. Extension of the visualization tool MapMan to allow statistical analysis of arrays, display of corresponding genes, and comparison with known responses. Plant Physiol. 138:1195-1204.

http://commons.wikimedia.org/wiki/Image:Gouache-arabidopsis-thaliana.jpghttp://commons.wikimedia.org/wiki/Image:Arabidopsis_thaliana-flower.jpghttp://drnelson.utmem.edu/module5/BSA/Spectrum/BSA60ng%20.jpg

GabiPD application structure

GabiPD

1000 1200 1400 1800 m/z20001600

1250

1000

750

500

250

Genome,Transcriptome Metabolome Proteome

1.

DataPerl API

Java API

GreenCards

BioTools

YAMB

BioMoby

Web Interface

Web Services

Templates

API Generator

Schema

Data Insertion &Update Routines

Database

meta information and api generation

data flow and data processing

generate

read

read

Populus trichocarpa

Populus tremula

Medicago truncatula

Arabidopsis thaliana

Brassica napus

Vitis vinifera

Solanum tuberosum

Solanum lycopersicum

Capsicum annuum

Nicotiana tabacum

Beta vulgaris

Hordeum vulgare

Oryza sativa

An

gio

sp

erm

s

Salicaceae

Salicaceae

Fabaceae

Brassicaceae

Brassicaceae

Vitaceae

Solanaceae

Solanaceae

Solanaceae

Solanaceae

Amaranthaceae

Poaceae

Co

re E

ud

ico

tsM

on

oco

ts

Ro

sid

sA

ste

rid

sC

ary

op

hylla

les

Ro

sid

s I

Ro

sid

s II

Poaceae

*

*

A

E

Schematic overview of the GabiPD application structure. The ´API generator´ translates the ´Templates´ using the database meta-information (shown in blue), generating the database application interface (Java and Perl API). The application logic (shown in yellow), i.e., Web Interface, WebServices and data manipulation routines, interacts with the ´Database´ through the database application interface.

‘omics’ data2.

Phylogenetic tree depicting the evolutionary relationships among species represented in GabiPD. Species for which whole-genome sequences and annotations are available are shown in blue. *Species that will soon be integrated in GabiPD. Species not shown: Solanum bulbocastanum, S. demissum, S. phureja, S. spegazzinii.

Example of a keyword search using GreenCards. (A) The user had performed a search for the keywords 'FLOWERING LOCUS T', which retrieves links to the GreenCards of Genes (genome annotation projects), Clones (ESTs) and Plants (mutant plant lines). (B) Display of the Gene GreenCard, corresponding to the Arabidopsis annotated gene AT1G65480.1. Here the users find high confidence matching EST sequences displayed in the 'Related with' section. Sequence features and the sequences themselves are displayed as well. The selected gene has a matching EST (Clone: MPMGp2011E01215), this Clone GreenCard is shown in (C) and links back to the Gene GreenCard and to the original EST trace file, displayed by JTrev (D). A protein spot in rapeseed (Brassica napus) has been identified by 2-DE/MS related to the retrieved gene, and a link directs the user to the 2DE-gel with the identified protein spot highlighted with blue cross-hair (E). The spot identified links to a description of the protein (F) that provides links to the original Gene GreenCard.

FunctionalannotationFunctionalannotation

ExternalresourcesExternal

resources

Sequencefeatures

Sequencefeatures

B

A

C

Visualisation of expression profile data in MapManWeb. (A) The Affymetrix®NASC Array experiment on programmed cell death in Arabidopsis is displayed(NASCArrays reference number: 30). MapManWeb allows the visualisation of expressedgenes in different biological processes; here only probesets (i.e., genes) involved intranscription regulation are shown. (B) Details for a strongly down-regulated probeset,with links to the related Gene GreenCard in A. thaliana. (C) The Gene GreenCard for theselected gene (ANAC79) links back to the probeset of the Affymetrix® ATH1 array.

Visualising expression data5.

Visualisation of the genetic maps published by Stein et al. 2007. (A) Theregion between 0cM and 25cM of barley chromosome I is shown. A selected marker isdisplayed in red, and links to the Marker GreenCard (B), which contains information on arelated EST sequence therewith connecting genomic with transcriptomic information.With the EST description, cluster information is included (Contig30438.1) that links to theschematic representation of all ClusterContig members displayed onto the relatedconsensus sequence (C). EST sequences that were selected from this ClusterContig asrepresentatives for the new 27K barley unigene set are shown in turquoise.

B

CA

C

Relatedclones

Relatedclones

Identifiedin 2DE gelsIdentified

in 2DE gels

GabiPD allows a seamless integration of varied ‘omics’ data types obtained from plant systems.Gene expression and metabolic profiling data, are included in GabiPD and integrated in gene-context.

E

B

D

Link to providerLink to provider

FF

This work was supported by the German Ministry for Education and Research (BMBF) [GABI I: 0312272, GABI II: 0313112 and GABI-FUTURE: 0315046], the former German Resource Center for Genome Research (RZPD) GmbH and the Max Planck Society.

Funding

Genomanalyse

im biologischenSystem PflanzeGabiPD NAR paper doi:10.1093/nar/gkn611GabiPD NAR paper doi:10.1093/nar/gkn611