mai nguyen bus 550, spring 2013 dr. minder chen
DESCRIPTION
Conceptual data modeling for bioinformatics. Erich Bornberg -Bauer and Norman W. Paton. (2002). Briefings in Bioinoformatics . Vol. 3. No. 2. 166-180. Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen. Bioinformatics. Data. Information Processing and System. Biology (Source). - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/1.jpg)
Conceptual data modeling for bioinformatics.
Erich Bornberg-Bauer and Norman W. Paton. (2002). Briefings in Bioinoformatics. Vol. 3. No. 2. 166-180.
Mai NguyenBUS 550, Spring 2013
Dr. Minder Chen
![Page 2: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/2.jpg)
Bioinformatics
…the field of science in which biology, computer science, and information technology merge to form a single discipline.
Biology (Source)
Data Information Processing and
System
![Page 3: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/3.jpg)
Human Genome Projectidentify all the approximately 20,000-25,000 genes in human DNA,determine the sequences of the 3 billion chemical base pairs that make up human DNA
store this information in databases,improve tools for data analysis,transfer related technologies to the private sector, andaddress the ethical, legal, and social issues (ELSI) that may arise from the project.
![Page 4: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/4.jpg)
Data Data Data Data,…..
![Page 5: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/5.jpg)
Databases
http://www.taverna.org.uk/introduction/taverna-in-use/bioinformatics/
![Page 6: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/6.jpg)
Conceptual Data Modeling (CDM)
Make explicit the structural properties of data, and as such are useful for capturing, refining, and communicating details about the data in a laboratory or database.
Entity 1
*A type of notation that describes entity types and relationships.
Entity 2
Attributes/
properties
![Page 7: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/7.jpg)
The design processRequirement
s Analysis
Conceptual Design
Logical Design
Database Requirements
Conceptual Model
Implementation Model
* There is no universally accepted standard for either the design process or notation.
![Page 8: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/8.jpg)
1. Entity-Relationship Modeling is a type of CDM
Entity Types An entity type provides a description of the properties of entities in a domain
Relationships Represents an association between two or more entity types.
Hemoglobin Myoglobin
![Page 9: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/9.jpg)
ER modeling for HIV
![Page 10: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/10.jpg)
ER models and fingerprints
ER models can be mapped onto relational tables.
String of attributes/properties that define and identify the relationship and are unique to the entity.
![Page 11: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/11.jpg)
2. Unified Modeling Language (UML); another CDM
The standard object modeling language.Notation is the class diagram which describes the attributes, operations, and relationships of a set of objects.
![Page 12: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/12.jpg)
Class Diagram
![Page 13: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/13.jpg)
UML versus ERER
ER was developed earlier for relational databasesER uses keys: when one or more attributes of an entity are used to uniquely identify itUses relational tablesNot truly platform independent
UMLUML is object-orientedDo not support keysNot all relationships are given names or rolesIndependent of platform
![Page 14: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/14.jpg)
Building Useful Networks
![Page 15: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/15.jpg)
Applications
![Page 16: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/16.jpg)
Personalized Medicine?
![Page 17: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/17.jpg)
Question 1.True/False: Conceptual Data Models (CDMs) are only applicable to bioinformatics research.
Correct Answer: FALSE
![Page 18: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/18.jpg)
Question 2.Which of the following is not an activity in the design process for conceptual modeling?
a. Requirements Analysisb. Conceptual Designc. Conceptual Modeld. A. B. and C. are all examples of the design process.e. All of the above and activities can vary as there is no
universally accepted process or design.
![Page 19: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/19.jpg)
Question 3.Which of the following is FALSE in regards to Conceptual Data Models (CDMs)?
a. CDMs allow precise statements to made about the data.b. CDMs are easier to read than to construct.c. CDMs easily describe all data without including invalid
data.d. Entity Relationship and Unified Modeling Languages are
types of CDMs
![Page 20: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/20.jpg)
Thank you for your attention!
![Page 21: Mai Nguyen BUS 550, Spring 2013 Dr. Minder Chen](https://reader033.vdocuments.us/reader033/viewer/2022051700/5681637e550346895dd45f53/html5/thumbnails/21.jpg)
References and Images (In order of appearance)
- http://campus.murraystate.edu/academic/faculty/eweber/bio101/images/cell_to_dna_sm.jpg
- http://bioinfo.imdik.pan.pl/mediawiki/images/e/e1/Sek3.jpg- http://asgpb.mhpcc.hawaii.edu/sequence/454sequencer.jpg
http://blog.chron.com/bakerblog/files/legacy/dna.jpghttp://www.nature.com/nature/journal/v464/n7289/images/464680a-i1.0.jpghttp://www.ornl.gov/sci/techresources/Human_Genome/home.shtmlhttp://25.media.tumblr.com/tumblr_loa5m3vYuO1qbh26io1_1280.jpghttp://www.sanger.ac.uk/about/history/hgp/http://en.wikipedia.org/wiki/File:Myoglobin.pnghttp://en.wikipedia.org/wiki/File:1GZX_Haemoglobin.pnghttp://www.databaseanswers.org/data_models/hotels/index.htmhttp://www.schillerlab.net/guides/hiv/image009.pnghttp://www.nature.com/scitable/content/6437/10[1].1073_pnas0701361104-fig2b_mid_1.jpg