anotation: gene of which little is known what follows is a simulation of an orf page in the proposed...
TRANSCRIPT
![Page 1: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/1.jpg)
Anotation: Gene of which little is known
What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist. As you go through the simulation please consider what capabilities you would want to serve your research and annotation interests.
A narrative to help you go through the simulation appears in a red-bordered box, such as the one below.
To begin:1. Click on Slide Show, (on the upper toolbar)2. Click View Show3. Click Continue button
Continue
Scenario 1
![Page 2: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/2.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
This is an example of an orf that has only automated annotation, except for annotation carried over automatically from its ortholog in
Synechocystis PCC 6803. The notation System indicates that the assertion has not been confirmed by a human annotator. Clicking on System brings you to a page that provides the basis for the assertion.
Continue
A
![Page 3: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/3.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
Clicking on Chromosome brings you to a zoomable map of the chromosome showing the position of all4312 (see
the more interesting case in Scenario 5).
A
Continue
![Page 4: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/4.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
Clicking on the coordinates 5166997... brings you to a view of the sequence of the region around all4312
(this is the subject of Scenario 6). Continue
A
![Page 5: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/5.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
Clicking on the length 256 amino acids brings you to the amino acid sequence. Both the DNA and amino
acid sequences can be downloaded in this way.Continue
A
![Page 6: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/6.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
There is no experimental data (or any other human annotation) available for all4312. However, there
is microarray data available for its ortholog sll1330. Click to show these experiments.
A
![Page 7: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/7.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to contract) Experiment Suzuki et al (2001) Cold shock response Experiment Hihara et al (2001) low light to high light Experiment Yoshimura et al (2001) cAMP receptor mutant Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
Clicking on the specific experiment gives details concerning protocol and results. Click to contract
the display back to its previous form.
A
![Page 8: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/8.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
A mutant exists for the ortholog in Synechocystis. Mouse over* to Experiment to see who made the
mutant and whether the work is published. *In real life, information will pop up when you mouse to an informative position. In this simulation you’ll also need to click.
A
![Page 9: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/9.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Presumed inviable (failed to segregate) Experiment
Cyanobacterial orthologs:
IMP (Inferred from Mutant Phenotype) is a standard Gene Ontology Consortium descriptor, useful for searching (see
Scenario 3 for more). Mouse away from the region. *In real life, information will go away when you mouse away from an informative position. In this simulation you’ll also need to click anywhere.
IMP: Masahiko Ikeuchiunpublished
A
![Page 10: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/10.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Presumed inviable (failed to segregate) Experiment
Cyanobacterial orthologs:
Clicking on Experiment would bring you to a page describing the experiment and its results. For now,
mouse over* to the Annotator icon to see who made this annotation.
*Remember that in this simulation you have to click to pop up information over a region.
A
![Page 11: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/11.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
Clicking on the icon would bring your designated e-mail screen with the annotator’s e-mail address ready to go.
For now, mouse away from the region.* *In real life, information will go away when you mouse away from an informative position. In this simulation you’ll also need to click anywhere..
Masahiko IkeuchiTokyo University
A
![Page 12: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/12.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
The bottom part of the screen shows the genetic neighborhood of all4312 and its cyanobacterial
orthologs. Orfs are color-coded by function. Mouse over* to the gene downstream from all4312 to see its
identity. Then mouse away.*
*Remember to click to view the popup and click to make it disappear.
AABC transporterATP-binding protein
![Page 13: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/13.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
Mouse over to the red motif bar to see the identity of the motif found in the indicated region of all4312. The
Pfam number and E-value of the match are given. Mouse away from the motif bar.*
*Remember to click to view the popup and click to make it disappear.
AResponse receiver domain
(PF00072): 1.4e-31
![Page 14: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/14.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsOptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
To predict operon structure, it is useful to visualize potential inverted repeats, which may function as
transcriptional termination signals. Operon structure isn’t an issue in this case, but let’s go for it anyway.
Click and then Show inverted repeats.
A
View: graphic
Show cb orthologs
Show hits: <1e-6
Show neighbors: +/- 1
Show motifs
Show alternate starts
Show annotated features
Show predicted features
Show transposon ends
Show tandem repeats
Show inverted repeats
Show dispersed repeats
Show private set
Show inverted repeats
Options
Options
![Page 15: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/15.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsOptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
There are evidently no features annotated yet for this orf, but there may be some that have been automatically predicted. Click and
then Show predicted features.
A
View: graphic
Show cb orthologs
Show hits: <1e-6
Show neighbors: +/- 1
Show motifs
Show alternate starts
Show annotated features
Show predicted features
Show transposon ends
Show tandem repeats
Show inverted repeats
Show dispersed repeats
Show private set
Show predicted features
Options
Options
![Page 16: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/16.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
Surprisingly, the same feature has been predicted for the entire set of orthologs. Mouse over to the feature upstream from
all4312 to identify it.
A
![Page 17: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/17.jpg)
Anab7120:all4312
NostPunc:618.077
TricEryt:5.6053
Syny6803:sll1330
TherElon:tlr1330
Anabaena PCC 7120: all4312OptionsAnnotateMain Menu History
Replicon: Chromosome
Coordinates: 5166997 (stop) <- 5167767 (start-GTG) System Length = 256 amino acids
Strand: Complementary
Function: Two-component response regulator System Syny6803:sll1330: Expression data (click to expand) Experiment
Mutant: None Syny6803:sll1330: Failed to segregate Experiment
Cyanobacterial orthologs:
Clicking on the green triangle would lead to a page that describes the motif and how it was identified.*
*Participant Jackie Collier and Chip Lawrence are developing a method to predict transcriptional binding sites. While the set does not yet exist, the triangles below represent sites that have actually been identified by a means similar to that described in Scenario 5.
A
End
Lawrence/Collier conserved motif set
![Page 18: Anotation: Gene of which little is known What follows is a simulation of an orf page in the proposed graphical interface. The interface does not yet exist](https://reader036.vdocuments.us/reader036/viewer/2022083008/56649ea15503460f94ba48ae/html5/thumbnails/18.jpg)
Anotation: Gene of which little is knownSummary
• The Annotation Page provides links to different views of the region surrounding the gene
• The page links to other pages that facilitate downloading sequences
• The page provides experimental data when available regarding the orf and its cyanobacterial orthologs
• DNA features can be visualized wthin the context of the genetic neighborhood of the orf
Scenario 1
A prototype of the Annotation Page exists in CYORF, but the full capabilities described here
require the resources we are asking of NSF.