hines lab report
TRANSCRIPT
-
8/12/2019 Hines Lab Report
1/9
Insect Pigment Patterning: A Genome-Wide Study of Bombusmelanopygus
Using tanGene
Andrew Walrond
Abstract
Insect pigmentation is a phenomenon that has garnered the attention of the general public and scientists
alike. From the intricate patterning of butterflies to the amazing color pattern variation in bumblebees, it
is a very interesting topic. The mode by which various patterns and pigmentation occurs differs among
organisms, but is similar amongst species.Drosophila melanogasteris a specie which has been studied
extensively to determine the mechanism of melanization, tan is a gene that has been indicated to play a
role in the pathway. In our study tan was selected to determine if it is the gene that affects patterning in
Bombus melanopygus. Using genetic sequencing and genome association to map genomic regions
contributing to the variation in pigment patterning, we attempt to determine if this locus is indeed
involved in the selected specie. Only two locations show single nucleotide polymorphisms (SNPs) that
sort according to phenotype, all other locations indicate sorting by location. This is a great method to
determine if a gene is associated with pigmentation patterning and will prove to be very useful if appliedacross the genome of many individuals, and different species.
Introduction
When studying evolutionary and developmental biology, insect pigmentation is a diverse,
yet in many respects, common trait that can lead to clues in answering various questions. One of
the most basic ways to study the development and subsequent evolution of species is by looking
at genetic changes. Genes interact at the molecular level in order to produce great variation at the
phenotypic level. Studying the mechanism and which genes are involved in these phenotypes
will provide insight into how different characteristics are not only inherited but how they have
changed over time.
Pigmentation research spans all disciplines of biology: from ecology, to physiology, even
to genetics. All of these have contributed to a greater understanding of why these phenotypic
traits are inherited, when they were inherited, and how they are inherited. More recently,
however, studies regarding pigmentation have started to center around the evolutionary and
developmental fields of biology. In insects, pigments are produced by epidermal cells through a
developmental process that includes pigment patterning and synthesis1. Extensive studies have
gone into drosophila pigmentation and butterfly wing patterns. In a study completed by True, J.
et al. theDrosophila melanogasterlocus, tan, was found to encode a novel enzyme required for
two cellular functions2. One of the cellular functions involves cuticular melanization. Tan is a
gene that has been hypothesized to have an effect on the melanization pathway, including theselective loss of expression of certain pigmentation. In a study composed by Jeong, S. it was
demonstrated that tan gene expression was eliminated through the mutation inactivation of a
specific cis-regulatory element and subsequent loss of pigmentation3. Studying the
developmental process by which pigmentation occurs will address evolutionary questions as well
as provide a basis for future examination of various genes hypothesized to affect coloration.
-
8/12/2019 Hines Lab Report
2/9
In our study we (1) explored the mechanisms of pigmentation in insects, (2) decided upon
a gene thought to be involved in the melanization pathway, (3) performed various genetic
analyses on the gene selected, (4) discuss what obtained data conveys about the selected gene.
The gene that was selected was tan, as previously discussed tan has been determined to have a
part in the melanin synthesis of D. melanogasterby reversing the synthesis of Dopamine to
NBAD1the production of brown melanin is achieved. With this information we hypothesize thattan is the gene responsible for the coloration pattern in the four samples covered in this study.
Materials and Methods
Most mutations that are believed to cause morphological variation are found in the cis-
regulatory region of developmental genes4. Exploring the DNA sequence of B. melanopygus,
specifically, the gene tan should provide evidence as to how the patterns of pigmentation arise.
Twelve individuals from the boarder of Oregon and California were selected and their DNA was
extracted.
To begin our study of the tan gene the sequence had to be retrieved. To retrieve the
sequence we usedwww.flybase.org,upon reaching the website a BLAST was performed in
which we searched for tan. Upon finding the gene that we were looking for we downloaded the
cds in FASTA format. That data was then entered into awww.beebase.orgBLAST. From there
we retrieved the scaffold in which the gene was located which was scf_0019. The scaffold was a
strong hit with the expected value of 2e-46, the next hit was .37 which allowed us to draw the
conclusion that this was our gene. The data was then downloaded and each of the exons from the
hit was labeled in the following fashion: tan_exon1..2..3.._Bimp. The genomic region was then
downloaded by clicking on the gene track in gbrowse. The genomic region was also retrieved for
Bombus terrestrisscaffold. In order to double check that we had the proper gene we took one of
the exons and placed it into a tbastx on flybase, this was repeated once more with a different
exon just to be completely certain. Certain we had the right gene the exons that we previouslyobtained were then BLAST in Genbank under tblastn which generated an annotated bumblebee
gene. The FASTA sequence was saved for both impatiens and terrestris. To make sure we had
the correct CDS we did another BLAST on flybase. The FASTA sequences that were obtained
were then added to text wrangler to ensure they were saved in simple text format. The files were
then imported into geneious a sequencing program and saved separately.
http://www.flybase.org/http://www.flybase.org/http://www.flybase.org/http://www.beebase.org/http://www.beebase.org/http://www.beebase.org/http://www.beebase.org/http://www.flybase.org/ -
8/12/2019 Hines Lab Report
3/9
This figure depicts the alignment of the exons to the genome.
The terrestris genome, exons, and impatiens genome were all aligned using the map toreference command. This alignment was then used to generate primers for the qPCR performed
later in the experiment.
The next step was to design primers, general guidelines were used in which: the length is
18-22 bp (20 was used for the primers we designed), the melting temperature is in the range of
52-58 C, and the annealing temperature is in the range of 52-60 C. All other conditions were
determined using the websitewww.primer3plus.comwhich provides a score of different
categories such as self-binding where 0 is the best and 5 or higher is the worst, GC which
gives a percentage of the G and C nucleotide content, and various other options which will aid in
one choosing the proper primer.
Once all of the information was gathered we proceeded with the PCR reaction. Qiagen
products were used in this PCR reaction. A stock solution was prepared which contained 30l of
Taq Master Mix, 1.2 l each of the two primers that were designed, and 21.6 l of water (Qiagen
supplied). To that stock solution 1.5l of bombus melanopygusDNA was added. The DNA that
was supplied was previously extracted by members of the lab. All contents when not being used
were kept in a freezer to ensure minimal degrading and reactions, also while working with any
Primer 1 R
Primer 2 FPrimer 2 F
Exons
Primer 1 F
http://www.primer3plus.com/http://www.primer3plus.com/http://www.primer3plus.com/http://www.primer3plus.com/ -
8/12/2019 Hines Lab Report
4/9
reagent, DNA, etc. they were all kept on ice. When everything was added to the solution the
contents were finger vortexed to ensure proper mixing, then centrifuged to allow the contents on
the walls to enter the mix. A 3-step PCR was used for the reaction with the parameters: 1) 95
for 2min, 2) 95 for 30sec, 3) 52 for 30sec, 4) 72 for 30sec, 5) 72 for 5min, 6) 4 as the
holding temperature.
To visualize the DNA gel electrophoresis was used a 1% gel which contained 1g agarose
and 50ml TAE buffer, this was heated for ~1:00 in the microwave or until a clear product was
obtained to which 2.5 l of DNA visualization liquid was added (gel-star). The liquid was then
transported into a gel mold until it solidified. From our newly obtained PCR sample we took 2 l
and added it to 2 l of loading dye. These samples as well as a DNA ladder were added to the
wells of the agarose gel. The DNA migration was analyzed under UV light by taking a
photograph using the UV pro program.
The next step was to purify the PCR sample; to do this an exosap reagent was used in a
60 l exosap: 30 l H2O dilution. From that dilution 4.3 l was added to 13 l of the PCR
sample. Once that was completed it was cycled at 37 C for 30min, 80 C for 15min, and 10 Cas the holding temperature. Once that product was finished cycling it was sent to be sequenced.
Results
UV pro was used to visualize the gel. The first set of data collected was from an initial
trial with DNA samples from BmelR_002, BmelR_005, BmelB_001, and BmelB_004. The
following gel was obtained:
-
8/12/2019 Hines Lab Report
5/9
Depicted above is the gel visualization for the first set of primers run for tan.
This sample was run under the conditions outlined in the materials and methods section.
From this visualization we can see that the DNA has migrated, but there is a faint set of bands at
the bottom which indicates primer dimer. A second trial was then run on the same sample, but
some of the conditions were changed. The annealing temperature was changed to 51 C and the
DNA concentration was increased from 1.5l to 2 l. This produced a visualization which was
clearer and had less primer dimer in the product. The next samples that were run were the EF1-
gene samples which would serve as our baseline, originally the sample was run at an annealing
temperature of 58 C, but that proved to be too high and there were no bands in the gel when
visualized. To rectify that the annealing temperature was lowered to 52 C, which then produced
visible bands. This method of visualizing the PCR before sequencing was repeated for every
batch of DNA that was going to be sequenced. Once the samples were sent out the results were
then imported into geneious and edited. Consensus sequences were obtained for all samples afterediting was completed. Once all consensus sequences had been obtained they were then aligned
to the bombus impatiens genome for individuals R_002, R_005, B_001, and B_004. An analysis
for where the SNPs were located was then performed:
-
8/12/2019 Hines Lab Report
6/9
Locati
on
B1
Genomic
R2
Genomic
B4
Genomic
R5
Genomic
Exon
1
Exon
2
Exon
3
Exon
4
Exon
5
Exon
6
Exon
7
29228 T T T T C - - - - -
30421 A A A A - G - - - - -
31057 C C C C - - - T - - -
31156 T T T T - - - C - - -
31231 C C C C - - - T - - -
31518 C C C C - - - - T - -
31754 G G G G - - - - - A -
32049 T T T T - - - - - - C
This table depicts the SNPs found in the exons of the tan gene.
Location
B1
Genomic
R2
Genomic
R5
Genomic
B4
Genomic
Tan 1B1
Tan 1R2
Tan 1R5
Tan 2B1
Tan 2R2
Tan 2R5
723 G G C C - - - - - -
724 T T G G - - - - - -
742 C C A A - - - - - -
1030 T T G G - - - - - -
11598 G A G A - - - - - -
11906 T T C C - - - - - -
14440 G A G A - - - - - -
18292 A G A G - - - - - -
22856 T G T G - - - - - -
24070 T A T A - - - - - -
28950 A A C C - - - - - -
30472 A A A A A A C C - -
30479 T T T T T T G G - -
30474 G G G G G G A A - -
30475 T T T T T T A A - -
30476 T T T T T T T A - -
30477 A A A A A A A T - -
30478 A A A A A A A G - -
30489 G G G G G G G C - -
30480 G G G G G G G A - -
30481 A A A A A A A T - -
30485 T T T T T T T A - -
30486 C C C C C C C T - -
30488 A A A A A A A T - -
30489 G G G G G G G A - -
-
8/12/2019 Hines Lab Report
7/9
30490 G G G G G G G A - -
30491 A A A A A A A G - -
30492 A A A A A A A C - -
30493 C C C C C C C G - -
30494 G G G G G G G T - -
30495 G G G G G G G A - -
30498 G G G G G G G T - -
30497 A A A A A A A T - -
30500 C C C C C C C G - -
30501 G G G G G G G C C -
30502 G G G G G G G T T -
30504 G G G G G G G A G -
30508 A A A A A A A T A -
30511 G G G G G G G C G -
30512 T T T T T T T C T -
30445 A A A A A A A T A -
30540 T T T T T T T T T C
30970 T T C C T T C T T C
30985 G G G G G G G C G C
31011 C C C C C C C C C C
31045 T T T T T T T A T T
31056 G G G G G G G G G T
31075 G G G G G G A A G A
33169 A A G G - - - - - -
34708G G A A - - - - - -
44135 C T T C - - - - - -
46451 A C C A - - - - - -
48387 A T A T - - - - - -
48738 A C A C - - - - - -
49736 C T T C - - - - - -
52272 C C T T - - - - - -
54150 T T C C - - - - - -
This table depicts the SNPs across the region of the genome that we selected; this table was modified just
to view the haplotypes that occur across the genome.
The exons are fairly conserved, we only observe eight SNPs across the seven exonsobtained. We see a stretch of haplotypes that sort B1, R2 and B4, R5 from bp 723-742, but then
this sequence is broken up to B1, R5 and R2, B4. There are only two instances in which we get
soring by phenotype (B1, B4 and R2, R5) and that occurs at bp 44135 and 49736. In the primer
sequences that were obtained there seems to be a lot of variation in B1 of the second set of tan
primers, as well as R2 of the second set of tan primers. Overall the association of SNPs seems to
-
8/12/2019 Hines Lab Report
8/9
be by location, with only two instances of phenotypic sorting. At bp 30970 the area surrounding
that location is fairly conserved, but at that bp we found a SNP which sorted R5, B4.
Discussion
The main goal of this research was to determine if the gene tan which has been implied to
have an effect on the melanization pathway is involved with bumblebee patterns. Throughgenetic sequencing we observed whether there is a pattern with the base sorting between the four
individuals R2, R5, B1, and B4. SNPs should occur mostly in non coding regions or regions
where selection is happening, if a SNP is observed repeatedly over a portion of the genome then
there is hypothesized to be an association. In our study there was a recurring pattern in the
sorting of B1, R5 and R2, B4. Unfortunately this is not a sorting by phenotype which is what we
would like to see; instead this is a sorting by location. Having a sorting by location is not very
informative because the variation in habitats is most likely the overlying reason as to why they
sort in that manner. Unfortunately for the designed primers both sequences for B4 did not
produce any data and could not be used in the analysis. Degraded DNA, poor sequencing, or
issues with amplification could have produced this issue. The primer sequence variation that wasobserved could be due to many factors including: improper design of primers, bad sequencing, or
actual variation between the samples. The primer sequences that were obtained did match the
genomic sequence, so there wasnt an issue with the primer design. Since there was no issue with
the primer the main source of error was most likely in the sequencing, when editing the
sequences clear peaks were not observed and the percentage was very low. The culmination of
all this data implies that tan is not the gene that we are looking for to be involved in color
patterning. Of course multiple trials and expansion of the gene to more samples could prove
otherwise, but for the purposes of this experiment there does not seem to be an association
between tan and color pattern. There are many genes which have been identified in the
melanization pathway such as bric a brac, ddc, pale, and countless others. Future researcherscould explore these options as the target gene in the melanin pathway.
-
8/12/2019 Hines Lab Report
9/9
References
1. Wittkopp, P., Beldade, P., 2008 Development and evolution of insect pigmentation: Geneticmechanisms and the potential consequences of pleiotrophy. Elsevier Ltd. Doi:
10.1016/j.semcdb.2008.10.002
2. True, JR, Yeh SD, Hovermann BT, Kemme T, Meinerzhagen IA, et al. (2005) Drosophila tanencodes a novel hydrolase required in pigmentation and vision. PLoS Genet 1(5): e63.
3. Jeong S, Rebeiz M, Andolfatto P, Werner T, True J, Carroll SB. The evolution of gene regulationunderlies a morphological difference between two Drosophila sister species. Cell. 2008 Mar
7;132(5):783-93. doi: 10.1016/j.cell.2008.01.014. PubMed PMID: 18329365.
4. Wittkopp PJ, Carroll SB, Kopp A. Evolution in black and white: genetic control of pigmentpatterns in Drosophila. Trends Genet. 2003 Sep;19(9):495-504. PubMed PMID: 12957543.
5. Bastide, H., A. Betancourt, V. Nolte, R. Tobler, P. Stobe et al., 2013 A genome-wide, fine-scalemap of natural pigmentation variation in Drosophila melanogaster. PLoS Genet. 9: e1003534.
6. Stern DL, Orgogozo V. The loci of evolution: how predictable is genetic evolution? Evolution.2008 Sep;62(9):2155-77. doi: 10.1111/j.1558-5646.2008.00450.x. Epub 2008 Jul 4. Review.
PubMed PMID: 18616572; PubMed Central PMCID: PMC2613234.