dna barcoding to map the microbial communities: current advances and future directions

12
MINI-REVIEW DNA barcoding to map the microbial communities: current advances and future directions Chiranjib Chakraborty & C. George Priya Doss & Bidhan C. Patra & Sanghamitra Bandyopadhyay Received: 12 November 2013 /Revised: 16 January 2014 /Accepted: 17 January 2014 # Springer-Verlag Berlin Heidelberg 2014 Abstract During the last two decades, the DNA barcode development towards microbial community has increased dramatically. DNA barcode development is related to error- free and quick species identification which aid in understand- ing the microbial biodiversity, as well as the diseases related to microbial species. Here, we seek to evaluate the so-called barcoding initiatives for the microbial communities and the emerging trends in this field. In this paper, we describe the development of DNA marker-based DNA barcoding system, comparison between routine species identification and DNA barcode, and microbial biodiversity and DNA barcode for microbial communities. Two major topics, such as the molec- ular diversity of viruses and barcode for viruses have been discussed at the same time. We demonstrate the current status and the maker of DNA barcode for bacteria, algae, fungi, and protozoa. Furthermore, we argue about the promises, limita- tions, and present and future challenges of microbial barcode development. Keywords Microbial community . DNA barcode . Next generation sequencing . Barcode marker gene . Biodiversity Introduction The last decade has proved to be an exciting era for biological science which is moving very fast with the help of new rapid technologies. Compared to the last 10 years, technological advancement has made data collection much easier, faster, and cheaper than what we cant think before. Biological research across different disciplines and researchers are performing collaborative research among cross subjects. Presently, most significant technologies were developed from computer science. Advancement in computer science has become one of the foundation components for the progression of new biological science (Losos et al. 2013; Stein 2008). Computer science has enabled scientists to decode various biological problems and health-related problems, such as ge- netically inherited diseases and drug discovery and develop- ment, etc (Doss et al. 2013; Chakraborty and Agrawal 2013; Chakraborty et al. 2011; Chakraborty and Doss 2013; George et al. 2013). In the ecosystem and biodiversity domain, DNA barcoding is a recent significant effort in this direction. More than 100 million species are existing in this planet, but only little proportion of them have been identified and characterized (May 1988). One of the biggest challenges is the identification of unknown species. Very few taxonomists can significantly identify nearly about 0.01 % of the projected 1015 million species (Hawksworth and Kalin-Arroyo 1995). Hebert et al. (2003a) expressed that a group of 15,000 taxon- omists or more than that will be needed to recognize living organisms morphologically. Again, if we look into the micro- bial world, we find astonishing numbers of microbes, which is very thrilling for microbiologist. In this microbial world, we are not aware of the diversity of the 99 % microbes that are mainly bacteria, archaea, and single-celled eukaryotes (Editorial 2011; Woese 1996). It is an impossible task to prepare the catalogue of microbial diversity by traditional methods following each taxon or each specimen with the C. Chakraborty (*) Department of Bioinformatics, School of Computer and Information Sciences, Galgotias University, Greater Noida, India e-mail: [email protected] C. G. P. Doss Medical Biotechnology Division, School of Biosciences and Technology, VIT University, Vellore 632014, Tamil Nadu, India B. C. Patra Department of Zoology, Vidyasagar University, Midnapore 721 102, West Bengal, India S. Bandyopadhyay Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India Appl Microbiol Biotechnol DOI 10.1007/s00253-014-5550-9

Upload: sanghamitra

Post on 23-Dec-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

MINI-REVIEW

DNA barcoding to map the microbial communities:current advances and future directions

Chiranjib Chakraborty & C. George Priya Doss &Bidhan C. Patra & Sanghamitra Bandyopadhyay

Received: 12 November 2013 /Revised: 16 January 2014 /Accepted: 17 January 2014# Springer-Verlag Berlin Heidelberg 2014

Abstract During the last two decades, the DNA barcodedevelopment towards microbial community has increaseddramatically. DNA barcode development is related to error-free and quick species identification which aid in understand-ing the microbial biodiversity, as well as the diseases related tomicrobial species. Here, we seek to evaluate the so-calledbarcoding initiatives for the microbial communities and theemerging trends in this field. In this paper, we describe thedevelopment of DNA marker-based DNA barcoding system,comparison between routine species identification and DNAbarcode, and microbial biodiversity and DNA barcode formicrobial communities. Two major topics, such as the molec-ular diversity of viruses and barcode for viruses have beendiscussed at the same time. We demonstrate the current statusand the maker of DNA barcode for bacteria, algae, fungi, andprotozoa. Furthermore, we argue about the promises, limita-tions, and present and future challenges of microbial barcodedevelopment.

Keywords Microbial community . DNA barcode . Nextgeneration sequencing . Barcodemarker gene . Biodiversity

Introduction

The last decade has proved to be an exciting era for biologicalscience which is moving very fast with the help of new rapidtechnologies. Compared to the last 10 years, technologicaladvancement has made data collection much easier, faster,and cheaper than what we can’t think before. Biologicalresearch across different disciplines and researchers areperforming collaborative research among cross subjects.Presently, most significant technologies were developed fromcomputer science. Advancement in computer science hasbecome one of the foundation components for the progressionof new biological science (Losos et al. 2013; Stein 2008).Computer science has enabled scientists to decode variousbiological problems and health-related problems, such as ge-netically inherited diseases and drug discovery and develop-ment, etc (Doss et al. 2013; Chakraborty and Agrawal 2013;Chakraborty et al. 2011; Chakraborty and Doss 2013; Georgeet al. 2013). In the ecosystem and biodiversity domain, DNAbarcoding is a recent significant effort in this direction.

More than 100 million species are existing in this planet,but only little proportion of them have been identified andcharacterized (May 1988). One of the biggest challenges is theidentification of unknown species. Very few taxonomists cansignificantly identify nearly about 0.01 % of the projected 10–15 million species (Hawksworth and Kalin-Arroyo 1995).Hebert et al. (2003a) expressed that a group of 15,000 taxon-omists or more than that will be needed to recognize livingorganisms morphologically. Again, if we look into the micro-bial world, we find astonishing numbers of microbes, which isvery thrilling for microbiologist. In this microbial world, weare not aware of the diversity of the 99 % microbes that aremainly bacteria, archaea, and single-celled eukaryotes(Editorial 2011; Woese 1996). It is an impossible task toprepare the catalogue of microbial diversity by traditionalmethods following each taxon or each specimen with the

C. Chakraborty (*)Department of Bioinformatics, School of Computer and InformationSciences, Galgotias University, Greater Noida, Indiae-mail: [email protected]

C. G. P. DossMedical Biotechnology Division, School of Biosciences andTechnology, VIT University, Vellore 632014, Tamil Nadu, India

B. C. PatraDepartment of Zoology, Vidyasagar University, Midnapore 721102, West Bengal, India

S. BandyopadhyayMachine Intelligence Unit, Indian Statistical Institute, Kolkata, India

Appl Microbiol BiotechnolDOI 10.1007/s00253-014-5550-9

morphological explanation (Lawton et al. 1988). Today’semerging technologies can support the task very easily(Godfray 2002). Considerable number of bacteria are notcultured, and few do not form colonies on agar plates(Schloss and Handelsman 2004; Rappe and Giovannoni2003). Especially, it has been observed for those bacteriawhich are living inside or on human beings (Curtin 2009;Friedrich 2008; Stone 2009; Turnbaugh et al. 2007). It is ofgreat challenge for the microbiologist to identify these mi-crobes. Recent advancement in DNA genome-based andDNA sequencing-based technologies can be an essential com-ponent in identifying these microbes. Presently, using DNAsequence data, the current “DNA barcoding” system is pro-viding a proficient platform for species-level identificationand this system ismore challenging. The scientific communityis using this process very efficiently which reflects in thepublication number. The publication in this domain is increas-ing very fast (Fig. 1).

With the advancement of sequencing and computationaltechnologies, the scientists from different genomic centers inboth the academia as well as the industries have standardizedDNA sequence and increased their capabilities in identifyingthe microbes with the help of the short sequences (Brent 2000;Pedersen 2010). Short unique sequence may help in differen-tiating the individual species due to the existence of geneticvariation in the closely related species (Hebert et al. 2003b).Many sequences of microbes are published in the variousmicrobial databases, as well as the public databases such asNCBI. Using these sequences, many efforts have been madein identifying the different strains as well as the developmentof barcode for microbial communities (Segata et al. 2013).Here, we undertook intensive field collections to explore theconcept of biological barcoding in microbial communities.We briefly described how DNA barcode can be proficient,

with a focus on its current standing and trends for the micro-bial communities such as virus, bacteria, algae, fungi, andprotozoa. Finally, an outlook has been provided on promises,limitations, and present and future challenges of microbialbarcode growth.

Comparing routine species identification and DNAbarcode-based species identification

The process of conventional species identification has manylimitations. The first problem is the phenotypic flexibility andgenetic variability in the characters which are accounted forthe different species identification that may direct to inaccu-rate identifications. The second problem is the identificationof cryptic species complex. These complexes were wide-spread in many groups of species (Jarman and Elliott 2000)especially most frequently existing in the marine environment(Knowlton 1993). Thirdly, the taxonomists are depending onmorphological keys for the routine species identification.These keys may vary in particular in life cycle or gender togender. Therefore, many species may be wrongly identified.Highly expert manpower is needed for this type of identifica-tion system.

Conversely, when using next generation high-throughputmethods, “DNA barcoding” is proved to be faster in speciesidentification. This modern-automated method is accurate,economic, and less time-consuming when compared to thetraditional process. This automated process can classify manynumbers of species at the same time. For example, high-throughput system facility could routinely handle 1,000 sam-ples per day or more. Currently, manpower cost is rising fast,whereas the cost of automation keeps on decreasing(Tautz et al. 2003). Therefore, the “DNA barcoding” is foundto be a more cost-effective and promising method as com-pared to the traditional process.

DNA barcoding system and DNA markers

We found that the genetic diversity is varying from species tospecies. The process of the species identification throughbarcoding is generally accomplished by the retrieval of a shortDNA sequence of microbes—from a standard part of thegenome. This short sequence is defined as barcode sequence,and it is specific for a particular gene. This short, consistentsequence can help in differentiating individual species(Hajibabaei et al. 2007; Hebert et al. 2003a).

For animals, the most common short sequence or markermitochondrial cytochrome c oxidase subunit I (COI)—isprojected to serve as a provisional identifier of a specimen.COI is approximately 650 base pair long and a single mtDNA

Fig. 1 Increasing trend in the publication in the DNA barcodingresearch. Keyword searched was performed from PUBMED, NCBIdatabase. We have used the keywords “(DNA barcoding) AND("2003"[Date - Publication] : "2004"[Date - Publication])”. Searchwas performed on 12th of November 2013

Appl Microbiol Biotechnol

gene. The wide range applicability as random PCR primers isthe major advantage of the COI gene (Hebert et al. 2003a, b).

DNA barcoding system has proved to be a trickier task forplants where DNA barcoding is using other markers. Due tothe low rate of nucleotide substitution mitochondrial genomein plants, the researchers are not using COI as a universal plantbarcode. The plastid markers show promise for plants(Fazekas et al. 2008). Different coding and non-coding re-gions have standardized the DNA barcoding system. Fewcoding loci such as rpoB, rpoC1, rbcL, matK, and 23SrDNA and non-coding loci such as trnH-psbA, atpF-atpH,and psbK-psbI were identified and used as DNA barcodingfor plants (Fazekas et al. 2008). Hollingsworth et al. (2011)have described a brief review with combinations of sevenplastid markers by different researchers on plant DNAbarcode development. Different mixtures of these markerswere discussed in the Second International Barcode of LifeConference in Taipei. Chase et al. (2007) suggested a combi-nation of barcode marker rpoC1+rpoB+matK or rpoC1+matK+trnH-psbA, followed by Kress and Erickson (2007)another combination of barcode marker rbcL+trnH-psbA.Lahaye et al. (2008) projected that the matK marker can worksingly as a plant barcode marker. The study of Seberg andPetersen (2009) projected rpoC1, matK, and trnH-psbA.Therefore, development of DNA barcoding loci for plantsremains as a challenging task.

Over two decades, DNA is used as a tool to identify thebacterial strains which was described by Woese (1987). Forbacterial taxonomy, scientists rely on a combination ofcharacter-based data and genome-based data or DNA-baseddata to understand the genetic and ecological diversity (Fraseret al. 2009). DNA-DNA hybridization technique was utilizedfor the identification of bacteria. Recently, genome-orientedmethods are evolving for the description of new species ofbacteria (Stackebrandt et al. 2002; Stackebrandt and Ebers2006; Richter and Rossello-Mora 2009). Since 1990s, 16SrRNA gene, the most common housekeeping genetic markeris being used as a marker gene in bacterial identificationprocess. In this context, many scientists have used PCR-amplified 16S rRNA gene to identify microbes (Schmidt andRelman 1994; Janda and Abbott 2007). This gene is used tostudy bacterial phylogeny and taxonomy for a number ofcauses. There are some benefits to use the gene as a barcodingmarker. First, this gene is present in almost all the bacterialspecies, frequently accessible as a multigene family or op-erons. Secondly, the function of the gene eventually has notchanged. Finally, this gene is approximately 1,500 bp inlength which is sufficient for informatics methods. Due tothe above advantages, 16S rRNA gene is used as a barcodemarker frequently. (Patel 2001; Ranasinghe et al. 2012).Another gene chaperonin-60 (cpn60) was also used as abacterial barcode marker gene. This gene is approximately555 bp long (Links et al. 2012).

Other than this, many other genes have been proposed asspecies-level molecular markers such as cytb, ITS1, ITS2,18S, and 28S RNA (LaJeunesse 2001; Case et al. 2007;Hajibabaei et al. 2007). A list of the marker genes has beenprovided in Table 1 which is routinely used in barcodingstudies of microbial communities. Approximate length ofsome marker gene is noted in Fig. 2. However, the barcodemarker sequence from each unidentified species is comparedwith a library of reference barcode sequences derived fromindividuals of known identity. The final goal of the DNAbarcoding system is to build up a robust and efficient mech-anism for the species identification in a fast manner which isstandardized, simple, and scalable.

Microbial biodiversity and DNA barcode for microbialcommunities

Evolution developed biodiversity. Actually, biodiversity wascreated by the evolution during the 3.5 billion years. About600 million years ago, life of microbes such as archaea,bacteria, protozoa, and similar single-celled organisms weredeveloped and gone through the biodiversity (Wilson 1999;Snedden 2007). Therefore, the microbial world is full ofbiodiversity. In the 1990s, investigation was initiated to un-derstand the diversity of bacteria and archaea. This was per-formed using DNA sequences and DNA taxonomy. The con-cept was that microbes have highly conserved small subunitribosomal RNA (SSrRNA) gene. This gene can be isolatedfrom mass environmental samples as a form of mass “envi-ronmental DNA.” With the examination of SSrRNA genesequence from the DNA sample, we can identify how manydifferent taxa are present. Then, scientists evaluated the rela-tionship of each sequenced organism. They have found thatmicroorganisms lived in main groups which are unknown totraditional microbiology. Finally, it has been noted that thediversity of microbes is almost 100 times higher than whatwas projected earlier (Woese 1996; Pace 1997; Blaxter 2003).

Barcoding for viruses—an open question

Probably, viruses are the most abundant biological creature onearth. It has been estimated that the total number of virusparticles is more than 10 times the total number of cells(Suttle 2007). The molecular diversity of viruses is compli-cated, as virus molecular diversity of genome is also complex(Hulo et al. 2011). The “molecular entity” and virus speciesstill remain as a debate for several scientists (Casiraghi et al.2010). The identification and explanation ofmolecular entitiesof virus should be the major objective of DNA barcoding.Very few works have been incited to identify thepathogenically important virus. Therefore, researchers should

Appl Microbiol Biotechnol

try to develop the barcode for the detection or identification ofthe virus.

Recently, few works have been found in this direction. Weiet al. (2011) described the k-mer-based barcode image toidentify significant pathogen human enteroviruses (HEVs).In this process, Wei et al. used the condition of 1<k<7 for afixed k and a genome barcode was described in terms of the k-mer frequency distribution across the whole genome for allcombinations of k-mers. Bluetongue virus (BTV) is an animalvirus which affects the different mammals such as cattle,buffalo, sheep, deer, goats, etc (Maan et al. 2011). For thedetection of this virus, ultrasensitive technique bio-barcodeamplification assay (BCA)methodwas developed. This meth-od was used for the specific detection of the outer-core proteinVP7 of BTV. Though they have produced protein bases bio-barcode, however, signal DNA annealed to DNA strands

Table 1 Different type of markers that have routinely been included in microbial communities for barcoding studies

Microbialcommunity

Barcoding markergenes

Remark References

Virus No specific marker 1. Nanoparticle-based bio-barcode assay2. Sandwich immunoassay and fluorophore-tagged oligonucleotidesas representative barcodes

Yin et al. 2011; Cao et al. 2009

Bacteria 16S rRNA 1. Nuclear DNA and housekeeping gene2. Coding genes are referred as 16S rDNA

Janda and Abbott 2007;Woo et al. 2008

COI Terminal enzyme in the respiratory chains in many bacteria Michel et al. 1998

cpn60 1. Chaperonin-60 (cpn60) also known as GroEL and Hsp602. A large barcode gap has been noted in cpn60

Links et al. 2012

rpoB Gene that encodes the β subunit of bacterial RNA polymerase Campbell et al. 2001

Algae COI COI was used as the barcode marker for brown and red algae Saunders and McDevit 2012

rbcL 1. Genomic location is plastid, rbcL-3P was used as the barcode markerfor diatoms and macroalgae

2. Protein coding gene

Saunders and McDevit 2012

tufA 1. tufA gene, a chloroplast gene, is a protein coding gene which encodesfor elongation factor Tu

2. It intercede the entry of an amino-acyl-tRNA into the acceptor site of aribosome during protein synthesis

Famà et al. 2002

LSU Genes coding for it are referred to as LSU rDNA such as 28S LSU rDNA Ali et al. 2001

UPA Universal plastid amplicon (UPA) used DNA barcode for red algae Sherwood et al. 2010

ITS Internal transcribed spacer (ITS) region of the ribosomal cistron Schoch et al. 2012

RuBisCO It encoded Ribulose-1,5-bisphosphate carboxylase oxygenase Meyer et al. 2012

Fungi COI COI coding region contains group I introns Cho et al. 1998

ITS Internal transcribed spacer region of the ribosomal cistron Schoch et al. 2012;Begerow et al. 2010

LSU Genes coding for LSU are referred to as LSU rDNA such as 28S LSU rDNA Begerow et al. 2010

SSU Genes coding referred to as SSU rDNA such as 16S SSU rDNA Begerow et al. 2010

RPB1, RPB2 Ribosomal polymerase B1 and B2 Seifert 2009

Protozoa COI COI as a good marker for DNA barcoding of amoebae Kosakyan et al. 2012

18S rRNA DNA encoding protozoal small subunit (SSU) ribosomal RNA Karnati et al. 2003

28S rRNA Evaluated as DNA barcode markers for piroplasma Gou et al. 2012

ITS ITS2 is used for piroplasma Gou et al. 2012

16S rRNA 16S ribosomal RNA, 28S rRNA 28S ribosomal RNA, COXI cytochrome c oxidase I, cpn60 chaperonin-60, rpoB RNA polymerase betasubunit, rbcL ribulose-bisphosphate carboxylase, tufA elongation factor Tu, UPA universal plastid amplicon, LSU large subunit, SSU small subunit, ITSinternal transcribed spacer, RuBisCO ribulose-1,5-bisphosphate carboxylase oxygenase, RPB1/ RPB2 ribosomal polymerase B1/ B2)

Fig. 2 Approximate length of some markers used in microbial commu-nities for barcoding studies

Appl Microbiol Biotechnol

bound with the gold nanoparticles were released by heatingand characterized by PCR and real-time fluorescence PCR(Yin et al. 2011). To detect avian influenza virus (AIV), afluorescent DNA barcode-based immunoassay was developedbased on the application of sandwich immunoassay andfluorophore-tagged oligonucleotides as representativebarcodes Cao et al. (2009).

To understand the viral biodiversity and development ofDNA barcode, no marker DNA or RNA has been developedfor viruses. However, it is the time to understand the viralbiodiversity with DNA barcoding.

DNA barcoding for bacteria

A microbial diversity picture in this planet is still not clear, asmany microbes are difficult to culture. It is very crucial tounderstand microbe association with different environmentsand their function in those environments. Gene sequences of16S rRNA from different samples especially environmentalsamples have reformed our understanding of microbial diver-sity, and it helps us in cataloging the vast diversity of micro-organisms on earth (Rappe and Giovannoni 2003; Schloss andHandelsman 2004; Lozupone and Knight 2007). Over the past20 years, more than 80,000 16S rRNA gene sequences havebeen deposited in GenBank, which can help us to understandbacterial phylogeny (Schloss and Handelsman 2004). To iden-tify different species of bacteria, 16S rRNA gene is describedas an important marker for soil and marine ammonia-oxidizing bacteria (Stephen et al. 1996), mountain lakes(Hiorns et al. 1997), rice paddy soil microcosms (Großkopfet al. 1998), and human clinical samples (Clarridge 2004). So,it is apparent that 16S rRNA gene highly conserved for eachand every species of bacteria, and it can be used as a markerfor DNA barcode for different species.

Two bacteria phyla Firmicutes and Bacteroidetes are med-ically necessary and related to obesity. The ratio, betweenthese bacteria, phyla Firmicutes/Bacteroidetes ratio (F/B ratio)was observed to be significantly higher in obese individualsthan in lean individuals, and the bacterial ratio (F/B ratio) isrelated to weight loss over time (Ley et al. 2006). This initia-tive wasmoved to develop DNA barcode of these two bacteriaphyla Firmicutes and Bacteroidetes using 16S rRNA gene(Armougom and Raoult 2008).

COI gene is used to develop the DNA barcode for bacteria.Using COI marker, DNA barcode was developed for 22species pathogen (Jones et al. 2013). Recently, Smith et al.(2012) developed the DNA barcode forWolbachia, a commonendosymbiotic bacterium, using COI gene, and this gene isone of the five multilocus sequence typing genes which wasapplied for categorizingWolbachia. They have found very fewoverlap with the eukaryotic DNA barcode area. This studycorroborates that the COI gene can be a DNA barcode marker

for bacteria. Chaperonin-60 (cpn60), also known as GroELand Hsp60, is a molecular chaperone conserved in bacteria.Conversely, for evaluating the barcoding targets for Archaeaincluding 16S rRNA, type II chaperonin (ortholog of cpn60)was found to be another option (Chaban and Hill 2012). Linkset al (2012) suggested that cpn60 can be a common target forbacteria barcode. Some other genes are used for bacterialidentification such as rpoB gene which may be used as abarcode marker gene for bacteria (Case et al. 2007). So, wecan infer that 16S rRNA, COI gene, and cpn60 can normallybe used as markers for developing DNA barcode.

DNA barcoding for algae

Most of the area of the earth is covered with the sea. Algae arefound in freshwater and seawater through the planet. Themaximum variety of algae is found in the coasts in bothtemperate and tropical seas. It has been noted that thephylogenic diversity of the algae is very wide-ranging(Metting 1996). For the last two decades with the help of thecomputational biology, increasingly molecular tools were ap-plied to identify algal species. Several molecular markers weresuggested from time to time to identify the algal species.Examples include COI, rbcL (the rubisco operon) (Hugheyet al. 2001), the internal transcribed spacer (ITS) of the ribo-somal cistron (Tai et al. 2001; Ross et al. 2003), tufA, andlarge subunit (LSU) 28S of the ribosomal cistron, 23SUniversal Plastid Amplicon (UPA), etc. Some markers arecoming into the picture for the development of DNA barcodealgae.

Saunders and McDevit (2012) tried to develop a methodfor DNA barcode for macroalgae and diatoms such as brownalgae (Phaeophyceae), red algae (Rhodophyta), green algae(Chlorophyta), and microscopic diatoms (Bacillariophyta).They used three barcode markers such as COI-5P, rbcL-3P,and tufA. COI was used as the barcode marker for brown andred algae, tufA as the barcode marker for green algae, andrbcL-3P was used as the barcode marker for diatoms. ForCarrageenophytes, various molecular markers have been in-troduced for the molecular taxonomy such as COX1 andCOX2-3 spacer, ITS, LSU, rbcL, RuBisCO, and the 23SUPA (Conklin et al. 2009; Zuccarello et al. 2006; Fredericqet al. 1999; Zhao and He 2011). A recent study by Tan et al.(2012) suggested the efficiency of the four markers COX1,COX2, COX2-3, and rbcL in DNA barcoding on selectedKappaphycus and Eucheuma from Southeast Asia. ForDNA barcoding, development of some marine algae especial-ly the Rhodophyta, COX1 “barcode” region was used toassess the marker (Saunders 2005). Buchheim et al. (2011)assessed the internal transcribed spacer two (ITS2) from 591Chlorophycean, 741 Trebouxiophycean, and 938Ulvophycean algae, which is one of the marker development

Appl Microbiol Biotechnol

steps for the DNA barcode development. Using COX1marker,Robba et al. (2006) assessed DNA barcode using 48 samplesplus 31 sequences for six orders of red algae namely Bangiales,Ceramiales, Corallinales, Gigartinales, Gracilariales, andRhodymeniales. They also used COX1 with the plastidRubisco spacer for Porphyra species and discovered that itwas a more susceptible marker in revealing initial speciationand cryptic diversity. Finally, they concluded COX1 gene is apossible DNA barcode marker of red algae. From the aboveevidences, we can understand that the vivid biodiversity of thealgal community has not been neglected, and scientists aretrying to develop the barcode of the biodiversity of the algalcommunity.

DNA barcoding for fungi

Identification of fungi to species level is the primary prob-lem for fungal research. Proper treatment for fungal dis-eases in humans is depending on appropriate identificationof the disease-causing agents (Bialek et al. 2005; Rickertset al. 2006). Moreover, species level identification plays afundamental role in biology conservation. With the help ofmolecular biology and system biology, DNA-based taxon-omy and DNA barcoding are the most interesting researchareas for the rapid level identification of all species. Theabsence of a general DNA barcode marker for fungi is aserious limitation for multitaxon ecological and biodiversi-ty studies (Schoch et al. 2012). It is interesting for molec-ular mycological community that fungal sequences areincreasing day by day in NCBI. Therefore, many studieshave been facilitated in DNA-based taxonomy and DNAbarcoding for fungi (Begerow et al. 2010). Stockinger et al.(2010) described a practical approach to choose DNAbarcode marker for arbuscular mycorrhizal fungi. Theyconsidered different criteria like universality, feasibility,and species resolution. For fungal species, the commonlyused DNA barcoding marker are—COI (Seifert et al. 2007),SSU rRNA gene (Poll et al. 2009), ITS sequences (Hugheset al. 2009), LSU, rRNA gene (Stockinger et al. 2010),ribosomal polymerase B1 and B2 (RPB1, RPB2) (Seifert2009), etc. One recent study has evaluated the potentialityof a number of fungal genes as barcode marker for fungi. Inthis study, from the barcoding database of 2,920 samples,the researchers have chosen a subset of 742 strains withsequences for four markers namely ITS, LSU, SSU, andRPB1. This subset was separated into four taxonomicallydelimited datasets which are 416 strains of Pezizomycotina,81 strains of Saccharomycotina, 202 strains ofBasidiomycota,and 43 strains from the collective lineages, and they haveconcluded that the ITS gene, located between the area of theribosomal cistron, can be used to identify a broad range offungi species; also, the distinct barcode gap has been

produced among interspecific and intraspecific variants(Schoch et al. 2012).

For more than two decades, the rRNA cistron is widelyused for fungal diagnostics and scientists are also using thisas a marker in phylogenetics (Begerow et al. 2010). 28Snuclear ribosomal LSU rRNA gene data are not availablefor some species, although the gene is usually more con-served than the ITS. However, the LSU or a combinedLSU+ITS will increase the degree of promise for barcode(Eberhardt 2010). However, for various yeasts, the D1/D2region of LSU was adopted for characterizing species longbefore the concept of DNA barcoding was promoted(Fell et al. 2000; Scorzetti et al. 2002). To understand themolecular phylogeny, evolution, and biodiversity ofarbuscular mycorrhizal fungi (AMF), two overlapping nu-clear DNA regions, totaling c. 3 kb, were analyzed: thesmall subunit (SSU) rRNA gene (up to 1,800 bp) and afragment spanning c. 250 bp of the SSU rDNA, the ITSregion (c. 475–520 bp), and c. 800 bp of the LSU rRNAgene were analyzed by Krüger et al (2012) which may be areference data for DNA barcoding. Chinese caterpillar fun-gus (Ophiocordyceps sinensis) has a medicinal value. ITSsequence as a DNA barcode was analyzed for this fungus.The result indicated that DNA barcoding is one of the fastand accurate identification methods to identify this fungus(O. sinensis) and closely related species (Xiang et al. 2013).In another study by Li et al. (2013), ITS sequence was usedto identify this fungus. Quaedvlieg et al. (2012) developedDNA barcoding ofMycosphaerella species from Europe. Inthis study, seven nuclear genomic loci were assessed foridentification of Mycosphaerella species as well as associ-ated anamorphs. Here, researchers considered seven nucle-ar genomic loci such as ITS regions of the nrDNA operon,28S nrDNA (LSU), β-tubulin (Btub), calmodulin (Cal),actin (Act), translation elongation factor 1-alpha (EF1α),and RNA polymerase II second largest subunit (RPB2).They concluded that the combination of ITS with eitherEF1α or Btub can be considered as the barcoding loci forEPPO A1/A2-listed Mycosphaerella species. Establishedbarcode loci are important for Cladonia spp. Pino-Bodaset al (2013) studied the loci IGS rDNA, EF1α, RPB2, andCOX1, where they have evaluated a total number of 782sequences from 36 species. In this study, three markersshowed the least intraspecific genetic distance range(COX1, ITS rDNA, and EF1α). Wickerhamomycesanomalus plays a important role in the food industry. β-tubulin gene was evaluated to understand the DNAbarcode marker of Wickerhamomyces strains. Species-specific PCR assay was developed for β-tubulin gene tounderstand the phylogenetic relationships between strainsW. anomalus (Huang et al. 2012).

To understand the fungal biodiversity, the ecological roles,and geographical distribution in pathogenic important fungi,

Appl Microbiol Biotechnol

DNA barcoding was proved to be worthful with enormouspotential. In the near future, DNA barcoding will definitelyplay a crucial role towards mycological research.

DNA barcoding for protozoa

Protozoan species identification and classification are difficulttasks because of the less number of morphological characters(Smirnov and Goodkov 1999), and it is true for protozoa withsimple morphology such as amoebae. The light microscopicfeatures in most of the protozoan species didn’t provide reli-able species differences (Smirnov 2002). Therefore, it is im-portant to develop a novel rapid tool in precise identificationof pathogenic protozoan species. With the development of themolecular system, the protozoa species identification is foundto be an interesting subject with the aid of DNA barcoding.

Amoebae are the most extensive group of protists residingin most of all the environments such as marine, freshwater,and terrestrial biosphere. Recently, attempts have been madeto identify this widespread protozoan species with molecularmarkers. Three genes, namely, SSU, ITS, and COI genes havebeen cloned to develop the molecular marker. This studyconcluded stating COI as a good marker for DNA barcodingof amoebae. However, further studies are needed to confirmthe accuracy of the COI gene as a barcode marker in othergroups of amoebae such as Gymnamoebae species. At thesame time, we have to understand the taxonomic significanceof COI variations in the different amoebaean species(Nassonova et al. 2010). Recently, Hoef-Emden (2012) triedto establish DNA barcoding systems in protists using speciesidentification technique—the blast in search with COI-5P and5′-partial LSU rDNA (domains A to D of the nuclear LSUrRNA gene). In this study, two different 5′-partial LSU rDNAdata sets have been applied to observe the evolutionarymodels. Nature of the frequency distribution of possiblebarcode gaps and genetic distances can be correlated withthe taxon sample shape. For some protozoa in genusTetrahymena such as Tetrahymena hegewischi, Tetrahymenacanadensis, Tetrahymena rostrata, Tetrahymena pyriformis,Tetrahymena thermophila, and Tetrahymena tropicalis,COX1 marker has been used for the development of barcode(Kher et al. 2011). Using molecular tools, species distinctionis a priority for piroplasma. Therefore, 18S rRNA, 28S rRNA,ITS regions, and COI genes have been evaluated as DNAbarcode markers for Piroplasma. ITS2 is defined as the mostideal DNA barcode based on the current database forPiroplasma (Gou et al. 2012). Animal parasites have beenstudied extensively in this direction. Diplostomoidmetacercariae is a freshwater fish parasite distributed world-wide. To study their diversity, barcode marker COI was usedto discriminate species range of 1,088 diplostomoids, which

was isolated from different fish from the St. Lawrence River,Canada (Locke et al. 2010).

In 1993, Plasmodium falciparum, the medically importantparasite, gene was examined through PCR and tried to devel-op “barcodes” to identify parasite stocks and lineages (Arnotet al. 1993). This study reflected the phenomenon that scien-tists are putting their efforts to barcode development for thespeedy identification of medically important parasites.

Present challenges, promises, and limitations of microbialbarcode development

In the last few years, DNA barcoding is moving very fast andmoved from dream to reality. However, list of some of themajor unsolved problems are existing for barcode develop-ment. The “cryptic species” identification is very problematicbecause morphologically these species are very similar.Taxonomies based on morphological analysis of such speciescan be challenging (Hawksworth 2006; Kemler et al. 2006,2009; Paulus et al. 2007). Many studies have shown that theoccurrence is due to convergence evolution (Lorenz et al.2005). However, DNA barcoding can solve this problemand can identify “cryptic species” quickly with support ofthe advanced barcode marker. This work is under progressto identify unique barcode marker considering genetic dis-tance boundary conditions. It is possible to identify geneticdistance boundary conditions within two individuals betweenthe related species. Even though the boundary seems to betaxon related, it was noted that the value of genetic distancebetween two DNA barcode sequences equivalent to or morethan 3 % (D≥0.03) identifies distinct species.

However, using genetic distance approaches, the develop-ment of DNA barcodes have several limitations mainly indefining the species boundaries (Witt et al. 2006). One reasonis that the DNA which we are selecting as a barcode markerhas the rate of evolution that varies substantially between andwithin species and between different groups of species, con-sequently resulting in large overlaps of intraspecific and inter-specific distances (Kipling and Rubinoff 2004; Rubinoff2006; Rubinoff et al. 2006).

Table 2 Numbers of dif-ferent marker genes listedfrom GenBank, NCBI(11 November 2013)

Marker gene Number of sequences

16S rRNA 40,546

cpn60 6,258

rpoB 3,375

COI 2,967

rbcL 1,026

tufA 549

18S RNA SSU 100

28S RNA LSU 87

Appl Microbiol Biotechnol

The second challenge remains as a major challenge inidentifying the appropriate barcode marker development fordifferent microbial communities. Different kinds of DNAmarkers have been described for the microbial species identi-fication. It has been noted that the sequence of differentmarker gene from microbial species is growing which canbe noted from NCBI (Table 2). It is a challenge to select theappropriate sequence also. For example, many barcodemarkers have been described for fungi, such as nuclear largeribosomal subunit (LSU rDNA) (Kurtzman and Robnett1998), nuclear small ribosomal subunit (SSU rDNA)(Baayen et al. 2001), internal transcribed spacer (ITS)(Druzhinina et al. 2005), β-tubulin (BenA) (Geiser et al.2007) , elongation factor 1-α (EF1α) (O'Donnell et al.2008), and second largest subunit of RNA polymerase II(RPB2) (O'Donnell et al. 2008; Ertz et al. 2009). Severalinitiatives were taken in this direction to choose the appropri-ate marker (Santamaria et al. 2009). The software packagenamed DNA-BAR has been developed which can chooseDNA probes exploitable for molecular categorization of mi-croorganisms. This interactive package is produced byDasGupta et al. (2005) and written the script in degenbarwhich finds the near-minimum number of distinguishers andalso allow the users to choose the sets of these distinguishers.

The third challenge is that the morphological identificationof species is sometimes difficult, especially for the microbialcommunity. It has been noted that the morphological charac-teristics varied in most of the life history stages and gender togender. Therefore, it is very much difficult to distinguish thespecies from the mixture of biological species. The DNAbarcoding will make this task much easier. With the help ofthe barcode marker, microbial diseases sample can be identi-fied by the physicians or veterinarians without difficulty. Also,it will be easier to identify the assortment of microbial speciescollected from the environment (Besansky et al. 2003; Wongand Hanner 2008; Valentini et al. 2008). Pons et al. (2006)projected a collective population model with a Yule model ofspeciation (Yule 1924) that permits us to define “species” as acluster of specimens in a particular time frame.

The purpose and objective of the barcoding should provideelectropherograms along with the raw sequences as well assequence quality information (Consortium for the barcode oflife 2009). At present, the sequences are not following most ofthe aim and objectives which are in the public sequencedatabases.

Future challenges and concluding remarks

In summary, DNA barcoding will provide balanced taxonom-ic research, population genetics, phylogenetics, and finally,computational biology for hologram-based barcode develop-ment. It is still debatable how a scientist can identify, discover,

and illustrate the feature of the unidentified species with thehelp of the DNA barcode. It can be noted from the paper ofHebert et al. (2003a, b) that they promised us that “taxonomicexpertise is collapsing”. Conversely, Packer et al. (2009) gavean opinion that systematists should not “panic”. However, thestarting phase of the barcode development is very interestingon the basis of recent developments. It is to be noted that thebarcode databases are growing very fast to more than100,000specimens per year. Therefore, we have also found an opti-mism that synergy is present between barcode developers andpracticing systematists where in many publications their skillsets are coexistent.

International Barcode of Life describes that “DNA se-quence can be used to identify different species, in the sameway a supermarket scanner can use a familiar black strip of theUPC barcode to identify your purchases”. It means that wehave to develop a digital barcode hologram eventually. Withthe digital barcode hologram, we can use the barcode reader todetect the species. Very little work has been initiated so far(Merker et al. 2013). However, the development of digitalbarcode hologram for life is still in its infancy stage. The firststage of such work is to develop the barcode with digitalinformation, i.e., DNA-based digital information. DNA hasmany potential advantages for unchallengeable, high informa-tion storage which can carry digital information also. Churchet al. (2012) published a work related to next-generationdigital information storage in DNA that will help us to readthe barcode sequences. But, more works are needed to under-stand about barcode density, redundant encodings, paritychecks, and error correction to improve density, error rate,and safety for DNA-based barcoding. We are very hopeful tonotice condensed, digital barcode hologram and error-free andappropriate barcode for microbial communities in the nearfuture which will enable us to identify microbial species veryfast.

Acknowledgments The authors take this opportunity to thank themanagement of Galgotias University and VIT University for providingthe facilities and encouragement to carry out this work. The authors havedeclared that no conflict of interest exists.

References

Ali AB, De Baere R, Van der Auwera G, De Wachter R, Van de Peer Y(2001) Phylogenetic relationships among algae based on completelarge-subunit rRNA sequences. Int J Syst Evol Microbiol 51:737–749

Armougom F, Raoult D (2008) Use of pyrosequencing and DNAbarcodes to monitor variations in Firmicutes and Bacteroidetescommunities in the gut microbiota of obese humans. BMCGenomics 9:576

Arnot DE, Roper C, Bayoumi RA (1993) Digital codes from hypervar-iable tandemly repeated DNA sequences in the Plasmodium

Appl Microbiol Biotechnol

falciparum circumsporozoite gene can genetically barcode isolates.Mol Biochem Parasitol 61:15–24

Baayen RP, O'Donnell K, Breeuwsma S, Geiser DM, Waalwijk C (2001)Molecular relationships of fungi within the Fusarium redolens—F. hostae clade. Phytopathology 91:1037–1044

Begerow D, Nilsson H, Unterseher M, Maier W (2010) Current state andperspectives of fungal DNA barcoding and rapid identificationprocedures. Appl Microbiol Biotechnol 87:99–108

Besansky NJ, Severson DW, Ferdig MT (2003) DNA barcoding ofparasites and invertebrate diseases vectors: what you don’t knowcan hurt you. Trends Parasitol 19:545–546

Bialek R, González GM, Begerow D, Zelck UE (2005)Coccidioidomycosis and blastomycosis: advances in molecular di-agnosis. FEMS Immunol Med Microbiol 45:355–360

Blaxter M (2003) Molecular systematics: counting angels with DNA.Nature 421:122–124

Brent R (2000) Genomic biology. Cell 100:169–183Buchheim MA, Keller A, Koetschan C, Förster F, Merget B, Wolf M

(2011) Internal transcribed spacer 2 (nu ITS2 rRNA) sequence-structure phylogenetics: towards an automated reconstruction ofthe green algal tree of life. PLoS One 6:e16931

Campbell EA, Korzheva N, Mustaev A, Murakami K, Nair S, GoldfarbA, Darst SA (2001) Structural mechanism for rifampicin inhibitionof bacterial RNA polymerase. Cell 104:901–912

Cao C, Dhumpa R, Bang DD, Ghavifekr Z, Høgberg J, Wolff A (2009)Detection of avian influenza virus by fluorescent DNA barcode-based immunoassay with sensitivity comparable to PCR. Analyst135:337–342

Case RJ, Boucher Y, Dahllof I, Holmstrom C, Doolittle WF, Kjelleberg S(2007) Use of 16S rRNA and rpoB genes as molecular markers formicrobial ecology studies. Appl Environ Microbiol 73:278–288

Casiraghi M, Labra M, Ferri E, Galimberti A, De Mattia F (2010) DNAbarcoding: a six-question tour to improve users' awareness about themethod. Brief Bioinform 11:440–453

Chaban B, Hill JE (2012) A ‘universal’ type II chaperonin PCR detectionsystem for the investigation of Archaea in complex microbial com-munities. ISME J 6:430–439

Chakraborty C, Agrawal A (2013) Computational analysis of C-reactiveprotein for assessment of molecular dynamics and interaction prop-erties. Cell Biochem Biophys 67:645–656

Chakraborty C, Doss GPC (2013) Sirtuins family—recent developmentas a drug target for aging, metabolism, and age related diseases. CurrDrug Targets 14:666–675

Chakraborty C, Agoramoorthy G, Hsu MJ (2011) Exploring the evolu-tionary relationship of insulin receptor substrate family using com-putational biology. PLoS One 6:e16580

ChaseMW, Cowan RS, Hollingsworth PM, van den Berg C,Madrinan S,Petersen G, Seberg O, Jørgsensen T, Cameron KM, Carine M,Pedersen P, Hedderson TAJ, Conrad F, Salazar GA, RichardsonJE, Hollingsworth ML, Barraclough TG, Kelly L, Wilkinson M(2007) A proposal for a standardised protocol to barcode all landplants. Taxon 56:295–299

Cho Y, Qiu YL, Kuhlmann P, Palmer JD (1998) Explosive invasion ofplant mitochondria by a group I intron. Proc Natl Acad Sci U S A95:14244–14249

Church GM, Gao Y, Kosuri S (2012) Next-generation digital informationstorage in DNA. Science 337:1628

Clarridge JE (2004) Impact of 16S rRNA gene sequence analysis foridentification of bacteria on clinical microbiology and infectiousdiseases. Clin Microbiol Rev 17:840–862

Conklin KY, Kurihara A, Sherwood AR (2009) A molecular method foridentification of the morphologically plastic invasive algal generaEucheuma andKappaphycus (Rhodophyta, Gigartinales) in Hawaii.J Appl Phycol 21:691–699

Consortium for the Barcode of Life (2009) http://www.barcoding.si.eduAccessed 21 December 2009

Curtin C (2009) Humans as host. Genome Tech 38–43DasGupta B, Konwar KM, Mandoiu II, Shvartsman (2005) DNA-BAR:

ditinguisher selection for DNA barcoding. Bioinformatics 21:3424–3426

Doss GPC, Chakraborty C, Rajith B, Nagasundaram N (2013) Modelingand molecular dynamics approach to characterize the functionalnsSNPs in hTERT by means of local DNA sequence context andregularity. J Mol Model 19:3517–3527

Druzhinina IS, Kopchinskiy AG, KomoñM, Bissett J, Szakacs G, KubicekCP (2005) An oligonucleotide barcode for species identification inTrichoderma and Hypocrea. Fungal Genet Biol 42:813–828

Eberhardt U (2010) A constructive step towards selecting a DNA barcodefor fungi. New Phytol 187:265–268

Editorial (2011) Microbiology by numbers. Nat Rev Microbiol 9(9):628Ertz D, Miadlikowska J, Lutzoni F, Dessein S, Raspé O, Vigneron N,

Hofstetter V, Diederich P (2009) Towards a new classification of theArthoniales (Ascomycota) based on a three-gene phylogeny focus-ing on the genus Opegrapha. Mycol Res 113:141–152

Famà P, Wysor B, Kooistra WH, Zuccarello GC (2002) Molecularphylogeny of the genus Caulerpa (Caulerpales, Chlorophyta) in-ferred from chloroplast tufA gene1. J Phycol 38:1040–1050

Fazekas AJ, Burgess KS, Kesanakurti PR, Graham SW, Newmaster SG,Husband BC, Percy DM, Hajibabaei M, Barrett SCH (2008)Multiple multilocus DNA barcodes from the plastid genome dis-criminate plant species equally well. PLoS One 3:e2802

Fell JW, Boekhout T, Fonseca A, Scorzetti G, Statzell-Tallman A (2000)Biodiversity and systematics of basidiomycetous yeasts as deter-mined by large-subunit rDNAD1/D2 domain sequence analysis. IntJ Syst Evol Microbiol 50:1351–1371

Fraser C, Alm EJ, Polz MF, Spratt BG, Hanage WP (2009) The bacterialspecies challenge: making sense of genetic and ecological diversity.Science 323:741–742

Fredericq S, Freshwater DW, Hommersand MH (1999) Observations onthe phylogenetic systematics and biogeography of the Solieriaceae(Gigartinales, Rhodophyta) inferred from rbcL sequences and mor-phological evidence. Hydrobiol 398(399):25–38

Friedrich MJ (2008) Microbiome project seeks to understand humanbody’s microscopic residents. JAMA 300:777–778

Geiser DM, Klich MA, Frisvad JC, Peterson SW, Varga J, Samson RA(2007) The current status of species recognition and identification inAspergillus. Stud Mycol 59:1–10

George PDC, Nagasundaram N, Chakraborty C, Chen L, Zhu H (2013)Extrapolating the effect of deleterious nsSNPs in the binding adapt-ability of flavopiridol with CDK7 protein: a molecular dynamicsapproach. Hum Genomics 7:10. doi:10.1186/1479-7364-7-10

Godfray HJC (2002) Challenges for taxonomy: the discipline will have toreinvent itself if it is to survive and flourish. Nature 417:17–19

Gou H, GuanG, Liu A,MaM,Xu Z, Liu Z, Ren Q, Li Y, Yang J, Chen Z,Yin H, Luo J (2012) A DNA barcode for Piroplasmea. Acta Trop124:92–97

Großkopf R, Janssen PH, LiesackW (1998) Diversity and structure of themethanogenic community in anoxic rice paddy soil microcosms asexamined by cultivation and direct 16S rRNA gene sequence re-trieval. Appl Environ Microbiol 64:960–969

Hajibabaei M, Singer GA, Hebert PD, Hickey DA (2007) DNAbarcoding: how it complements taxonomy, molecular phylogeneticsand population genetics. Trends Genet 23:167–172

Hawksworth DL (2006) Pandora's mycological box: molecular se-quences vs. morphology in understanding fungal relationships andbiodiversity. Rev Iberoam Micol 23:127–133

Hawksworth DL, Kalin-Arroyo MT (1995) Magnitude and distributionof biodiversity. In: Heywood VH (ed) Global biodiversity assess-ment. Cambridge University Press. Cambridge, p. 107–191

Hebert PDN, Cywinska A, Ball SL, deWaard JR (2003a) Biologicalidentifications through DNA barcodes. Proc R Soc Lond B270:313–321

Appl Microbiol Biotechnol

Hebert PDN, Ratnasingham S, deWaard JR (2003b) Barcoding animallife: cytochrome c oxidase subunit 1 divergences among closelyrelated species. Proc R Soc Lond B 270:S96–S99

Hiorns WD, Methe BA, Nierzwicki-Bauer SA, Zehr JP (1997) Bacterialdiversity in Adirondack mountain lakes as revealed by 16S rRNAgene sequences. Appl Environ Microbiol 63:2957–2960

Hoef-Emden K (2012) Pitfalls of establishing DNA barcoding systems inProtists: the Cryptophyceae as a test case. PLoS One 7:e43652

Hollingsworth PM, Graham SW, Little DP (2011) Choosing and using aplant DNA barcode. PLoS One 6:e19254

Huang CH, Chang MT, Huang L (2012) Species identification ofWickerhamomyces anomalus and related taxa using β-tubulin (β-tub) DNA barcode marker. Yeast 29:531–535

Hughes KW, Petersen RH, Lickey EB (2009) Using heterozygosity toestimate a percentage DNA sequence similarity for environmentalspecies' delimitation across Basidiomycete fungi. New Phytol 182:795–798

Hughey JR, Silva PC, Hommersand MH (2001) Solving taxonomic andnomenclatural problems in Pacific Gigartinaceae (Rhodophyta)using DNA from type material. J Phycol 37:1091–1109

Hulo C, de Castro E, Masson P, Bougueleret L, Bairoch A, Xenarios I, LeMercier P (2011) ViralZone: a knowledge resource to understandvirus diversity. Nucleic Acids Res 39(Database issue):D576–582

Janda JM, Abbott SL (2007) 16S rRNA gene sequencing for bacterialidentification in the diagnostic laboratory: pluses, perils, and pitfalls.J Clin Microbiol 45:2761–2764

Jarman SN, Elliott NG (2000) DNA evidence for morphological andcryptic Cenozoic speciations in the Anaspididae, ‘living fossils’from the Triassic. J Evol Biol 13:624–633

Jones YL, Peters SM,Weland C, Ivanova NV, Yancy HF (2013) Potentialuse of DNA barcodes in regulatory science: identification of theU.S. Food and Drug Administration's "Dirty 22," contributors to thespread of foodborne pathogens. J Food Prot 76:144–149

Karnati SK, Yu Z, Sylvester JT, Dehority BA, Morrison M, Firkins JL(2003) Technical note: specific PCR amplification of protozoal 18SrDNA sequences from DNA extracted from ruminal samples ofcows. J Anim Sci 81:812–815

Kemler M, Göker M, Oberwinkler F, Begerow D (2006) Implications ofmolecular characters for the phylogeny of the Microbotryaceae(Basidiomycota: Urediniomycetes). BMC Evol Biol 6:35

Kemler M, Lutz M, Göker M, Oberwinkler F, Begerow D (2009) Hiddendiversity in the non-caryophyllaceous plant-parasitic members ofMicrobotryum (Pucciniomycotina: Microbotryales). Syst Biodivers7:297–306

Kher CP, Doerder FP, Cooper J, Ikonomi P, Achilles-Day U, Küpper FC,Lynn DH (2011) Barcoding Tetrahymena: discriminating speciesand identifying unknowns using the cytochrome c oxidase subunitI (cox-1) barcode. Protist 162:2–13

Kipling WW, Rubinoff D (2004) Myth of the molecule: DNA barcodesfor species cannot replace morphology for identification and classi-fication. Cladistics 20:47–55

Knowlton N (1993) Sibling species in the sea. Annu Rev Ecol Syst 24:189–216

Kosakyan A, Heger TJ, Leander BS, Todorov M, Mitchell EA, Lara E(2012) COI barcoding of Nebelid testate Amoebae (Amoebozoa:Arcellinida): extensive cryptic diversity and redefinition of theHyalospheniidae Schultze. Protist 163:415–434

KressWJ, EricksonDL (2007) A two-locus global DNA barcode for landplants: the coding rbcL gene complements the non-coding trnH-psbA apacer region. PLoS One 2:e508

Krüger M, Krüger C, Walker C, Stockinger H, Schüssler A (2012)Phylogenetic reference data for systematics and phylotaxonomy ofarbuscular mycorrhizal fungi from phylum to species level. NewPhytol 193:970–984

Kurtzman CP, Robnett CJ (1998) Identification and phylogeny ofAscomycetous yeasts from analysis of nuclear large subunit (26S)

ribosomal DNA partial sequences. Antonie Van Leeuwenhoek 73:331–371

Lahaye R, van der Bank M, Bogarin D, Warner J, Pupulin F, Gigot G,Maurin O, Duthoit S, Barraclough TG, Savolainen V (2008) DNAbarcoding the floras of biodiversity hotspots. ProcNatl Acad Sci U SA 105:2923–2928

LaJeunesse TC (2001) Investigating the biodiversity, ecology, and phy-logeny of endosymbiotic dinoflagellates in the genus Symbiodiniumusing the ITS region: in search of a “species” level marker. J Phycol37:866–880

Lawton JH, Bignell DE, Bolton B, Bloemers GF, Eggleton P, HammondPM, Hodda M, Holt RD, Larsen TB, Mawdsley NA, Stork NE,Srivastava DS, Watt AD (1988) Biodiversity inventories, indicatortaxa and effects of habitat modification in tropical forest. Nature391:72–76

Ley RE, Turnbaugh PJ, Klein S, Gordon JI (2006) Microbial ecology:human gut microbes associated with obesity. Nature 444:1022–1023

Li Y, Jiao L, Yao YJ (2013) Non-concerted ITS evolution in fungi, asrevealed from the important medicinal fungus Ophiocordycepssinensis. Mol Phylogenet Evol 68:373–379

Links MG, Dumonceaux TJ, Hemmingsen SM, Hill JE (2012) Thechaperonin-60 universal target is a barcode for bacteria that enablesde novo assembly ofmetagenomic sequence data. PLoSOne 7:e49755

Locke SA, Daniel McLaughlin J, Marcogliese DJ (2010) DNA barcodesshow cryptic diversity and a potential physiological basis for hostspecificity among Diplostomoidea (Platyhelminthes: Digenea) par-asitizing freshwater fishes in the St. Lawrence River, Canada. MolEcol 19:2813–2827

Lorenz JG, Jackson WE, Beck JC, Hanner R (2005) The problems andpromise of DNA barcodes for species diagnosis of primate bioma-terials. Philos Trans R Soc Lond B Biol Sci 360:1869–1877

Losos JB, Arnold SJ, Bejerano G, Brodie ED 3rd, Hibbett D, HoekstraHE,Mindell DP,Monteiro A,Moritz C, Orr HA, PetrovDA, RennerSS, Ricklefs RE, Soltis PS, Turner TL (2013) Evolutionary biologyfor the 21st century. PLoS Biol 11:e1001466

Lozupone CA, Knight R (2007) Global patterns in bacterial diversity.Proc Natl Acad Sci U S A 104:11436–11440

Maan S, Maan NS, Nomikou K, Veronesi E, Bachanek-Bankowska K,Belaganahalli MN, Attoui H, Mertens PP (2011) Complete genomecharacterisation of a novel 26th bluetongue virus serotype fromKuwait. PLoS One 6:e26147

May RM (1988) How many species are there on earth? Science241(4872):1441–1449

Merker JD, O'Grady N, Gojenola L, Dao M, Lenta R, Yeakley JM,Schrijver I (2013) Feasibility of using microbeads with holographicbarcodes to track DNA specimens in the clinical molecular labora-tory. Peer J 1:e91

Metting FB (1996) Biodiversity and application of microalgae. J IndMicrobiol 17:477–489

Meyer MT, Genkov T, Skepper JN, Jouhet J, Mitchell MC, Spreitzer RJ,Griffiths H (2012) Rubisco small-subunitα-helices control pyrenoidformation in Chlamydomonas. Proc Natl Acad Sci U S A 109:19474–19479

Michel H, Behr J, Harrenga A, Kannt A (1998) Cytochrome c oxidase:structure and spectroscopy. Annu Rev Biophys Biomol Struct 27:329–356

Nassonova E, Smirnov A, Fahrni J, Pawlowski J (2010) Barcoding amoe-bae: comparison of SSU, ITS and COI genes as tools for molecularidentification of naked lobose amoebae. Protist 161:102–115

O'Donnell K, Sutton DA, Fothergill A, McCarthy D, Rinaldi MG, BrandtME, Zhang N, Geiser DM (2008) Molecular phylogenetic diversity,multilocus haplotype nomenclature, and in vitro antifungal resis-tance within the Fusarium solani species complex. J Clin MicrobiolK46:2477–2490

Pace NR (1997) A molecular view of microbial diversity and the bio-sphere. Science 276:734–740

Appl Microbiol Biotechnol

Packer L, Gibbs J, Sheffield C, Hanner R (2009) DNA barcoding and themediocrity of morphology. Mol Ecol Resour 9:42–50

Patel JB (2001) 16S rRNA gene sequencing for bacterial pathogenidentification in the clinical laboratory. Mol Diagn 6:313–32

Paulus B, Nilsson RH, Hallenberg N (2007) Phylogenetic studies inHypochnicium (Basidiomycota), with special emphasis on speciesfrom New Zealand. N Z J Bot 45:139–150

Pedersen ML (2010) DNA barcode profiling: a new platform for theinvestigation of genome integrity. Genome Biol 11:P3

Pino-Bodas R, Martín MP, Burgaz AR, Lumbsch HT (2013) Speciesdelimitation in Cladonia (Ascomycota): a challenge to the DNAbarcoding philosophy. Mol Ecol Resour 13:1058–1068

Poll C, Brune T, BegerowD, Kandeler E (2009) Small-scale diversity andsuccession of fungi in the detritusphere of rye residues. Microb Ecol59:130–140

Pons J, Barraclough TG, Gomez-Zurita J, Cardoso A, Duran DP, HazellS, Kamoun S, Sumlin WD, Vogler AP (2006) Sequence-basedspecies delimitation for the DNA taxonomy of undescribed insects.Syst Biol 55:595–609

Quaedvlieg W, Groenewald JZ, de Jesús Yáñez-Morales M, Crous PW(2012) DNA barcoding of Mycosphaerella species of quarantineimportance to Europe. Persoonia 29:101–115

Ranasinghe PD, Satoh H, OshikiM, Oshima K, SudaW, Hattori M,MinoT (2012) Revealing microbial community structures in large- andsmall-scale activated sludge systems by barcoded pyrosequencingof 16S rRNA gene. Water Sci Technol 66:2155–2161

Rappe MS, Giovannoni SJ (2003) The uncultured microbial majority.Annu Rev Microbiol 57:369–394

Richter M, Rossello-Mora R (2009) Shifting the genomic gold standardfor the prokaryotic species definition. Proc Natl Acad Sci U SA 106:19126–19131

Rickerts V, Just-Nubling G, Konrad F, Kern J, Lambrecht E, Bohme A,Jacobi V, Bialek R (2006) Diagnosis of invasive aspergillosis andmucormycosis in immunocompromised patients by seminested PCRassay of tissue samples. Eur J Clin Microbiol Infect Dis 25:8–13

Robba L, Russell SJ, Barker GL, Brodie J (2006) Assessing the use of themitochondrial COX1 marker for use in DNA barcoding of red algae(Rhodophyta). Am J Bot 93:1101–1108

Ross PJ, Donaldson SL, Saunders GW (2003) A molecular investigationof Mazzaella (Gigartinales Rhodophyta) morphologically interme-diate between Mazzaella linearis and M. splendens. Bot Mar 46:202–213

Rubinoff D (2006) Utility of mitochondrial DNA barcodes in speciesconservation. Conserv Biol 20:1026–1033

Rubinoff D, Cameron S, Will K (2006) A genomic perspective on theshortcomings of mitochondrial DNA for “barcoding” identification.J Hered 97:581–594

Santamaria M, Vicario S, Pappadà G, Scioscia G, Scazzocchio C, SacconeC (2009) Towards barcode markers in fungi: an intron map ofAscomycota mitochondria. BMC Bioinforma 10(Suppl 6):S15

Saunders GW (2005) Applying DNA barcoding to red macroalgae: apreliminary appraisal holds promise for future applications. PhilosTrans R Soc Lond B Biol Sci 360:1879–1888

Saunders GW, McDevit DC (2012) Methods for DNA barcoding photo-synthetic protists emphasizing the macroalgae and diatoms.Methods Mol Biol 858:207–222

Schloss PD, Handelsman J (2004) Status of the microbial census.Microbiol Mol Biol Rev 68:686–691

Schmidt TM, Relman DA (1994) Phylogenetic identification of uncul-tured pathogens using ribosomal RNA sequences. MethodsEnzymol 235:205–222

Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL, Levesque CA,Chen W, Fungal Barcoding Consortium; Fungal BarcodingConsortium Author List (2012) Nuclear ribosomal internal tran-scribed spacer (ITS) region as a universal DNA barcode markerfor Fungi. Proc Natl Acad Sci U S A 109:6241–6246

Scorzetti G, Fell JW, Fonseca A, Statzell-Tallman A (2002) Systematicsof basidiomycetous yeasts: a comparison of large subunit D1/D2and internal transcribed spacer rDNA regions. FEMS Yeast Res 2:495–517

Seberg O, Petersen G (2009) Howmany loci does it take to DNA barcodea crocus? PLoS One 4:e4598

Segata N, Boernigen D, Tickle TL, Morgan XC, Garrett WS,Huttenhower C (2013) Computational metaomics for microbialcommunity studies. Mol Syst Biol 14:666

Seifert KA (2009) Progress towards DNA barcoding of fungi. Mol EcolResour 9:83–89

Seifert KA, Samson RA, Dewaard JR, Houbraken J, Levesque CA,Moncalvo JM, Louis-Seize G, Hebert PDN (2007) Prospectsfor fungus identification using COI DNA barcodes, withPenicillium as a test case. Proc Natl Acad Sci U S A 104:3901–3906

Sherwood AR, Sauvage T, Kurihara A, Conklin KY, Presting GG (2010)A comparative analysis of COI, LSU and UPA marker data for theHawaiian florideophyte Rhodophyta: implications for DNAbarcoding of red algae. Cryptogam Algol 31:451–465

Smirnov AV (2002) Re-description of Vannella mira (Gymnamoebia,Vannellidae)—an often mentioned but poorly known amoeba spe-cies. Protistology 2:178–185

Smirnov AV, Goodkov AV (1999) An illustrated list of the basicmorphotypes of Gymnamoebia (Rhizopoda, Lobosea).Protistology 1:20–29

Smith MA, Bertrand C, Crosby K, Eveleigh ES, Fernandez-Triana J,Fisher BL, Gibbs J, Hajibabaei M, Hallwachs W, Hind K, Hrcek J,HuangDW, JandaM, Janzen DH, Li Y,Miller SE, Packer L, QuickeD, Ratnasingham S, Rodriguez J, Rougerie R, Shaw MR, SheffieldC, Stahlhut JK, Steinke D, Whitfield J, Wood M, Zhou X (2012)Wolbachia and DNA barcoding insects: patterns, potential, andproblems. PLoS One 7:e36514

Snedden R (2007) The diversity of life: from single cells to multicellularorganisms. Heinemann Library. com

Stackebrandt E, Ebers J (2006) Taxonomic parameters revisited:tarnished gold standards. Microbiol Today 33:152–155

Stackebrandt E, Frederiksen W, Garrity GM, Grimont PA, Kämpfer P,Maiden MC, Nesme X, Rosselló-Mora R, Swings J, Trüper HG,Vauterin L, Ward AC, Whitman WB (2002) Report of the ad hoccommittee for the re-evaluation of the species definition in bacteri-ology. Int J Syst Evol Microbiol 52:1043–1047

Stein LD (2008) Bioinformatics: alive and kicking. Genome Biol 9:114Stephen JR, McCaig AE, Smith Z, Prosser JI, Embley TM (1996)

Molecular diversity of soil and marine 16S rRNA gene sequencesrelated to beta-subgroup ammonia-oxidizing bacteria. Appl EnvironMicrobiol 62:4147–4154

Stockinger H, Krüger M, Schüßler A (2010) DNA barcoding ofarbuscular mycorrhizal fungi. New Phytol 187:461–474

Stone M (2009) NIH builds substantial human microbiome project.Microbe 4:451–456

Suttle CA (2007)Marine viruses—major players in the global ecosystem.Nat Rev Microbiol 5:801–812

Tai V, Lindstrom SC, Saunders GW (2001) Phylogeny of theDumontiaceae (Gigartinales Rhodophyta) and associated familiesbased on SSU rDNA and internal transcribed spacer sequence data. JPhycol 37:184–196

Tan J, Lim PE, Phang SM, Hong DD, Sunarpi H, Hurtado AQ (2012)Assessment of four molecular markers as potential DNA barcodesfor red algae Kappaphycus Doty and Eucheuma J. Agardh(Solieriaceae, Rhodophyta). PLoS One 7:e52905

Tautz D, Arctander P, Minelli A, Thomas RH, Vogler AP (2003) A pleafor DNA taxonomy. Trends Ecol Evol 18:70–74

Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM, Knight R,Gordon JI (2007) The human microbiome project. Nature449:804–810

Appl Microbiol Biotechnol

Valentini A, Pompanon F, Taberlet P (2008) DNA barcoding for ecolo-gists. Trends Ecol Evol 24:110–117

Wei C, Wang G, Chen X, Huang H, Liu B, Xu Y, Li F (2011)Identification and typing of human enterovirus: a genomic barcodeapproach. PLoS One 6:e26296

Wilson EO (1999) The diversity of life. WW Norton and Company.Witt JD, Threloff DK, Hebert PD (2006) DNA barcoding reveals extraor-

dinary cryptic diversity in an Amphipod genus: implications fordesert spring conservation. Mol Ecol 15:3073–3082

Woese CR (1987) Bacterial evolution. Microbiol Rev 51:221–271Woese CR (1996) Whither microbiology? Phylogenetic trees. Curr Biol

6:1060–1063Wong EHK, Hanner RH (2008) DNA barcoding detects market

substitution in North American seafood. Food Res Int 41:828–837

Woo PC, Lau SK, Teng JL, Tse H, Yuen KY (2008) Then and now: use of16S rDNA gene sequencing for bacterial identification and

discovery of novel bacteria in clinical microbiology laboratories.Clin Microbiol Infect 14:908–934

Xiang L, Song J, Xin T, Zhu Y, Shi L, Xu X, Pang X, Yao H, Li W, ChenS (2013) DNA barcoding the commercial Chinese caterpillar fun-gus. FEMS Microbiol Lett. doi:10.1111/1574-6968.12233

Yin HQ, Jia MX, Shi LJ, Yang S, Zhang LY, Zhang QM,Wang SQ, Li G,Zhang JG (2011) Nanoparticle-based bio-barcode assay for thedetection of bluetongue virus. J Virol Methods 178:225–228

Yule GU (1924) A mathematical theory of evolution based on theconclusions of Dr. J. C. Willis, FRS. Phil Trans R Soc B 213:21–87

Zhao S, He P (2011) Molecular identification based on ITS sequences forKappaphycus and Eucheuma cultivated in China. Chinese JOceanol Limnol 29:1287–1296

Zuccarello GC, Critchley AT, Smith J, Sieber V, Lhonneur GB, West JA(2006) Systematics and genetic variation in commercialKappaphycus and Eucheuma (Solieriaceae, Rhodophyta). J ApplPhycol 18:643–651

Appl Microbiol Biotechnol