a murrel cysteine protease, cathepsin l: bioinformatics characterization, gene expression and...

12
Biologia 69/3: 395—406, 2014 Section Cellular and Molecular Biology DOI: 10.2478/s11756-013-0326-8 A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity Venkatesh Kumaresan 1 , Prasanth Bhatt 1 , Rajesh Palanisamy 1 , Annie J. Gnanam 2 , Mukesh Pasupuleti 3 & Jesu Arockiaraj 1 * 1 Division of Fisheries Biotechnology & Molecular Biology, Department of Biotechnology, Faculty of Science and Humanities, SRM University, Kattankulathur – 603 203, Chennai, Tamil Nadu, India; e-mail: [email protected] 2 Institute for Cellular and Molecular Biology, The University of Texas at Austin, 1 University Station A4800, Austin, TX 78712, USA 3 Lab PCN 206, CSIR-Central Drug Research Institute, B.S. 10/1, Sector 10, Jankipuram Extension, Sitapur Road, Lucknow 226 031, Uttar Pradesh, India Abstract: Cathepsin L, a lysosomal endopeptidase, is a member of the peptidase C1 family (papain-like family) of cysteine proteinases that cleave peptide bonds of lysosomal proteins. In this study, we report a cathepsin L sequence identified from the constructed cDNA library of striped murrel Channa striatus (designated as CsCath L) using genome sequencing FLX TM technology. The full-length CsCath L contains three eukaryotic thiol protease domains at positions 134-145, 278-288 and 299-318. Phylogenetic analysis revealed that the CsCath L was clustered together with other cathepsin L from teleosts. The three-dimensional structure of CsCath L modelled by the I-Tasser program was compared with structures deposited in the Protein Data Bank to find out the structural similarity of CsCath L with experimentally identified structures. The results showed that the CsCath L exhibits maximum structural identity with pro-cathepsin L from human. The RNA fold structure of CsCath L was predicted along with its minimum free energy (–471.93 kcal/mol). The highest CsCath L gene expression was observed in liver, which was also significantly higher (P< 0.05) than that detected in other tissues taken for analysis. In order to investigate the mRNA transcription profile of CsCath L during infection, C. striatus were injected with fungus (Aphanomyces invadans) and bacteria (Aeromonas hydrophila) and its expression was up-regulated in liver at various time points. Similar to gene expression studies, the highest CsCath L enzyme activity was also observed in liver and its activity was up-regulated by fungal and bacterial infections. Key words: Channa striatus; cathepsin L; bioinformatics analysis; epizootic ulcerative syndrome; gene expression; enzyme activity. Abbreviations: CsCath L, Channa striatus cathepsin L; EUS, epizootic ulcerative syndrome; MFE, minimum free energy; ORF, open reading frame; PBS, phosphate buffer saline; PDB, Protein Data Bank; qRT-PCR, quantitative real time polymerase chain reaction; UTR, untranslated region. Introduction Cathepsin is defined as a ‘lysosomal proteolytic en- zyme’. Lysosome is an organelle with a cystic struc- ture, containing hydrolytic enzymes including phos- phatase, ribonuclease, deoxyribonuclease, cathepsin, B- glucuronidase and acetyl-transferase (Li et al. 2011). Cathepsins are enzymes that have been cleaving pep- tide bonds of lysosomal proteins probably since lyso- somes appeared in early eukaryotes. When the adaptive system emerged in gnathostomes, cathepsins evolved to produce peptides in the major histocompatibility com- plex class II molecules (Uinuk-Ool et al. 2003). During the past few years, many of the cathepsins have been ac- credited with more specific functions in human, includ- ing bone re-modelling, antigen presentation, epidermal homeostasis, pro-hormone processing, maintenance of the central nervous system, angiogenesis, cell death and cancer cell invasion (Reinheckel et al. 2001; Turk et al. 2001; Balaji et al. 2002; Felbor et al. 2002). Cathepsins are classified into three groups based on the active-site residues. They are cysteine proteases including cathepsins B, C, H, F, K, L, S, W, and X/Z; serine proteases including cathepsins A and G, and as- paragine proteases including cathepsins D and E (Rawl- ings et al. 2006). Cathepsin L is synthesized as an inactive pro-enzyme with an N-terminal pro-peptide that is removed upon activation. The pro-peptide re- gion not only acts as an inhibitor of the enzyme ac- tivity, but it is also required for the proper folding of the enzyme synthesis and transport of the pro-enzyme to lysosomes (Matsumoto et al. 1995). In cattle, the * Corresponding author c 2013 Institute of Molecular Biology, Slovak Academy of Sciences

Upload: jesu

Post on 23-Dec-2016

216 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

Biologia 69/3: 395—406, 2014Section Cellular and Molecular BiologyDOI: 10.2478/s11756-013-0326-8

A murrel cysteine protease, cathepsin L: bioinformaticscharacterization, gene expression and proteolytic activity

Venkatesh Kumaresan1, Prasanth Bhatt1, Rajesh Palanisamy1, Annie J. Gnanam2,Mukesh Pasupuleti3 & Jesu Arockiaraj1*1Division of Fisheries Biotechnology & Molecular Biology, Department of Biotechnology, Faculty of Science and Humanities,SRM University, Kattankulathur – 603 203, Chennai, Tamil Nadu, India; e-mail: [email protected] for Cellular and Molecular Biology, The University of Texas at Austin, 1 University Station A4800, Austin, TX78712, USA3Lab PCN 206, CSIR-Central Drug Research Institute, B.S. 10/1, Sector 10, Jankipuram Extension, Sitapur Road, Lucknow– 226 031, Uttar Pradesh, India

Abstract: Cathepsin L, a lysosomal endopeptidase, is a member of the peptidase C1 family (papain-like family) of cysteineproteinases that cleave peptide bonds of lysosomal proteins. In this study, we report a cathepsin L sequence identified fromthe constructed cDNA library of striped murrel Channa striatus (designated as CsCath L) using genome sequencing FLXTM

technology. The full-length CsCath L contains three eukaryotic thiol protease domains at positions 134-145, 278-288 and299-318. Phylogenetic analysis revealed that the CsCath L was clustered together with other cathepsin L from teleosts.The three-dimensional structure of CsCath L modelled by the I-Tasser program was compared with structures depositedin the Protein Data Bank to find out the structural similarity of CsCath L with experimentally identified structures. Theresults showed that the CsCath L exhibits maximum structural identity with pro-cathepsin L from human. The RNA foldstructure of CsCath L was predicted along with its minimum free energy (–471.93 kcal/mol). The highest CsCath L geneexpression was observed in liver, which was also significantly higher (P < 0.05) than that detected in other tissues takenfor analysis. In order to investigate the mRNA transcription profile of CsCath L during infection, C. striatus were injectedwith fungus (Aphanomyces invadans) and bacteria (Aeromonas hydrophila) and its expression was up-regulated in liver atvarious time points. Similar to gene expression studies, the highest CsCath L enzyme activity was also observed in liverand its activity was up-regulated by fungal and bacterial infections.

Key words: Channa striatus; cathepsin L; bioinformatics analysis; epizootic ulcerative syndrome; gene expression; enzymeactivity.

Abbreviations: CsCath L, Channa striatus cathepsin L; EUS, epizootic ulcerative syndrome; MFE, minimum free energy;ORF, open reading frame; PBS, phosphate buffer saline; PDB, Protein Data Bank; qRT-PCR, quantitative real timepolymerase chain reaction; UTR, untranslated region.

Introduction

Cathepsin is defined as a ‘lysosomal proteolytic en-zyme’. Lysosome is an organelle with a cystic struc-ture, containing hydrolytic enzymes including phos-phatase, ribonuclease, deoxyribonuclease, cathepsin, B-glucuronidase and acetyl-transferase (Li et al. 2011).Cathepsins are enzymes that have been cleaving pep-tide bonds of lysosomal proteins probably since lyso-somes appeared in early eukaryotes. When the adaptivesystem emerged in gnathostomes, cathepsins evolved toproduce peptides in the major histocompatibility com-plex class II molecules (Uinuk-Ool et al. 2003). Duringthe past few years, many of the cathepsins have been ac-credited with more specific functions in human, includ-ing bone re-modelling, antigen presentation, epidermal

homeostasis, pro-hormone processing, maintenance ofthe central nervous system, angiogenesis, cell death andcancer cell invasion (Reinheckel et al. 2001; Turk et al.2001; Balaji et al. 2002; Felbor et al. 2002).Cathepsins are classified into three groups based

on the active-site residues. They are cysteine proteasesincluding cathepsins B, C, H, F, K, L, S, W, and X/Z;serine proteases including cathepsins A and G, and as-paragine proteases including cathepsins D and E (Rawl-ings et al. 2006). Cathepsin L is synthesized as aninactive pro-enzyme with an N-terminal pro-peptidethat is removed upon activation. The pro-peptide re-gion not only acts as an inhibitor of the enzyme ac-tivity, but it is also required for the proper folding ofthe enzyme synthesis and transport of the pro-enzymeto lysosomes (Matsumoto et al. 1995). In cattle, the

* Corresponding author

c©2013 Institute of Molecular Biology, Slovak Academy of Sciences

Page 2: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

396 V. Kumaresan et al.

purified cathepsin L protease is used as a vaccinationand it induces high levels of immunologic protectionagainst liver fluke infections (Conus & Simon 2010). Inmammals, cathepsin L functions include degradationof proteins in the lysosome and they are also known tobe involved in antigen processing and toll-like receptorsignalling (Coulombe et al. 1996). Once lysosome pro-duces secretory enzymes, cathepsin L degrades extra-cellular matrix and thus participates in tumour growthand metastasis (Conus & Simon 2010).In fishes, such as zebrafish and rainbow trout,

cathepsin L is known to be involved in oogenesis andembryogenesis (Li et al. 2010, 2011). It has also beenreported that cathepsin L is highly up-regulated inhaemopoietic tissues including liver, kidney and bloodduring bacterial and viral infection in fishes (Aranishiet al. 1997; Yeh & Klesius 2009; Ahn et al. 2010; Chenet al. 2011), which clearly indicates the immunologicalrole of cathepsin L in fishes. Cathepsin L in fish shows astrong proteolytic activity for several proteins includingmyofibrillar proteins in muscles of Oncorhynchus keta(Yamashita & Konagaya 1990), Scomber japonicas (Leeet al. 1993), Engraulis japonica and Atheresthes stomias(Visessanguan et al. 2003) suggesting its participationin intracellular and extracellular protein catabolism infish (Aranishi et al. 1997). Cathepsin L is present infish mucus and it is reported to produce antimicrobialpeptides during infection (Lee et al. 1993).Cathepsin L and its active proteases have been

identified and purified from many teleosts (Aranishi etal. 1997; Heu et al. 1997; Tingaud-Sequeira & Cerda2007; Ahn et al. 2010). Yet, there is no informationavailable on cathepsin L from snakehead or stripedmurrel Channa striatus. Hence, to gain insight intothe characterization of cathepsin L and its role in C.striatus, a full-length cathepsin L cDNA (designated asCsCath L) was identified from the C. striatus cDNA li-brary constructed by Genome Sequencing FLXTM tech-nology. The tissue distribution and mRNA transcrip-tion of CsCath L has been studied using Aphanomycesinvadans and Aeromonas hydrophila infection. The spe-cific enzyme activity of CsCath L in different tissuesand the variation of specific activity in blood of C.striatus after fungal and bacterial infection are also re-ported.

Material and methods

cDNA library construction and CsCath L identificationA full-length cathepsin L was identified from the con-structed C. striatus cDNA library by the genome sequenceFLXTM technology. Briefly, total RNA was isolated usingTri ReagentTM (Life Technologies) from the tissue pool in-cluding spleen, liver, kidney, muscle and gills of healthy C.striatus. Then, mRNA was purified using an mRNA isola-tion kit (Miltenyi Biotech). The first strand cDNA synthe-sis and normalization were carried out with CloneMinerTM

cDNA library construction kit (Invitrogen) and Trimmer Di-rect Kit: cDNA Normalization Kit (BioCat GmbH). There-after, the GS-FLXTM sequencing of C. striatus cDNAwas performed according to the manufacturer’s protocol

(Roche). The raw data were processed with the Rochequality control pipeline using the default settings. Seq-clean (http://compbio.dfci.harvard.edu/tgi/software/) soft-ware was used to screen for and remove normalization adap-tor sequences, homopolymers and reads shorter than 40bp prior to assembly. Further, the sequences were sub-jected to assembly using MIRA (ver. 3.2.0) technology(Chevreux et al. 2004) into full-length cDNAs. From theestablished cDNA library of C. striatus sequence database,we identified a cathepsin L gene, which we designated asCsCathL through BLAST annotation program on NCBI(http://www.blast2go.com/b2ghome).

Bioinformatics characterization of CsCath LThe full-length CsCath L sequence was compared with othersequences available in NCBI database (http://blast.ncbi.nlm.nih.gov/Blast) and the similarities were analyzed. TheORF and amino acid sequence of CsCath L was ob-tained by using DNAssist (ver. 2.2.). Characteristic do-mains and motifs were identified using the PROSITEprofile database (http://prosite.expasy.org/scanprosite/).Percentage identity and similarity analysis of CsCath Lwith other homologous sequences were carried out us-ing matrix global alignment tool (MATGAT). To ob-tain this identity and similarity we used scoring ma-trix of BLOSUM50 with first gap of 16 and extendinggap of 4. The N-terminal transmembrane sequence wasdetermined by DAS transmembrane prediction program(http://www.sbc.su.se/∼miklos/DAS). Signal peptide anal-ysis was done using the SignalP http://www.cbs.dtu.dk/).Multiple sequence alignment was carried out on ClustalW(ver. 2) (http://www.ebi.ac.uk/Tools/msa/clustalw2/) pro-gram to find out the evolutionarily conserved residuesamong the different organisms. The sequences were alignedusing BLOSUM50 method with a gap extension value 0.5and gap open value and gap distance value of 5. The alignedsequences were analyzed on Bioedit (ver. 7.1.3.0). In graphicview, the threshold limit was set to 100% to obtain theexact matches in the aligned sequences (Hall 1999). Theevolutionary history of CsCath L was inferred using theneighbour-joining method. The evolutionary distances werecomputed using the Poisson correction method (Uinuk-Oolet al. 2003). The phylogenetic analysis involved 30 aminoacid sequences including CsCath L. The phylogenetic treewas conducted in MEGA 5 (Tamura et al. 2011). The re-liability of the branching was tested using the bootstrapre-sampling (1,000 pseudo-replicates).

The secondary structure of the CsCath L protein wasanalyzed using SOPMA program and was constructed onPolyview method (http://polyview.cchmc.org/). The ter-tiary structure of the CsCath L deduced amino acid se-quence was predicted by I-Tasser program (http://zhanglab.ccmb.med.umich.edu/I-TASSER). I-Tasser generates full-length model of proteins by excising continuous fragmentsfrom threading alignments and then reassembling themusing replica-exchanged Monte Carlo simulations (Zhang2008; Roy et al. 2010). The structural image was gener-ated using PyMol Molecular Graphics System (ver. 1.5;http://www.pymol.org/). Domain regions in the sequencewere identified by Pfam and SMART databases, along withmotif prediction by PRINTS and motif search databases.The Protein Data Bank (PDB) structure obtained fromI-Tasser program was used to predict the position of theactive sites. Further, CsCath L protein with and with-out inhibitor region was compared in surface view us-ing the PyMol program. Moreover, the cDNA sequence of

Page 3: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

Murrel cysteine protease cathepsin L 397

CsCath L was converted into the corresponding RNA se-quence using DNAssist (ver. 2.2.) to predict the RNA struc-ture of CsCath L. The converted RNA sequence was sub-mitted to RNA fold server (http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi) and the structure of CsCath L RNA waspredicted with minimum free energy (MFE).

Collection and maintenance of fishHealthy C. striatus (average body weight of 40 g) were ob-tained from the Surya Agro Farms Ltd., Erode, Tamil Nadu,India. Fishes were maintained in flat-bottomed plastic tanks(100 L) with aerated and filtered freshwater at 29 ± 2◦Cin the wet lab of Division of Fisheries Biotechnology andMolecular Biology, SRM University. All fishes were accli-matized for a week before being challenged to A. invadansand A. hydrophila. A maximum of 10 fishes per tank weremaintained during the experiments.

Pathogen injection and tissue collectionFor fungus induced mRNA expression analysis, the fisheswere injected with A. invadans (102 spores). A. invandanswere isolated from an epizootic ulcerative syndrome (EUS)infected C. striatusmuscle sample. The infected muscle sam-ple was taken from the EUS infected fish and were placed ina petri dish of algal boost GP medium with 100 units/mLpenicillin G and 100 µg/mL streptomycin. The nutrientmedium was incubated at 25◦C for 12 h and examined un-der a binocular microscope (CoslabTM). The fungal specieswere identified according to the description of Caster &Cole (1990) using potato dextrose agar and Czapek Doxagar (Himedia, Mumbai). For bacterial challenge, the fisheswere injected intraperitoneally with A. hydrophila (5×106CFU/mL) suspended in 1X phosphate buffer saline (PBS;100 µL/fish). A. hydrophila was also isolated and identifiedfrom the muscle sample of EUS infected C. striatus as de-scribed by Dhanaraj et al. (2008). Samples were collectedbefore (0 h), and after injection (3, 6, 12, 24 and 48 h) andwere immediately snap-frozen in liquid nitrogen and storedat –80◦C until total RNA was isolated. Using a sterilizedsyringe, the blood (0.5-1.0 mL per fish) was collected fromthe fish caudal fin and immediately centrifuged at 4000×gfor 10 min at 4◦C to allow blood cell collection for totalRNA extraction. PBS (1X) were prepared and served ascontrol (100 µL/fish). All samples were analyzed in threeduplications and the best representative data was expressedas described by Livak & Schmittgenm (2001).

RNA isolation and cDNA conversionTotal RNA from the control and infected fish was isolatedusing Tri ReagentTM (Life Technologies), according to themanufacture’s protocol with slight modifications (Arock-iaraj et al. 2011, 2012). Using 2.5 µg of RNA, first strandcDNA synthesis was carried out using a SuperScript�VILOTM cDNA Synthesis Kit (Life technologies) (Arock-iaraj et al. 2013). The resulting cDNA solution was storedat –20◦C for further analysis.

Gene expression analysisThe relative expression of CsCath L in blood, gills, liver,heart, spleen, intestine, head kidney, kidney, skin, muscleand brain were measured by quantitative real time poly-merase chain reaction (qRT-PCR). qRT-PCR was carriedout using a ABI 7500 Real-time Detection System (AppliedBiosystems) in 20 µL reaction volume containing 4 µL ofcDNA from each tissue, 10 µL of Fast SYBR� Green Mas-ter Mix, 0.5 µL of each primer (20 pmol/µL) and 5 µLdH2O. The qRT-PCR cycle profile was 1 cycle of 95◦C for

10 s, followed by 35 cycles of 95◦C for 5 s, 58◦C for 10 sand 72 ◦C for 20 s and finally 1 cycle of 95◦C for 15 s,60◦C for 30 s and 95◦C for 15 s. The same qRT-PCR cy-cle profile was used for the internal control gene, β-actin.β-Actin of C. striatus primers was designed from the se-quence of GenBank Accession No. EU570219. The primerdetails of gene specific primer (CsCath L) and internal con-trol (β-actin) are as follows: CsCath L F1: GTG GGA GAAGAA CCT GAA GAA G and CsCath L R2: CAT GTCTCC GAA GTG GTT CAT; β-actin F3: TCT TCC AGCCTT CCT TCC TTG GTA and β-actin R4: GAC GTCGCA CTT CAT GAT GCT GTT. After the PCR program,data were analyzed with ABI 7500 SDS software (AppliedBiosystems). To maintain consistency, the baseline was setautomatically by the software. The comparative CT method(2−∆∆CT method) was used to analyze the expression level ofCsCath L (Livak & Schmittgen 2001). All samples were ana-lyzed in three duplications and the best representative datawas expressed here as described by Livak & Schmittgenm(2001).

CsCath L enzyme activity analysisCsCath L specific activity was measured according to themethodology of Stephens et al ( 2012) with slight modifica-tions. The activity was determined by using the cathepsin Lfluorogenic substrate (Ac-HRYR-ACC) (Merck) in proteinextracts obtained from various organs of C. striatus includ-ing liver, spleen, heart, kidney, head kidney, blood, skin, gill,brain, muscle and intestine. The assays were conducted at30◦C in 96-well plates as follows: 20 µL of protein (concen-tration = 500 µg) extract from fish organs were mixed with abuffer solution (100 mM sodium acetate, 1.5 mM ethylenedi-aminetetraacetic acid, 2 mM dithiothreitol and 0.05% Tri-ton X-100; pH 5.5). The reaction was set by adding thecathepsin L fluorogenic substrate to a final concentrationof 100 µM. The activity was recorded, measured at 440 nmand calculated as explained by manufacturer’s protocol. Thesimilar protocol was followed to study the enzyme activityat various time points (3, 6, 12, 24 and 48 h) in spleen tis-sue infected with fungus and bacteria. PBS (1X) served ascontrol. All the assays were performed in three duplicates.

Statistical analysisFor comparison of relative CsCath L mRNA expression andcathepsin L enzyme activity, statistical analysis was per-formed using the one-way ANOVA and mean comparisonswere performed by Tukey’s Multiple Range Test using SPSS11.5 at the 5% significant level.

Results

CsCath L cDNA analysisThe full-length CsCath L cDNA consisted of 1,334 nu-cleotides along with an ORF (1,021 nucleotides). Theuntranslated region (UTR) at the 5’ end is 70 nu-cleotides and 3’ UTR is 250 nucleotides long. The totalGC content of the ORF is 57%. The ORF was thentranslated to amino acid sequence with 337 residuesusing DNAssit program. The protein has a molecularweight of 38 kDa and the theoretical isoelectric point of5.9. CsCath L sequence was submitted to EMBL data-bank under the accession number HF571334.

Page 4: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

398 V. Kumaresan et al.

Fig. 1. Multiple sequence alignment of CsCath L. Analysis was performed by ClustalW, using representatives of cathepsin L from differ-ent previously known teleosts: Lates calcarifer (GenBank: ABV59078), Danio rerio (NP 997749), rooster Gallus gallus (NP 001161481)and human (AAI42984). The inhibitor region and active sites are clearly marked using double sided arrow mark. The down arrow incleavage site 1 represents the cleavage site of the signal peptide region and in cleavage site 2 represents the cleavage site of pro-peptideregion. The down arrow in active site shows the motif (Cys140, His280 and Asn304) of the respective active site residues. Cathepsin Lsignature ERWNIN and GCNGG are highlighted in green colour. The conserved regions are shaded in black colour.

Domain and motif analysis of CsCath LThe Prosite Scan analysis showed three eukaryotic thiolprotease domains at 134–145, 278–288 and 299–318along with their cysteine (Cys140), histidine (His280)and asparagine (Asn304) active site residues, respec-tively (Fig. 1). In addition, 20 other high probability

common motifs were observed. These 20 motifs belongto 4 different sites including phosphorylation, myris-toylation, glycosylation and amidation groups. SignalPanalysis showed a peak at 18th position of CsCath Lamino acid sequence, thus predicting it as the cleavagesite of deduced amino acid sequence. Hence, it is pre-

Page 5: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

Murrel cysteine protease cathepsin L 399

Table 1. MATGAT analysis showing the identity and similarity (%) of CsCath L amino acid sequence with other homologous sequences.a

No. Organism Accession No. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

1 CsCath L∗ HF571334 91 83 83 83 81 77 67 67 64 25 26 23 24 24 18 16 15 17 16 19 17 18 20 19 22 18 20 21 222 Lates calcarifer∗ ABV59078 96 85 85 88 83 78 67 66 66 25 26 27 25 25 17 16 15 17 16 19 17 19 19 21 21 18 18 20 213 Cyprinus carpio∗ BAD08618 92 94 92 80 82 76 66 66 65 26 26 26 26 26 16 15 15 18 16 19 17 17 20 20 20 19 20 20 194 Danio rerio∗ NP 997749 92 93 97 78 81 75 66 66 64 27 25 25 24 25 16 17 15 19 16 20 17 17 21 19 21 19 20 21 195 Dicentrarchus labrax∗ ACN93991 89 91 88 86 77 71 62 61 61 24 26 26 24 23 17 17 14 17 14 18 17 17 17 17 20 17 19 19 196 Osmerus mordax∗ ACO09031 91 93 91 91 87 77 66 65 64 26 26 25 25 24 19 14 15 16 15 18 17 20 20 20 19 20 21 20 207 Gallus gallus∗ NP 001161481 87 88 85 84 81 86 67 66 65 26 25 25 24 24 15 17 17 17 14 17 17 19 20 20 20 17 18 19 198Macaca mulatta∗ EHH24212 80 80 78 79 75 80 79 96 78 26 26 26 25 26 16 17 18 15 17 18 18 18 20 20 18 16 19 17 199 Homo sapiens∗ AAI42984 79 80 78 79 75 79 79 99 78 26 26 26 24 26 16 17 17 14 17 18 18 19 17 20 16 16 18 18 1810 Myotis davidii∗ ELK33129 80 80 79 78 74 79 78 88 88 25 25 24 24 24 18 17 14 19 18 17 18 19 18 19 18 19 17 17 1911 Oplegnathus fasciatus§ AEA48884 44 44 44 46 43 43 42 44 45 46 81 82 70 69 17 15 16 18 15 15 21 19 17 18 21 19 20 20 2112 Cyprinus carpio§ BAE44111 44 43 46 46 45 45 45 46 45 44 90 91 71 70 16 14 16 16 16 16 18 17 19 19 19 19 20 20 1913 Danio rerio§ NP 998501 42 46 45 46 44 45 45 47 46 46 90 95 70 69 18 16 15 17 16 16 18 17 18 19 19 19 20 19 1714 Homo sapiens§ AAH10240 45 45 47 45 41 45 42 45 43 46 83 84 84 75 18 15 15 16 18 17 19 17 17 20 20 18 20 19 2015 Gallus gallus§ NP 990702 44 46 45 46 42 44 41 46 46 45 81 82 81 86 17 15 14 17 15 18 18 18 19 20 19 19 21 20 1816 Homo sapiens¶ NP 001902 34 34 32 34 30 32 32 32 33 32 30 29 31 30 28 67 33 69 67 18 18 16 16 18 17 19 16 18 1717 Mus musculus¶ NP 031826 32 32 31 33 35 30 31 32 32 31 30 27 29 27 26 79 32 67 67 17 16 18 16 15 17 18 16 17 1718 Xenopus tropicalis¶ NP 001107513 31 32 31 32 30 30 32 31 31 32 31 28 30 29 25 53 53 35 32 15 14 14 14 13 17 16 13 14 1319 Myotis davidii¶ ELK32445 30 30 31 32 31 31 31 29 27 35 32 32 32 31 27 83 79 54 68 18 18 17 16 18 16 19 19 17 1720 Heterocephalus glaber¶ EHB11954 32 32 31 32 28 32 29 30 29 32 28 29 29 32 29 78 78 55 79 17 16 16 14 14 18 19 18 17 1821 Homo sapiensψ NP 004842 35 34 34 33 32 31 35 33 34 32 32 33 32 31 31 31 29 25 29 28 52 56 64 55 45 44 41 45 4522 Drosophila melanogasterψ NP 652013 33 33 33 34 33 34 34 33 32 32 37 35 37 34 32 31 28 26 30 28 69 63 54 71 45 43 40 45 4523 Todarodes pacificusψ BAD15111 33 35 34 34 33 36 34 34 33 35 35 30 34 31 33 30 30 27 29 31 70 77 56 64 46 45 41 47 4524 Danio rerioψ NP 571785 35 36 35 36 33 36 35 34 32 35 34 33 33 34 33 29 29 26 27 26 79 70 70 57 47 47 40 46 4625 Aedes aegyptiψ XP 001657556 36 36 36 33 34 34 36 35 35 32 38 35 38 35 36 28 31 26 29 27 69 83 77 70 45 45 41 45 4526 Homo sapiens∆ NP 001901 38 37 36 38 35 37 39 35 36 35 37 36 35 37 32 28 31 27 29 29 63 62 64 66 65 61 70 90 8827 Rana catesbeiana∆ BAC75398 33 32 34 33 32 33 35 32 31 35 34 34 37 35 35 30 32 24 28 30 65 64 65 67 66 79 50 61 6028 Tupaia chinensis∆ ELW47743 35 33 36 36 33 36 36 35 34 32 37 36 37 36 34 29 28 26 30 29 60 61 62 62 63 83 69 68 6729 Myotis davidii∆ ELK37212 37 36 34 38 35 35 36 35 34 34 34 34 34 35 35 30 31 24 30 31 64 63 64 67 64 93 78 81 9030 Pteropus alecto∆ ELK03306 37 36 35 37 34 36 36 35 35 34 37 36 35 36 32 28 30 26 29 30 63 63 64 66 64 94 77 80 94

a Individual cathepsins are marked as follows: ∗ cathepsin L; § cathepsin B; ¶ cathepsin G; ψ cathepsin D; ∆ cathepsin E. Sequenceidentity (%) – above diagonal, sequence similarity (%) – under diagonal.

dicted that the signal peptide comprises the region ofCsCath L between 1 and 17. Further analysis revealedthat CsCath L contains an inhibitor region along withits peptidase C1 super family active domain at 30–90,which belongs to inhibitor 29 superfamily.

Homologous analysis and multiple sequence alignmentHomologous analysis on BLASTp showed that CsCathL possesses a significant sequence similarity with othercathepsin L from various organisms, especially fromteleost fishes. The active sites are highly conservedamong the sequences taken for homologous analysis.The maximum identity was noticed with Lates calcar-ifer (91%), followed by Cyprinus carpio, Daneio rerioand Dicentrarchus labrax (83%) (Table 1). Moreover,multiple sequence alignment of CsCath L also revealeda high degree of identity with other cathepsin L. CsCathL is highly similar with other sequences taken for anal-ysis at signal peptide region, cleavage site, inhibitor re-gions and active sites (Fig. 1). The cathepsin L signa-ture motifs ERWNIN and GCNGG also remain con-served in the sequences taken for analysis.

Phylogenetic treeA phylogenetic tree was drawn to study the genetic dis-tance of CsCath L using the neighbour-joining methodand is presented in a radiation view (Fig. 2). Phyloge-netic analysis showed that CsCath L exhibits a strongrelationship with other cathepsin L from various organ-

isms. The CsCath L sequence is clustered with otherteleosts cathepsin L. Moreover, to find the evolution-ary position of the CsCath L, it was compared with theother sequences of cathepsin B, D, E and G.

Structural analysis of CsCath LThe secondary structural analysis of CsCath L showedthat the protein contains 29% α-helical region (99amino acid), 17% β-sheets and 54% random coils(Fig. 3a). I-Tasser program predicted five different mod-els of CsCath L protein. Based on the confidence score(c-score) the best model was selected for analysis. Thec-score of the selected model is –0.01. The predictedstructure was viewed in a PDB viewer (Fig. 3b). Theobtained tertiary structure was compared with the PDBto find out the structural similarity of CsCath L withother experimentally identified structures. The resultsshowed that CsCath L has the maximum structuralidentity with pro-cathepsin L from human, whose X-raystructure was experimentally determined (Coulombe etal. 1996.). The surface view analysis of CsCath L pro-tein structure with and without inhibitor region is pre-sented in Figure 4. The comparative images (Fig. 4aand 4b) showed that the eukaryotic thiol protease ac-tive site residues Cys140, His280 and Asn304 are presentin the centre of the protein. Further analysis showedthat the active site residues are masked by an inhibitorin the pro-protein region of CsCath L. The predictedRNA fold structure of CsCath L with MFE is given

Page 6: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

400 V. Kumaresan et al.

Fig. 2. Phylogentic analysis of CsCath L with other species reconstructed by the neighbour-joining method. The evolutionary distanceswere computed using the Poisson correction method. The tree is based on an alignment corresponding to full-length amino acidsequences using MEGA. The numbers at the branches denote bootstrap majority consensus values on 1,000 replicates. The scale barrepresents a genetic distance 0.2 as the frequency of substitutions in pair-wise comparison of two sequences. The tree is expressed inradiation view. The GenBank accession numbers are given in Table 1.

in Figure 5. The MFE of the predicted RNA structureof CsCath L is –471.93 kcal/mol. The prediction showsthat the RNA is mostly paired and very few nucleotidesare left unpaired.

Tissue distribution of CsCath LThe mRNA transcripts of CsCath L could be detectedby qRT-PCR (Fig. 6a). The largest quantity of CsCathL was observed in liver followed by spleen, heart, kidney,head kidney, blood, skin, gill, brain, muscle and intes-tine. Further statistical analysis showed that CsCath LmRNA expression was significantly higher (P < 0.05) inthe liver. Therefore, the fungal (A. invadans) and bacte-rial (A. hydrophila) infected liver tissue was selected to

investigate the temporal expression of CsCath L gene.

CsCath L mRNA expression in liver after fungal andbacterial infectionTo analyze the expression profile of CsCath L dur-ing infection, C. striatus was injected with A. in-vadans and A. hydrophila and the liver was analyzedby real time PCR (Fig. 6b,c). In A. invadans infectedtissue, the highest level of CsCath L mRNA tran-scripts was observed at 24 h post-injection (Fig. 6b).Significant differences (P < 0.05) were found in ex-pression at 3, 6, 12 and 24 h post-injection betweenA. invadans injected and the PBS injected controlgroups. In A. hydrophila injected groups, the level of

Page 7: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

Murrel cysteine protease cathepsin L 401

Fig. 3. The predicted structure of CsCath L. (a) Secondary structure of CsCath L predicted using Polyview method. The red colorzig-zag lines represent the α-helix region, the blue color horizontal lines represent the coils and the green arrows represent β-sheets.(b) Three-dimensional structure of the deduced amino acid sequence of CsCath L. The α-helices, β-strands and random coil regionsare represented in red, blue and green colour, respectively. The three active-site residues including cysteine, histidine and asparagineare represented in yellow, pink and orange, respectively, as ball structure.

CsCath L mRNA transcripts sharply increased until24 h post-injection and the expression was decreasedat 48 h post-injection, almost near to the basal level(Fig. 6c). The expression was significantly different (P< 0.05) in all the post-injection time points betweenA. hydrophila challenged and the PBS injected controlgroups.

CsCath L enzyme activity and its changes by fungal andbacterial infectionThe enzyme activity was observed in all the tissues

taken for analysis. The highest enzyme activity was no-ticed in liver, which was significantly higher (P < 0.05)than that detected in other tissues (Fig. 7a). Moreover,we selected spleen tissue infected with fungal and bac-terial immune stimulants to study the specific activ-ity of CsCath L. Figure 7b shows the enzyme activ-ity of CsCath L at various hours after infected withfungus A. invadans. A significantly (P < 0.05) maxi-mum peak was observed during 24 h post-injection ofA.invadans in enzyme activity (77.0 U/µg). In bacterialA. hydrophila infected murrel, significantly (P < 0.05)

Page 8: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

402 V. Kumaresan et al.

Fig. 4. Surface view of CsCath L in Pymol Viewer. (a) CsCath L surface view along with inhibitor region. (b) The view withoutinhibitor region and showing the active-site residues (Cys140, His280 and Asn304) of CsCath L.

Fig. 5. The predicted structure of CsCath L RNA fold with MFE using RNA fold server. The scale bar (colour map) from violet tored in the figure denotes the probability of nucleotides being unpaired.

highest activity (109.6 U/µg) was observed at 24 h postinjection (Fig. 7c).

Discussion

In this study, we deliver a molecular characterizationof the first cathepsin L from C. striatus. CsCath L

polypeptide contains a signal sequence between theresidues 1 and 17 similar to other cathepsin L ( Li etal. 2010; Ma et al. 2010). The cleavage site was no-ticed in CsCath L at Ala18 followed by an inhibitorregion (otherwise called pro-peptide region). Vernet etal (1995) reported that the cathepsin is synthesized asan inactive pro-enzyme at the N-terminal pro-peptide

Page 9: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

Murrel cysteine protease cathepsin L 403

Fig. 6. Relative quantification of CsCath L gene expression by real-time PCR. (a) Results of tissue distribution analysis of CsCath Lfrom various organs of striped murrel. Data are given as a ratio to CsCath L mRNA expression in intestine. (b,c) The time course ofCsCath L mRNA expression in liver at 0, 3, 6, 12, 24, and 48 h post-injection with A. invadans (b) and A. hydrophila (c).

which gets cleaved and converts into an active protein.A potential N-glycosylation site was noticed betweenthe residues 224 and 227 in CsCath L and is importantfor transportation of cathepsin L proteases into lyso-somes as reported by Ma et al (2010).The cathepsin L signature motifs, ERWNIN and

GCNGG are both present in the pro-domain region ofCsCath L. ERWNIN plays an important role in the inhi-bition of proteolytic activity (Karrer et al. 1993). Vernetet al (1995) pointed out that the GCNGG motif is re-lated to the pH-dependent intra-molecular processing.

Moreover, the cysteine residue in the GCNGG motif isassociated with the formation of a disulfide bridge (Kar-rer et al. 1993). The glutamine residue at the position133 in the CsCath L amino acid sequence helped in theformation of oxyanion hole (Menard & Storer 1992).Further analysis revealed that the CsCath L consistsof six potential substrate binding sites. These bindingsites may vary in different species, for example, channelcatfish Ictalurus punctatus (Yeh & Klesius 2009) andpearl oyster Pinctada fucata (Ma et al. 2010). Hence itis possible to suggest that the functional mechanism of

Page 10: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

404 V. Kumaresan et al.

Fig. 7. The specific activity of CsCath L. (a) Enzyme activity of CsCath L in different tissues. (b,c) CsCath L specific activity profileafter A. invadans (b) and A. hydrophila (c) infection, respectively. Values are shown as mean ± standard deviation.

cathepsin L is not the same in all organisms.Phylogenic analysis of CsCath L produced five sep-

arate clades. The tree exhibited three main branchesthat consist of L and B; E and D; and G respectively.The topology of the tree was in accordance with thecathepsin groups as cathepsins B and L are the largestand best-known cathepsin groups (cysteine proteasesof the papain family); cathepsins D and E are asparticproteases and cathepsin G is the serine protease.The secondary structure analysis indicated that

the CsCath L contains more coils due to the presenceof high amount of glycine (the smallest amino acidwith no side chain and not involved in cross linkages).The three-dimensional structure of the CsCath L pro-

tein was predicted by I-Tasser modelling program andcompared with the human pro-cathepsin. The analysisexhibited that the three active site residues (Cys140,His280, Asn304) of the CsCath L are located at thecentral position of the protein and masked by the in-hibitor region, thus making the protein as non-reactiveto the substrates. Moreover, the tertiary structure ofthe CsCath L pro-protein (with inhibitor region) andthe active protein (without the inhibitor region) wascompared and analyzed in the PyMol surface view. Itis also confirmed that the active sites are masked bythe inhibitor region. As reported earlier (Coulombe etal. 1996), this mask prevents the substrates from bind-ing to the active sites of the protein. When the inhibitor

Page 11: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

Murrel cysteine protease cathepsin L 405

region is cleaved from the pro-protein, the active sitesbecome open to substrate binding site, thus making theprotein functional. The predicted RNA structure indi-cates that the CsCath L RNA is highly stable based onthe high MFE value (–471.93 kcal/mol) and less num-ber of free nucleotides.Tissue distribution results showed that the CsCath

L is expressed in all the tissues taken for analysis, show-ing the active involvement of CsCath L in proteasefunction. Cathepsin expression was studied in many or-ganisms using various immune stimulants (Conus & Si-mon 2010). However, A. invadans and A. hydrophilainfection-induced cathepsin L expression in C. striatusis still unknown and it is reported for the first timehere. In this study, we observed the highest gene ex-pression of CsCath L in liver against fungal and bacte-rial infection. The gradual increment of CsCath L geneexpression in liver infected with A. hydrophila duringthe first 24 h post-injection may be due to the incre-ment of bacterial cell density with the time. These re-sults showed the involvement of CsCath L in C. stria-tus immune response against the fungal and bacterialinfection. These pathogen-induced gene expression wasrelated to inflammation, cytokine activity, antigen pre-sentation and binding activity (Trent et al. 2006).The enzyme activity of CsCath L was noticed in

all the tissues taken for analysis. Similar to the gene ex-pression studies, the highest enzyme activity was alsoobserved in liver. The specific activity of CsCath Lin spleen after challenged with fungus and bacteriumshowed its involvement in the immune process of stripedmurrel. As stated by Press & Evensen (1999), Dong etal. (2007) and Kim et al. (2011) the results of CsCathL enzyme activity analyses in spleen after fungal andbacterial infection indicate that the haematopoietic andlymphoid organs are closely related to the immune re-sponse in fish. Hence it is possible to suggest thatcathepsin L may play multifunctional roles in murrels.

Acknowledgements

This research was supported by DBT’s Prestigious Rama-lingaswami Re-entry Fellowship (D.O.NO.BT/HRD/35/02/2006) funded by Department of Biotechnology, Ministry ofScience and Technology, Government of India, New Delhi.

References

Ahn S., Sung J., Kim N., Lee A., Jeon S., Lee J., Kim J., Chung J.& Lee H. 2010. Molecular cloning, expression and characteri-zation of cathepsin L from mud loach (Misgurnus mizolepis).Appl. Biochem. Biotechnol. 162: 1858–1871.

Aranishi F., Ogata H., Hara K., Osatomi K. & Ishihara T. 1997.Purification and characterization of cathepsin L from hep-atopancreas of carp Cyprinus carpio. Comp. Biochem. Phys-iol. B Biochem. Mol. 118: 531–537.

Arockiaraj J., Avin F.A., Vanaraja P., Easwvaran S., Singh A.,Othman R.Y. & Bhassu S. 2012. Immune role ofMrNFκBI-α,an IκB family member characterized in prawnM. rosenbergii.Fish Shellfish Immunol. 33: 619–625.

Arockiaraj J., Gnanam A., Muthukrishnan D., Pasupuleti M.,Milton J. & Singh A. 2013. An upstream initiator caspase10 of snakehead murrel Channa striatus, containing DED,

p20 and p10 subunits: molecular cloning, gene expression andproteolytic activity. Fish Shellfish Immunol. 34: 505–513.

Arockiaraj J., Vanaraja P., Easwvaran S., Singh A., Alinejaid T.,Othman R. & Bhassu S. 2011. Gene profiling and character-ization of arginine kinase-1 (MrAK-1) from freshwater giantprawn (Macrobrachium rosenbergii). Fish Shellfish Immunol.31: 81–89.

Balaji K., Schaschke N., Machleidt W., Catalfamo M. & HenkartP.A. 2002. Surface cathepsin B protects cytotoxic lympho-cytes from self-destruction after degranulation. J. Exp. Med.196: 493–503.

Caster G. & Cole J.R. 1990. Diagnostic Procedure in VeterinaryBacteriology and Mycology, 5th Edn. Academic Press.

Chen L., Zhang M. & Sun L. 2011. Identification and expres-sional analysis of two cathepsins from half-smooth tonguesole (Cynoglossus semilaevis). Fish Shell?sh Immunol. 31:1270–1277.

Chevreux B., Pfisterer T., Drescher B., Driesel A., Muller W.,Wetter T. & Suhai S. 2004. Using the miraEST assembler forreliable and automated mRNA transcript assembly and SNPdetection in sequenced ESTs. Genome Res. 14: 1147–1159.

Conus S. & Simon H. 2010. Cathepsins and their involvement inimmune responses. Swiss Med. Wkly 140: w13042.

Coulombe R., Grochulski P., Sivaraman J., Menard R., Mort J.& Cygler M. 1996. Structure of human procathepsin L revealsthe molecular basis of inhibition by the prosegment. EMBOJ. 15: 5492–5503.

Dhanaraj M., Haniffa M., Ramakrishnan C. & Singh S.V. 2008.Microbial flora from the Epizootic Ulcerative Syndrome(EUS) infected murrel Channa striatus (Bloch, 1797) inTirunelveli region. Turk. J. Vet. Anim. Sci. 32: 221–224.

Dong W., Xiang L. & Shao J. 2007. Cloning and characterisationof two natural killer enhancing factor genes (NKEF-A andNKEF-B) in pufferfish, Tetraodon nigroviridis. Fish ShellfishImmunol. 22: 1–15.

Felbor U., Kessler B., Mothes W., Goebel H., Ploegh H., BronsonR. & Olsen B. 2002. Neuronal loss and brain atrophy in micelacking cathepsins B and L. Proc. Natl. Acad. Sci. USA 99:7883–7888.

Hall T. 1999. BioEdit: a user-friendly biological sequence align-ment editor and analysis program for Windows 95/98/NT.Nucleic Acid Symp. Ser. 41: 95–98.

Heu M., Kim H., Cho D., Godber J. & Pyeun J. 1997. Purifi-cation and characterization of cathepsin L-like enzyme fromthe muscle of anchovy, Engraulis japonica. Comp. Biochem.Physiol. B Biochem. Mol. 118: 523–529.

Karrer K., Peiffer S. & Ditomas M. 1993. Two distinct gene sub-families within the family of cysteine protease genes. Proc.Natl. Acad. Sci. USA 90: 3063–3067.

Kim J., Jeong J., Park H., Kim E., Kim H., Chae Y., Kim D. &Park C. 2011. Molecular identification and expression anal-ysis of cathepsins O and S from rock bream, Oplegnathusfasciatus. Fish Shellfish Immunol. 31: 578–587.

Lee J., Chen H. & Jiang S. 1993. Purification and character-ization of proteinases identified as cathepsins L and L-like(58 kDa) proteinase from mackerel (Scomber australasicus).Biosci. Biotechnol. Biochem. 57: 1470–1476.

Livak K. & Schmittgen T. 2001. Analysis of relative gene expres-sion data using real-time quantitative PCR and the 2−∆∆CTmethod. Methods 25: 402–408.

Li W., Jin X., He L., Jiang H., Gong Y., Xie Y. & Wang Q.2010. Molecular cloning, characterization, expression and ac-tivity analysis of cathepsin L in Chinese mitten crab, Eri-ocheir sinensis. Fish Shellfish Immunol. 29: 1010–1018.

Li W.W., He L., Jin X.K., Jiang H., Chen L.L., Wang Y. & WangQ. 2011. Molecular cloning, characterization and expressionanalysis of cathepsin A gene in Chinese mitten crab, Eri-ocheir sinensis. Peptides 32: 518–525.

Ma J., Zhang D., Jiang J., Cui S., Pu H. & Jiang S. 2010. Molec-ular characterization and expression analysis of cathepsin L1cysteine protease from pearl oyster Pinctada fucata. FishShellfish Immunol. 29: 501–507.

Matsumoto I., Watanabe H., Abe K., Arai S. & Emori Y. 1995.A putative digestive cysteine proteinase from Drosophila

Page 12: A murrel cysteine protease, cathepsin L: bioinformatics characterization, gene expression and proteolytic activity

406 V. Kumaresan et al.

melanogaster is predominantly expressed in the embryonicand larval midgut. Eur. J. Biochem. 227: 582–587.

Menard R. & Storer A. 1992. Oxyanion hole interactions in serineand cysteine proteases. Biol. Chem. 373: 393–400.

Press C.M. & Evensen O. 1999. The morphology of the immunesystem in teleost fishes. Fish Shellfish Immunol. 9: 309–318.

Rawlings N., Morton F. & Barrett A. 2006. MEROPS: the pep-tidase database. Nucleic Acids Res. 34: D270–D272.

Reinheckel T., Deussing J., Roth W. & Peters C. 2001. Towardsspecific functions of lysosomal cysteine peptidases: pheno-types of mice deficient for cathepsin B or cathepsin L. Biol.Chem. 382: 735–741.

Roy A., Kucukural A. & Zhang Y. 2010. I-TASSER: a unifiedplatform for automated protein structure and function pre-diction. Nat. Protocols 5: 725–738.

Stephens A., Rojo L., Araujo-Bernal S., Garcia-Carreno F. &Muhlia-Almazan A. 2012. Cathepsin B from the white shrimpLitopenaeus vannamei: cDNA sequence analysis, tissues-specific expression and biological activity. Comp. Biochem.Physiol. B Biochem. Mol. Biol. 161: 32–40.

Tamura K., Peterson D., Peterson N., Stecher G., Nei M. & Ku-mar S. 2011. MEGA5: Molecular Evolutionary Genetics Anal-ysis using maximum likelihood, evolutionary distance, andmaximum parsimony methods. Mol. Biol. Evol. 28: 2731–2739.

Tingaud-Sequeira A. & Cerda J. 2007. Phylogenetic relationshipsand gene expression pattern of three different cathepsin L(Ctsl) isoforms in zebrafish: Ctsla is the putative yolk pro-cessing enzyme. Gene 386: 98–106.

Trent M., Stead C., Tran A. & Hankins J. 2006. Diversity ofendotoxin and its impact on pathogenesis. J. Endotoxin Res.12: 205–223.

Turk V., Turk B. & Turk D. 2001. Lysosomal cysteine proteases:facts and opportunities. EMBO J. 20: 4629–4633.

Uinuk-Ool T., Takezaki N., Kuroda N., Figueroa F., Sato A.,Samonte I., Mayer W. & Klein J. 2003. Phylogeny of antigen-processing enzymes: cathepsins of a cephalochordate, an ag-nathan and a bony fish. Scand. J. Immunol. 58: 436–448.

Vernet T., Berti P.J., De Montigny C., Musil R., Tessier D.C.,Menard R., Magny M.C., Storer A.C. & Thomas D.Y. 1995.Processing of the papain precursor. The ionization of a con-served amino acid motif within the pro region participates inthe regulation of intramolecular processing. J. Biol. Chem.270: 10838–10846.

Visessanguan W., Benjakul S. & An H. 2003. Purificationand characterization of cathepsin L in arrowtooth flounder(Atheresthes stomias) muscle. Comp. Biochem. Physiol. BBiochem. Mol. Biol. 134: 477–487.

Yamashita M. & Konagaya S. 1990. Purification and characteri-zation of cathepsin L from the white muscle of chum salmon,Oncorhynchus keta. Comp. Biochem. Physiol. B 96: 247–252.

Yeh H. & Klesius P. 2009. Channel catfish, Ictalurus punctatus,cysteine proteinases: cloning, characterisation and expressionof cathepsin H and L. Fish Shellfish Immunol. 26: 332–338.

Zhang Y. 2008. I-TASSER server for protein 3D structure pre-diction. BMC Bioinformatics 9: 40.

Received July 18, 2013Accepted December 13, 2013