domain analysis of 3 keto acyl-coa synthase for structural variations in vitis vinifera and oryza...
TRANSCRIPT
Interdiscip Sci Comput Life Sci (2014) 6: 1–14
DOI: 10.1007/s12539-013-0017-8
Domain Analysis of 3 Keto Acyl-CoA Synthase for StructuralVariations in Vitis vinifera and Oryza brachyantha using
Comparative Modelling
Mamta Sagar∗, Neetesh Pandey, Naseha Qamar, Brijendra Singh, Akanksha Shukla(Department of Bioinformatics, University Institute of Engineering and Technology, Chhatrapati Shahuji Maharaj
University, Kanpur 208024, India)
Received 14 June 2013 / Revised 30 December 2013 / Accepted 21 January 2014
Abstract: The long chain fatty acids incorporated into plant lipids are derived from the iterative addition of C2units which is provided by malonyl-CoA to an acyl-CoA after interactions with 3-ketoacyl-CoA synthase (KCS),found in several plants. This study provides functional characterization of three 3 ketoacyl CoA synthase likeproteins in Vitis vinifera (one) and Oryza brachyantha (two proteins). Sequence analysis reveals that protein ofOryza brachyantha shows 96% similarity to a hypothetical protein in Sorghum bicolor ; total 11 homologs werepredicted in Sorghum bicolor. Conserved domain prediction confirm the presence of FAE1/Type III polyketidesynthase-like protein, Thiolase-like, subgroup; Thiolase-like and 3-Oxoacyl-ACP synthase III, C-terminal andchalcone synthase like domain but very long chain 3-keto acyl CoA domain is absent. All three proteins were foundto have Chalcone and stilbene synthases C terminal domain which is similar to domain of thiolase and β keto acylsynthase. Its N terminal domain is absent in J3M9Z7 protein of Oryza brachyantha and F6HH63 protein of Vitisvinifera. Differences in N-terminal domain is responsible for distinguish activity. The J3MF16 protein of Oryzabrachyantha contains N terminal domain and C terminal domain and characterized using annotation of thesedomains. Domains Gcs (streptomyces coelicolor) and Chalcone-stilbene synthases (KAS) in 2-pyrone synthase(Gerbera hybrid) and chalcone synthase 2 (Medicago sativa) were found to be present in three proteins. Thissimilarity points toward anthocyanin biosynthetic process. Similarity to chalcone synthase 2 reveals its possiblerole in Naringenine and Chalcone synthase like activity. In 3 keto acyl CoA synthase of Oryza brachyantha.Active site residues C-240, H-407, N-447 are present in J3MF16 protein that are common in these three protein atdifferent positions. Structural variations among dimer interface, product binding site, malonyl-CoA binding sites,were predicted in localized combination of conserved residues.Key words: Vitis vinifera, Oryza brachyantha, 3 keto acyl-CoA synthase, flavonoid, domain analysis, sequenceanalysis, KCS, active site, malonyl CoA binding site, structure alignment, binding site, Pyrone synthase.
List of abbreviations
ACP Acyl carrier proteinCHS Chalcone synthaseFA Fatty acidKCS 3 keto acyl CoA SynthaseGcs Germicidin synthasePS Pyrone synthasePKS polyketide synthaseJ3M9Z7 ORYBR & J3MFI6 ORYBR 3 keto acylCoA Synthase like protein in Oryza brachyanthaF6HH63 VITVI. 3 keto acyl CoA Synthase in VitisviniferaVLCFA Very long chain fatty acids
∗Corresponding author.E-mail: [email protected]
1 Introduction
Very-long-chain fatty acids (VLCFAs) are essentialprecursors of cuticular waxes and aliphatic suberins inroots. Condensation of C (2) units to an acyl CoAby 3-ketoacyl CoA synthase (KCS) is first committedstep in VLCFA biosynthesis. The very long chain fattyacids (VLCFA) incorporated into plant lipids are de-rived from the iterative addition of C2 units providedby malonyl-CoA to an acyl-CoA by the 3-ketoacyl-CoAsynthase (KCS) component of a fatty acid elongase(FAE) complex (Millar, A.A. et al. 1999; Lee SB etal. 2009). An Arabidopsis fatty acid elongase gene,KCS1 encodes a 3- ketoacyl-CoA synthase which is in-volved in very long chain fatty acid synthesis in vegeta-tive tissues, and wax biosynthesis. In Arabidopsis, waxbiosynthesis involves modification of most VLCFA by
2 Interdiscip Sci Comput Life Sci (2014) 6: 1–14
either the acyl reduction or decarbonylation pathway(Hannoufa et al. 1993; Jenks et al. 1995). A smoothlayer of epicuticular wax is found over the epidermalsurface in wild-type Arabidopsis leaf (Jenks, M.A. etal., 1995).
CHS is a key enzyme of plant flavonoid biosynthesis.Although it has various functions in plants but its rolein plant pathogen defence and insect resistance mecha-nism needs more precise definitions (Major, I. T. et al.2006; Robin D. Mellway et al. 2009). However, Palo RT(1984) described role of secondary metabolites as chem-ical defence against herbivores in Betula Spp, Salix Spp,and Populus Spp. CHS and its related genes are foundin several plants. One important related enzyme is 3keto acyl CoA synthase. Allthough 3 keto acyl CoAsynthase is well chracterised in Arabidopsis thaliana,Populus tricocarpa and other organism but this enzymeis not studied in Oryza brachyantha and no study re-ports KCS function in this organism except genomicannotation of whole genome sequence provided by Dr.Mingsheng Chen or Dr. Rod Wing (Jinfeng Chen et al.,2013). This enzyme has transferase activity, transfer-ring acyl groups other than amino-acyl groups. In thisstudy, homologs of 3 keto acyl CoA synthase sequenceof P . Trichocarpa x P.deltoides are predicted usinghomology search, these are similar 3-ketoacyl-synthasesynthase family proteins in Populus trichocarpa, Rici-nus communis and Gosypium hirsutum with similarityof more than 90% similarity. Uncharacterised proteinshomologs were also obtained, which are focus of ourstudy. Herein we have triesd to characterise these pro-teins. Conserved domain analysis, protein functionalanalysis followed by modelling and structural alignmentapproaches are used to predict role of unchracterisedproteins in Vitis vinifera and Oryza brachyantha. Pro-tein functional analysis was done using fold libraryand structure-based alignments for homologous proteinfamilies. This study provides functional annotation of3 keto acyl CoA synthase protein; A9PIU6, F6HH63,J3MFI6 and J3M9Z7 in T . Trichocarpa x T.deltoides,Vitis vinifera and Oryza brachyantha by Comparativedomain analysis based on sequence- sequence and se-quence structure comparison. Activity of these pro-teins has been characterized on the basis of structuralsimilarities and variations. Domains Germicidin syn-thase (streptomyces coelicolor) and Chalcone-stilbenesynthases (KAS) in 2-pyrone synthase (Gerbera hybrid)are analysed to predict role of protein in anthocyaninproduction and narigenin synthesis activity. Other do-mains, CHS 2 (Medicago sativa) B ketoacylsynthaseIII (Streptomyces sp.r1128 ). 3-oxoacyl-ACP synthaseIII (Mycobacterium tuberculosis), 3-oxoacyl-ACP syn-thase (Thermus thermophilus), β keto acyl-acyl carrierprotein synthase III (Escherichia coli) are also anal-ysed. Finally structural variations among dimer inter-face, product binding sites and malony CoA binding
sites are analysed on the basis of amino acids positionsin sequence and structure. Differences between C andN terminal domain are considered to analyse the func-tional variation. All proteins were found to be relatedto Chalcone and stilbene synthases; plant-specific PKSand related enzymes, also called type III PKS whichindicate the presence of cd00831domain.
2 Materials and Methods
2.1 Protein sequences of keto acyl CoA syn-thase were retrieved from NCBI ENTREZ database(www.ncbi.nlm.nih.gov). An isolated gene from Pop-ulus trichocarpa x Populus deltoides exposed to con-tinuous feeding by Malacosoma disstria Hubner (foresttent caterpillar) mid-instar larvae was characterised byits product 3 ketoacyl synthase. 3-ketoacyl-CoA syn-thase in Populus trichocarpa x P.deltoides (S Kawaiet al. 1996). This protein was taken as model pro-tein to perform local and global alignment. Proteinsequences and annotation of homologs of 3 keto-acylCoASynthase (EF148325) were retrieved from Uniprotdatabases (www.uniprot.org/help/uniprotkb).2.2 Uncharacterised proteins were obtained from re-sult of Homology search of 3 keto acyl CoA syn-thase using BLAST. Protein annotation of sequenceof Oryza brachyantha was obtained from Ensemble-plants database (www.plants.ensembl.org).Transcriptid OB09G23410.1, protein id OB09G23410.1 annota-tion is given here-Ave. residue weight: 109.221 g/mol,Charge: 19.5, Isoelectric point: 9.2732, location -Chromosome 9: 12, 008, 506-12, 010, 333 forwardstrand, Number of residues: 483 aa, (Jinfeng Chen,Mingsheng Chen et al., 2013). Four entries were re-lated to acyl-tansferase.2.3 Global alignment of Target sequence was doneto predict its homologs using FASTx program(www.ebi.ac.uk/Tools/sss/fasta/). Gap extensionpenalty and open gap penalty of −2 and −10 for pro-tein sequence are given respectively as negative score;total 50 related sequence of EF148325 (KCS from Pop-ulus hybrid) were predicted. For global sequence align-ment, matrix was based on Needleman wunsch algo-rithm (Needleman et al. 1970).2.4 Domain search analysis and Pair wise alignment(F. Smith et al. 1981) was performed using BLAST2.2.25 (Altschul et al. 1997) and Local alignment wasalso performed using WU BLAST to get homologs of 3-ketoacyl-CoA synthase of P . Trichocarpa x P.deltoides.CDD CDART: (www.ncbi.nlm.nih.gov/Structure/),conserved Domain Architecture Retrieval Tool Program(Marchler-Bauer A et al., 2013). The E-value corre-sponding to a given bit score is calculated
E = mn2−5
where m and n are sequence length and s is bit score,
Interdiscip Sci Comput Life Sci (2014) 6: 1–14 3
It must attain the score x twice in a row for an HSPto get the score 2x, so E value decrease exponentiallywith score.2.5 Protein Functional Analysis (Quevillon E. etal. 2005) using the InterProScan program (http://www.ebi.ac.uk/Tools/services/web iprscan) was per-formed to provide structural view of domains on proteinsequence. InterProScan (E.M. Zdobnov et al. 2001) isa tool that combines different protein signature recog-nition methods from the InterPro.2.6 Sequences were searched against fold library usingenvironment-specific, substitution tables and structure-dependent gap penalties on web based FUGUE(v2.s.07) http://tardis.nibio.go.jp/result/fugue/21367/fugue.html.Fold library and substitution tables arebased on the HOMSTRAD database. (Shi J,, et al.2001). HOMSTRAD (HOMologous STRucture Align-ment Database) (http://tardis. nibio.go.jp/homstrad/)(Mizuguchi K et al. 1998) is a curated databaseof structure-based alignments for homologous proteinfamilies. Sequences of representative members of eachfamily were aligned on the basis of their 3D struc-tures using 3D structure of representative which useprograms MNYFIT, STAMP and COMPARER.2.7 Biological Annotation was retrieved from PDBedatabase http://www.ebi.ac.uk/pdbe-srv/view/entry/1mzj/summary.html via structural biology knowledge-base database. Protein databank (http://www.rcsb.org/pdb/explore/) was used to retrieve protein struc-tures in which domains were predicted.2.8 Geno3D (http://geno3d-pbil.ibcp.fr) was used formodelling uncharacterised protein. Modelling pro-tein was done using template Template pdb1cgzAand pdb3tsyA were used for modelling J3M9Z7 andF6HH63 protein (Combet C., 2002). Comparative(“homology”) modelling approximates the 3D structureof a target protein for which only the sequence is avail-able, provided an empirical 3D “template” structureis available with >30% sequence identity (i) Determi-nation of structurally conserved region (SCRs). (ii)Alignment of amino acid sequence of the unknown pro-tein with those of the reference protein(s) within theSCRs. (iii) Assignment of coordinates in the conservedregions. (iv) Prediction of conformations for the restof the protein chain, including loops. (v) Searchingfor the optimum side chain conformations for residuesthat differ from those in the reference protein. (vi)Energy minimization and molecular dynamics to re-fine the molecular structure so that steric introducedduring the (vii) Model building process can be relieved(Baker et al. 2001; Chothia et al. 1986 and Vitkup etal. 2001. Model of J3M9Z7 protein shows core region(66.4%), allowed region (24.1%), generously allowed re-gions (5.8%), Model energy was −15505.70 K/cal/moleand minimum disallowed region was 3.8%. disallowedregion increases (more than 4.1%-4.9%, if we further
decrease the energy. Therefore we have chosen modelwith −15505.70 kcal/mole energy and 3.8% disallowedregion, which was minimum among all conformations.
Model of F6HH63 shows core region (62%), allowedregion (29.12%), generously allowed regions (5.8%), ifdisallowed regions are kept minimum 2.3 %, and thenenergy of model −15594.60%. If we further decreaseenergy up to −15787.20 Kcal/mole, then disallowed re-gion slightly increases by 0.3% that is 2.6% which isminor difference and we do not prefer amino acids offunctional sites to fall in disallowed region, thereforewe have chosen this second model.
Protein J3MF16 and A9PIU6 9ROSI couldnot be modelled using Geno 3D (Fig. 6(a)),therefore modelling was also done by Swiss pdb(http://swissmodel.expasy.org) modelling tool (ArnoldK et al., 2006; Schwede T et al., 2003; Guex, N et al.,1997). It uses z score and qmean score to evaluatemodels, two models were obtained long and short.From the point of view of covering all functionalsites in structure, only long model was used forvisualisation. For J3MF16 protein, 1qlvA templatewas used with 17.98 identity and 2.10 A distance.For this model qmean z score was −4.73 and e valuewas 0.00e-1. Template 1tedA was used to modelA9PIU6 protein, which shows 17.05% identity and2.25 A distance. Qmean z score was −5.36 and evalue was 0.00e-1 for this protein. Although Qmeanz is very low score for both structures and does notshow good quality, but modelled structure of theseJ3MF16 and A9PIU6s protein have covered 106 to488 and 114 to 496 amino acid residues respectivelyand we have used this to show distant sites, shortrange models were not taken into consideration. TheQMEAN4 score is a composite score consisting ofa linear combination of 4 statistical potential terms(estimated model reliability between 0-1) (Fig. 6(b)). These four terms are C β interaction energy,All-atom pairwise energy, Solvation energy, Torsionangle energy (Benkert P, Biasini M, Schwede T. 2011).Pymol was used for visualisation of structure model.Pymol Visualisation tool (http://www.pymol.org) isan open-source molecular visualisation system createdby Warren Lyford DeLano2.9 Structure-structure alignment was done usingMatchalign programme of chimera. Molecular graphicsand analyses were performed with the UCSF Chimerapackage. Chimera is developed by the Resource for Bio-computing, Visualization, and Informatics at the Uni-versity of California, San Francisco (Pettersen et al.2004). Matchalign align amino acids in two structuresand calculate RMSD value. Structure of putative PKSand 2 pyrone synthase are used as refrence structureand aligned with modelled structure of 3 keto acyl syn-thase like protein in Vitis vinfera and Oryza brachyan-tha.
4 Interdiscip Sci Comput Life Sci (2014) 6: 1–14
Vitis viniferaF6HH63_VITVOryza brachyanthaJ3MF16_ORYBROryza brachyanthaJ3M9Z7_ORYBR
3000
2500
2000
1500
1000
500
0
CH
S_lik
eFA
E1_
CU
T1_
Rpp
AA
CP
_sy
n_II
I_C
KA
S_II
IIn
it_
cond
_en
zym
esC
hal_
sti_
synt
_C
Ket
oacy
l-sy
nt s
uper
fam
ilyC
Ond
_en
zym
es s
uper
fam
ilyP
LN
O21
92P
LN
O28
54P
LN
O23
77P
LN
O29
32P
LN
OO
415
Bcs
AFab
HP
RK
1287
9P
LN
O31
69P
RK
0935
2P
LN
O31
72P
LN
O31
68P
LN
O23
26P
RK
O59
63C
hal_
sti_
synt
_N
PLN
O31
71C
hal_
sti_
synt
Fig. 1 Graph is generated using bit score of domain sim-ilarity search for all protein domains present inthree proteins, shown on the right hand. ProteinJ3M9Z7 ORYBR shows lowest bit score range forall predicted domains.
3 Results and discussion
3.1 Domain analysis using domain profiling
Several domains very long 3 keto acyl CoA syn-thase, FAE1/Type III PKS-like protein (IPR013601),Thiolase-like, subgroup (IPR016038), Thiolase-like(IPR016039) and 3-Oxoacyl-(ACP) synthase III C-terminal (IPR013747) have been predicted in 3 ketoacyl CoA synthase of Populus trichocarpa x Populusdeltoides and its homologs which were obtained fromsimilarity search based on local alignment. Sequencesimilarity search revealed three uncharacterized 3 ketoacyl CoA synthase like protein, one protein F6HH63is from Vitis vinifera and other two proteins (J3MFI6,J3M9Z7) are from Oryza brachyantha. These three pro-teins do not show the presence of all domains. Very long3 keto acyl CoA synthase is found in all other fourtyseven (47) orthologs obtained from blast search. Secondsimilar domain is FAE1/Type III PKS-like proteins,this domain is found in 3-Ketoacyl-CoA synthases, typeIII PKS, fatty acid elongases and fatty acid condens-ing enzymes and are found in both prokaryotic and eu-karyotic species. This domain is mainly found in plantspecies.
Third similar domain is Thiolase-like, subgroup(IPR016038) and Thiolase-like (IPR016039) that aresubgroup of thiolase-like domains and usually occursin two similar copies that are arisen through duplica-tion. Such proteins are related to thiolase and chalconesynthase. The thiolase-like enzymes include thiolase,
B-ketoacyl-ACP synthases types I and Actinorhodinpolyketide β-keto acyl synthases 1 and 2. Thiolase-like (IPR016039) also include fatty oxidation com-plex β subunit (3-ketoacyl-CoA thiolase additionally.The CHS-like enzymes include CHS, Ketoacyl-ACPsynthase III (FabH); PKS, 3-hydroxy-3-methylglutarylCoA synthase, Dihydropinosylvin synthase. Third sim-ilar domain is 3-Oxoacyl-ACP synthase III C-terminal.This domain is found on 3-Oxoacyl-(ACP) synthase III,the enzyme responsible for initiating the chain of reac-tions of the fatty acid synthase in plants and bacteria.
3.2 Conserved domain prediction
Conserved domain prediction was performed usingCDD tool to analyse all conserved domains. Twentyfive (25) domains were predicted (Table 1), Chalconeand stilbene synthases; plant-specific PKS is also foundin 2-Pyrone Synthase complexed with acetoacetyl-Coain Gerbera hybrid cultivar and β keto acyl carrier pro-tein in E.coli. FAE1/Type III PKS-like protein is foundin Brassica napus (rape) as 3 keto acyl synthase. Simi-lar domain putative very-long-chain fatty acid condens-ing enzyme is found in CUT1 of Oryza sativa japonica;Ketoacyl-ACP synthase III (KASIII) initiates the elon-gation in type II “initiating” condensing enzymes are asubclass of decarboxylating condensing enzymes. Mem-bers of this family are described as 3-ketoacyl-CoA syn-thase and similar domain is found in B-Ketoacyl- ACPSynthase Iii (Fabh) of Escherichia coli. 3-Oxoacyl-(ACP) synthase III C terminal Chain B, Priming B-Keto acyl synthase from Polyketide Biosynthetic Path-way of Streptomyces sp. R1128. Chalcone and stilbenesynthases, C-terminal domain; this domain of CHS isreported to be structurally similar to domains in thi-olase and β-ketoacyl synthase. These are related tofatty acid synthesis pathway. Similar domain is foundin Pyrone Synthase (Pys) from Gerbera Hybrida.
Conserved domain prediction reveals domainsPLN02854, PLN02377 and PLN02192 with more sig-nificant negative e value of 0e-00 in Oryza brachyan-tha in comparison to positive e value of 0e+00 inVitis vinifera. Domains PLN00415, PLN02932 havebeen predicted with negative e value of 0e-00 inthese three proteins. These domains are 3 namedas 3 keto acyl CoA synthase. Negative e values de-fine significant alignment, in these protein, this in-dicate partial presence of domain 3 keto acyl CoAsynthase. Other domains have also been predicted,out of these PLN03169 are not characterised (Table1). Out of twenty five domains (25) domains 19 do-mains are common in KCS (J3M9Z7 and J3MFI6)of Vitis vinifera and Oryza brachyantha. (Table 1)Presence and absence of other six domains are ob-served in these organisms. Chalcone & stilbene syn-thase, C- terminal domain is absent in Vitis vinifera(F6HH63 VITVI) and present in Oryza brachayan-
Interdiscip Sci Comput Life Sci (2014) 6: 1–14 5
Table 1 Information obtained from Uniprot databse
Uiprot id/
accession numberName of protein Organism
A9PIU6 9ROSI 3 keto acyl CoA synthase P . Trichocarpa X P.deltoides
F6HH63 VITVI Putative uncharacterized protein OS=Vitis vinifera (Orderedlocus name-VIT 11s0016g04700)
Vitis vinifera
J3MFI6 ORYBR Recommended name-3 keto acyl CoA synthase encoded byOB06G27800 Uncharacterized protein
Oryza brachyantha
J3M9Z7 ORYBR encoded by OB05G34210 uncharacterized protein Oryza brachyantha
tha (J3M9Z7 and J3MFI6). 3 oxoacyl synthase IIIis absent in Oryza brachyantha (J3MFI6 ORYBR)but present in protein (J3M9Z7 ORYBR) of Oryzabrachyantha and Vitis vinifera. Chalcone and stilbenesynthase and N-terminal domain is absent in Oryzabrachyantha (J3M9Z7 ORYBR) and Vitis vinifera(F6HH63 VITVI) but present in Oryza brachyantha(J3MFI6 ORYBR). Chalcone and stillbene synthase ispresent in Vitis vinifera and absent in Oryza brachyan-tha. These are the critical differences of this proteinamong these plant organisms. Presence of N and Cterminal domain is related to biochemical activity inpathway leads to the synthesis of Chalcone stilbene syn-thase. C terminal domain is present in three proteins.It is clear from this study that protein J3MFI6 ORYBRcontain both N and C terminal domain of chalconeand stilbene synthase, thus it is confirmed that thisprotein has functionality of Chalcone and stilbene syn-thase. The C-terminal domain of Chalcone synthase isreported to be structurally similar to domains in thio-lase and β-ketoacyl synthase. All three proteins containC terminal domain which share similar structure withdomain of thiolase and β keto acyl synthase. Differencesin N-terminal domain is responsible for distinguish ac-tivity. N terminal domain is absent in J3M9Z7 proteinof Oryza brachyantha and F6HH63 of Vitis vinifera.Activity of these proteins can be characterised on thebasis of these feature. The protein J3MF16 of Oryzabrachyantha contains N terminal domain and C termi-nal domain and thus can be characterised using anno-tation of these domains but modelling of this proteinwas difficult because of low sequence identity (17.98)with template protein. Several uncharacterised domainPRK09352, PLN03172 and PLN02854 are present inboth varieties of Oryza brachyantha and Vitis vinifera,allthough PRK05963 uncharacterised domain is ab-sent in Oryza brachyantha (J3MFI6 ORYBR) and Vi-tis vinifera but present in J3M9Z7 of Oryza brachyan-tha. PRK05963 uncharacterised domain absent inJ3MFI6 of Oryza brachyanta and Vitis vinifera. Ensm-bleplants database entry shows molecular function ofOryza brrachyantha is transferase, catalytic and trans-ferring acyl groups and annotation source is UniPro-
tKB/TrEMBL: A6MCZ1 9ORYZ.PLN02854, PLN02377, PLN02932, PLN00415 do-
mains present in 3-ketoacyl-CoA synthase enzymewhich are found in bryophytes and pteridophytesplants, other pln domains are related to chalcone syn-thase like protein (Fig. 2). Other predicted domainsare from Populus hybrids which are related to flavonoidpathway.
12
10
8
6
4
2
0
Num
ber
of h
omol
ogs
Camell
ia ole
ifera
Brach
ypod
ium
dista
chyo
n
Glycine m
ax
Cucum
is sativ
us
Arabido
psis
thali
ana
Vitis v
inife
ra
Med
icago
trun
catu
la
Eran
this
hyem
alis
J3MFI6_ORYBR (Oryza brachyantha)
J3M9Z7_ORYBR (Oryza brachyantha)
F6HH63_VITVI (Vitis vinifera)
_ ( y y )
J3M9Z7_ORYBR (Oryza brachyantha)
F6HH63_VITVI (Vitis vinifera)
Fig. 2 Local alignment using NCBI-BLAST: Similarityscore of homologs in various organisms are depicted.
3.2.1 Sequence homology analysis of 3 ketoacyl CoA synthase in Oryza brachyantha(J3M9Z7), Oryza brachyanta (J3MFI6)and Vitis vinifera (F6HH63)
J3MFI6 protein of Oryza brachyanta shows thatit is more similar to hypothetical protein OsI 23590(EAZ01557.1) Oryza sativa Indica Group and Oryzasativa Japonica Group) hypothetical protein SOR-BIDRAFT 10g023290 (Sorghum bicolor) share 94%(100% query coverage) and 96% (96% query coverage)similarity. Twenty protein entries were named as 3 ketoacyl CoA synthase. Out of these, eleven (11) 3 ketoacyl CoA synthase proteins are from Sorghum bicolor.Nine entries were searched from Glycine max (Fig. 2).
6 Interdiscip Sci Comput Life Sci (2014) 6: 1–14
90
80
70
60
50
40
30
20
10
0
Sim
ilari
ty
Glycine m
ax
Vitis v
inife
ra
Cucum
is sativ
us
Arabido
psis
thali
ana
Brach
ypod
ium dist
achy
on
Eran
this
hymali
s
Med
icago
trun
catu
la
Camell
ia ole
ifera
F6HH63_VITVI (Vitis vinifera)
J3M9Z7_ORYBR (Oryza brachyantha)
J3MFI6_ORYBR (Oryza brachyantha)J3MFI6_ORYBR (Oryza brachyantha)
Fig. 3 All homologs of 3 keto acyl CoA synthase are shown in various organisms.
This shows that J3MF16 protein of Oryza brachyanthais highly similar to 3 keto acyl synthase of Sorghum bi-color. All though similarity to KCS of Oryza sativa In-dica, Oryza sativa japonica is not considered significantbecause Oryza brachyantha is originated from Oryza.
Total 32 homologs were predicted in 3 keto acyl CoAsynthase (J3M9Z7) of Oryza brachyantha in differentorganism (see Fig. 2, 3 for more description); Vitisvinifera (72.6%) similarity, Glycine max (79.0%), Cu-cumis sativus (76%).
Total thirty one (31) 3 keto acyl CoA synthase pro-tein of Putative uncharacterized protein Vitis viniferawere predicted in various organisms with differentsimilarity. These are highly similar; Glycine max(77.7%), Vitis vinifera (86.4%) and Cocumis sativus(79.6%). Brachypodium distachyon (81%), Medicagotruncatula, Camellia oleifera (85%). Maximum ho-mologs of 3 keto acyl CoA synthase of Vitis viniferaand Oryza brachyantha were predicted in Glycine max,Vitis vinifera and Cucumus sativus. More homologsof Oryza brachyantha in Brachypodium distachyon arepredicted than other organism (Fig. 2). Similarityis higher in protein of Medicago truncatula, Camelliaoleifera (85%) and Vitis vinifera (86.4).
3.3 Domain prediction using sequence struc-ture comparison
Various domains were predicted in P . Trichocarpa xP.deltoides, Vitis viinfera and Oryza brachyantha bysequence structure comparison. Sequence structureswere aligned for each domain. All four protein showssimilarity to Gcs (hs3v7ia) and a type III PKS pu-tative PKS from Streptomyces coelicolor with 14.23,14.08, 13.15, 13.23 score. Chalcone and stilbene syn-
thases (KAS) was predicted in 3 keto acyl CoA syn-thase like protein of P . Trichocarpa x P.deltoides,Vitis viinfera, Oryza brachyanta with 13.08, 13.93,17.74 16.02 score respectively. These hits are 2-pyrone synthase (Gerbera hybrid) chalcone synthase2 (Medicago sativa), B-ketoacylsynthase III (Strepto-myces sp.r1128 ), 3-oxoacyl-[ACP] synthase III (My-cobacterium tuberculosis) 3-oxoacyl-[acyl-carrier pro-tein] synthase (Thermus thermophilus), B-ketoacyl-ACP synthase III (Escherichia coli).
Other domain hs2f82a profile hits from 3-hydroxy-3-methylglutaryl CoA synthases of Brassica juncea Bras-sica juncea were also predicted in Vitis vinifera, popu-lus hybrids and Oryza brachyantha with 4.01, 4.25, 4.25score. This enzyme is involved in isoprenoid biosyn-thetic process. Amino acids tyrosine, arginine, lysine,serine, asparagines, cystiene and aspartate are involvedin hydrogen bond formation.
3.4 Functional characterisation
Structure and functional characterisation of proteinswas done using their structure alignment with similarcharacterised proteins, obtained from sequence struc-ture alignment using fold library. All protein domainsrepresent transferase activity which is predicted in 3keto acyl CoA synthase like protein of Vitis viniferaand Oryza brachyantha (Table 2). Gcs from Strepto-myces coelicolor is a type III PKS with broad substrateflexibility. Chemler, J.A. and Buchholz, T.J., 2012 pro-vide unanticipated information about the interactionsbetween type III PKSs and ACPs that will facilitate en-gineering of novel polyketides. The Gcs (3v7i) structurecontains an unusual helical bundle of unknown functionthat appears to extend the dimerization interface. A
Interdiscip Sci Comput Life Sci (2014) 6: 1–14 7
Table 2 Conserved domain prediction: All conserved domains present in three uncharacterised protein inOryza brachyantha and Vitis vinifera
Domain Name Domain DescriptionOryza brachyantha
J3M9Z7 ORYBR
Oryza brachyantha
J3MFI6 ORYBR
Vitis vinifera
F6HH63 VITV
CHS like Chalcone and stilbene synthases; plant-specificPKS
2.97e-124 5.51e-129 4.14e-128
FAE1 CUT1 RppA FAE1/Type III PKS-like protein; The mem-bers of this family are described as 3-ketoacyl-CoA synthases
0e-00 0e-00 0e+00
ACP syn III C 3-Oxoacyl-ACP synthase III C terminal; Thisdomain is found on 3-Oxoacyl-ACP
4.66e-11 3.27e-08 4.51e-09
KAS III Ketoacyl-ACP synthase III (KASIII) initiatesthe elongation in type II
4.54e-15 5.29e-17 2.02e-17
Init cond enzymes “initiating” condensing enzymes are a subclassof decarboxylating condensing enzymes
1.19e-11 2.63e-15 1.40e-14
Chal sti synt C Chalcone and stilbene synthases, C-terminaldomain;
3.14e-07 2.56e-05 1.19e-06
Ketoacyl-synt su-perfamily
Ketoacyl-synthase superfamily 4.54e-15 5.29e-17 2.02e-17
Cond enzymessuperfamily
Condensing enzymes 1.19e-11 2.63e-15 1.40e-14
PLN02192,PLN02854,PLN02377
3-ketoacyl-CoA synthase 0e-00 0e-00 0e+00
PLN02932,PLN00415
3-ketoacyl-CoA synthase 0e-00 0e-00 0e-00
BcsA Predicted naringenin-CHS [Secondarymetabolites biosynthesis, transport]
7.90e-26 3.00e-26 8.39e-26
FabH 3-oxoacyl-[acyl-carrier-protein] 6.38e-16 3.57e-18 2.35e-11
PRK12879 3-oxoacyl-(acyl carrier protein) synthase III;Reviewed
2.16e-03 1.12e-04 4.10e-07
PLNO3169 Not characterized 7.09e-08 3.77e-11 3.77e-11
PRK09352 3-oxoacyl-(acyl carrier protein) synthase III;Reviewed
1.99e-06 1.14e-04 8.39e-05
PLN03172 CHS family protein; Provisional 1.29e-05 1.52e-07 4.10e-07
PLN03168 chalcone synthase; Provisional 1.49e-05 1.71e-05 1.06e-04
PLN02326 3-oxoacyl-[acyl-carrier-protein] synthase III 3.14e-03 Absent 2.91e-04
PRK05963 3-oxoacyl-(acyl carrier protein) synthase II;Reviewed
7.24e-03 Absent Absent
Chal sti synt N Chalcone and stilbene synthases, N-terminaldomain; This domain of chalcone
Absent 2.56e-05 Absent
PLNO3171 CHS-like protein; Provisional Absent 1.44e-06 3.90e-07
Chal sti synt Chalcone and stilbene synthases Absent Absent 1.19e-06
pair of arginine residues adjacent to the active site af-fect catalytic activity but not ACP binding. (Chemler,J.A., Buchholz, T.J., 2012).
PKSs generate molecular diversity by utilizing dif-ferent starter molecules for the biosynthesis of anti-microbial phytoalexins, anthocyanin floral pigments,and inducers of Rhizobium nodulation genes. Similar-ity of 3 keto acyl CoA synthase like protein of Vitisvinifera and Oryza brachyantha to Pyrone synthase and
CHS share a common three-dimensional fold, a set ofconserved catalytic residues, and similar CoA bindingsites, but four positions are varied in CHS (Jez, J.M.,Austin, M.B., 2001). KAS 2-pyrone synthase (1EE0)from gerbera hybrid provides information about its pos-sibility to involve in proanthocyanidine production. Bi-ological annotation from databases reveals anthocyaninbiosynthetic process which may be related to proantho-cyanidine production because sequence is obtained from
8 Interdiscip Sci Comput Life Sci (2014) 6: 1–14
Populus hybrid exposed to forest tent caterpillar.CHS (1i88) synthesizes a tetraketide by sequential
condensation of three acetyl anions which is derivedfrom malonyl-CoA decarboxylation to a p-coumaroylmoiety attached (Jez, J.M., Bowman, M.E., 2001) Simi-larity to KAS chalcone synthase 2 from Medicago sativareveal role of these 3 keto acyl CoA synthase like proteinof Vitis vinifera and Oryza brachyantha in Naringenineand CHS like activity.
Similarity to KAS-B-ketoacylsynthase III from Strep-tomyces sp.r1128 and KAS oxoacyl-ACP synthase fromThermus thermophilus reveal, role of these 3 keto acylCoA synthase like protein of Vitis vinifera and Oryzabrachyantha in B-ketoacyl-ACP synthase III activityand 3-oxoacyl-ACP synthase activity. B-keto acyl syn-thase III from Streptomyces sp.r1128 reveals an exten-sive loop region at the dimer interface that appears toaffect the selectivity for the primer unit. (Pan, H., Tsai,S., 2002)
Additionally, Similarity to KAS, 3-oxoacyl-ACP syn-thase III Mycobacterium tuberculosis give hypotheticalview of activities; fatty-acyl-coa binding, Catalytic ac-tivity, 3-oxoacyl-ACP synthase activity, Protein bind-ing and Transferase activity. This also shows theirinvolvement in protein homodimerisation. Protein ofβ-ketoacyl-ACP synthase III (mtFabH) of Mycobac-terium tuberculosis catalyze a Claisen-type condensa-tion (Scarsdale, J.N., Kazanina, G, 2001) B-ketoacyl-ACP synthase III (1HNJ) of E. coli also share KASdomain with these protein.
Protein J3MF16 of Oryza brachyantha has 17.74
score in sequence structure alignment which is highest.Presumably, this protein has activities which are dis-cussed above. Its similarity to KAS 2-pyrone synthase(1EE0) from Gerbera hybrid gives important clue aboutits possibility to involve in proanthocyanidine produc-tion. Allthough other three protein of P . Trichocarpax P.deltoides, Vitis viinfera, Oryza brachyantha alsocontain Chalcone and stilbene synthases (KAS) with13.08, 13.93, 16.02 score respectively. All similar struc-tures has transferase activity, transferring acyl groups,transferring acyl groups other than amino-acyl groups(Fig. 4, 5).3.4.1 Structural similarities and variations
among active site, dimer interface, prod-uct binding sites and malonyl CoA bind-ing site in 3 keto acyl CoA synthase ofVitis vinifera and Oryza brachyantha
Structure obtained from homology modelling of theseproteins were used for predictions of these sites andthier actual positions in the structure. Models wereevaluated on the basis of minimum energy and mini-mum disallowed regions in ramachandran plot (Fig. 6).Amino acids involved in fatty acid synthesis are il-lustrated, starter molecule is loaded onto the activesite cysteine, a carboxylative condensation reaction ex-tends the polyketide chain which incorporated intoplant lipids are derived from the iterative addition ofC2 units. Malonyl-CoA provide C2 unit to an acyl-CoAby the 3-ketoacyl-CoA synthase (KCS). Several malonylCoA binding site have been predicted, these are I-229,DQ-232-233, F-392, V-451, GRA-515-517, these are not
Fig. 4 β-ketoacyl-ACP synthase, type III and PKS: B-Ketoacyl-ACP Synthase III (Fabh) from Escherichia Coli (1EBL A),2-Pyrone Synthase(1 EE0 A) Complexed With Acetoacetyl-Coa from Gerbera hybrid cultivar and other relatedprotein sequences are aligned with active site in yellow colour (number denotes gap between amino acids residues).
Interdiscip Sci Comput Life Sci (2014) 6: 1–14 9
Fig. 5 Chalcone and stilbene synthases domain: Conserved and aligned amino acids are shown in various color according tosecondary structure (helix-red; sheet-blue; 310 helix-maroon; solvent accessible lower case; solvent inaccessible-uppercase, H- bond-main chain –amide-bold; H- bond-main chain carbonyl-underline; disulfide bond-cedilla, positive φtorsion angle-positive) depicted in 2-pyrone synthase (1ee0a; Gerbera hybrid) CHS 2 (1i88a; Medicago sativa), β-ketoacyl synthase III(1mzj; Streptomyces sp. r1128, 3-oxoacyl-ACP synthase III (1hzp; Mycobacterium tuberculosis)3-oxoacyl-[ACP] synthase (1ub7; Thermus thermophilus) β -ketoacyl ACP synthase III (1hnj; Escherichia coli).
16000
14000
12000
10000
8000
6000
4000
2000
0
Energy(kcal/mole)F6HH63_VITVI
Energy(kcal/mole)J3M9Z7_ORYBR
1 2 3 4 5 6 7 8 9 10Model
(a)
1.5
1.0
0.5
0
0.5
1.0
1.5
Nor
mal
ised
QM
EA
N4
scor
e
0 100
Comparison withnon-redundant set of PDB structures
200 300
(b) (c)
400
|Z-score|<1
1<|Z-score|<2
|Z-score|>2
Query model
500 600Protein size
Z-score QMEAN: 4.73Z-score Cbeta: 1.84Z-score all_atom: 3.52Z-score solvation: 1.08Z-score torsion: 3.82
1.5
1.0
0.5
0
0.5
1.0
1.5
Nor
mal
ised
QM
EA
N4
scor
e
0 100
Comparison withnon-redundant set of PDB structures
200 300 400
|Z-score|<1
1<|Z-score|<2
|Z-score|>2
Query model
500 600Protein size
Z-score QMEAN: 4.73Z-score Cbeta: 1.84Z-score all_atom: 3.52Z-score solvation: 1.08Z-score torsion: 3.82
|Z-score|<1
1<|Z-score|<2
|Z-score|>2
Query model
score QMEAN: 4.73score Cbeta: 1.84score all_atom: 3.52score solvation: 1.08score torsion: 3.82
|Z-score|<1
1<|Z-score|<2
|Z-score|>2
Query model
score QMEAN: 4.73score Cbeta: 1.84score all_atom: 3.52score solvation: 1.08score torsion: 3.82
Fig. 6 (a) Histogram depicting energy (-Kcal/mole) used for evaluation of modelled structures of Z3M9Z7 ORYBR andF6HH63 VITVI. (b) Z score was calculated using C β interaction energy, all atom pairwise energy, salvation energyand torsion angle energy for J3MF16 ORYBR model. (c) Then composite Qmean score was calculated using linearcombination of these four zscore (shown in different colour).
10 Interdiscip Sci Comput Life Sci (2014) 6: 1–14
(a) J3M9Z7_Oryza brachyantha (b) J3MF16_Oryza brachyantha
Structural variations among:Dimer interfaceProduct binding siteMalonyl CoA binding site
(c) F6HH63_Vitis vinifera
Fig. 7 Modeled structure of F6HH63 VITVI and J3M9Z7 ORYBR: Structural variation predicted among dimer interface(green), product binding site (blue) and malonyl Co-A binding sites (light orange) are shown in 3 keto acyl CoAsynthase like proteins that are vary with each other.
identical to 2 pyrone synthase (1ee0) but these aminoacids have similar properties. The GRA site is absent inprotein J3M9Z7 ORYBR. Other maolonyl CoA bindingsite in this protein are F-288, V-259, R-32, and N-35andC-36 (Fig. 7).
V-451 and GRA 515-517 is also predicted in KCS likeprotein J3MFI6 ORYBR at 414-416 and 353 positionsrespectively, other site are RC-131, T-128, F291, V-353, and GRA-414 (Table 3). This analysis providesannotation of biologically active sites.
In J3MF16, LR is present and LN is present in othertwo proteins F6HH63 VITVI and J3M9Z7 ORYBR.GNIISYNL is present in F6HH63 VITVI . GNV, ISYNand GNV, VSYN are present in J3MF16 ORYBRand J3M9Z7 ORYBR proteins. QVH is common inF6HH63 VITVI and J3MF16 ORYBR.
Product binding site in F6HH63 VITVI consist of
R, VGV, GV, which shows slight variations withJ3MFI6 ORYBR protein at sites IGV, AV. In place of‘V’, ‘I’ is present and A is present in place of G and R isabsent. In J3M9Z7 ORYBR protein R, TGV, AV, RFwere predicted as product binding sites. T is present inplace of V/I.
Mutated Malonyl CoA binding sites inF6HH63 VITVI are I, DQ, GRA and in J3MFI6ORYBR In other proteins, these sites are RC, TRA,GRA. GRA is absent in J3M9Z7. Malonyl CoAbinding sites in J3M9Z7 ORYBR) are R, N, Cwhich are different from other two proteins (Table4).
When biological active sites of three protein werecompared with A9PIU6 9ROSI protein of Populus tri-chocarpa. All the active site and product binding sitesare common in J3MFI6 ORYBR and J3M9Z7 ORYBR
Interdiscip Sci Comput Life Sci (2014) 6: 1–14 11
Table 3 Results on structure sequence alignment: Domain related to fatty acid synthesis pathway are shownbelow, these domain are predicted in KCS like protein of Vitis vinifera and Oryza brachyantha(J3MFI6 ORYBR and J3M9Z7 ORYBR) with 13.08, 17.74 and 15.81 z score. For putative PKSz score is 14.23 which s only predicted in Vitis vinifera. Structural & functional annotation isobtained from PDB and PDBe database
S.N Profile hit/PDB Code(HOMSTRAD)
Source Name Functional characterization (PDBe; knowl-edgebase)
1 hs3v7ia putativepolyketide synthase(3v7i) Germicidinsynthase
Streptomycescoelicolor
Germicidinsynthase (Gcs)
Catalytic activity, transferase sctivity(Only predicted in Vitis vinifera)
2 KAS2-pyrone synthase(1ee0)
Gerberahybrida
PKSs Catalytic activity, Transferase activity, biosyn-thesis of antimicrobial phytoalexins, antho-cyanin floral pigments
KAS CHS 2. (1i88) Medicagosativa
CHS Catalytic activity, Naringenin-chalcone syn-thase activity,
KASβ−ketoacylsynthaseIII.(1mzj)
Streptomycessp.r1128
ZhuH isa primingketosynthase
Catalytic activity, 3-oxoacyl-ACP synthase ac-tivity, B-ketoacyl-ACP synthase III activity.
KAS3-oxoacyl-ACPsynthase III. (1hzp)
Mycobacteriumtuberculosis
An usual β-ketoacyl ACP synthaseIII (mtFabH)
Fatty-acyl-coa binding, Catalytic activity, 3-oxoacyl-ACP synthase activity, Protein bind-ing, B-keto acyl-ACP synthase III activity,Protein homodimerization activity
KASoxoacyl-ACPsynthase (1ub73)
Thermusthermophilus
oxoacyl-ACPsynthase
Catalytic activity, 3-oxoacyl-ACP synthase ac-tivity, B-ketoacyl-ACP synthase III activity
KAS. (1hnj) Escherichiacoli
B−ketoacyl-acylcarrier proteinsynthase III
Catalytic activity, 3-oxoacyl-ACP synthase ac-tivity, B-ketoacyl-ACP III activity.
Table 4 Amino acids at different active site related to fatty acid synthesis are illustrated. Common sitesare shown in bold
Source Organism/Protein name
Activesite
Product bindingsite
Malonyl CoAbinding site
Dimer interface
Vitis viniferaF6HH63|F6HH63 VITVI
C-344,H-512,N-545
ENI-373-375,R-395, VGV-445-447, GV-455,S-545, S-575
I-229, DQ-232-233, F-392, V-451, GRA-515
LN-68, P-72, PT-317, A-323, N-327, GNIISYNL-333-340, MG-343-344, DL-353-354, D-357, QVH-360-362, T-420, RTH-422-424, VSLS-447-450, K-580
Oryza brachyanthatr|J3MFI6|J3MFI6 ORYBR
C-244,H-411,N-442
ENI-272, IGV-344, AV-354, S-444, S-474
RC-131, T-128F291, V-353,GRA-414
LR-167-168, P-171, PT-215-216,A221, N225, GNV-231-233, ISYN-235-238, MG-242-243, DL-252-253,D-256, QVH-259-262, T-319, RTH-321-323, VSLS-344-347, K-479
Oryza brachyanthaJ3M9Z7 ORYBR
C-240,H-407,N-447
ENI-176-178,R-198, TGV-338-340, AV -348-349,S-450, RF-345
F-288, V-259,R-32, N-35and C-36
LN-163-164, P-167, A-211, N221,GNV-227-229, VSYN-231-234, MG-238-239, DL-248-249, D252,QW-255-256, T315, RTH-317-319,VSLS-342-345, K475
and A9PIU6 9ROSI, few product binding sites and mal-ony CoA binding sites were present in F6HH63 VITVIbut not present in A9PIU6 9ROSI protein. There areno common residues in malonyl CoA binding site and
dimer interface of A9PIU6 9ROSI, J3MF16 ORYBRand J3M9Z7 proteins. Proteine F6HH63 VITVI of vitisvinifera shows common site in dimer interface and twomalonyl CoA binding site (Table 5).
12 Interdiscip Sci Comput Life Sci (2014) 6: 1–14
(a) (b)
Fig. 8 Structure- structure alignment of modeled structure obtained from homology modeling (a) putative PKS (pdb-3v7i;gray colour) of Streptomyces coelicolor is aligned (green)with structure (shown in cornflower blue colour) of Vitisvinifera (b) Structurally aligned and conserved amino acids are shown in peach colour for Putative PKS (pdb-3v7i)of Streptomycin coelicolor, aligned with structure model of Vitis vinifera.
(a) (b)
(c) (d)Fig. 9 (a) KAS chalcone synthase 2(1i88 pdb) structure of Medicago sativa is aligned with J3M9Z7 protein structure of
Oryza brachyantha. (b) Structurally aligned and conserved amino acids are shown in peach colour for domainKAS chalcone synthase 2(1i88 pdb) structure of Medicago sativa aligned with J3M9Z7 protein structure of Oryzabrachyantha. (c) KAS 2-pyrone synthase (1ee0; gray colour) of Gerbera hybrida is aligned with J3MF16 proteinstructure model (shown in cornflower blue colour) of Oryza brachyantha. (d) Structurally aligned and conservedamino acids are shown in peach colour for KAS 2-pyrone synthase (pdb-1ee0; gray colour) of Gerbera hybrid, alignedwith J3MF16 protein structure model (shown in cornflower blue colour) of Oryza brachyantha.
Interdiscip Sci Comput Life Sci (2014) 6: 1–14 13
Table 5 Comparative account of biologically active sites of three proteins with A9PIU6 9ROSI protein ofPopulus trichocarpa
Source Organism/
Protein name
Activesite
Productbinding site
Malonyl CoAbinding site
Dimer Interface
Populus trichocarpaA9PIU6 9ROSI
C-252H-419N-452
ENI-280, R-302, VGV-352,AV-362, S-454,S-484
T-137, ET-140,F-301, V-363,GRA-422
PR-179, M-183, PT-224, A-230, N-234,GNILSYNL-240, MG-250, DL-260, Q-264,QVH-267, C-338, CV-341, VSLS-354, K-487
Vitis viniferaF6HH63 VITVI
C-344,H-512,N-545
ENI-373-375, R-395,VGV-445-447, GV-455,S-545, S-575
I-229, DQ-232-233, F-392, V-451, GRA-515
LN-68, P-72, PT-317, A-323, N-327, GNIISYNL-333-340, MG-343-344, DL-353-354, D-357, QVH-360-362, T-420, RTH-422-424, VSLS-447-450, K-580
Oryza brachyanthaJ3MFI6 ORYBR
C-244,H-411,N-442
ENI-272,IGV-344,AV-354,S-444, S-474
RC-131, T-128F291, V-353,GRA-414
LR-167-168, P-171, PT-215-216, A221,N225, GNV-231-233, ISYN-235-238, MG-242-243, DL-252-253, D-256, QVH-259-262, T-319, RTH-321-323, VSLS-344-347,K-479
Oryza brachyanthaJ3M9Z7 ORYBR
C-240,H-407,N-447
ENI-176-178,R-198, TGV-338-340, AV-348-349, S-450, RF-345
F-288, V-259,R-32, N-35andC-36
LN-163-164, P-167, A-211, N221, GNV-227-229, VSYN-231-234, MG-238-239,DL-248-249, D252, QW-255-256, T315,RTH-317-319, VSLS-342-345, K-475
4 Conclusion
This study provides functional characterization ofthree uncharacterised protein 3 ketoacyl CoA syn-thase like protein in Vitis vinifera and Oryza brachyan-tha with 93.7% and two 3 keto acyl Co synthase inOryza brachyantha with 87.0 and 97.2 similarities to3 keto acyl CoA synthase of Populus hybrid. Se-quence analysis reveals uncharacterized proteins in Vi-tis Vinifera and Oryza brachyantha. One protein inOryza brachyantha is recommended as 3 keto acyl CoAsynthase, Although 3 keto acyl CoA synthase has beencharacterised in several plants and Oryza brachyan-tha; still there is need of potential research to func-tionally characterise of 3 keto acyl CoA synthase pro-tein in Oryza brachyantha. Conserved domains analy-sis reveals presence of pfam08392, cd00830, cd00827,COG3424, PLN03169, cd00831 which are related toflavonoid pathways. This study provides structuraland functional annotation of 3 keto acyl CoA synthaseby comparative analysis of protein sequences in plantsand provides annotation of proteins in Vitis viniferaand Oryza brachyantha. These proteins show pres-ence of Nineteen (19) domains in these three organ-isms. Protein J3MFI6 ORYBR of Oryza brachyanthashow presence of chalcone and stilbene synthase do-main. All three proteins were found to have C ter-minal domain which is similar to domain of thiolaseand β keto acyl synthase. N terminal domain is absentin J3M9Z7 protein of Oryza brachyantha and F6HH63pritein of Vitis vinifera. Differences in N-terminal
domain is responsible for distinguish activity. Max-imum homologs of 3 keto acyl CoA synthase of Vi-tis vinifera and Oryza brachyantha were predicted inGlycine max, Vitis vinifera and Cucumus sativus. KCSHomologs of Oryza brachyantha in Brachypodium dis-tachyon are more similar than other homologs. Gcs(hs3v7ia), a type III PKS putative PKS from strepto-myces coelicolor and Chalcone and stilbene synthases(KAS) from 2-pyrone synthase (Gerbera hybrid) do-main were predicted.Similarities to CHS 2 and 2 py-rone synthase suggest their role in chalcone, naringe-nine synthase activity and anthocyanin biosynthetic ac-tivity respectively. Similarities to these structures maybe valuable information for characterisation of 3 ketoacyl CoA synthase in Vitis viifera and Oryza brachyan-tha. This study confirms that both N and C terminaldomain of chalcone and stilbene synthase are present inprotein J3MFI6 ORYBR, thus it is confirmed that thisprotein perform functions of Chalcone and stilbene syn-thase. All though full domain is absent in this protein.The C-terminal domain of CHS is reported to be struc-turally similar to domains in thiolase and β ketoacylCoA synthase.
In all these sites GRA and V is present in J3MF16and F6HH63 protein at various positions of these threeplant species, but GRA is absent in J3M9Z7 Activesite amino acids Cysteine, Histidine and Asparagine areconserved in protein of three plant species. Cysteine isnecessary to load the starter molecule in 3 keto acylCoA synthase. These sites are not identical but similarto 2-pyrone synthase structure (1ee0) of Gerbera hybrid.
14 Interdiscip Sci Comput Life Sci (2014) 6: 1–14
These sites may be modified by site directed mutagene-sis and genetic engineering approach to enhance the bi-ological activity in KCS like protein of Oryza brachyan-tha and Vitis vinifera. Molecular dynamic simulationmight be useful approach to define precisely the struc-tural and functional behavior of product binding andmalonyl CoA binding sites.
References
[1] Arnold, K., Bordoli, L., Kopp, J., Schwede, T. 2006.The SWISS-MODEL Workspace: A web-based en-vironment for protein structure homology modeling.Bioinformatics 22, 195-201.
[2] Benkert, P., Biasini, M., Schwede, T. 2011. Toward theestimation of the absolute quality of individual proteinstructure models. Bioinformatics 27(3), 343-350.
[3] Chothia, C., Lesk, A.M. 1986. The relation betweenthe divergence of sequence and structure in proteins.EMBO J. 5, 823-826.
[4] Combet, C., Jambon, M., Deleage, G., Geourjon, C.2002. Geno3D: automatic comparative molecular mod-elling of protein. Bioinformatics 18(1), 213-214.
[5] Chen, J.F., Huang, Q.F., Gao, D.Y., Wang, J.Y.,Lang, Y.S., Liu, T.Y., Li, B., Bai, Z.T., Goicoechea,J.L., Liang, C.Z., Chen, C.B., Zhang, W.L., Sun, S.H.,Liao, Y., Zhang, X.M., Yang, L., Song, C.L., Wang,M.J., Shi, J.F., Liu, G., Liu, J.J., Zhou, H.L., Zhou,W.L., Yu, Q.L., An, N., Chen, Y., Cai, Q.L., Wang, B.,Liu, B.H., Min, J.M., Huang, Y., Wu, H.L., Li, Z.Y.,Zhang Y., Yin, Y., Song, W.Q., Jiang, J.M., Jack-son, S.A., Wing, R.A., Wang, J., Chen, M.S. 2013.Whole-genome sequencing of Oryza brachyantha re-veals mechanisms underlying Oryza genome evolutionNature Communication 1595.
[6] Lee, S.B., Jung, S.J., Go, Y.S., Kim, H.U., Kim, J.K.,Cho, H.J., Park, O.K., Suh, M.C. 2009. Two Ara-bidopsis 3-ketoacyl CoA synthase genes, KCS20 andKCS2/DAISY, are functionally redundant in cuticularwax and root suberin biosynthesis, but differentiallycontrolled by osmotic stress. Plant J. 60(3), 462-475.
[7] Mizuguchi, K., Deane, C.M., Blundell, T.L., Overing-ton, J.P. 1998. HOMSTRAD: a database of protein
structure alignments for homologous families. ProteinScience 7, 2469-2471.
[8] Marchler-Bauer, A., Zheng, C., Chitsaz, F., Der-byshire, M.K., Geer, L.Y., et al. 2013. CDD: conserveddomains and protein three-dimensional structure. Nu-cleic Acids Research 41, D384-D352.
[9] Palo, R.T. 1984. Distribution of birch (Betula Spp),willow (Salix Spp), and poplar (Populus Spp) Sec-ondary metabolites and their potential role as chemicaldefence against herbivores. J. Chem. Ecol. 10, 499-520.
[10] Pettersen, E.F., Goddard, T.D., Huang, C.C., Couch,G.S., Greenblatt, D.M., Meng, E.C., Ferrin, T.E. 2004.UCSF Chimera–a visualization system for exploratoryresearch and analysis. J. Comput. Chem. 25(13), 1605-1612.
[11] Quevillon, E., Silventoinen, V., Pillai, S., Harte, N.,Mulder, N., Apweiler, R., Lopez R. 2005 InterProScan:protein domains identifier, Nucleic Acids Research,332.
[12] R. Thamos Palo: Distribution of birch (Betula SPP.),willow (Salix SPP.), and poplar (Populus SPP.) sec-ondary metabolites and their potential role as chem-ical defense against herbivores. Journal of ChemicalEcology 10(3), 499-520.
[13] Robin, D. Mellway, Lan, T. Tran, Michael, B. Prouse2009. The Wound-, Pathogen-, and Ultraviolet BRe-sponsive MYB134 Gene Encodes an R2R3 MYB Tran-scription Factor That Regulates ProanthocyanidinSynthesis in Poplar1. Plant Physiology, American So-ciety of Plant Biologists 150, 924-941.
[14] Schwede, T., Kopp, J., Guex, N., Peitsch, M.C. 2003.SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Research 31,3381-3385.
[15] Shi, J., Blundell, T.L., Mizuguchi, K. 2001. Sequence-structure homology recognition using environment spe-cific substitution tables and structure-dependent gappenalties. J Mol Biol. 310(1), 243-57.
[16] Smith, T.F., Waterman, M.S. 1981. Identification ofcommon molecular subsequences. J. Molecular Biology147, 195-197.