computational strategies to address chromatin structure problems · 2016-07-01 · lys5(mammals,s....

18
Phys. Biol. 13 (2016) 035006 doi:10.1088/1478-3975/13/3/035006 PAPER Computational strategies to address chromatin structure problems Ognjen Perišić 1 and Tamar Schlick 2,3 1 Big Blue Genomics, 11000 Belgrade, Serbia 2 Department of Chemistry, New York University, NY 10003, USA 3 Courant Institute of Mathematical Sciences, New York University, NY 10012, USA E-mail: [email protected] Keywords: chromatin structure, molecular modeling, molecular dynamics, coarse grained modeling Abstract While the genetic information is contained in double helical DNA, gene expression is a complex multilevel process that involves various functional units, from nucleosomes to fully formed chromatin bers accompanied by a host of various chromatin binding enzymes. The chromatin ber is a polymer composed of histone protein complexes upon which DNA wraps, like yarn upon many spools. The nature of chromatin structure has been an open question since the beginning of modern molecular biology. Many experiments have shown that the chromatin ber is a highly dynamic entity with pronounced structural diversity that includes properties of idealized zig-zag and solenoid models, as well as other motifs. This diversity can produce a high packing ratio and thus inhibit access to a majority of the wound DNA. Despite much research, chromatins dynamic structure has not yet been fully described. Long stretches of chromatin bers exhibit puzzling dynamic behavior that requires interpretation in the light of gene expression patterns in various tissue and organisms. The properties of chromatin ber can be investigated with experimental techniques, like in vitro biochemistry, in vivo imagining, and high-throughput chromosome capture technology. Those techniques provide useful insights into the bers structure and dynamics, but they are limited in resolution and scope, especially regarding compact bers and chromosomes in the cellular milieu. Complementary but specialized modeling techniques are needed to handle large oppy polymers such as the chromatin ber. In this review, we discuss current approaches in the chromatin structure eld with an emphasis on modeling, such as molecular dynamics and coarse-grained computational approaches. Combinations of these computational techniques complement experiments and address many relevant biological problems, as we will illustrate with special focus on epigenetic modulation of chromatin structure. Introduction Celebrated as one of the highest achievements in science, the rst mapping of the human genome in the dawn of the 21st century [1, 2], raised fundamental questions that may take a whole century to answer satisfactorily. The human genome, and genomes of all other higher organisms, contain much less genes than expected, but the amount of genes expressed in majority of tissues is higher than anticipated [3]. Moreover, simpler organisms can have comparable or even higher number of genes than human. These ndings suggest that the diversity of living species stems not only from the multitude of genes, but also from many ways of their expression [3]. Although short segments of RNA can have enzymatic roles, they are not sufcient to account for the variety of cell types and cell signaling processes. Chromatin, the ber that stores the genomic material in eukaryotes, holds one of the keys to the connection between the limited number of genes and the complex nature of higher organisms. Chromatins primary role, to compress 2 m of total genomic DNA in humans into a micro- meter sized cell nucleus, is accompanied by a more active role, namely the control of gene expression. The chromatin ber is made of double helical DNA wrapped around protein-octamer globules. Nucleosomes, chromatins building blocks, are built of about 147 base pairs of DNA wrapped around highly conserved, four histone proteins (H2A, H2B, H3 and H4) [4], with the addition of the dynamically bound linker histone (LH) H1/H5. The interplay RECEIVED 1 December 2015 REVISED 11 March 2016 ACCEPTED FOR PUBLICATION 8 April 2016 PUBLISHED 24 June 2016 © 2016 IOP Publishing Ltd

Upload: others

Post on 27-Jan-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

Phys. Biol. 13 (2016) 035006 doi:10.1088/1478-3975/13/3/035006

PAPER

Computational strategies to address chromatin structure problems

Ognjen Perišić1 andTamar Schlick2,31 Big BlueGenomics, 11000Belgrade, Serbia2 Department of Chemistry, NewYorkUniversity, NY10003,USA3 Courant Institute ofMathematical Sciences, NewYorkUniversity, NY 10012,USA

E-mail: [email protected]

Keywords: chromatin structure,molecularmodeling,molecular dynamics, coarse grainedmodeling

AbstractWhile the genetic information is contained in double helical DNA, gene expression is a complexmultilevel process that involves various functional units, fromnucleosomes to fully formed chromatinfibers accompanied by a host of various chromatin binding enzymes. The chromatinfiber is a polymercomposed of histone protein complexes uponwhichDNAwraps, like yarn uponmany spools. Thenature of chromatin structure has been an open question since the beginning ofmodernmolecularbiology.Many experiments have shown that the chromatin fiber is a highly dynamic entity withpronounced structural diversity that includes properties of idealized zig-zag and solenoidmodels, aswell as othermotifs. This diversity can produce a high packing ratio and thus inhibit access to amajority of thewoundDNA.Despitemuch research, chromatin’s dynamic structure has not yet beenfully described. Long stretches of chromatinfibers exhibit puzzling dynamic behavior that requiresinterpretation in the light of gene expression patterns in various tissue and organisms. The propertiesof chromatinfiber can be investigatedwith experimental techniques, like in vitro biochemistry, in vivoimagining, and high-throughput chromosome capture technology. Those techniques provide usefulinsights into thefiber’s structure and dynamics, but they are limited in resolution and scope, especiallyregarding compactfibers and chromosomes in the cellularmilieu. Complementary but specializedmodeling techniques are needed to handle largefloppy polymers such as the chromatinfiber. In thisreview, we discuss current approaches in the chromatin structurefieldwith an emphasis onmodeling,such asmolecular dynamics and coarse-grained computational approaches. Combinations of thesecomputational techniques complement experiments and addressmany relevant biological problems,as wewill illustrate with special focus on epigeneticmodulation of chromatin structure.

Introduction

Celebrated as one of the highest achievements inscience, the first mapping of the human genome in thedawn of the 21st century [1, 2], raised fundamentalquestions that may take a whole century to answersatisfactorily. The human genome, and genomes of allother higher organisms, contain much less genes thanexpected, but the amount of genes expressed inmajority of tissues is higher than anticipated [3].Moreover, simpler organisms can have comparable oreven higher number of genes than human. Thesefindings suggest that the diversity of living speciesstems not only from the multitude of genes, but alsofrom many ways of their expression [3]. Althoughshort segments of RNA can have enzymatic roles, they

are not sufficient to account for the variety of cell typesand cell signaling processes. Chromatin, the fiber thatstores the genomic material in eukaryotes, holds oneof the keys to the connection between the limitednumber of genes and the complex nature of higherorganisms. Chromatin’s primary role, to compress2 m of total genomic DNA in humans into a micro-meter sized cell nucleus, is accompanied by a moreactive role, namely the control of gene expression.

The chromatin fiber is made of double helicalDNA wrapped around protein-octamer globules.Nucleosomes, chromatin’s building blocks, are builtof about 147 base pairs of DNA wrapped aroundhighly conserved, four histone proteins (H2A, H2B,H3 and H4) [4], with the addition of the dynamicallybound linker histone (LH) H1/H5. The interplay

RECEIVED

1December 2015

REVISED

11March 2016

ACCEPTED FOR PUBLICATION

8April 2016

PUBLISHED

24 June 2016

© 2016 IOPPublishing Ltd

Page 2: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

between positively charged tails protruding from thehistone cores with DNA and neighboring nucleosomeparticles and various chromatin binding factors con-trols the level of chromatin compaction [5, 6], fromaccessible double stranded DNA to fully formed chro-mosomes. That level may determine the type of cell, itsstatus, and its future. Modern day genomics coupledwith the next generation sequencing [7–9] reveals thatthe perturbation of the gene expression patterns candisrupt cell-signaling processes and affect humanhealth [10, 11]. High throughput genome studiesaimed at deciphering the spatial connectivity of chro-mosomal genomic regions [12] are suggesting that thestructural diversity of chromosomes is related to geneexpression patterns [13] and that genes and corresp-onding regulatory elements can be, sequentially andspatially, far apart in the genome [13, 14]. Therefore,deciphering chromatin structure and dynamics is ofthe crucial importance for understanding gene expres-sion control in various organisms and tissues.

In this paper, we describe challenges associatedwith chromatin structure and the histone code, andthen outline modeling approaches to study chromatinstructure, dynamics, and the effect of epigenetic mod-ifications on chromatin structure. We end by advocat-ing code/resource sharing approaches to accelerateboth experimental and modeling research in this areaand to helpmake the necessary links between them.

Chromatin structure and the histone code

The internal organization of chromosomes and thecorresponding structure of the chromatin fiber are stillopen questions [15, 16]. The internal organization ofthe 30 nm fiber, often observed in vitro via variousoptical techniques, was for a long time a holy grail ofstructural biology [17]. Two theoretical models weredeveloped to address the 30 nm organization, one-start solenoid with bent DNA linkers [18, 19] and two-start zig-zag with straight linkers [20–22]. Zig-zagfibers are often observed under idealized experimentalconditions [23–27], withmild ionic environments thatdiscourage linker bending. Multiple experiments andsimulations showed that the chromatinfiber is a highlydynamic entity with a very pronounced structuraldiversity sharing properties of both theoretical models[15, 16, 28, 29]. For example, cryo-electron micro-scopy and synchrotron x-ray scattering experimentsobserved a fractal-like organization of chromosome inhuman mitotic HeLa cells, without prominent 30 nmfiber like structures [30]. The absence of 30 nm motifwas also demonstrated in recent cross-linking experi-ments of HeLa cells in combination with modelingthat revealed zigzag motifs associated with hierarchi-cally looped chromatin fiber in interphase and meta-phase chromatin [31]. Real-world fibers interact withvarious enzymatic particles that disrupt the simpletwo-start nucleosome ordering. They do not fold into

identically ordered structures in every cell because theentropic cost of such a folding process would beenormous [32]. Increased ion concentrations screenelectrostatic repulsion between positively chargedDNAfibers and allow their bending [28]. The naturallyoccurring variability of DNA linkers connectingsuccessive nucleosome cores introduces a diversity inthe fiber structure [28, 33, 34]. The H1/H5 LHdynamically binds to the nucleosome between twoDNA strands (entering and exiting), draws themtogether, and establishes the nucleosome stem.

Prominent sources of fiber diversity are post-translational modifications of histones and histonetails [35–39]. Histone tails are typical targets for mod-ification because they have prominent roles in inter-and intra-nucleosomal interactions and interactionswith DNA linkers [28, 35, 39–41]. Modifications ofhistone core and tails also influence the stability anddynamics of nucleosomes [36, 38, 42–46], althoughearly experiments did not observe a prominent role oftails in nucleosome stabilization [47, 48]. Therefore,histone modifications either induce partial unfoldingof the chromatin fiber (euchromatin) or stabilize atightly lockedfiber (heterochromatin).

Enzymes introducing histone modifications areusually very selective [49]. They act upon a specificresidue or a small group of selected residues. The threemajor covalent histone tail modifications are acetyla-tion, phosphorylation, and methylation, see figure 1.These chemical changes reduce the tails charges andmodify tails dynamics and thus influence their interac-tions with neighboring nucleosomes andDNA linkers.Other notable modifications with similar influencesare ubiquitylation, sumoylation, ADP ribosylation,histone tail clipping, histone proline isomerizationandβ-N-acetylglucosaminemodification [5].

The histone modifications have been a researchtopic for many years, but their importance came toprominence with the realization that the DNA con-tains much less genes than expected, and that theirexpression strongly depends on those heritable [49],albeit reversible modifications. Seemingly simple,these chemical changes produce perplexing effects,alone or in combination with other factors [5]. Acet-ylation neutralizes the positive charge of lysine and inthis manner weakens interactions between positivelycharged histones and negatively charged DNA andexposes DNA to transcription mechanisms [49–55]. Itis also involved in gene silencing [56], and DNAdamage response [57–59]. Acetylation affects numer-ous lysines on all tails (see table 1), and the high num-ber of possible sites where it may occur is an indicationof the existence of hyper-acetylated regions that aredevoid of tail charges and thus transcriptionally active[60, 61]. Phosphorylation adds a negative charge toserines, threonines and tyrosines, mostly in the his-tone N-terminal tails, but its mechanism of action isnot based on electrostatic screening only. It is a highlydynamic, site-specific modification active in mitosis,

2

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 3: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

meiosis, transcription activation, cell death, DNArepair, DNA replication and recombination [62–67](see table 1). Phosphorylation is a strong signaling fac-tor dependent on the cell cycle, and its dysregulation ishighly correlated with cancer [66, 68]. Methylation is aphysically small modification that does not alter thecharge of a histone protein but can have a profoundeffect as a signaling factor for various cell processes. Itmodifies all histone molecules, including LHs. Methy-lation is associated with transcriptional activation,repression, silencing, euchromatin formation, andantagonistic or supportive cross-talks with othermod-ifications as well as with DNA methylations [69–73](see table 1). Ubiquitylation produces the largest phy-sical modification of all reactions mentioned herebecause it attaches ubiquitin, a 76-amino acid poly-peptide, to the lysine’s Ɛ-amino group. Ubiquitin is acell-signaling factor mostly involved in the degrada-tion of proteins [74]. Histone ubiquitylation induces achange of the overall nucleosome conformation and inturn disrupts intra and inter nucleosomal interactions,as well as interactions with DNA or with other chro-matin-binding factors. It has a role in gene silencing,meiosis, transcriptional activation and in euchromatinformation [75–77]. The same enzymes that attach ubi-quitin are involved in sumoylation [78], a reaction thatattaches small, an ubiquitin-likemodifiermolecules tohistone lysines. This chemical change antagonizesacetylations and ubiquitylations that address the samelysine side chains [79, 80], and it is primarily associatedwith the transcriptional repression, though its directphysical effects are not yet known [5, 77, 79, 81]. Bioti-nylation is a modification that covalently attaches the

vitamin biotin to a protein. Biotin is a small molecule,when compared to ubiquitin, but its binding to his-tone proteins influences gene expression, cell pro-liferation [82], transcription repression and histonedimerization [83], as well as the cell’s response toDNAdamage [84, 85]. Biotinylation may affect other his-tone modifications via crosstalk [82]. Gene expressioncan be also regulated though histone tail clipping, aprocedure that removes residues from a histone tail.The procedure is present in many systems, from yeasttomammals [86–88].

The effects of histone modifications are sequenceand context dependent. Besides directly affectingchromatin structure, they often attract factors thatinduce or remove modifications on neighboring resi-dues, or they impair access to gene promoting or sup-pressing factors [72, 89]. Biochemical andphysiological effects caused by histone modificationscannot be interpreted solely through their linear com-bination and could be perceived as a type of code thatcontrols gene expression [90]. This histone code is,therefore, complementary to the four-letter geneticcode that holds information for protein synthesis[90–92].

Chromatinmodeling

The roles of histone components and histone mod-ifications in chromatin structure formation can beaddressed using various biophysical, biochemical,genetic or epigenomic experimental techniques.These techniques offer insights into the structure, andmore importantly dynamics of chromatin fibers

Figure 1.Positions of targets formajor covalent histonemodifications (pdb ID 1KX5). The histone octamer is white andDNA is pinkand cyan. Acetylations are colored red, phosphorylation blue, ubiquitylations, sumoylations and biotinylations green, andmethylations yellow. The linker histone (PDB IDs 1HST and 1C7Y) is gray.

3

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 4: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

Table 1. List ofmajor histone covalentmodification (A—acetylation, P—phosphorylation, U—ubiquitylation, S—sumoylation,M—methylation) listed by the histone and sequence position. The number of histonemodifying factors (9th column) represents the relativeimportance a residue haswithin a histone code.

Tail Residue A P U S B M#modify-ing factors Description

Lys26 x 1 Transcription silencing [72, 89]

Ser27 x 1 Transcription silencing, chromatin decon-densation [72, 89]

Lys48 x 1 OxidativeDNAdamage [38, 93, 94]

Arg54 x 1 Chromatin compaction, transcription [95]

H1 Lys65 x x 2 OxidativeDNAdamage [94, 96]

Lys66 x 1 OxidativeDNAdamage [94, 96]

Tyr73 x 1 Unknown [96]

Lys92 x x 2 OxidativeDNAdamage [93, 96]

Lys99 x 1 Unknown [93]

Ser1 x 1+ Mitosis, chromatin assembly, transcriptionrepression [97, 98]

Arg3 x 4 Transcription activation, transcriptionrepression [73]

Lys4 (S. cerevisiae) x 1 Transcription activation [99]

Lys5 (mammals, S.cerevisiae)

x 3 Transcription activation, unknown[50, 100, 101]

Lys7 (S. cerevisiae) x 1 Transcription activation [99, 102]

Lys9 x 1 Transcription repression [83]

Lys13 x 1 Transcription repression [83]

Lys36 x 1 Unknown [96, 103]

Arg42 x 1 Unknown [38, 96]

Lys74 x 1 Unknown [104]

Lys75 x 1 Unknown [104]

Arg77 x 1 Unknown [104]

H2A Arg88 x 1 Unknown [96]

Lys95 x 1 Unknown [104]

Gln105 x 1 Transcription [105]

Lys119 (mammals) x 1 Gene silencing [75]

Thr119 (D.melanogaster) x 1 Mitosis [106]

Thr120 (mammals) x 2 Mitosis, transcription repression [107, 108]

Ser122 (S. cerevisiae) x 1 DNA repair [64]

Lys125 x 1 Histone dimerization [83]

Lys126 (S. cerevisiae) x 1 Transcription repression [81]

Lys127 x 1 Histone dimerization [83]

Lys129 x 1 Histone dimerization [83]

4

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 5: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

Table 1. (Continued.)

Tail Residue A P U S B M#modify-ing factors Description

Ser129 (S. cerevisiae) x 2 DNA repair [109, 110]

Ser139 (mammalianH2A.X)

x 3 DNA repair [111–113]

Thr142 (mammalianH2A.X)

x 1 Apoptosis, DNA repair [114]

Lys5 x 2 Transcription activation [50, 115]

Lys6 (S. cerevisiae) x 1 Transcription repression [81]

Lys7 (S. cerevisiae) x 1 Transcription repression [81]

Ser10 (S. cerevisiae) x 1 Apoptosis [116]

Lys11 (S. cerevisiae) x 1 Transcription activation [102]

Lys12 (mammals) x 2 Transcription activation [50, 115]

Ser14 (vertebrates) x 1+ Apoptosis, DNA repair [63, 117]

Lys15 (mammals) x 2 Transcription activation [50, 115]

Lys16 (mammals) x 2 Transcription activation [102]

H2B Lys20 x x 2 Transcription activation [50]

Lys23 x Unknown [96]

Ser33 (D.melanogaster) x 1 Transcription activation

Ser36 x 1 Transcription activation [118]

Lys43 x 1 Unknown [103]

Lys57 x 1 Unknown [96]

Arg79 x 1 Unknown [96]

Lys85 x 1 Unknown [96]

Arg99 x 1 Unknown [96]

Lys120 (mammals) x 1 Meiosis, development [76]

Lys123 (S. cerevisiae) x 1 Transcription activation, elongation,euchromatin [77]

Arg2 x 2 Transcription repression [73]

Thr3 x 2 Mitosis [119]

Lys4 (S. cerevisiae) x x x 8 Transcription activation (tri-me), geneexpression, cell proliferation, permissiveeuchromatin (di-me) [82, 99, 120–124]

Thr6 x 1 Unknown [125]

Arg8 x 3 Transcription activation, transcriptionrepression [126, 127]

Lys9 x x x 10 Transcription activation, transcriptionrepression (tri-me), genomic imprinting,histone deposition, gene expression, cellproliferation [51, 70, 71, 82, 128–131]

Ser10 x 5 Transcription activation,mitosis,meiosis,immediate-early gene activation, DNAmethylation [62, 132–135]

5

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 6: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

Table 1. (Continued.)

Tail Residue A P U S B M#modify-ing factors Description

Thr11 (mammals) x 2 Mitosis [136]

Lys14(12) x 12 Transcription activation (elongation), DNArepair, transcription control, histonedeposition, RNApolymerase II transcrip-tion, cell growth [50, 51, 56–58, 99, 100, 120, 128, 129, 137–139]

Arg17 x 1 Transcription activation [140]

Lys18 x x 3 Transcription activation, histone deposi-tion, gene expression, cell proliferation[50, 51, 82, 140]

H3 Lys23 x 3 Transcription activation, histone deposi-tion,DNA repair [50–52, 128, 140]

Arg26 x 1 Transcription activation [73]

Lys27 x x 3 Transcription silencing, X inactivation[130, 141]

Ser28 (mammals) x 3 Mitosis, immediate-early gene activation[133, 142, 143]

Lys36 x x 1 Transcription activation (elongation)[144, 145]

Tyr41 x 1 Transcription activation [67]

Arg42 x 1 Transcription activation [146]

Thr45 x 1 Apoptosis [147]

Lys56 x x 1 Transcription activation, DNA repair, oxi-dativeDNAdamage [53, 59, 96, 148, 149]

Lys57 x 1 Unknown [150]

Arg63 x 1 Unknown [96]

Lys64 x x 2 Transcription [96, 104, 151]

Lys79 x 1 Euchromatin, transcription activation(elongation), checkpoint response[152–154]

Thr80 x 1 Mitosis [150]

Thr118 x 1 Transcription, DNA repair [103, 155, 156]

Lys122 x x 2 Transcription, DNA repair [94, 96, 157]

Arg128 x 1 Unknown [96]

Ser1 x 2+ Mitosis, chromatin assembly, DNArepair [65, 97]

Arg3 x 5 Transcription activation, transcriptionrepression [73, 126, 158]

Lys5 x 7 Transcription activation, histone deposi-tion,DNA repair[50, 57, 58, 99, 100, 115, 120, 159]

Lys8 x x 9 Transcription activation (elongation), DNArepair[50, 57, 58, 84, 85, 99, 100, 115, 137, 160]

6

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 7: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

[22–27, 168, 169], but they cannot fully describe thechromatin structural patterns. The difficulties stemfrom the tightly packed and still not fully accessiblechromatin fiber interior and from the small size ofhistone modifications. Furthermore, laboratory pre-parations often put analyzed specimens in conditionsnot present in vivo. For example, in vitro fibers haveequally spaced nucleosomes and usually lack histonemodifications [168, 169]. In principle, systematicmodeling can help address the influence of histonemodifications on chromatin structure. This is achallenging task because the effects of histone mod-ifications cover various spatial and temporal scales,which means that different levels of chromatin fiberorganization have to be described using differenttheoretical and computational methods [6]. The first,basic level modeling has to cover is the influence ofhistone modifications on histone tails themselves. Thesecond level is the modeling of the structure andbehavior of nucleosome cores with and withoutmodifications. Next, the behavior and structure ofoligonucleosomes, with and without histone modifi-cations, has to be accurately determined. The finallevel is the modeling of longer stretches of chromatinfiber and the modeling of whole chromosomes. Allthose four levels of modeling have to apply differenttools and strategies in a selective manner, but they

have to be consistent in the interpretation of fiberbehavior.

Nucleosome and oligonucleosomemodeling

Molecular dynamics uses the atom level interpretationof molecules to model their behavior in naturalenvironments (for a detailed review see [170]). Itsnumerical complexity makes MD simulations ofbiomolecules applicable to systems with a limitednumber of atoms, usually less than ten million [171],including solvent and salt atoms [170]. The maximumtime span MD can cover is in the range of micro-seconds [171–173], or milliseconds on special archi-tectures (Anton [174]). The full atom interpretation ofa dinucleosome in explicit solvent requires 800 000atoms [39], which means that besides simulationsof individual tails only a limited number of nucleo-some related simulations have been conducted sofar (see [175] for details). Although limited inscope, those simulations provide important insightsinto the behavior of the nucleosome and itscomponents.

Many methods have been developed in attempt toimprove the efficiency of molecular dynamics (MDs)

Table 1. (Continued.)

Tail Residue A P U S B M#modify-ing factors Description

Lys12 x x 7 Transcription activation (elongation), DNArepair, histone deposition, telomericsilencing, euchromatin[50, 57, 58, 84, 85, 99, 100, 120, 159, 161]

H4 Lys16 x 7 Transcription activation, transcriptionsilencing, DNA repair [39, 56–58, 99, 100, 102, 115, 160, 162]

Lys20 x 5 Transcription activation, transcriptionsilencing (mono-me), heterochromatin(tri-me), checkpoint response [163–165]

Arg35 x Unknown [96]

Ser47 x Nucleosome assembly [166]

Arg55 x Unknown [96]

Lys59 x 1+ Transcription silencing, oxidativeDNAdamage [96, 167]

Arg67 x 1 Unknown [96]

Lys77 x x 2 Unknown [96]

Lys79 x x 2 OxidativeDNAdamage [94, 96]

Tyr88 x 1 Unknown [96]

Lys91 (S. cerevisiae) x . 2 Chromatin assembly [96, 168]

Arg92 x 1 Unknown [96, 104]

7

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 8: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

simulations [176–184]. The most commonly methodused to reduce the numerical cost of MD is implicitsolvent. This approach reduces the number of simu-lated atoms by representing solvent as a continuousmedium, following the assumption that polar orcharged molecules stay in water [185, 186]. Thismethod is used to explore the protein folding anddynamic behavior of biological polymers, DNA, RNAand proteins. Its applicability can be limited in a tightlypacked environment of the chromatin fiber interior,in which individual water molecules may affect thebehavior of small peptides such as histone tails.

Many other methods have been developed, buttheir application to complex systems like the chroma-tinfiber remains to be shown as advantageous.

Coarse-graining interprets biological moleculesless accurately thanMD, but allows simulations of lar-ger systems and makes possible more efficient sam-pling of the configurational space. Early coarse-grained models, although simple, were able to inter-pret basic structural patterns of the chromatin fiberand approach experimental findings [187–191]. Thosemodels usually represented DNA as a worm-like-chain and nucleosomes as simple, rigid particles thatinteract via simple potentials. Advanced models usemore detailed representations of nucleosomes andapply explicit tails.

Woodcock et al applied a model of the oligonu-cleosome fiber, based on a simple representation ofrotatable nucleosome core particle (NCP) and vari-able-length DNA linkers with linker entry-exit angles,to interpret EM chromatin images [187]. A similarearly coarse-grained model by Katritch, Bustamanteand Olson was able to reproduce force-extensioncurves of real chromatin pulling experiments [192].Later on, Wedemann and Langowski modeled NCPsas oblate ellipsoids in an elastic fiber [189]. TheirMonte Carlo simulations of 100-NCP chains withelastic and electrostatic interactions reproducedexperimental properties of 30 nm fibers with mediumDNA linker lengths (they achieved a mass density of6.1 NCP per 11 nm with 32 nm wide fiber). Later on,they added various chromatin structural elements totheir model, such as the nucleosome stem [190] orirregularly spaced nucleosomes [193]. Many othergroups also developed coarse-grained models toaddress the stability of nucleosome, its response tostretching, and to analyze interactions between DNAand cylindrical particles (nucleosomes) under physio-logical conditions [194–198]. See [6] for a recent over-view of coarse-grained and multiscale modeling ofchromatin.

Our group was among the first to address the roleof histone tails in chromatin compaction using adetailed coarse-grainedmodel of the nucleosome. Themodel uses different coarse-graining techniques tomodel basic chromatin building blocks. Nucleosomes,histone tails, DNA linkers and LHs are modeled asseparate entities using appropriate theoretical models

[15, 28, 33–35, 37, 199–205]. The nucleosome core(eight histones plus wrapped DNA), without histonetails, is modeled as a rigid object with irregular surfaceinterpreted via approximately 300 charged beads[201]. The charge of each bead is assigned using thediscrete surface charge optimization algorithm devel-oped by Beard and Schlick [206] based on the Debye–Hückel approximation. The nucleosome core modelrobustly reproduces the electrostatic field of thenucleosome at physiological monovalent salt con-centrations. DNA linkers between consecutive nucleo-somes are interpreted as worm-like-chains. Theirnegative charges are modeled using Stigter’s proce-dure [202]. The core histone tails are separately mod-eled, using the Warshel–Levitt united-atom proteinmodel, with one bead per five amino-acids [35]. TheLH was initially modeled as a rigid three-bead struc-ture [15]. A recent refinement accounts for the inher-ently dynamic and disordered nature of the LH[199, 203]. The refined model includes flexible C-terminal domains and a globular head attached to thenucleosome [200]. Chromatin fiber configurations aresampled using an efficient set of Monte Carlomoves [35, 37].

Comparison with real-world experimental data hasbeen used to validate the model. For instance, themodel reproduces values of sedimentation coefficients,radios of gyration and packing ratios encountered inin vitro experiments [15, 28, 33, 34, 37, 200, 204, 205].Various applications have detailed the influence ofdynamic tails [35], variable linker lengths [207] anddivalent Mg2+ ions [15], and the LH binding affinity[205, 208] on chromatin’s structure [28]. Furthermore,fiber models share properties of both theoretical mod-els (zig-zag and solenoid) [28]. The model withimproved LH was able to explain the binding asym-metry and relate the LH condensation and nucleosomestem formation with the global condensation of chro-matin [200]. A linear relationship between the LH con-centration and the DNA linker length observedexperimentally could also be related to the formation ofa compact zig-zag fiber [209]. Forced induced unfold-ings showed that LH increases fiber’s resistance tounfolding, and that dynamic binding/unbinding of LHreduces it [204, 205]. Heterogeneous elements promotesuper beads-on-a-string configurations during stretch-ing. Those configurations are biologically advantageousbecause they selectively expose DNA as 10 nm linkersbetween super-beads. The fibers with non-uniform lin-ker lengths also exhibit smoother transitions because ofa more continuous range of similar stable configura-tions [210]. The absence of 30 nm fiberswas also shownin recent modeling work in collaboration with cross-linking experiments. Persistence of zig-zag motifswithin a new model of hierarchical looping for meta-phase chromosomes helps reconcile current models ofpolymer melts on one hand and other models thatdefine clear chromosomal boundaries [31].

8

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 9: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

The histone tails have been a major topic of mod-eling studies due to their intrinsic disorder and theirroles in fiber compaction. The tails were addressedusing full-atom MD [45, 211] as well as coarse-grain-ing [35, 37, 39, 41]. The researchers explored the rolesof individual tails and their interactions with nucleo-somes and DNA before shifting their focus to tailmodifications. Our early studies, although lessdetailed than full-atomMD simulations, revealed thattails have multiple roles and are crucial for fiber com-paction. They delineated roles of each tail, especiallythe role of the H4 tail in mediating internucleosomalinteractions in highly compacted fibers with LHs, fol-lowed by roles of ‘H3,H2A, andH2B tails in decreasingorder of importance’ [35, 37], a result concordant withexperimental finding on the role of H4 tail in fibercondensation [54]. The nanosecond timescale MDsimulations of the nucleosome core by Roccatano andcoworkers [211] showed that at physiological salt con-centrations histone tails adopt conformations inwhich tail segments preferentially interact with majorand minor grooves of DNA and produce a slightlymore compact and solvent-protected nucleosome, aresult that contradicts experimental finding that tailsadopt a largely solvent-exposed structure at salt con-centrations above 50 mM NaCl [212]. H3 in theirsimulations exhibited the most noticeable conforma-tional changes, correlated with the increased numberof close tail-DNA contacts, while H2A domainsshowed affinity toward the DNA minor groove andhigher mobility, likely caused by the presence of theH2A’s long C-terminal tail. Roccatano and coworkersalso showed that the canonical nucleosome particle ismostly rigid under physiological conditions during thenanosecond simulation runs. Over longer time scales,however, nucleosomes exhibit conformational het-erogeneity and spontaneous unwrapping/rewrappingof nucleosomal DNA, as shown by the Langowski andWidom groups [213, 214]. MD simulations by Biswasand coworkers [45] showed that nucleosome simula-tions with truncated H2A and H3 tails produce lesscompact nucleosomes. The removal of these tails dis-rupts the electrostatic potential around the H2A his-tone, which in turn destabilizes the docking betweenthe H2A–H2B dimer and the H3–H4 tetramer. Themultiscale study of nucleosome unwrapping by theLangowski group [41], based on coarse-grained mod-eling and all-atom MD simulations, suggested thathistone tails could have an opposite effect on themononucleosome. While the attraction between theH3 tail and the ‘acidic patch’, formed on the nucleo-some surface by seven acidic residues from H2A andH2B, can trigger partial unwrapping of DNA, and pre-vent the DNA rewrapping, the attraction between theH4 tail and the patch promotes full wrapping of thenucleosome. That indicates that the H3 tails activelyparticipate in the initiation of the nucleosome remo-deling. A recent study from the same group [40], basedon replica-exchange and MDs simulations showed

that the H2B and H4 tails have a single dominantbinding configuration with DNA, while the H2B andH3 tails have multiple DNA binding configurations.However, large portions of tails were found not to bebound to DNA, which is an indirect conformation oftheir complex roles. The Rippe group studied thenucleosome’s resistance to forced unwrapping usingthe steeredMD (SMD) [215]. The analysis of SMD tra-jectories proposed a multistep process in which his-tone tails have prominent roles in the resistance tounwrapping. The authors also suggested that there aretwo main base-pair related barriers to unwrapping.However, results obtained with SMD simulations haveto be taken with precaution because SMD perturba-tions are five to six orders of magnitude faster thanexperimental techniques. The Langowski groupapplied Brownian dynamics to a coarse-grainedmodelwith a uniform, distance-dependent potential toexamine the mechanical unfolding of DNA from thetorus-like nucleosome core [216]. Their simulationssuggested a gradual unwrapping of DNA from nucleo-some and reproduced force-extension curves for thelow-force loading rate. Dobrovolskaia and Arya usedcoarse-grained models of nucleosome and DNA withexperimentally derived position-dependent freeenergy profile to address the influence of non-uniforminteractions between histone octamer and wrappedDNA on the nucleosome’s resistance to externalforce [217].

An effective modeling of histone modificationsrequires both MD and coarse-grained approaches todescribe the direct influence of modifications on thenucleosome and their influence on longer stretchesof the chromatin fiber. Yang and Arya [218] con-firmed the experimental finding [219] that the inter-actions between the H3 and H4 histone tails and thenucleosome’s acid patch play a crucial role in the reg-ulation of the nucleosome structure and may have asignificant role in the overall fiber architecture. Theysuggested that the acetylation of the lysine 16 fromthe H4 tail reduces the alpha-helix forming pro-pensity of H4 and destabilizes its binding to the acidicpatch. In a similar study, Potoyan and Papopian[220] found that the acetylation of H4K16 inducespartial ordering of the H4 histone and enhances itspropensity toward alpha-helical organization. Theordering of the intrinsically disorder protein reducesthe tail’s binding affinity toward neighboring nucleo-somes and weakens internuclosomal contacts, aresult in concordance with experimental findings[54]. The acetylation of the H4 tail, despite reducingthe positive tail charge produces a stronger attractionbetween the tail and DNA by inducing a partial col-lapse of the tail that enables it to make more hydro-phobic contacts with the surface of DNA [220]. Ourrecent multiscale study follows the path establishedby our earlier studies and addresses seven major his-tone acetylations [39]. In that study Collepardo-Gue-vara and coworkers show through combinedMD and

9

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 10: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

coarse-grained Monte Carlo simulations of 24 oligo-nucleotides that acetylations change the overall his-tone-tail flexibility and decrease the disorder of tails.The more folded tails have reduced interaction sur-face areas that limit inter-nucleosomal interactionsin oligonucleosomes and trigger chromatin fiberopening. For instance, a H4 tail with acetylated resi-dues has a limited ability to extend and reach neigh-boring nucleosomes but has a higher affinity towardparental DNA. The results clearly underscore manyfactors that affect the chromatin compaction incooperative manner, and confirm that the hyper-acetylation is not a prerequisite for a significant struc-tural change. The acetylation of a single H4 lysine(H4K16) prohibits the formation of compact fiberand affects the formation of both higher order chro-matin structures and functional interactions betweenchromatin fiber and non-histone proteins [54, 220].

Chromatin-protein interactions are an attractiveresearch topic also. An MD study by Papamokos andcoworkers [221] suggested that the binding of a pro-tein important for the formation of the hypercon-densed mitotic chromosomes (HeterochromatinProtein 1—HP1, [6]) to the H3 tail is strongly affectedby the H3S10 phosphorylation. That study discoveredthat the phosphorylated H3S10 residue forms a saltbridge with H3R8 and thus impacts the HP1 bindingto the methylated H3K9me2/3. A similar MD basedstudy depicted the non-covalent binding of lysinespecific demethylase-1 enzyme (LSD1) with its co-repressor protein (CoREST) to chromatin [222]. Thisstudy is notable because LSD1 is one of the most pro-mising epigenetic targets for drug discovery againstcancer and neurodegenerative diseases [222]. Theauthors showed that the LSD1/CoREST complexbinds to H3 via an induced-fit mechanism, an infor-mation potentially useful in designing LSD1 inhibi-tors. The combined docking, Brownian dynamics,and normal mode analysis by the Wade group [199]showed that the LH H1/H5 can adopt various dock-ing positions near the nucleosome’s dyad axis, aresults in concordance with the latest experimentalstudies that suggests that asymmetric, on- and off-dyad, binding of the LH globular domain may leadtoward distinct higher order chromatin structures[223]. The variable LH positioning is importantbecause it allows the LH to influence the higher orderstructure of chromatin fiber in adaptable fashionthrough the modulation of entering/exiting angles ofDNA linkers [224].

Polymer and continuummodels

Polymer models are less refined than mesoscalemodels, but cover much larger spatial scales. Theyinterpret chromatin as a chain of spherical beads thatinteracts through simple harmonic and Lennard–Jones systems, where each bead represents more than

one nucleosome core. The polymer models have beendeveloped as a response to large-scale experimentalobservations by various chromatin conformationcapture techniques based on cross-linking of sequen-tially distant DNA segments. Those techniques pro-duce kilo-base (3C), mega-base (4C and 5C) andgenome-wide (Hi-C) interaction frequencies betweengenome loci in the nucleus as statistical averages overcell populations [225]. Interaction frequencies insingle cells can be analyzed via 3C [226] and fluores-cence in situ hybridization techniques [227]. Conti-nuum models, on the other hand, were developed tointerpret experimental data related to in vivo celldynamics. They primarily use analytical approaches,based on fluid dynamics, to explain the behavior oflarge segments of chromatin in the cell nucleus.

Both experimental observations and simulationsestablished the fractal model of the structural organi-zation of chromosomes (first introduced by Grosbergand coworkers [228, 229]). The model assumes globu-lar organization of chromosomes at almost all scales.In that case the contact probability of genomic loci as afunction of the genome distance (or sub-chain length)follows the same scaling law. That law is the outcomeof the self-similar polymer (chromatin) domains orga-nized into non-equilibrium hierarchical structureswith open-state and untangled topologies that rarelyinterfere with each other [230]. The Mirny lab intro-duced the fractal framework using a 20 nm bead poly-mer chain in which each bead covers six nucleosomes[231]. They were able to generate a fractal-like organi-zation of their chain with a power law contact prob-ability with the −1 exponent. However, theequilibration of that chain produced a uniform dis-tribution, incompatible with the interphase chromo-somes. Later on, they decreased the bead size to 10 nmand applied attractive and repulsive Lennard–Jonespotentials, softened at short distances to allow chaincrossing [232]. This refined model produced contactprobabilities consistent with experimental data andproposed local loop formations, without reproducibleradial positions, with the rest of their model chromo-some being in disordered state. Themodel suggested apower law dependency of contact probability on geno-mic distance with the exponential factor −0.5 forshort distances, that sharply decays with the increasein distance [232]. Barbieri and coworkers showed thatfractal organization also depends on the concentrationof protein cofactors able to bridge different genomicregions [233]. Simulations based on a polymer modelcomposed of two set of beads by the Langowski lab[234] showed that yeast chromosomes have pre-ferential positions in cell nucleus with dynamical clus-tering of functional elements of genomes. Jost andcoworkers addressed epigenetic modificationsthrough block organized chromatin model [235]. Intheir interpretation, the chromatin chain is organizedinto various blocks of identical monomers that pre-ferentially interact with other monomers of the same

10

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 11: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

chromatin type over interactions with monomers ofother types. They applied Langevin dynamics andGaussian like potentials to sample fiber configura-tions. Their results are consistent with Hi-C dataobtained from 10 Mbp drosophila regions. The topo-logical domains they generated are related epigeneti-cally and form a multistable fiber with coiled andcollapsed microphase regions that separate differentepigenetic domains. Bruinsma and coworkers [236]produced a two-fluid analytical model (nucleoplasmas a solvent and chromatin as a solute) to interpretexperimental observation of coherent movementsbeyond single chromosomes [237]. They showed thatthe nucleus in ATP-depleted cells is characterized withpassive longitudinal thermal fluctuations, while ATP-active cells have intense transverse long wavelengthvelocity fluctuations driven by force dipoles depen-dent on ATP. Isaacson and coworkers examined thetime required to find specific DNA binding site as afunction of the chromatin density (expressed as thevolume exclusivity, a potential term that excludes adiffusing protein from a given volume filled with chro-matin) [238, 239]. Their results based on a continuummodel indicate that the time sharply decreases as theexclusivity increases and reaches a minimum valueand then slowly increases with the increase in volumeexclusivity.

Discussion

The chromatin structure problem has remained achallenge despite many recent advances in bothexperimentation and modeling. Its appeal comes notonly from the puzzling nature of chromatin architec-ture and still unknown relationship between local fiberproperties and behavior of the chromatin fiber at large,but also from the chromatin’s key role in geneexpression. And the deciphering that role has verypractical implications because the disruption of DNApackaging has serious implications on health. Indeed,chromatin structure can be easily disrupted by histonemodifications. That disruption can affect overallchromosome integrity and alter gene expression,including the aberrant regulation of oncogenes and/or tumor suppressors. However, new treatment ave-nues arise because the majority of histone modifica-tions is reversible.

The detailed modeling of histone modifications isrequired for a proper interpretation of chromatin fiberstructure and dynamics because experimentation is stilllimited in scope and resolution. The current modelingefforts addressed only general properties of chromatinfiber and described a very small number of histonemodifications. To understand the chromatin fiberstructure and its behavior, all modifications and theirinterplay have to be addressed in a systematic fashion.

Figure 2.Multiscale chromatinmodeling covers all levels ofmodeling, from atomicmodels of nucleosomes and histonemodifications, via oligomers and polymermodels to continuousmodels that interpret global chromatinmotion in cell nucleus.

11

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 12: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

That enormous task will require MD simulations of alltail modifications, within mononucleosome and with-out it. The puzzling role and behavior of LH has to befurther examined, using docking, MD and coarse-graind simulations. Furthermore,MD simulations haveto be performed on structures larger than dinucleo-somes, with and without histone modifications. Simu-lations with four, or even more, nucleosomes arenecessary if we want to fully grasp the role of tails ininternucleosomal interactions. Those simulations haveto based on different crystal structures becausemajorityof current simulations has been based on the same highresolution structure, 1KX5 [175]. The role of DNAsequence in nucleosome positioning and fiber dynam-ics also has to be addressed in a systematic fashion. Thatwill require all atom representations of DNA linkersand, probably, combination of classical and quantumMD. The nucleosome’s mechanical resistance tounfolding has been addressed using only fast SMD[215], orders of magnitude faster that experimentalstretching.New simulations have to be performedusingmuch slower pulling on single nucleosomes with andwithout modifications, and, if possible, on equilibratedsystemswithmorenucleosome cores.

All-atom simulations are, despite recent advancesin hardware, numerically still very expensive. Thatmeans that longer stretches of fiber have to be simu-lated using coarse grained models. Current models arevery detailed, but have to be improved to include all his-tone modifications, as well as LH and nucleosome coredynamics, especially during mechanical stretching.Moreover, they have to accurately interpret the influ-ences of various binding factors and ion environments.

To facilitate research and collaboration, themodel-ing approaches should be able to record chromatinsimulation trajectories, corresponding starting config-urations, and force filed parameters in a standardized,human readable format(s) based on attribute-valuepairs. Similarly, software tools should be open sourcedand deposited in easily accessible online repositories.This can enable an approach to modeling where anappropriate algorithm can be called if necessary insemi-autonomous fashion. For instance, a coarse-grained model can invoke an atom-based model tointerpret an unusual configuration. This could, in prin-ciple, allow zooming-in and zooming-out of chromatinregions, see figure 2, because information betweenmodeling levels can be easily exchanged using a com-mon data format. In that case, the long chromatin fiberis already in energetically favorable configurationwhichmeans that only sampling on a limited scale with spatialconstraints, using local neighborhoods would have tobe performed. Such a gradual, object-oriented model-ing enables inheritance of general properties from onemodeling level to another without the burden ofincreased numerical complexity. However, it has to beapplied with caution because an apparently small mod-ification with a limited influence on a local fiber

neighborhood can have a crucial role in the behavior oflonger stretches of the chromatin fiber. Those longerstretches often exhibit puzzling dynamic behavior thatrequires a detailed interpretation in the light of geneexpression patterns in various tissue andorgans.

Chromatin structure and its relation to the histonecode remains a central question in structural/mole-cular biophysics and genetics. Addressing it fullyrequires a joint effort of experimentalist andmodelers.To facilitate these efforts, a more facile informationflow and data/program sharing between differentteams could be valuable.

References

[1] InternationalHumanGenome Sequencing Consortium2001Initial sequencing and analysis of the human genomeNature409 860–921

[2] Venter J C et al 2001The sequence of the human genomeScience 291 1304–51

[3] UhlénM et al 2015Tissue-basedmap of the humanproteomeScience 347 1260419

[4] Luger K,Mäder AW,RichmondRK, SargentD F andRichmondT J 1997Crystal structure of the nucleosome coreparticle at 2.8 Å resolutionNature 389 251–60

[5] Bannister A J andKouzarides T 2011Regulation of chromatinby histonemodificationsCell Res. 21 381–95

[6] OzerG, LuqueA and Schlick T 2015The chromatinfiber:multiscale problems and approachesCurr. Opin. Struct. Biol.31 124–39

[7] Mardis ER 2008The impact of next-generation sequencingtechnology on geneticsTrends Genet. 24 133–41

[8] QuailMA, SmithM,Coupland P,Otto TD,Harris S R,Connor TR, Bertoni A, SwerdlowHP andGuY 2012A taleof three next generation sequencing platforms: comparisonof IonTorrent, Pacific Biosciences and IlluminaMiSeqsequencersBMCGenomics 13

[9] Pabinger S, Dander A, FischerM, Snajder R, SperkM,EfremovaM,Krabichler B, SpeicherMR, Zschocke J andTrajanoski Z 2013A survey of tools for variant analysis ofnext-generation genome sequencing dataBriefingsBioinformatics 15 256–78

[10] TheCancerGenomeAtlas ResearchNetwork 2008Comprehensive genomic characterization defines humanglioblastoma genes and core pathwaysNature 455 1061–8

[11] BrennanCW et al 2013The somatic genomic landscape ofglioblastomaCell 155 462–77

[12] Dekker J, Rippe K,DekkerMandKlecknerN 2002Capturingchromosome conformation Science 295 1306–11

[13] Dekker J 2008Gene regulation in the third dimension Science319 1793–4

[14] NaumovaN andDekker J 2010 Integrating one-dimensionaland three-dimensionalmaps of genomes J. Cell Sci. 1231979–88

[15] Grigoryev S, AryaG, Correll S,WoodcockC and Schlick T2009 Evidence for heteromorphic chromatin fibers fromanalysis of nucleosome interactions Proc. Natl Acad. Sci 10613317–22

[16] Schlick T,Hayes T andGrigoryev S 2012Towardconvergence of experimental studies and theoreticalmodeling of the chromatin fiber J. Biol. Chem. 287 5183–91

[17] Grigoryev S andWoodcockCL 2012Chromatinorganization—the 30 nmfiberExp. Cell Res. 318 1448–55

[18] ThomaF,Koller T andKlugA 1979 Involvement of histoneH1 in the organization of thenucleosome and of the salt-dependent superstructures of chromatin J. Cell Biol. 83 403–27

[19] Finch J T andKlugA 1979 Solenoidalmodel forsuperstructure in chromatin Proc. Natl Acad. Sci. 731897–901

12

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 13: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

[20] StaynovDZ 1983 Possible nucleosome arrangements in thehigher-order structure of chromatin Int. J. Biol.Macromol. 53–9

[21] Williams S P, Athey BD,Muglia L J, Schappe R S,GoughAHand Langmore J P 1986Chromatinfibers are left-handed double helices with diameter andmass per unitlength that depend on linker lengthBiophys. J. 49 233–48

[22] Athey BD, SmithMF, Rankert DA,Williams S P andLangmore J P 1990The diameters of frozen-hydratedchromatin fibers increase withDNA linker length: evidencein support of variable diametermodels for chromatin J. CellBiol. 111 795–806

[23] Horowitz RA, AgardDA, Sedat JWandWoodcockCL 1994The three-dimensional architecture of chromatin in situ:electron tomography reveals fibers composed of acontinuously variable zig-zag nucleosomal ribbon J. Cell Biol.125 1–10

[24] Dorigo B, Schalch T, Kulangara A,Duda S, Schroeder RR andRichmondT J 2004Nucleosome arrays reveal the two-startorganization of the chromatin fiber Science 306 1571–3

[25] Schalch T,Duda S, SargentDF andRichmondT J 2005X-raystructure of a tetranucleosome and its implications for thechromatin fibreNature 436 138–41

[26] SchefferMP, EltsovMand Frangakis A S 2011 Evidence forshort-range helical order in the 30 nmchromatin fibers oferythrocyte nucleiProc. Natl Acad. Sci. 108 16992–7

[27] Song F, Chen P, SunD,WangM,Dong L, LiangD, XuRM,ZhuP and Li G 2014Cryo-EM study of the chromatinfiberreveals a double helix twisted by tetranucleosomal unitsScience 344 376–80

[28] PerišićO,Collepardo-Guevara R and Schlick T 2010Modeling studies of chromatin fiber structure as a function ofDNA linker length J.Mol. Biol. 403 777–802

[29] EltsovM,MacLellanKM,MaeshimaK, Frangakis A S andDubochet J 2008Analysis of cryo-electronmicroscopyimages does not support the existence of 30 nmchromatinfibers inmitotic chromosomes in situ Proc. Natl Acad. Sci. 10519732–7

[30] Nishino Y et al 2012Humanmitotic chromosomes consistpredominantly of irregularly folded nucleosome fibreswithout a 30 nmchromatin structureEMBO J. 31 1644–53

[31] Grigoryev SA, BascomG, Buckwalter JM, SchubertMB,WoodcockCL and Schlick T 2016Hierarchical looping ofzigzag nucleosome chains inmetaphase chromosomes Proc.Natl Acad. Sci. 113 1238–43

[32] Mirny LA 2011The fractal globule as amodel of chromatinarchitecture in the cellChromosomeRes. 19 37–51

[33] Schlick T and PerišićO2009Mesoscale simulations of twonucleosome-repeat length oligonucleosomes Phys. Chem.Chem. Phys. 11 10729–37

[34] Collepardo-Guevara R and Schlick T 2014Chromatinfiberpolymorphism triggered by variations ofDNA linker lengthsProc. Natl Acad. Sci. 111 8061–6

[35] AryaG and Schlick T 2006Role of histone tails in chromatinfolding revealed by a newmesoscopic oligonucleosomemodel Proc. Natl Acad. Sci. 103 16236–41

[36] Mersfelder E L and ParthunMR2006The tale beyond thetail: histone core domainmodifications and the regulation ofchromatin structureNucleic Acids Res. 34 2653–62

[37] AryaG and Schlick T 2009A tale of tails: howhistone tailsmediate chromatin compaction in different salt and linkerhistone environments J. Phys. Chem.A 113 4045–59

[38] Tessarz P andKouzarides T 2014Histone coremodificationsregulating nucleosome structure and dynamicsNat. Rev.Mol.Cell Biol. 15 703–8

[39] Collepardo-Guevara R, Portella G, FrenkelMD,Schlick T andOrozcoM2015Chromatin unfolding byepigeneticmodifications explained by dramatic impairmentof internucleosome interactions: amultiscale computationalstudy J. Am.Chem. Soc. 137 10205–15

[40] Erler J, Zhang R, Petridis L, ChengX, Smith J C andLangowski J 2014The role of histone tails in the nucleosome:a computational studyBiophys. J. 107 2911–22

[41] Voltz K, Trylska J, CalimetN, Smith J C and Langowski J 2012Unwrapping of nucleosomalDNA ends: amultiscalemolecular dynamics studyBiophys. J. 102 849–58

[42] Brower-Toland B,WackerDA, Fulbright RM, Lis J T,KrausWL andWangMD2005 Specific contributions ofhistone tails and their acetylation to themechanical stabilityof nucleosomes J.Mol. Biol. 346 135–46

[43] FerreiraH, Somers J,Webster R, Flaus A andOwen-Hughes T 2007Histone tails and theH3 alpha-Nhelixregulate nucleosomemobility and stabilityMol. Cell. Biol. 274037–48

[44] Henikoff S 2008Nucleosome destabilization in the epigeneticregulation of gene expressionNat. Rev. Genet. 9 15–26

[45] BiswasM,Voltz K, Smith JC and Langowski J 2011Role ofhistone tails in structural stability of the nucleosome PLoSComput. Biol. 7 e1002279

[46] IwasakiW,Miya Y,HorikoshiN,Osakabe A, TaguchiH,TachiwanaH, Shibata T, KagawaWandKurumizakaH2013Contribution of histoneN-terminal tails to the structure andstability of nucleosomes FEBSOpen Biol. 3 363–9

[47] WidlundHR,Vitolo JM, Thiriet C andHayes J J 2000DNAsequence- dependent contributions of core histone tails tonucleosome stability: differential effects of acetylation andproteolytic tail removalBiochemistry 39 3835–41

[48] Gottesfeld JM andLuger K 2001 Energetics and affinity of thehistone octamer for definedDNA sequencesBiochemistry 4010927–33

[49] Marmorstein R andTrievel RC 2009Histonemodifyingenzymes: structures,mechanisms, and specificitiesBiochim.Biophys. Acta 1789 58–68

[50] Schiltz R L,MizzenCA,Vassilev A, CookRG,Allis CD andNakatani Y 1999Overlapping but distinct patterns of histoneacetylation by the human coactivators p300 and PCAFwithinnucleosomal substrates J. Biol. Chem. 274 1189–92

[51] Grant PA, Eberharter A, John S, CookRG, Turner BMandWorkman J L 1999 Expanded lysine acetylation specificity ofGcn5 in native complexes J. Biol. Chem. 274 5895–900

[52] Howe L, AustonD,Grant P, John S, CookRG,Workman J L and Pillus L 2001HistoneH3 specificacetyltransferases are essential for cell cycle progressionGenesDev. 15 3144–54

[53] XuF, ZhangK andGrunsteinM2005Acetylation in histoneH3 globular domain regulates gene expression in yeastCell121 375–85

[54] Shogren-KnaakM, IshiiH, Sun JM,PazinMJ,Davie J R andPetersonCL2006HistoneH4-K16 acetylation controlschromatin structure andprotein interactions Science311 844–7

[55] YangX J and Seto E 2007HATs andHDACs: From structure,function and regulation to novel strategies for therapy andpreventionOncogene 26 5310–8

[56] SuttonA, ShiaW J, BandD,Kaufman PD,Osada S,Workman J L and Sternglanz R 2003 Sas4 and Sas5 arerequired for the histone acetyltransferase activity of Sas2 inthe SAS complex J. Biol. Chem. 278 16887–92

[57] Bird AW,YuDY, Pray-GrantMG,QiuQ,HarmonKE,Megee PC,Grant PA, SmithMMandChristmanMF 2002Acetylation of histoneH4 by esa1 is required forDNAdouble-strand break repairNature 419 411–5

[58] Ikura T,OgryzkoVV,GrigorievM,GroismanR,Wang J,HorikoshiM, Scully R,Qin J andNakatani Y 2000Involvement of the TIP60 histone acetylase complex inDNArepair and apoptosisCell 102 463–73

[59] MasumotoH,HawkeD, Kobayashi R andVerreault A 2005Arole for cell-cycle-regulated histoneH3 lysine 56 acetylationin theDNAdamage responseNature 436 294–8

[60] Kiefer CM,HouC, Little J A andDeanA 2008 Epigenetics ofbeta-globin gene regulationMutat. Res. 647 68–76

[61] WangZ et al 2008Combinatorial patterns of histoneacetylations andmethylations in the human genomeNat.Genet. 40 897–903

[62] HendzelM J,Wei Y,ManciniMA,VanHooser A, Ranalli T,Brinkley BR, Bazett-JonesDP andAllis CD1997Mitosis-specific phosphorylation of histoneH3 initiates primarily

13

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 14: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

within pericentromeric heterochromatin duringG2 andspreads in an ordered fashion coincident withmitoticchromosome condensationChromosoma 106 348–60

[63] Fernandez-CapetilloO, Allis CD andNussenzweig A 2004Phosphorylation of histoneH2B atDNAdouble-strandbreaks J. Exp.Med. 199 1671–7

[64] Harvey AC, Jackson S P andDowns J A 2005 Saccharomycescerevisiae histoneH2A Ser122 facilitates DNA repairGenetics170 543–53

[65] CheungWL, Turner F B, Krishnamoorthy T,Wolner B,Ahn SH, FoleyM,Dorsey J A, PetersonCL, Berger S L andAllis CD 2005 Phosphorylation of histoneH4 serine 1 duringDNAdamage requires casein kinase II in S. cerevisiae Curr.Biol. 15 656–60

[66] OkiM, AiharaH and Ito T 2007Role of histonephosphorylation in chromatin dynamics and its implicationsin diseases Subcell Biochem. 41 319–36

[67] DawsonMA, Bannister A J, Göttgens B, Foster SD, Bartke T,GreenAR andKouzarides T 2009 JAK2 phosphorylateshistoneH3Y41 and excludesHP1alpha from chromatinNature 461 819–22

[68] Yamada Y,WarrenA J, DobsonC, Forster A, Pannell R andRabbitts TH 1998TheT cell leukemia LIMprotein Lmo2 isnecessary for adultmouse hematopoiesis Proc. Natl Acad. Sci.95 3890–5

[69] Rea S et al 2000Regulation of chromatin structure by site-specific histoneH3methyltransferasesNature 406 593–9

[70] TamaruH and Selker EU 2001AhistoneH3methyltransferase controlsDNAmethylation inNeurosporacrassaNature 414 277–83

[71] Johnson L, CaoX and Jacobsen S 2002 Interplay between twoepigeneticmarksDNAmethylation and histoneH3 lysine 9methylationCurr. Biol. 12 1360–7

[72] Daujat S, Zeissler U,WaldmannT,HappelN and Schneider R2005HP1 binds specifically to Lys26-methylated histoneH1.4, whereas simultaneous Ser27 phosphorylation blocksHP1binding J. Biol. Chem. 280 38090–5

[73] Di Lorenzo A andBedfordMT2011Histone argininemethylation FEBS Lett. 585 2024–31

[74] GlickmanMHandCiechanover A 2002The ubiquitin-proteasome proteolytic pathway: destruction for the sake ofconstruction Physiol. Rev. 82 373–428

[75] WangH,Wang L, Erdjument-BromageH,VidalM,Tempst P, Jones R S andZhang Y 2004Role of histoneH2Aubiquitination in polycomb silencingNature 431 873–8

[76] ZhuB, Zheng Y, PhamAD,Mandal S S,Erdjument-BromageH, Tempst P andReinbergD2005Monoubiquitination of human histoneH2B: the factorsinvolved and their roles inHOXgene regulationMol. Cell 20601–11

[77] RobzykK, Recht J andOsleyMA2000Rad6-dependentubiquitination of histoneH2B in yeast Science 287 501–4

[78] Seeler J S andDejeanA 2003Nuclear and unclear functions ofSUMONat. Rev.Mol. Cell Biol. 4 690–9

[79] Shiio Y and EisenmanRN2003Histone sumoylation isassociatedwith transcriptional repressionProc. Natl Acad.Sci. 100 13225–30

[80] NathanD et al 2006Histone sumoylation is a negativeregulator in saccharomyces cerevisiae and shows dynamicinterplay with positive-acting histonemodificationsGenesDev. 20 966–76

[81] NathanD et al 2006Histone sumoylation is a negativeregulator in saccharomyces cerevisiae and shows dynamicinterplay with positive-acting histonemodificationsGenesDev. 20 966–76

[82] KobzaK, Camporeale G, Rueckert B, KuehA,Griffin J B,SarathG andZempleni J 2005K4, K9 andK18 in humanhistoneH3 are targets for biotinylation by biotinidase FEBS J.272 4249–59

[83] ChewYC, Camporeale G, Kothapalli N, SarathG andZempleni J 2006 Lysine residues inN-terminal andC-

terminal regions of human histoneH2A are targets forbiotinylation by biotinidase J. Nutritional Biochem. 17 225–33

[84] Camporeale G, Shubert E E, SarathG, Cerny R andZempleni J 2004K8 andK12 are biotinylated in humanhistoneH4Eur. J. Biochem. 271 2257–63

[85] Kothapalli N, SarathG andZempleni J 2005 Biotinylation ofK12 in histoneH4 decreases in response toDNAdouble-strand breaks in human JAr choriocarcinoma cellsJ. Nutrition 135 2337–42

[86] Santos-RosaH et al 2009HistoneH3 tail clipping regulatesgene expressionNat. Struct.Mol. Biol. 16 17–22

[87] Duncan EM et al 2008Cathepsin L proteolytically processeshistoneH3duringmouse embryonic stem cell differentiationCell 135 284–94

[88] Allis CD, Bowen JK, AbrahamGN,Glover CV andGorovskyMA1980 Proteolytic processing of histoneH3 inchromatin: a physiologically regulated event inTetrahymenamicronuclei Cell 20 55–64

[89] Kuzmichev A, Jenuwein T, Tempst P andReinbergD 2004Different EZH2-containing complexes targetmethylation ofhistoneH1or nucleosomal histoneH3Mol. Cell 14 189–93

[90] Jenuwein T andAllis CD 2001Translating the histone codeScience 293 1074–80

[91] Strahl BD andAllis CD 2000The language of covalenthistonemodificationsNature 403 41–5

[92] RandoO J 2012Combinatorial complexity in chromatinstructure and function: revisiting the histone codeCurr.Opin. Genet. Dev. 222 148–55

[93] Wisniewski J R, ZougmanA, Krüger S andMannM2006Mass spectrometricmapping of linker histoneH1 variantsrevealsmultiple acetylations,methylations, andphosphorylation aswell as differences between cell cultureand tissueMol. Cell. Proteomics 6 72–87

[94] Jiang T, ZhouX, TaghizadehK,DongMandDedonPC 2007N-formylation of lysine in histone proteins as a secondarymodification arising fromoxidativeDNAdamage Proc. Natl.Acad. Sci. 104 60–5

[95] ChristophorouMA et al 2014Citrullination regulatespluripotency and histoneH1 binding to chromatinNature507 104–8

[96] TanM et al 2011 Identification of 67 histonemarks andhistone lysine crotonylation as a new type of histonemodificationCell 146 1016–28

[97] Barber CM et al 2004The enhancement of histoneH4 andH2A serine 1 phosphorylation duringmitosis and S-phase isevolutionarily conservedChromosoma 112 360–71

[98] Zhang Y,GriffinK,MondalN and Parvin JD 2004Phosphorylation of histoneH2A inhibits transcription onchromatin templates J. Biol. Chem. 279 21866–72

[99] Clarke A S, Lowell J E, Jacobson S J and Pillus L 1999 Esa1p isan essential histone acetyltransferase required for cell cycleprogressionMol. Cell. Biol. 19 2515–26

[100] KimuraA andHorikoshiM1998Tip60 acetylates six lysines ofa specific class in core histones in vitroGenesCells 3 789–800

[101] Verreault A,KaufmanPD,Kobayashi R andStillmanB1997NucleosomalDNAregulates the core-histone-binding subunitof the humanHat1 acetyltransferaseCurr. Biol.8 96–108

[102] SukaN, Suka Y, CarmenAA,Wu J andGrunsteinM2001Highly specific antibodies determine histone acetylation siteusage in yeast heterochromatin and euchromatinMol. Cell 8473–9

[103] Zhang L, Eugeni E E, ParthunMRandFreitasMA2003Identification of novel histone post-translationalmodifications by peptidemassfingerprintingChromosoma112 77–86

[104] FreitasMA, Sklenar AR and ParthunMR2004Applicationofmass spectrometry to the identification and quantificationof histone post-translationalmodifications J. Cell Biochem. 92691–700

[105] Tessarz P, Santos-RosaH, Robson SC, SylvestersenKB,NelsonC J,NielsenML andKouzarides T 2014Glutamine

14

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 15: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

methylation in histoneH2A is anRNA-polymerase-I-dedicatedmodificationNature 505 564–8

[106] AiharaH et al 2004Nucleosomal histone kinase-1phosphorylates H2AThr 119 duringmitosis in the earlyDrosophila embryoGenesDev. 18 877–88

[107] Yamagishi Y,Honda T, TannoY andWatanabe Y 2010Twohistonemarks establish the inner centromere andchromosome bi-orientation Science 330 239–43

[108] KimK,Kim JM,Kim J S, Choi J, Lee Y S,Neamati N, Song J S,HeoK andAnW2013VprBPHas intrinsic kinase activitytargeting histoneH2A and represses gene transcriptionMol.Cell 52 459–67

[109] Downs JA, LowndesN F and Jackson S P 2000A role forsaccharomyces cerevisiae histoneH2A inDNA repairNature408 1001–4

[110] Shroff R, Arbel-EdenA, PilchD, IraG, BonnerWM,Petrini JH,Haber J E and LichtenM2004Distribution anddynamics of chromatinmodification induced by a definedDNAdouble-strand breakCurr. Biol. 14 1703–11

[111] Ward IMandChen J 2001HistoneH2AX is phosphorylatedin anATR-dependentmanner in response to replicationalstress J. Biol. Chem. 276 47759–62

[112] Burma S, Chen BP,MurphyM,Kurimasa A andChenD J2001ATMphosphorylates histoneH2AX in response toDNAdouble-strand breaks J. Biol. Chem. 276 42462–7

[113] Park E J, ChanDW, Park JH,OettingerMA andKwon J2003DNA-PK is activated by nucleosomes andphosphorylates H2AXwithin the nucleosomes in anacetylation-dependentmannerNucleic Acids Res. 31 6819–27

[114] XiaoA et al 2008WSTF regulates theH2A.XDNAdamageresponse via a novel tyrosine kinase activityNature 457 57–62

[115] KawasakiH, Schiltz L, Chiu R, Itakura K, Taira K,Nakatani Y andYokoyamaKK2000ATF-2 has intrinsichistone acetyltransferase activity which ismodulated byphosphorylationNature 405 195–200

[116] Ahn SH,CheungWL,Hsu J Y, Diaz R L, SmithMMandAllis CD 2005 Sterile 20 kinase phosphorylates histoneH2Bat serine 10 during hydrogen peroxide-induced apoptosis inS. cerevisiae Cell 120 25–36

[117] CheungWL et al 2003Apoptotic phosphorylation of histoneH2B ismediated bymammalian sterile twenty kinaseCell 113507–17

[118] BungardD, FuerthB J, ZengPY, FaubertB,MaasNL,Viollet B,CarlingD,ThompsonCB, JonesRGandBerger SL 2010Signaling kinaseAMPKactivates stress-promoted transcriptionvia histoneH2Bphosphorylation Science3291201–5

[119] Dai J, Sultan S, Taylor S S andHiggins JM2005The kinasehaspin is required formitotic histoneH3Thr 3phosphorylation and normalmetaphase chromosomealignmentGenes Dev. 19

[120] Angus-HillML,Dutnall RN,TafrovST,SternglanzR andRamakrishnanV1999Crystal structure of thehistoneacetyltransferaseHpa2: a tetramericmemberof theGcn5-relatedN-acetyltransferase superfamily J.Mol. Biol.2941311–25

[121] Briggs SD, BrykM, Strahl BD,CheungWL,Davie J K,Dent S Y,Winston F andAllis CD2001HistoneH3 lysine 4methylation ismediated by Set1 and required for cell growthand rDNA silencing in Saccharomyces cerevisiaeGenesDev.15 3286–95

[122] WangH,CaoR, Xia L, Erdjument-BromageH, Borchers C,Tempst P andZhang Y 2001 Purification and functionalcharacterization of a histoneH3-lysine 4-specificmethyltransferaseMol. Cell 8 1207–17

[123] Nakamura T,Mori T, Tada S, KrajewskiW, Rozovskaia T,Wassell R,Dubois G,MazoA,CroceCMandCanaani E 2002ALL-1 is a histonemethyltransferase that assembles asupercomplex of proteins involved in transcriptionalregulationMol. Cell 10 1119–28

[124] Sedkov Y, ChoE, Petruk S, Cherbas L, Smith ST, Jones R S,Cherbas P, Canaani E, Jaynes J B andMazoA 2003Methylation at lysine 4 of histoneH3 in ecdysone-dependentdevelopment of drosophilaNature 426 78–83

[125] Metzger E et al 2010 Phosphorylation of histoneH3T6 byPKCbeta(I) controls demethylation at histoneH3K4Nature464 792–6

[126] Pal S, Vishwanath SN, Erdjument-BromageH, Tempst P andSif S 2004Human SWI/SNF-associated PRMT5methylateshistoneH3 arginine 8 and negatively regulates expression ofST7 andNM23 tumor suppressor genesMol. Cell Biol. 249630–45

[127] SuX, ZhuG,DingX, Lee S Y,DouY, ZhuB,WuWand LiH2014Molecular basis underlying histoneH3 lysine-argininemethylation pattern readout by spin/Ssty repeats ofSpindlin1GenesDev. 28 622–36

[128] Sobel R E,Cook RG, PerryCA, Annunziato AT andAllis CD1995Conservation of deposition-related acetylation sites innewly synthesized histonesH3 andH4 Proc. Natl Acad. Sci. 921237–41

[129] Spencer TE et al 1997 Steroid receptor coactivator-1 is ahistone acetyltransferaseNature 389 194–8

[130] TachibanaM, SugimotoK, FukushimaT and Shinkai Y 2001Set domain-containing protein, G9a, is a novel lysine-preferringmammalian histonemethyltransferase withhyperactivity and specific selectivity to lysines 9 and 27 ofhistoneH3 J. Biol. Chem. 276 25309–17

[131] SchultzDC,AyyanathanK,NegorevD,MaulGG andRauscher F J 3rd 2002 SETDB1: a novel KAP-1-associatedhistoneH3, lysine 9-specificmethyltransferase thatcontributes toHP1-mediated silencing of euchromatic genesbyKRAB zinc-finger proteinsGenesDev. 16 919–32

[132] Hsu J Y et al 2000Mitotic phosphorylation of histoneH3 isgoverned by Ipl1/aurora kinase andGlc7/PP1 phosphatasein budding yeast and nematodesCell 102 279–91

[133] Soloaga A, Thomson S,WigginGR, RampersaudN,DysonMH,Hazzalin CA,Mahadevan LC andArthur J S2003MSK2 andMSK1mediate themitogen- and stress-induced phosphorylation of histoneH3 andHMG-14 EMBOJ. 22 2788–97

[134] Anest V,Hanson J L, Cogswell PC, Steinbrecher KA,Strahl BD andBaldwinA S 2003Anucleosomal function forIkappaB kinase-alpha inNF-kappaB-dependent geneexpressionNature 423 659–63

[135] LoWS,Duggan L, EmreNC, Belotserkovskya R, LaneWS,Shiekhattar R andBerger S L 2001 Snf1—a histone kinase thatworks in concert with the histone acetyltransferaseGcn5 toregulate transcription Science 293 1142–6

[136] PreussU, LandsbergG and ScheidtmannKH2003Novelmitosis-specific phosphorylation of histoneH3 at Thr11mediated byDlk/ZIP kinaseNucleic Acid Res. 31 878–85

[137] Winkler G S, KristjuhanA, Erdjument-BromageH,Tempst P and Svejstrup JQ2002 Elongator is a histoneH3andH4 acetyltransferase important for normal histoneacetylation levels in vivo Proc. Natl Acad. Sci. 99 3517–22

[138] Hsieh Y J, KunduTK,WangZ, KovelmanR andRoeder RG1999The TFIIIC90 Subunit of TFIIIC Interacts withmultiplecomponents of the RNApolymerase IIImachinery andcontains a histone-specific acetyltransferase activityMol. Cell.Biol. 19 7697–704

[139] MizzenCA et al 1996The TAF(II)250 subunit of TFIID hashistone acetyltransferase activityCell 87 1261–70

[140] Daujat S, BauerUM, ShahV, Turner B, Berger S andKouzarides T 2002Crosstalk betweenCARM1methylationandCBP acetylation on histoneH3Curr. Biol. 12 2090–7

[141] CaoR,Wang L,WangH,Xia L, Erdjument-BromageH,Tempst P, Jones R S andZhang Y 2002Role of histoneH3lysine 27methylation in Polycomb-group silencing Science298 1039–43

[142] GotoH, Yasui Y,Nigg EA and InagakiM2002Aurora-Bphosphorylates HistoneH3 at serine28with regard to themitotic chromosome condensationGenes to Cells 7 11–7

[143] Zhong S, JansenC, SheQB,GotoH, InagakiM, BodeAM,MaWYandDong Z 2001Ultraviolet b-inducedphosphorylation of histone h3 at serine 28 ismediated bymsk1 J. Biol. Chem. 276 33213–9

15

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 16: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

[144] Morris SA,RaoB,GarciaBA,Hake SB,DiazRL,Shabanowitz J,HuntDF,AllisCD, Lieb JDandStrahlBD2007Identificationof histoneH3 lysine 36 acetylationas a highlyconservedhistonemodification J. Biol. Chem.282 7632–40

[145] KroganN J et al 2003Methylation of histoneH3 by Set2 insaccharomyces cerevisiae is linked to transcriptionalelongation by RNApolymerase IIMol. Cell. Biol. 23 4207–18

[146] Casadio F, LuX, Pollock S B, LeRoyG, Garcia BA,Muir TW,Roeder RG andAllis CD 2013H3R42me2a is a histonemodificationwith positive transcriptional effectsProc. NatlAcad. Sci. 110 14894–9

[147] Hurd P J et al 2009 Phosphorylation of histoneH3Thr-45 islinked to apoptosis J. Biol. Chem. 284 16575–83

[148] Ozdemir A, Spicuglia S, Lasonder E, VermeulenM,Campsteijn C, StunnenbergHGand Logie C 2005Characterization of lysine 56 of histoneH3 as an acetylationsite in Saccharomyces cerevisiae J. Biol. Chem. 280 25949–52

[149] YuY, SongC, ZhangQ,DiMaggio PA,Garcia BA, YorkA,CareyMF andGrunsteinM2012HistoneH3 lysine 56methylation regulates DNA replication through itsinteractionwith PCNAMol. Cell 46 7–17

[150] VermeulenM et al 2010Quantitative interaction proteomicsand genome-wide profiling of epigenetic histonemarks andtheir readersCell 142 967–80

[151] DiCerboV et al 2014Acetylation of histoneH3 at lysine 64regulates nucleosome dynamics and facilitates transcriptioneLife 3 1–23

[152] FengQ,WangH,NgHH, Erdjument-BromageH, Tempst P,Struhl K andZhang Y 2002Methylation ofH3-lysine 79 ismediated by a new family ofHMTases without a SET domainCurr. Biol. 25 1052–8

[153] KroganN J et al 2003The Paf1 complex is required forhistoneH3methylation byCOMPASS andDot1p: linkingtranscriptional elongation to histonemethylationMol. Cell 11721–9

[154] HuyenY, ZgheibO,Ditullio RA Jr, Gorgoulis VG,Zacharatos P, Petty T J, Sheston EA,MellertH S,Stavridi E S andHalazonetis TD2004Methylated lysine 79 ofhistoneH3 targets 53BP1 toDNAdouble-strand breaksNature 432 406–11

[155] North J A et al 2011 Phosphorylation of histoneH3(T118)alters nucleosome dynamics and remodelingNucleic AcidsRes. 39 6465–74

[156] KrugerW, PetersonCL, Sil A, CoburnC, Arents G,Moudrianakis EN andHerskowitz I 1995Amino acidsubstitutions in the structured domains of histonesH3 andH4 partially relieve the requirement of the yeast SWI/SNFcomplex for transcriptionGenesDev. 9 2770–9

[157] Tropberger P et al 2013Regulation of transcription throughacetylation ofH3K122 on the lateral surface of the histoneoctamerCell 152 859–72

[158] Strahl BD et al 2001Methylation of histoneH4 at arginine 3occurs in vivo and ismediated by the nuclear receptorcoactivator PRMT1Curr. Biol. 11 996–1000

[159] ParthunMR,Widom J andGottschling D E 1996 Themajor cytoplasmic histone acetyltransferase in yeast: linksto chromatin replication and histonemetabolismCell 8785–94

[160] KuoMH,Brownell J E, Sobel R E, Ranalli TA, CookRG,EdmondsonDG, Roth SY andAllis CD 1996Transcription-linked acetylation byGcn5p of histonesH3 andH4 at specificlysinesNature 383 269–72

[161] Kelly T J,Qin S, GottschlingDE and ParthunMR2000TypeB histone acetyltransferaseHat1p participates in telomericsilencingMol. Cell Biol. 20 7051–8

[162] Hilfiker A,Hilfiker-KleinerD, Pannuti A and Lucchesi J C1997mof, a putative acetyl transferase gene related to theTip60 andMOZhuman genes and to the SAS genes of yeast, isrequired for dosage compensation in drosophila EMBO J. 162054–60

[163] NishiokaK et al 2002 PR-Set7 is a nucleosome-specificmethyltransferase thatmodifies lysine 20 of histoneH4 and isassociatedwith silent chromatinMol. Cell 9 1201–13

[164] Schotta G, LachnerM, SarmaK, Ebert A, Sengupta R,Reuter G, ReinbergD and Jenuwein T 2004A silencingpathway to induceH3-K9 andH4-K20 trimethylation atconstitutive heterochromatinGenesDev. 18 1251–62

[165] Sanders S L, PortosoM,Mata J, Bähler J, Allshire RC andKouzarides T 2004Methylation of histoneH4 lysine 20controls recruitment of Crb2 to sites ofDNAdamageCell 119603–14

[166] Kang B, PuM,HuG,WenW,DongZ, ZhaoK,Stillman B andZhang Z 2011 Phosphorylation ofH4 Ser 47promotesHIRA-mediated nucleosome assemblyGenesDev.25 1359–64

[167] Ye J, Ai X, Eugeni E E, Zhang L, Carpenter L R, JelinekMA,FreitasMA and ParthunMR2005HistoneH4 lysine 91acetylation a core domainmodification associatedwithchromatin assemblyMol. Cell 18 123–30

[168] SimpsonRT, ThomaF andBrubaker JM1985Chromatinreconstituted from tandemly repeated clonedDNAfragments and core histones: amodel system for study ofhigher order structureCell 42 799–808

[169] Lowary PT andWidom J 1998NewDNA sequence rules forhighaffinity binding to histone octamer and sequence-directed nucleosome positioning J.Mol. Biol. 276 19–42

[170] Schlick T, Collepardo-Guevara R,Halvorsen LA, Jung S andXiaoX 2011 Biomolecularmodeling and simulation: afieldcoming of ageQ.Rev. Biophys. 44 191–228

[171] SunY, ZhengG,Mei C, BohmE J, Phillips J C andKalé LV2012Optimizing fine-grained communication in abiomolecular simulation application on cray XK6 Proc. SC ’12Proc. Int. Conf. onHigh Performance Computing, Networking,Storage andAnalysis 55 1–11

[172] Phillips J C, BraunR,WangW,Gumbart J, Tajkhorshid E,Villa E, Chipot C, Skeel RD, Kalé L and SchultenK 2005Scalablemolecular dynamics withNAMD J. Comput. Chem.26 1781–802

[173] Kumar S,HuangC, ZhengG, BohmE, Bhatele A, Phillips J C,YuHandKalé LV 2008 Scalablemolecular dynamics withNAMDon the IBMblue gene/L system IBM J. Res. Dev. 52177–88

[174] Grossman J P, Towles B, GreskampB and ShawDE2015Filtering, reductions and synchronization in theAnton 2network IEEE 29th Int. Parallel andDistributed Proc. Symp.860–70

[175] BiswasM, Langowski J and BishopTC2013Atomisticsimulations of nucleosomesWIReSComput.Mol. Sci. 3378–92

[176] Schlick T 2009Monte Carlo, harmonic approximation, andcoarse-graining approaches for enhanced sampling ofbiomolecular structure F1000 Biol. Rep. 1

[177] Schlick T 2009Molecular-dynamics based approaches forenhanced sampling of long-time, large-scale conformationalchanges in biomolecules F1000 Biol. Rep. 1

[179] Wabik J, Kmiecik S, GrontD, KouzaMandKolinski A 2013Combining coarse-grained proteinmodels with replica-exchange all-atommolecular dynamics Int. J.Mol. Sci. 149893–905

[179] PerišićOand LuH2014On the improvement of free-energycalculation from steeredmolecular dynamics simulationsusing adaptive stochastic perturbation protocols PLoSOne 9e101810

[180] AbramsC andBussi G 2014 Enhanced sampling inmoleculardynamics usingmetadynamics, replica-exchange, andtemperature-acceleration Entropy 16 163–99

[181] AkimovAV and PrezhdoOV2015 Large-scale computationsin chemistry: a bird’s eye view of a vibrant fieldChem. Rev.115 5797–890

[182] Bernardi RC,MeloMCR and SchultenK2015 Enhancedsampling techniques inmolecular dynamics simulations ofbiological systemsBiochim. Biophys. Acta 1850 872–7

[183] FujisakiH,MoritsuguK,Matsunaga Y,Morishita T andMaragliano L 2015 Extended phase-spacemethods forenhanced sampling inmolecular simulations: a reviewFrontiers Bioeng. Biotechnol. 3

16

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 17: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

[184] PanAC,Weinreich TM, Piana S and ShawDE2016Demonstrating anOrder-of-Magnitude SamplingEnhancementMolecular Dynamics Simulations of ComplexProtein Systems Journal of Chemical Theory andComputation12 1360–7

[185] Richards FM1977Areas, volumes, packing, and proteinstructureAnnu. Rev. Biophys. Bioeng. 6 151–76

[186] RouxB and SimonsonT 1999 Implicit solventmodelsBiophys. Chem. 78 1–20

[187] WoodcockC,Grigoryev S,Horowitz R andWhitakerN 1993A chromatin foldingmodel that incorporates linkervariability generates fibers resembling the native structuresProc. Natl Acad. Sci. 90 9021–5

[188] Schiessel H,GelbartWMandBruinsmaR 2001DNA folding:structural andmechanical properties of the two-anglemodelfor chromatinBiophys. J. 80 1940–56

[189] WedemannGand Langowski J 2002Computer simulation ofthe 30 nm chromatinfiberBiophys. J. 82 2847–59

[190] Stehr R, KepperN, RippeK andWedemannG2008Theeffect of internucleosomal interaction on folding of thechromatin fiberBiophys. J. 95 3677–91

[191] Stehr R, Schöpflin R, Ettig R, KepperN, Rippe K andWedemannG2010 Exploring the conformational space ofchromatin fibers and their stability by numerical dynamicphase diagramsBiophys. J. 98 1028–37

[192] KatritchV, BustamanteC andOlsonW2000Pullingchromatin fibers: computer simulations of direct physicalmicromanipulations J.Mol. Biol. 295 29–40

[193] MüllerO, KepperN, Schöpflin R, Ettig R, Rippe K andWedemannG2014Changing chromatinfiber conformationby nucleosome repositioningBiophys. J. 107 2141–50

[194] KunzeK-K andNetz RR 2000 Salt-inducedDNA-Histonecomplexation Phys. Rev. Lett. 85 4389–92

[195] Schiessel H 2003The physics of chromatin J. Phys. Condens.Matter 15 699–774

[196] Cherstvy AG andWinkler RG 2004Complexation ofsemiflexible chainswith oppositely charged cylinder J. Chem.Phys. 120 9394–400

[197] KorolevN, Lyubartsev A P and LaaksonenA 2004Electrostatic background of chromatin fiber stretchingJ. Biomol. Struct. Dyn. 22 215–26

[198] Fenley AT, AdamsDA andOnufriev AV2010Charge stateof the globular histone core controls stability of thenucleosomeBiophys. J. 99 1577–85

[199] PachovGV,Gabdoulline RR andWadeRC2011On thestructure and dynamics of the complex of the nucleosomeand the linker histoneNucleic Acids Res. 39 5255–63

[201] Luque A,Collepardo-Guevara R,Grigoryev S and Schlick T2014Dynamic condensation of linker histoneC-terminaldomain regulates chromatin structureNucleic Acids Res. 427553–60

[201] ZhangQ, BeardD and Schlick T 2003Constructing irregularsurfaces to enclosemacromolecular complexes formesoscalemodeling using the discrete surface charge optimization(DISCO) algorithm J. Comput. Chem. 24 2063–74

[202] Schlick T, Li B andOlsonWK1994The influence of salt onthe structure and energetics of supercoiledDNABiophys. J. 672146–66

[203] Caterino TL andHayes J J 2011 Structure of theH1C-terminal domain and function in chromatin condensationBiochem. Cell Biol. 89 35–44

[204] Collepardo-Guevara R and Schlick T 2011The effect of linkerhistone’s nucleosome binding affinity on chromatinunfoldingmechanismsBiophys. J. 101 1670–80

[205] Collepardo-Guevara R and Schlick T 2012Crucial role ofdynamic linker histone binding and divalent ions forDNAaccessibility and gene regulation revealed bymesoscalemodeling of oligonucleosomesNucleic Acids Res. 40 8803–17

[206] BeardD and Schlick T 2001Modeling salt-mediatedelectrostatics ofmacromolecules: the discrete surface chargeoptimization algorithm and its application to the nucleosomeBiopolymers 58 106–15

[207] Collepardo-Guevara R and Schlick T 2014Chromatinfiberpolymorphism triggered by variations of DNA linker lengthsProc. Natl Acad. Sci. 111 8061–6

[208] Collepardo-Guevara R and Schlick T 2011The effect of linkerhistone’s nucleosome binding affinity on chromatinunfoldingmechanismsBiophys. J. 111 1670–80

[209] Luque. A and Schlick T 2016A sensitive interplay betweennucleosome repeat length, linker histone density, and histonetail interactionsBiophys. J. in revision

[210] OzerG, Collepard-Guevara R and Schlick T 2015 Forcedunraveling of chromatin fibers with nonuniform linkerDNAlengths J. Phys.: Condens.Matter. 27 064113

[211] RoccatanoD, Barthel A andZachariasM2007 Structuralflexibility of the nucleosome core particle at atomicresolution studied bymolecular dynamics simulationBiopolymers 85 407–21

[212] Mangenot S, Leforestier A, Vachette P, DurandD andLivolant F 2002 Salt-induced conformation andinteraction changes of nucleosome core particlesBiophys. J.82 345–56

[213] GansenA, Valeri A,Hauger F, Felekyan S, Kalinin S, TothK,Langowski J and Seidel CAM2009Nucleosome disassemblyintermediates characterized by single-molecule FRETProc.Natl Acad. Sci. 106 15308–13

[214] LiG, LevitusM, Bustamante C andWidom J 2004Rapidspontaneous accessibility of nucleosomalDNANat. Struct.Mol. Biol. 12 46–53

[215] Ettig R, KepperN, Stehr R,WedemannGandRippeK2011DissectingDNA-histone interactions in the nucleosome bymolecular dynamics simulations ofDNAunwrappingBiophys. J. 101 1999–2008

[216] WocjanT, Klenin K and Langowski J 2009 Browniandynamics simulation ofDNAunrolling from the nucleosomeJ. Phys. Chem.B 113 2639–46

[217] Dobrovolskaia I V andAryaG 2012Dynamics of forcednucleosome unraveling and role of nonuniformhistone-DNA interactionsBiophys. J. 103 989–98

[218] YangD andAryaG 2011 Structure and binding of theH4histone tail and the effects of lysine 16 acetylation Phys. Chem.Chem. Phys. 13 2911–21

[219] Zhou J, Fan J Y, RangasamyDandTremethickD J 2007Thenucleosome surface regulates chromatin compaction andcouples it with transcriptional repressionNat. Struct.Mol.Biol. 14 1070–6

[220] PotoyanDA and PapoianGA2012Regulation of theH4 tailbinding and folding landscapes via Lys-16 acetylation Proc.Natl Acad. Sci. 109

[221] PapamokosGV, TziatzosG, PapageorgiouDG,Georgatos SD, PolitouA S andKaxiras E 2012 Structural roleof RKSmotifs in chromatin interactions: amoleculardynamics study ofHP1 bound to a variablymodified histonetailBiophys. J. 102 1926–33

[223] VelloreNA andBaronR 2013Molecular dynamicssimulations indicate an induced-fitmechanism for LSD1/CoREST-H3-histonemolecular recognitionBMCBiophys.6 15

[223] ZhouBR, Jiang J, FengH,GhirlandoR, XiaoT S andBai Y2015 Structuralmechanisms of nucleosome recognition bylinker histonesMol. Cell 59 1–11

[224] ZhouBR, FengH,KatoH,Dai L, Yang Y, ZhouY andBai Y2013 Structural insights into the histoneH1-nucleosomecomplexProc. Natl Acad. Sci. 110 19390–5

[225] Marti-RenomMAandMirny LA 2011 Bridging theresolution gap in structuralmodeling of 3D genomeorganization PLoSComput. Biol. 7 e1002125

[227] NaganoT, Lubling Y, Stevens T J, Schoenfelder S, Yaffe E,DeanW, Laue ED, TanayA and Fraser P 2013 Single-cell Hi-C reveals cell-to-cell variability in chromosome structureNature 502 59–64

[227] Rouquette J, CremerC, Cremer T and Fakan S 2010Functional nuclear architecture studied bymicroscopy:present and future Int. Rev. CellMol. Biol. 282 1–90

17

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick

Page 18: Computational strategies to address chromatin structure problems · 2016-07-01 · Lys5(mammals,S. cerevisiae) x 3 Transcriptionactivation,unknown [50,100,101] Lys7(S.cerevisiae)

[228] Grosberg AYu,Nechaev SK and Shakhnovich E I 1988Therole of topological constraints in the kinetics of collapse ofmacromolecules J. Phys. 49 2095–100

[229] Grosberg A, Rabin Y,Havlin S andNeer A 1993Crumpledglobulemodel of the three-dimensional structure of DNAEurophys. Lett. 23 373–8

[230] Mirny LA 2011The fractal globule as amodel ofchromatin architecture in the cellChromosomeRes. 1937–51

[231] Lieberman-Aiden E et al 2009Comprehensivemapping oflong-range interactions reveals folding principles of thehuman genome Science 326 289–93

[232] NaumovaN, ImakaevM, FudenbergG, ZhanY, Lajoie BR,Mirny LA andDekker J 2013Organization of themitoticchromosome Science 342 948–53

[233] BarbieriM,ChotaliaM, Fraser J, Lavitas LM,Dostie J,PomboA andNicodemiM2012Complexity ofchromatin folding is captured by the strings andbinders switchmodel Proc. Natl Acad. Sci. 10916173–8

[234] Gehlen LR,Gruenert G, JonesMB, Rodley CD,Langowski J andO’Sullivan J 2012Chromosome positioningand the clustering of functionally related loci in yeast is drivenby chromosomal interactionsNucleus 3 370–83

[235] Jost D, Carrivain P, Cavalli G andVaillant C 2014Modelingepigenome folding: formation and dynamics of topologicallyassociated chromatin domainsNucleic Acids Res. 42 9553–61

[236] BruinsmaR,Grosberg AY, Rabin Y andZidovska A 2014Chromatin hydrodynamicsBiophys. J. 106 1871–81

[237] Zidovska A,WeitzDA andMitchison T J 2013Micron-scalecoherence in interphase chromatin dynamics Proc. Natl Acad.Sci. 110 15555–60

[238] Isaacson SA,McQueenDMandPeskinC S 2011Theinfluence of volume exclusion by chromatin on the timerequired tofind specificDNAbinding sites by diffusion Proc.Natl Acad. Sci. 108 3815–20

[239] Isaacson SA, Larabell CA, LeGrosMA,McQueenDMandPeskinC S 2013The influence of spatial variation inchromatin density determined by x-ray tomograms on thetime to findDNAbinding sitesBull.Math. Biol. 75 2093–117

18

Phys. Biol. 13 (2016) 035006 OPerišić andT Schlick