Understanding Spatial Genome Organization: Methods and Insights
2016-03-31VijayRamaniJayShendureZhijunDuan
Vijay Ramani, Jay Shendure, Zhijun Duan
Understanding Spatial Genome Organization: Methods and Insights
Vijay Ramani1,a, Jay Shendure1,5,b, Zhijun Duan2,3,4,*,c
1Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
2Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
3Division of Hematology, University of Washington, Seattle, WA 98195, USA
4Department of Medicine, University of Washington, Seattle, WA 98195, USA
5Howard Hughes Medical Institute, Seattle, WA 98195, USA
Received 31 December 2015; revised 20 January 2016; accepted 21 January 2016
Available online 11 February 2016
Handled by Zhihua Zhang
E-mail: zjduan@uw.edu (Duan Z).
aORCID: 0000-0003-3345-5960.
bORCID: 0000-0002-1516-1865.
cORCID: 0000-0002-8147-793X.
Peer review under responsibility of Beijing Institute of Genomics,
Chinese Academy of Sciences and Genetics Society of China.
http://dx.doi.org/10.1016/j.gpb.2016.01.002
1672-0229ⓒ2016 The Authors. Production and hosting by Elsevier B.V. on behalf of Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China.
This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
KEYWORDS
Chromatin; Chromosome; Epigenomics; 4D nucleome; Hi-C
Abstract The manner by which eukaryotic genomes are packaged into nuclei while maintaining crucial nuclear functions remains one of the fundamental mysteries in biology. Over the last ten years, we have witnessed rapid advances in both microscopic and nucleic acid-based approaches to map genome architecture, and the application of these approaches to the dissection of higherorder chromosomal structures has yielded much new information. It is becoming increasingly clear, for example, that interphase chromosomes form stable, multilevel hierarchical structures. Among them, self-associating domains like so-called topologically associating domains (TADs) appear to be building blocks for large-scale genomic organization. This review describes features of these broadly-defined hierarchical structures, insights into the mechanisms underlying their formation, our current understanding of how interactions in the nuclear space are linked to gene regulation, and important future directions for the field.
Introduction
The human body consists of many trillions of cells harboring nearly identical genomes, and yet subsets of these cells are distinct both functionally and morphologically. It is widely accepted that‘epigenetic”mechanisms are responsible for the differential regulation of shared genetic information, and thus for the generation of a diverse array of terminal cell types through zygotic development.
The physical organization of eukaryotic chromosomes within a nucleus is crucially intertwined with the reading, interpretation, and propagation of genetic information by these epigenetic mechanisms. Metazoan cells package genomic DNA up to 2 m long into a tiny nuclear space~10 μm in diameter via hierarchy of organizational structures [1]. The compaction begins with the wrapping of 147 base pairs (bp)of DNA around a histone octamer to form the nucleosome; this nucleoprotein complex serves as the basic repeating unit of chromatin. The histone octamer itself is composed of eight subunits that assemble as one histone H3–H4 tetramer and two histone H2A–H2B dimers. Both DNA and histone components of the nucleosome particle can be subjected to a diversity of chemical modifications [2] (e.g., CpG methylation and lysine tail acetylation), and several histone variants exist [3] (e.g., H2A.Z, CENPA), thus enabling an epigenetic diversity at even this most basic level of chromatin organization. The next level of compaction is commonly believed to be the organization of nucleosomes into a 10 nm‘beads-on-string”chromatin fiber [4]. Additional nucleosomal organization into higher-order structures on the order of 30 nm or 100 nm has been hotly debated, and the existence of native structure beyond the 10 nm fiber has been questioned [5,6]. For instance, a recent study has proposed that native chromatin fibers in Saccharomyces cerevisiae are formed by heterogeneous clutches of nucleosomes that are interspersed with nucleosome-depleted regions, arguing against the existence of highly-ordered structures such as the 30 nm fiber [7]. Regardless of physical model, however, the chromatin fiber must ultimately fold into highly-condensed interphase chromosomes, a process that remains poorly understood.
Though we still know little of the dynamics of in vivo chromatin folding, we have gained important insights into the higher-order spatial organization of eukaryotic genomes, thanks to significant advances in DNA imaging technology and high-throughput biochemical techniques [8–11]. Eukaryotic genomes are clearly organized in the nucleus in a nonrandom way. In mammalian genomes, individual chromosomes preferentially occupy distinct nuclear areas, termed chromosome territories (CTs) [12]. Transcriptionally-silent regions are generally localized near the nuclear envelope and perinucleolar space, whereas transcriptionally-active regions occupy the remaining nuclear space [13,14]. At the cytological level, the eukaryotic genome is partitioned into euchromatin and heterochromatin [1]. At the molecular level, the nucleus is geometrically compartmentalized in mammalian cells to contain morphologically and molecularly distinct sub-structures (e.g., nuclear bodies), suggesting that nuclear activities are also spatially organized [15,16]. Individual chromosomes are partitioned into various physical or functional compartments and domains such as topologically associating domains (TADs) and chromatin loops [10,17–19]. Given the strong link between these common organizational features and cellular functions (e.g., transcription), it is tempting to speculate that the modulation of chromatin organization itself is a basic mechanism by which cellular functions are enacted. However, crucial experiments, some of which are mentioned later in this review, are still required to elucidate whether these observed structural features play some general causal roles, or are simply a correlative of the cellular functions. Even still, we can be reasonably certain that features like chromatin loops, clustered highlytranscribed genomic loci, and large-scale chromosome domains are basic elements of chromatin folding, and as such are invaluable when analyzing genome organization in the context of developmental and environmental cues.
Here, we first review well-established and emerging echnologies that are revolutionizing our understanding of higher-order genome architecture. We then discuss our current understanding of spatial genome organization in greater detail, providing insights into the mechanisms underlying structure formation, and the links between chromatin folding and gene regulation. Finally, we propose a handful of pressing questions we believe to be central to an ultimate understanding of the spatiotemporal organization and functions of nucleome.
Tools for exploring the 3D genome
Three largely-orthogonal approaches are commonly used to study the structure and function of the three-dimensional (3D) genome. Microscopy-based DNA imaging techniques and high-throughput genomic mapping tools based on massively-parallel sequencing have been used to delineate higher-order genomic architecture, while genome perturbation tools (i.e., genome-editing) have then been used to ascertain the functional significance of specific architectural elements. Over the last ten years, the field has witnessed tremendous methodological advances in all three areas [20–23].
Traditionally, chromosome and nuclear structure have been viewed through DNA imaging technologies, which can be based on electron microscopy [24,25], or light microscopy [25,26]. Electron microscopic techniques, including transmission electron microscopy (TEM) and cryo-electron microscopy (Cryo-EM), have typically been used to characterize cell-free systems. Cryo-EM, in particular, has become an increasingly popular structural biological tool, owing in part to dramatic improvements in resolution and ease of sample preparation [27]. Recently, Cryo-EM was used to determine an 11 A˚-resolution structure of 30-nm chromatin fibers assembled from arrays of 12 nucleosomes [28].
Before the advent of massively-parallel analyses by microarray and later high-throughput sequencing, our knowledge of 3D genome organization was largely derived from studies using fluorescence labeling-based light microscopy, such as DNA fluorescence in situ hybridization (FISH) [29] and live-cell imaging [30]. FISH and live-cell imaging can directly measure physical distances between DNA loci and visualize the nuclear position of loci and/or whole chromosomes within single cells. Today, many variants of the FISH technique exist, including conventional two-dimensional FISH (2D-FISH), 3D-FISH [31], and cryo-FISH [32], with the resolution approaching 100 kb [33]. More recently, a highthroughput imaging position mapping platform (HIPmap) has been implemented [34], presenting a crucial breakthrough in overcoming the limitations of scalability and throughput that are associated with conventional FISH techniques. While FISH assays are typically used to characterize only a few loci at a time, HIPmap enables large-scale (384-well format), automated, high-resolution localization of 3D gene positions in single cells. In addition to HIPmap, a quantitative high-resolution imaging approach, which combines FISH, array tomography (AT) imaging, and multiplexed immunostaining, has also been implemented for investigating 3D chromatin organization in complex tissues [35]. The development of the automated image analysis toolkits such as the aforementioned ones is likely to be critical as the field moves toward visualizing chromatin architecture in a large number of diverse contexts.
One commonly-cited limitation of light microscopic techniques, despite their versatility, is the resolution limit owing to the wavelength of light. To overcome this, several superresolution fluorescence microscopy approaches, such as structured illumination microscopy (SIM), stimulated emission depletion (STED), and photoactivation localization microscopy/stochastic optical reconstruction microscopy (PALM/ STORM), have been developed during the last decade (reviewed in [36]). These techniques have been applied to study higher-order nuclear architecture [37–40]. In addition, combined with more advanced fluorescent labeling techniques, these techniques have also been used to image chromosome dynamics with unprecedented spatiotemporal resolution in live cells at the single-molecule level [41]. In typical chromatin visualization experiments, either chromatin-associated proteins (e.g., core histone proteins) or the DNA itself must be labeled [37]. While the LacO/LacI DNA tagging system has long been used in live-cell imaging [14], recent developments in live-cell chromatin imaging have used fluorescently-tagged transcription activator-like effector (TALE) proteins [42] or the clustered regulatory interspaced short palindromic repeat (CRISPR)/Cas9 system [43,44] to specifically label loci.
Complementary to microscopy-based DNA imaging tools, biochemical tools decipher nuclear organization by measuring physical contacts between different genomic regions or between genomic DNA and other nuclear components. Initially coupled with oligonucleotide-decorated array (microarray) technology, and now typically paired with massively-parallel DNA sequencing [45], these biochemical tools enable genome-wide characterization of myriad aspects of higher-order chromosome structure and organization. Moreover, they can also allow for reconstruction and modeling of 3D genome architecture with the aid of sophisticated computational algorithms [46].
Broadly, the current state-of-the-art for these biochemical techniques can be classified into three groups, based on the biological origin of the chromatin contacts being assayed (Figure 1; Table 1). Methods that can detect DNA-protein interactions, such as chromatin immunoprecipitation (ChIP-seq), DNA adenine methyltransferase identification (DamID), and sedimentation fractionation, have been used for probing physical contacts between genomic loci and nuclear landmarks such as the nuclear envelope or nucleolus, providing information about where particular genomic loci are localized within the nucleus. In ChIP techniques [47], antibodies specific to a nuclear complex of interest are used to immunoprecipitate chemically-crosslinked sheared chromatin, and the associated DNA is used to create a high-throughput sequencing library. In DamID [48], bacterial adenine methyltransferase is fused to a protein of interest and allowed to interact with physically-proximal DNA. Sequences containing methylated adenine are enriched through digestion with Dam-specific restriction enzymes, and the products are then sequenced. In sedimentation fractionation [49], chromatin is subjected to ultracentrifugation and fractionation, and the DNA present in desired fractions is sequenced. Both ChIP and DamID have been used to identify genomic regions associated with nuclear pore complexes (NPCs) [20], while sedimentation fractionation has been used to isolate nucleolus-associated domains (NADs) [49]. Most commonly, the DamID approach has been used to catalog genomic regions that interact with the inner face of nuclear membrane, the so-called lamina-associated domains (LADs) [50–52].
The second class of methods includes those that probe chromatin–RNA interactions, a hotly debated class of interactions that may eventually be used to define chromatin domains or sub-nuclear bodies. Currently, there are three different methods for identifying chromatin–RNA interactions: chromatin isolation by RNA purification (ChIRP) [53], capture hybridization analysis of RNA targets (CHART) [54], and RNA antisense purification (RAP) [55]. All three techniques follow the same basic schema: crosslinked chromatin is sheared and then hybridized to biotinylated anti-sense oligonucleotides that are specific to a transcript or transcripts of interest. Following a streptavidin enrichment step, DNA coenriched with targeted RNA is subjected to deep sequencing. All three of these techniques have been used to study long noncoding RNAs (lncRNAs), including RoX in Drosophila melanogaster [56] and Xist in mouse [54,55], both of which play crucial roles in each species’respective dosage compensation mechanisms.
The third group of techniques covers the chromosome conformation capture (3C) family of methods [21,57], which measure the relative spatial proximity between individual genomic loci through digestion and re-ligation of physically-proximal chemically-crosslinked fragments of chromatin. 3C techniques are probably the most popular tools for mapping chromatin interactions, and a diversity of methods based on 3C have been developed during the past decade. 3C derivatives themselves can be classified into two groups (Table 2): (1) for globally mapping genome-scale chromatin interactions occurring in a nucleus, including Hi-C [58], tethered conformation capture (TCC) [59], single-cell Hi-C [60], DNase Hi-C [61], in situ Hi-C [62], in situ DNase Hi-C [63], and Micro-C [64]; and (2) for targeted detection of a subset of chromatin interactions, such as 3C [57], ChIP-loop [65], circularized chromosome conformation capture (4C) [66,67], enhanced 4C (e4C) [68], carbon-copy chromosome conformation capture (5C) [69], chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) [70,71], Capture-C [72], Capture-Hi-C [73], and targeted DNase Hi-C [61]. Since 2009, Hi-C and its variants have been used to generate whole-genome contact probability maps in bacteria [74–76], budding and fission yeast [64,77,78], a pathogenic eukaryote (Plasmodium falciparum) [79], plants [80,81], worm [82],fly [83,84], mouse [63,85], and human [58,62,85,86]. Depending on the protocol and depth of highthroughput sequencing used, the resolution of Hi-C-derived contact probability maps can be multiple orders of magnitude lower than that of genomic annotations at base-pair level. In the absence of incredibly-high sequencing depth, Hi-C and its variants are most suitable for identifying chromatin conformation signatures at the sub-megabase (Mb) or Mb scale, such as CTs, chromatin compartments, and TADs. Other chromatin conformation signatures, including so-called loops between promoters and other cis-elements, or pairs of binding sites for the transcription factor CTCF, are best carried out using the second group of approaches, which are each designed to map a specific set of chromatin interactions and thus allow for considerably-higher resolution for a given sequencing depth. Indeed, 4C, 5C, ChIA-PET, Capture-C, Capture-Hi-C, and targeted DNase Hi-C have all successfully been used to map specific regulatory interactions.
Figure 1 High-throughput biochemical techniques for probing the nucleomeHigh-throughput methods for probing the nucleome can broadly be grouped into three classes. (1) Methods detecting protein–DNA interactions include ChIP-seq, where antibodies specific to proteins of interest are used to co-precipitate crosslinked genomic DNA, and DNA adenine methyltransferase identification (DamID), in which a bacterial adenine methylase is used to methylate physically-proximal adenines. (2) Methods detecting RNA–DNA interactions include ChIRP, CHART, and RAP. Crosslinked chromatin is sheared and then hybridized to biotinylated anti-sense oligonucleotides specific to a transcript or transcripts of interest. In all of these methods, tagged or purified DNA is used to create a massively-parallel sequencing library. (3) The 3C family of methods are used to probe DNA–DNA interactions. While there are many different types of 3C assay, all 3C-based methods share the same core concept: chromatin interactions are measured by proximity ligation of fragmented and crosslinked chromatin. The key differences between these methods lie in how chromatin interactions are detected following proximity ligation. In ChIA-PET, crosslinked chromatin complexes are fragmented by sonication and chromatin interactions mediated by a protein of interest are enriched by ChIP before performing the proximity ligation. In 3C, 4C, and 5C, chromatin interactions of interest are enriched by PCR using locus-specific primers. In Hi-C and its variants, the valid chromatin interactions are enriched through a streptavidin-biotin-mediated pull-down. In targeted Hi-C methods, such as Capture-C, Capture Hi-C, and targeted DNase Hi-C, chromatin interactions of interest are enriched by applying hybrid capture technologies to 3C or Hi-C libraries.
Table 1 Biochemical tools for probing genomic interactions
Table 2 The 3C family
Crucially, data generated using 3C technologies may also be used to generate 3D predictions of genomic structure. The various computational approaches for tackling this problem have been reviewed in great detail elsewhere [87], and as such this review will not further address this large and still growing body of work.
The biochemical methods discussed above are able to offer detailed molecular views of chromosome structure. However, these assays are all performed on many thousands to millions of cells per experiment, thus masking the variability inherent between individual cells. Single-cell versions of ChIP-seq [88], Dam-ID [52], and Hi-C [60,89] have all been recently described, though in all cases the sensitivity of the assay is markedly low due to the difficulty in obtaining large amounts of DNA from single cells. Still, this field of single-cell chromatin profiling by high-throughput biochemical methods is nascent, and offers an interesting complement to traditional single-cell assays carried out through microscopy. The most accurate models for the spatiotemporal organization of eukaryotic genome architecture will likely be derived using a combination of high-resolution microscopy-based imaging technologies (FISH and live-cell imaging) and high-throughput, genome-wide single-cell biochemical approaches.
A major goal in the field of chromatin biology concerns characterizing the functional significance of the genomic regions identified using the aforementioned techniques. Several genome editing tools are currently available, including the zinc-finger nucleases (ZFNs) [90], transcription activator-like effector nucleases (TALENs), and the RNA-guided CRISPR/Cas9 system [22]. All of these tools have been used to perturb higher-order chromatin architecture through genome and epigenome editing [91–98]. Due to limited space, this review will not cover these tools and their applications, which have been reviewed elsewhere [99].
Organizational features of eukaryotic genomes and their relation to nuclear activities
Microscopy-based and high-throughput biochemical studies have revealed common organizational structures in eukaryotic genomes, including CTs, chromatin, and nuclear compartments, various types of chromatin domains (e.g., NADs, LADs, and TADs), and chromatin loops. In this section, we discuss their respective biophysical characteristics, and links between these structural features.
CTs and the nuclear position of chromosomes
The non-randomness of genome organization in the nuclear space at chromosome level was observed more than a century ago. The Rabl configuration, with centromeres and telomeres at opposite poles of the nucleus, was proposed by Carl Rabl in 1885 [100] and later confirmed by both microscopic and molecular studies in yeast and some plants [77,80,81]. In 1909, Theodor Boveri suggested that animal interphase chromosomes occupied distinct regions within the nucleus, for which Boveri introduced the term CTs. Since then, microscopic studies and genome-wide chromatin interaction mapping have revealed several features of CTs. First, although the existence of CTs in yeast and some plants is debatable, CTs as an organizational feature exist in the nuclei of a wide range of species, particularly mammals [85,86]. Second, each CT is predominantly a self-interacting entity that still harbors interactions with other CTs [12]. The physical clustering of centromeres, ribosomal DNA (rDNA) genes, and tRNA genes located on different chromosomes, which can be seen in species as divergent as S. cerevisiae and human, is a prime example of contacts occurring between different CTs [58,59,77,83,84]. Third, although the position of each CT is stochastic in a cell population (i.e., not the same in each cell), individual CTs show preferences for nuclear positioning in mammalian cells, which may correlate with genomic properties (e.g., GC content, gene density, and chromosome size), as well as with genomic functions (e.g., transcriptional activity and replication timing) [101–105]. In general, large and gene-poor chromosomes tend to be located near the nuclear periphery, whereas small and gene-rich chromosomes group together near the center of the nucleus. For example, human chromosomes 18 (gene-poor) and 19 (gene-rich) are localized preferentially to the periphery and center of the nucleus in human lymphocytes, respectively [102]. Interestingly, homologous chromosomes in diploid cells are generally found to be far apart from each other in the interphase [106]. Fourth, in each cell, the relative position of CTs is stably maintained from mid G1 to late G2/early prophase during the cell cycle; this has been demonstrated in both HeLa cells and normal rat kidney (NRK) cells [107,108]. Whether these global chromosomal arrangements are transmitted through mitosis, however, remains unknown. In NRK cells, this is believed to be the case [107], while in HeLa and HT1080 fibrosarcoma cells, this appears to not be the case [108,109]. Fifth and the last, while the functional significance of a given CT’s positional preference remains unknown, the spatial configurations of chromosomes relative to one another are tissue-specific [110] and may even be evolutionarily-conserved [111]. As an example of tissue specificity, X chromosomes are localized more peripherally in liver cells compared to kidney cells [110].
Chromatin folding and compartmentalization of nuclear activities
At any given time within a living cell’s interphase chromosomes, certain genomic loci may be embedded in a constitutive heterochromatin region, some may associate with the nuclear lamina, some may be attached to the nucleolus, and others may be embedded in the various sub-nuclear bodies, engaging in specific nuclear activities. One widely-held model for transcription postulates that active genes may co-localize into discrete‘transcription factories”, where high local concentrations of RNA polymerase II (RNAPII) and basal transcriptional machinery enforce gene expression [112]. Thus, in any given nucleus of a eukaryotic cell, along an interphase chromatin fiber, packing state is heterogeneous and tightly associated with local epigenetic state. This supports the notion that chromatin folding is somehow influenced by various nuclear processes (e.g., transcription and DNA replication/repair) and constrained by nuclear context (e.g., geometrical heterogeneity). Microscopic and molecular studies have identified several chromatin domains, with each representing some aspect of chromatin folding. Here we summarize the characteristics of the most commonly-discussed chromatin domains and review how they relate among each other.
A/B compartments
Hi-C studies have revealed that within CTs, chromosomes are partitioned into large compartments at the multi-Mb scale, containing either the active and open (A compartments) or inactive and closed chromatin (B compartments) [58]. The open A compartments contain high GC-content regions, are generich, and are generally highly transcribed. They are enriched in DNase I hypersensitivity and histone modifications marking active (H3K36me3) and poised chromatin (H3K27me3). In contrast, B compartments are gene-poor, less transcriptionally active, and enriched in high levels of the silencing H3K9me3 mark [58]. It is interesting to consider the extent to which A/ B compartments are correlated with cytogenetically-defined euchromatin/heterochromatin. The A compartments preferentially cluster with other A compartments throughout the genome, as do B compartments. B compartments are also highly correlated with late replication timing and LADs, suggesting that their nuclear position might be close to the nuclear periphery [113]. A recent high-resolution Hi-C study found that the two compartments can be further subdivided into sixsub-compartments (A1, A2, and B1-B4) [62]. A/B compartments and sub-compartments have also been found to be celltype specific and are each associated with distinct chromatin patterns [58,62]. This represents, a sensible finding given that different cell types express gene sets driven by distinct groups of regulatory elements. Thus, the compartmentalization of CTs into distinct A/B compartments and sub-compartments is directly correlated with the cell type-specific gene expression and chromatin status of the genome. Indeed, A/B compartments revealed by Hi-C can be reconstructed by using a variety of epigenomic data, reflecting genome-wide DNA methylation or chromatin accessibility patterns [114].
Self-interacting domains With increases in resolution provided by a greater depth of sequencing, recent Hi-C and 5C studies have revealed that CTs and A/B compartments may be broken down further into smaller self-interacting domains, which have been identified in the genomes of a wide range of species from bacteria to human [115,116]. In metazoan genomes, these chromatin-folding modules are called physical domains in flies [84] or TADs in mammalian cells [85,117], while in bacteria and yeast, these domains are typically referred to as chromosomal interacting domains (CIDs) [64,74]. TADs in mammalian genomes are several hundred kb up to 1–2 Mb in size (with a median size of about 800 kb in mouse) [85,117], and are smaller in flies (60 kb) [83,84], while CIDs are typically smaller [64,74].
While the formal definition for these self-interacting domains is quite broad, they all share common core properties. First, they are characterized by a greater frequency of withindomain interactions as compared to external interactions. This is in fact how TADs are identified in Hi-C data, through a measure of the directionality index (DI) of ligation pairs across a chromosome [85]. Identification of self-interacting domains is thus strongly dependent on the resolution of the Hi-C data set analyzed. This is evidenced by the much smaller selfinteracting domains (median length 185 kb), identified in both mouse and human cells in a recent high-resolution Hi-C study [62]. Second, domain boundary regions are generally enriched in transcription start sites, active transcription, active chromatin marks, housekeeping genes, tRNA genes, and short interspersed nuclear elements (SINEs), as well as binding sites for architectural proteins like CTCF and cohesin [85]. A recent study also highlighted the role of histone acetylation in the formation of TADs, suggesting that TADs are primarily built from nonacetylated nucleosomes and that TAD boundaries are composed of acetylated nucleosomes [118]. Third, TADs are evolutionarily conserved and cell-type independent [115,116], a feature that is expected, given the presence of housekeeping genes at TAD boundaries. Fourth, selfinteracting domains represent basic units of chromatin folding. This is supported by early microscopic studies showing that CTs consist of chromosomal domains (CDs) spanning 100 kb–1 Mb in size [119], the same length scale as for the recently defined self-interacting domains. This suggests that TADs and similar domains may represent the same structures as microscopy-defined CDs. Recent lines of evidence further strengthened this by linking TADs and chromatin packing directly in fly [120]. Hi-C studies on Drosophila polytene chromosomes revealed equivalence between polytene bands/ inter-bands and TAD/TAD boundaries, suggesting that different types of TADs correspond to distinct packing states. For example, inactive TADs, which contain fully-condensed chromatin at the nuclear periphery, correspond to classical heterochromatin, whereas active TADs (partially-packaged) and TAD boundaries (fully-extended chromatin fibers) correspond to classic euchromatin (less dense chromatin in the nuclear interior). Since these polytene bands are observed in single salivary gland cells, the correspondence of TADs to polytene bands also suggests that TADs are unlikely to be a statistical feature of population-level Hi-C experiments, but rather exist at the level of single cells. Recently, a super-resolution microscopy study on human and mouse cells using STORM revealed that nucleosomes are grouped into discrete clutches along the fiber, with areas of relative depletion between them [7]. The relationship between these‘clutches”of nucleosomes and self-interacting domains in metazoans remains unknown, though the recently published Micro-C method in yeast hints at a strong linkage between the two [64].
Though the definition of self-interacting domains has greatly helped our understanding of how chromatin might be organized in the nucleus, the functional relevance of these domains and the mechanisms underlying their formation remain poorly understood. To get at the function of particular domain boundaries, recent studies have employed genome editing to edit out or invert CTCF sites [94,96,98]. In some cases, this editing led to drastic changes in gene expression, particularly when single nucleotide polymorphisms (SNPs) in these CTCF sites were already implicated in genome-wide association studies (GWAS) for a particular syndrome. Another naturally-occurring example of this was recently shown in the context of brain cancer, where hypermethylation at particular CTCF sites in low-grade IDH1-mutant gliomas leads to differential CTCF binding, changes in genome topology, and consequent dysregulation of proto-oncogenes [121]. In other cases, however, inversion or deletion led to only slight changes in gene expression. The results of such experiments hint at the underlying complexity of gene regulation, perhaps suggesting that genome architecture alone is not the master regulator of gene expression.
Gene clustering in transcription factories
One common model for transcription posits the existence of transcription factories—discrete nuclear foci in eukaryotic nuclei where transcription occurs [122]. Biochemical purification of transcription factories associated with RNAPI, II, or III has demonstrated that transcription factories consist of nascent RNAs, genomic templates and regulatory DNA elements (e.g., enhancers), and a variety of proteins involved in transcription initiation, elongation, and regulation [122,123]. Several features of transcription factories have been revealed: (i)>95% of all nuclear transcription activities occur within transcription factories [124]; (ii) each transcription factory contains only one type of RNAP (I, II, or III) and the number of the RNAP molecules in a factory is variable among different cell types [124]; (iii) genes sharing the same factory can be on the same chromosome or on different chromosomes, and may be co-regulated or functionally unrelated [125]; (iv) the number of transcription factories found per nucleus depends largely on the species studied and the detection method used,ranging from a few hundreds to a few thousands [124]; and (v) the size of the factory varies depending on both the RNAP featured and cell types [124]. Given this model, the question of whether transcription factory formation is a byproduct of the process of transcription, or whether these are stable structures whose formation, in fact, precedes and/or drives transcription itself, remains unanswered. What is clear, however, is that the colocalization of genomic loci into these‘factories”is a strongly tissue-specific mark of both chromatin folding and 3D genome organization in the nucleus.
Nucleolar associating domains
The nucleolus is the largest subnuclear organelle in the nucleus of eukaryotic cells and is the prototype for transcription factories, as it serves as the primary site of rRNA biogenesis. In addition to its primary role as the site of rRNA transcription and maturation, the nucleolus also hosts several other biological processes, including viral replication, signal recognition particle biosynthesis, and sequestration of proteins (reviewed in [126]). Nucleoli assemble around the rDNA genes clustered from different chromosomes, where the genes are transcribed by RNAPI. In a given nucleus, only a subset of rDNA loci are transcribed at once, where they are looped into the nucleolus. The remaining rDNA loci are located at the periphery of the nucleolus to form constitutive heterochromatin. Genomic regions that interact frequently with the nucleolus are called nucleolar associating domains (NADs) [49,127]. NADs are characterized by repetitive DNA elements, mostly from centromeric and pericentromeric regions, are gene poor, and typically contain silent chromatin (e.g., regions of the inactive X chromosome (Xi), repressed olfactory receptor genes, tissuespecifically repressed RNAPII genes), and several RNAPIII-transcribed genes. NADs cover about 4% of the human genome and are significantly overlapped with LADs (discussed in further detail below), indicating that a certain amount of redistribution occurs between the nuclear lamina and nucleolar periphery after mitosis [49,127]. Mechanisms for this redistribution remain poorly understood, though it has been shown that nucleolus tethering may be mediated by trans acting factors such as CTCF, chromatin assembly factor (CAF)-1, nucleolar proteins, and potentially lncRNAs [126].
LADs
LADs refer to the regions of the genome that interact with the nuclear lamina at the interior of the nuclear envelope. LADs were first characterized using the DamID technique, which has revealed that mammalian LADs are large, gene-poor domains spanning 40 kb–30 Mb and covering~40% of the genome [50,128]. LADs are enriched for heterochromatic silencing marks, largely overlap with the previously-identified H3K9me2 locks, and show very sharp borders that are significantly enriched for bidirectional transcription, CpG islands, and CTCF binding sites [50]. These features are reminiscent of the borders found at self-associating domains. As with NADs, the mechanisms underlying tethering of LADs to the nuclear periphery largely remain unclear. However, a recent single-cell study has revealed that LADs showing stable contact (i.e., contact across many single cells) with the nuclear lamina (NL) are extremely gene poor, suggesting a structural role, whereas LADs with variable NL contacts tend to be cell-type specific [52]. Moreover, the consistency of NL contacts is inversely linked to gene activity in single cells and correlates positively with the heterochromatic histone modification H3K9me3 [52], suggesting that the tethering of LADs to the NL plays an important role in physically and functionally compartmentalizing eukaryotic genomes.
Chromatin loops and gene regulation
Looping is an intrinsic property of chromatin fibers and serves as the basic mechanism of chromatin folding. In as early as 1878, Walther Flemming observed large chromosomal loops in the so-called lampbrush chromosomes of amphibian oocytes [125]. Ptashne and others have since posited that long-range looping interactions may be key effectors of gene expression [129], a hypothesis that has gained credence, thanks to recent mapping efforts via 3C-based methods. The chromatin loop is likely tightly related to the formation of self-associating domains. For example, a recent work has shown that the stability of a TAD is determined by specific long-range loops within it [130]. The best-studied chromatin loops are those between genes and their distal regulatory elements, such as enhancers. One such example is the observation of an active chromatin hub (ACH) at the active beta- and alpha-globin loci. The ACH configuration is formed when multiple regulatory elements are juxtaposed against one another in 3D space via looping to coordinate gene expression [131].
Recent genome-wide mapping of chromatin interactions has uncovered general features of this type of loop. First, ~50% of active genes are engaged in long-range chromatin interactions in the cell types examined [132,133]. Notably, those active genes that are not found to interact with a distal enhancer are enriched in housekeeping genes [132]. Second, in addition to promoter–enhancer interactions, promoter–promoter and enhancer–enhancer loops have also been detected, and there is extensive co-localization among multiple promoters and/or multiple distal-acting enhancers [133,61,134,135]. Given that 3C-based methods are designed to detect second order interactions (i.e., pairs of interacting loci), the question remains whether an element interacts with multiple other elements simultaneously within the same nuclear environment, or whether these interactions actually occur within different single cells. As discussed in further detail below, arriving at an answer to these questions may become possible through the further development of single-cell epigenomic technologies. Third, promoter–enhancer interactions generally show high cell type specificity and are correlated with cell type-specific transcription [133,61,134,135], though it has been argued that promoter–enhancer loops are generally unchanged across tissue contexts and across development [132,136]. Collectively, these findings nonetheless underscore that chromatin looping is an important mechanism by which long-range interaction between distal regulatory elements and genes may be achieved.
Building upon these findings, recent functional studies using gene editing tools have further suggested a causal link between chromatin looping and gene regulation. It has long remained unclear whether looped interactions are a prerequisite for or merely a consequence of gene regulation. Direct evidence has been obtained recently, demonstrating that chromatin looping between a gene promoter and a strongenhancer can lead to transcriptional activation [137,138]. The Blobel group, in collaboration with synthetic ZFN pioneers Sangamo Biosciences, recently showed that chromatin loops may be induced between the globin locus control region (LCR) and the beta-globin promoter in GATA1 knock-out murine cells using synthetic zinc-finger proteins tethered to the self-association domain of Ldb1. These induced chromatin loops led to substantial activation of ß-globin transcription in the absence of GATA1 [137]. Using the same approach, the group also more recently demonstrated that forced LCR–promoter looping could lead to transcriptional reactivation of the developmentally-silenced fetal γ-globin gene in adult murine erythroblasts [138]. These new insights argue that, in the proper context, forced chromatin looping can directly guide transcriptional activity [99,139].
Many factors, including transcription factors (e.g., CTCF, YY1, and NRSF), co-activators (e.g., mediators), chromatin structural proteins (e.g., cohesin), and ncRNAs (e.g., Xist [140], Firre [141,142], and HOTTIP [143]) have been shown to play roles in mediating chromatin looping. The roles of CTCF and cohesin in spatial genome organization are by far the best characterized. Both CTCF and cohesin have been found to bind thousands to tens of thousands genomic sites, a significant portion of which are co-occupied by both proteins in mammalian cells [18,125,144]. Early studies also established CTCF as a transcription factor with versatile roles in transcription activation and repression, as well as a global insulator protein [145,146]. Cohesin is best known for its role in sister chromatid cohesion, chromosome segregation, and DNA repair [147]. Insights obtained from recent studies have also suggested that CTCF and cohesin play important roles in the hierarchical folding of the interphase chromosome, from chromatin looping to establishment of chromatin domains. It has been found that CTCF mediates thousands of chromatin loops in mouse and human genomes, which account for a substantial portion of all the loops detected in a genome [62,71,105,148–150]. The formation of CTCF-mediated loops requires cohesin, which also co-localizes with mediators to facilitate tissuespecific promoter–enhancer looping [151]. Moreover, it has been revealed that the orientation of CTCF binding guides directional chromatin looping [62,96,98,152]. This is in agreement with an extrusion model of loop formation [98,153].
It is believed that CTCF and cohesin also play important roles at the chromatin-domain level. CTCF binding has been found enriched at LAD boundaries, suggesting involvement in the formation of LADs [50]. CTCF and cohesin are also enriched at the boundaries of TADs, and depletion of cohesion and CTCF results in widespread changes in topological organization [154]. As mentioned briefly above, this was also shown recently by a study demonstrating that IDH mutations promote gliomagenesis by disrupting CTCF binding via hypermethylation, in turn disrupting TAD boundaries and allowing aberrant enhancer–promoter interactions to activate normallyinsulated oncogenes [121]. These results suggest a general role for CTCF and cohesin in chromatin folding and genome compartmentalization.
Future directions
The synthesis of classical microscopy-based approaches and more recent high-throughput biochemical techniques has led to an explosion in our knowledge of the physical organization of eukaryotic genomes. Through a diverse array of techniques including electron microscopy, FISH, ChIP-seq, DamID, as well as 3C and its derivatives, we are generating increasingly fine-scale catalogs of the chromatin loops, self-associating domains, and CTs that comprise the eukaryotic nuclear genome. Given this dense catalog of structural elements, then, we believe that the field will eventually move into two primary directions: (i) functional dissection of this vast catalog of structural elements, and (ii) large-scale characterization of the dynamics and mechanisms of chromatin folding both across biological processes such as differentiation, and across homogenous and heterogeneous cell populations.
Functional dissection of structural elements
The advent of CRISPR/Cas9 as an easy to use, highlymultiplexable system for perturbing primary sequence has opened up considerable avenues to testing the functional significance of genomic elements. We predict the continued use of genome editing reagents in validating key structural elements (e.g., CTCF binding sites), with respect to various phenotypes of interest (e.g., pathogenicity and dysregulation of global and local gene expression). Already, several groups have successfully utilized Cas9-mediated genome editing to generate clonal populations harboring inverted or deleted transcription factor binding sites, and have performed assays like Hi-C and RNA-seq to link structural and functional changes [94,96,98].
As low-throughput (e.g., test of single edited clones) approaches become more popular, we anticipate the eventual development of high-throughput screens for large-scale characterization of structural elements. Already, genome editingbased lentiviral and in vivo saturation mutagenesis screens have been employed, to dissect the functional significance of genes [155–157], codons [158], small insertions/deletions (indels) [159,160], and SNPs [158]. A key next step in determining the functional significance of cataloged elements will be employing such approaches to perturb key structural features in a variety of biological contexts; these experiments may be critical to eventually understanding the link between human disease phenotypes (e.g., cancer) and dysregulation of chromatin architecture.
Characterizing structural dynamics across time and space
Questions regarding the dynamics of chromatin—the processes by which chromatin architecture and state change as a function of a given biological process—remain largely unanswered. The nascent field of single-cell epigenomics [161], however, has offered a key set of tools that may finally be able to address such questions. While traditional epigenomic assays must be performed on populations of cells, single-cell epigenomics provide an opportunity to characterize heterogeneity within populations—an invaluable tool for both defining novel cell types from a heterogenous population (e.g., an organ system), and for characterizing transitory states in biological processes such as differentiation. Recently-published approaches such as single-cell DamID [52], single-cell ChIP-seq [88], and singlecell Hi-C [60] all provide valuable proof-of-concept for such assays. The next step, then, is to scale these approaches to easily process hundreds of thousands of single cells. Werecently described a method that leverages combinatorial DNA barcoding of single cells to provide chromatin accessibility information from thousands of cells in a single experiment [162]. Such approaches may be adapted to other epigenomic assays, including Hi-C, DamID, and ChIP, thus providing a way forward to achieving the required throughput to confidently define new cell types, or organize populations of cells going through some biological process into some sort of ‘pseudotime.”
It may also be useful to consider the marriage of single-cell biochemical techniques with complimentary microscopeacquired in situ transcriptomic datasets [163–166]. In situ transcriptomics may, for example, be necessary to properly spatially organize large populations of tissue-derived nuclei in some biologically meaningful way. Furthermore, by matching in situ transcriptomic data with replicate single-cell epigenomic experiments in this way, one may be able to link differential genome architectural features with gene regulatory phenomena, thus furthering our progress toward ultimately understanding the links between 3D genome architecture and gene regulation.
Of course, the application and development of any of these techniques is intertwined with the development of data analytical techniques. While algorithmic development will have to keep pace with the development of these technologies, we believe that incredible strides already made in the relatively young field of single-cell RNA sequencing [167] are a positive indicator that analytical methods will be able to keep pace with this exploding field.
Closing remarks
There are many fundamental and long-standing biological questions linked to 3D genome architecture, and we close by echoing a handful of them below. Does genome architecture itself define cellular identity? How does chromatin state (i.e., histone modifications and DNA methylation) impact higher-order chromatin structure? How might defects or differences in 3D genome architecture lead to human disease? Obtaining the knowledge necessary to answer these questions requires a multi-pronged approach employing creative microscopic, biochemical, and computational tools. As reviewed here, these are thankfully requirements that the field is actively addressing, suggesting that we will be well-positioned to answer many if not all of these pressing questions in the years to come.
Competing interests
The authors have declared that no competing interests exist.
Acknowledgments
This work is supported by the UW-CNOF Mapping Technology Development (Grant No. 1U54DK107979) from the National Institutes of Health, USA to JS and the UW Bridge Fund from the University of Washington, USA to ZD.
References
[1] Felsenfeld G, Groudine M. Controlling the double helix. Nature 2003;421:448–53.
[2] Zhou VW, Goren A, Bernstein BE. Charting histone modifications and the functional organization of mammalian genomes. Nat Rev Genet 2011;12:7–18.
[3] Talbert PB, Henikoff S. Histone variants–ancient wrap artists of the epigenome. Nat Rev Mol Cell Biol 2010;11:264–75.
[4] Oudet P, Gross-Bellard M, Chambon P. Electron microscopic and biochemical evidence that chromatin structure is a repeating unit. Cell 1975;4:281–300.
[5] Fussner E, Strauss M, Djuric U, Li R, Ahmed K, Hart M, et al. Open and closed domains in the mouse genome are configured as 10-nm chromatin fibres. EMBO Rep 2012;13:992–6.
[6] Joti Y, Hikima T, Nishino Y, Kamada F, Hihara S, Takata H, et al. Chromosomes without a 30-nm chromatin fiber. Nucleus 2012;3:404–10.
[7] Ricci MA, Manzo C, Garcı´a-Parajo MF, Lakadamyali M, Cosma MP. Chromatin fibers are formed by heterogeneous groups of nucleosomes in vivo. Cell 2015;160:1145–58.
[8] Gorkin DU, Leung D, Ren B. The 3D genome in transcriptional regulation and pluripotency. Cell Stem Cell 2014;14:762–75.
[9] Misteli T. Beyond the sequence: cellular organization of genome function. Cell 2007;128:787–800.
[10] Gibcus JH, Dekker J. The hierarchy of the 3D genome. Mol Cell 2013;49:773–82.
[11] Pombo A, Dillon N. Three-dimensional genome architecture: players and mechanisms. Nat Rev Mol Cell Biol 2015;16:245–57. [12] Cremer T, Cremer M. Chromosome territories. Cold Spring Harb Perspect Biol 2010;2:a003889.
[13] Finlan LE, Sproul D, Thomson I, Boyle S, Kerr E, Perry P, et al. Recruitment to the nuclear periphery can alter expression of genes in human cells. PLoS Genet 2008;4:e1000039.
[14] Gonzalez-Sandoval A, Towbin BD, Kalck V, Cabianca DS, Gaidatzis D, Hauer MH, et al. Perinuclear anchoring of H3K9-methylated chromatin stabilizes induced cell fate in C. elegans embryos. Cell 2015;163:1333–47.
[15] Dundr M, Misteli T. Biogenesis of nuclear bodies. Cold Spring Harb Perspect Biol 2010;2:a000711.
[16] Schneider R, Grosschedl R. Dynamics and interplay of nuclear architecture, genome organization, and gene expression. Genes Dev 2007;21:3027–43.
[17] Duan Z, Blau CA. The genome in space and time: does form always follow function? How does the spatial and temporal organization of a eukaryotic genome reflect and influence its functions? BioEssays 2012;34:800–10.
[18] Bouwman BA, de Laat W. Getting the genome in shape: the formation of loops, domains and compartments. Genome Biol 2015;16:154.
[19] Sexton T, Cavalli G. The role of chromosome domains in shaping the functional genome. Cell 2015;160:1049–59.
[20] van Steensel B, Dekker J. Genomics tools for unraveling chromosome architecture. Nat Biotechnol 2010;28:1089–95.
[21] de Wit E, de Laat W. A decade of 3C technologies: insights into nuclear organization. Genes Dev 2012;26:11–24.
[22] Gaj T, Gersbach CA, Barbas 3 CF. ZFN, TALEN, and CRISPR/Cas-based methods for genome engineering. Trends Biotechnol 2013;31:397–405.
[23] Risca VI, Greenleaf WJ. Unraveling the 3D genome: genomics tools for multiscale exploration. Trends Genet 2015;31:357–72.
[24] Daban JR. Electron microscopy and atomic force microscopy studies of chromatin and metaphase chromosome structure. Micron 2011;42:733–50.
[25] Rapkin LM, Anchel DR, Li R, Bazett-Jones DP. A view of the chromatin landscape. Micron 2012;43:150–8.
[26] Huang B, Babcock H, Zhuang X. Breaking the diffraction barrier: super-resolution imaging of cells. Cell 2010;143:1047–58.
[27] Callaway E. The revolution will not be crystallized: a new method sweeps through structural biology. Nature 2015;525:172–4.
[28] Song F, Chen P, Sun D, Wang M, Dong L, Liang D, et al. Cryo-EM study of the chromatin fiber reveals a double helix twisted by tetranucleosomal units. Science 2014;344:376–80.
[29] Langer-Safer PR, Levine M, Ward DC. Immunological method for mapping genes on Drosophila polytene chromosomes. Proc Natl Acad Sci U S A 1982;79:4381–5.
[30] Tsukamoto T, Hashiguchi N, Janicki SM, Tumbar T, Belmont AS, Spector DL. Visualization of gene activity in living cells. Nat Cell Biol 2000;2:871–8.
[31] Cremer M, Grasser F, Lanctot C, Muller S, Neusser M, Zinner R, et al. Multicolor 3D fluorescence in situ hybridization for imaging interphase chromosomes. Methods Mol Biol 2008;463:205–39.
[32] Branco MR, Branco T, Ramirez F, Pombo A. Changes in chromosome organization during PHA-activation of resting human lymphocytes measured by cryo-FISH. Chromosome Res 2008;16:413–26.
[33] Beliveau BJ, Joyce EF, Apostolopoulos N, Yilmaz F, Fonseka CY, McCole RB, et al. Versatile design and synthesis platform for visualizing genomes with Oligopaint FISH probes. Proc Natl Acad Sci U S A 2012;109:21301–6.
[34] Shachar S, Voss TC, Pegoraro G, Sciascia N, Misteli T. Identification of gene positioning factors using high-throughput imaging mapping. Cell 2015;162:911–23.
[35] Linhoff MW, Garg SK, Mandel G. A high-resolution imaging approach to investigate chromatin architecture in complex tissues. Cell 2015;163:246–55.
[36] Toomre D, Bewersdorf J. A new wave of cellular imaging. Annu Rev Cell Dev Biol 2010;26:285–314.
[37] Lakadamyali M, Cosma MP. Advanced microscopy methods for visualizing chromatin structure. FEBS Lett 2015;589:3023–30.
[38] Markaki Y, Smeets D, Fiedler S, Schmid VJ, Schermelleh L, Cremer T, et al. The potential of 3D-FISH and super-resolution structured illumination microscopy for studies of 3D nuclear architecture: 3D structured illumination microscopy of defined chromosomal structures visualized by 3D (immuno)-FISH opens new perspectives for studies of nuclear architecture. BioEssays 2012;34:412–26.
[39] Schermelleh L, Carlton PM, Haase S, Shao L, Winoto L, Kner P, et al. Subdiffraction multicolor imaging of the nuclear periphery with 3D structured illumination microscopy. Science 2008;320:1332–6.
[40] Wang W, Li GW, Chen C, Xie XS, Zhuang X. Chromosome organization by a nucleoid-associated protein in live bacteria. Science 2011;333:1445–9.
[41] Liu Z, Lavis LD, Betzig E. Imaging live-cell dynamics and structure at the single-molecule level. Mol Cell 2015;58:644–59.
[42] Miyanari Y, Ziegler-Birling C, Torres-Padilla ME. Live visualization of chromatin dynamics with fluorescent TALEs. Nat Struct Mol Biol 2013;20:1321–4.
[43] Chen B, Gilbert LA, Cimini BA, Schnitzbauer J, Zhang W, Li GW, et al. Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell 2013;155:1479–91.
[44] Knight SC, Xie L, Deng W, Guglielmi B, Witkowsky LB, Bosanac L, et al. Dynamics of CRISPR-Cas9 genome interrogation in living cells. Science 2015;350:823–6.
[45] Shendure J, Lieberman Aiden E. The expanding scope of DNA sequencing. Nat Biotechnol 2012;30:1084–94.
[46] Ay F, Noble WS. Analysis methods for studying the 3D architecture of the genome. Genome Biol 2015;16:183.
[47] Park PJ. ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet 2009;10:669–80.
[48] van Steensel B, Henikoff S. Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nat Biotechnol 2000;18:424–8.
[49] Nemeth A, Conesa A, Santoyo-Lopez J, Medina I, Montaner D, Peterfia B, et al. Initial genomics of the human nucleolus. PLoS Genet 2010;6:e1000889.
[50] Guelen L, Pagie L, Brasset E, Meuleman W, Faza MB, Talhout W, et al. Domain organization of human chromosomes revealed by mapping of nuclear lamina interactions. Nature 2008;453:948–51.
[51] Kind J, Pagie L, Ortabozkoyun H, Boyle S, de Vries SS, Janssen H, et al. Single-cell dynamics of genome-nuclear lamina interactions. Cell 2013;153:178–92.
[52] Kind J, Pagie L, de Vries SS, Nahidiazar L, Dey SS, Bienko M, et al. Genome-wide maps of nuclear lamina interactions in single human cells. Cell 2015;163:134–47.
[53] Chu C, Qu K, Zhong FL, Artandi SE, Chang HY. Genomic maps of long noncoding RNA occupancy reveal principles of RNA–chromatin interactions. Mol Cell 2011;44:667–78.
[54] Simon MD, Pinter SF, Fang R, Sarma K, Rutenberg-Schoenberg M, Bowman SK, et al. High-resolution Xist binding maps reveal two-step spreading during X-chromosome inactivation. Nature 2013;504:465–9.
[55] Engreitz JM, Pandya-Jones A, McDonel P, Shishkin A, Sirokman K, Surka C, et al. The Xist lncRNA exploits threedimensional genome architecture to spread across the X chromosome. Science 2013;341:1237973.
[56] Quinn JJ, Ilik IA, Qu K, Georgiev P, Chu C, Akhtar A, et al. Revealing long noncoding RNA architecture and functions using domain-specific chromatin isolation by RNA purification. Nat Biotechnol 2014;32:933–40.
[57] Dekker J, Rippe K, Dekker M, Kleckner N. Capturing chromosome conformation. Science 2002;295:1306–11.
[58] Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, et al. Comprehensive mapping of longrange interactions reveals folding principles of the human genome. Science 2009;326:289–93.
[59] Kalhor R, Tjong H, Jayathilaka N, Alber F, Chen L. Genome architectures revealed by tethered chromosome conformation capture and population-based modeling. Nat Biotechnol 2012;30:90–8.
[60] Nagano T, Lubling Y, Stevens TJ, Schoenfelder S, Yaffe E, Dean W, et al. Single-cell Hi-C reveals cell-to-cell variability in chromosome structure. Nature 2013;502:59–64.
[61] Ma W, Ay F, Lee C, Gulsoy G, Deng X, Cook S, et al. Fine-scale chromatin interaction maps reveal the cis-regulatory landscape of human lincRNA genes. Nat Methods 2015;12:71–8.
[62] Rao SS, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 2014;159:1665–80.
[63] Deng X, Ma W, Ramani V, Hill A, Yang F, Ay F, et al. Bipartite structure of the inactive mouse X chromosome. Genome Biol 2015;16:152.
[64] Hsieh TH, Weiner A, Lajoie B, Dekker J, Friedman N, Rando OJ. Mapping nucleosome resolution chromosome folding in yeast by Micro-C. Cell 2015;162:108–19.
[65] Horike S, Cai S, Miyano M, Cheng JF, Kohwi-Shigematsu T. Loss of silent-chromatin looping and impaired imprinting of DLX5 in Rett syndrome. Nat Genet 2005;37:31–40.
[66] Simonis M, Klous P, Splinter E, Moshkin Y, Willemsen R, de Wit E, et al. Nuclear organization of active and inactive chromatin domains uncovered by chromosome conformation capture-on-chip (4C). Nat Genet 2006;38:1348–54.
[67] Zhao Z, Tavoosidana G, Sjolinder M, Gondor A, Mariano P, Wang S, et al. Circular chromosome conformation capture (4C) uncovers extensive networks of epigenetically regulated intraand interchromosomal interactions. Nat Genet 2006;38:1341–7.
[68] Schoenfelder S, Sexton T, Chakalova L, Cope NF, Horton A, Andrews S, et al. Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells. Nat Genet 2010;42:53–61.
[69] Dostie J, Richmond TA, Arnaout RA, Selzer RR, Lee WL, Honan TA, et al. Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. Genome Res 2006;16:1299–309.
[70] Fullwood MJ, Liu MH, Pan YF, Liu J, Xu H, Mohamed YB, et al. An oestrogen-receptor-alpha-bound human chromatin interactome. Nature 2009;462:58–64.
[71] Tang Z, Luo OJ, Li X, Zheng M, Zhu JJ, Szalaj P, et al. CTCF-mediated human 3D genome architecture reveals chromatin topology for transcription. Cell 2015;163:1611–27.
[72] Hughes JR, Roberts N, McGowan S, Hay D, Giannoulatou E, Lynch M, et al. Analysis of hundreds of cis-regulatory landscapes at high resolution in a single, high-throughput experiment. Nat Genet 2014;46:205–12.
[73] Mifsud B, Tavares-Cadete F, Young AN, Sugar R, Schoenfelder S, Ferreira L, et al. Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C. Nat Genet 2015;47:598–606.
[74] Le TB, Imakaev MV, Mirny LA, Laub MT. High-resolution mapping of the spatial organization of a bacterial chromosome. Science 2013;342:731–4.
[75] Burton JN, Liachko I, Dunham MJ, Shendure J. Species-level deconvolution of metagenome assemblies with Hi-C-based contact probability maps. G3 (Bethesda) 2014;4:1339–46.
[76] Marbouty M, Le Gall A, Cattoni DI, Cournac A, Koh A, Fiche JB, et al. Condensin- and replication-mediated bacterial chromosome folding and origin condensation revealed by Hi-C and Super-resolution imaging. Mol Cell 2015;59:588–602.
[77] Duan Z, Andronescu M, Schutz K, McIlwain S, Kim YJ, Lee C, et al. A three-dimensional model of the yeast genome. Nature 2010;465:363–7.
[78] Mizuguchi T, Fudenberg G, Mehta S, Belton JM, Taneja N, Folco HD, et al. Cohesin-dependent globules and heterochromatin shape 3D genome architecture in S. pombe. Nature 2014;516:432–5.
[79] Ay F, Bunnik EM, Varoquaux N, Bol SM, Prudhomme J, Vert JP, et al. Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression. Genome Res 2014;24:974–88.
[80] Feng S, Cokus SJ, Schubert V, Zhai J, Pellegrini M, Jacobsen SE. Genome-wide Hi-C analyses in wild-type and mutants reveal high-resolution chromatin interactions in Arabidopsis. Mol Cell 2014;55:694–707.
[81] Grob S, Schmid MW, Grossniklaus U. Hi-C analysis in Arabidopsis identifies the KNOT, a structure with similarities to the flamenco locus of Drosophila. Mol Cell 2014;55:678–93.
[82] Crane E, Bian Q, McCord RP, Lajoie BR, Wheeler BS, Ralston EJ, et al. Condensin-driven remodelling of X chromosome topology during dosage compensation. Nature 2015;523:240–4.
[83] Hou C, Li L, Qin ZS, Corces VG. Gene density, transcription, and insulators contribute to the partition of the Drosophila genome into physical domains. Mol Cell 2012;48:471–84.
[84] Sexton T, Yaffe E, Kenigsberg E, Bantignies F, Leblanc B, Hoichman M, et al. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 2012;148:458–72.
[85] Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 2012;485:376–80.
[86] Dixon JR, Jung I, Selvaraj S, Shen Y, Antosiewicz-Bourget JE, Lee AY, et al. Chromatin architecture reorganization during stem cell differentiation. Nature 2015;518:331–6.
[87] Dekker J, Marti-Renom MA, Mirny LA. Exploring the threedimensional organization of genomes: interpreting chromatin interaction data. Nat Rev Genet 2013;14:390–403.
[88] Rotem A, Ram O, Shoresh N, Sperling RA, Goren A, Weitz DA, et al. Single-cell ChIP-seq reveals cell subpopulations defined by chromatin state. Nat Biotechnol 2015;33:1165–72.
[89] Nagano T, Lubling Y, Yaffe E, Wingett SW, Dean W, Tanay A, et al. Single-cell Hi-C for genome-wide detection of chromatin interactions that occur simultaneously in a single cell. Nat Protoc 2015;10:1986–2003.
[90] Urnov FD, Rebar EJ, Holmes MC, Zhang HS, Gregory PD. Genome editing with engineered zinc finger nucleases. Nat Rev Genet 2010;11:636–46.
[91] Maeder ML, Angstman JF, Richardson ME, Linder SJ, Cascio VM, Tsai SQ, et al. Targeted DNA demethylation and activation of endogenous genes using programmable TALE-TET1 fusion proteins. Nat Biotechnol 2013;31:1137–42.
[92] Mendenhall EM, Williamson KE, Reyon D, Zou JY, Ram O, Joung JK, et al. Locus-specific editing of histone modifications at endogenous enhancers. Nat Biotechnol 2013;31:1133–6.
[93] Therizols P, Illingworth RS, Courilleau C, Boyle S, Wood AJ, Bickmore WA. Chromatin decondensation is sufficient to alter nuclear organization in embryonic stem cells. Science 2014;346:1238–42.
[94] Lupianez DG, Kraft K, Heinrich V, Krawitz P, Brancati F, Klopocki E, et al. Disruptions of topological chromatin domains cause pathogenic rewiring of gene–enhancer interactions. Cell 2015;161:1012–25.
[95] Thakore PI, D’Ippolito AM, Song L, Safi A, Shivakumar NK, Kabadi AM, et al. Highly specific epigenome editing by CRISPR-Cas9 repressors for silencing of distal regulatory elements. Nat Methods 2015;12:1143–9.
[96] Guo Y, Xu Q, Canzio D, Shou J, Li J, Gorkin DU, et al. CRISPR Inversion of CTCF sites alters genome topology and enhancer/promoter function. Cell 2015;162:900–10.
[97] Hilton IB, D’Ippolito AM, Vockley CM, Thakore PI, Crawford GE, Reddy TE, et al. Epigenome editing by a CRISPR-Cas9-based acetyltransferase activates genes from promoters and enhancers. Nat Biotechnol 2015;33:510–7.
[98] Sanborn AL, Rao SS, Huang SC, Durand NC, Huntley MH, Jewett AI, et al. Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc Natl Acad Sci U S A 2015;112:E6456–65.
[99] Deng W, Blobel GA. Manipulating nuclear architecture. Curr Opin Genet Dev 2014;25:1–7.
[100] Cremer T, Cremer M, Hubner B, Strickfaden H, Smeets D, Popken J, et al. The 4D nucleome: evidence for a dynamic nuclear landscape based on co-aligned active and inactive nuclear compartments. FEBS Lett 2015;589:2931–43.
[101] Boyle S, Gilchrist S, Bridger JM, Mahy NL, Ellis JA, Bickmore WA. The spatial organization of human chromosomes within the nuclei of normal and emerin-mutant cells. Hum Mol Genet 2001;10:211–9.
[102] Croft JA, Bridger JM, Boyle S, Perry P, Teague P, Bickmore WA. Differences in the localization and morphology of chromosomes in the human nucleus. J Cell Biol 1999;145:1119–31.
[103] Grasser F, Neusser M, Fiegler H, Thormeyer T, Cremer M, Carter NP, et al. Replication-timing-correlated spatial chromatin arrangements in cancer and in primate interphase nuclei. J Cell Sci 2008;121:1876–86.
[104] Kosak ST, Groudine M. Form follows function: the genomic organization of cellular differentiation. Genes Dev 2004;18:1371–84.
[105] Takizawa T, Meaburn KJ, Misteli T. The meaning of gene positioning. Cell 2008;135:9–13.
[106] Heride C, Ricoul M, Kieu K, Hase von J, Guillemot V, Cremer C, et al. Distance between homologous chromosomes resultsfrom chromosome positioning constraints. J Cell Sci 2010;123:4063–75.
[107] Gerlich D, Beaudouin J, Kalbfuss B, Daigle N, Eils R, Ellenberg J. Global chromosome positions are transmitted through mitosis in mammalian cells. Cell 2003;112:751–64.
[108] Walter J, Schermelleh L, Cremer M, Tashiro S, Cremer T. Chromosome order in HeLa cells changes during mitosis and early G1, but is stably maintained during subsequent interphase stages. J Cell Biol 2003;160:685–97.
[109] Thomson I, Gilchrist S, Bickmore WA, Chubb JR. The radial positioning of chromatin is not inherited through mitosis but is established de novo in early G1. Curr Biol 2004;14:166–72.
[110] Parada LA, McQueen PG, Misteli T. Tissue-specific spatial organization of genomes. Genome Biol 2004;5:R44.
[111] Tanabe H, Muller S, Neusser M, Hase von J, Calcagno E, Cremer M, et al. Evolutionary conservation of chromosome territory arrangements in cell nuclei from higher primates. Proc Natl Acad Sci U S A 2002;99:4424–9.
[112] Eskiw CH, Cope NF, Clay I, Schoenfelder S, Nagano T, Fraser P. Transcription factories and nuclear organization of the genome. Cold Spring Harb Symp Quant Biol 2010;75:501–6.
[113] Ryba T, Hiratani I, Lu J, Itoh M, Kulik M, Zhang J, et al. Evolutionarily conserved replication timing profiles predict longrange chromatin interactions and distinguish closely related cell types. Genome Res 2010;20:761–70.
[114] Fortin JP, Hansen KD. Reconstructing A/B compartments as revealed by Hi-C using long-range correlations in epigenetic data. Genome Biol 2015;16:180.
[115] Ciabrelli F, Cavalli G. Chromatin-driven behavior of topologically associating domains. J Mol Biol 2015;427:608–25.
[116] Dekker J, Heard E. Structural and functional diversity of Topologically Associating Domains. FEBS Lett 2015;589:2877–84.
[117] Nora EP, Lajoie BR, Schulz EG, Giorgetti L, Okamoto I, Servant N, et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 2012;485:381–5.
[118] Ulianov SV, Khrameeva EE, Gavrilov AA, Flyamer IM, Kos P, Mikhaleva EA, et al. Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains. Genome Res 2015;26:70–84.
[119] Cook PR, Brazell IA. Supercoils in human DNA. J Cell Sci 1975;19:261–79.
[120] Eagen KP, Hartl TA, Kornberg RD. Stable chromosome condensation revealed by chromosome conformation capture. Cell 2015;163:934–46.
[121] Flavahan WA, Drier Y, Liau BB, Gillespie SM, Venteicher AS, Stemmer-Rachamimov AO, et al. Insulator dysfunction and oncogene activation in IDH mutant gliomas. Nature 2016;529:110–4.
[122] Cook PR. A model for all genomes: the role of transcription factories. J Mol Biol 2010;395:1–10.
[123] Melnik S, Deng B, Papantonis A, Baboo S, Carr IM, Cook PR. The proteomes of transcription factories containing RNA polymerases I, II or III. Nat Methods 2011;8:963–8.
[124] Buckley MS, Lis JT. Imaging RNA Polymerase II transcription sites in living cells. Curr Opin Genet Dev 2014;25:126–30.
[125] Fraser J, Williamson I, Bickmore WA, Dostie J. An overview of genome organization and how we got there: from FISH to Hi-C. Microbiol Mol Biol Rev 2015;79:347–72.
[126] Matheson TD, Kaufman PD. Grabbing the genome by the NADs. Chromosoma 2015.
[127] van Koningsbruggen S, Gierlinski M, Schofield P, Martin D, Barton GJ, Ariyurek Y, et al. High-resolution whole-genome sequencing reveals that specific chromatin domains from most human chromosomes associate with nucleoli. Mol Biol Cell 2010;21:3735–48.
[128] Peric-Hupkes D, Meuleman W, Pagie L, Bruggeman SW, Solovei I, Brugman W, et al. Molecular maps of the reorganization of genome-nuclear lamina interactions during differentiation. Mol Cell 2010;38:603–13.
[129] Griffith J, Hochschild A, Ptashne M. DNA loops induced by cooperative binding of γ repressor. Nature 1986;322:750–2.
[130] Giorgetti L, Galupa R, Nora EP, Piolot T, Lam F, Dekker J, et al. Predictive polymer modeling reveals coupled fluctuations in chromosome conformation and transcription. Cell 2014;157:950–63.
[131] Tolhuis B, Palstra RJ, Splinter E, Grosveld F, de Laat W. Looping and interaction between hypersensitive sites in the active beta-globin locus. Mol Cell 2002;10:1453–65.
[132] Jin F, Li Y, Dixon JR, Selvaraj S, Ye Z, Lee AY, et al. A highresolution map of the three-dimensional chromatin interactome in human cells. Nature 2013;503:290–4.
[133] Li G, Ruan X, Auerbach RK, Sandhu KS, Zheng M, Wang P, et al. Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell 2012;148:84–98.
[134] Sanyal A, Lajoie BR, Jain G, Dekker J. The long-range interaction landscape of gene promoters. Nature 2012;489:109–13.
[135] Zhang Y, Wong CH, Birnbaum RY, Li G, Favaro R, Ngan CY, et al. Chromatin connectivity maps reveal dynamic promoterenhancer long-range associations. Nature 2013;504:306–10.
[136] Ghavi-Helm Y, Klein FA, Pakozdi T, Ciglar L, Noordermeer D, Huber W, et al. Enhancer loops appear stable during development and are associated with paused polymerase. Nature 2014;512:96–100.
[137] Deng W, Lee J, Wang H, Miller J, Reik A, Gregory PD, et al. Controlling long-range genomic interactions at a native locus by targeted tethering of a looping factor. Cell 2012;149:1233–44.
[138] Deng W, Rupon JW, Krivega I, Breda L, Motta I, Jahn KS, et al. Reactivation of developmentally silenced globin genes by forced chromatin looping. Cell 2014;158:849–60.
[139] Dekker J, Misteli T. Long-range chromatin interactions. Cold Spring Harb Perspect Biol 2015;7:a019356.
[140] Lee JT. Epigenetic regulation by long noncoding RNAs. Science 2012;338:1435–9.
[141] Hacisuleyman E, Goff LA, Trapnell C, Williams A, Henao-Mejia J, Sun L, et al. Topological organization of multichromosomal regions by the long intergenic noncoding RNA Firre. Nat Struct Mol Biol 2014;21:198–206.
[142] Yang F, Deng X, Ma W, Berletch JB, Rabaia N, Wei G, et al. The lncRNA Firre anchors the inactive X chromosome to the nucleolus by binding CTCF and maintains H3K27me3 methylation. Genome Biol 2015;16:52.
[143] Wang KC, Yang YW, Liu B, Sanyal A, Corces-Zimmerman R, Chen Y, et al. A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature 2011;472:120–4.
[144] Phillips JE, Corces VG. CTCF: master weaver of the genome. Cell 2009;137:1194–211.
[145] Lobanenkov VV, Nicolas RH, Adler VV, Paterson H, Klenova EM, Polotskaja AV, et al. A novel sequence-specific DNA binding protein which interacts with three regularly spaced direct repeats of the CCCTC-motif in the 5'-flanking sequence of the chicken c-myc gene. Oncogene 1990;5:1743–53.
[146] Bell AC, West AG, Felsenfeld G. The protein CTCF is required for the enhancer blocking activity of vertebrate insulators. Cell 1999;98:387–96.
[147] Wood AJ, Severson AF, Meyer BJ. Condensin and cohesin complexity: the expanding repertoire of functions. Nat Rev Genet 2010;11:391–404.
[148] Dowen JM, Fan ZP, Hnisz D, Ren G, Abraham BJ, Zhang LN, et al. Control of cell identity genes occurs in insulated neighborhoods in mammalian chromosomes. Cell 2014;159:374–87.
[149] Handoko L, Xu H, Li G, Ngan CY, Chew E, Schnapp M, et al. CTCF-mediated functional chromatin interactome in pluripotent cells. Nat Genet 2011;43:630–8.
[150] Ji X, Dadon DB, Powell BE, Fan ZP, Borges-Rivera D, Shachar S, et al. 3D chromosome regulatory landscape of human pluripotent cells. Cell Stem Cell 2015.
[151] Kagey MH, Newman JJ, Bilodeau S, Zhan Y, Orlando DA, van Berkum NL, et al. Mediator and cohesin connect gene expression and chromatin architecture. Nature 2010;467:430–5.
[152] de Wit E, Vos ES, Holwerda SJ, Valdes-Quezada C, Verstegen MJ, Teunissen H, et al. CTCF binding polarity determines chromatin looping. Mol Cell 2015;60:676–84.
[153] Alipour E, Marko JF. Self-organization of domain structures by DNA-loop-extruding enzymes. Nucleic Acids Res 2012;40:11202–12.
[154] Zuin J, Dixon JR, van der Reijden MIJA, Ye Z, Kolovos P, Brouwer RWW, et al. Cohesin and CTCF differentially affect chromatin architecture and gene expression in human cells. Proc Natl Acad Sci U S A 2014;111:996–1001.
[155] Shalem O, Sanjana NE, Hartenian E, Shi X, Scott DA, Mikkelsen TS, et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science 2014;343:84–7.
[156] Wang T, Wei JJ, Sabatini DM, Lander ES. Genetic screens in human cells using the CRISPR-Cas9 system. Science 2014;343:80–4.
[157] Wang T, Birsoy K, Hughes NW, Krupczak KM, Post Y, Wei JJ, et al. Identification and characterization of essential genes in the human genome. Science 2015;350:1096–101.
[158] Findlay GM, Boyle EA, Hause RJ, Klein JC, Shendure J. Saturation editing of genomic regions by multiplex homologydirected repair. Nature 2014;513:120–3.
[159] Vierstra J, Reik A, Chang K-H, Stehling-Sun S, Zhou Y, Hinkley SJ, et al. Functional footprinting of regulatory DNA. Nat Methods 2015;12:927–30.
[160] Canver MC, Smith EC, Sher F, Pinello L, Sanjana NE, Shalem O, et al. BCL11A enhancer dissection by Cas9-mediated in situ saturating mutagenesis. Nature 2015;527:192–7.
[161] Schwartzman O, Tanay A. Single-cell epigenomics: techniques and emerging applications. Nat Rev Genet 2015;16:716–26.
[162] Cusanovich DA, Daza R, Adey A, Pliner HA, Christiansen L, Gunderson KL, et al. Epigenetics. Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing. Science 2015;348:910–4.
[163] Lubeck E, Cai L. Single-cell systems biology by super-resolution imaging and combinatorial labeling. Nat Methods 2012;9:743–8.
[164] Lubeck E, Coskun AF, Zhiyentayev T, Ahmad M, Cai L. Singlecell in situ RNA profiling by sequential hybridization. Nat Methods 2014;11:360–1.
[165] Lee JH, Daugharthy ER, Scheiman J, Kalhor R, Yang JL, Ferrante TC, et al. Highly multiplexed subcellular RNA sequencing in situ. Science 2014;343:1360–3.
[166] Chen KH, Boettiger AN, Moffitt JR, Wang S, Zhuang X. RNA imaging. Spatially resolved, highly multiplexed RNA profiling in single cells. Science 2015;348, aaa6090-0.
[167] Trapnell C. Defining cell types and states with single-cell genomics. Genome Res 2015;25:1491–8.
REVIEW
*Corresponding author.
杂志排行
Genomics,Proteomics & Bioinformatics的其它文章
- Roles, Functions, and Mechanisms of Long Non-coding RNAs in Cancer
- Single-cell Transcriptome Study as Big Data
- Translational Bioinformatics: Past, Present, and Future
- A Bipartite Network-based Method for Prediction of Long Non-coding RNA–protein Interactions
- EDISON-WMW: Exact Dynamic Programing Solution of the Wilcoxon–Mann–Whitney Test
- Call for Papers Special Issue on‘‘Biomarkers for Human Diseases and Translational Medicine’’