21 research outputs found

    Genome-wide profiling of forum domains in Drosophila melanogaster

    Get PDF
    Forum domains are stretches of chromosomal DNA that are excised from eukaryotic chromosomes during their spontaneous non-random fragmentation. Most forum domains are 50–200 kb in length. We mapped forum domain termini using FISH on polytene chromosomes and we performed genome-wide mapping using a Drosophila melanogaster genomic tiling microarray consisting of overlapping 3 kb fragments. We found that forum termini very often correspond to regions of intercalary heterochromatin and regions of late replication in polytene chromosomes. We found that forum domains contain clusters of several or many genes. The largest forum domains correspond to the main clusters of homeotic genes inside BX-C and ANTP-C, cluster of histone genes and clusters of piRNAs. PRE/TRE and transcription factor binding sites often reside inside domains and do not overlap with forum domain termini. We also found that about 20% of forum domain termini correspond to small chromosomal regions where Ago1, Ago2, small RNAs and repressive chromatin structures are detected. Our results indicate that forum domains correspond to big multi-gene chromosomal units, some of which could be coordinately expressed. The data on the global mapping of forum domains revealed a strong correlation between fragmentation sites in chromosomes, particular sets of mobile elements and regions of intercalary heterochromatin

    Genome-Wide Study of Colocalization between Genomic Stretches: A Method and Applications to the Regulation of Gene Expression

    No full text
    In this paper, we describe a method for the study of colocalization effects between stretch–stretch and stretch–point genome tracks based on a set of indices varying within the (–1, +1) interval. The indices combine the distances between the centers of neighboring stretches and their lengths. The extreme boundaries of the interval correspond to the complete colocalization of the genome tracks or its complete absence. We also obtained the relevant criteria of statistical significance for such indices using the complete permutation test. The method is robust with respect to strongly inhomogeneous positioning and length distribution of the genome tracks. On the basis of this approach, we created command-line software, the Genome Track Colocalization Analyzer. The software was tested, compared with other available packages, and applied to particular problems related to gene expression. The package, Genome Track Colocalization Analyzer (GTCA), is freely available to the users. GTCA complements our previous software, the Genome Track Analyzer, intended for the search for pairwise correlations between point-like genome tracks (also freely available). The corresponding details are provided in Data Availability Statement at the end of the text

    Structural coordinates: A novel approach to predict protein backbone conformation

    No full text
    Motivation Local protein structure is usually described via classifying each peptide to a unique class from a set of pre-defined structures. These classifications may differ in the number of structural classes, the length of peptides, or class attribution criteria. Most methods that predict the local structure of a protein from its sequence first rely on some classification and only then proceed to the 3D conformation assessment. However, most classification methods rely on homologous proteins' existence, unavoidably lose information by attributing a peptide to a single class or suffer from a suboptimal choice of the representative classes. Results To alleviate the above challenges, we propose a method that constructs a peptide's structural representation from the sequence, reflecting its similarity to several basic representative structures. For 5-mer peptides and 16 representative structures, we achieved the Q16 classification accuracy of 67.9%, which is higher than what is currently reported in the literature. Our prediction method does not utilize information about protein homologues but relies only on the amino acids' physicochemical properties and the resolved structures' statistics. We also show that the 3D coordinates of a peptide can be uniquely recovered from its structural coordinates, and show the required conditions under various geometric constraints

    DNA double-strand breaks coupled with PARP1 and HNRNPA2B1 binding sites flank coordinately expressed domains in human chromosomes.

    Get PDF
    Genome instability plays a key role in multiple biological processes and diseases, including cancer. Genome-wide mapping of DNA double-strand breaks (DSBs) is important for understanding both chromosomal architecture and specific chromosomal regions at DSBs. We developed a method for precise genome-wide mapping of blunt-ended DSBs in human chromosomes, and observed non-random fragmentation and DSB hot spots. These hot spots are scattered along chromosomes and delimit protected 50-250 kb DNA domains. We found that about 30% of the domains (denoted forum domains) possess coordinately expressed genes and that PARP1 and HNRNPA2B1 specifically bind DNA sequences at the forum domain termini. Thus, our data suggest a novel type of gene regulation: a coordinated transcription or silencing of gene clusters delimited by DSB hot spots as well as PARP1 and HNRNPa2B1 binding sites

    rDNA Clusters Make Contact with Genes that Are Involved in Differentiation and Cancer and Change Contacts after Heat Shock Treatment

    No full text
    Human rDNA clusters form numerous contacts with different chromosomal regions as evidenced by chromosome conformation capture data. Heterochromatization of rDNA genes leads to heterochromatization in different chromosomal regions coupled with the activation of the transcription of genes related to differentiation. These data suggest a role for rDNA clusters in the regulation of many human genes. However, the genes that reside within the rDNA-contacting regions have not been identified. The purpose of this study was to detect and characterize the regions where rDNA clusters make frequent contacts and to identify and categorize genes located in these regions. We analyzed the regions that contact rDNA using 4C data and show that these regions are enriched with genes specifying transcription factors and non-coding RNAs involved in differentiation and development. The rDNA-contacting genes are involved in neuronal development and are associated with different cancers. Heat shock treatment led to dramatic changes in the pattern of rDNA-contacting sites, especially in the regions possessing long stretches of H3K27ac marks. Whole-genome analysis of rDNA-contacting sites revealed specific epigenetic marks and the transcription sites of 20–100 nt non-coding RNAs in these regions. The rDNA-contacting genes jointly regulate many genes that are involved in the control of transcription by RNA polymerase II and the development of neurons. Our data suggest a role for rDNA clusters in the differentiation of human cells and carcinogenesis

    Dynamics of Whole-Genome Contacts of Nucleoli in Drosophila Cells Suggests a Role for rDNA Genes in Global Epigenetic Regulation

    No full text
    Chromosomes are organized into 3D structures that are important for the regulation of gene expression and differentiation. Important role in formation of inter-chromosome contacts play rDNA clusters that make up nucleoli. In the course of differentiation, heterochromatization of rDNA units in mouse cells is coupled with the repression or activation of different genes. Furthermore, the nucleoli of human cells shape the direct contacts with genes that are involved in differentiation and cancer. Here, we identified and categorized the genes located in the regions where rDNA clusters make frequent contacts. Using a 4C approach, we demonstrate that in Drosophila S2 cells, rDNA clusters form contacts with genes that are involved in chromosome organization and differentiation. Heat shock treatment induces changes in the contacts between nucleoli and hundreds of genes controlling morphogenesis. We show that nucleoli form contacts with regions that are enriched with active or repressive histone marks and where small non-coding RNAs are mapped. These data indicate that rDNA contacts are involved in the repression and activation of gene expression and that rDNA clusters orchestrate large groups of Drosophila genes involved in differentiation

    Induction of the Erythroid Differentiation of K562 Cells Is Coupled with Changes in the Inter-Chromosomal Contacts of rDNA Clusters

    No full text
    The expression of clusters of rDNA genes influences pluripotency; however, the underlying mechanisms are not yet known. These clusters shape inter-chromosomal contacts with numerous genes controlling differentiation in human and Drosophila cells. This suggests a possible role of these contacts in the formation of 3D chromosomal structures and the regulation of gene expression in development. However, it has not yet been demonstrated whether inter-chromosomal rDNA contacts are changed during differentiation. In this study, we used human leukemia K562 cells and induced their erythroid differentiation in order to study both the changes in rDNA contacts and the expression of genes. We observed that approximately 200 sets of rDNA-contacting genes are co-expressed in different combinations in both untreated and differentiated K562 cells. rDNA contacts are changed during differentiation and coupled with the upregulation of genes whose products are mainly located in the nucleus and are highly associated with DNA- and RNA-binding, along with the downregulation of genes whose products mainly reside in the cytoplasm or intra- or extracellular vesicles. The most downregulated gene is ID3, which is known as an inhibitor of differentiation, and thus should be switched off to allow for differentiation. Our data suggest that the differentiation of K562 cells leads to alterations in the inter-chromosomal contacts of rDNA clusters and 3D structures in particular chromosomal regions as well as to changes in the expression of genes located in the corresponding chromosomal domains. We conclude that approximately half of the rDNA-contacting genes are co-expressed in human cells and that rDNA clusters are involved in the global regulation of gene expression

    Six Highly Conserved Targets of RNAi Revealed in HIV-1-Infected Patients from Russia Are Also Present in Many HIV-1 Strains Worldwide

    No full text
    RNAi has been suggested for use in gene therapy of HIV/AIDS, but the main problem is that HIV-1 is highly variable and could escape attack from the small interfering RNAs (siRNAs) due to even single nucleotide substitutions in the potential targets. To exhaustively check the variability in selected RNA targets of HIV-1, we used ultra-deep sequencing of six regions of HIV-1 from the plasma of two independent cohorts of patients from Russia. Six RNAi targets were found that are invariable in 82%–97% of viruses in both cohorts and are located inside the domains specifying reverse transcriptase (RT), integrase, vpu, gp120, and p17. The analysis of mutation frequencies and their characteristics inside the targets suggests a likely role for APOBEC3G (apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 3G, A3G) in G-to-A mutations and a predominant effect of RT biases in the detected variability of the virus. The lowest frequency of mutations was detected in the central part of all six targets. We also discovered that the identical RNAi targets are present in many HIV-1 strains from many countries and from all continents. The data are important for both the understanding of the patterns of HIV-1 mutability and properties of RT and for the development of gene therapy approaches using RNAi for the treatment of HIV/AIDS. Keywords: HIV-1, RNAi targets, gene therapy, ultra-deep sequencing, conserved HIV-1 sequence

    Chromosomal Translocations in NK-Cell Lymphomas Originate from Inter-Chromosomal Contacts of Active rDNA Clusters Possessing Hot Spots of DSBs

    No full text
    Endogenous hot spots of DNA double-strand breaks (DSBs) are tightly linked with transcription patterns and cancer. There are nine hot spots of DSBs (denoted Pleiades) in human rDNA units that are located exclusively inside the intergenic spacer (IGS). Profiles of Pleiades coincide with the profiles of γ-H2AX, suggesting a high level of in vivo breakage inside rDNA genes. The data were confirmed by microscopic observation of the largest γ-H2AX foci inside nucleoli in interphase chromosomes. Circular chromosome conformation capture (4C) data indicate that the rDNA units often make contact with a specific set of chromosomal regions containing genes that are involved in differentiation and cancer. Interestingly, these regions also often possess hot spots of DSBs that provide the potential for Robertsonian and oncogenic translocations. In this study, we searched for translocations in which rDNA clusters are involved. The whole genome sequence (WGS) data of normal T cells and NK-cell lymphomas from the same individuals revealed numerous translocations in which Pleiades were involved. The sites of these translocations in normal T cells and in the lymphomas were mostly different, although there were also some common sites. The genes at translocations in normal cells and in lymphomas are associated with predominantly non-overlapping lists of genes that are depleted with silenced genes. Our data indicate that rDNA-mediated translocations occur at about the same frequency in the normal T cells and NK-lymphoma cells but differ at particular sites that correspond to open chromatin. We conclude that oncogenic translocations lead to dysregulation of a specific set of genes controlling development. In normal T cells and in NK cells, there are hot spots of translocations at sites possessing strong H3K27ac marks. The data indicate that Pleiades are involved in rDNA-mediated translocation
    corecore