67 research outputs found
Recommended from our members
DNA replication timing and higher-order nuclear organization determine single nucleotide substitution patterns in cancer genomes
Single nucleotide substitutions (SNS) are a defining characteristic of cancer genomes. Many SNS in cancer genomes arise due to errors in DNA replication, which is spatio-temporally stratified. Here we propose that DNA replication patterns help shape the mutational landscapes of normal and cancer genomes. Using data on five fully sequenced cancer types and two personal genomes, we determined that the frequency of intergenic SNS is significantly higher in late DNA replication timing regions, even after controlling for a number of genomic features. Furthermore, some substitution signatures are more frequent in certain DNA replication timing zones. Finally, integrating data on higher-order nuclear organization, we found that genomic regions in close spatial proximity to late replicating domains display similar mutation spectra as the late replicating regions themselves. These data suggest that DNA replication timing together with higher-order genomic organization contribute to the patterns of SNS in normal and cancer genomes
Histone Modifications Are Associated with Transcript Isoform Diversity in Normal and Cancer Cells
Mechanisms that generate transcript diversity are of fundamental importance in eukaryotes. Although a large fraction of human protein-coding genes and lincRNAs produce more than one mRNA isoform each, the regulation of this phenomenon is still incompletely understood. Much progress has been made in deciphering the role of sequence-specific features as well as DNA-and RNA-binding proteins in alternative splicing. Recently, however, several experimental studies of individual genes have revealed a direct involvement of epigenetic factors in alternative splicing and transcription initiation. While histone modifications are generally correlated with overall gene expression levels, it remains unclear how histone modification enrichment affects relative isoform abundance. Therefore, we sought to investigate the associations between histone modifications and transcript diversity levels measured by the rates of transcription start-site switching and alternative splicing on a genome-wide scale across protein-coding genes and lincRNAs. We found that the relationship between enrichment levels of epigenetic marks and transcription start-site switching is similar for protein-coding genes and lincRNAs. Furthermore, we found associations between splicing rates and enrichment levels of H2az, H3K4me1, H3K4me2, H3K4me3, H3K9ac, H3K9me3, H3K27ac, H3K27me3, H3K36me3, H3K79me2, and H4K20me, marks traditionally associated with enhancers, transcription initiation, transcriptional repression, and others. These patterns were observed in both normal and cancer cell lines. Additionally, we developed a novel computational method that identified 840 epigenetically regulated candidate genes and predicted transcription start-site switching and alternative exon splicing with up to 92% accuracy based on epigenetic patterning alone. Our results suggest that the epigenetic regulation of transcript isoform diversity may be a relatively common genome-wide phenomenon representing an avenue of deregulation in tumor development
Interaction preferences across protein-protein interfaces of obligatory and non-obligatory components are different
BACKGROUND: A polypeptide chain of a protein-protein complex is said to be obligatory if it is bound to another chain throughout its functional lifetime. Such a chain might not adopt the native fold in the unbound form. A non-obligatory polypeptide chain associates with another chain and dissociates upon molecular stimulus. Although conformational changes at the interaction interface are expected, the overall 3-D structure of the non-obligatory chain is unaltered. The present study focuses on protein-protein complexes to understand further the differences between obligatory and non-obligatory interfaces. RESULTS: A non-obligatory chain in a complex of known 3-D structure is recognized by its stable existence with same fold in the bound and unbound forms. On the contrary, an obligatory chain is detected by its existence only in the bound form with no evidence for the native-like fold of the chain in the unbound form. Various interfacial properties of a large number of complexes of known 3-D structures thus classified are comparatively analyzed with an aim to identify structural descriptors that distinguish these two types of interfaces. We report that the interaction patterns across the interfaces of obligatory and non-obligatory components are different and contacts made by obligatory chains are predominantly non-polar. The obligatory chains have a higher number of contacts per interface (20 ± 14 contacts per interface) than non-obligatory chains (13 ± 6 contacts per interface). The involvement of main chain atoms is higher in the case of obligatory chains (16.9 %) compared to non-obligatory chains (11.2 %). The β-sheet formation across the subunits is observed only among obligatory protein chains in the dataset. Apart from these, other features like residue preferences and interface area produce marginal differences and they may be considered collectively while distinguishing the two types of interfaces. CONCLUSION: These results can be useful in distinguishing the two types of interfaces observed in structures determined in large-scale in the structural genomics initiatives, especially for those multi-component protein assemblies for which the biochemical characterization is incomplete
Patterns of evolutionary constraints on genes in humans
<p>Abstract</p> <p>Background</p> <p>Different regions in a genome evolve at different rates depending on structural and functional constraints. Some genomic regions are highly conserved during metazoan evolution, while other regions may evolve rapidly, either in all species or in a lineage-specific manner. A strong or even moderate change in constraints in functional regions, for example in coding regions, can have significant evolutionary consequences.</p> <p>Results</p> <p>Here we discuss a novel framework, 'BaseDiver', to classify groups of genes in humans based on the patterns of evolutionary constraints on polymorphic positions in their coding regions. Comparing the nucleotide-level divergence among mammals with the extent of deviation from the ancestral base in the human lineage, we identify patterns of evolutionary pressure on nonsynonymous base-positions in groups of genes belonging to the same functional category. Focussing on groups of genes in functional categories, we find that transcription factors contain a significant excess of nonsynonymous base-positions that are conserved in other mammals but changed in human, while immunity related genes harbour mutations at base-positions that evolve rapidly in all mammals including humans due to strong preference for advantageous alleles. Genes involved in olfaction also evolve rapidly in all mammals, and in humans this appears to be due to weak negative selection.</p> <p>Conclusion</p> <p>While recent studies have identified genes under positive selection in humans, our approach identifies evolutionary constraints on Gene Ontology groups identifying changes in humans relative to some of the other mammals.</p
Functional protein divergence in the evolution of Homo sapiens.
BACKGROUND: Protein-coding regions in a genome evolve by sequence divergence and gene gain and loss, altering the gene content of the organism. However, it is not well understood how this has given rise to the enormous diversity of metazoa present today. RESULTS: To obtain a global view of human genomic evolution, we quantify the divergence of proteins by functional category at different evolutionary distances from human. CONCLUSION: This analysis highlights some general systems-level characteristics of human evolution: regulatory processes, such as signal transducers, transcription factors and receptors, have a high degree of plasticity, while core processes, such as metabolism, transport and protein synthesis, are largely conserved. Additionally, this study reveals a dynamic picture of selective forces at short, medium and long evolutionary timescales. Certain functional categories, such as 'development' and 'organogenesis', exhibit temporal patterns of sequence divergence in eukaryotes relative to human. This framework for a grammar of human evolution supports previously postulated theories of robustness and evolvability.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are
Interfacing systems biology and synthetic biology
A report of BioSysBio 2009, the IET conference on Synthetic Biology, Systems Biology and Bioinformatics, Cambridge, UK, 23-25 March 2009
- …