153 research outputs found

    A Model-Based Analysis of GC-Biased Gene Conversion in the Human and Chimpanzee Genomes

    Get PDF
    GC-biased gene conversion (gBGC) is a recombination-associated process that favors the fixation of G/C alleles over A/T alleles. In mammals, gBGC is hypothesized to contribute to variation in GC content, rapidly evolving sequences, and the fixation of deleterious mutations, but its prevalence and general functional consequences remain poorly understood. gBGC is difficult to incorporate into models of molecular evolution and so far has primarily been studied using summary statistics from genomic comparisons. Here, we introduce a new probabilistic model that captures the joint effects of natural selection and gBGC on nucleotide substitution patterns, while allowing for correlations along the genome in these effects. We implemented our model in a computer program, called phastBias, that can accurately detect gBGC tracts about 1 kilobase or longer in simulated sequence alignments. When applied to real primate genome sequences, phastBias predicts gBGC tracts that cover roughly 0.3% of the human and chimpanzee genomes and account for 1.2% of human-chimpanzee nucleotide differences. These tracts fall in clusters, particularly in subtelomeric regions; they are enriched for recombination hotspots and fast-evolving sequences; and they display an ongoing fixation preference for G and C alleles. They are also significantly enriched for disease-associated polymorphisms, suggesting that they contribute to the fixation of deleterious alleles. The gBGC tracts provide a unique window into historical recombination processes along the human and chimpanzee lineages. They supply additional evidence of long-term conservation of megabase-scale recombination rates accompanied by rapid turnover of hotspots. Together, these findings shed new light on the evolutionary, functional, and disease implications of gBGC. The phastBias program and our predicted tracts are freely available. © 2013 Capra et al

    Predicting cell types and genetic variations contributing to disease by combining GWAS and epigenetic data

    Get PDF
    Genome-wide association studies (GWASs) identify single nucleotide polymorphisms (SNPs) that are enriched in individuals suffering from a given disease. Most disease-associated SNPs fall into non-coding regions, so that it is not straightforward to infer phenotype or function; moreover, many SNPs are in tight genetic linkage, so that a SNP identified as associated with a particular disease may not itself be causal, but rather signify the presence of a linked SNP that is functionally relevant to disease pathogenesis. Here, we present an analysis method that takes advantage of the recent rapid accumulation of epigenomics data to address these problems for some SNPs. Using asthma as a prototypic example; we show that non-coding disease-associated SNPs are enriched in genomic regions that function as regulators of transcription, such as enhancers and promoters. Identifying enhancers based on the presence of the histone modification marks such as H3K4me1 in different cell types, we show that the location of enhancers is highly cell-type specific. We use these findings to predict which SNPs are likely to be directly contributing to disease based on their presence in regulatory regions, and in which cell types their effect is expected to be detectable. Moreover, we can also predict which cell types contribute to a disease based on overlap of the disease-associated SNPs with the locations of enhancers present in a given cell type. Finally, we suggest that it will be possible to re-analyze GWAS studies with much higher power by limiting the SNPs considered to those in coding or regulatory regions of cell types relevant to a given disease

    Implementation of the Time-to-Event Continuous Reassessment Method Design in a Phase I Platform Trial Testing Novel Radiotherapy-Drug Combinations-CONCORDE

    Get PDF
    \ua9 2022 by American Society of Clinical Oncology. PURPOSE CONCORDE is the first phase I drug-radiotherapy (RT) combination platform in non-small-cell lung cancer, designed to assess multiple different DNA damage response inhibitors in combination with radical thoracic RT. Time-to-event continuous reassessment method (TiTE-CRM) methodology will inform dose escalation individually for each different DNA damage response inhibitor-RT combination and a randomized calibration arm will aid attribution of toxicities. We report in detail the novel statistical design and implementation of the TiTE-CRM in the CONCORDE trial. METHODS Statistical parameters were calibrated following recommendations by Lee and Cheung. Simulations were performed to assess the operating characteristics of the chosen models and were written using modified code from the R package dfcrm. RESULTS The results of the simulation work showed that the proposed statistical model setup can answer the research questions under a wide range of potential scenarios. The proposed models work well under varying levels of recruitment and with multiple adaptations to the original methodology. CONCLUSION The results demonstrate how TiTE-CRM methodology may be used in practice in a complex dose-finding platform study. We propose that this novel phase I design has potential to overcome some of the logistical barriers that for many years have prevented timely development of novel drug-RT combinations

    Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project

    Get PDF
    The human major histocompatibility complex (MHC) is contained within about 4 Mb on the short arm of chromosome 6 and is recognised as the most variable region in the human genome. The primary aim of the MHC Haplotype Project was to provide a comprehensively annotated reference sequence of a single, human leukocyte antigen-homozygous MHC haplotype and to use it as a basis against which variations could be assessed from seven other similarly homozygous cell lines, representative of the most common MHC haplotypes in the European population. Comparison of the haplotype sequences, including four haplotypes not previously analysed, resulted in the identification of >44,000 variations, both substitutions and indels (insertions and deletions), which have been submitted to the dbSNP database. The gene annotation uncovered haplotype-specific differences and confirmed the presence of more than 300 loci, including over 160 protein-coding genes. Combined analysis of the variation and annotation datasets revealed 122 gene loci with coding substitutions of which 97 were non-synonymous. The haplotype (A3-B7-DR15; PGF cell line) designated as the new MHC reference sequence, has been incorporated into the human genome assembly (NCBI35 and subsequent builds), and constitutes the largest single-haplotype sequence of the human genome to date. The extensive variation and annotation data derived from the analysis of seven further haplotypes have been made publicly available and provide a framework and resource for future association studies of all MHC-associated diseases and transplant medicine

    Characterization of the past and current duplication activities in the human 22q11.2 region

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Segmental duplications (SDs) on 22q11.2 (LCR22), serve as substrates for meiotic non-allelic homologous recombination (NAHR) events resulting in several clinically significant genomic disorders.</p> <p>Results</p> <p>To understand the duplication activity leading to the complicated SD structure of this region, we have applied the A-Bruijn graph algorithm to decompose the 22q11.2 SDs to 523 fundamental duplication sequences, termed subunits. Cross-species syntenic analysis of primate genomes demonstrates that many of these LCR22 subunits emerged very recently, especially those implicated in human genomic disorders. Some subunits have expanded more actively than others, and young <it>Alu </it>SINEs, are associated much more frequently with duplicated sequences that have undergone active expansion, confirming their role in mediating recombination events. Many copy number variations (CNVs) exist on 22q11.2, some flanked by SDs. Interestingly, two chromosome breakpoints for 13 CNVs (mean length 65 kb) are located in paralogous subunits, providing direct evidence that SD subunits could contribute to CNV formation. Sequence analysis of PACs or BACs identified extra CNVs, specifically, 10 insertions and 18 deletions within 22q11.2; four were more than 10 kb in size and most contained young <it>AluY</it>s at their breakpoints.</p> <p>Conclusions</p> <p>Our study indicates that <it>AluY</it>s are implicated in the past and current duplication events, and moreover suggests that DNA rearrangements in 22q11.2 genomic disorders perhaps do not occur randomly but involve both actively expanded duplication subunits and <it>Alu </it>elements.</p

    Evidence for Transcript Networks Composed of Chimeric RNAs in Human Cells

    Get PDF
    The classic organization of a gene structure has followed the Jacob and Monod bacterial gene model proposed more than 50 years ago. Since then, empirical determinations of the complexity of the transcriptomes found in yeast to human has blurred the definition and physical boundaries of genes. Using multiple analysis approaches we have characterized individual gene boundaries mapping on human chromosomes 21 and 22. Analyses of the locations of the 5â€Č and 3â€Č transcriptional termini of 492 protein coding genes revealed that for 85% of these genes the boundaries extend beyond the current annotated termini, most often connecting with exons of transcripts from other well annotated genes. The biological and evolutionary importance of these chimeric transcripts is underscored by (1) the non-random interconnections of genes involved, (2) the greater phylogenetic depth of the genes involved in many chimeric interactions, (3) the coordination of the expression of connected genes and (4) the close in vivo and three dimensional proximity of the genomic regions being transcribed and contributing to parts of the chimeric RNAs. The non-random nature of the connection of the genes involved suggest that chimeric transcripts should not be studied in isolation, but together, as an RNA network

    The Tetraodon nigroviridis reference transcriptome: Developmental transition, length retention and microsynteny of long non-coding RNAs in a compact vertebrate genome

    Get PDF
    Pufferfish such as fugu and tetraodon carry the smallest genomes among all vertebrates and are ideal for studying genome evolution. However, comparative genomics using these species is hindered by the poor annotation of their genomes. We performed RNA sequencing during key stages of maternal to zygotic transition of Tetraodon nigroviridis and report its first developmental transcriptome. We assembled 61,033 transcripts (23,837 loci) representing 80% of the annotated gene models and 3816 novel coding transcripts from 2667 loci. We demonstrate the similarities of gene expression profiles between pufferfish and zebrafish during maternal to zygotic transition and annotated 1120 long non-coding RNAs (lncRNAs) many of which differentially expressed during development. The promoters for 60% of the assembled transcripts result validated by CAGE-seq. Despite the extreme compaction of the tetraodon genome and the dramatic loss of transposons, the length of lncRNA exons remain comparable to that of other vertebrates and a small set of lncRNAs appears enriched for transposable elements suggesting a selective pressure acting on lncRNAs length and composition. Finally, a set of lncRNAs are microsyntenic between teleost and vertebrates, which indicates potential regulatory interactions between lncRNAs and their flanking coding genes. Our work provides a fundamental molecular resource for vertebrate comparative genomics and embryogenesis studies

    Physical education undergraduate students’ perceptions of their learning using the jigsaw learning method

    Get PDF
    Recognising the limited research around the use of cooperative learning in higher education, this case study sought to explore physical education students’ perceptions of learning using the jigsaw learning method. It examined the impact of two different aesthetic activities and two different groupings on students’ perceptions of their learning. A purposive sample of 36 third-year undergraduates was selected for the study. Data were collected using focus group interviews and reflective journals. Inductive analysis illustrated students’ perceptions of their own and others’ abilities, students’ empathy towards their peers, and how their perceptions of gymnastics and dance impacted on their perceptions of learning. Students felt that heterogeneous and friendship groupings have the potential to encourage high-order social and cognitive learning. However, those students with limited psychomotor abilities appear to be better served in friendship groupings to facilitate such learning. Students also favoured the ‘structured’ nature of gymnastics in comparison to dance for their own teaching and learning purposes. Irrespective of aesthetic activity or grouping utilised, students felt their psychomotor learning was limited. It is recommended that university staff consider using a mixture of groupings with a single cohort dependent on the practical ability of students and the use of more ‘structured’ activities. In doing so, students’ perceptions of their social, cognitive and psychomotor learning may improve and thereby encourage greater and more effective use of this innovative method in schools

    Long non-coding RNA RAMS11 promotes metastatic colorectal cancer progression

    Get PDF
    Colorectal cancer (CRC) is the most common gastrointestinal malignancy in the U.S.A. and approximately 50% of patients develop metastatic disease (mCRC). Despite our understanding of long non-coding RNAs (lncRNAs) in primary colon cancer, their role in mCRC and treatment resistance remains poorly characterized. Therefore, through transcriptome sequencing of normal, primary, and distant mCRC tissues we find 148 differentially expressed RNAs Associated with Metastasis (RAMS). We prioritize RAMS11 due to its association with poor disease-free survival and promotion of aggressive phenotypes in vitro and in vivo. A FDA-approved drug high-throughput viability assay shows that elevated RAMS11 expression increases resistance to topoisomerase inhibitors. Subsequent experiments demonstrate RAMS11-dependent recruitment of Chromobox protein 4 (CBX4) transcriptionally activates Topoisomerase II alpha (TOP2α). Overall, recent clinical trials using topoisomerase inhibitors coupled with our findings of RAMS11-dependent regulation of TOP2α supports the potential use of RAMS11 as a biomarker and therapeutic target for mCRC
    • 

    corecore