155 research outputs found

    Using machine learning to detect the differential usage of novel gene isoforms

    Get PDF
    BACKGROUND: Differential isoform usage is an important driver of inter-individual phenotypic diversity and is linked to various diseases and traits. However, accurately detecting the differential usage of different gene transcripts between groups can be difficult, in particular in less well annotated genomes where the spectrum of transcript isoforms is largely unknown. RESULTS: We investigated whether machine learning approaches can detect differential isoform usage based purely on the distribution of reads across a gene region. We illustrate that gradient boosting and elastic net approaches can successfully identify large numbers of genes showing potential differential isoform usage between Europeans and Africans, that are enriched among relevant biological pathways and significantly overlap those identified by previous approaches. We demonstrate that diversity at the 3′ and 5′ ends of genes are primary drivers of these differences between populations. CONCLUSION: Machine learning methods can effectively detect differential isoform usage from read fraction data, and can provide novel insights into the biological differences between groups. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04576-3

    Sequence level mechanisms of human epigenome evolution

    Get PDF
    DNA methylation and chromatin states play key roles in development and disease. However, the extent of recent evolutionary divergence in the human epigenome and the influential factors that have shaped it are poorly understood. To determine the links between genome sequence and human epigenome evolution, we examined the divergence of DNA methylation and chromatin states following segmental duplication events in the human lineage. Chromatin and DNA methylation states were found to have been generally well conserved following a duplication event, with the evolution of the epigenome largely uncoupled from the total number of genetic changes in the surrounding DNA sequence. However, the epigenome at tissue-specific, distal regulatory regions was observed to be unusually prone to diverge following duplication, with particular sequence differences, altering known sequence motifs, found to be associated with divergence in patterns of DNA methylation and chromatin. Alu elements were found to have played a particularly prominent role in shaping human epigenome evolution, and we show that human-specific AluY insertion events are strongly linked to the evolution of the DNA methylation landscape and gene expression levels, including at key neurological genes in the human brain. Studying paralogous regions within the same sample enables the study of the links between genome and epigenome evolution while controlling for biological and technical variation. We show DNA methylation and chromatin divergence between duplicated regions are linked to the divergence of particular genetic motifs, with Alu elements having played a disproportionate role in the evolution of the epigenome in the human lineage

    Clinical evaluation of Corridor disease in Bos indicus (Boran) cattle naturally infected with buffalo-derived Theileria parva

    Get PDF
    Corridor disease (CD) is a fatal condition of cattle caused by buffalo-derived Theileria parva. Unlike the related condition, East Coast fever, which results from infection with cattle-derived T. parva, CD has not been extensively studied. We describe in detail the clinical and laboratory findings in cattle naturally infected with buffalo-derived T. parva. Forty-six cattle were exposed to buffalo-derived T. parva under field conditions at the Ol Pejeta Conservancy, Kenya, between 2013 and 2018. The first signs of disease observed in all animals were nasal discharge (mean day of onset was 9 days post-exposure), enlarged lymph nodes (10 days post-exposure), and pyrexia (13.7 days post-exposure). Coughing and labored breathing were observed in more than 50% of animals (14 days post-exposure). Less commonly observed signs, corneal edema (22%) and diarrhea (11%), were observed later in the disease progression (19 days post-exposure). All infections were considered clinically severe, and 42 animals succumbed to infection. The mean time to death across all studies was 18.4 days. The mean time from onset of clinical signs to death was 9 days and from pyrexia to death was 4.8 days, indicating a relatively short duration of clinical illness. There were significant relationships between days to death and the days to first temperature (chi2 = 4.00, p = 0.046), and days to peak temperature (chi2 = 25.81, p = 0.001), animals with earlier onset pyrexia died sooner. These clinical indicators may be useful for assessing the severity of disease in the future. All infections were confirmed by the presence of macroschizonts in lymph node biopsies (mean time to parasitosis was 11 days). Piroplasms were detected in the blood of two animals (4%) and 20 (43%) animals seroconverted. In this study, we demonstrate the successful approach to an experimental field study for CD in cattle. We also describe the clinical progression of CD in naturally infected cattle, including the onset and severity of clinical signs and pathology. Laboratory diagnoses based on examination of blood samples are unreliable, and alternatives may not be available to cattle keepers. The rapid development of CD requires recognition of the clinical signs, which may be useful for early diagnosis of the disease and effective intervention for affected animals

    Inherited tolerance in cattle to the apicomplexan protozoan Theileria parva is associated with decreased proliferation of parasite-infected lymphocytes

    Get PDF
    Theileria parva is the causative agent of East Coast fever and Corridor disease, which are fatal, economically important diseases of cattle in eastern, central and southern Africa. Improved methods of control of the diseases are urgently required. The parasite transforms host lymphocytes, resulting in a rapid, clonal expansion of infected cells. Resistance to the disease has long been reported in cattle from T. parva-endemic areas. We reveal here that first- and second-generation descendants of a single Bos indicus bull survived severe challenge with T. parva, (overall survival rate 57.3% compared to 8.7% for unrelated animals) in a series of five field studies. Tolerant cattle displayed a delayed and less severe parasitosis and febrile response than unrelated animals. The in vitro proliferation of cells from surviving cattle was much reduced compared to those from animals that succumbed to infection. Additionally, some pro-inflammatory cytokines such as IL1β, IL6, TNFα or TGFβ which are usually strongly expressed in susceptible animals and are known to regulate cell growth or motility, remain low in tolerant animals. This correlates with the reduced proliferation and less severe clinical reactions observed in tolerant cattle. The results show for the first time that the inherited tolerance to T. parva is associated with decreased proliferation of infected lymphocytes. The results are discussed in terms of whether the reduced proliferation is the result of a perturbation of the transformation mechanism induced in infected cells or is due to an innate immune response present in the tolerant cattle

    Examining the Impact of Imputation Errors on Fine-Mapping Using DNA Methylation QTL as a Model Trait

    Get PDF
    Genetic variants disrupting DNA methylation at CpG dinucleotides (CpG-SNP) provide a set of known causal variants to serve as models for testing fine-mapping methodology. We use 1716 CpG-SNPs to test three fine-mapping approaches (BIMBAM, BSLMM, and the J-test), assessing the impact of imputation errors and the choice of reference panel by using both whole-genome sequence (WGS), and genotype array data on the same individuals (n=1166). The choice of imputation reference panel had a strong effect on imputation accuracy, with the 1000 Genomes Phase 3 (1000G) reference panel (n=2504 from 26 populations) giving a mean non-reference discordance rate between imputed and sequenced genotypes of 3.2% compared to 1.6% when using the Haplotype Reference Consortium (HRC) reference panel (n=32470 Europeans). These imputation errors impacted on whether the CpG-SNP was included in the 95% credible set, with a difference of ∼ 23% and ∼ 7% between the WGS and the 1000G and HRC imputed datasets respectively. All of the fine-mapping methods failed to reach the expected 95% coverage of the CpG-SNP. This is attributed to secondary cis genetic effects that are unable to be statistically separated from the CpG-SNP, and through a masking mechanism where the effect of the methylation disrupting allele at the CpG-SNP is hidden by the effect of a nearby SNP that has strong LD with the CpG-SNP. The reduced accuracy in fine-mapping a known causal variant in a low level biological trait with imputed genetic data has implications for the study of higher order complex traits and disease

    Endomicroscopic and transcriptomic analysis of impaired barrier function and malabsorption in environmental enteropathy

    Get PDF
    Introduction: Environmental enteropathy (EE) is associated with growth failure, micronutrient malabsorption and impaired responses to oral vaccines. We set out to define cellular mechanisms of impaired barrier function in EE and explore protective mechanisms. Methods: We studied 49 adults with environmental enteropathy in Lusaka, Zambia using confocal laser endomicroscopy (CLE); histology, immunohistochemistry and mRNA sequencing of small intestinal biopsies; and correlated these with plasma lipopolysaccharide (LPS) and a zinc uptake test. Results: CLE images (median 134 for each study) showed virtually ubiquitous small intestinal damage. Epithelial defects, imaged by histology and claudin 4 immunostaining, were predominantly seen at the tips of villi and corresponded with leakage imaged in vivo by CLE. In multivariate analysis, circulating log-transformed LPS was correlated with cell shedding events (β = 0.83; P = 0.035) and with serum glucagon-like peptide-2 (β = -0.13; P = 0.007). Zinc uptake from a test dose of 25mg was attenuated in 30/47 (64%) individuals and in multivariate analysis was reduced by HIV, but positively correlated with GLP-2 (β = 2.72; P = 0.03). There was a U-shaped relationship between circulating LPS and villus surface area. Transcriptomic analysis identified 23 differentially expressed genes in severe enteropathy, including protective peptides and proteins. Conclusions: Confocal endomicroscopy, claudin 4 immunostaining and histology identify epithelial defects which are probably sites of bacterial translocation, in the presence of which increased epithelial surface area increases the burden of translocation. GLP 2 and other protective peptides may play an important role in mucosal protection in EE

    Dynamics of Barred Galaxies

    Get PDF
    Some 30% of disc galaxies have a pronounced central bar feature in the disc plane and many more have weaker features of a similar kind. Kinematic data indicate that the bar constitutes a major non-axisymmetric component of the mass distribution and that the bar pattern tumbles rapidly about the axis normal to the disc plane. The observed motions are consistent with material within the bar streaming along highly elongated orbits aligned with the rotating major axis. A barred galaxy may also contain a spheroidal bulge at its centre, spirals in the outer disc and, less commonly, other features such as a ring or lens. Mild asymmetries in both the light and kinematics are quite common. We review the main problems presented by these complicated dynamical systems and summarize the effort so far made towards their solution, emphasizing results which appear secure. (Truncated)Comment: This old review appeared in 1993. Plain tex with macro file. 82 pages 18 figures. A pdf version with figures at full resolution (3.24MB) is available at http://www.physics.rutgers.edu/~sellwood/bar_review.pd

    A genome-wide screen in human embryonic stem cells reveals novel sites of allele-specific histone modification associated with known disease loci

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Chromatin structure at a given site can differ between chromosome copies in a cell, and such imbalances in chromatin structure have been shown to be important in understanding the molecular mechanisms controlling several disease loci. Human genetic variation, DNA methylation, and disease have been intensely studied, uncovering many sites of allele-specific DNA methylation (ASM). However, little is known about the genome-wide occurrence of sites of allele-specific histone modification (ASHM) and their relationship to human disease. The aim of this study was to investigate the extent and characteristics of sites of ASHM in human embryonic stem cells (hESCs).</p> <p>Results</p> <p>Using a statistically rigorous protocol, we investigated the genomic distribution of ASHM in hESCs, and their relationship to sites of allele-specific expression (ASE) and DNA methylation. We found that, although they were rare, sites of ASHM were substantially enriched at loci displaying ASE. Many were also found at known imprinted regions, hence sites of ASHM are likely to be better markers of imprinted regions than sites of ASM. We also found that sites of ASHM and ASE in hESCs colocalize at risk loci for developmental syndromes mediated by deletions, providing insights into the etiology of these disorders.</p> <p>Conclusion</p> <p>These results demonstrate the potential importance of ASHM patterns in the interpretation of disease loci, and the protocol described provides a basis for similar studies of ASHM in other cell types to further our understanding of human disease susceptibility.</p

    The host ubiquitin-dependent segregase VCP/p97 is required for the onset of human cytomegalovirus replication

    Get PDF
    The human cytomegalovirus major immediate early proteins IE1 and IE2 are critical drivers of virus replication and are considered pivotal in determining the balance between productive and latent infection. IE1 and IE2 are derived from the same primary transcript by alternative splicing and regulation of their expression likely involves a complex interplay between cellular and viral factors. Here we show that knockdown of the host ubiquitin-dependent segregase VCP/p97, results in loss of IE2 expression, subsequent suppression of early and late gene expression and, ultimately, failure in virus replication. RNAseq analysis showed increased levels of IE1 splicing, with a corresponding decrease in IE2 splicing following VCP knockdown. Global analysis of viral transcription showed the expression of a subset of viral genes is not reduced despite the loss of IE2 expression, including UL112/113. Furthermore, Immunofluorescence studies demonstrated that VCP strongly colocalised with the viral replication compartments in the nucleus. Finally, we show that NMS-873, a small molecule inhibitor of VCP, is a potent HCMV antiviral with potential as a novel host targeting therapeutic for HCMV infection
    corecore