172 research outputs found

    A Bare Molecular Cloud at z~0.45

    Get PDF
    Several neutral species (MgI, SiI, CaI, FeI) have been detected in a weak MgII absorption line system (W_r(2796)~0.15 Angstroms) at z~0.45 along the sightline toward HE0001-2340. These observations require extreme physical conditions, as noted in D'Odorico (2007). We place further constraints on the properties of this system by running a wide grid of photoionization models, determining that the absorbing cloud that produces the neutral absorption is extremely dense (~100-1000/cm^3), cold (<100 K), and has significant molecular content (~72-94%). Structures of this size and temperature have been detected in Milky Way CO surveys, and have been predicted in hydrodynamic simulations of turbulent gas. In order to explain the observed line profiles in all neutral and singly ionized chemical transitions, the lines must suffer from unresolved saturation and/or the absorber must partially cover the broad emission line region of the background quasar. In addition to this highly unusual cloud, three other ordinary weak MgII clouds (within densities of ~0.005/cm^3 and temperatures of ~10000K) lie within 500 km/s along the same sightline. We suggest that the "bare molecular cloud", which appears to reside outside of a galaxy disk, may have had in situ star formation and may evolve into an ordinary weak MgII absorbing cloud.Comment: 15 pages, 4 figures, 4 tables, ApJ accepte

    Adding gene transcripts into genomic prediction improves accuracy and reveals sampling time dependence.

    Get PDF
    Recent developments allowed generating multiple high-quality \u27omics\u27 data that could increase the predictive performance of genomic prediction for phenotypes and genetic merit in animals and plants. Here, we have assessed the performance of parametric and nonparametric models that leverage transcriptomics in genomic prediction for 13 complex traits recorded in 478 animals from an outbred mouse population. Parametric models were implemented using the best linear unbiased prediction, while nonparametric models were implemented using the gradient boosting machine algorithm. We also propose a new model named GTCBLUP that aims to remove between-omics-layer covariance from predictors, whereas its counterpart GTBLUP does not do that. While gradient boosting machine models captured more phenotypic variation, their predictive performance did not exceed the best linear unbiased prediction models for most traits. Models leveraging gene transcripts captured higher proportions of the phenotypic variance for almost all traits when these were measured closer to the moment of measuring gene transcripts in the liver. In most cases, the combination of layers was not able to outperform the best single-omics models to predict phenotypes. Using only gene transcripts, the gradient boosting machine model was able to outperform best linear unbiased prediction for most traits except body weight, but the same pattern was not observed when using both single nucleotide polymorphism genotypes and gene transcripts. Although the GTCBLUP model was not able to produce the most accurate phenotypic predictions, it showed the highest accuracies for breeding values for 9 out of 13 traits. We recommend using the GTBLUP model for prediction of phenotypes and using the GTCBLUP for prediction of breeding values

    Prediction performance of linear models and gradient boosting machine on complex phenotypes in outbred mice.

    Get PDF
    We compared the performance of linear (GBLUP, BayesB, and elastic net) methods to a nonparametric tree-based ensemble (gradient boosting machine) method for genomic prediction of complex traits in mice. The dataset used contained genotypes for 50,112 SNP markers and phenotypes for 835 animals from 6 generations. Traits analyzed were bone mineral density, body weight at 10, 15, and 20 weeks, fat percentage, circulating cholesterol, glucose, insulin, triglycerides, and urine creatinine. The youngest generation was used as a validation subset, and predictions were based on all older generations. Model performance was evaluated by comparing predictions for animals in the validation subset against their adjusted phenotypes. Linear models outperformed gradient boosting machine for 7 out of 10 traits. For bone mineral density, cholesterol, and glucose, the gradient boosting machine model showed better prediction accuracy and lower relative root mean squared error than the linear models. Interestingly, for these 3 traits, there is evidence of a relevant portion of phenotypic variance being explained by epistatic effects. Using a subset of top markers selected from a gradient boosting machine model helped for some of the traits to improve the accuracy of prediction when these were fitted into linear and gradient boosting machine models. Our results indicate that gradient boosting machine is more strongly affected by data size and decreased connectedness between reference and validation sets than the linear models. Although the linear models outperformed gradient boosting machine for the polygenic traits, our results suggest that gradient boosting machine is a competitive method to predict complex traits with assumed epistatic effects

    Genome-wide transcript and protein analysis highlights the role of protein homeostasis in the aging mouse heart.

    Get PDF
    Investigation of the molecular mechanisms of aging in the human heart is challenging because of confounding factors, such as diet and medications, as well as limited access to tissues from healthy aging individuals. The laboratory mouse provides an ideal model to study aging in healthy individuals in a controlled environment. However, previous mouse studies have examined only a narrow range of the genetic variation that shapes individual differences during aging. Here, we analyze transcriptome and proteome data from 185 genetically diverse male and female mice at ages 6, 12, and 18 mo to characterize molecular changes that occur in the aging heart. Transcripts and proteins reveal activation of pathways related to exocytosis and cellular transport with age, whereas processes involved in protein folding decrease with age. Additional changes are apparent only in the protein data including reduced fatty acid oxidation and increased autophagy. For proteins that form complexes, we see a decline in correlation between their component subunits with age, suggesting age-related loss of stoichiometry. The most affected complexes are themselves involved in protein homeostasis, which potentially contributes to a cycle of progressive breakdown in protein quality control with age. Our findings highlight the important role of post-transcriptional regulation in aging. In addition, we identify genetic loci that modulate age-related changes in protein homeostasis, suggesting that genetic variation can alter the molecular aging process

    A multi-tissue full lifespan epigenetic clock for mice.

    Get PDF
    Human DNA-methylation data have been used to develop highly accurate biomarkers of aging ( epigenetic clocks ). Recent studies demonstrate that similar epigenetic clocks for mice (Mus Musculus) can be slowed by gold standard anti-aging interventions such as calorie restriction and growth hormone receptor knock-outs. Using DNA methylation data from previous publications with data collected in house for a total 1189 samples spanning 193,651 CpG sites, we developed 4 novel epigenetic clocks by choosing different regression models (elastic net- versus ridge regression) and by considering different sets of CpGs (all CpGs vs highly conserved CpGs). We demonstrate that accurate age estimators can be built on the basis of highly conserved CpGs. However, the most accurate clock results from applying elastic net regression to all CpGs. While the anti-aging effect of calorie restriction could be detected with all types of epigenetic clocks, only ridge regression based clocks replicated the finding of slow epigenetic aging effects in dwarf mice. Overall, this study demonstrates that there are trade-offs when it comes to epigenetic clocks in mice. Highly accurate clocks might not be optimal for detecting the beneficial effects of anti-aging interventions

    Genetic dissection of the pluripotent proteome through multi-omics data integration.

    Get PDF
    Genetic background drives phenotypic variability in pluripotent stem cells (PSCs). Most studies to date have used transcript abundance as the primary molecular readout of cell state in PSCs. We performed a comprehensive proteogenomics analysis of 190 genetically diverse mouse embryonic stem cell (mESC) lines. The quantitative proteome is highly variable across lines, and we identified pluripotency-associated pathways that were differentially activated in the proteomics data that were not evident in transcriptome data from the same lines. Integration of protein abundance to transcript levels and chromatin accessibility revealed broad co-variation across molecular layers as well as shared and unique drivers of quantitative variation in pluripotency-associated pathways. Quantitative trait locus (QTL) mapping localized the drivers of these multi-omic signatures to genomic hotspots. This study reveals post-transcriptional mechanisms and genetic interactions that underlie quantitative variability in the pluripotent proteome and provides a regulatory map for mESCs that can provide a basis for future mechanistic studies

    Regulation of protein abundance in genetically diverse mouse populations.

    Get PDF
    Genetically diverse mouse populations are powerful tools for characterizing the regulation of the proteome and its relationship to whole-organism phenotypes. We used mass spectrometry to profile and quantify the abundance of 6,798 proteins in liver tissue from mice of both sexes across 58 Collaborative Cross (CC) inbred strains. We previously collected liver proteomics data from the related Diversity Outbred (DO) mice and their founder strains. We show concordance across the proteomics datasets despite being generated from separate experiments, allowing comparative analysis. We map protein abundance quantitative trait loci (pQTLs), identifying 1,087 local and 285 distal in the CC mice and 1,706 local and 414 distal in the DO mice. We find that regulatory effects on individual proteins are conserved across the mouse populations, in particular for local genetic variation and sex differences. In comparison, proteins that form complexes are often co-regulated, displaying varying genetic architectures, and overall show lower heritability and map fewer pQTLs. We have made this resource publicly available to enable quantitative analyses of the regulation of the proteome

    Genetic Analysis of Hematological Parameters in Incipient Lines of the Collaborative Cross

    Get PDF
    Hematological parameters, including red and white blood cell counts and hemoglobin concentration, are widely used clinical indicators of health and disease. These traits are tightly regulated in healthy individuals and are under genetic control. Mutations in key genes that affect hematological parameters have important phenotypic consequences, including multiple variants that affect susceptibility to malarial disease. However, most variation in hematological traits is continuous and is presumably influenced by multiple loci and variants with small phenotypic effects. We used a newly developed mouse resource population, the Collaborative Cross (CC), to identify genetic determinants of hematological parameters. We surveyed the eight founder strains of the CC and performed a mapping study using 131 incipient lines of the CC. Genome scans identified quantitative trait loci for several hematological parameters, including mean red cell volume (Chr 7 and Chr 14), white blood cell count (Chr 18), percent neutrophils/lymphocytes (Chr 11), and monocyte number (Chr 1). We used evolutionary principles and unique bioinformatics resources to reduce the size of candidate intervals and to view functional variation in the context of phylogeny. Many quantitative trait loci regions could be narrowed sufficiently to identify a small number of promising candidate genes. This approach not only expands our knowledge about hematological traits but also demonstrates the unique ability of the CC to elucidate the genetic architecture of complex traits
    corecore