1,311 research outputs found

    Rating of electrical wires in vacuum environments

    Get PDF
    Electric conductors used in vacuum environments have smaller cross sections. This report provides data on the correct size wire for a required current load in free-air, low-pressure oxygen, and vacuum environments

    Cleaning Genotype Data from Diversity Outbred Mice.

    Get PDF
    Data cleaning is an important first step in most statistical analyses, including efforts to map the genetic loci that contribute to variation in quantitative traits. Here we illustrate approaches to quality control and cleaning of array-based genotyping data for multiparent populations (experimental crosses derived from more than two founder strains), using MegaMUGA array data from a set of 291 Diversity Outbred (DO) mice. Our approach employs data visualizations that can reveal problems at the level of individual mice or with individual SNP markers. We find that the proportion of missing genotypes for each mouse is an effective indicator of sample quality. We use microarray probe intensities for SNPs on the X and Y chromosomes to confirm the sex of each mouse, and we use the proportion of matching SNP genotypes between pairs of mice to detect sample duplicates. We use a hidden Markov model (HMM) reconstruction of the founder haplotype mosaic across each mouse genome to estimate the number of crossovers and to identify potential genotyping errors. To evaluate marker quality, we find that missing data and genotyping error rates are the most effective diagnostics. We also examine the SNP genotype frequencies with markers grouped according to their minor allele frequency in the founder strains. For markers with high apparent error rates, a scatterplot of the allele-specific probe intensities can reveal the underlying cause of incorrect genotype calls. The decision to include or exclude low-quality samples can have a significant impact on the mapping results for a given study. We find that the impact of low-quality markers on a given study is often minimal, but reporting problematic markers can improve the utility of the genotyping array across many studies

    Dynamical frustration in ANNNI model and annealing

    Full text link
    Zero temperature quench in the Axial Next Nearest Neighbour Ising (ANNNI) model fails to bring it to its ground state for a certain range of values of the frustration parameter κ\kappa, the ratio of the next nearest neighbour antiferromagnetic interaction strength to the nearest neighbour one. We apply several annealing methods, both classical and quantum, and observe that the behaviour of the residual energy and the order parameter depends on the value of κ\kappa strongly. Classical or thermal annealing is found to be adequate for small values of κ\kappa. However, neither classical nor quantum annealing is effective at values of κ\kappa close to the fully frustrated point κ=0.5\kappa=0.5, where the residual energy shows a very slow algebraic decay with the number of MCS.Comment: 6 pages,10 figures, to be published in Proceedings of " The International Workshop on Quantum annealing and other Optimization Methods

    Prediction performance of linear models and gradient boosting machine on complex phenotypes in outbred mice.

    Get PDF
    We compared the performance of linear (GBLUP, BayesB, and elastic net) methods to a nonparametric tree-based ensemble (gradient boosting machine) method for genomic prediction of complex traits in mice. The dataset used contained genotypes for 50,112 SNP markers and phenotypes for 835 animals from 6 generations. Traits analyzed were bone mineral density, body weight at 10, 15, and 20 weeks, fat percentage, circulating cholesterol, glucose, insulin, triglycerides, and urine creatinine. The youngest generation was used as a validation subset, and predictions were based on all older generations. Model performance was evaluated by comparing predictions for animals in the validation subset against their adjusted phenotypes. Linear models outperformed gradient boosting machine for 7 out of 10 traits. For bone mineral density, cholesterol, and glucose, the gradient boosting machine model showed better prediction accuracy and lower relative root mean squared error than the linear models. Interestingly, for these 3 traits, there is evidence of a relevant portion of phenotypic variance being explained by epistatic effects. Using a subset of top markers selected from a gradient boosting machine model helped for some of the traits to improve the accuracy of prediction when these were fitted into linear and gradient boosting machine models. Our results indicate that gradient boosting machine is more strongly affected by data size and decreased connectedness between reference and validation sets than the linear models. Although the linear models outperformed gradient boosting machine for the polygenic traits, our results suggest that gradient boosting machine is a competitive method to predict complex traits with assumed epistatic effects

    Adding gene transcripts into genomic prediction improves accuracy and reveals sampling time dependence.

    Get PDF
    Recent developments allowed generating multiple high-quality \u27omics\u27 data that could increase the predictive performance of genomic prediction for phenotypes and genetic merit in animals and plants. Here, we have assessed the performance of parametric and nonparametric models that leverage transcriptomics in genomic prediction for 13 complex traits recorded in 478 animals from an outbred mouse population. Parametric models were implemented using the best linear unbiased prediction, while nonparametric models were implemented using the gradient boosting machine algorithm. We also propose a new model named GTCBLUP that aims to remove between-omics-layer covariance from predictors, whereas its counterpart GTBLUP does not do that. While gradient boosting machine models captured more phenotypic variation, their predictive performance did not exceed the best linear unbiased prediction models for most traits. Models leveraging gene transcripts captured higher proportions of the phenotypic variance for almost all traits when these were measured closer to the moment of measuring gene transcripts in the liver. In most cases, the combination of layers was not able to outperform the best single-omics models to predict phenotypes. Using only gene transcripts, the gradient boosting machine model was able to outperform best linear unbiased prediction for most traits except body weight, but the same pattern was not observed when using both single nucleotide polymorphism genotypes and gene transcripts. Although the GTCBLUP model was not able to produce the most accurate phenotypic predictions, it showed the highest accuracies for breeding values for 9 out of 13 traits. We recommend using the GTBLUP model for prediction of phenotypes and using the GTCBLUP for prediction of breeding values

    The prevalence of systemic autoimmune rheumatic diseases in Canadian pediatric populations: administrative database estimates

    Get PDF
    CI 17.9, 29.2). SARDs were more common in females than in males across all provinces. There was a slightly higher prevalence among those living in urban compared to rural areas of Alberta (rate difference 14.4, 95 % CI 8.6, 20.1) and Saskatchewan (rate difference 13.8, 95 % CI 1.0, 26.6). Our results provide population-based prevalence estimates of pediatric SARDs in four Canadian provinces. Keywords Pediatric rheumatic diseases · Systemic autoimmune rheumatic diseases · Epidemiology · Disease prevalence Abstract To estimate systemic autoimmune rheumatic disease (SARD) prevalence using administrative data for pediatric populations in four Canadian provinces. Physician billing claims and inpatient hospitalizations from Alberta, Manitoba, Quebec, and Saskatchewan were used to define cases aged ≤18 years with a SARD diagnosis code in: one or more hospitalization, two or more physician visits within 2 years and at least 2 months apart, or one or more physician visit to a rheumatologist. Estimates ranged from 15.9/100,000 in Quebec [95 % confidence interval (95 % CI) 14.1, 18.0] to 23.0/100,000 in Manitoba (95 % Rheumatology INTERNATIONA

    A General Bayesian Approach to Analyzing Diallel Crosses of Inbred Strains

    Get PDF
    The classic diallel takes a set of parents and produces offspring from all possible mating pairs. Phenotype values among the offspring can then be related back to their respective parentage. When the parents are diploid, sexed, and inbred, the diallel can characterize aggregate effects of genetic background on a phenotype, revealing effects of strain dosage, heterosis, parent of origin, epistasis, and sex-specific versions thereof. However, its analysis is traditionally intricate, unforgiving of unplanned missing information, and highly sensitive to imbalance, making the diallel unapproachable to many geneticists. Nonetheless, imbalanced and incomplete diallels arise frequently, albeit unintentionally, as by-products of larger-scale experiments that collect F1 data, for example, pilot studies or multiparent breeding efforts such as the Collaborative Cross or the Arabidopsis MAGIC lines. We present a general Bayesian model for analyzing diallel data on dioecious diploid inbred strains that cleanly decomposes the observed patterns of variation into biologically intuitive components, simultaneously models and accommodates outliers, and provides shrinkage estimates of effects that automatically incorporate uncertainty due to imbalance, missing data, and small sample size. We further present a model selection procedure for weighing evidence for or against the inclusion of those components in a predictive model. We evaluate our method through simulation and apply it to incomplete diallel data on the founders and F1's of the Collaborative Cross, robustly characterizing the genetic architecture of 48 phenotypes
    • …
    corecore