651 research outputs found
Genetic prediction of complex traits: integrating infinitesimal and marked genetic effects
Genetic prediction for complex traits is usually based on models including individual (infinitesimal) or marker effects. Here, we concentrate on models including both the individual and the marker effects. In particular, we develop a ''Mendelian segregation'' model combining infinitesimal effects for base individuals and realized Mendelian sampling in descendants described by the available DNA data. The model is illustrated with an example and the analyses of a public simulated data file. Further, the potential contribution of such models is assessed by simulation. Accuracy, measured as the correlation between true (simulated) and predicted genetic values, was similar for all models compared under different genetic backgrounds. As expected, the segregation model is worthwhile when markers capture a low fraction of total genetic variance. (Résumé d'auteur
Supersymmetric Extension of GCA in 2d
We derive the infinite dimensional Supersymmetric Galilean Conformal Algebra
(SGCA) in the case of two spacetime dimensions by performing group contraction
on 2d superconformal algebra. We also obtain the representations of the
generators in terms of superspace coordinates. Here we find realisations of the
SGCA by considering scaling limits of certain 2d SCFTs which are non-unitary
and have their left and right central charges become large in magnitude and
opposite in sign. We focus on the Neveu-Schwarz sector of the parent SCFTs and
develop, in parallel to the GCA studies recently in (arXiv:0912.1090), the
representation theory based on SGCA primaries, Ward identities for their
correlation functions and their descendants which are null states.Comment: La TeX file, 32 pages; v2: typos corrected, journal versio
Accuracy of breeding values of 'unrelated' individuals predicted by dense SNP genotyping
<p>Abstract</p> <p>Background</p> <p>Recent developments in SNP discovery and high throughput genotyping technology have made the use of high-density SNP markers to predict breeding values feasible. This involves estimation of the SNP effects in a training data set, and use of these estimates to evaluate the breeding values of other 'evaluation' individuals. Simulation studies have shown that these predictions of breeding values can be accurate, when training and evaluation individuals are (closely) related. However, many general applications of genomic selection require the prediction of breeding values of 'unrelated' individuals, i.e. individuals from the same population, but not particularly closely related to the training individuals.</p> <p>Methods</p> <p>Accuracy of selection was investigated by computer simulation of small populations. Using scaling arguments, the results were extended to different populations, training data sets and genome sizes, and different trait heritabilities.</p> <p>Results</p> <p>Prediction of breeding values of unrelated individuals required a substantially higher marker density and number of training records than when prediction individuals were offspring of training individuals. However, when the number of records was 2*N<sub>e</sub>*L and the number of markers was 10*N<sub>e</sub>*L, the breeding values of unrelated individuals could be predicted with accuracies of 0.88 â 0.93, where N<sub>e </sub>is the effective population size and L the genome size in Morgan. Reducing this requirement to 1*N<sub>e</sub>*L individuals, reduced prediction accuracies to 0.73â0.83.</p> <p>Conclusion</p> <p>For livestock populations, 1N<sub>e</sub>L requires about ~30,000 training records, but this may be reduced if training and evaluation animals are related. A prediction equation is presented, that predicts accuracy when training and evaluation individuals are related. For humans, 1N<sub>e</sub>L requires ~350,000 individuals, which means that human disease risk prediction is possible only for diseases that are determined by a limited number of genes. Otherwise, genotyping and phenotypic recording need to become very common in the future.</p
Advances in understanding of air-sea exchange and cycling of greenhouse gases in the upper ocean
\ua9 2024 University of California Press. All rights reserved. The airâsea exchange and oceanic cycling of greenhouse gases (GHG), including carbon dioxide (CO2), nitrous oxide (N2O), methane (CH4), carbon monoxide (CO), and nitrogen oxides (NOx \ubc NO \ufe NO2), are fundamental in controlling the evolution of the Earthâs atmospheric chemistry and climate. Significant advances have been made over the last 10 years in understanding, instrumentation and methods, as well as deciphering the production and consumption pathways of GHG in the upper ocean (including the surface and subsurface ocean down to approximately 1000 m). The global ocean under current conditions is now well established as a major sink for CO2, a major source for N2O and a minor source for both CH4 and CO. The importance of the ocean as a sink or source of NOx is largely unknown so far. There are still considerable uncertainties about the processes and their major drivers controlling the distributions of N2O, CH4, CO, and NOx in the upper ocean. Without having a fundamental understanding of oceanic GHG production and consumption pathways, our knowledge about the effects of ongoing major oceanic changesâwarming, acidification, deoxygenation, and eutrophicationâon the oceanic cycling and airâsea exchange of GHG remains rudimentary at best. We suggest that only through a comprehensive, coordinated, and interdisciplinary approach that includes data collection by global observation networks as well as joint process studies can the necessary data be generated to (1) identify the relevant microbial and phytoplankton communities, (2) quantify the rates of ocean GHG production and consumption pathways, (3) comprehend their major drivers, and (4) decipher economic and cultural implications of mitigation solutions
Accounting for genomic pre-selection in national BLUP evaluations in dairy cattle
<p>Abstract</p> <p>Background</p> <p>In future Best Linear Unbiased Prediction (BLUP) evaluations of dairy cattle, genomic selection of young sires will cause evaluation biases and loss of accuracy once the selected ones get progeny.</p> <p>Methods</p> <p>To avoid such bias in the estimation of breeding values, we propose to include information on all genotyped bulls, including the culled ones, in BLUP evaluations. Estimated breeding values based on genomic information were converted into genomic pseudo-performances and then analyzed simultaneously with actual performances. Using simulations based on actual data from the French Holstein population, bias and accuracy of BLUP evaluations were computed for young sires undergoing progeny testing or genomic pre-selection. For bulls pre-selected based on their genomic profile, three different types of information can be included in the BLUP evaluations: (1) data from pre-selected genotyped candidate bulls with actual performances on their daughters, (2) data from bulls with both actual and genomic pseudo-performances, or (3) data from all the genotyped candidates with genomic pseudo-performances. The effects of different levels of heritability, genomic pre-selection intensity and accuracy of genomic evaluation were considered.</p> <p>Results</p> <p>Including information from all the genotyped candidates, i.e. genomic pseudo-performances for both selected and culled candidates, removed bias from genetic evaluation and increased accuracy. This approach was effective regardless of the magnitude of the initial bias and as long as the accuracy of the genomic evaluations was sufficiently high.</p> <p>Conclusions</p> <p>The proposed method can be easily and quickly implemented in BLUP evaluations at the national level, although some improvement is necessary to more accurately propagate genomic information from genotyped to non-genotyped animals. In addition, it is a convenient method to combine direct genomic, phenotypic and pedigree-based information in a multiple-step procedure.</p
Using the realized relationship matrix to disentangle confounding factors for the estimation of genetic variance components of complex traits
Background: In the analysis of complex traits, genetic effects can be confounded with non-genetic effects, especially when using full-sib families. Dominance and epistatic effects are typically confounded with additive genetic and non-genetic effects. This confounding may cause the estimated genetic variance components to be inaccurate and biased
Using the Pareto principle in genome-wide breeding value estimation
Genome-wide breeding value (GWEBV) estimation methods can be classified based on the prior distribution assumptions of marker effects. Genome-wide BLUP methods assume a normal prior distribution for all markers with a constant variance, and are computationally fast. In Bayesian methods, more flexible prior distributions of SNP effects are applied that allow for very large SNP effects although most are small or even zero, but these prior distributions are often also computationally demanding as they rely on Monte Carlo Markov chain sampling. In this study, we adopted the Pareto principle to weight available marker loci, i.e., we consider that x% of the loci explain (100 - x)% of the total genetic variance. Assuming this principle, it is also possible to define the variances of the prior distribution of the 'big' and 'small' SNP. The relatively few large SNP explain a large proportion of the genetic variance and the majority of the SNP show small effects and explain a minor proportion of the genetic variance. We name this method MixP, where the prior distribution is a mixture of two normal distributions, i.e. one with a big variance and one with a small variance. Simulation results, using a real Norwegian Red cattle pedigree, show that MixP is at least as accurate as the other methods in all studied cases. This method also reduces the hyper-parameters of the prior distribution from 2 (proportion and variance of SNP with big effects) to 1 (proportion of SNP with big effects), assuming the overall genetic variance is known. The mixture of normal distribution prior made it possible to solve the equations iteratively, which greatly reduced computation loads by two orders of magnitude. In the era of marker density reaching million(s) and whole-genome sequence data, MixP provides a computationally feasible Bayesian method of analysis
Best Linear Unbiased Prediction of Genomic Breeding Values Using a Trait-Specific Marker-Derived Relationship Matrix
With the availability of high density whole-genome single nucleotide polymorphism chips, genomic selection has become a promising method to estimate genetic merit with potentially high accuracy for animal, plant and aquaculture species of economic importance. With markers covering the entire genome, genetic merit of genotyped individuals can be predicted directly within the framework of mixed model equations, by using a matrix of relationships among individuals that is derived from the markers. Here we extend that approach by deriving a marker-based relationship matrix specifically for the trait of interest
Maternal angiotensinogen (AGT) haplotypes, fetal renin (REN) haplotypes and risk of preeclampsia; estimation of gene-gene interaction from family-triad data
Background Preeclampsia is a debilitating disorder affecting approximately 3% of pregnant women in the Western world. Although inconclusive, current evidence suggests that the renin-angiotensin system may be involved in hypertension. Therefore, our objective was to determine whether the genes for placental renin (REN) and maternal angiotensinogen (AGT) interact to influence the risk of preeclampsia. Methods Three haplotype-tagging SNPs (htSNPs) covering REN (rs5705, rs1464818, and rs3795575) and another three covering AGT (rs2148582, rs2478545 and rs943580) were genotyped in 99 mother-father-child triads of preeclampsia pregnancies. We estimated relative risks (RR) conferred by maternal AGT and fetal REN haplotypes using HAPLIN, a statistical software designed to detect multi-marker transmission distortion among triads. To assess a combined effect of maternal AGT and fetal REN haplotypes, the preeclamptic triads were first stratified by presence/absence of maternal AGT haplotype C-T-A and tested for an effect of fetal REN across these strata. Results We found evidence that mothers carrying the most frequent AGT haplotype, C-T-A, had a reduced risk of preeclampsia (RR of 0.4, 95% CI = 0.2-0.8 for heterozygotes and 0.6, 95% CI = 0.2-1.5 for homozygotes). Mothers homozygous for AGT haplotypes t-c-g and C-c-g appeared to have a higher risk, but only the former was statistically significant. We found only weak evidence of an overall effect of fetal REN haplotypes and no support for our hypothesis that an effect of REN depended on whether the mother carried the C-T-A haplotype of AGT (p = 0.33). Conclusion Our findings indicate that the mother's AGT haplotypes affect her risk for developing preeclampsia. However, this risk is not influenced by fetal REN haplotypes.publishedVersio
Bootstrap, Bayesian probability and maximum likelihood mapping: exploring new tools for comparative genome analyses
BACKGROUND: Horizontal gene transfer (HGT) played an important role in shaping microbial genomes. In addition to genes under sporadic selection, HGT also affects housekeeping genes and those involved in information processing, even ribosomal RNA encoding genes. Here we describe tools that provide an assessment and graphic illustration of the mosaic nature of microbial genomes. RESULTS: We adapted the Maximum Likelihood (ML) mapping to the analyses of all detected quartets of orthologous genes found in four genomes. We have automated the assembly and analyses of these quartets of orthologs given the selection of four genomes. We compared the ML-mapping approach to more rigorous Bayesian probability and Bootstrap mapping techniques. The latter two approaches appear to be more conservative than the ML-mapping approach, but qualitatively all three approaches give equivalent results. All three tools were tested on mitochondrial genomes, which presumably were inherited as a single linkage group. CONCLUSIONS: In some instances of interphylum relationships we find nearly equal numbers of quartets strongly supporting the three possible topologies. In contrast, our analyses of genome quartets containing the cyanobacterium Synechocystis sp. indicate that a large part of the cyanobacterial genome is related to that of low GC Gram positives. Other groups that had been suggested as sister groups to the cyanobacteria contain many fewer genes that group with the Synechocystis orthologs. Interdomain comparisons of genome quartets containing the archaeon Halobacterium sp. revealed that Halobacterium sp. shares more genes with Bacteria that live in the same environment than with Bacteria that are more closely related based on rRNA phylogeny . Many of these genes encode proteins involved in substrate transport and metabolism and in information storage and processing. The performed analyses demonstrate that relationships among prokaryotes cannot be accurately depicted by or inferred from the tree-like evolution of a core of rarely transferred genes; rather prokaryotic genomes are mosaics in which different parts have different evolutionary histories. Probability mapping is a valuable tool to explore the mosaic nature of genomes
- âŠ