Search CORE

39,981 research outputs found

A Penalized Multi-trait Mixed Model for Association Mapping in Pedigree-based GWAS

Author: Huang Jian
Li Cong
Liu Jin
Ma Shuangge
Shi Xingjie
Yang Can
Zhao Hongyu
Publication venue
Publication date: 19/05/2013
Field of study

In genome-wide association studies (GWAS), penalization is an important approach for identifying genetic markers associated with trait while mixed model is successful in accounting for a complicated dependence structure among samples. Therefore, penalized linear mixed model is a tool that combines the advantages of penalization approach and linear mixed model. In this study, a GWAS with multiple highly correlated traits is analyzed. For GWAS with multiple quantitative traits that are highly correlated, the analysis using traits marginally inevitably lose some essential information among multiple traits. We propose a penalized-MTMM, a penalized multivariate linear mixed model that allows both the within-trait and between-trait variance components simultaneously for multiple traits. The proposed penalized-MTMM estimates variance components using an AI-REML method and conducts variable selection and point estimation simultaneously using group MCP and sparse group MCP. Best linear unbiased predictor (BLUP) is used to find predictive values and the Pearson's correlations between predictive values and their corresponding observations are used to evaluate prediction performance. Both prediction and selection performance of the proposed approach and its comparison with the uni-trait penalized-LMM are evaluated through simulation studies. We apply the proposed approach to a GWAS data from Genetic Analysis Workshop (GAW) 18

arXiv.org e-Print Archive

CiteSeerX

Principal variable selection to explain grain yield variation in winter wheat from features extracted from UAV imagery

Author: Baenziger P. Stephen
Belamkar Vikas
Bhatta Madhav
Garst Nicholas D.
Ge Yufeng
Howard Reka
Li Jiating
Li Jiating
Stoll Hannah
Veeranampalayam-Sivakumar Arun-Narenthiran
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2019
Field of study

Background: Automated phenotyping technologies are continually advancing the breeding process. However, collecting various secondary traits throughout the growing season and processing massive amounts of data still take great efforts and time. Selecting a minimum number of secondary traits that have the maximum predictive power has the potential to reduce phenotyping efforts. The objective of this study was to select principal features extracted from UAV imagery and critical growth stages that contributed the most in explaining winter wheat grain yield. Five dates of multispectral images and seven dates of RGB images were collected by a UAV system during the spring growing season in 2018. Two classes of features (variables), totaling to 172 variables, were extracted for each plot from the vegetation index and plant height maps, including pixel statistics and dynamic growth rates. A parametric algorithm, LASSO regression (the least angle and shrinkage selection operator), and a non-parametric algorithm, random forest, were applied for variable selection. The regression coefficients estimated by LASSO and the permutation importance scores provided by random forest were used to determine the ten most important variables influencing grain yield from each algorithm. Results: Both selection algorithms assigned the highest importance score to the variables related with plant height around the grain filling stage. Some vegetation indices related variables were also selected by the algorithms mainly at earlier to mid growth stages and during the senescence. Compared with the yield prediction using all 172 variables derived from measured phenotypes, using the selected variables performed comparable or even better. We also noticed that the prediction accuracy on the adapted NE lines (r = 0.58–0.81) was higher than the other lines (r = 0.21–0.59) included in this study with different genetic backgrounds. Conclusions: With the ultra-high resolution plot imagery obtained by the UAS-based phenotyping we are now able to derive more features, such as the variation of plant height or vegetation indices within a plot other than just an averaged number, that are potentially very useful for the breeding purpose. However, too many features or variables can be derived in this way. The promising results from this study suggests that the selected set from those variables can have comparable prediction accuracies on the grain yield prediction than the full set of them but possibly resulting in a better allocation of efforts and resources on phenotypic data collection and processing

Individual classification of ADHD patients by integrating multiscale neuroimaging markers and advanced pattern recognition techniques

Author: Jianfeng eFeng
Jianfeng eFeng
Jie eZhang
Wei eCheng
Wei eCheng
Xiaoxi eJi
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2012
Field of study

Accurate classification or prediction of the brain state across individual subject, i.e., healthy, or with brain disorders, is generally a more difficult task than merely finding group differences. The former must be approached with highly informative and sensitive biomarkers as well as effective pattern classification/feature selection approaches. In this paper, we propose a systematic methodology to discriminate attention deficit hyperactivity disorder (ADHD) patients from healthy controls on the individual level. Multiple neuroimaging markers that are proved to be sensitive features are identified, which include multiscale characteristics extracted from blood oxygenation level dependent (BOLD) signals, such as regional homogeneity (ReHo) and amplitude of low-frequency fluctuations. Functional connectivity derived from Pearson, partial, and spatial correlation is also utilized to reflect the abnormal patterns of functional integration, or, dysconnectivity syndromes in the brain. These neuroimaging markers are calculated on either voxel or regional level. Advanced feature selection approach is then designed, including a brain-wise association study (BWAS). Using identified features and proper feature integration, a support vector machine (SVM) classifier can achieve a cross-validated classification accuracy of 76.15% across individuals from a large dataset consisting of 141 healthy controls and 98 ADHD patients, with the sensitivity being 63.27% and the specificity being 85.11%. Our results show that the most discriminative features for classification are primarily associated with the frontal and cerebellar regions. The proposed methodology is expected to improve clinical diagnosis and evaluation of treatment for ADHD patient, and to have wider applications in diagnosis of general neuropsychiatric disorders

Directory of Open Access Journals

Frontiers - Publisher Connector

Warwick Research Archives Portal Repository

Is the Universe homogeneous?

Author: Maartens Roy
Publication venue: 'The Royal Society'
Publication date: 01/01/2011
Field of study

The standard model of cosmology is based on the existence of homogeneous surfaces as the background arena for structure formation. Homogeneity underpins both general relativistic and modified gravity models and is central to the way in which we interpret observations of the CMB and the galaxy distribution. However, homogeneity cannot be directly observed in the galaxy distribution or CMB, even with perfect observations, since we observe on the past lightcone and not on spatial surfaces. We can directly observe and test for isotropy, but to link this to homogeneity, we need to assume the Copernican Principle. First, we discuss the link between isotropic observations on the past lightcone and isotropic spacetime geometry: what observations do we need to be isotropic in order to deduce spacetime isotropy? Second, we discuss what we can say with the Copernican assumption. The most powerful result is based on the CMB: the vanishing of the dipole, quadrupole and octupole of the CMB is sufficient to impose homogeneity. Real observations lead to near-isotropy on large scales - does this lead to near-homogeneity? There are important partial results, and we discuss why this remains a difficult open question. Thus we are currently unable to prove homogeneity of the Universe on large-scales, even with the Copernican Principle. However we can use observations of the CMB, galaxies and clusters to test homogeneity itself.Comment: Based on an invited talk at a Theo Murphy Meeting "Testing general relativity with cosmology". Minor corrections, references updated. To appear Phil. Trans. R. Soc.

arXiv.org e-Print Archive

When Do People Trust Their Social Groups?

Author: Edmondson Amy C
Fukuyama Francis
Gambetta Diego
Gulati Ranjay
Holtz David
La Macchia Stephen T
Meyerson Debra
Putnam Robert D
Putnam Robert D
Resnick Paul
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 13/05/2019
Field of study

Trust facilitates cooperation and supports positive outcomes in social groups, including member satisfaction, information sharing, and task performance. Extensive prior research has examined individuals' general propensity to trust, as well as the factors that contribute to their trust in specific groups. Here, we build on past work to present a comprehensive framework for predicting trust in groups. By surveying 6,383 Facebook Groups users about their trust attitudes and examining aggregated behavioral and demographic data for these individuals, we show that (1) an individual's propensity to trust is associated with how they trust their groups, (2) smaller, closed, older, more exclusive, or more homogeneous groups are trusted more, and (3) a group's overall friendship-network structure and an individual's position within that structure can also predict trust. Last, we demonstrate how group trust predicts outcomes at both individual and group level such as the formation of new friendship ties.Comment: CHI 201

arXiv.org e-Print Archive