Search CORE

221 research outputs found

Genome-wide selection by mixed model ridge regression and extensions based on geostatistical models

Author: A Coster
ADR McQuarrie
CM Hurvich
D Gianola
D Ruppert
Hans-Peter Piepho
HP Piepho
HP Piepho
JAK Suykens
O Schabenberger
P Craven
R Bernardo
THE Meuwissen
Torben Schulz-Streeck
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

A comparison of random forests, boosting and support vector machines for genomic selection

Author: A Liaw
A Statnikov
CM Bishop
G Moser
Hans-Peter Piepho
HP Piepho
Joseph O Ogutu
L Breiman
THE Meuwissen
TJ Hastie
Torben Schulz-Streeck
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Genomic selection (GS) involves estimating breeding values using molecular markers spanning the entire genome. Accurate prediction of genomic breeding values (GEBVs) presents a central challenge to contemporary plant and animal breeders. The existence of a wide array of marker-based approaches for predicting breeding values makes it essential to evaluate and compare their relative predictive performances to identify approaches able to accurately predict breeding values. We evaluated the predictive accuracy of random forests (RF), stochastic gradient boosting (boosting) and support vector machines (SVMs) for predicting genomic breeding values using dense SNP markers and explored the utility of RF for ranking the predictive importance of markers for pre-screening markers or discovering chromosomal locations of QTLs

Crossref

Springer - Publisher Connector

PubMed Central

CGSpace

Breeding progress, variation, and correlation of grain and quality traits in winter rye hybrid and population varieties and national on-farm progress in Germany over 26 years

Author: Alexandra Huesken
Dirk Rentel
F Laidig
F Laidig
F Laidig
F Wehmann
Friedrich Laidig
Hans-Peter Piepho
HB Hansen
HFW Rattunde
HP Piepho
HP Piepho
J Kucerova
NW Simmonds
P Peltonen-Sainio
T Miedaner
Thomas Drobek
Uwe Meyer
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Codominant scoring of AFLP in association panels

Author: A Wong
AP Dempster
AX Deniau
B Garel
BG Lindsay
C Fraley
D Böhning
Fred A. van Eeuwijk
G Gort
G Gort
G McLachlan
Gerrit Gort
GJ McLachlan
HJ Eck van
HM Meudt
HP Piepho
HP Piepho
J Bezdek
J Chen
J Chen
J Cuesta-Albertos
JW Heath
Keygene Products BV
M Pérez-Enciso
M Vuylsteke
P Castiglioni
P Li
P McCullagh
P Vos
R Berloo van
R Berloo van
R Ihaka
RC Jansen
RC Jansen
RC Jansen
SM Reamon-Büttner
W Jank
Y Lo
Z Liu
ZD Feng
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

A study on the codominant scoring of AFLP markers in association panels without prior knowledge on genotype probabilities is described. Bands are scored codominantly by fitting normal mixture models to band intensities, illustrating and optimizing existing methodology, which employs the EM-algorithm. We study features that improve the performance of the algorithm, and the unmixing in general, like parameter initialization, restrictions on parameters, data transformation, and outlier removal. Parameter restrictions include equal component variances, equal or nearly equal distances between component means, and mixing probabilities according to Hardy–Weinberg Equilibrium. Histogram visualization of band intensities with superimposed normal densities, and optional classification scores and other grouping information, assists further in the codominant scoring. We find empirical evidence favoring the square root transformation of the band intensity, as was found in segregating populations. Our approach provides posterior genotype probabilities for marker loci. These probabilities can form the basis for association mapping and are more useful than the standard scoring categories A, H, B, C, D. They can also be used to calculate predictors for additive and dominance effects. Diagnostics for data quality of AFLP markers are described: preference for three-component mixture model, good separation between component means, and lack of singletons for the component with highest mean. Software has been developed in R, containing the models for normal mixtures with facilitating features, and visualizations. The methods are applied to an association panel in tomato, comprising 1,175 polymorphic markers on 94 tomato hybrids, as part of a larger study within the Dutch Centre for BioSystems Genomics

Crossref

Springer - Publisher Connector

PubMed Central

Wageningen University & Research Publications

Reflexo da interação genótipo x ambiente sobre o melhoramento genético de feijão

Crossref

Characterisation and correction of signal fluctuations in successive acquisitions of microarray images

Author: AM Dudley
Annie Glatigny
C Romualdi
CS Brown
DS Skibbe
E Graudens
H Bengtsson
Hervé Delacroix
HP Piepho
JA Timlin
JL DeRisi
KF Kerr
L Shi
Lawrence Aggerbeck
M Schena
Marie-Hélène Mucchielli-Giorgi
MC Yang
MD Abramoff
MR Khondoker
Nicolas François
T Tang
Thomas Tang
TK Jenssen
TK Karakach
X Wang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background There are many sources of variation in dual labelled microarray experiments, including data acquisition and image processing. The final interpretation of experiments strongly relies on the accuracy of the measurement of the signal intensity. For low intensity spots in particular, accurately estimating gene expression variations remains a challenge as signal measurement is, in this case, highly subject to fluctuations. Results To evaluate the fluctuations in the fluorescence intensities of spots, we used series of successive scans, at the same settings, of whole genome arrays. We measured the decrease in fluorescence and we evaluated the influence of different parameters (PMT gain, resolution and chemistry of the slide) on the signal variability, at the level of the array as a whole and by intensity interval. Moreover, we assessed the effect of averaging scans on the fluctuations. We found that the extent of photo-bleaching was low and we established that 1) the fluorescence fluctuation is linked to the resolution e.g. it depends on the number of pixels in the spot 2) the fluorescence fluctuation increases as the scanner voltage increases and, moreover, is higher for the red as opposed to the green fluorescence which can introduce bias in the analysis 3) the signal variability is linked to the intensity level, it is higher for low intensities 4) the heterogeneity of the spots and the variability of the signal and the intensity ratios decrease when two or three scans are averaged. Conclusion Protocols consisting of two scans, one at low and one at high PMT gains, or multiple scans (ten scans) can introduce bias or be difficult to implement. We found that averaging two, or at most three, acquisitions of microarrays scanned at moderate photomultiplier settings (PMT gain) is sufficient to significantly improve the accuracy (quality) of the data and particularly those for spots having low intensities and we propose this as a general approach. For averaging and precise image alignment at sub-pixel levels we have made a program freely available on our web-site <url>http://bioinfome.cgm.cnrs-gif.fr</url> to facilitate implementation of this approach.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Effects of scanning sensitivity and multiple scan algorithms on microarray data quality

Author: AM Dudley
Andrew Williams
BM Bolstad
C Romualdi
D Zhu
DS Skibbe
E Wit
EM Thomson
Errol M Thomson
H Lyng
HP Piepho
IV Yang
J García de la Nava
KF Kerr
L Gautier
LE Dodd
M Khondoker
MR Khondoker
R Development Core Team
R Gupta
R Gupta
S Draghici
V Sharov
WN Venables
Publication venue: BioMed Central
Publication date: 12/03/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Mixed model approaches for the identification of QTLs within a maize hybrid breeding program

Two outlines for mixed model based approaches to quantitative trait locus (QTL) mapping in existing maize hybrid selection programs are presented: a restricted maximum likelihood (REML) and a Bayesian Markov Chain Monte Carlo (MCMC) approach. The methods use the in-silico-mapping procedure developed by Parisseaux and Bernardo (2004) as a starting point. The original single-point approach is extended to a multi-point approach that facilitates interval mapping procedures. For computational and conceptual reasons, we partition the full set of relationships from founders to parents of hybrids into two types of relations by defining so-called intermediate founders. QTL effects are defined in terms of those intermediate founders. Marker based identity by descent relationships between intermediate founders define structuring matrices for the QTL effects that change along the genome. The dimension of the vector of QTL effects is reduced by the fact that there are fewer intermediate founders than parents. Furthermore, additional reduction in the number of QTL effects follows from the identification of founder groups by various algorithms. As a result, we obtain a powerful mixed model based statistical framework to identify QTLs in genetic backgrounds relevant to the elite germplasm of a commercial breeding program. The identification of such QTLs will provide the foundation for effective marker assisted and genome wide selection strategies. Analyses of an example data set show that QTLs are primarily identified in different heterotic groups and point to complementation of additive QTL effects as an important factor in hybrid performance

Crossref

Springer - Publisher Connector

PubMed Central

University of Queensland eSpace

Gene and QTL detection in a three-way barley cross under selection by a mixed model with kinship information using SNPs

Author: A Borner
A Cuesta-Marcos
Ana M. Casas
B Stich
C Zhu
CJ Jiang
CS Haley
D Laurie
DA Laurie
E Potokina
ES Lander
F Eeuwijk van
Fred A. van Eeuwijk
H Li
H Piepho
H-P Piepho
HH Zhao
HP Piepho
I Mackay
J Yu
J Zitzewitz
JD Franckowiak
José-Luis Molina-Cano
K Mathews
K Sato
K Zhao
LC Emebiri
Luke Ramsay
M Lynch
M Malosetti
M Malosetti
M Malosetti
M Moralejo
M Sameri
M-J Paulo
Marcos Malosetti
Marian Moralejo
Martin P. Boer
MD McMullen
MP Boer
Mónica Elía
O Chloupek
Prasanna R. Bhat
Q Jia
R Aghnoum
R Takahashi
R Thompson
R Waugh
R Wu
RC Jansen
RW Payne
S Crepieux
S Xu
T Close
W Powell
WTB Thomas
ZB Zeng
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

Quantitative trait locus (QTL) detection is commonly performed by analysis of designed segregating populations derived from two inbred parental lines, where absence of selection, mutation and genetic drift is assumed. Even for designed populations, selection cannot always be avoided, with as consequence varying correlation between genotypes instead of uniform correlation. Akin to linkage disequilibrium mapping, ignoring this type of genetic relatedness will increase the rate of false-positives. In this paper, we advocate using mixed models including genetic relatedness, or ‘kinship’ information for QTL detection in populations where selection forces operated. We demonstrate our case with a three-way barley cross, designed to segregate for dwarfing, vernalization and spike morphology genes, in which selection occurred. The population of 161 inbred lines was screened with 1,536 single nucleotide polymorphisms (SNPs), and used for gene and QTL detection. The coefficient of coancestry matrix was estimated based on the SNPs and imposed to structure the distribution of random genotypic effects. The model incorporating kinship, coancestry, information was consistently superior to the one without kinship (according to the Akaike information criterion). We show, for three traits, that ignoring the coancestry information results in an unrealistically high number of marker–trait associations, without providing clear conclusions about QTL locations. We used a number of widely recognized dwarfing and vernalization genes known to segregate in the studied population as landmarks or references to assess the agreement of the mapping results with a priori candidate gene expectations. Additional QTLs to the major genes were detected for all traits as well

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

PubMed Central

Wageningen University & Research Publications

Repositori Obert UdL

Digital.CSIC

Extension of the bayesian alphabet for genomic selection

Author: B Hayes
BJ Hayes
CR Henderson
CR Henderson
D Garrick
D Gianola
D Habier
D Habier
D Habier
D Sorensen
David Habier
Dorian J Garrick
EL Heffner
HD Daetwyler
HP Piepho
JL Jannink
Kadir Kizilkaya
LA García-Cortés
PM VanRaden
PM VanRaden
RL Fernando
RL Fernando
Rohan L Fernando
S Karlin
S Zhong
SJ Godsill
T Ohta
THE Meuwissen
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Two Bayesian methods, BayesC<it>π </it>and BayesD<it>π</it>, were developed for genomic prediction to address the drawback of BayesA and BayesB regarding the impact of prior hyperparameters and treat the prior probability <it>π </it>that a SNP has zero effect as unknown. The methods were compared in terms of inference of the number of QTL and accuracy of genomic estimated breeding values (GEBVs), using simulated scenarios and real data from North American Holstein bulls. Results Estimates of <it>π </it>from BayesC<it>π</it>, in contrast to BayesD<it>π</it>, were sensitive to the number of simulated QTL and training data size, and provide information about genetic architecture. Milk yield and fat yield have QTL with larger effects than protein yield and somatic cell score. The drawback of BayesA and BayesB did not impair the accuracy of GEBVs. Accuracies of alternative Bayesian methods were similar. BayesA was a good choice for GEBV with the real data. Computing time was shorter for BayesC<it>π </it>than for BayesD<it>π</it>, and longest for our implementation of BayesA. Conclusions Collectively, accounting for computing effort, uncertainty as to the number of QTL (which affects the GEBV accuracy of alternative methods), and fundamental interest in the number of QTL underlying quantitative traits, we believe that BayesC<it>π </it>has merit for routine applications.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central