Search CORE

207 research outputs found

Estimating conformational traits in dairy cattle with deepAPS : A two-step deep learning automated phenotyping and segmentation approach

Author: Nye Jessica
Perez-Enciso Miguel
Zingaretti Laura M.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2020
Field of study

Assessing conformation features in an accurate and rapid manner remains a challenge in the dairy industry. While recent developments in computer vision has greatly improved automated background removal, these methods have not been fully translated to biological studies. Here, we present a composite method (DeepAPS) that combines two readily available algorithms in order to create a precise mask for an animal image. This method performs accurately when compared with manual classification of proportion of coat color with an adjusted R2 = 0.926. Using the output mask, we are able to automatically extract useful phenotypic information for 14 additional morphological features. Using pedigree and image information from a web catalog (www.semex.com), we estimated high heritabilities (ranging from h2 = 0.18-0.82), indicating that meaningful biological information has been extracted automatically from imaging data. This method can be applied to other datasets and requires only a minimal number of image annotations (50) to train this partially supervised machinelearning approach. DeepAPS allows for the rapid and accurate quantification of multiple phenotypic measurements while minimizing study cost. The pipeline is available at https://github.com/lauzingaretti/deepaps

Diposit Digital de Documents de la UAB

Digital.CSIC

The genetic variability of pigs is greater than was thought

Author: Folch Josep M
Ojeda Ana
Perez-Enciso Miguel
Rozas Julio
Publication venue
Publication date: 01/01/2007
Field of study

L'espècie porcina és molt més variable del que es pensava. Els investigadors han trobat una variabilitat genètica per al gen FABP4 (que està implicat en la quantitat de grasa que deposita l'animal) deu vegades superior a la trobada a l'espècie humana i similar a la de les espècies silvestres, no domesticades. A més, han descobert que el porc ibèric és molt variable.La especie porcina es mucho más variable de lo que se pensaba. Los investigadores han encontrado una variabilidad genética para el gen FABP4 (que está implicado en la cantidad de grasa que deposita el animal) diez veces superior a la encontrada en la especie humana y similar a la de las especies silvestres, no domesticadas. Además, han descubierto que el cerdo Ibérico es muy variable.The porcine species is much more variable than was once thought. Researchers have found a genetic variability for the gene FABP4 (which is involved in the quantity of fat that the animal deposits) ten times greater than that found in humans and similar to that of wild, undomesticated species. In addition they have discovered that the Iberian pig is very variable

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Diposit Digital de Documents de la UAB

SeqBreed : a python tool to evaluate genomic prediction in complex scenarios

Author: Perez-Enciso Miguel
Ramírez-Ayala Lino C.
Zingaretti Laura M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Background: Genomic prediction (GP) is a method whereby DNA polymorphism information is used to predict breeding values for complex traits. Although GP can significantly enhance predictive accuracy, it can be expensive and difficult to implement. To help design optimum breeding programs and experiments, including genome-wide association studies and genomic selection experiments, we have developed SeqBreed, a generic and flexible forward simulator programmed in python3. Results: SeqBreed accommodates sex and mitochondrion chromosomes as well as autopolyploidy. It can simulate any number of complex phenotypes that are determined by any number of causal loci. SeqBreed implements several GP methods, including genomic best linear unbiased prediction (GBLUP), single-step GBLUP, pedigree-based BLUP, and mass selection. We illustrate its functionality with Drosophila genome reference panel (DGRP) sequence data and with tetraploid potato genotype data. Conclusions: SeqBreed is a flexible and easy to use tool that can be used to optimize GP or genome-wide association studies. It incorporates some of the most popular GP methods and includes several visualization tools. Code is open and can be freely modified. Software, documentation, and examples are available at https://github.com/miguelperezenciso/SeqBreed

Diposit Digital de Documents de la UAB

Digital.CSIC

Genome-Wide Footprints of Pig Domestication and Selection Revealed through Massive Parallel Sequencing of Pooled DNA

Author: Amaral A.J.
Crooijmans R.P.M.A.
Ferretti L.
Groenen M.A.M.
Megens H.J.W.C.
Nie H.
Perez-Enciso M.
Ramos-Onsins S.E.
Schook L.B.
Publication venue
Publication date: 01/01/2011
Field of study

Background Artificial selection has caused rapid evolution in domesticated species. The identification of selection footprints across domesticated genomes can contribute to uncover the genetic basis of phenotypic diversity. Methodology/Main Findings Genome wide footprints of pig domestication and selection were identified using massive parallel sequencing of pooled reduced representation libraries (RRL) representing ~2% of the genome from wild boar and four domestic pig breeds (Large White, Landrace, Duroc and Pietrain) which have been under strong selection for muscle development, growth, behavior and coat color. Using specifically developed statistical methods that account for DNA pooling, low mean sequencing depth, and sequencing errors, we provide genome-wide estimates of nucleotide diversity and genetic differentiation in pig. Widespread signals suggestive of positive and balancing selection were found and the strongest signals were observed in Pietrain, one of the breeds most intensively selected for muscle development. Most signals were population-specific but affected genomic regions which harbored genes for common biological categories including coat color, brain development, muscle development, growth, metabolism, olfaction and immunity. Genetic differentiation in regions harboring genes related to muscle development and growth was higher between breeds than between a given breed and the wild boar. Conclusions/Significance These results, suggest that although domesticated breeds have experienced similar selective pressures, selection has acted upon different genes. This might reflect the multiple domestication events of European breeds or could be the result of subsequent introgression of Asian alleles. Overall, it was estimated that approximately 7% of the porcine genome has been affected by selection events. This study illustrates that the massive parallel sequencing of genomic pools is a cost-effective approach to identify footprints of selection

Wageningen University & Research Publications

Correction to : Opportunities and limits of combining microbiome and genome data for complex trait prediction

Author: de los Campos Gustavo
Perez-Enciso Miguel
Ramayo-Caldas Yuliaxis
Zingaretti Laura M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

PubMed Central

Diposit Digital de Documents de la UAB

Transposable element polymorphisms improve prediction of complex agronomic traits in rice

Author: Casacuberta Josep M.
Castanera Raúl
Perez-Enciso Miguel
Ramos-Onsins Sebastián E.
Vourlaki Ioanna-Theoni
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Acord transformatiu CRUE-CSICKey message: Transposon insertion polymorphisms can improve prediction of complex agronomic traits in rice compared to using SNPs only, especially when accessions to be predicted are less related to the training set. Abstract: Transposon insertion polymorphisms (TIPs) are significant sources of genetic variation. Previous work has shown that TIPs can improve detection of causative loci on agronomic traits in rice. Here, we quantify the fraction of variance explained by single nucleotide polymorphisms (SNPs) compared to TIPs, and we explore whether TIPs can improve prediction of traits when compared to using only SNPs. We used eleven traits of agronomic relevance from by five different rice population groups (Aus, Indica, Aromatic, Japonica, and Admixed), 738 accessions in total. We assess prediction by applying data split validation in two scenarios. In the within-population scenario, we predicted performance of improved Indica varieties using the rest of Indica accessions. In the across population scenario, we predicted all Aromatic and Admixed accessions using the rest of populations. In each scenario, Bayes C and a Bayesian reproducible kernel Hilbert space regression were compared. We find that TIPs can explain an important fraction of total genetic variance and that they also improve genomic prediction. In the across population prediction scenario, TIPs outperformed SNPs in nine out of the eleven traits analyzed. In some traits like leaf senescence or grain width, using TIPs increased predictive correlation by 30-50%. Our results evidence, for the first time, that TIPs genotyping can improve prediction on complex agronomic traits in rice, especially when accessions to be predicted are less related to training accessions

PubMed Central

Diposit Digital de Documents de la UAB

Digital.CSIC

On the holobiont 'predictome' of immunocompetence in pigs

Author: Ballester Devis Maria
Calle-García Joan
Perez-Enciso Miguel
Quintanilla Raquel
Ramayo-Caldas Yuliaxis
Zingaretti Laura M.
Publication venue
Publication date: 01/01/2023
Field of study

Gut microbial composition plays an important role in numerous traits, including immune response. Integration of host genomic information with microbiome data is a natural step in the prediction of complex traits, although methods to optimize this are still largely unexplored. In this paper, we assess the impact of different modelling strategies on the predictive capacity for six porcine immunocompetence traits when both genotype and microbiota data are available. We used phenotypic data on six immunity traits and the relative abundance of gut bacterial communities on 400 Duroc pigs that were genotyped for 70 k SNPs. We compared the predictive accuracy, defined as the correlation between predicted and observed phenotypes, of a wide catalogue of models: reproducing kernel Hilbert space (RKHS), Bayes C, and an ensemble method, using a range of priors and microbial clustering strategies. Combined (holobiont) models that include both genotype and microbiome data were compared with partial models that use one source of variation only. Overall, holobiont models performed better than partial models. Host genotype was especially relevant for predicting adaptive immunity traits (i.e., concentration of immunoglobulins M and G), whereas microbial composition was important for predicting innate immunity traits (i.e., concentration of haptoglobin and C-reactive protein and lymphocyte phagocytic capacity). None of the models was uniformly best across all traits. We observed a greater variability in predictive accuracies across models when microbiability (the variance explained by the microbiome) was high. Clustering microbial abundances did not necessarily increase predictive accuracy. Gut microbiota information is useful for predicting immunocompetence traits, especially those related to innate immunity. Modelling microbiome abundances deserves special attention when microbiability is high. Clustering microbial data for prediction is not recommended by default. The online version contains supplementary material available at 10.1186/s12711-023-00803-4

Diposit Digital de Documents de la UAB

Challenges of poor surface water drainage and wastewater management in refugee camps

Author: Bosse Mirte
Crooijmans Richard P. M. A.
Groenen Martien AM
Herrero-Medrano Juan Manuel
Megens Hendrik-Jan
Perez-Enciso Miguel
Publication venue: IIETA (International Information and Engineering Technology Association)
Publication date: 01/01/2014
Field of study

Since refugee camps are meant to be temporary and setting them up usually require urgency, little attention has been given to provision of surface water drainage and to a lesser extent wastewater management. As the population of refugees in these camps continues to grow, the effectiveness of drainage infrastructure continues to diminish. In addition, availability of sufficient safe drinking water and wastewater management have become difficult in the refugee camps across the world. The present situation in refugee camps across the world, such as flooding and outbreak of water-related diseases in South Sudan refugee camps, has made the need for sustainable approach to solving the problems to be very urgent. One sustainable way of solving the problems of flooding and outbreak of diseases in refugee camps is to provide effective drainage and wastewater infrastructure that ensures all the wastewater are properly collected, treated and reused for various purposes such as agriculture, drinking, laundry and other relevant uses. This paper therefore presents the current state of drainage and wastewater management in two refugee camps and propose low-cost technologies for stormwater management, wastewater collection, treatment and potential reuse, suitable for these refugee camps

Crossref

Greenwich Academic Literature Archive

Springer - Publisher Connector

UWE Bristol Research Repository

PubMed Central

Diposit Digital de Documents de la UAB

Digital.CSIC

Quantitative trait locus analysis of hybrid pedigrees: variance-components model, inbreeding parameter, and power

Author: AG Comuzzie
C Chevalet
C Ovilo
C Xie
CC Li
CI Amos
CI Amos
CS Haley
CS Haley
DD Kosambi
DE Goldgar
ES Lander
Gulnara R Svischeva
J Blangero
JK Haseman
JT Williams
K Lange
L Almasy
LL Lo
M Kendall
M Perez-Enciso
M Pérez-Enciso
M Pérez-Enciso
M Pérez-Enciso
NJ Schork
O Martinez
PC Sham
R Jansen
TI Axenovich
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background For the last years reliable mapping of quantitative trait loci (QTLs) has become feasible through linkage analysis based on the variance-components method. There are now many approaches to the QTL analysis of various types of crosses within one population (breed) as well as crosses between divergent populations (breeds). However, to analyse a complex pedigree with dominance and inbreeding, when the pedigree's founders have an inter-population (hybrid) origin, it is necessary to develop a high-powered method taking into account these features of the pedigree. Results We offer a universal approach to QTL analysis of complex pedigrees descended from crosses between outbred parental lines with different QTL allele frequencies. This approach improves the established variance-components method due to the consideration of the genetic effect conditioned by inter-population origin and inbreeding of individuals. To estimate model parameters, namely additive and dominant effects, and the allelic frequencies of the QTL analysed, and also to define the QTL positions on a chromosome with respect to genotyped markers, we used the maximum-likelihood method. To detect linkage between the QTL and the markers we propose statistics with a non-central χ2-distribution that provides the possibility to deduce analytical expressions for the power of the method and therefore, to estimate the pedigree's size required for 80% power. The method works for arbitrarily structured pedigrees with dominance and inbreeding. Conclusion Our method uses the phenotypic values and the marker information for each individual of the pedigree under observation as initial data and can be valuable for fine mapping purposes. The power of the method is increased if the QTL effects conditioned by inter-population origin and inbreeding are enhanced. Several improvements can be developed to take into account fixed factors affecting trait formation, such as age and sex.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Exploring deep learning for complex trait genomic prediction in polyploid outcrossing species

Author: Ferrão Luis Felipe V.
Gezan Salvador Alejandro
Monfort Amparo
Muñoz Patricio R.
Osorio Luis F.
Perez-Enciso Miguel
Whitaker Vance M.
Zingaretti Laura M.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2020
Field of study

Genomic prediction (GP) is the procedure whereby the genetic merits of untested candidates are predicted using genome wide marker information. Although numerous examples of GP exist in plants and animals, applications to polyploid organisms are still scarce, partly due to limited genome resources and the complexity of this system. Deep learning (DL) techniques comprise a heterogeneous collection of machine learning algorithms that have excelled at many prediction tasks. A potential advantage of DL for GP over standard linear model methods is that DL can potentially take into account all genetic interactions, including dominance and epistasis, which are expected to be of special relevance in most polyploids. In this study, we evaluated the predictive accuracy of linear and DL techniques in two important small fruits or berries: strawberry and blueberry. The two datasets contained a total of 1,358 allopolyploid strawberry (2n=8x=112) and 1,802 autopolyploid blueberry (2n=4x=48) individuals, genotyped for 9,908 and 73,045 single nucleotide polymorphism (SNP) markers, respectively, and phenotyped for five agronomic traits each. DL depends on numerous parameters that influence performance and optimizing hyperparameter values can be a critical step. Here we show that interactions between hyperparameter combinations should be expected and that the number of convolutional filters and regularization in the first layers can have an important effect on model performance. In terms of genomic prediction, we did not find an advantage of DL over linear model methods, except when the epistasis component was important. Linear Bayesian models were better than convolutional neural networks for the full additive architecture, whereas the opposite was observed under strong epistasis. However, by using a parameterization capable of taking into account these non-linear effects, Bayesian linear models can match or exceed the predictive accuracy of DL. A semiautomatic implementation of the DL pipeline is available at https://github.com/lauzingaretti/deepGP/

Diposit Digital de Documents de la UAB

Digital.CSIC