14 research outputs found
Treatment of organic resources before soil incorporation in semi-arid regions improves resilience to El Niño, and increases crop production and economic returns
We are grateful for support from the DFID-NERC El Niño programme in project NE P004830, “Building Resilience in Ethiopia’s Awassa region to Drought (BREAD)”, the ESRC NEXUS programme in project IEAS/POO2501/1, “Improving organic resource use in rural Ethiopia (IPORE)”, and the NERC ESPA programme in project NEK0104251 “Alternative carbon investments in ecosystems for poverty alleviation (ALTER)”. We are also grateful to Anke Fischer (James Hutton Insitute) for her comments on the paper.Peer reviewedPublisher PD
Pathogenetics of alveolar capillary dysplasia with misalignment of pulmonary veins.
Alveolar capillary dysplasia with misalignment of pulmonary veins (ACDMPV) is a lethal lung developmental disorder caused by heterozygous point mutations or genomic deletion copy-number variants (CNVs) of FOXF1 or its upstream enhancer involving fetal lung-expressed long noncoding RNA genes LINC01081 and LINC01082. Using custom-designed array comparative genomic hybridization, Sanger sequencing, whole exome sequencing (WES), and bioinformatic analyses, we studied 22 new unrelated families (20 postnatal and two prenatal) with clinically diagnosed ACDMPV. We describe novel deletion CNVs at the FOXF1 locus in 13 unrelated ACDMPV patients. Together with the previously reported cases, all 31 genomic deletions in 16q24.1, pathogenic for ACDMPV, for which parental origin was determined, arose de novo with 30 of them occurring on the maternally inherited chromosome 16, strongly implicating genomic imprinting of the FOXF1 locus in human lungs. Surprisingly, we have also identified four ACDMPV families with the pathogenic variants in the FOXF1 locus that arose on paternal chromosome 16. Interestingly, a combination of the severe cardiac defects, including hypoplastic left heart, and single umbilical artery were observed only in children with deletion CNVs involving FOXF1 and its upstream enhancer. Our data demonstrate that genomic imprinting at 16q24.1 plays an important role in variable ACDMPV manifestation likely through long-range regulation of FOXF1 expression, and may be also responsible for key phenotypic features of maternal uniparental disomy 16. Moreover, in one family, WES revealed a de novo missense variant in ESRP1, potentially implicating FGF signaling in the etiology of ACDMPV
Recommended from our members
Common genetic variants in the CLDN2 and PRSS1-PRSS2 loci alter risk for alcohol-related and sporadic pancreatitis
Pancreatitis is a complex, progressively destructive inflammatory disorder. Alcohol was long thought to be the primary causative agent, but genetic contributions have been of interest since the discovery that rare PRSS1, CFTR, and SPINK1 variants were associated with pancreatitis risk. We now report two significant genome-wide associations identified and replicated at PRSS1-PRSS2 (1×10-12) and x-linked CLDN2 (p < 1×10-21) through a two-stage genome-wide study (Stage 1, 676 cases and 4507 controls; Stage 2, 910 cases and 4170 controls). The PRSS1 variant affects susceptibility by altering expression of the primary trypsinogen gene. The CLDN2 risk allele is associated with atypical localization of claudin-2 in pancreatic acinar cells. The homozygous (or hemizygous male) CLDN2 genotype confers the greatest risk, and its alleles interact with alcohol consumption to amplify risk. These results could partially explain the high frequency of alcohol-related pancreatitis in men – male hemizygous frequency is 0.26, female homozygote is 0.07
The genetic architecture of the human cerebral cortex
The cerebral cortex underlies our complex cognitive capabilities, yet little is known about the specific genetic loci that influence human cortical structure. To identify genetic variants that affect cortical structure, we conducted a genome-wide association meta-analysis of brain magnetic resonance imaging data from 51,665 individuals. We analyzed the surface area and average thickness of the whole cortex and 34 regions with known functional specializations. We identified 199 significant loci and found significant enrichment for loci influencing total surface area within regulatory elements that are active during prenatal cortical development, supporting the radial unit hypothesis. Loci that affect regional surface area cluster near genes in Wnt signaling pathways, which influence progenitor expansion and areal identity. Variation in cortical structure is genetically correlated with cognitive function, Parkinson's disease, insomnia, depression, neuroticism, and attention deficit hyperactivity disorder
Large expert-curated database for benchmarking document similarity detection in biomedical literature search
Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe
Genome-wide structural variant analysis identifies risk loci for non-Alzheimer’s dementias
We characterized the role of structural variants, a largely unexplored type of genetic variation, in two non-Alzheimer’s dementias, namely Lewy body dementia (LBD) and frontotemporal dementia (FTD)/amyotrophic lateral sclerosis (ALS). To do this, we applied an advanced structural variant calling pipeline (GATK-SV) to short-read whole-genome sequence data from 5,213 European-ancestry cases and 4,132 controls. We discovered, replicated, and validated a deletion in TPCN1 as a novel risk locus for LBD and detected the known structural variants at the C9orf72 and MAPT loci as associated with FTD/ALS. We also identified rare pathogenic structural variants in both LBD and FTD/ALS. Finally, we assembled a catalog of structural variants that can be mined for new insights into the pathogenesis of these understudied forms of dementia
Recommended from our members
Comparative performances of machine learning methods for classifying Crohn Disease patients using genome-wide genotyping data
Abstract: Crohn Disease (CD) is a complex genetic disorder for which more than 140 genes have been identified using genome wide association studies (GWAS). However, the genetic architecture of the trait remains largely unknown. The recent development of machine learning (ML) approaches incited us to apply them to classify healthy and diseased people according to their genomic information. The Immunochip dataset containing 18,227 CD patients and 34,050 healthy controls enrolled and genotyped by the international Inflammatory Bowel Disease genetic consortium (IIBDGC) has been re-analyzed using a set of ML methods: penalized logistic regression (LR), gradient boosted trees (GBT) and artificial neural networks (NN). The main score used to compare the methods was the Area Under the ROC Curve (AUC) statistics. The impact of quality control (QC), imputing and coding methods on LR results showed that QC methods and imputation of missing genotypes may artificially increase the scores. At the opposite, neither the patient/control ratio nor marker preselection or coding strategies significantly affected the results. LR methods, including Lasso, Ridge and ElasticNet provided similar results with a maximum AUC of 0.80. GBT methods like XGBoost, LightGBM and CatBoost, together with dense NN with one or more hidden layers, provided similar AUC values, suggesting limited epistatic effects in the genetic architecture of the trait. ML methods detected near all the genetic variants previously identified by GWAS among the best predictors plus additional predictors with lower effects. The robustness and complementarity of the different methods are also studied. Compared to LR, non-linear models such as GBT or NN may provide robust complementary approaches to identify and classify genetic markers
Genome-wide association analysis of more than 120,000 individuals identifies 15 new susceptibility loci for breast cancer
International audienceGenome-wide association studies (GWAS) and large-scale replication studies have identified common variants in 79 loci associated with breast cancer, explaining ∼14% of the familial risk of the disease. To identify new susceptibility loci, we performed a meta-analysis of 11 GWAS, comprising 15,748 breast cancer cases and 18,084 controls together with 46,785 cases and 42,892 controls from 41 studies genotyped on a 211,155-marker custom array (iCOGS). Analyses were restricted to women of European ancestry. We generated genotypes for more than 11 million SNPs by imputation using the 1000 Genomes Project reference panel, and we identified 15 new loci associated with breast cancer at P < 5 × 10(-8). Combining association analysis with ChIP-seq chromatin binding data in mammary cell lines and ChIA-PET chromatin interaction data from ENCODE, we identified likely target genes in two regions: SETBP1 at 18q12.3 and RNF115 and PDZK1 at 1q21.1. One association appears to be driven by an amino acid substitution encoded in EXO1