332 research outputs found

    Detecting discordance enrichment among a series of two-sample genome-wide expression data sets

    Get PDF
    Background With the current microarray and RNA-seq technologies, two-sample genome-wide expression data have been widely collected in biological and medical studies. The related differential expression analysis and gene set enrichment analysis have been frequently conducted. Integrative analysis can be conducted when multiple data sets are available. In practice, discordant molecular behaviors among a series of data sets can be of biological and clinical interest. Methods In this study, a statistical method is proposed for detecting discordance gene set enrichment. Our method is based on a two-level multivariate normal mixture model. It is statistically efficient with linearly increased parameter space when the number of data sets is increased. The model-based probability of discordance enrichment can be calculated for gene set detection. Results We apply our method to a microarray expression data set collected from forty-five matched tumor/non-tumor pairs of tissues for studying pancreatic cancer. We divided the data set into a series of non-overlapping subsets according to the tumor/non-tumor paired expression ratio of gene PNLIP (pancreatic lipase, recently shown it association with pancreatic cancer). The log-ratio ranges from a negative value (e.g. more expressed in non-tumor tissue) to a positive value (e.g. more expressed in tumor tissue). Our purpose is to understand whether any gene sets are enriched in discordant behaviors among these subsets (when the log-ratio is increased from negative to positive). We focus on KEGG pathways. The detected pathways will be useful for our further understanding of the role of gene PNLIP in pancreatic cancer research. Among the top list of detected pathways, the neuroactive ligand receptor interaction and olfactory transduction pathways are the most significant two. Then, we consider gene TP53 that is well-known for its role as tumor suppressor in cancer research. The log-ratio also ranges from a negative value (e.g. more expressed in non-tumor tissue) to a positive value (e.g. more expressed in tumor tissue). We divided the microarray data set again according to the expression ratio of gene TP53. After the discordance enrichment analysis, we observed overall similar results and the above two pathways are still the most significant detections. More interestingly, only these two pathways have been identified for their association with pancreatic cancer in a pathway analysis of genome-wide association study (GWAS) data. Conclusions This study illustrates that some disease-related pathways can be enriched in discordant molecular behaviors when an important disease-related gene changes its expression. Our proposed statistical method is useful in the detection of these pathways. Furthermore, our method can also be applied to genome-wide expression data collected by the recent RNA-seq technology

    Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets

    Get PDF
    Background Gene set enrichment analysis (GSEA) is an important approach to the analysis of coordinate expression changes at a pathway level. Although many statistical and computational methods have been proposed for GSEA, the issue of a concordant integrative GSEA of multiple expression data sets has not been well addressed. Among different related data sets collected for the same or similar study purposes, it is important to identify pathways or gene sets with concordant enrichment. Methods We categorize the underlying true states of differential expression into three representative categories: no change, positive change and negative change. Due to data noise, what we observe from experiments may not indicate the underlying truth. Although these categories are not observed in practice, they can be considered in a mixture model framework. Then, we define the mathematical concept of concordant gene set enrichment and calculate its related probability based on a three-component multivariate normal mixture model. The related false discovery rate can be calculated and used to rank different gene sets. Results We used three published lung cancer microarray gene expression data sets to illustrate our proposed method. One analysis based on the first two data sets was conducted to compare our result with a previous published result based on a GSEA conducted separately for each individual data set. This comparison illustrates the advantage of our proposed concordant integrative gene set enrichment analysis. Then, with a relatively new and larger pathway collection, we used our method to conduct an integrative analysis of the first two data sets and also all three data sets. Both results showed that many gene sets could be identified with low false discovery rates. A consistency between both results was also observed. A further exploration based on the KEGG cancer pathway collection showed that a majority of these pathways could be identified by our proposed method. Conclusions This study illustrates that we can improve detection power and discovery consistency through a concordant integrative analysis of multiple large-scale two-sample gene expression data sets

    Global Multidimensional Poverty Index 2021: Unmasking disparities by ethnicity, caste and gender

    Get PDF
    This report provides a comprehensive picture of acute multidimensional poverty to inform the work of countries and communities building a more just future for the global poor. Part I focuses on where we are now. It examines the levels and composition of multidimensional poverty across 109 countries covering 5.9 billion people. It also discusses trends among more than 5 billion people in 80 countries, 70 of which showed a statistically significant reduction in Multidimensional Poverty Index value during at least one of the time periods presented. While the COVID-19 pandemic's impact on developed countries is already an active area of research, this report offers a multidimensional poverty perspective on the experience of developing countries. It explores how the pandemic has affected three key development indicators (social protection, livelihoods and school attendance), in association with multidimensional poverty, with a focus predominantly on Sub-Saharan Africa. Part II profiles disparities in multidimensional poverty with new research that scrutinizes estimates disaggregated by ethnicity or race and by caste to identify who and how people are being left behind. It also explores the proportion of multidimensionally poor people who live in a household in which no female member has completed at least six years of schooling and presents disparities in multidimensional poverty by gender of the household head. Finally, it probes interconnections between the incidence of multidimensional poverty and intimate partner violence against women and girls

    Association Study with 77 SNPs Confirms the Robust Role for the rs10830963/G of MTNR1B Variant and Identifies Two Novel Associations in Gestational Diabetes Mellitus Development

    Get PDF
    CONTEXT: Genetic variation in human maternal DNA contributes to the susceptibility for development of gestational diabetes mellitus (GDM). OBJECTIVE: We assessed 77 maternal single nucleotide gene polymorphisms (SNPs) for associations with GDM or plasma glucose levels at OGTT in pregnancy. METHODS: 960 pregnant women (after dropouts 820: case/control: m99'WHO: 303/517, IADPSG: 287/533) were enrolled in two countries into this case-control study. After genomic DNA isolation the 820 samples were collected in a GDM biobank and assessed using KASP (LGC Genomics) genotyping assay. Logistic regression risk models were used to calculate ORs according to IADPSG/m'99WHO criteria based on standard OGTT values. RESULTS: The most important risk alleles associated with GDM were rs10830963/G of MTNR1B (OR = 1.84/1.64 [IADPSG/m'99WHO], p = 0.0007/0.006), rs7754840/C (OR = 1.51/NS, p = 0.016) of CDKAL1 and rs1799884/T (OR = 1.4/1.56, p = 0.04/0.006) of GCK. The rs13266634/T (SLC30A8, OR = 0.74/0.71, p = 0.05/0.02) and rs7578326/G (LOC646736/IRS1, OR = 0.62/0.60, p = 0.001/0.006) variants were associated with lower risk to develop GDM. Carrying a minor allele of rs10830963 (MTNR1B); rs7903146 (TCF7L2); rs1799884 (GCK) SNPs were associated with increased plasma glucose levels at routine OGTT. CONCLUSIONS: We confirmed the robust association of MTNR1B rs10830963/G variant with GDM binary and glycemic traits in this Caucasian case-control study. As novel associations we report the minor, G allele of the rs7578326 SNP in the LOC646736/IRS1 region as a significant and the rs13266634/T SNP (SLC30A8) as a suggestive protective variant against GDM development. Genetic susceptibility appears to be more preponderant in individuals who meet both the modified 99'WHO and the IADPSG GDM diagnostic criteria

    Experimental confirmation of efficient island divertor operation and successful neoclassical transport optimization in Wendelstein 7-X

    Get PDF

    Overview of progress in European medium sized tokamaks towards an integrated plasma-edge/wall solution

    Get PDF
    Integrating the plasma core performance with an edge and scrape-off layer (SOL) that leads to tolerable heat and particle loads on the wall is a major challenge. The new European medium size tokamak task force (EU-MST) coordinates research on ASDEX Upgrade (AUG), MAST and TCV. This multi-machine approach within EU-MST, covering a wide parameter range, is instrumental to progress in the field, as ITER and DEMO core/pedestal and SOL parameters are not achievable simultaneously in present day devices. A two prong approach is adopted. On the one hand, scenarios with tolerable transient heat and particle loads, including active edge localised mode (ELM) control are developed. On the other hand, divertor solutions including advanced magnetic configurations are studied. Considerable progress has been made on both approaches, in particular in the fields of: ELM control with resonant magnetic perturbations (RMP), small ELM regimes, detachment onset and control, as well as filamentary scrape-off-layer transport. For example full ELM suppression has now been achieved on AUG at low collisionality with n  =  2 RMP maintaining good confinement HH(98,y2)0.95{{H}_{\text{H}\left(98,\text{y}2\right)}}\approx 0.95 . Advances have been made with respect to detachment onset and control. Studies in advanced divertor configurations (Snowflake, Super-X and X-point target divertor) shed new light on SOL physics. Cross field filamentary transport has been characterised in a wide parameter regime on AUG, MAST and TCV progressing the theoretical and experimental understanding crucial for predicting first wall loads in ITER and DEMO. Conditions in the SOL also play a crucial role for ELM stability and access to small ELM regimes

    Disruption prediction at JET through deep convolutional neural networks using spatiotemporal information from plasma profiles

    Get PDF
    In view of the future high power nuclear fusion experiments, the early identification of disruptions is a mandatory requirement, and presently the main goal is moving from the disruption mitigation to disruption avoidance and control. In this work, a deep-convolutional neural network (CNN) is proposed to provide early detection of disruptive events at JET. The CNN ability to learn relevant features, avoiding hand-engineered feature extraction, has been exploited to extract the spatiotemporal information from 1D plasma profiles. The model is trained with regularly terminated discharges and automatically selected disruptive phase of disruptions, coming from the recent ITER-like-wall experiments. The prediction performance is evaluated using a set of discharges representative of different operating scenarios, and an in-depth analysis is made to evaluate the performance evolution with respect to the considered experimental conditions. Finally, as real-time triggers and termination schemes are being developed at JET, the proposed model has been tested on a set of recent experiments dedicated to plasma termination for disruption avoidance and mitigation. The CNN model demonstrates very high performance, and the exploitation of 1D plasma profiles as model input allows us to understand the underlying physical phenomena behind the predictor decision
    corecore