5 research outputs found
Data_Sheet_1_Genetic Pathways and Functional Subnetworks for the Complex Nature of Bipolar Disorder in Genome-Wide Association Study.csv
Bipolar disorder is a complex psychiatric trait that is also recognized as a high substantial heritability from a worldwide distribution. The success in identifying susceptibility loci for bipolar disorder (BPD) has been limited due to its complex genetic architecture. Growing evidence from association studies including genome-wide association (GWA) studies points to the need of improved analytic strategies to pinpoint the missing heritability for BPD. More importantly, many studies indicate that BPD has a strong association with dementia. We conducted advanced pathway analytics strategies to investigate synergistic effects of multilocus within biologically functional pathways, and further demonstrated functional effects among proteins in subnetworks to examine mechanisms underlying the complex nature of bipolarity using a GWA dataset for BPD. We allowed bipolar susceptible loci to play a role that takes larger weights in pathway-based analytic approaches. Having significantly informative genes identified from enriched pathways, we further built function-specific subnetworks of protein interactions using MetaCore. The gene-wise scores (i.e., minimum p-value) were corrected for the gene-length, and the results were corrected for multiple tests using Benjamini and Hochberg’s method. We found 87 enriched pathways that are significant for BPD; of which 36 pathways were reported. Most of them are involved with several metabolic processes, neural systems, immune system, molecular transport, cellular communication, and signal transduction. Three significant and function-related subnetworks with multiple hotspots were reported to link with several Gene Ontology processes for BPD. Our comprehensive pathway-network frameworks demonstrated that the use of prior knowledge is promising to facilitate our understanding between complex psychiatric disorders (e.g., BPD) and dementia for the access to the connection and clinical implications, along with the development and progression of dementia.</p
DataSheet_3_An advanced systems biology framework of feature engineering for cold tolerance genes discovery from integrated omics and non-omics data in soybean.xlsx
Soybean is sensitive to low temperatures during the crop growing season. An urgent demand for breeding cold-tolerant cultivars to alleviate the production loss is apparent to cope with this scenario. Cold-tolerant trait is a complex and quantitative trait controlled by multiple genes, environmental factors, and their interaction. In this study, we proposed an advanced systems biology framework of feature engineering for the discovery of cold tolerance genes (CTgenes) from integrated omics and non-omics (OnO) data in soybean. An integrative pipeline was introduced for feature selection and feature extraction from different layers in the integrated OnO data using data ensemble methods and the non-parameter random forest prioritization to minimize uncertainties and false positives for accuracy improvement of results. In total, 44, 143, and 45 CTgenes were identified in short-, mid-, and long-term cold treatment, respectively, from the corresponding gene-pool. These CTgenes outperformed the remaining genes, the random genes, and the other candidate genes identified by other approaches in an independent RNA-seq database. Furthermore, we applied pathway enrichment and crosstalk network analyses to uncover relevant physiological pathways with the discovery of underlying cold tolerance in hormone- and defense-related modules. Our CTgenes were validated by using 55 SNP genotype data of 56 soybean samples in cold tolerance experiments. This suggests that the CTgenes identified from our proposed systematic framework can effectively distinguish cold-resistant and cold-sensitive lines. It is an important advancement in the soybean cold-stress response. The proposed pipelines provide an alternative solution to biomarker discovery, module discovery, and sample classification underlying a particular trait in plants in a robust and efficient way.</p
DataSheet_2_An advanced systems biology framework of feature engineering for cold tolerance genes discovery from integrated omics and non-omics data in soybean.xlsx
Soybean is sensitive to low temperatures during the crop growing season. An urgent demand for breeding cold-tolerant cultivars to alleviate the production loss is apparent to cope with this scenario. Cold-tolerant trait is a complex and quantitative trait controlled by multiple genes, environmental factors, and their interaction. In this study, we proposed an advanced systems biology framework of feature engineering for the discovery of cold tolerance genes (CTgenes) from integrated omics and non-omics (OnO) data in soybean. An integrative pipeline was introduced for feature selection and feature extraction from different layers in the integrated OnO data using data ensemble methods and the non-parameter random forest prioritization to minimize uncertainties and false positives for accuracy improvement of results. In total, 44, 143, and 45 CTgenes were identified in short-, mid-, and long-term cold treatment, respectively, from the corresponding gene-pool. These CTgenes outperformed the remaining genes, the random genes, and the other candidate genes identified by other approaches in an independent RNA-seq database. Furthermore, we applied pathway enrichment and crosstalk network analyses to uncover relevant physiological pathways with the discovery of underlying cold tolerance in hormone- and defense-related modules. Our CTgenes were validated by using 55 SNP genotype data of 56 soybean samples in cold tolerance experiments. This suggests that the CTgenes identified from our proposed systematic framework can effectively distinguish cold-resistant and cold-sensitive lines. It is an important advancement in the soybean cold-stress response. The proposed pipelines provide an alternative solution to biomarker discovery, module discovery, and sample classification underlying a particular trait in plants in a robust and efficient way.</p
Data_Sheet_1_Genetic Pathways and Functional Subnetworks for the Complex Nature of Bipolar Disorder in Genome-Wide Association Study.pdf
Bipolar disorder is a complex psychiatric trait that is also recognized as a high substantial heritability from a worldwide distribution. The success in identifying susceptibility loci for bipolar disorder (BPD) has been limited due to its complex genetic architecture. Growing evidence from association studies including genome-wide association (GWA) studies points to the need of improved analytic strategies to pinpoint the missing heritability for BPD. More importantly, many studies indicate that BPD has a strong association with dementia. We conducted advanced pathway analytics strategies to investigate synergistic effects of multilocus within biologically functional pathways, and further demonstrated functional effects among proteins in subnetworks to examine mechanisms underlying the complex nature of bipolarity using a GWA dataset for BPD. We allowed bipolar susceptible loci to play a role that takes larger weights in pathway-based analytic approaches. Having significantly informative genes identified from enriched pathways, we further built function-specific subnetworks of protein interactions using MetaCore. The gene-wise scores (i.e., minimum p-value) were corrected for the gene-length, and the results were corrected for multiple tests using Benjamini and Hochberg’s method. We found 87 enriched pathways that are significant for BPD; of which 36 pathways were reported. Most of them are involved with several metabolic processes, neural systems, immune system, molecular transport, cellular communication, and signal transduction. Three significant and function-related subnetworks with multiple hotspots were reported to link with several Gene Ontology processes for BPD. Our comprehensive pathway-network frameworks demonstrated that the use of prior knowledge is promising to facilitate our understanding between complex psychiatric disorders (e.g., BPD) and dementia for the access to the connection and clinical implications, along with the development and progression of dementia.</p
DataSheet_1_An advanced systems biology framework of feature engineering for cold tolerance genes discovery from integrated omics and non-omics data in soybean.pdf
Soybean is sensitive to low temperatures during the crop growing season. An urgent demand for breeding cold-tolerant cultivars to alleviate the production loss is apparent to cope with this scenario. Cold-tolerant trait is a complex and quantitative trait controlled by multiple genes, environmental factors, and their interaction. In this study, we proposed an advanced systems biology framework of feature engineering for the discovery of cold tolerance genes (CTgenes) from integrated omics and non-omics (OnO) data in soybean. An integrative pipeline was introduced for feature selection and feature extraction from different layers in the integrated OnO data using data ensemble methods and the non-parameter random forest prioritization to minimize uncertainties and false positives for accuracy improvement of results. In total, 44, 143, and 45 CTgenes were identified in short-, mid-, and long-term cold treatment, respectively, from the corresponding gene-pool. These CTgenes outperformed the remaining genes, the random genes, and the other candidate genes identified by other approaches in an independent RNA-seq database. Furthermore, we applied pathway enrichment and crosstalk network analyses to uncover relevant physiological pathways with the discovery of underlying cold tolerance in hormone- and defense-related modules. Our CTgenes were validated by using 55 SNP genotype data of 56 soybean samples in cold tolerance experiments. This suggests that the CTgenes identified from our proposed systematic framework can effectively distinguish cold-resistant and cold-sensitive lines. It is an important advancement in the soybean cold-stress response. The proposed pipelines provide an alternative solution to biomarker discovery, module discovery, and sample classification underlying a particular trait in plants in a robust and efficient way.</p
