4,660 research outputs found

    Unsupervised Learning and Multipartite Network Models: A Promising Approach for Understanding Traditional Medicine

    Get PDF
    The ultimate goal of precision medicine is to determine right treatment for right patients based on precise diagnosis. To achieve this goal, correct stratification of patients using molecular features and clinical phenotypes is crucial. During the long history of medical science, our understanding on disease classification has been improved greatly by chemistry and molecular biology. Nowadays, we gain access to large scale patient-derived data by high-throughput technologies, generating a greater need for data science including unsupervised learning and network modeling. Unsupervised learning methods such as clustering could be a better solution to stratify patients when there is a lack of predefined classifiers. In network modularity analysis, clustering methods can be also applied to elucidate the complex structure of biological and disease networks at the systems level. In this review, we went over the main points of clustering analysis and network modeling, particularly in the context of Traditional Chinese medicine (TCM). We showed that this approach can provide novel insights on the rationale of classification for TCM herbs. In a case study, using a modularity analysis of multipartite networks, we illustrated that the TCM classifications are associated with the chemical properties of the herb ingredients. We concluded that multipartite network modeling may become a suitable data integration tool for understanding the mechanisms of actions of traditional medicine.Peer reviewe

    What Symptoms and How Long? An Interpretable AI Approach for Depression Detection in Social Media

    Full text link
    Depression is the most prevalent and serious mental illness, which induces grave financial and societal ramifications. Depression detection is key for early intervention to mitigate those consequences. Such a high-stake decision inherently necessitates interpretability. Although a few depression detection studies attempt to explain the decision based on the importance score or attention weights, these explanations misalign with the clinical depression diagnosis criterion that is based on depressive symptoms. To fill this gap, we follow the computational design science paradigm to develop a novel Multi-Scale Temporal Prototype Network (MSTPNet). MSTPNet innovatively detects and interprets depressive symptoms as well as how long they last. Extensive empirical analyses using a large-scale dataset show that MSTPNet outperforms state-of-the-art depression detection methods with an F1-score of 0.851. This result also reveals new symptoms that are unnoted in the survey approach, such as sharing admiration for a different life. We further conduct a user study to demonstrate its superiority over the benchmarks in interpretability. This study contributes to IS literature with a novel interpretable deep learning model for depression detection in social media. In practice, our proposed method can be implemented in social media platforms to provide personalized online resources for detected depressed patients.Comment: 56 pages, 10 figures, 21 table

    Network Pharmacology Approaches for Understanding Traditional Chinese Medicine

    Get PDF
    Traditional Chinese medicine (TCM) has obvious efficacy on disease treatments and is a valuable source for novel drug discovery. However, the underlying mechanism of the pharmacological effects of TCM remains unknown because TCM is a complex system with multiple herbs and ingredients coming together as a prescription. Therefore, it is urgent to apply computational tools to TCM to understand the underlying mechanism of TCM theories at the molecular level and use advanced network algorithms to explore potential effective ingredients and illustrate the principles of TCM in system biological aspects. In this thesis, we aim to understand the underlying mechanism of actions in complex TCM systems at the molecular level by bioinformatics and computational tools. In study Ⅰ, a machine learning framework was developed to predict the meridians of the herbs and ingredients. Finally, we achieved high accuracy of the meridians prediction for herbs and ingredients, suggesting an association between meridians and the molecular features of ingredients and herbs, especially the most important features for machine learning models. Secondly, we proposed a novel network approach to study the TCM formulae by quantifying the degree of interactions of pairwise herb pairs in study Ⅱ using five network distance methods, including the closest, shortest, central, kernel, as well as separation. We demonstrated that the distance of top herb pairs is shorter than that of random herb pairs, suggesting a strong interaction in the human interactome. In addition, center methods at the ingredient level outperformed the other methods. It hints to us that the central ingredients play an important role in the herbs. Thirdly, we explored the associations between herbs or ingredients and their important biological characteristics in study III, such as properties, meridians, structures, or targets via clusters from community analysis of the multipartite network. We found that herbal medicines among the same clusters tend to be more similar in the properties, meridians. Similarly, ingredients from the same cluster are more similar in structure and protein target. In summary, this thesis intends to build a bridge between the TCM system and modern medicinal systems using computational tools, including the machine learning model for meridian theory, network modelling for TCM formulae, as well as multipartite network analysis for herbal medicines and their ingredients. We demonstrated that applying novel computational approaches on the integrated high-throughput omics would provide insights for TCM and accelerate the novel drug discovery as well as repurposing from TCM.Perinteinen kiinalainen lääketiede (TCM) on ilmeinen tehokkuus taudin hoidoissa ja on arvokas lähde uuden lääkkeen löytämiseen. TCM: n farmakologisten vaikutusten taustalla oleva mekanismi pysyy kuitenkin tuntemattomassa, koska TCM on monimutkainen järjestelmä, jossa on useita yrttejä ja ainesosia, jotka tulevat yhteen reseptilääkkeeksi. Siksi on kiireellistä soveltaa Laskennallisia työkaluja TCM: lle ymmärtämään TCM-teorioiden taustalla oleva mekanismi molekyylitasolla ja käyttävät kehittyneitä verkkoalgoritmeja tutkimaan mahdollisia tehokkaita ainesosia ja havainnollistavat TCM: n periaatteita järjestelmän biologisissa näkökohdissa. Tässä opinnäytetyössä pyrimme ymmärtämään monimutkaisten TCM-järjestelmien toimintamekanismia molekyylitasolla bioinformaattilla ja laskennallisilla työkaluilla. Tutkimuksessa kehitettiin koneen oppimiskehystä yrttien ja ainesosien meridialaisista. Lopuksi saavutimme korkean tarkkuuden meridiaaneista yrtteistä ja ainesosista, mikä viittaa meridiaaneihin ja ainesosien ja yrtteihin liittyvien molekyylipiirin välillä, erityisesti koneen oppimismalleihin tärkeimmät ominaisuudet. Toiseksi ehdoimme uuden verkon lähestymistavan TCM-kaavojen tutkimiseksi kvantitoimisella vuorovaikutteisten yrttiparien vuorovaikutuksen tutkimuksessa ⅱ käyttämällä viisi verkkoetäisyyttä, mukaan lukien lähin, lyhyt, keskus, ydin sekä erottaminen. Osoitimme, että ylä-yrttiparien etäisyys on lyhyempi kuin satunnaisten yrttiparien, mikä viittaa voimakkaaseen vuorovaikutukseen ihmisellä vuorovaikutteisesti. Lisäksi Center-menetelmät ainesosan tasolla ylittivät muut menetelmät. Se vihjeitä meille, että keskeiset ainesosat ovat tärkeässä asemassa yrtteissä. Kolmanneksi tutkimme yrttien tai ainesosien välisiä yhdistyksiä ja niiden tärkeitä biologisia ominaisuuksia tutkimuksessa III, kuten ominaisuudet, meridiaanit, rakenteet tai tavoitteet klustereiden kautta moniparite-verkoston yhteisön analyysistä. Löysimme, että kasviperäiset lääkkeet samoilla klusterien keskuudessa ovat yleensä samankaltaisia ominaisuuksissa, meridiaaneissa. Samoin saman klusterin ainesosat ovat samankaltaisempia rakenteissa ja proteiinin tavoitteessa. Yhteenvetona tämä opinnäytetyö aikoo rakentaa silta TCM-järjestelmän ja nykyaikaisten lääkevalmisteiden välillä laskentatyökaluilla, mukaan lukien Meridian-teorian koneen oppimismalli, TCM-kaavojen verkkomallinnus sekä kasviperäiset lääkkeet ja niiden ainesosat Osoitimme, että uusien laskennallisten lähestymistapojen soveltaminen integroidulle korkean suorituskyvyttömiehille tarjosivat TCM: n näkemyksiä ja nopeuttaisivat romaanin huumeiden löytöä sekä toistuvat TCM: stä

    Mining a Small Medical Data Set by Integrating the Decision Tree and t-test

    Get PDF
    [[abstract]]Although several researchers have used statistical methods to prove that aspiration followed by the injection of 95% ethanol left in situ (retention) is an effective treatment for ovarian endometriomas, very few discuss the different conditions that could generate different recovery rates for the patients. Therefore, this study adopts the statistical method and decision tree techniques together to analyze the postoperative status of ovarian endometriosis patients under different conditions. Since our collected data set is small, containing only 212 records, we use all of these data as the training data. Therefore, instead of using a resultant tree to generate rules directly, we use the value of each node as a cut point to generate all possible rules from the tree first. Then, using t-test, we verify the rules to discover some useful description rules after all possible rules from the tree have been generated. Experimental results show that our approach can find some new interesting knowledge about recurrent ovarian endometriomas under different conditions.[[journaltype]]國外[[incitationindex]]EI[[booktype]]紙本[[countrycodes]]FI

    A whole-genome population structure analysis within cattle breeds

    Get PDF

    Disentangling causal webs in the brain using functional Magnetic Resonance Imaging: A review of current approaches

    Get PDF
    In the past two decades, functional Magnetic Resonance Imaging has been used to relate neuronal network activity to cognitive processing and behaviour. Recently this approach has been augmented by algorithms that allow us to infer causal links between component populations of neuronal networks. Multiple inference procedures have been proposed to approach this research question but so far, each method has limitations when it comes to establishing whole-brain connectivity patterns. In this work, we discuss eight ways to infer causality in fMRI research: Bayesian Nets, Dynamical Causal Modelling, Granger Causality, Likelihood Ratios, LiNGAM, Patel's Tau, Structural Equation Modelling, and Transfer Entropy. We finish with formulating some recommendations for the future directions in this area

    Statistical methods for gene selection and genetic association studies

    Get PDF
    This dissertation includes five Chapters. A brief description of each chapter is organized as follows. In Chapter One, we propose a signed bipartite genotype and phenotype network (GPN) by linking phenotypes and genotypes based on the statistical associations. It provides a new insight to investigate the genetic architecture among multiple correlated phenotypes and explore where phenotypes might be related at a higher level of cellular and organismal organization. We show that multiple phenotypes association studies by considering the proposed network are improved by incorporating the genetic information into the phenotype clustering. In Chapter Two, we first illustrate the proposed GPN to GWAS summary statistics. Then, we assess contributions to constructing a well-defined GPN with a clear representation of genetic associations by comparing the network properties with a random network, including connectivity, centrality, and community structure. The network topology annotations based on the sparse representations of GPN can be used to understand the disease heritability for the highly correlated phenotypes. In applications of phenome-wide association studies, the proposed GPN can identify more significant pairs of genetic variant and phenotype categories. In Chapter Three, a powerful and computationally efficient gene-based association test is proposed, aggregating information from different gene-based association tests and also incorporating expression quantitative trait locus information. We show that the proposed method controls the type I error rates very well and has higher power in the simulation studies and can identify more significant genes in the real data analyses. In Chapter Four, we develop six statistical selection methods based on the penalized regression for inferring target genes of a transcription factor (TF). In this study, the proposed selection methods combine statistics, machine learning , and convex optimization approach, which have great efficacy in identifying the true target genes. The methods will fill the gap of lacking the appropriate methods for predicting target genes of a TF, and are instrumental for validating experimental results yielding from ChIP-seq and DAP-seq, and conversely, selection and annotation of TFs based on their target genes. In Chapter Five, we propose a gene selection approach by capturing gene-level signals in network-based regression into case-control association studies with DNA sequence data or DNA methylation data, inspired by the popular gene-based association tests using a weighted combination of genetic variants to capture the combined effect of individual genetic variants within a gene. We show that the proposed gene selection approach have higher true positive rates than using traditional dimension reduction techniques in the simulation studies and select potentially rheumatoid arthritis related genes that are missed by existing methods
    corecore