8,125 research outputs found
A FRAME WORK FOR IDENTIFICATION OF RELATIONSHIP BETWEEN GENE AND DISEASE CAUSING MUTATION USING BIOLOGICAL TEXT MINING
We have gone through various papers describing the mutations in between them and associated disease in a rapid pace. The articles of previous studies show that there is a need to acquire knowledge of gene mutation causing diseases and its association. The need cannot be solved manually, but it has to be automated, so our study is based to develop a framework which gathers information of disease association mutation for knowledge sharing to doctors and researchers. Our work is done using texting mining for extraction of disease causing mutation and its associated NLP from previous abstracts. Our proposed system extracts mutation causing gene using NLP.DMLtool consists of modules of NLP that process text input using semantic and synaptic patterns to gain disease mutation. DML developed gives recall and precision high with F-score 0.87 , 0.89 and 0.91, which were evaluated on 3 various datasets related to associated disease mutations.  In DML we used a special module which extracts mentioned mutation and its gene text associated with it. Various types of datasets have been evaluated on our framework and its performance has been check with performance metric.The obtained results shows better performance compared to the existing on association of disease-mutation and also solves problems of low precision and their approaches.LMA is applied to large data sets of different type of abstracts in Pubmed, it extracts associated disease-mutations and its related information of patients, population of data and its type size. The gained result from our work is stored in a database, which can be acquired by query processing. In our work we conclude that using text mining method, we can increase high throughput, this gives potential to the research and also assist the research in identifying mutation causing disease and it’s associated with
Literature-based discovery of diabetes- and ROS-related targets
Abstract Background Reactive oxygen species (ROS) are known mediators of cellular damage in multiple diseases including diabetic complications. Despite its importance, no comprehensive database is currently available for the genes associated with ROS. Methods We present ROS- and diabetes-related targets (genes/proteins) collected from the biomedical literature through a text mining technology. A web-based literature mining tool, SciMiner, was applied to 1,154 biomedical papers indexed with diabetes and ROS by PubMed to identify relevant targets. Over-represented targets in the ROS-diabetes literature were obtained through comparisons against randomly selected literature. The expression levels of nine genes, selected from the top ranked ROS-diabetes set, were measured in the dorsal root ganglia (DRG) of diabetic and non-diabetic DBA/2J mice in order to evaluate the biological relevance of literature-derived targets in the pathogenesis of diabetic neuropathy. Results SciMiner identified 1,026 ROS- and diabetes-related targets from the 1,154 biomedical papers (http://jdrf.neurology.med.umich.edu/ROSDiabetes/). Fifty-three targets were significantly over-represented in the ROS-diabetes literature compared to randomly selected literature. These over-represented targets included well-known members of the oxidative stress response including catalase, the NADPH oxidase family, and the superoxide dismutase family of proteins. Eight of the nine selected genes exhibited significant differential expression between diabetic and non-diabetic mice. For six genes, the direction of expression change in diabetes paralleled enhanced oxidative stress in the DRG. Conclusions Literature mining compiled ROS-diabetes related targets from the biomedical literature and led us to evaluate the biological relevance of selected targets in the pathogenesis of diabetic neuropathy.http://deepblue.lib.umich.edu/bitstream/2027.42/78315/1/1755-8794-3-49.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/2/1755-8794-3-49-S7.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/3/1755-8794-3-49-S10.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/4/1755-8794-3-49-S8.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/5/1755-8794-3-49-S3.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/6/1755-8794-3-49-S1.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/7/1755-8794-3-49-S4.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/8/1755-8794-3-49-S2.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/9/1755-8794-3-49-S12.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/10/1755-8794-3-49-S11.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/11/1755-8794-3-49-S9.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/12/1755-8794-3-49-S5.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/13/1755-8794-3-49-S6.XLShttp://deepblue.lib.umich.edu/bitstream/2027.42/78315/14/1755-8794-3-49.pdfPeer Reviewe
Analysis of the human diseasome reveals phenotype modules across common, genetic, and infectious diseases
Phenotypes are the observable characteristics of an organism arising from its
response to the environment. Phenotypes associated with engineered and natural
genetic variation are widely recorded using phenotype ontologies in model
organisms, as are signs and symptoms of human Mendelian diseases in databases
such as OMIM and Orphanet. Exploiting these resources, several computational
methods have been developed for integration and analysis of phenotype data to
identify the genetic etiology of diseases or suggest plausible interventions. A
similar resource would be highly useful not only for rare and Mendelian
diseases, but also for common, complex and infectious diseases. We apply a
semantic text- mining approach to identify the phenotypes (signs and symptoms)
associated with over 8,000 diseases. We demonstrate that our method generates
phenotypes that correctly identify known disease-associated genes in mice and
humans with high accuracy. Using a phenotypic similarity measure, we generate a
human disease network in which diseases that share signs and symptoms cluster
together, and we use this network to identify phenotypic disease modules
An automated identification and analysis of ontological terms in gastrointestinal diseases and nutrition-related literature provides useful insights
With an unprecedented growth in the biomedical literature, keeping up to date with
the new developments presents an immense challenge. Publications are often studied
in isolation of the established literature, with interpretation being subjective and
often introducing human bias. With ontology-driven annotation of biomedical data
gaining popularity in recent years and online databases offering metatags with rich
textual information, it is now possible to automatically text-mine ontological terms
and complement the laborious task of manual management, interpretation, and
analysis of the accumulated literature with downstream statistical analysis. In this
paper, we have formulated an automated workflow through which we have identified
ontological information, including nutrition-related terms in PubMed abstracts
(from 1991 to 2016) for two main types of Inflammatory Bowel Diseases: Crohn’s
Disease and Ulcerative Colitis; and two other gastrointestinal (GI) diseases, namely,
Coeliac Disease and Irritable Bowel Syndrome. Our analysis reveals unique clustering
patterns as well as spatial and temporal trends inherent to the considered GI diseases
in terms of literature that has been accumulated so far. Although automated
interpretation cannot replace human judgement, the developed workflow shows
promising results and can be a useful tool in systematic literature reviews. The
workflow is available at https://github.com/KociOrges/pytag
- …