55,333 research outputs found
Using Neural Networks for Relation Extraction from Biomedical Literature
Using different sources of information to support automated extracting of
relations between biomedical concepts contributes to the development of our
understanding of biological systems. The primary comprehensive source of these
relations is biomedical literature. Several relation extraction approaches have
been proposed to identify relations between concepts in biomedical literature,
namely, using neural networks algorithms. The use of multichannel architectures
composed of multiple data representations, as in deep neural networks, is
leading to state-of-the-art results. The right combination of data
representations can eventually lead us to even higher evaluation scores in
relation extraction tasks. Thus, biomedical ontologies play a fundamental role
by providing semantic and ancestry information about an entity. The
incorporation of biomedical ontologies has already been proved to enhance
previous state-of-the-art results.Comment: Artificial Neural Networks book (Springer) - Chapter 1
WormBase 2012: more genomes, more data, new website
Since its release in 2000, WormBase (http://www.wormbase.org) has grown from a small resource focusing on a single species and serving a dedicated research community, to one now spanning 15 species essential to the broader biomedical and agricultural research fields. To enhance the rate of curation, we have automated the identification of key data in the scientific literature and use similar methodology for data extraction. To ease access to the data, we are collaborating with journals to link entities in research publications to their report pages at WormBase. To facilitate discovery, we have added new views of the data, integrated large-scale datasets and expanded descriptions of models for human disease. Finally, we have introduced a dramatic overhaul of the WormBase website for public beta testing. Designed to balance complexity and usability, the new site is species-agnostic, highly customizable, and interactive. Casual users and developers alike will be able to leverage the public RESTful application programming interface (API) to generate custom data mining solutions and extensions to the site. We report on the growth of our database and on our work in keeping pace with the growing demand for data, efforts to anticipate the requirements of users and new collaborations with the larger science community
Recommended from our members
A Network of SLC and ABC Transporter and DME Genes Involved in Remote Sensing and Signaling in the Gut-Liver-Kidney Axis.
Genes central to drug absorption, distribution, metabolism and elimination (ADME) also regulate numerous endogenous molecules. The Remote Sensing and Signaling Hypothesis argues that an ADME gene-centered network-including SLC and ABC "drug" transporters, "drug" metabolizing enzymes (DMEs), and regulatory genes-is essential for inter-organ communication via metabolites, signaling molecules, antioxidants, gut microbiome products, uremic solutes, and uremic toxins. By cross-tissue co-expression network analysis, the gut, liver, and kidney (GLK) formed highly connected tissue-specific clusters of SLC transporters, ABC transporters, and DMEs. SLC22, SLC25 and SLC35 families were network hubs, having more inter-organ and intra-organ connections than other families. Analysis of the GLK network revealed key physiological pathways (e.g., involving bile acids and uric acid). A search for additional genes interacting with the network identified HNF4Ī±, HNF1Ī±, and PXR. Knockout gene expression data confirmed ~60-70% of predictions of ADME gene regulation by these transcription factors. Using the GLK network and known ADME genes, we built a tentative gut-liver-kidney "remote sensing and signaling network" consisting of SLC and ABC transporters, as well as DMEs and regulatory proteins. Together with protein-protein interactions to prioritize likely functional connections, this network suggests how multi-specificity combines with oligo-specificity and mono-specificity to regulateĀ homeostasis of numerous endogenous small molecules
A heuristic optimization method for mitigating the impact of a virus attack
Taking precautions before or during the start of a virus outbreak can heavily
reduce the number of infected. The question which individuals should be
immunized in order to mitigate the impact of the virus on the rest of
population has received quite some attention in the literature. The dynamics of
the of a virus spread through a population is often represented as information
spread over a complex network. The strategies commonly proposed to determine
which nodes are to be selected for immunization often involve only one
centrality measure at a time, while often the topology of the network seems to
suggest that a single metric is insufficient to capture the influence of a node
entirely.
In this work we present a generic method based on a genetic algorithm (GA)
which does not rely explicitly on any centrality measures during its search but
only exploits this type of information to narrow the search space. The fitness
of an individual is defined as the estimated expected number of infections of a
virus following SIR dynamics. The proposed method is evaluated on two contact
networks: the Goodreau's Faux Mesa high school and the US air transportation
network. The GA method manages to outperform the most common strategies based
on a single metric for the air transportation network and its performance is
comparable with the best performing strategy for the high school network.Comment: To appear in the proceedings of the International Conference on
Computational Science (ICCS) in Barcelona. 11 pages, 5 figure
Patient-specific data fusion for cancer stratification and personalised treatment
According to Cancer Research UK, cancer is a leading cause of death accounting for more than one in four of all deaths in 2011. The recent advances in experimental technologies in cancer research have resulted in the accumulation of large amounts of patient-specific datasets, which provide complementary information on the same cancer type. We introduce a versatile data fusion (integration) framework that can effectively integrate somatic mutation data, molecular interactions and drug chemical data to address three key challenges in cancer research: stratification of patients into groups having different clinical outcomes, prediction of driver genes whose mutations trigger the onset and development of cancers, and repurposing of drugs treating particular cancer patient groups. Our new framework is based on graph-regularised non-negative matrix tri-factorization, a machine learning technique for co-clustering heterogeneous datasets. We apply our framework on ovarian cancer data to simultaneously cluster patients, genes and drugs by utilising all datasets.We demonstrate superior performance of our method over the state-of-the-art method, Network-based Stratification, in identifying three patient subgroups that have significant differences in survival outcomes and that are in good agreement with other clinical data. Also, we identify potential new driver genes that we obtain by analysing the gene clusters enriched in known drivers of ovarian cancer progression. We validated the top scoring genes identified as new drivers through database search and biomedical literature curation. Finally, we identify potential candidate drugs for repurposing that could be used in treatment of the identified patient subgroups by targeting their mutated gene products. We validated a large percentage of our drug-target predictions by using other databases and through literature curation
- ā¦