Search CORE

3 research outputs found

Biomedical Information Extraction Pipelines for Public Health in the Age of Deep Learning

Author
Publication venue
Publication date: 01/01/2019
Field of study

abstract: Unstructured texts containing biomedical information from sources such as electronic health records, scientific literature, discussion forums, and social media offer an opportunity to extract information for a wide range of applications in biomedical informatics. Building scalable and efficient pipelines for natural language processing and extraction of biomedical information plays an important role in the implementation and adoption of applications in areas such as public health. Advancements in machine learning and deep learning techniques have enabled rapid development of such pipelines. This dissertation presents entity extraction pipelines for two public health applications: virus phylogeography and pharmacovigilance. For virus phylogeography, geographical locations are extracted from biomedical scientific texts for metadata enrichment in the GenBank database containing 2.9 million virus nucleotide sequences. For pharmacovigilance, tools are developed to extract adverse drug reactions from social media posts to open avenues for post-market drug surveillance from non-traditional sources. Across these pipelines, high variance is observed in extraction performance among the entities of interest while using state-of-the-art neural network architectures. To explain the variation, linguistic measures are proposed to serve as indicators for entity extraction performance and to provide deeper insight into the domain complexity and the challenges associated with entity extraction. For both the phylogeography and pharmacovigilance pipelines presented in this work the annotated datasets and applications are open source and freely available to the public to foster further research in public health.Dissertation/ThesisDoctoral Dissertation Biomedical Informatics 201

ASU Digital Repository

Extreme Learning Machine for Multi-Label Classification

Author: Changmeng Jiang
Feijuan He
Jingting Xu
Jun Feng
Su-Shing Chen
Xia Sun
Publication venue: 'MDPI AG'
Publication date: 01/06/2016
Field of study

Extreme learning machine (ELM) techniques have received considerable attention in the computational intelligence and machine learning communities because of the significantly low computational time required for training new classifiers. ELM provides solutions for regression, clustering, binary classification, multiclass classifications and so on, but not for multi-label learning. Multi-label learning deals with objects having multiple labels simultaneously, which widely exist in real-world applications. Therefore, a thresholding method-based ELM is proposed in this paper to adapt ELM to multi-label classification, called extreme learning machine for multi-label classification (ELM-ML). ELM-ML outperforms other multi-label classification methods in several standard data sets in most cases, especially for applications which only have a small labeled data set

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Applying Regularization Least Squares Canonical Correlation Analysis in Extreme Learning Machine for Multi-label Classification Problems

Author: A.J. Clare
C.D. Brown
G. Madjarov
G. Tsoumakas
G. Tsoumakas
G.B. Huang
G.B. Huang
G.B. Huang
H. Hotelling
J. Fürnkranz
K.H. Zou
L. Sun
L. Sun
M.L. Zhang
M.L. Zhang
M.L. Zhang
N. Liu
Q.Y. Zhu
R. Tibshirani
T. Fawcett
T.F. Wu
Y. Freund
Y. Freund
Y. Lan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Crossref