1,077 research outputs found
NetiNeti : Discovery of Scientific Names from Text Using Machine Learning Methods Figure 1
Figure 1 demonstrates a series of training experiments with the Naïve Bayes classifier using different neighborhoods for contextual features, different sizes of positive and
negative training examples and evaluated the resulting classifiers with our annotated
gold standard corpus.
The data sets are the results of running NetiNeti on subset of 136 PubMedCentral tagged open access articles and with no stop list.A scientific name for an organism can be associated with almost all biological data.
Name identification is an important step in many text mining tasks aiming to extract
useful information from biological, biomedical and biodiversity text sources. A
scientific name acts as an important metadata element to link biological information.We present NetiNeti, a machine learning based approach for identification and
discovery of scientific names. The system implementing the approach can be accessed
at http://namefinding.ubio.org we present the comparison results of various machine
learning algorithms on our annotated corpus. Naïve Bayes and Maximum Entropy
with Generalized Iterative Scaling (GIS) parameter estimation are the top two
performing algorithms
Chemical named entities recognition: a review on approaches and applications
The rapid increase in the flow rate of published digital information in all disciplines has resulted in a pressing need for techniques that can simplify the use of this information. The chemistry literature is very rich with information about chemical entities. Extracting molecules and their related properties and activities from the scientific literature to "text mine" these extracted data and determine contextual relationships helps research scientists, particularly those in drug development. One of the most important challenges in chemical text mining is the recognition of chemical entities mentioned in the texts. In this review, the authors briefly introduce the fundamental concepts of chemical literature mining, the textual contents of chemical documents, and the methods of naming chemicals in documents. We sketch out dictionary-based, rule-based and machine learning, as well as hybrid chemical named entity recognition approaches with their applied solutions. We end with an outlook on the pros and cons of these approaches and the types of chemical entities extracte
- …