58,335 research outputs found
High-Precision Extraction of Emerging Concepts from Scientific Literature
Identification of new concepts in scientific literature can help power
faceted search, scientific trend analysis, knowledge-base construction, and
more, but current methods are lacking. Manual identification cannot keep up
with the torrent of new publications, while the precision of existing automatic
techniques is too low for many applications. We present an unsupervised concept
extraction method for scientific literature that achieves much higher precision
than previous work. Our approach relies on a simple but novel intuition: each
scientific concept is likely to be introduced or popularized by a single paper
that is disproportionately cited by subsequent papers mentioning the concept.
From a corpus of computer science papers on arXiv, we find that our method
achieves a Precision@1000 of 99%, compared to 86% for prior work, and a
substantially better precision-yield trade-off across the top 15,000
extractions. To stimulate research in this area, we release our code and data
(https://github.com/allenai/ForeCite).Comment: Accepted to SIGIR 202
TechMiner: Extracting Technologies from Academic Publications
In recent years we have seen the emergence of a variety of scholarly datasets. Typically these capture ‘standard’ scholarly entities and their connections, such as authors, affiliations, venues, publications, citations, and others. However, as the repositories grow and the technology improves, researchers are adding new entities to these repositories to develop a richer model of the scholarly domain. In this paper, we introduce TechMiner, a new approach, which combines NLP, machine learning and semantic technologies, for mining technologies from research publications and generating an OWL ontology describing their relationships with other research entities. The resulting knowledge base can support a number of tasks, such as: richer semantic search, which can exploit the technology dimension to support better retrieval of publications; richer expert search; monitoring the emergence and impact of new technologies, both within and across scientific fields; studying the scholarly dynamics associated with the emergence of new technologies; and others. TechMiner was evaluated on a manually annotated gold standard and the results indicate that it significantly outperforms alternative NLP approaches and that its semantic features improve performance significantly with respect to both recall and precision
AUGUR: Forecasting the Emergence of New Research Topics
Being able to rapidly recognise new research trends is strategic for many stakeholders, including universities, institutional funding bodies, academic publishers and companies. The literature presents several approaches to identifying the emergence of new research topics, which rely on the assumption that the topic is already exhibiting a certain degree of popularity and consistently referred to by a community of researchers. However, detecting the emergence of a new research area at an embryonic stage, i.e., before the topic has been consistently labelled by a community of researchers and associated with a number of publications, is still an open challenge. We address this issue by introducing Augur, a novel approach to the early detection of research topics. Augur analyses the diachronic relationships between research areas and is able to detect clusters of topics that exhibit dynamics correlated with the emergence of new research topics. Here we also present the Advanced Clique Percolation Method (ACPM), a new community detection algorithm developed specifically for supporting this task. Augur was evaluated on a gold standard of 1,408 debutant topics in the 2000-2011 interval and outperformed four alternative approaches in terms of both precision and recall
Forecasting the Spreading of Technologies in Research Communities
Technologies such as algorithms, applications and formats are an important part of the knowledge produced and reused in the research process. Typically, a technology is expected to originate in the context of a research area and then spread and contribute to several other fields. For example, Semantic Web technologies have been successfully adopted by a variety of fields, e.g., Information Retrieval, Human Computer Interaction, Biology, and many others. Unfortunately, the spreading of technologies across research areas may be a slow and inefficient process, since it is easy for researchers to be unaware of potentially relevant solutions produced by other research communities. In this paper, we hypothesise that it is possible to learn typical technology propagation patterns from historical data and to exploit this knowledge i) to anticipate where a technology may be adopted next and ii) to alert relevant stakeholders about emerging and relevant technologies in other fields. To do so, we propose the Technology-Topic Framework, a novel approach which uses a semantically enhanced technology-topic model to forecast the propagation of technologies to research areas. A formal evaluation of the approach on a set of technologies in the Semantic Web and Artificial Intelligence areas has produced excellent results, confirming the validity of our solution
Unsupervised Extraction of Representative Concepts from Scientific Literature
This paper studies the automated categorization and extraction of scientific
concepts from titles of scientific articles, in order to gain a deeper
understanding of their key contributions and facilitate the construction of a
generic academic knowledgebase. Towards this goal, we propose an unsupervised,
domain-independent, and scalable two-phase algorithm to type and extract key
concept mentions into aspects of interest (e.g., Techniques, Applications,
etc.). In the first phase of our algorithm we propose PhraseType, a
probabilistic generative model which exploits textual features and limited POS
tags to broadly segment text snippets into aspect-typed phrases. We extend this
model to simultaneously learn aspect-specific features and identify academic
domains in multi-domain corpora, since the two tasks mutually enhance each
other. In the second phase, we propose an approach based on adaptor grammars to
extract fine grained concept mentions from the aspect-typed phrases without the
need for any external resources or human effort, in a purely data-driven
manner. We apply our technique to study literature from diverse scientific
domains and show significant gains over state-of-the-art concept extraction
techniques. We also present a qualitative analysis of the results obtained.Comment: Published as a conference paper at CIKM 201
A quantitative taxonomy of human hand grasps
Background: A proper modeling of human grasping and of hand movements is fundamental for robotics,
prosthetics, physiology and rehabilitation. The taxonomies of hand grasps that have been proposed in scientific
literature so far are based on qualitative analyses of the movements and thus they are usually not quantitatively
justified.
Methods: This paper presents to the best of our knowledge the first quantitative taxonomy of hand grasps based on
biomedical data measurements. The taxonomy is based on electromyography and kinematic data recorded from 40
healthy subjects performing 20 unique hand grasps. For each subject, a set of hierarchical trees are computed for
several signal features. Afterwards, the trees are combined, first into modality-specific (i.e. muscular and kinematic)
taxonomies of hand grasps and then into a general quantitative taxonomy of hand movements. The modality-specific
taxonomies provide similar results despite describing different parameters of hand movements, one being muscular
and the other kinematic.
Results: The general taxonomy merges the kinematic and muscular description into a comprehensive hierarchical
structure. The obtained results clarify what has been proposed in the literature so far and they partially confirm the
qualitative parameters used to create previous taxonomies of hand grasps. According to the results, hand movements
can be divided into five movement categories defined based on the overall grasp shape, finger positioning and
muscular activation. Part of the results appears qualitatively in accordance with previous results describing kinematic
hand grasping synergies.
Conclusions: The taxonomy of hand grasps proposed in this paper clarifies with quantitative measurements what
has been proposed in the field on a qualitative basis, thus having a potential impact on several scientific fields
- …