Search CORE

30,007 research outputs found

Multilingual log analysis: LogCLEF

Author: Leveling Johannes
Mandl Thomas
Nunzio Giorgio Maria Di
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

The current lack of recent and long-term query logs makes the verifiability and repeatability of log analysis experiments very limited. A first attempt in this direction has been made within the Cross-Language Evaluation Forum in 2009 in a track named LogCLEF which aims to stimulate research on user behaviour in multilingual environments and promote standard evaluation collections of log data. We report on similarities and differences of the most recent activities for LogCLEF

Crossref

Irish Universities

DCU Online Research Access Service

Archivio istituzionale della ricerca - Università di Padova

Building ontologies from folksonomies and linked data: Data structures and Algorithms

Author: Corcho Oscar
García-Castro Alexander
García-Castro Leyla Jael
García-Silva A.
Gómez-Pérez A.
Publication venue: Facultad de Informática (UPM)
Publication date: 01/05/2012
Field of study

We present the data structures and algorithms used in the approach for building domain ontologies from folksonomies and linked data. In this approach we extracts domain terms from folksonomies and enrich them with semantic information from the Linked Open Data cloud. As a result, we obtain a domain ontology that combines the emergent knowledge of social tagging systems with formal knowledge from Ontologies

Archivo Digital UPM

Natural Language Query in the Biochemistry and Molecular Biology Domains Based on Cognition Search™

Author: Elizabeth J. Goldsmith
Kathleen Dahlgren
Radha Akella
Saurabh Mendiratta
Publication venue
Publication date: 19/09/2008
Field of study

Motivation: With the tremendous growth in scientific literature, it is necessary to improve upon the standard pattern matching style of the available search engines. Semantic NLP may be the solution to this problem. Cognition Search (CSIR) is a natural language technology. It is best used by asking a simple question that might be answered in textual data being queried, such as MEDLINE. CSIR has a large English dictionary and semantic database. Cognition’s semantic map enables the search process to be based on meaning rather than statistical word pattern matching and, therefore, returns more complete and relevant results. The Cognition Search engine uses downward reasoning and synonymy which also improves recall. It improves precision through phrase parsing and word sense disambiguation.
Result: Here we have carried out several projects to "teach" the CSIR lexicon medical, biochemical and molecular biological language and acronyms from curated web-based free sources. Vocabulary from the Alliance for Cell Signaling (AfCS), the Human Genome Nomenclature Consortium (HGNC), the United Medical Language System (UMLS) Meta-thesaurus, and The International Union of Pure and Applied Chemistry (IUPAC) was introduced into the CSIR dictionary and curated. The resulting system was used to interpret MEDLINE abstracts. Meaning-based search of MEDLINE abstracts yields high precision (estimated at >90%), and high recall (estimated at >90%), where synonym information has been encoded. The present implementation can be found at http://MEDLINE.cognition.com. &#xa

PubMed Central

Nature Precedings