Search CORE

29 research outputs found

A Cross-Lingual Similarity Measure for Detecting Biomedical Term Translations

Author: A Cichocki
C Ding
CD Manning
Danushka Bollegala
E Morin
FJ Och
Georgios Kontonatsios
GH Golub
H Wold
H Wold
K Frantzi
L Breiman
L van der Maaten
ME Tipping
N Okazaki
Neil R. Smalheiser
NT Duc
P Geladi
P Turney
PD Turney
PD Turney
R Rosipal
Sophia Ananiadou
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/06/2015
Field of study

Bilingual dictionaries for technical terms such as biomedical terms are an important resource for machine translation systems as well as for humans who would like to understand a concept described in a foreign language. Often a biomedical term is first proposed in English and later it is manually translated to other languages. Despite the fact that there are large monolingual lexicons of biomedical terms, only a fraction of those term lexicons are translated to other languages. Manually compiling large-scale bilingual dictionaries for technical domains is a challenging task because it is difficult to find a sufficiently large number of bilingual experts. We propose a cross-lingual similarity measure for detecting most similar translation candidates for a biomedical term specified in one language (source) from another language (target). Specifically, a biomedical term in a language is represented using two types of features: (a) intrinsic features that consist of character n-grams extracted from the term under consideration, and (b) extrinsic features that consist of unigrams and bigrams extracted from the contextual windows surrounding the term under consideration. We propose a cross-lingual similarity measure using each of those feature types. First, to reduce the dimensionality of the feature space in each language, we propose prototype vector projection (PVP)—a non-negative lower-dimensional vector projection method. Second, we propose a method to learn a mapping between the feature spaces in the source and target language using partial least squares regression (PLSR). The proposed method requires only a small number of training instances to learn a cross-lingual similarity measure. The proposed PVP method outperforms popular dimensionality reduction methods such as the singular value decomposition (SVD) and non-negative matrix factorization (NMF) in a nearest neighbor prediction task. Moreover, our experimental results covering several language pairs such as English–French, English–Spanish, English–Greek, and English–Japanese show that the proposed method outperforms several other feature projection methods in biomedical term translation prediction tasks

University of Liverpool Repository

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Edge Hill University Research Information Repository

PubMed Central

The University of Manchester - Institutional Repository

FigShare

Event extraction of bacteria biotopes: a knowledge-intensive NLP-based approach

Author: A Airola
A Culotta
AP Manine
AR Aronson
BJ Grosz
C Jacquemin
C Nédellec
D Bollegala
D Field
D Zelenko
G Erkan
I Segura-Bedmar
JD Kim
JO Korbel
K Fundel
K Liolios
M Torii
N Kambhatla
Pierre Warnier
R Bossy
R Bossy
S Aubin
S Lappin
SA Kripke
SP Lapage
T Hamon
T Ono
Wiktoria Golik
Y Lin
Z GuoDong
Zorana Ratkovic
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

International audienceBackground: Bacteria biotopes cover a wide range of diverse habitats including animal and plant hosts, natural, medical and industrial environments. The high volume of publications in the microbiology domain provides a rich source of up-to-date information on bacteria biotopes. This information, as found in scientific articles, is expressed in natural language and is rarely available in a structured format, such as a database. This information is of great importance for fundamental research and microbiology applications (e.g., medicine, agronomy, food, bioenergy). The automatic extraction of this information from texts will provide a great benefit to the field

Crossref

Springer - Publisher Connector

PubMed Central

HAL Descartes

Hal-Diderot

Health care providers’ attitudes towards transfer and transition in young persons with long term illness- a web-based survey

Author: AB Burström
Carina Sparud-Lundin
CT Cunningham
D Hilderson
D Hilderson
D Wild
DF Polit
Ewa-Lena Bratt
HM Sonneveld
JC Suris
JE McDonagh
L Fegran
MA Attiah
MA McManus
Malin Berghammer
N Bollegala
N Walleghem Van
Philip Moons
R Crowley
WC Cooley
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Taxonomy Construction Using Compound Similarity Measure

Author: A. Hotho
A. Maedche
C. Leacock
D. Bollegala
M. Shamsfard
N. Weber
P. Cimiano
P. Cimiano
P. Resnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

CGSPN : cascading gated self-attention and phrase-attention network for sentence modeling

Author: B Babic
D Bollegala
G Gazdar
G Rao
H Palangi
J Fürnkranz
JR Haddock
N Sharma
P Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Supporting Systematic Reviews using Text Mining

Author: Bollegala D.
Brian Rea
Carroll J.
Hartswood M.
James Thomas
Jirotka M.
Lin C.Y.
Naoaki Okazaki
Okazaki N.
Okazaki N.
Rob Procter
Sasaki Y.
Sophia Ananiadou
Publication venue
Publication date: 01/01/2009
Field of study

In this article, we describe how we are using text mining solutions to enhance the production of systematic reviews. The aims of this collaborative project are the development of a text mining framework to support systematic reviews and the provision of a service exemplar serving as a test bed for deriving requirements for the development of more generally applicable text mining tools and services

CiteSeerX

Crossref

UCL Discovery

Warwick Research Archives Portal Repository

The University of Manchester - Institutional Repository