Search CORE

207 research outputs found

The textual characteristics of traditional and Open Access scientific journals are similar

Author: A Knebel
A Swan
C Blaschke
D Biber
D Ferrucci
DP Corney
G Eysenbach
K Bretonnel Cohen
K Curran
K Verspoor
Karin Verspoor
KB Cohen
L Tanabe
Lawrence Hunter
M Krallinger
M Palmer
MP Marcus
P Rayson
PK Shah
S Kullback
T Dunning
W Hersh
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Recent years have seen an increased amount of natural language processing (NLP) work on full text biomedical journal publications. Much of this work is done with Open Access journal articles. Such work assumes that Open Access articles are representative of biomedical publications in general and that methods developed for analysis of Open Access full text publications will generalize to the biomedical literature as a whole. If this assumption is wrong, the cost to the community will be large, including not just wasted resources, but also flawed science. This paper examines that assumption. Results We collected two sets of documents, one consisting only of Open Access publications and the other consisting only of traditional journal publications. We examined them for differences in surface linguistic structures that have obvious consequences for the ease or difficulty of natural language processing and for differences in semantic content as reflected in lexical items. Regarding surface linguistic structures, we examined the incidence of conjunctions, negation, passives, and pronominal anaphora, and found that the two collections did not differ. We also examined the distribution of sentence lengths and found that both collections were characterized by the same mode. Regarding lexical items, we found that the Kullback-Leibler divergence between the two collections was low, and was lower than the divergence between either collection and a reference corpus. Where small differences did exist, log likelihood analysis showed that they were primarily in the area of formatting and in specific named entities. Conclusion We did not find structural or semantic differences between the Open Access and traditional journal collections.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

Gender equality and girls education: Investigating frameworks, disjunctures and meanings of quality education

Author: Aikman S
Aikman S
Arnot M
ASPBAE and UNGEI
Bandyopadhyay M
Barber L
Budlender D
Chaudhury N
Croft A
Geeves R
Herz B
Hickling-Hudson A
Lewin KM
Lewis M
Marshall H
Mitchell C
Morrell R
Ramachandran V
Rao N
Rao N
Sieder R
Stromquist N
Tomasevski K
UNESCO
UNICEF
UNICEF and UNGEI
Unterhalter E
Unterhalter E
Verspoor A
Wood J
Publication venue: 'SAGE Publications'
Publication date: 08/11/2012
Field of study

The article draws on qualitative educational research across a diversity of low-income countries to examine the gendered inequalities in education as complex, multi-faceted and situated rather than a series of barriers to be overcome through linear input–output processes focused on isolated dimensions of quality. It argues that frameworks for thinking about educational quality often result in analyses of gender inequalities that are fragmented and incomplete. However, by considering education quality more broadly as a terrain of quality it investigates questions of educational transitions, teacher supply and community participation, and develops understandings of how education is experienced by learners and teachers in their gendered lives and their teaching practices. By taking an approach based on theories of human development the article identifies dynamics of power underpinning gender inequalities in the literature and played out in diverse contexts and influenced by social, cultural and historical contexts. The review and discussion indicate that attaining gender equitable quality education requires recognition and understanding of the ways in which inequalities intersect and interrelate in order to seek out multi-faceted strategies that address not only different dimensions of girls’ and women’s lives, but understand gendered relationships and structurally entrenched inequalities between women and men, girls and boys

Crossref

University of East Anglia digital repository

Motivation for or from bilingual education? A comparative study of learner views in the Netherlands

Author: Azarnoosh M.
Baetens Beardsmore H.
Banegas D. L.
Boone H. N.
Cohen J.
Coleman L.
Coyle D.
de Graaff R.
Deci E. L.
Dörnyei Z.
Friedman H. H.
Gajo L.
Gardner R. C.
Koster A.
Lasagabaster D.
Maljers A.
Markus H.
Mearns T.
Nuffield
Rumlich D.
Somers T.
Sylvén L. K.
Ting Y. L. T.
Ushioda E.
Verspoor M.
Verspoor M.
Weenink D.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2017
Field of study

Teaching and Teacher Learning (ICLON

Crossref

Edinburgh Research Explorer

Leiden University Scholary Publications

The structural and content aspects of abstracts versus bodies of full text journal articles are different

Author: Alias-i
B Settles
BM Szmrecsányi
C Blaschke
C Friedman
C Gasperin
C Gasperin
Christophe Roeder
D Jurafsky
D Klein
DP Corney
G Leroy
Helen L Johnson
I Goldin
J Lin
JG Caporaso
K Bretonnel Cohen
K Verspoor
Karin Verspoor
L Hirschman
L Tanabe
Lawrence E Hunter
M Krallinger
N Elhadad
PG Mutalik
PI Nakov
R Leaman
S Abney
S Agarwal
T McIntosh
W Chapman
W Chapman
W Hersh
WA Baumgartner Jr
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background An increase in work on the full text of journal articles and the growth of PubMedCentral have the opportunity to create a major paradigm shift in how biomedical text mining is done. However, until now there has been no comprehensive characterization of how the bodies of full text journal articles differ from the abstracts that until now have been the subject of most biomedical text mining research. Results We examined the structural and linguistic aspects of abstracts and bodies of full text articles, the performance of text mining tools on both, and the distribution of a variety of semantic classes of named entities between them. We found marked structural differences, with longer sentences in the article bodies and much heavier use of parenthesized material in the bodies than in the abstracts. We found content differences with respect to linguistic features. Three out of four of the linguistic features that we examined were statistically significantly differently distributed between the two genres. We also found content differences with respect to the distribution of semantic features. There were significantly different densities per thousand words for three out of four semantic classes, and clear differences in the extent to which they appeared in the two genres. With respect to the performance of text mining tools, we found that a mutation finder performed equally well in both genres, but that a wide variety of gene mention systems performed much worse on article bodies than they did on abstracts. POS tagging was also more accurate in abstracts than in article bodies. Conclusions Aspects of structure and content differ markedly between article abstracts and article bodies. A number of these differences may pose problems as the text mining field moves more into the area of processing full-text articles. However, these differences also present a number of opportunities for the extraction of data types, particularly that found in parenthesized text, that is present in article bodies but not in article abstracts.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Patterns of natural selection acting on the mitochondrial genome of a locally adapted fish species

Author: A Vasemägi
AA Makhrov
AC Dalziel
AD Foote
B Murrell
B Murrell
BH Letcher
BJ Crespi
C Garcia de Leaniz
C Garcia de Leaniz
C Moritz
Carlos Garcia de Leaniz
CD Meiklejohn
D Mishmar
DA McClellan
DC Wallace
DK Dowling
DL Parrish
DM Rand
E Bazin
E Kazancıoğlu
E Verspoor
E Verspoor
E Verspoor
E Verspoor
Elgan John
Eric Verspoor
G Bucci
G Guillot
G Guillot
HAL Tuppen
HK Anandatheerthavarada
IH Tomasco
J Carroll
J Das
J Felsenstein
J Gilbey
J Luterbacher
J Nilsson
JB Stewart
JC Avise
JWO Ballard
K Tamura
K Yang
KD Friedland
KH Brown
KL Ciborowski
L Yu
M Albu
M Gershoni
M Murai
M Yasuike
MJ Domanico
MR Garvin
N Aubin-Horth
N Osheroff
O Fridjonsson
O Rossignol
P Cardol
PU Blier
PU Blier
R Frankham
R Moreno-Sánchez
RG Efremov
RG Efremov
RJ Simes
RR Fonseca da
S Consuegra
S Consuegra
S DiMauro
S Einum
S Einum
S Einum
S Woolley
S Yokoyama
SL Kosakovsky Pond
Sofia Consuegra
T Birt
T Regnier
T Régnier
TL King
TL King
U Brandt
V Bourret
W Delport
WD Wilson
Y Bai
ZH Yang
Ø Skaala
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Crossref

Springer - Publisher Connector

Cronfa at Swansea University

Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters

Author: A Aronson
A Doms
A Jimeno
A Koike
A Sokolov
AT McCray
B Settles
Benjamin Garcia
C Brewster
C Jonquet
C Roeder
C Verspoor
Christophe Roeder
Christopher Funk
D Ferrucci
D Hancock
D Rebholz-Schuhmann
DA Natale
DL Wheeler
DS DeLuca
FM Couto
H Liu
H Yu
HM Muller
IBM
J Bard
JC Denny
JC Denny
JG Caporaso
K Bretonnel Cohen
K Degtyarenko
K Eilbeck
K Verspoor
K Verspoor
K Verspoor
K Verspoor
Karin Verspoor
KB Cohen
L Hunter
L Reeve
L Yao
Lawrence E Hunter
M Bada
M Bada
M Krallinger
M Tanenblatt
Michael Bada
MJ Schuemie
N Kang
N Shah
Ontology Consortium The Gene
P Khatri
PV Ogren
Q Zou
R Leaman
S Ray
S Van Landeghem
SA Stewart
T Rocktaschel
William Baumgartner
WW Chu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Entity recognition in the biomedical domain using a hybrid approach

Author: A Tharatipyakul
C Funk
CD Paice
CS Funk
D Campos
D Koning
D Maglott
D Szklarczyk
DM Jessop
E Pafilis
E Tseytlin
F Rinaldi
F Rinaldi
F Rinaldi
F Rinaldi
G Sheikhshab
K Degtyarenko
K Eilbeck
K Verspoor
K Verspoor
M Ashburner
M Bada
M Basaldella
M Basaldella
MF Porter
N Pudota
P Lopez
PD Turney
R Core Team
R Leaman
R Leaman
S Aubin
S Eltyeb
S Tulkens
SA Akhondi
T Groza
T Munkhdalai
U Leser
Y Sasaki
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The Dagstuhl Perspectives Workshop on Performance Modeling and Prediction

Author: Castells Pablo
Daly Elizabeth M.
Declerck Thierry
Ekstrand Michael D.
Ferro Nicola
Fuhr Norbert
Geyer Werner
Gonzalo Julio
Grefenstette Gregory
Konstan Joseph A.
Kuflik Tsvi
Lindén Krister
Magnini Bernardo
Nie Jian-Yun
Perego Raffaele
Shapira Bracha
Soboroff Ian
Tintarev Nava
Verspoor Karin
Willemsen Martijn C.
Zobel Justin
Publication venue
Publication date: 01/01/2018
Field of study

Non peer reviewe

Repository TU/e

Pure OAI Repository

Helsingin yliopiston digitaalinen arkisto

Archivio istituzionale della ricerca - Università di Padova

From Evaluating to Forecasting Performance: How to Turn Information Retrieval, Natural Language Processing and Recommender Systems into Predictive Sciences

Author: Castells Pablo
Daly Elizabeth M.
Declerck Thierry
Ekstrand Michael D.
Ferro Nicola
Fuhr Norbert
Geyer Werner
Gonzalo Julio
Grefenstette Gregory
Konstan Joseph A.
Kuflik Tsvi
Lindén Krister
Magnini Bernardo
Nie Jian-Yun
Perego Raffaele
Shapira Bracha
Soboroff Ian
Tintarev Nava
Verspoor Karin
Willemsen Martijn C.
Zobel Justin
Publication venue
Publication date: 01/01/2018
Field of study

Non peer reviewe

Dagstuhl Research Online Publication Server

Helsingin yliopiston digitaalinen arkisto

Archivio istituzionale della ricerca - Università di Padova

Benchmarking Ontologies: Bigger or Better?

Author: A Faatz
A Gangemi
A Gomez-Perez
A Gómez-Pérez
A Mädche
A Mädche
A Rzhetsky
A Spooner
Andrey Rzhetsky
Anna Divoli
AR Aronson
AR Aronson
AR Aronson
AT McCray
AT McCray
AT McCray
AT McCray
AT McCray
B Smith
BA Kipfer
C Brewster
C Brewster
C Brewster
C Laird
C Rosse
CE Lipscomb
CJ Bult
CL Smith
D Lin
D Maynard
DL Cook
E Riloff
FB Rogers
G Jurasinski
G Miller
I Scholastic
I Sim
Ilya Mayzus
J Brank
J Devlin
J Evermann
J Yu
JA Blake
James A. Evans
JC Park
JI Rodale
JR Firth
JS Justeson
K Dellschaft
K Toutanova
K Toutanova
K Verspoor
K Verspoor
K. Bretonnel Cohen
KB Cohen
Lixia Yao
LM Spencer
M Ashburner
M Grüninger
M Minsky
M Missikoff
M Sabou
N Guarino
O Bodenreider
P Buitelaar
P Cimiano
PD Karp
R Cornet
R Navigli
S Hyun
S Kiritchenko
S Schulz
S York
S Zhang
SH Brown
TR Gruber
U Hahn
V Walden
W Ceusters
Y Sure
Z Harris
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

A scientific ontology is a formal representation of knowledge within a domain, typically including central concepts, their properties, and relations. With the rise of computers and high-throughput data collection, ontologies have become essential to data mining and sharing across communities in the biomedical sciences. Powerful approaches exist for testing the internal consistency of an ontology, but not for assessing the fidelity of its domain representation. We introduce a family of metrics that describe the breadth and depth with which an ontology represents its knowledge domain. We then test these metrics using (1) four of the most common medical ontologies with respect to a corpus of medical documents and (2) seven of the most popular English thesauri with respect to three corpora that sample language from medicine, news, and novels. Here we show that our approach captures the quality of ontological representation and guides efforts to narrow the breach between ontology and collective discourse within a domain. Our results also demonstrate key features of medical ontologies, English thesauri, and discourse from different domains. Medical ontologies have a small intersection, as do English thesauri. Moreover, dialects characteristic of distinct domains vary strikingly as many of the same words are used quite differently in medicine, news, and novels. As ontologies are intended to mirror the state of knowledge, our methods to tighten the fit between ontology and domain will increase their relevance for new areas of biomedical science and improve the accuracy and power of inferences computed across them

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central