Search CORE

4 research outputs found

Collocation analysis for UMLS knowledge-based word sense disambiguation

Author: A Aronson
A Jimeno-Yepes
A Jimeno-Yepes
A Jimeno-Yepes
A Jimeno-Yepes
A Purandare
Alan R Aronson
Antonio Jimeno-Yepes
B McInnes
B McInnes
B Rosario
Bridget T Mclnnes
C Leacock
C Manning
D Yarowsky
H Schütze
M Schuemie
M Stevenson
M Weeber
O Bodenreider
PR Cohen
S Humphrey
S Patwardhan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

BACKGROUND: The effectiveness of knowledge-based word sense disambiguation (WSD) approaches depends in part on the information available in the reference knowledge resource. Off the shelf, these resources are not optimized for WSD and might lack terms to model the context properly. In addition, they might include noisy terms which contribute to false positives in the disambiguation results. METHODS: We analyzed some collocation types which could improve the performance of knowledge-based disambiguation methods. Collocations are obtained by extracting candidate collocations from MEDLINE and then assigning them to one of the senses of an ambiguous word. We performed this assignment either using semantic group profiles or a knowledge-based disambiguation method. In addition to collocations, we used second-order features from a previously implemented approach.Specifically, we measured the effect of these collocations in two knowledge-based WSD methods. The first method, AEC, uses the knowledge from the UMLS to collect examples from MEDLINE which are used to train a Naïve Bayes approach. The second method, MRD, builds a profile for each candidate sense based on the UMLS and compares the profile to the context of the ambiguous word.We have used two WSD test sets which contain disambiguation cases which are mapped to UMLS concepts. The first one, the NLM WSD set, was developed manually by several domain experts and contains words with high frequency occurrence in MEDLINE. The second one, the MSH WSD set, was developed automatically using the MeSH indexing in MEDLINE. It contains a larger set of words and covers a larger number of UMLS semantic types. RESULTS: The results indicate an improvement after the use of collocations, although the approaches have different performance depending on the data set. In the NLM WSD set, the improvement is larger for the MRD disambiguation method using second-order features. Assignment of collocations to a candidate sense based on UMLS semantic group profiles is more effective in the AEC method.In the MSH WSD set, the increment in performance is modest for all the methods. Collocations combined with the MRD disambiguation method have the best performance. The MRD disambiguation method and second-order features provide an insignificant change in performance. The AEC disambiguation method gives a modest improvement in performance. Assignment of collocations to a candidate sense based on knowledge-based methods has better performance. CONCLUSIONS: Collocations improve the performance of knowledge-based disambiguation methods, although results vary depending on the test set and method used. Generally, the AEC method is sensitive to query drift. Using AEC, just a few selected terms provide a large improvement in disambiguation performance. The MRD method handles noisy terms better but requires a larger set of terms to improve performance

Crossref

Springer - Publisher Connector

PubMed Central

University of Melbourne Institutional Repository

Global diversity and distribution of macrofungi

Author: Buyck B
Cifuentes Blanco Joaquín
Desjardin DE
Halling RE
Hjortstam K
Iturriaga T
Larsson KH
Leacock PR
Lodge DJ
May TW
Minter D
Mueller GM
Rajchenberg M
Redhead SA
Ryvarden L
Schmit JP
Trappe JM
Watling R
Wu QW
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Data on macrofungal diversity and distribution patterns were compiled for major geographical regions of the world. Macrofungi are defined here to include ascomycetes and basidiomycetes with large, easily observed spore-bearing structures that form above or below ground. Each coauthor either provided data on a particular taxonomic group of macrofungi or information on the macrofungi of a specific geographic area. We then employed a meta-analysis to investigate species overlaps between areas, levels of endemism, centers of diversity, and estimated percent of species known for each taxonomic group for each geographic area and for the combined macrofungal data set. Thus, the study provides both a meta-analysis of current data and a gap assessment to help identify research needs. In all, 21,679 names of macrofungi were compiled. The percentage of unique names for each region ranged from 37% for temperate Asia to 72% for Australasia. Approximately 35,000 macrofungal species were estimated to be "unknown" by the contributing authors. This would give an estimated total of 56,679 macrofungi. Our compiled species list does not include data from most of S.E. Europe, Africa, western Asia, or tropical eastern Asia. Even so, combining our list of names with the estimates from contributing authors is in line with our calculated estimate of between 53,000 and 110,000 macrofungal species derived using plant/macrofungal species ratio data. The estimates developed in this study are consistent with a hypothesis of high overall fungal species diversity

Red Mexicana de Repositorios Institucionales

On differences and deficits: A critique of the theoretical and methodological underpinnings of the word gap

Author: Bomer R
Cazden CB
Cooc N
Deutsch M
Dudley-Marling C
Dudley-Marling C
Dyson AH
Escobar K
Gee JP
Geertz C
Hart B
Hart B
Heath SB
Hewlett BS
Highmore B
Hoff E
Huttenlocher PR
Hymes D
Kozol J
Labov W
Lareau U
Leacock EB
Lee VE
Lewis O
Miller P
Miller PJ
Nunberg G
Ochs E
Payne RK
Phillips DA
Raver C
Rogoff B
Rosenberg T
Silverstein M
Tamis-LeMonda CS
Thelen E
Tomasello M
Vernon-Feagans L
Yana Kuchirko
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Zebrafish as a Model for Human Osteosarcoma

Author: AA Sandberg
AB Mohseny
AB Mohseny
AB Mohseny
AB Mohseny
AB Mohseny
AE Rosenberg
AJ Chou
AM Cleton-Jansen
BR Bill
C Hall
C Khanna
CE Andrea de
CJ Bayne
CM Press
CMCL Press
CR Walkley
D Hanahan
DM Langenau
E Mayordomo
EP Buddingh
ES Bromage
FE Moore
FJ Roca
G Bulut
G Gohring
G Kari
G Merlino
H Feitsma
HH Truong
HM Stern
HP Spaink
IJ Lewis
IJ Marques
J Etchin
J Tolar
J Zou
JA Bridge
JF Amatruda
JK Anninga
JM Parant
JV Forment
JW Holland
K Peng
K Stoletov
K Stoletov
KJ Clark
L Cade
L Pasquier Du
LM Lee
M Ganeshkumar
M Lehner
M Mione
M Vaart van der
MC Carroll
MC Carroll
MI Wiweger
MI Wiweger
MJ Manning
ML Kuijjer
MS Lee
N Hagner
P Herbomel
P Huang
P Zwollo
P Zwollo
P Zwollo
PJ Stephens
PR Rauta
R Smolowitz
RC Deo
RM White
S Berghmans
S Berghmans
S Bird
S Brattgjerd
S Chen
S He
SA Renshaw
SA Savage
SH Lam
SL Kaattari
SM Onnebo
SS Bielack
SW Leacock
T Davoli
T Wang
TJ Bowden
TJ Dahlem
TY Chang
U Fischer
VM Bedell
W Goessling
YO Ahn
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref