Search CORE

11 research outputs found

Objective and automated protocols for the evaluation of biomedical search engines using No Title Evaluation protocols

Author: AM Cohen
D Demner-Fushman
E Amitay
EM Voorhees
Fabien Campagne
I Soboroff
JA Aslam
K Sparck Jones
K Sparck Jones
KC Dorff
M Fuller
P Boldi
P Dong
R Nuray
S Buttcher
SE Robertson
SF Kim
Y Yue
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The evaluation of information retrieval techniques has traditionally relied on human judges to determine which documents are relevant to a query and which are not. This protocol is used in the Text Retrieval Evaluation Conference (TREC), organized annually for the past 15 years, to support the unbiased evaluation of novel information retrieval approaches. The TREC Genomics Track has recently been introduced to measure the performance of information retrieval for biomedical applications. Results We describe two protocols for evaluating biomedical information retrieval techniques without human relevance judgments. We call these protocols No Title Evaluation (NT Evaluation). The first protocol measures performance for focused searches, where only one relevant document exists for each query. The second protocol measures performance for queries expected to have potentially many relevant documents per query (high-recall searches). Both protocols take advantage of the clear separation of titles and abstracts found in Medline. We compare the performance obtained with these evaluation protocols to results obtained by reusing the relevance judgments produced in the 2004 and 2005 TREC Genomics Track and observe significant correlations between performance rankings generated by our approach and TREC. Spearman's correlation coefficients in the range of 0.79–0.92 are observed comparing bpref measured with NT Evaluation or with TREC evaluations. For comparison, coefficients in the range 0.86–0.94 can be observed when evaluating the same set of methods with data from two independent TREC Genomics Track evaluations. We discuss the advantages of NT Evaluation over the TRels and the data fusion evaluation protocols introduced recently. Conclusion Our results suggest that the NT Evaluation protocols described here could be used to optimize some search engine parameters before human evaluation. Further research is needed to determine if NT Evaluation or variants of these protocols can fully substitute for human evaluations.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Probing Chemical Space with Alkaloid-Inspired Libraries

Author: A Ganesan
A Verma
A Wrobleski
AA Shelat
AM Boldi
AM Meyer
AM Szpilman
BE Evans
BM Trost
BT Smith
CA Lipinski
CD Duarte
D Camp
DF Veber
Digamber Rane
DJ Newman
E Fenster
EG Perez
F Kopp
F Lovering
F Lovering
GL Thomas
Gurpreet Singh
H Lachance
I Paterson
J Zuegg
James N. Plampin
Jeffrey Aubé
Jenna L. Wang
JP Nandy
JT Njardarson
JW Daly
K. J. Frankowski
L Costantino
L Costantino
L Xue
MD Burke
Michael C. McLeod
MS Butler
N Gericke
P MacLellan
PA Clemons
RA Bauer
RA Pilli
RA Pilli
RW DeSimone
RW Huigens
S Wetzel
SM Weinreb
V Gracias
Victor W. Day
WHB Sauer
X Li
Y Zeng
YK Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2014
Field of study

Screening of small molecule libraries is an important aspect of probe and drug discovery science. Numerous authors have suggested that bioactive natural products are attractive starting points for such libraries, due to their structural complexity and sp3-rich character. Here, we describe the construction of a screening library based on representative members of four families of biologically active alkaloids (Stemonaceae, the structurally related cyclindricine and lepadiformine families, lupin, and Amaryllidaceae). In each case, scaffolds were based on structures of the naturally occurring compounds or a close derivative. Scaffold preparation was pursued following the development of appropriate enabling chemical methods. Diversification provided 686 new compounds suitable for screening. The libraries thus prepared had structural characteristics, including sp3 content, comparable to a basis set of representative natural products and were highly rule-of-five compliant

Crossref

KU ScholarWorks

PubMed Central