Search CORE

3 research outputs found

QiSampler: evaluation of scoring schemes for high-throughput datasets using a repetitive sampling strategy on gold standards

Author: A Subramanian
Bernhard Suter
C Jacques
E Marcotte
F Ramirez
Jean F Fontaine
JF Fontaine
K Venkatesan
ME Sowa
Miguel A Andrade-Navarro
O Mete
P Smialowski
R Jansen
RDC Team
RM Ewing
T Barrett
T Fawcett
T Sing
W Xu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background High-throughput biological experiments can produce a large amount of data showing little overlap with current knowledge. This may be a problem when evaluating alternative scoring mechanisms for such data according to a gold standard dataset because standard statistical tests may not be appropriate. Findings To address this problem we have implemented the QiSampler tool that uses a repetitive sampling strategy to evaluate several scoring schemes or experimental parameters for any type of high-throughput data given a gold standard. We provide two example applications of the tool: selection of the best scoring scheme for a high-throughput protein-protein interaction dataset by comparison to a dataset derived from the literature, and evaluation of functional enrichment in a set of tumour-related differentially expressed genes from a thyroid microarray dataset. Conclusions QiSampler is implemented as an open source R script and a web server, which can be accessed at <url>http://cbdm.mdc-berlin.de/tools/sampler/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MDC Repository

Optimal values of recall and precision

Author: Moss
Olafsen
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Optimal values of recall and precision

Author: Ackoff
Cleverdon
Cooper
Garvey
Hirshleifer
Robertson
Salton
Samuelson
Schoolman
Swets
Taylor
Van der Meulen
Vickery
Williams
Publication venue: 'Wiley'
Publication date
Field of study

Crossref