Search CORE

333 research outputs found

Improving Screening Processes via Calibrated Subset Selection

Author: Gomez Rodriguez M.
Joachims T.
Wang L.
Publication venue
Publication date: 01/01/2022
Field of study

MPG.PuRe

Antibodies to the Chlamydial 60 Kilodalton Heat Shock Protein in Women With Tubal Factor Infertility

Author: B. D. Statland
D. I. L. Dozier
J. Gunter
K. A. Ault
M. L. Joachims
M. M. Smith King
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2007
Field of study

Crossref

Boric acid vaginal suppositories: a brief review.

Author: Ault K A
Dozier D I
Gunter J
Joachims M L
King M M
Statland B D
Publication venue
Publication date: 01/01/1998
Field of study

OBJECTIVE: The purpose of this study was to determine the utility of serum CA125 determinations in diagnosing acute salpingitis. METHODS: CA125 levels were determined for 34 women with the clinical diagnosis of pelvic inflammatory disease (PID). Acute salpingitis was confirmed laparoscopically in 28 women (82.3%). RESULTS: Twenty patients (71.4%) with laparoscopically confirmed acute salpingitis had CA125 levels greater than 7.5 units, compared with no patients (0/6) with laparoscopically normal tubes (P = 0.002). The degree of elevation of CA125 levels correlated with the severity of tubal inflammation noted at laparoscopy. All patients with levels above 16 units had laparoscopically severe salpingitis. CONCLUSIONS: We conclude that while CA125 levels above 7.5 units may modestly improve the ability of the clinical diagnosis of PID to accurately reflect visually confirmed acute salpingitis, limitations of the test make its clinical utility questionable

Crossref

Directory of Open Access Journals

PubMed Central

VCU Scholars Compass

Hyperparameter Importance Across Datasets

Author: Bergstra J.
Bonilla E. V.
Brazdil P.
Demvsar J.
Feurer M.
Jamieson K.
Joachims T.
Klein A.
Li L.
Loshchilov I.
Sobol I. M.
van Rijn J. N.
van Rijn J. N.
Wistuba M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 29/05/2018
Field of study

With the advent of automated machine learning, automated hyperparameter optimization methods are by now routinely used in data mining. However, this progress is not yet matched by equal progress on automatic analyses that yield information beyond performance-optimizing hyperparameter settings. In this work, we aim to answer the following two questions: Given an algorithm, what are generally its most important hyperparameters, and what are typically good values for these? We present methodology and a framework to answer these questions based on meta-learning across many datasets. We apply this methodology using the experimental meta-data available on OpenML to determine the most important hyperparameters of support vector machines, random forests and Adaboost, and to infer priors for all their hyperparameters. The results, obtained fully automatically, provide a quantitative basis to focus efforts in both manual algorithm design and in automated hyperparameter optimization. The conducted experiments confirm that the hyperparameters selected by the proposed method are indeed the most important ones and that the obtained priors also lead to statistically significant improvements in hyperparameter optimization.Comment: \c{opyright} 2018. Copyright is held by the owner/author(s). Publication rights licensed to ACM. This is the author's version of the work. It is posted here for your personal use, not for redistribution. The definitive Version of Record was published in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Minin

arXiv.org e-Print Archive

Crossref

Semi-supervised prediction of protein interaction sentences exploiting semantically encoded metrics

Author: D.D. Lewis
E.M. Marcotte
J.D. Kim
K. Lund
L. Azzopardi
M. Girolami
M.N. Jones
M.N. Jones
R. Bunescu
S. Padó
S. Pyysalo
S. Rogers
T. Joachims
T.K. Landauer
Z. Minier
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Protein-protein interaction (PPI) identification is an integral component of many biomedical research and database curation tools. Automation of this task through classification is one of the key goals of text mining (TM). However, labelled PPI corpora required to train classifiers are generally small. In order to overcome this sparsity in the training data, we propose a novel method of integrating corpora that do not contain relevance judgements. Our approach uses a semantic language model to gather word similarity from a large unlabelled corpus. This additional information is integrated into the sentence classification process using kernel transformations and has a re-weighting effect on the training features that leads to an 8% improvement in F-score over the baseline results. Furthermore, we discover that some words which are generally considered indicative of interactions are actually neutralised by this process

Detrended fluctuation analysis as a statistical tool to monitor the climate

Author: Bodri L
Bodri L
Eichner J
Fluegeman R H
Ivanova K
Ivanova K
Joachims T
Koeppen W
Koscielny-Bunde E
Koscielny-Bunde E Kantelhardt J W Braun P Bunde A Havlin S
Kurnaz M L
Livina V
M L Kurnaz
Mandelbrot B B
Mandelbrot B B
Peng C K
Talkner P
Vapnik V N
Vjushin D
Wang Y
Publication venue: 'IOP Publishing'
Publication date: 27/03/2004
Field of study

Detrended fluctuation analysis is used to investigate power law relationship between the monthly averages of the maximum daily temperatures for different locations in the western US. On the map created by the power law exponents, we can distinguish different geographical regions with different power law exponents. When the power law exponents obtained from the detrended fluctuation analysis are plotted versus the standard deviation of the temperature fluctuations, we observe different data points belonging to the different climates, hence indicating that by observing the long-time trends in the fluctuations of temperature we can distinguish between different climates.Comment: 8 pages, 4 figures, submitted to JSTA

arXiv.org e-Print Archive

Crossref

Transductive Learning for Spatial Data Classification

Author: A. Appice
A. Frank
A. Gammerman
A. Mukerjee
D. Malerba
D. Malerba
D. Malerba
D. Malerba
D. McIver
F. Esposito
G. Góra
J. Han
J. Sander
J.A. Robinson
K. Koperski
K.P. Bennett
L. Džeroski
L. Raedt De
L. Raedt De
M. Ceci
M. Ceci
M. Ceci
M. Ester
M. Krogel
M. Kukar
M.-A. Krogel
M.J. Egenhofer
N. Lavrač
P. Legendre
R.S. Michalski
S. Muggleton
S. Shekhar
S. Shekhar
S. Shekhar
T. Joachims
T. Joachims
T. Mitchell
V. Vapnik
V. Vapnik
W. Klösgen
Y. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Learning classifiers of spatial data presents several issues, such as the heterogeneity of spatial objects, the implicit definition of spatial relationships among objects, the spatial autocorrelation and the abundance of unlabelled data which potentially convey a large amount of information. The first three issues are due to the inherent structure of spatial units of analysis, which can be easily accommodated if a (multi-)relational data mining approach is considered. The fourth issue demands for the adoption of a transductive setting, which aims to make predictions for a given set of unlabelled data. Transduction is also motivated by the contiguity of the concept of positive autocorrelation, which typically affect spatial phenomena, with the smoothness assumption which characterize the transductive setting. In this work, we investigate a relational approach to spatial classification in a transductive setting. Computational solutions to the main difficulties met in this approach are presented. In particular, a relational upgrade of the nave Bayes classifier is proposed as discriminative model, an iterative algorithm is designed for the transductive classification of unlabelled data, and a distance measure between relational descriptions of spatial objects is defined in order to determine the k-nearest neighbors of each example in the dataset. Computational solutions have been tested on two real-world spatial datasets. The transformation of spatial data into a multi-relational representation and experimental results are reported and commented

Crossref

Archivio istituzionale della ricerca - Università di Bari

Kent Academic Repository

Machine Learning in Automated Text Categorization

Author: ANDROUTSOPOULOS I.
ATTARDI G.
BAKER L.D.
BIEBRICHER P.
CAROPRESO M.F.
CAVNAR W.B.
CHAKRABARTI S.
CLACK C.
CLEVERDON C.
COHEN W. W.
COHEN W. W.
COHEN W.W.
DAGAN I.
DEERWESTER S.
DENOYER L.
DIAZ ESTEBAN A.
DRUCKER H.
DUMAIS S.T.
DUMAIS S.T.
ESCUDERO G.
Fabrizio Sebastiani
FIELD B.
FORSYTH R. S.
FUHR N.
FUHR N.
FUHR N.
FURNKRANZ J.
GALAVOTTI L.
GALE W. A.
GOVERT N.
GRAY W.A.
GUTHRIE L.
HAYES P.J.
HEAPS H.
HERSH W.
HULL D. A.
HULL D. A.
ITTNER D.J.
IWAYAMA M.
IYER R.D.
JOACHIMS T.
JOACHIMS T.
JOACHIMS T.
JOHN G. H.
JUNKER M.
JUNKER M.
KESSLER B.
KIM Y.-H.
KLINKENBERG R.
KNORZ G.
KOLLER D.
LAM S.L.
LAM W.
LAM W.
LANG K.
LARKEY L. S.
LARKEY L. S.
LARKEY L.S.
LEWIS D. D.
LEWIS D. D.
LEWIS D. D.
LEWIS D. D.
LEWIS D.D.
LEWIS D.D.
LEWIS D.D.
LEWIS D.D.
LEWIS D.D.
LI H.
LI Y.H.
LIERE R.
LIM J. H.
MASAND B.
MASAND B.
MCCALLUM A. K.
MCCALLUM A.K.
MLADENIC D.
MLADENIC D.
MOULINIER I.
MOULINIER I.
MYERS K.
NG H.T.
OH H.-J.
PAZIENZA M. T.
RILOFF E.
ROBERTSON S.E.
ROBERTSON S.E.
ROTH D.
RUIZ M.E.
SABLE C.L.
SARACEVIC T.
SCHAPIRE R. E.
SCHUTZE H.
SCHUTZE H.
SCOTT S.
SEBASTIANI F.
SINGHAL A.
SLONIM N.
TAIRA H.
TUMER K.
TZERAS K.
VAN RIJSBERGEN C. J.
WIENER E.D.
YANG Y.
YANG Y.
YANG Y.
YANG Y.
YU K.L.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2001
Field of study

The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach to this problem is based on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of preclassified documents, the characteristics of the categories. The advantages of this approach over the knowledge engineering approach (consisting in the manual definition of a classifier by domain experts) are a very good effectiveness, considerable savings in terms of expert manpower, and straightforward portability to different domains. This survey discusses the main approaches to text categorization that fall within the machine learning paradigm. We will discuss in detail issues pertaining to three different problems, namely document representation, classifier construction, and classifier evaluation.Comment: Accepted for publication on ACM Computing Survey

arXiv.org e-Print Archive

CiteSeerX

Crossref

Enabling multi-level relevance feedback on PubMed by integrating rank learning into DBMS

Author: B Suomela
C Burges
C Sneiderman
D States
F Radlinski
G Poulter
G Salton
H Oh
H Yu
H Yu
Hwanjo Yu
Ilhwan Ko
J Xu
Jinoh Oh
L Murphy
M Siadaty
Sungchul Kim
T Joachims
T Joachims
T Qin
Taehoon Kim
V Cherkassky
W Hersh
Wook-Shin Han
X Geng
Y Cao
Y Lin
Yoo Illhoi
Z Lu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: Finding relevant articles from PubMed is challenging because it is hard to express the user's specific intention in the given query interface, and a keyword query typically retrieves a large number of results. Researchers have applied machine learning techniques to find relevant articles by ranking the articles according to the learned relevance function. However, the process of learning and ranking is usually done offline without integrated with the keyword queries, and the users have to provide a large amount of training documents to get a reasonable learning accuracy. This paper proposes a novel multi-level relevance feedback system for PubMed, called RefMed, which supports both ad-hoc keyword queries and a multi-level relevance feedback in real time on PubMed. Results: RefMed supports a multi-level relevance feedback by using the RankSVM as the learning method, and thus it achieves higher accuracy with less feedback. RefMed "tightly" integrates the RankSVM into RDBMS to support both keyword queries and the multi-level relevance feedback in real time; the tight coupling of the RankSVM and DBMS substantially improves the processing time. An efficient parameter selection method for the RankSVM is also proposed, which tunes the RankSVM parameter without performing validation. Thereby, RefMed achieves a high learning accuracy in real time without performing a validation process. RefMed is accessible at http://dm.postech.ac.kr/refmed. Conclusions: RefMed is the first multi-level relevance feedback system for PubMed, which achieves a high accuracy with less feedback. It effectively learns an accurate relevance function from the user's feedback and efficiently processes the function to return relevant articles in real time.1114Nsciescopu

Crossref

Springer - Publisher Connector

PubMed Central

포항공과대학교

Notch signaling during human T cell development

Author: A Galy
A Krueger
A Sambandam
A Wilson
A Wolfer
AC Jaleco
AI Garbe
AP Weng
B Blom
B Reizis
B Vandekerckhove
BN Weber
C Benne
CC Tydell
CN Ting
D Hockemeyer
D Hockemeyer
E Robey
EM Six
ES David-Fung
F Klein
F Radtke
F Radtke
F Timmermans
F Weerkamp
F Weerkamp
G Awong
H Hirata
H Neves
H Wang
HT Petrie
HY Kueh
I Hoebeke
I Maillard
I Maillard
I Walle Van de
I Walle Van de
J Buer
J Gotter
J Plum
J Shi
JS Yuan
K Heinzel
K Hozumi
K Li
K Maki
K Tanigaki
L Li
M Ciofani
M Ciofani
M Ciofani
M Garcia-Peydro
M Ghisi
M Magri
M Smedt De
M Smedt De
M Smedt De
M Smedt De
MA Yui
MD Green
ML Joachims
N Lefort
P Beatus
P Doerfler
P Li
QL Hao
QL Hao
R Haddad
RN Motte-Mohs La
S Coppernolle Van
S Doulatov
S Gonzalez-Garcia
S Suliman
S Verbeek
SK Durum
SK Ye
SM Lehar
SY Lee
T Hosoya
T Ikawa
T Ikawa
T Kreslavsky
T Kreslavsky
T Palomero
T Taghon
T Taghon
T Taghon
T Taghon
T Taghon
T Taghon
TB Feyerabend
TM Schmitt
TN Taghon
W Dontje
WA Dik
Y Wu
YR Carrasco
Z Galic
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Notch signaling is critical during multiple stages of T cell development in both mouse and human. Evidence has emerged in recent years that this pathway might regulate T-lineage differentiation differently between both species. Here, we review our current understanding of how Notch signaling is activated and used during human T cell development. First, we set the stage by describing the developmental steps that make up human T cell development before describing the expression profiles of Notch receptors, ligands, and target genes during this process. To delineate stage-specific roles for Notch signaling during human T cell development, we subsequently try to interpret the functional Notch studies that have been performed in light of these expression profiles and compare this to its suggested role in the mouse

Crossref

Ghent University Academic Bibliography