Search CORE

19,979 research outputs found

On Horizontal and Vertical Separation in Hierarchical Text Classification

Author: Chen M.
Feature
Kim D.-k.
Lewis D. D.
McCallum A.
Sigurbjörnsson B.
Song Y.
Sun A.
Zhou D.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

Hierarchy is a common and effective way of organizing data and representing their relationships at different levels of abstraction. However, hierarchical data dependencies cause difficulties in the estimation of "separable" models that can distinguish between the entities in the hierarchy. Extracting separable models of hierarchical entities requires us to take their relative position into account and to consider the different types of dependencies in the hierarchy. In this paper, we present an investigation of the effect of separability in text-based entity classification and argue that in hierarchical classification, a separation property should be established between entities not only in the same layer, but also in different layers. Our main findings are the followings. First, we analyse the importance of separability on the data representation in the task of classification and based on that, we introduce a "Strong Separation Principle" for optimizing expected effectiveness of classifiers decision based on separation property. Second, we present Hierarchical Significant Words Language Models (HSWLM) which capture all, and only, the essential features of hierarchical entities according to their relative position in the hierarchy resulting in horizontally and vertically separable models. Third, we validate our claims on real-world data and demonstrate that how HSWLM improves the accuracy of classification and how it provides transferable models over time. Although discussions in this paper focus on the classification problem, the models are applicable to any information access tasks on data that has, or can be mapped to, a hierarchical structure.Comment: Full paper (10 pages) accepted for publication in proceedings of ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR'16

arXiv.org e-Print Archive

Crossref

International Migration, Integration and Social Cohesion online publications

UvA-DARE

MLPerf Inference Benchmark

Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devices to data-center solutions. Fueling the hardware are a dozen or more software frameworks and libraries. The myriad combinations of ML hardware and ML software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner challenging. There is a clear need for industry-wide standard ML benchmarking and evaluation criteria. MLPerf Inference answers that call. In this paper, we present our benchmarking method for evaluating ML inference systems. Driven by more than 30 organizations as well as more than 200 ML engineers and practitioners, MLPerf prescribes a set of rules and best practices to ensure comparability across systems with wildly differing architectures. The first call for submissions garnered more than 600 reproducible inference-performance measurements from 14 organizations, representing over 30 systems that showcase a wide range of capabilities. The submissions attest to the benchmark's flexibility and adaptability.Comment: ISCA 202

arXiv.org e-Print Archive

Crossref

Online Optimization Methods for the Quantification Problem

Author: Cesa-Bianchi Nicoló
Gentile Claudio
Kar Purushottam
Kar Purushottam
Marthinus
Narasimhan Harikrishna
Parambath Shameem P.
Shalev-Shwartz Shai
Zalinescu Constantin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 13/06/2016
Field of study

The estimation of class prevalence, i.e., the fraction of a population that belongs to a certain class, is a very useful tool in data analytics and learning, and finds applications in many domains such as sentiment analysis, epidemiology, etc. For example, in sentiment analysis, the objective is often not to estimate whether a specific text conveys a positive or a negative sentiment, but rather estimate the overall distribution of positive and negative sentiments during an event window. A popular way of performing the above task, often dubbed quantification, is to use supervised learning to train a prevalence estimator from labeled data. Contemporary literature cites several performance measures used to measure the success of such prevalence estimators. In this paper we propose the first online stochastic algorithms for directly optimizing these quantification-specific performance measures. We also provide algorithms that optimize hybrid performance measures that seek to balance quantification and classification performance. Our algorithms present a significant advancement in the theory of multivariate optimization and we show, by a rigorous theoretical analysis, that they exhibit optimal convergence. We also report extensive experiments on benchmark and real data sets which demonstrate that our methods significantly outperform existing optimization techniques used for these performance measures.Comment: 26 pages, 6 figures. A short version of this manuscript will appear in the proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 201

arXiv.org e-Print Archive

Crossref

Can we identify non-stationary dynamics of trial-to-trial variability?"

Author: A Ben-Hur
A Bouchachia
A Faisal
A Ledberg
A Litwin-Kumar
A Provenzale
A Renart
A Scaglione
Alejandro Tabas-Diaz
B Bathellier
B Staude
B Whitsel
C Lapish
C Quiroga Lombard
D Bernal-Casas
D Durstewitz
D Durstewitz
D Sussillo
DAJ Blythe
Dante R. Chialvo
E Balaguer-Ballester
E Balaguer-Ballester
E Balaguer-Ballester
E Balaguer-Ballester
E Balaguer-Ballester
E Zohary
Emili Balaguer-Ballester
F Freyer
G Deco
G Nason
G Werner
I Zliobaite
J Beck
J Braun
J Chen
J Du
J Gama
J Gomez-Sanchis
J Ha
J Hyman
J Niessing
J Robinson
J Toups
K Saadi
K Stiefel
L Kuncheva
L Milad
L Paninski
M Budka
M Churchland
M Churchland
M Churchland
M Mann
M Priestley
M Rabinovich
M Rabinovich
M Volpi
Marcin Budka
O Mazor
P Holmes
P Honeine
P von Bunau
R Kobayashi
R Moreno-Bote
R Schapire
R Stein
S Deneve
S Haraa
S Sabarathinama
T Duong
T Masquelier
T Omi
T Sauer
T Wills
Z Feng
ZP Jiang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 25/04/2014
Field of study

Identifying sources of the apparent variability in non-stationary scenarios is a fundamental problem in many biological data analysis settings. For instance, neurophysiological responses to the same task often vary from each repetition of the same experiment (trial) to the next. The origin and functional role of this observed variability is one of the fundamental questions in neuroscience. The nature of such trial-to-trial dynamics however remains largely elusive to current data analysis approaches. A range of strategies have been proposed in modalities such as electro-encephalography but gaining a fundamental insight into latent sources of trial-to-trial variability in neural recordings is still a major challenge. In this paper, we present a proof-of-concept study to the analysis of trial-to-trial variability dynamics founded on non-autonomous dynamical systems. At this initial stage, we evaluate the capacity of a simple statistic based on the behaviour of trajectories in classification settings, the trajectory coherence, in order to identify trial-to-trial dynamics. First, we derive the conditions leading to observable changes in datasets generated by a compact dynamical system (the Duffing equation). This canonical system plays the role of a ubiquitous model of non-stationary supervised classification problems. Second, we estimate the coherence of class-trajectories in empirically reconstructed space of system states. We show how this analysis can discern variations attributable to non-autonomous deterministic processes from stochastic fluctuations. The analyses are benchmarked using simulated and two different real datasets which have been shown to exhibit attractor dynamics. As an illustrative example, we focused on the analysis of the rat's frontal cortex ensemble dynamics during a decision-making task. Results suggest that, in line with recent hypotheses, rather than internal noise, it is the deterministic trend which most likely underlies the observed trial-to-trial variability. Thus, the empirical tool developed within this study potentially allows us to infer the source of variability in in-vivo neural recordings

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Bournemouth University Research Online

FigShare