Search CORE

89,323 research outputs found

A foundation for machine learning in design

Author: Duffy Alex H.B.
Sim Siang Kok
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/1998
Field of study

This paper presents a formalism for considering the issues of learning in design. A foundation for machine learning in design (MLinD) is defined so as to provide answers to basic questions on learning in design, such as, "What types of knowledge can be learnt?", "How does learning occur?", and "When does learning occur?". Five main elements of MLinD are presented as the input knowledge, knowledge transformers, output knowledge, goals/reasons for learning, and learning triggers. Using this foundation, published systems in MLinD were reviewed. The systematic review presents a basis for validating the presented foundation. The paper concludes that there is considerable work to be carried out in order to fully formalize the foundation of MLinD

Crossref

University of Strathclyde Institutional Repository

Resource Constrained Structured Prediction

Author: Bolukbasi Tolga
Chang Kai-Wei
Saligrama Venkatesh
Wang Joseph
Publication venue
Publication date: 07/06/2016
Field of study

We study the problem of structured prediction under test-time budget constraints. We propose a novel approach applicable to a wide range of structured prediction problems in computer vision and natural language processing. Our approach seeks to adaptively generate computationally costly features during test-time in order to reduce the computational cost of prediction while maintaining prediction performance. We show that training the adaptive feature generation system can be reduced to a series of structured learning problems, resulting in efficient training using existing structured learning algorithms. This framework provides theoretical justification for several existing heuristic approaches found in literature. We evaluate our proposed adaptive system on two structured prediction tasks, optical character recognition (OCR) and dependency parsing and show strong performance in reduction of the feature costs without degrading accuracy

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

Association for the Advancement of Artificial Intelligence: AAAI Publications

Salience-based selection: attentional capture by distractors less salient than the target

Author: A Found
A Kristjansson
A Soltani
A Treisman
AB Leber
AB Leber
Anja Isabel Koch
AR Koene
BA Purcell
C Hickey
C Hickey
C Koch
CL Folk
D Gao
D Gao
D Kahneman
DM Beck
G Schwarz
GW Humphreys
H Deubel
H-C Nothdurft
Harriet Goschy
HE Egeth
Hermann Joseph Müller
HJ Müller
HJ Müller
J de Fockert
J Hodsoll
J Palmer
J Palmer
J Theeuwes
J Theeuwes
J Theeuwes
J Vandekerckhove
JA Nelder
JH Reynolds
JI Gold
JM Wolfe
JM Wolfe
JM Wolfe
Joy J. Geng
JP Gottlieb
JP Gottlieb
JW Bisley
KR Cave
KR Cave
L Itti
L Itti
M Eimer
M Kiss
M Usher
M Usher
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
M Zehetleitner
Michael Zehetleitner
MP Eckstein
N Lavie
NDB Bruce
P Verghese
P Verghese
R Desimone
R Desimone
R Ratcliff
R Ratcliff
RH Phaf
SJ Luck
T Töllner
T van Zandt
V Maljkovic
W van Zoest
W van Zoest
Z Li
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Current accounts of attentional capture predict the most salient stimulus to be invariably selected first. However, existing salience and visual search models assume noise in the map computation or selection process. Consequently, they predict the first selection to be stochastically dependent on salience, implying that attention could even be captured first by the second most salient (instead of the most salient) stimulus in the field. Yet, capture by less salient distractors has not been reported and salience-based selection accounts claim that the distractor has to be more salient in order to capture attention. We tested this prediction using an empirical and modeling approach of the visual search distractor paradigm. For the empirical part, we manipulated salience of target and distractor parametrically and measured reaction time interference when a distractor was present compared to absent. Reaction time interference was strongly correlated with distractor salience relative to the target. Moreover, even distractors less salient than the target captured attention, as measured by reaction time interference and oculomotor capture. In the modeling part, we simulated first selection in the distractor paradigm using behavioral measures of salience and considering the time course of selection including noise. We were able to replicate the result pattern we obtained in the empirical part. We conclude that each salience value follows a specific selection time distribution and attentional capture occurs when the selection time distributions of target and distractor overlap. Hence, selection is stochastic in nature and attentional capture occurs with a certain probability depending on relative salience

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Open Access LMU

PubMed Central

Publikationsserver der Katholischen Universität Eichstätt-Ingolstadt

Birkbeck Institutional Research Online

FigShare

Probabilistic Bag-Of-Hyperlinks Model for Entity Linking

Author: Bunescu R. C.
Cheng X.
Cucerzan S.
He Z.
Recht B.
Rizzo G.
Spitkovsky V. I.
Yedidia J.
Publication venue
Publication date: 29/01/2016
Field of study

Many fundamental problems in natural language processing rely on determining what entities appear in a given text. Commonly referenced as entity linking, this step is a fundamental component of many NLP tasks such as text understanding, automatic summarization, semantic search or machine translation. Name ambiguity, word polysemy, context dependencies and a heavy-tailed distribution of entities contribute to the complexity of this problem. We here propose a probabilistic approach that makes use of an effective graphical model to perform collective entity disambiguation. Input mentions (i.e.,~linkable token spans) are disambiguated jointly across an entire document by combining a document-level prior of entity co-occurrences with local information captured from mentions and their surrounding context. The model is based on simple sufficient statistics extracted from data, thus relying on few parameters to be learned. Our method does not require extensive feature engineering, nor an expensive training procedure. We use loopy belief propagation to perform approximate inference. The low complexity of our model makes this step sufficiently fast for real-time usage. We demonstrate the accuracy of our approach on a wide range of benchmark datasets, showing that it matches, and in many cases outperforms, existing state-of-the-art methods

arXiv.org e-Print Archive

Crossref