Search CORE

58 research outputs found

Electrostatic Field Classifier for Deficient Data

Author: A.P. Dempster
B. Gabrys
D. Ruta
J.L. Schafer
K. Torkkola
W. Outhwaite
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

This paper investigates the suitability of recently developed models based on the physical field phenomena for classification problems with incomplete datasets. An original approach to exploiting incomplete training data with missing features and labels, involving extensive use of electrostatic charge analogy, has been proposed. Classification of incomplete patterns has been investigated using a local dimensionality reduction technique, which aims at exploiting all available information rather than trying to estimate the missing values. The performance of all proposed methods has been tested on a number of benchmark datasets for a wide range of missing data scenarios and compared to the performance of some standard techniques. Several modifications of the original electrostatic field classifier aiming at improving speed and robustness in higher dimensional spaces are also discussed

Crossref

Bournemouth University Research Online

Self-explaining AI as an alternative to interpretable AI

Author: B Goertzel
BA Richards
C Rudin
E Linfoot
F Sahigara
K Torkkola
L Breiman
M Belkin
MD Zeiler
N Bostrom
N-M Aliman
P McClure
S Bach
S Shen
S Spigler
U Hasson
WJ Murdoch
WR Ashby
WR Swartout
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/07/2020
Field of study

The ability to explain decisions made by AI systems is highly sought after, especially in domains where human lives are at stake such as medicine or autonomous vehicles. While it is often possible to approximate the input-output relations of deep neural networks with a few human-understandable rules, the discovery of the double descent phenomena suggests that such approximations do not accurately capture the mechanism by which deep neural networks work. Double descent indicates that deep neural networks typically operate by smoothly interpolating between data points rather than by extracting a few high level rules. As a result, neural networks trained on complex real world data are inherently hard to interpret and prone to failure if asked to extrapolate. To show how we might be able to trust AI despite these problems we introduce the concept of self-explaining AI. Self-explaining AIs are capable of providing a human-understandable explanation of each decision along with confidence levels for both the decision and explanation. For this approach to work, it is important that the explanation actually be related to the decision, ideally capturing the mechanism used to arrive at the explanation. Finally, we argue it is important that deep learning based systems include a "warning light" based on techniques from applicability domain analysis to warn the user if a model is asked to extrapolate outside its training distribution. For a video presentation of this talk see https://www.youtube.com/watch?v=Py7PVdcu7WY& .Comment: 10pgs, 2 column forma

arXiv.org e-Print Archive

Crossref

Preferred Spatial Frequencies for Human Face Processing Are Associated with Optimal Class Discrimination in the Machine

Author: A Fiorentini
A Hayes
Agata Lapedriza
C Howe
David Masip
E Peli
F Attneave
F Harris
F Long
Guorong Xuan
H Barlow
J Atick
Jordi Vitria
K Fukunaga
K Fukunaga
K Torkkola
Keinosuke Fukunaga
M Hellman
M Keil
M Srinivasan
M Turk
Matthias S. Keil
N Costen
N Costen
Ojanpää
P Lenny
R Baddeley
R Duda
R Fisher
R Linsker
R Lotto
R Lotto
R Näsänen
Robert P. Futrelle
S Laughlin
S Laughlin
S Nundy
T Hastie
T Hosoya
T Tieger
W Levy
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Psychophysical studies suggest that humans preferentially use a narrow band of low spatial frequencies for face recognition. Here we asked whether artificial face recognition systems have an improved recognition performance at the same spatial frequencies as humans. To this end, we estimated recognition performance over a large database of face images by computing three discriminability measures: Fisher Linear Discriminant Analysis, Non-Parametric Discriminant Analysis, and Mutual Information. In order to address frequency dependence, discriminabilities were measured as a function of (filtered) image size. All three measures revealed a maximum at the same image sizes, where the spatial frequency content corresponds to the psychophysical found frequencies. Our results therefore support the notion that the critical band of spatial frequencies for face recognition in humans and machines follows from inherent properties of face images, and that the use of these frequencies is associated with optimal face recognition performance

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

The Oberta in open access

Diposit Digital de la Universitat de Barcelona

Automatically finding the control variables for complex system behavior

Author: A. Saltelli
A. Saltelli
B. Boehm
B. Fischer
C. Bishop
D.E. Goldberg
E. Tuv
G. Antoniol
G. Gay
G. Towell
G.J. Holzmann
Gregory Gay
H. Jing
H.B. Mann
I.H. Witten
I.H. Witten
J. Gu
J. Oakley
K. Rose
K. Torkkola
Karen Gundy-Burlet
Misty Davies
N. Metropolis
P. Austin
P.E. Gill
R. Dechter
R. Kohavi
R. Quinlan
R. Spear
R.C. Holte
S. Kirkpatrick
S. Sendall
T. Uribe
Tim Menzies
V. Eruhimov
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Towards Comprehensive Foundations of Computational Intelligence

Author: A Cichocki
A Gifi
A Gutkin
A Hyvärinen
A Konar
A Newell
A Pouget
A Pouget
A Roy
AM Callataÿ de
B Bakker
B Kégl
B Schölkopf
C Giraud-Carrier
C Jones
C Wendelken
CD Manning
CS Ong
D Michie
D Nauck
D Rousseau
D Wolpert
DL Wang
E Bauer
E Pekalska
E Salinas
E Simoncelli
EM Iyoda
F Corbacho
F Crestani
F Schwenker
FR Bach
G Giacinto
G-B Huang
GA Carpenter
GE Hinton
GRG Lanckriet
GS Cree
H Haas
H Leung
H Lodhi
I Guyon
J-P Vert
JA Anderson
JA Anderson
JG Wolff
JH Friedman
JSR Jang
K Grabczewski
K Torkkola
K Tsuda
KP Unnikrishnan
KS Fu
L Goldfarb
L Goldfarb
L Györfi
L Shastri
LI Kuncheva
M Blachnik
M Grochowski
M Kordos
M Leshno
MJ Kearns
MJD Powell
N Chater
N Jankowski
N Kunstman
NI Achieser
O Chapelle
P Dayan
P Matykiewicz
P Smyth
PH Winston
PM Baggenstoss
R Avnimelech
R Hecht-Nielsen
R Raizada
RE Schapire
RF Thompson
RL Gorsuch
RO Duda
RS Sutton
S Anuj
S Deneve
S Grossberg
S Haykin
S Mitra
S Roweis
SF Walker
SJ Russell
SK Pal
T Bilgiç
T Kohonen
T Poggio
T Wieczorek
TG Dietterich
TJ McCabe
TM Cover
V Kecman
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Duch
W Maass
W Shoujue
Y Bengio
Y Bengio
Y Burnod
YH Pao
YJ Lee
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Abstract. Although computational intelligence (CI) covers a vast variety of different methods it still lacks an integrative theory. Several proposals for CI foundations are discussed: computing and cognition as compression, meta-learning as search in the space of data models, (dis)similarity based methods providing a framework for such meta-learning, and a more general approach based on chains of transformations. Many useful transformations that extract information from features are discussed. Heterogeneous adaptive systems are presented as particular example of transformation-based systems, and the goal of learning is redefined to facilitate creation of simpler data models. The need to understand data structures leads to techniques for logical and prototype-based rule extraction, and to generation of multiple alternative models, while the need to increase predictive power of adaptive models leads to committees of competent models. Learning from partial observations is a natural extension towards reasoning based on perceptions, and an approach to intuitive solving of such problems is presented. Throughout the paper neurocognitive inspirations are frequently used and are especially important in modeling of the higher cognitive functions. Promising directions such as liquid and laminar computing are identified and many open problems presented.

CiteSeerX

Crossref

Text Mining with an Augmented Version of the Bisecting K-Means Algorithm

Author: K. Torkkola
M. Dittenbach
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref

Feature extraction using information-theoretic learning

Author: D. Erdogmus
J.C. Principe
K. Torkkola
K.E. Hild
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Active Learning with Ensembles for Image Classification

Author: H. Liu
K. Torkkola
Liu And Mandvikar
P. Foschi
Publication venue
Publication date
Field of study

In many real-world tasks of image classification, limited amounts of labeled data are available to train automatic classifiers. Consequently, extensive human expert involvement is required for verification

CiteSeerX

Recursive Approach for Real-Time Blind Source Separation of Acoustic Signals

Author: A. Cichocki
K. Torkkola
L. Parra
M. Kawamoto
S. Haykin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

Crossref

Prediction by Categorical Features: Generalization Properties and Application to Feature Ranking

Author: A. Antos
J.R. Quinlan
K. Torkkola
T. Hastie
T.M. Mitchell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref