Search CORE

4,678 research outputs found

Vision systems with the human in the loop

Author: Bauckhage Christian
Hanheide Marc
Kaster Thomas
Pfeiffer Michael
Sagerer Gerhard
Wrede Sebastian
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2005
Field of study

The emerging cognitive vision paradigm deals with vision systems that apply machine learning and automatic reasoning in order to learn from what they perceive. Cognitive vision systems can rate the relevance and consistency of newly acquired knowledge, they can adapt to their environment and thus will exhibit high robustness. This contribution presents vision systems that aim at flexibility and robustness. One is tailored for content-based image retrieval, the others are cognitive vision systems that constitute prototypes of visual active memories which evaluate, gather, and integrate contextual knowledge for visual analysis. All three systems are designed to interact with human users. After we will have discussed adaptive content-based image retrieval and object and action recognition in an office environment, the issue of assessing cognitive systems will be raised. Experiences from psychologically evaluated human-machine interactions will be reported and the promising potential of psychologically-based usability experiments will be stressed

University of Lincoln Institutional Repository

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Publications at Bielefeld University

The Neural Basis of Cognitive Efficiency in Motor Skill Performance from Early Learning to Automatic Stages

Author: A Floyer-Lea
A Karni
A Karni
A Kubler
A Moors
AJ Pearce
AM Gentile
AM Gentile
Angela R. Laird
AW Salmoni
B Abernethy
B Almåsbakk
BA Clegg
C Gerloff
C-HJ Lin
CA Thorn
CA Thorn
CE Lang
CM Korsgaard
CS Carter
CS Sherrington
CS Sherrington
D Kahneman
D LaBerge
D Wright
David Coynel
DL Porretta
DL Wright
DL Wright
DL Wright
DM Little
DM Wolpert
DW Chaney
DW Fendrich
E Dayan
E Hazeltine
E Koechlin
EL Abrahamse
EL Abrahamse
EM Robertson
EM Robertson
ES Cross
F Brady
F Brady
FG Ashby
FG Ashby
FG Ashby
G Buccino
G Stein
GA Miller
H Kondo
H Pashler
HA Whitaker
HE Schendan
Henry H Yin
HH Yin
HP Bahrick
I Toni
IH Jenkins
J Breton
J Doyon
J Doyon
J Doyon
J Doyon
J Jonides
J Toner
JA Taylor
JB Shea
JD Cohen
JL Leavitt
JL Wambaugh
JM Fuster
Julien Doyon
JW Krakauer
JX O’Reilly
K Sakai
KC Engel
KJ Friston
KR Lohse
L Bezzola
L Pauwels
L Proteau
L Shmuelof
L Solomons
M Kawato
M Paola Di
M. Herdener
M. Jueptner
M. Jueptner
MA Immink
MA Immink
MA Immink
Maria-Felice Ghilardi
MI Jordan
MI Posner
MJ Nissen
MP Walker
N Bernstein
NF Wymbs
O Hikosaka
O Jastrow
Okihide Hikosaka
P Nachev
P Rey Del
PD McLeod
PJ Smith
PM Fitts
PM Fitts
R Shadmehr
R Shadmehr
RA Magill
RA Poldrack
RA Schmidt
RA Schmidt
RA Schmidt
RC Miall
RD Seidler
RE Burke
RE Passingham
RM Hardwick
RN Singer
RW Pew
Rüdiger J. Seitz
S Goode
S Helie
S Lehericy
S Ollis
S Tunovic
Scott T. Grafton
SG Adams
SP Swinnen
SS Kantak
ST Grafton
ST Grafton
ST Grafton
ST Klapp
SW Keele
T Kim
T Wu
T Wu
T Wu
T Wu
Tiziana Marilena Florio
TL Brown
TR Knock
V Puttemans
VB Penhune
VB Penhune
W James
W Schneider
W Schneider
WB Verwey
WB Verwey
WB Verwey
WB Verwey
WB Verwey
Willem B. Verwey
Y Li
Y Matsuzaka
Publication venue: Springer
Publication date: 01/01/2020
Field of study

Crossref

University of Twente Research Information

Resonant Neural Dynamics of Speech Perception

Author: Grossberg Stephen
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/09/2002
Field of study

What is the neural representation of a speech code as it evolves in time? How do listeners integrate temporally distributed phonemic information across hundreds of milliseconds, even backwards in time, into coherent representations of syllables and words? What sorts of brain mechanisms encode the correct temporal order, despite such backwards effects, during speech perception? How does the brain extract rate-invariant properties of variable-rate speech? This article describes an emerging neural model that suggests answers to these questions, while quantitatively simulating challenging data about audition, speech and word recognition. This model includes bottom-up filtering, horizontal competitive, and top-down attentional interactions between a working memory for short-term storage of phonetic items and a list categorization network for grouping sequences of items. The conscious speech and word recognition code is suggested to be a resonant wave of activation across such a network, and a percept of silence is proposed to be a temporal discontinuity in the rate with which such a resonant wave evolves. Properties of these resonant waves can be traced to the brain mechanisms whereby auditory, speech, and language representations are learned in a stable way through time. Because resonances are proposed to control stable learning, the model is called an Adaptive Resonance Theory, or ART, model.Air Force Office of Scientific Research (F49620-01-1-0397); National Science Foundation (IRI-97-20333); Office of Naval Research (N00014-01-1-0624)

Boston University Institutional Repository (OpenBU)

Symbolic and Deep Learning Based Data Representation Methods for Activity Recognition and Image Understanding at Pixel Level

Author: Karki Manohar
Publication venue: LSU Digital Commons
Publication date: 01/01/2017
Field of study

Efficient representation of large amount of data particularly images and video helps in the analysis, processing and overall understanding of the data. In this work, we present two frameworks that encapsulate the information present in such data. At first, we present an automated symbolic framework to recognize particular activities in real time from videos. The framework uses regular expressions for symbolically representing (possibly infinite) sets of motion characteristics obtained from a video. It is a uniform framework that handles trajectory-based and periodic articulated activities and provides polynomial time graph algorithms for fast recognition. The regular expressions representing motion characteristics can either be provided manually or learnt automatically from positive and negative examples of strings (that describe dynamic behavior) using offline automata learning frameworks. Confidence measures are associated with recognitions using Levenshtein distance between a string representing a motion signature and the regular expression describing an activity. We have used our framework to recognize trajectory-based activities like vehicle turns (U-turns, left and right turns, and K-turns), vehicle start and stop, person running and walking, and periodic articulated activities like digging, waving, boxing, and clapping in videos from the VIRAT public dataset, the KTH dataset, and a set of videos obtained from YouTube. Next, we present a core sampling framework that is able to use activation maps from several layers of a Convolutional Neural Network (CNN) as features to another neural network using transfer learning to provide an understanding of an input image. The intermediate map responses of a Convolutional Neural Network (CNN) contain information about an image that can be used to extract contextual knowledge about it. Our framework creates a representation that combines features from the test data and the contextual knowledge gained from the responses of a pretrained network, processes it and feeds it to a separate Deep Belief Network. We use this representation to extract more information from an image at the pixel level, hence gaining understanding of the whole image. We experimentally demonstrate the usefulness of our framework using a pretrained VGG-16 model to perform segmentation on the BAERI dataset of Synthetic Aperture Radar (SAR) imagery and the CAMVID dataset. Using this framework, we also reconstruct images by removing noise from noisy character images. The reconstructed images are encoded using Quadtrees. Quadtrees can be an efficient representation in learning from sparse features. When we are dealing with handwritten character images, they are quite susceptible to noise. Hence, preprocessing stages to make the raw data cleaner can improve the efficacy of their use. We improve upon the efficiency of probabilistic quadtrees by using a pixel level classifier to extract the character pixels and remove noise from the images. The pixel level denoiser uses a pretrained CNN trained on a large image dataset and uses transfer learning to aid the reconstruction of characters. In this work, we primarily deal with classification of noisy characters and create the noisy versions of handwritten Bangla Numeral and Basic Character datasets and use them and the Noisy MNIST dataset to demonstrate the usefulness of our approach

Louisiana State University

The Mechanics of Embodiment: A Dialogue on Embodiment and Computational Modeling

Author: Angelo Cangelosi
Giovanni Pezzulo
Giovanni Pezzulo
Ken eMcRae
Lawrence W Barsalou
Martin H Fischer
Michael Spivey
Publication venue
Publication date: 01/01/2011
Field of study

Embodied theories are increasingly challenging traditional views of cognition by arguing that conceptual representations that constitute our knowledge are grounded in sensory and motor experiences, and processed at this sensorimotor level, rather than being represented and processed abstractly in an amodal conceptual system. Given the established empirical foundation, and the relatively underspecified theories to date, many researchers are extremely interested in embodied cognition but are clamouring for more mechanistic implementations. What is needed at this stage is a push toward explicit computational models that implement sensory-motor grounding as intrinsic to cognitive processes. In this article, six authors from varying backgrounds and approaches address issues concerning the construction of embodied computational models, and illustrate what they view as the critical current and next steps toward mechanistic theories of embodiment. The first part has the form of a dialogue between two fictional characters: Ernest, the �experimenter�, and Mary, the �computational modeller�. The dialogue consists of an interactive sequence of questions, requests for clarification, challenges, and (tentative) answers, and touches the most important aspects of grounded theories that should inform computational modeling and, conversely, the impact that computational modeling could have on embodied theories. The second part of the article discusses the most important open challenges for embodied computational modelling

Scholarship@Western

ZENODO

Directory of Open Access Journals

PubMed Central

Frontiers - Publisher Connector

Plymouth Electronic Archive and Research Library

The University of Manchester - Institutional Repository

Enlighten

PUblication MAnagement