Search CORE

707 research outputs found

Simulation of Crowd Problems for Computer Vision

Author: Andrade Ernesto
Fisher Bob
Publication venue
Publication date: 01/01/2005
Field of study

Human behavior analysis in video surveillance: A Social Signal Processing perspective

Author: Alessio Del Bue
Marco Cristani
R. Raghavendra
Vittorio Murino
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

The analysis of human activities is one of the most intriguing and important open issues for the automated video surveillance community. Since few years ago, it has been handled following a mere Computer Vision and Pattern Recognition perspective, where an activity corresponded to a temporal sequence of explicit actions (run, stop, sit, walk, etc.). Even under this simplistic assumption, the issue is hard, due to the strong diversity of the people appearance, the number of individuals considered (we may monitor single individuals, groups, crowd), the variability of the environmental conditions (indoor/outdoor, different weather conditions), and the kinds of sensors employed. More recently, the automated surveillance of human activities has been faced considering a new perspective, that brings in notions and principles from the social, affective, and psychological literature, and that is called Social Signal Processing (SSP). SSP employs primarily nonverbal cues, most of them are outside of conscious awareness, like face expressions and gazing, body posture and gestures, vocal characteristics, relative distances in the space and the like. This paper is the first review analyzing this new trend, proposing a structured snapshot of the state of the art and envisaging novel challenges in the surveillance domain where the cross-pollination of Computer Science technologies and Sociology theories may offer valid investigation strategies

Catalogo dei prodotti della ricerca

Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense

Author: Achinstein
Amsel
Argall
Baber
Baillargeon
Baillargeon
Baillargeon
Baillargeon
Baker
Baldwin
Baldwin
Bandara
Bardenhagen
Barrow
Bassili
Battaglia
Beck
Bentham
Benz
Bernstein
Berry
Biederman
Bird
Biro
Blake
Blakemore
Bloom
Blum
Bohg
Bonet
Bower
Bower
Brackbill
Bramley
Bridson
Broadbent
Brooks
Brubaker
Buehner
Burgoon
Byrne
Carey
Carpenter
Casasola
Catrambone
Chen
Chen
Chen
Cheng
Chickering
Choi
Chrastil
Clark
Cohn
Connell
Corrigan
Creem-Regehr
Csibra
Csibra
Dale
Daviet
Davis
Davis
Davis
Dayan
Dehaene
Delorme
Dennett
Dennett
DiManzo
DiSessa
Dittrich
Doeller
Donea
Dupoux
Edmonds
Edmonds
Elsner
Elsner
Emery
Fan
Fang
Fang
Fang
Feng
Fire
Fischer
Fisher
Fleming
Foerster
Foerster
Foley
Foo
Foster
Frank
Fu
Fyhn
Gao
Gao
Gao
Gao
Gast
Gaume
Gentner
Gentner
Gentner
Gergely
Gergely
Gergely
Gerstenberg
Gerstenberg
Gibson
Gibson
Gibson
Gibson
Gibson
Gick
Gillner
Glennerster
Gogel
Goldman
Goodall
Goodman
Goodman
Gootjes-Dreesbach
Gosselin
Graham
Greene
Greene
Grice
Guilkey
Guo
Hafting
Haidt
Hamlin
Hammer
Hammond
Han
Hartley
Haryu
He
Heider
Heider
Henrich
Hespos
Hespos
Hespos
Hespos
Hesse
Ho
Hoai
Holtzen
Holyoak
Holyoak
Holyoak
House
Hu
Hu
Huang
Huang
Hunt
Hurka
Iacoboni
Ibsen-Jensen
Ikeuchi
Imbens
Indow
Inhelder
Jacobs
Jaeggi
Jiang
Jiang
Jiang
Jiang
Johnson
Julesz
Julesz
Jäger
Kaiser
Kaiser
Kaiser
Kanade
Kaplan
Kellman
Killian
Kim
Kinney
Kleiman-Weiner
Kleinke
Klár
Koenderink
Koffka
Kohler
Kubricht
Köhler
Land
Lawson
Legg
Leslie
Levine
Lewis
Li
Li
Li
Liu
Liu
Liu
Liu
Liu
Liu
Liu
Lowe
Lowe
Lowe
Lu
Luneburg
Luo
Ma
Maclaurin
Malament
Malcolm
Marr
Marr
McCloskey
McCloskey
McCollough
McCoy
Melis
Meltzoff
Meyer
Michotte
Mikhail
Mill
Min
Minsky
Mishkin
Mnih
Mo
Monaghan
Mueller
Mueller
Mumford
Mumford
Nairn
Nakahashi
Narang
Needham
Nelson
Newton
Nisan
Nishant
Oakley
Ogle
Oliva
Oliva
Osa
O’Keefe
Pearl
Pentland
Peters
Pham
Piaget
Piaget
Potter
Potter
Premack
Qing
Rajalingham
Raven
Rawls
Reid
Rescorla
Rizzolatti
Robb
Robb
Roberts
Rolfs
Rosenbaum
Rousselet
Rubin
Russell
Sadanand
Sadeghirad
Santos
Scholl
Scholl
Scholl
Scholl
Schreyer
Schrier
Schwartz
Schwing
Schyns
Schyns
Serre
Shah
Shanks
Shao
Shapley
Shu
Shukla
Silver
Snow
Solso
Sperber
Spirtes
Stam
Stark
Stomakhin
Stomakhin
Strickland
Sulsky
Sulsky
Sulsky
Tampubolon
Tampuu
Thorpe
Thorpe
Tolman
Tomasello
Turek
Tversky
Ullman
Vuong
Wagemans
Wagemans
Wagner
Waldmann
Waltz
Wang
Wang
Wang
Wang
Wang
Wang
Warren
Weir
Wertheimer
White
Whiten
Winston
Winston
Wittgenstein
Wolper
Woodward
Woodward
Xie
Xie
Yamanobe
Yao
Yao
Yartsev
York
Yue
Zago
Zhang
Zhang
Zhao
Zhao
Zheng
Zhu
Zhu
Zhu
Publication venue: 'Elsevier BV'
Publication date: 20/04/2020
Field of study

Recent progress in deep learning is essentially based on a "big data for small tasks" paradigm, under which massive amounts of data are used to train a classifier for a single narrow task. In this paper, we call for a shift that flips this paradigm upside down. Specifically, we propose a "small data for big tasks" paradigm, wherein a single artificial intelligence (AI) system is challenged to develop "common sense", enabling it to solve a wide range of tasks with little training data. We illustrate the potential power of this new paradigm by reviewing models of common sense that synthesize recent breakthroughs in both machine and human vision. We identify functionality, physics, intent, causality, and utility (FPICU) as the five core domains of cognitive AI with humanlike common sense. When taken as a unified concept, FPICU is concerned with the questions of "why" and "how", beyond the dominant "what" and "where" framework for understanding vision. They are invisible in terms of pixels but nevertheless drive the creation, maintenance, and development of visual scenes. We therefore coin them the "dark matter" of vision. Just as our universe cannot be understood by merely studying observable matter, we argue that vision cannot be understood without studying FPICU. We demonstrate the power of this perspective to develop cognitive AI systems with humanlike common sense by showing how to observe and apply FPICU with little training data to solve a wide range of challenging tasks, including tool use, planning, utility inference, and social learning. In summary, we argue that the next generation of AI must embrace "dark" humanlike common sense for solving novel tasks.Comment: For high quality figures, please refer to http://wellyzhang.github.io/attach/dark.pd

arXiv.org e-Print Archive

DSpace@MIT

Crossref

Unsupervised Long-Term Routine Modelling Using Dynamic Bayesian Networks

Author: Bull David
Damen Dima
Xu Yangdi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/02/2018
Field of study

Explore Bristol Research

Semantic multimedia analysis using knowledge and context

Author: Nikolopoulos Spyridon
Publication venue: 'Queen Mary University of London'
Publication date: 01/01/2012
Field of study

PhDThe difficulty of semantic multimedia analysis can be attributed to the extended diversity in form and appearance exhibited by the majority of semantic concepts and the difficulty to express them using a finite number of patterns. In meeting this challenge there has been a scientific debate on whether the problem should be addressed from the perspective of using overwhelming amounts of training data to capture all possible instantiations of a concept, or from the perspective of using explicit knowledge about the concepts’ relations to infer their presence. In this thesis we address three problems of pattern recognition and propose solutions that combine the knowledge extracted implicitly from training data with the knowledge provided explicitly in structured form. First, we propose a BNs modeling approach that defines a conceptual space where both domain related evi- dence and evidence derived from content analysis can be jointly considered to support or disprove a hypothesis. The use of this space leads to sig- nificant gains in performance compared to analysis methods that can not handle combined knowledge. Then, we present an unsupervised method that exploits the collective nature of social media to automatically obtain large amounts of annotated image regions. By proving that the quality of the obtained samples can be almost as good as manually annotated images when working with large datasets, we significantly contribute towards scal- able object detection. Finally, we introduce a method that treats images, visual features and tags as the three observable variables of an aspect model and extracts a set of latent topics that incorporates the semantics of both visual and tag information space. By showing that the cross-modal depen- dencies of tagged images can be exploited to increase the semantic capacity of the resulting space, we advocate the use of all existing information facets in the semantic analysis of social media

Queen Mary Research Online

Knowledge Extraction in Video Through the Interaction Analysis of Activities

Author: Florez Omar Ulises
Publication venue: DigitalCommons@USU
Publication date: 01/05/2013
Field of study

Video is a massive amount of data that contains complex interactions between moving objects. The extraction of knowledge from this type of information creates a demand for video analytics systems that uncover statistical relationships between activities and learn the correspondence between content and labels. However, those are open research problems that have high complexity when multiple actors simultaneously perform activities, videos contain noise, and streaming scenarios are considered. The techniques introduced in this dissertation provide a basis for analyzing video. The primary contributions of this research consist of providing new algorithms for the efficient search of activities in video, scene understanding based on interactions between activities, and the predicting of labels for new scenes

DigitalCommons@USU

Proceedings of the 2020 Joint Workshop of Fraunhofer IOSB and Institute for Anthropomatics, Vision and Fusion Laboratory

Author: Beyerer Jürgen
Zander Tim
Publication venue: KIT Scientific Publishing, Karlsruhe
Publication date: 01/01/2021
Field of study

In 2020 fand der jährliche Workshop des Faunhofer IOSB und the Lehrstuhls für interaktive Echtzeitsysteme statt. Vom 27. bis zum 31. Juli trugen die Doktorranden der beiden Institute über den Stand ihrer Forschung vor in Themen wie KI, maschinellen Lernen, computer vision, usage control, Metrologie vor. Die Ergebnisse dieser Vorträge sind in diesem Band als technische Berichte gesammelt

KITopen

SEGMENTATION, RECOGNITION, AND ALIGNMENT OF COLLABORATIVE GROUP MOTION

Author: Li Ruonan
Publication venue
Publication date: 01/01/2011
Field of study

Modeling and recognition of human motion in videos has broad applications in behavioral biometrics, content-based visual data analysis, security and surveillance, as well as designing interactive environments. Significant progress has been made in the past two decades by way of new models, methods, and implementations. In this dissertation, we focus our attention on a relatively less investigated sub-area called collaborative group motion analysis. Collaborative group motions are those that typically involve multiple objects, wherein the motion patterns of individual objects may vary significantly in both space and time, but the collective motion pattern of the ensemble allows characterization in terms of geometry and statistics. Therefore, the motions or activities of an individual object constitute local information. A framework to synthesize all local information into a holistic view, and to explicitly characterize interactions among objects, involves large scale global reasoning, and is of significant complexity. In this dissertation, we first review relevant previous contributions on human motion/activity modeling and recognition, and then propose several approaches to answer a sequence of traditional vision questions including 1) which of the motion elements among all are the ones relevant to a group motion pattern of interest (Segmentation); 2) what is the underlying motion pattern (Recognition); and 3) how two motion ensembles are similar and how we can 'optimally' transform one to match the other (Alignment). Our primary practical scenario is American football play, where the corresponding problems are 1) who are offensive players; 2) what are the offensive strategy they are using; and 3) whether two plays are using the same strategy and how we can remove the spatio-temporal misalignment between them due to internal or external factors. The proposed approaches discard traditional modeling paradigm but explore either concise descriptors, hierarchies, stochastic mechanism, or compact generative model to achieve both effectiveness and efficiency. In particular, the intrinsic geometry of the spaces of the involved features/descriptors/quantities is exploited and statistical tools are established on these nonlinear manifolds. These initial attempts have identified new challenging problems in complex motion analysis, as well as in more general tasks in video dynamics. The insights gained from nonlinear geometric modeling and analysis in this dissertation may hopefully be useful toward a broader class of computer vision applications

Digital Repository at the University of Maryland