Search CORE

2,556 research outputs found

Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition

Author: Alahi Alexandre
Bagautdinov Timur
Fleuret François
Fua Pascal
Savarese Silvio
Publication venue
Publication date: 28/11/2016
Field of study

We present a unified framework for understanding human social behaviors in raw image sequences. Our model jointly detects multiple individuals, infers their social actions, and estimates the collective actions with a single feed-forward pass through a neural network. We propose a single architecture that does not rely on external detection algorithms but rather is trained end-to-end to generate dense proposal maps that are refined via a novel inference scheme. The temporal consistency is handled via a person-level matching Recurrent Neural Network. The complete model takes as input a sequence of frames and outputs detections along with the estimates of individual actions and collective activities. We demonstrate state-of-the-art performance of our algorithm on multiple publicly available benchmarks

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

FollowMe: Efficient Online Min-Cost Flow Tracking with Bounded Memory and Computation

Author: Geiger Andreas
Lenz Philip
Urtasun Raquel
Publication venue
Publication date: 25/12/2014
Field of study

One of the most popular approaches to multi-target tracking is tracking-by-detection. Current min-cost flow algorithms which solve the data association problem optimally have three main drawbacks: they are computationally expensive, they assume that the whole video is given as a batch, and they scale badly in memory and computation with the length of the video sequence. In this paper, we address each of these issues, resulting in a computationally and memory-bounded solution. First, we introduce a dynamic version of the successive shortest-path algorithm which solves the data association problem optimally while reusing computation, resulting in significantly faster inference than standard solvers. Second, we address the optimal solution to the data association problem when dealing with an incoming stream of data (i.e., online setting). Finally, we present our main contribution which is an approximate online solution with bounded memory and computation which is capable of handling videos of arbitrarily length while performing tracking in real time. We demonstrate the effectiveness of our algorithms on the KITTI and PETS2009 benchmarks and show state-of-the-art performance, while being significantly faster than existing solvers

arXiv.org e-Print Archive

CiteSeerX

Crossref

MPG.PuRe

テキタイテキセイセイネットワークヲモチイタカンシカメラエイゾウニタイスルイジョウケンチシュホウニカンスルケンキュウ

Author: Saypadith Savath
サイパディスサバス
Publication venue
Publication date
Field of study

Osaka University Knowledge Archive