Search CORE

25 research outputs found

Group Norm for Learning Structured SVMs with Unstructured Latent Variables

Author: Batra Dhruv
Chen Daozheng
Freeman William T.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2013
Field of study

Latent variables models have been applied to a number of computer vision problems. However, the complexity of the latent space is typically left as a free design choice. A larger latent space results in a more expressive model, but such models are prone to over fitting and are slower to perform inference with. The goal of this paper is to regularize the complexity of the latent space and learn which hidden states are really relevant for prediction. Specifically, we propose using group-sparsity-inducing regularizers such as ℓ[subscript 1]-ℓ[subscript 2] to estimate the parameters of Structured SVMs with unstructured latent variables. Our experiments on digit recognition and object detection show that our approach is indeed able to control the complexity of latent space without any significant loss in accuracy of the learnt model.Quanta Computer (Firm)Google (Firm

DSpace@MIT

Crossref

Применение алгоритмов искусственного интеллекта для определения буллинга в видеопотоке в средних учебных заведениях

Author: Аубакирова Г.Г.
Куандыков А.А.
Куатбаева А.А.
Омаров А.Р.
Салаватов И.Г.
Publication venue: International Information Technology University
Publication date: 22/07/2021
Field of study

The article presents the basic concept of development of software for determination of bulling in the video stream.В статье представлена основная концепция разработки ПО для определения буллинга в видеопотоке

International Journal of Information and Communication Technologies

Алгоритмы определения и оценки позы человека на изображении и видеопоследовательности

Author: Аубакирова Г.Г.
Куандыков А.А.
Куатбаева А.А.
Омаров А.Р.
Салаватов И.Г.
Publication venue: International Information Technology University
Publication date: 22/07/2021
Field of study

The article presents the basic concept of developing algorithms for determining and evaluating the position of a person on an image and a video sequence.В статье представлена основная концепция разработки алгоритмов определения и оценки позы человека на изображении и видеопоследовательности

International Journal of Information and Communication Technologies

Deformable Part Models with Individual Part Scaling

Author: Dubout Charles
Fleuret François
Publication venue
Publication date: 01/01/2013
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Multi-modal RGB–Depth–Thermal Human Body Segmentation

Author: Bahnsen Chris
Clapés Albert
Escalera Sergio
Moeslund Thomas B.
Møgelmose Andreas
Palmero Cristina
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/04/2016
Field of study

Crossref

VBN

Learning Separable Filters

Author: Fua Pascal
Lepetit Vincent
Rigamonti Roberto
Sironi Amos
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/02/2013
Field of study

Learning filters to produce sparse image representations in terms of overcomplete dictionaries has emerged as a powerful way to create image features for many different purposes. Unfortunately, these filters are usually both nu-merous and non-separable, making their use computation-ally expensive. In this paper, we show that such filters can be computed as linear combinations of a smaller number of separable ones, thus greatly reducing the computational complexity at no cost in terms of performance. This makes filter learning approaches practical even for large images or 3D volumes, and we show that we significantly outperform state-of-the-art methods on the linear structure extraction task, in terms of both accuracy and speed. Moreover, our approach is gen-eral and can be used on generic filter banks to reduce the complexity of the convolutions. 1

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX

Tracking Multiple Players using a Single Camera

Author: Ben Shitrit Horesh
Fleuret François
Fua Pascal
Raca Mirko
Publication venue: Springer Verlag
Publication date: 13/12/2012
Field of study

It has been shown that multi-people tracking could be successfullly formulated as a Linear Program to process the output of multiple fixed and synchronized cameras with overlapping fields of view. In this paper, we extend this approach to the more challenging single-camera case and show that it yields excellent performance, even when the camera moves. We validate our approach on a number of basketball matches and argue that using a properly retrained people detector is key to producing the probabilities of presence that are used as input to the Linear Program

Infoscience - École polytechnique fédérale de Lausanne

The Fastest Deformable Part Model for Object Detection

Author: Junjie Yan
Longyin Wen
Stan Z. Li
Zhen Lei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2014
Field of study

This paper solves the speed bottleneck of deformable part model (DPM), while maintaining the accuracy in de-tection on challenging datasets. Three prohibitive steps in cascade version of DPM are accelerated, including 2D cor-relation between root filter and feature map, cascade part pruning and HOG feature extraction. For 2D correlation, the root filter is constrained to be low rank, so that 2D cor-relation can be calculated by more efficient linear combi-nation of 1D correlations. A proximal gradient algorithm is adopted to progressively learn the low rank filter in a dis-criminative manner. For cascade part pruning, neighbor-hood aware cascade is proposed to capture the dependence in neighborhood regions for aggressive pruning. Instead of explicit computation of part scores, hypotheses can be pruned by scores of neighborhoods under the first order ap-proximation. For HOG feature extraction, look-up tables are constructed to replace expensive calculations of orien-tation partition and magnitude with simpler matrix index operations. Extensive experiments show that (a) the pro-posed method is 4 times faster than the current fastest DPM method with similar accuracy on Pascal VOC, (b) the pro-posed method achieves state-of-the-art accuracy on pedes-trian and face detection task with frame-rate speed. 1

CiteSeerX

Crossref

Random Forest with Adaptive Local Template for Pedestrian Detection

Author: Mao Ye
Tao Li
Tao Xiang
Zijian Liu
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2015
Field of study

Pedestrian detection with large intraclass variations is still a challenging task in computer vision. In this paper, we propose a novel pedestrian detection method based on Random Forest. Firstly, we generate a few local templates with different sizes and different locations in positive exemplars. Then, the Random Forest is built whose splitting functions are optimized by maximizing class purity of matching the local templates to the training samples, respectively. To improve the classification accuracy, we adopt a boosting-like algorithm to update the weights of the training samples in a layer-wise fashion. During detection, the trained Random Forest will vote the category when a sliding window is input. Our contributions are the splitting functions based on local template matching with adaptive size and location and iteratively weight updating method. We evaluate the proposed method on 2 well-known challenging datasets: TUD pedestrians and INRIA pedestrians. The experimental results demonstrate that our method achieves state-of-the-art or competitive performance

Crossref

Directory of Open Access Journals