314 research outputs found
Comparing feature matching for object categorization in video surveillance
In this paper we consider an object categorization system using local HMAX features. Two feature matching techniques are compared: the MAX technique, originally proposed in the HMAX framework, and the histogram technique originating from Bag-of-Words literature. We have found that each of these techniques have their own field of operation. The histogram technique clearly outperforms the MAX technique with 5-15% for small dictionaries up to 500-1,000 features, favoring this technique for embedded (surveillance) applications. Additionally, we have evaluated the influence of interest point operators in the system. A first experiment analyzes the effect of dictionary creation and has showed that random dictionaries outperform dictionaries created from Hessian-Laplace points. Secondly, the effect of operators in the dictionary matching stage has been evaluated. Processing all image points outperforms the point selection from the Hessian-Laplace operator
Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks
Deep neural networks have emerged as a widely used and effective means for
tackling complex, real-world problems. However, a major obstacle in applying
them to safety-critical systems is the great difficulty in providing formal
guarantees about their behavior. We present a novel, scalable, and efficient
technique for verifying properties of deep neural networks (or providing
counter-examples). The technique is based on the simplex method, extended to
handle the non-convex Rectified Linear Unit (ReLU) activation function, which
is a crucial ingredient in many modern neural networks. The verification
procedure tackles neural networks as a whole, without making any simplifying
assumptions. We evaluated our technique on a prototype deep neural network
implementation of the next-generation airborne collision avoidance system for
unmanned aircraft (ACAS Xu). Results show that our technique can successfully
prove properties of networks that are an order of magnitude larger than the
largest networks verified using existing methods.Comment: This is the extended version of a paper with the same title that
appeared at CAV 201
A visual category filter for Google images
We extend the constellation model to include heterogeneous parts which may represent either the appearance or the geometry of a region of the object. The pans and their spatial configuration are learnt simultaneously and automatically, without supervision, from cluttered images.
We describe how this model can be employed for ranking the output of an image search engine when searching for object categories. It is shown that visual consistencies in the output images can be identified, and then used to rank the images according to their closeness to the visual object category.
Although the proportion of good images may be small, the algorithm is designed to be robust and is capable of learning in either a totally unsupervised manner, or with a very limited amount of supervision.
We demonstrate the method on image sets returned by Google's image search for a number of object categories including bottles, camels, cars, horses, tigers and zebras
Defining the essence of innovation how important terms in promoting of transformation processes in Ukraine
Feature hierarchies are essential to many visual object recognition systems and are well motivated by observations in biological systems. The present paper proposes an algorithm to incrementally compute feature hierarchies. The features are represented as estimated densities, using a variant of local soft histograms. The kernel functions used for this estimation in conjunction with their unitary extension establish a tight frame and results from framelet theory apply. Traversing the feature hierarchy requires resampling of the spatial and the feature bins. For the resampling, we derive a multi-resolution scheme for quadratic spline kernels and we derive an optimization algorithm for the upsampling. We complement the theoretic results by some illustrative experiments, consideration of convergence rate and computational efficiency.DIPLECSGARNICSELLII
Patch-Based Experiments with Object Classification in Video Surveillance
We present a patch-based algorithm for the purpose of object classification in video surveillance. Within detected regions-of-interest (ROIs) of moving objects in the scene, a feature vector is calculated based on template matching of a large set of image patches. Instead of matching direct image pixels, we use Gabor-filtered versions of the input image at several scales. This approach has been adopted from recent experiments in generic object-recognition tasks. We present results for a new typical video surveillance dataset containing over 9,000 object images. Furthermore, we compare our system performance with another existing smaller surveillance dataset. We have found that with 50 training samples or higher, our detection rate is on the average above 95%. Because of the inherent scalability of the algorithm, an embedded system implementation is well within reach
Integrated information increases with fitness in the evolution of animats
One of the hallmarks of biological organisms is their ability to integrate
disparate information sources to optimize their behavior in complex
environments. How this capability can be quantified and related to the
functional complexity of an organism remains a challenging problem, in
particular since organismal functional complexity is not well-defined. We
present here several candidate measures that quantify information and
integration, and study their dependence on fitness as an artificial agent
("animat") evolves over thousands of generations to solve a navigation task in
a simple, simulated environment. We compare the ability of these measures to
predict high fitness with more conventional information-theoretic processing
measures. As the animat adapts by increasing its "fit" to the world,
information integration and processing increase commensurately along the
evolutionary line of descent. We suggest that the correlation of fitness with
information integration and with processing measures implies that high fitness
requires both information processing as well as integration, but that
information integration may be a better measure when the task requires memory.
A correlation of measures of information integration (but also information
processing) and fitness strongly suggests that these measures reflect the
functional complexity of the animat, and that such measures can be used to
quantify functional complexity even in the absence of fitness data.Comment: 27 pages, 8 figures, one supplementary figure. Three supplementary
video files available on request. Version commensurate with published text in
PLoS Comput. Bio
Natural images from the birthplace of the human eye
Here we introduce a database of calibrated natural images publicly available
through an easy-to-use web interface. Using a Nikon D70 digital SLR camera, we
acquired about 5000 six-megapixel images of Okavango Delta of Botswana, a
tropical savanna habitat similar to where the human eye is thought to have
evolved. Some sequences of images were captured unsystematically while
following a baboon troop, while others were designed to vary a single parameter
such as aperture, object distance, time of day or position on the horizon.
Images are available in the raw RGB format and in grayscale. Images are also
available in units relevant to the physiology of human cone photoreceptors,
where pixel values represent the expected number of photoisomerizations per
second for cones sensitive to long (L), medium (M) and short (S) wavelengths.
This database is distributed under a Creative Commons Attribution-Noncommercial
Unported license to facilitate research in computer vision, psychophysics of
perception, and visual neuroscience.Comment: Submitted to PLoS ON
A survey of energy drink consumption patterns among college students
<p>Abstract</p> <p>Background</p> <p>Energy drink consumption has continued to gain in popularity since the 1997 debut of Red Bull, the current leader in the energy drink market. Although energy drinks are targeted to young adult consumers, there has been little research regarding energy drink consumption patterns among college students in the United States. The purpose of this study was to determine energy drink consumption patterns among college students, prevalence and frequency of energy drink use for six situations, namely for insufficient sleep, to increase energy (in general), while studying, driving long periods of time, drinking with alcohol while partying, and to treat a hangover, and prevalence of adverse side effects and energy drink use dose effects among college energy drink users.</p> <p>Methods</p> <p>Based on the responses from a 32 member college student focus group and a field test, a 19 item survey was used to assess energy drink consumption patterns of 496 randomly surveyed college students attending a state university in the Central Atlantic region of the United States.</p> <p>Results</p> <p>Fifty one percent of participants (<it>n </it>= 253) reported consuming greater than one energy drink each month in an average month for the current semester (defined as energy drink user). The majority of users consumed energy drinks for insufficient sleep (67%), to increase energy (65%), and to drink with alcohol while partying (54%). The majority of users consumed one energy drink to treat most situations although using three or more was a common practice to drink with alcohol while partying (49%). Weekly jolt and crash episodes were experienced by 29% of users, 22% reported ever having headaches, and 19% heart palpitations from consuming energy drinks. There was a significant dose effect only for jolt and crash episodes.</p> <p>Conclusion</p> <p>Using energy drinks is a popular practice among college students for a variety of situations. Although for the majority of situations assessed, users consumed one energy drink with a reported frequency of 1 – 4 days per month, many users consumed three or more when combining with alcohol while partying. Further, side effects from consuming energy drinks are fairly common, and a significant dose effect was found with jolt and crash episodes. Future research should identify if college students recognize the amounts of caffeine that are present in the wide variety of caffeine-containing products that they are consuming, the amounts of caffeine that they are consuming in various situations, and the physical side effects associated with caffeine consumption.</p
Recurrent Connections Aid Occluded Object Recognition by Discounting Occluders
Recurrent connections in the visual cortex are thought to aid object
recognition when part of the stimulus is occluded. Here we investigate if and
how recurrent connections in artificial neural networks similarly aid object
recognition. We systematically test and compare architectures comprised of
bottom-up (B), lateral (L) and top-down (T) connections. Performance is
evaluated on a novel stereoscopic occluded object recognition dataset. The task
consists of recognizing one target digit occluded by multiple occluder digits
in a pseudo-3D environment. We find that recurrent models perform significantly
better than their feedforward counterparts, which were matched in parametric
complexity. Furthermore, we analyze how the network's representation of the
stimuli evolves over time due to recurrent connections. We show that the
recurrent connections tend to move the network's representation of an occluded
digit towards its un-occluded version. Our results suggest that both the brain
and artificial neural networks can exploit recurrent connectivity to aid
occluded object recognition.Comment: 13 pages, 5 figures, accepted at the 28th International Conference on
Artificial Neural Networks, published in Springer Lecture Notes in Computer
Science vol 1172
- …