1,120 research outputs found
Using the Forest to See the Trees: Exploiting Context for Visual Object Detection and Localization
Recognizing objects in images is an active area of research in computer vision. In the last two decades, there has been much progress and there are already object recognition systems operating in commercial products. However, most of the algorithms for detecting objects perform an exhaustive search across all locations and scales in the image comparing local image regions with an object model. That approach ignores the semantic structure of scenes and tries to solve the recognition problem by brute force. In the real world, objects tend to covary with other objects, providing a rich collection of contextual associations. These contextual associations can be used to reduce the search space by looking only in places in which the object is expected to be; this also increases performance, by rejecting patterns that look like the target but appear in unlikely places.
Most modeling attempts so far have defined the context of an object in terms of other previously recognized objects. The drawback of this approach is that inferring the context becomes as difficult as detecting each object. An alternative view of context relies on using the entire scene information holistically. This approach is algorithmically attractive since it dispenses with the need for a prior step of individual object recognition. In this paper, we use a probabilistic framework for encoding the relationships between context and object properties and we show how an integrated system provides improved performance. We view this as a significant step toward general purpose machine vision systems.United States. National Geospatial-Intelligence Agency (NEGI-1582-04-0004)United States. Army Research Office. Multidisciplinary University Research Initiative (Grant Number N00014-06-1-0734)National Science Foundation (U.S.). (Contract IIS-0413232)National Defense Science and Engineering Graduate Fellowshi
'Part'ly first among equals: Semantic part-based benchmarking for state-of-the-art object recognition systems
An examination of object recognition challenge leaderboards (ILSVRC,
PASCAL-VOC) reveals that the top-performing classifiers typically exhibit small
differences amongst themselves in terms of error rate/mAP. To better
differentiate the top performers, additional criteria are required. Moreover,
the (test) images, on which the performance scores are based, predominantly
contain fully visible objects. Therefore, `harder' test images, mimicking the
challenging conditions (e.g. occlusion) in which humans routinely recognize
objects, need to be utilized for benchmarking. To address the concerns
mentioned above, we make two contributions. First, we systematically vary the
level of local object-part content, global detail and spatial context in images
from PASCAL VOC 2010 to create a new benchmarking dataset dubbed PPSS-12.
Second, we propose an object-part based benchmarking procedure which quantifies
classifiers' robustness to a range of visibility and contextual settings. The
benchmarking procedure relies on a semantic similarity measure that naturally
addresses potential semantic granularity differences between the category
labels in training and test datasets, thus eliminating manual mapping. We use
our procedure on the PPSS-12 dataset to benchmark top-performing classifiers
trained on the ILSVRC-2012 dataset. Our results show that the proposed
benchmarking procedure enables additional differentiation among
state-of-the-art object classifiers in terms of their ability to handle missing
content and insufficient object detail. Given this capability for additional
differentiation, our approach can potentially supplement existing benchmarking
procedures used in object recognition challenge leaderboards.Comment: Extended version of our ACCV-2016 paper. Author formatting modifie
Realización de un filtro activo de potencia empleando una FPGA
En este artículo se presenta un sistema para compensación de reactiva y eliminación de armónicos en la conexión de una carga a una red eléctrica. El sistema emplea una simple tarjeta que incorpora una FPGA, un microprocesador, una memoria doble puerta y convertidores A/D, y que genera directamente los pulsos de disparo de los elementos de conmutación de un inversor trifásico
Intestinal fungi contribute to development of alcoholic liver disease
This study was supported in part by NIH grants R01 AA020703, U01 AA021856 and by Award Number I01BX002213 from the Biomedical Laboratory Research & Development Service of the VA Office of Research and Development (to B.S.). K.H. was supported by a DFG (Deutsche Forschungsgemeinschaft) fellowship (HO/ 5690/1-1). S.B. was supported by a grant from the Swiss National Science Foundation (P2SKP3_158649). G.G. received funding from the Yale Liver Center NIH P30 DK34989 and R.B. from NIAAA grant U01 AA021908. A.K. received support from NIH grants RC2 AA019405, R01 AA020216 and R01 AA023417. G.D.B. is supported by funds from the Wellcome Trust. We acknowledge the Human Tissue and Cell Research (HTCR) Foundation for making human tissue available for research and Hepacult GmbH (Munich, Germany) for providing primary human hepatocytes for in vitro analyses. We thank Dr. Chien-Yu Lin Department of Medicine, Fu-Jen Catholic University, Taiwan for statistical analysis.Peer reviewedPublisher PD
Modelling search for people in 900 scenes: A combined source model of eye guidance
How predictable are human eye movements during search in real world scenes? We recorded 14 observers’ eye movements as they performed a search task (person detection) in 912 outdoor scenes. Observers were highly consistent in the regions fixated during search, even when the target was absent from the scene. These eye movements were used to evaluate computational models of search guidance from three sources: Saliency, target features, and scene context. Each of these models independently outperformed a cross-image control in predicting human fixations. Models that combined sources of guidance ultimately predicted 94% of human agreement, with the scene context component providing the most explanatory power. None of the models, however, could reach the precision and fidelity of an attentional map defined by human fixations. This work puts forth a benchmark for computational models of search in real world scenes. Further improvements in modelling should capture mechanisms underlying the selectivity of observers’ fixations during search.National Eye Institute (Integrative Training Program in Vision grant T32 EY013935)Massachusetts Institute of Technology (Singleton Graduate Research Fellowship)National Science Foundation (U.S.) (Graduate Research Fellowship)National Science Foundation (U.S.) (CAREER Award (0546262))National Science Foundation (U.S.) (NSF contract (0705677))National Science Foundation (U.S.) (Career Award (0747120)
O(N) methods in electronic structure calculations
Linear scaling methods, or O(N) methods, have computational and memory
requirements which scale linearly with the number of atoms in the system, N, in
contrast to standard approaches which scale with the cube of the number of
atoms. These methods, which rely on the short-ranged nature of electronic
structure, will allow accurate, ab initio simulations of systems of
unprecedented size. The theory behind the locality of electronic structure is
described and related to physical properties of systems to be modelled, along
with a survey of recent developments in real-space methods which are important
for efficient use of high performance computers. The linear scaling methods
proposed to date can be divided into seven different areas, and the
applicability, efficiency and advantages of the methods proposed in these areas
is then discussed. The applications of linear scaling methods, as well as the
implementations available as computer programs, are considered. Finally, the
prospects for and the challenges facing linear scaling methods are discussed.Comment: 85 pages, 15 figures, 488 references. Resubmitted to Rep. Prog. Phys
(small changes
Natural images from the birthplace of the human eye
Here we introduce a database of calibrated natural images publicly available
through an easy-to-use web interface. Using a Nikon D70 digital SLR camera, we
acquired about 5000 six-megapixel images of Okavango Delta of Botswana, a
tropical savanna habitat similar to where the human eye is thought to have
evolved. Some sequences of images were captured unsystematically while
following a baboon troop, while others were designed to vary a single parameter
such as aperture, object distance, time of day or position on the horizon.
Images are available in the raw RGB format and in grayscale. Images are also
available in units relevant to the physiology of human cone photoreceptors,
where pixel values represent the expected number of photoisomerizations per
second for cones sensitive to long (L), medium (M) and short (S) wavelengths.
This database is distributed under a Creative Commons Attribution-Noncommercial
Unported license to facilitate research in computer vision, psychophysics of
perception, and visual neuroscience.Comment: Submitted to PLoS ON
Microbial sucession dynamics in the forefield of Breiðamerkurjokull Glacier (Iceland)
FEMS 2017 (7th. 2017. Valencia)Backgrounds One key consequence of glacier recession, as effect of climatic change, is the creation of new habitats for colonization. In glacier forefields, primary succession occurs simultaneously in soils and rocks recently discovered offering a type of natural experiment in which temporal colonization dynamics can be analyzed. Objectives A chronosequence established at Breiðamerkurjökull Glacier forefield, was used as a framework to analyze primary microbial succession processes in subarctic regions. This outlet glacier stretches to southeast from Vatnajökull Glacier and has been dramatically retreating during the 20th century. Methods Soil samples from different succession stages were collected. Microbial community structure was analyzed by high-throughput amplicon sequencing. Potential microbial activity (microbial respiration, N mineralization) as well as different soil attributes were also measured in these samples
Highlights from the Pierre Auger Observatory
The Pierre Auger Observatory is the world's largest cosmic ray observatory.
Our current exposure reaches nearly 40,000 km str and provides us with an
unprecedented quality data set. The performance and stability of the detectors
and their enhancements are described. Data analyses have led to a number of
major breakthroughs. Among these we discuss the energy spectrum and the
searches for large-scale anisotropies. We present analyses of our X
data and show how it can be interpreted in terms of mass composition. We also
describe some new analyses that extract mass sensitive parameters from the 100%
duty cycle SD data. A coherent interpretation of all these recent results opens
new directions. The consequences regarding the cosmic ray composition and the
properties of UHECR sources are briefly discussed.Comment: 9 pages, 12 figures, talk given at the 33rd International Cosmic Ray
Conference, Rio de Janeiro 201
Multi-resolution anisotropy studies of ultrahigh-energy cosmic rays detected at the Pierre Auger Observatory
We report a multi-resolution search for anisotropies in the arrival
directions of cosmic rays detected at the Pierre Auger Observatory with local
zenith angles up to and energies in excess of 4 EeV ( eV). This search is conducted by measuring the angular power spectrum
and performing a needlet wavelet analysis in two independent energy ranges.
Both analyses are complementary since the angular power spectrum achieves a
better performance in identifying large-scale patterns while the needlet
wavelet analysis, considering the parameters used in this work, presents a
higher efficiency in detecting smaller-scale anisotropies, potentially
providing directional information on any observed anisotropies. No deviation
from isotropy is observed on any angular scale in the energy range between 4
and 8 EeV. Above 8 EeV, an indication for a dipole moment is captured; while no
other deviation from isotropy is observed for moments beyond the dipole one.
The corresponding -values obtained after accounting for searches blindly
performed at several angular scales, are in the case of
the angular power spectrum, and in the case of the needlet
analysis. While these results are consistent with previous reports making use
of the same data set, they provide extensions of the previous works through the
thorough scans of the angular scales.Comment: Published version. Added journal reference and DOI. Added Report
Numbe
- …
