13,246 research outputs found
Feature fusion reveals slow and fast visual memories
Although the visual system can achieve a coarse classification of its inputs in a relatively short time, the synthesis of qualia-rich and detailed percepts can take substantially more time. If these prolonged computations were to take place in a retinotopic space, moving objects would generate extensive smear. However, under normal viewing conditions, moving objects appear relatively sharp and clear, suggesting that a substantial part of visual short-term memory takes place at a nonretinotopic locus. By using a retinotopic feature fusion and a nonretinotopic feature attribution paradigm, we provide evidence for a relatively fast retinotopic buffer and a substantially slower nonretinotopic memory. We present a simple model that can account for the dynamics of these complementary memory processes. Taken together, our results indicate that the visual system can accomplish temporal integration of information while avoiding smear by breaking off sensory memory into fast and slow components that are implemented in retinotopic and nonretinotopic loci, respectively
Acoustic and Device Feature Fusion for Load Recognition
Appliance-specific Load Monitoring (LM) provides a possible solution to the problem of energy conservation which is becoming increasingly challenging, due to growing energy demands within offices and residential spaces. It is essential to perform automatic appliance recognition and monitoring for optimal resource utilization. In this paper, we study the use of non-intrusive LM methods that rely on steady-state appliance signatures for classifying most commonly used office appliances, while demonstrating their limitation in terms of accurately discerning the low-power devices due to overlapping load signatures. We propose a multilayer decision architecture that makes use of audio features derived from device sounds and fuse it with load signatures acquired from energy meter. For the recognition of device sounds, we perform feature set selection by evaluating the combination of time-domain and FFT-based audio features on the state of the art machine learning algorithms. The highest recognition performance however is shown by support vector machines, for the device and audio recognition experiments. Further, we demonstrate that our proposed feature set which is a concatenation of device audio feature and load signature significantly improves the device recognition accuracy in comparison to the use of steady-state load signatures only
Attentional Feature Fusion
Feature fusion, the combination of features from different layers or
branches, is an omnipresent part of modern network architectures. It is often
implemented via simple operations, such as summation or concatenation, but this
might not be the best choice. In this work, we propose a uniform and general
scheme, namely attentional feature fusion, which is applicable for most common
scenarios, including feature fusion induced by short and long skip connections
as well as within Inception layers. To better fuse features of inconsistent
semantics and scales, we propose a multi-scale channel attention module, which
addresses issues that arise when fusing features given at different scales. We
also demonstrate that the initial integration of feature maps can become a
bottleneck and that this issue can be alleviated by adding another level of
attention, which we refer to as iterative attentional feature fusion. With
fewer layers or parameters, our models outperform state-of-the-art networks on
both CIFAR-100 and ImageNet datasets, which suggests that more sophisticated
attention mechanisms for feature fusion hold great potential to consistently
yield better results compared to their direct counterparts. Our codes and
trained models are available online.Comment: Accepted by WACV 202
Feature-Fused SSD: Fast Detection for Small Objects
Small objects detection is a challenging task in computer vision due to its
limited resolution and information. In order to solve this problem, the
majority of existing methods sacrifice speed for improvement in accuracy. In
this paper, we aim to detect small objects at a fast speed, using the best
object detector Single Shot Multibox Detector (SSD) with respect to
accuracy-vs-speed trade-off as base architecture. We propose a multi-level
feature fusion method for introducing contextual information in SSD, in order
to improve the accuracy for small objects. In detailed fusion operation, we
design two feature fusion modules, concatenation module and element-sum module,
different in the way of adding contextual information. Experimental results
show that these two fusion modules obtain higher mAP on PASCALVOC2007 than
baseline SSD by 1.6 and 1.7 points respectively, especially with 2-3 points
improvement on some smallobjects categories. The testing speed of them is 43
and 40 FPS respectively, superior to the state of the art Deconvolutional
single shot detector (DSSD) by 29.4 and 26.4 FPS. Code is available at
https://github.com/wnzhyee/Feature-Fused-SSD. Keywords: small object detection,
feature fusion, real-time, single shot multi-box detectorComment: Artificial Intelligence;8 pages,8 figure
- …