Search CORE

4,038 research outputs found

Temporal Interlacing Network

Author: Liu Yu
Qian Shengju
Shao Hao
Publication venue
Publication date: 17/01/2020
Field of study

For a long time, the vision community tries to learn the spatio-temporal representation by combining convolutional neural network together with various temporal models, such as the families of Markov chain, optical flow, RNN and temporal convolution. However, these pipelines consume enormous computing resources due to the alternately learning process for spatial and temporal information. One natural question is whether we can embed the temporal information into the spatial one so the information in the two domains can be jointly learned once-only. In this work, we answer this question by presenting a simple yet powerful operator -- temporal interlacing network (TIN). Instead of learning the temporal features, TIN fuses the two kinds of information by interlacing spatial representations from the past to the future, and vice versa. A differentiable interlacing target can be learned to control the interlacing process. In this way, a heavy temporal model is replaced by a simple interlacing operator. We theoretically prove that with a learnable interlacing target, TIN performs equivalently to the regularized temporal convolution network (r-TCN), but gains 4% more accuracy with 6x less latency on 6 challenging benchmarks. These results push the state-of-the-art performances of video understanding by a considerable margin. Not surprising, the ensemble model of the proposed TIN won the

1^{st}

place in the ICCV19 - Multi Moments in Time challenge. Code is made available to facilitate further research at https://github.com/deepcs233/TINComment: Accepted to AAAI 2020. Winning entry of ICCV Multi-Moments in Time Challenge 2019. Code is available at https://github.com/deepcs233/TI

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Disease Localization in Multilayer Networks

Author: Cozzo Emanuele
de Arruda Guilherme Ferraz
Moreno Yamir
Peixoto Tiago P.
Rodrigues Francisco A.
Publication venue: 'American Physical Society (APS)'
Publication date: 26/12/2016
Field of study

We present a continuous formulation of epidemic spreading on multilayer networks using a tensorial representation, extending the models of monoplex networks to this context. We derive analytical expressions for the epidemic threshold of the SIS and SIR dynamics, as well as upper and lower bounds for the disease prevalence in the steady state for the SIS scenario. Using the quasi-stationary state method we numerically show the existence of disease localization and the emergence of two or more susceptibility peaks, which are characterized analytically and numerically through the inverse participation ratio. Furthermore, when mapping the critical dynamics to an eigenvalue problem, we observe a characteristic transition in the eigenvalue spectra of the supra-contact tensor as a function of the ratio of two spreading rates: if the rate at which the disease spreads within a layer is comparable to the spreading rate across layers, the individual spectra of each layer merge with the coupling between layers. Finally, we verified the barrier effect, i.e., for three-layer configuration, when the layer with the largest eigenvalue is located at the center of the line, it can effectively act as a barrier to the disease. The formalism introduced here provides a unifying mathematical approach to disease contagion in multiplex systems opening new possibilities for the study of spreading processes.Comment: Revised version. 25 pages and 18 figure

arXiv.org e-Print Archive

Repositorio Universidad de Zaragoza

Directory of Open Access Journals

Anyone here? Smart embedded low-resolution omnidirectional video sensor to measure room occupancy

Author: Callemein Timothy
Goedemé Toon
Van Beeck Kristof
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 09/07/2020
Field of study

In this paper, we present a room occupancy sensing solution with unique properties: (i) It is based on an omnidirectional vision camera, capturing rich scene info over a wide angle, enabling to count the number of people in a room and even their position. (ii) Although it uses a camera-input, no privacy issues arise because its extremely low image resolution, rendering people unrecognisable. (iii) The neural network inference is running entirely on a low-cost processing platform embedded in the sensor, reducing the privacy risk even further. (iv) Limited manual data annotation is needed, because of the self-training scheme we propose. Such a smart room occupancy rate sensor can be used in e.g. meeting rooms and flex-desks. Indeed, by encouraging flex-desking, the required office space can be reduced significantly. In some cases, however, a flex-desk that has been reserved remains unoccupied without an update in the reservation system. A similar problem occurs with meeting rooms, which are often under-occupied. By optimising the occupancy rate a huge reduction in costs can be achieved. Therefore, in this paper, we develop such system which determines the number of people present in office flex-desks and meeting rooms. Using an omnidirectional camera mounted in the ceiling, combined with a person detector, the company can intelligently update the reservation system based on the measured occupancy. Next to the optimisation and embedded implementation of such a self-training omnidirectional people detection algorithm, in this work we propose a novel approach that combines spatial and temporal image data, improving performance of our system on extreme low-resolution images

arXiv.org e-Print Archive

Crossref

Perfect State Transfer, Effective Gates and Entanglement Generation in Engineered Bosonic and Fermionic Networks

Author: B. H. Bransden
B. N. Parlett
G. M. L. Gladwell
J. H. Wilkinson
M. A. Nielsen
M.-H. Yung
Man-Hong Yung
S. Sachdev
Sougato Bose
Publication venue: 'American Physical Society (APS)'
Publication date: 10/04/2005
Field of study

We show how to achieve perfect quantum state transfer and construct effective two-qubit gates between distant sites in engineered bosonic and fermionic networks. The Hamiltonian for the system can be determined by choosing an eigenvalue spectrum satisfying a certain condition, which is shown to be both sufficient and necessary in mirror-symmetrical networks. The natures of the effective two-qubit gates depend on the exchange symmetry for fermions and bosons. For fermionic networks, the gates are entangling (and thus universal for quantum computation). For bosonic networks, though the gates are not entangling, they allow two-way simultaneous communications. Protocols of entanglement generation in both bosonic and fermionic engineered networks are discussed.Comment: RevTeX4, 6 pages, 1 figure; replaced with a more general example and clarified the sufficient and necessary condition for perfect state transfe

arXiv.org e-Print Archive

Crossref

Subjective Quality Evaluation of H.264 High-Definition Video Coding versus Spatial Up-Scaling and Interlacing

Author: Barkowsky Marcus
Le Callet Patrick
Pitrey Yohann
Pépion Romuald
Publication venue: HAL CCSD
Publication date: 09/06/2010
Field of study

International audienceThe upcoming High-De nition format for video display provides high-quality content, especially when displayed on adapted devices. When combined with video coding techniques such as MPEG-4 AVC/H.264, the transmission of High-De nition video content on broadcast networks becomes possible. Nonetheless, transmitting and decoding such video content is a real challenge. Therefore, intermediate formats based on lower frame resolutions or interlaced coding are still provided to address targets with limited resources. Using these formats, the nal video quality depends on the postprocessing tools employed at the receiver to upsample and de-interlace these streams. In this paper, we compare the full-HD format to three possible scenarios to generate a full-HD stream from intermediate formats. We present the results of subjective tests that compare the visual quality of each scenario when using the same bitrate. The results show that using the same bitrate, the videos generated from lower-resolution formats reach similar quality compared to the full-HD videos