380 research outputs found
Speaker-following Video Subtitles
We propose a new method for improving the presentation of subtitles in video
(e.g. TV and movies). With conventional subtitles, the viewer has to constantly
look away from the main viewing area to read the subtitles at the bottom of the
screen, which disrupts the viewing experience and causes unnecessary eyestrain.
Our method places on-screen subtitles next to the respective speakers to allow
the viewer to follow the visual content while simultaneously reading the
subtitles. We use novel identification algorithms to detect the speakers based
on audio and visual information. Then the placement of the subtitles is
determined using global optimization. A comprehensive usability study indicated
that our subtitle placement method outperformed both conventional
fixed-position subtitling and another previous dynamic subtitling method in
terms of enhancing the overall viewing experience and reducing eyestrain
Harvesting Discriminative Meta Objects with Deep CNN Features for Scene Classification
Recent work on scene classification still makes use of generic CNN features
in a rudimentary manner. In this ICCV 2015 paper, we present a novel pipeline
built upon deep CNN features to harvest discriminative visual objects and parts
for scene classification. We first use a region proposal technique to generate
a set of high-quality patches potentially containing objects, and apply a
pre-trained CNN to extract generic deep features from these patches. Then we
perform both unsupervised and weakly supervised learning to screen these
patches and discover discriminative ones representing category-specific objects
and parts. We further apply discriminative clustering enhanced with local CNN
fine-tuning to aggregate similar objects and parts into groups, called meta
objects. A scene image representation is constructed by pooling the feature
response maps of all the learned meta objects at multiple spatial scales. We
have confirmed that the scene image representation obtained using this new
pipeline is capable of delivering state-of-the-art performance on two popular
scene benchmark datasets, MIT Indoor 67~\cite{MITIndoor67} and
Sun397~\cite{Sun397}Comment: To Appear in ICCV 201
Magnetic Proximity Effect and Interlayer Exchange Coupling of Ferromagnetic/Topological Insulator/Ferromagnetic Trilayer
Magnetic proximity effect between topological insulator (TI) and
ferromagnetic insulator (FMI) is considered to have great potential in
spintronics. However, a complete determination of interfacial magnetic
structure has been highly challenging. We theoretically investigate the
interlayer exchange coupling of two FMIs separated by a TI thin film, and show
that the particular electronic states of the TI contributing to the proximity
effect can be directly identified through the coupling behavior between two
FMIs, together with a tunability of coupling constant. Such FMI/TI/FMI
structure not only serves as a platform to clarify the magnetic structure of
FMI/TI interface, but also provides insights into designing the magnetic
storage devices with ultrafast response.Comment: 7 pages, 4 figure
Recommended from our members
Supplement of the radiance-based method to validate satellite-derived land surface temperature products over heterogeneous land surfaces
Land surface temperature (LST) retrieved from satellite remote sensing data has become a key parameter in research on global environmental change; therefore, the acquisition of accurate satellite-derived LST information is crucial for the diagnosis and analysis of global change. However, it is relatively difficult to obtain the true value of a pixel due to the scale mismatch between in situ measurements and satellite-based observations, especially for commonly heterogeneous and nonisothermal land areas, which greatly increases the difficulty in estimating pixel-representative LST values from in situ measurements for validation of satellite-based LST products. In this study, a supplemented radiance-based (SR-based) validation method was developed to evaluate the latest moderate resolution imaging spectroradiometer (MODIS) Collection 6 Level 2 daily LST/land surface emissivity (LSE) products over a heterogeneous and nonisothermal region of the Heihe Watershed Allied Telemetry Experimental Research (HiWATER) project, West China. In the SR-based framework, pixel-representative LST values are simulated by the MODTRAN model from the corresponding in situ measurements, such as LSE and atmospheric profile measurements, to evaluate the MODIS LST products. The validation results show that the MODIS daytime LST products from the Aqua satellite (MYD11_L2) have a greater accuracy than those from the Terra satellite (MOD11_L2). Analyses of the effect factors indicate a strong correlation between the errors in the MOD11_L2 LST product and the corresponding difference in the MODIS brightness temperature between bands 31 and 32. Although the requirement of synchronous or quasisynchronous in situ measurements for the validated LST products may limit the applicability of the SR-based method, it is still an effective and simple method for validating satellite-derived LST products over mixed pixels. Our method is an indispensable supplement for the validation methods of satellite-derived LST products, and it can be applied in West China and other areas with heterogeneous land surfaces
A novel direct power control for open-winding brushless doubly-fed reluctance generators fed by dual two-level converters using a common DC bus
A new direct power control (DPC) strategy for open-winding brushless doubly-fed reluctance generators (BDFRGs) with variable speed constant frequency is proposed. The control winding is open-circuited and fed by dual traditional two-level three phase converters using a common DC bus, and the DPC strategy aiming at maximum power point tracking and common mode voltage elimination is designed. Compared to the traditional three-level converter systems, the DC bus voltage, the voltage rating of power devices and capacity of the single two-level converter are all reduced by 50% while the reliability, redundancy and fault tolerance of the proposed system still greatly improved. Consequently its effectiveness is evaluated by simulation tests on a 42 kW prototype generator in MATLAB/SIMULINK
Photo2Relief: Let Human in the Photograph Stand Out
In this paper, we propose a technique for making humans in photographs
protrude like reliefs. Unlike previous methods which mostly focus on the face
and head, our method aims to generate art works that describe the whole body
activity of the character. One challenge is that there is no ground-truth for
supervised deep learning. We introduce a sigmoid variant function to manipulate
gradients tactfully and train our neural networks by equipping with a loss
function defined in gradient domain. The second challenge is that actual
photographs often across different light conditions. We used image-based
rendering technique to address this challenge and acquire rendering images and
depth data under different lighting conditions. To make a clear division of
labor in network modules, a two-scale architecture is proposed to create
high-quality relief from a single photograph. Extensive experimental results on
a variety of scenes show that our method is a highly effective solution for
generating digital 2.5D artwork from photographs.Comment: 10 pages, 11 figure
- …