349 research outputs found
MixUp as Locally Linear Out-Of-Manifold Regularization
MixUp is a recently proposed data-augmentation scheme, which linearly
interpolates a random pair of training examples and correspondingly the one-hot
representations of their labels. Training deep neural networks with such
additional data is shown capable of significantly improving the predictive
accuracy of the current art. The power of MixUp, however, is primarily
established empirically and its working and effectiveness have not been
explained in any depth. In this paper, we develop an understanding for MixUp as
a form of "out-of-manifold regularization", which imposes certain "local
linearity" constraints on the model's input space beyond the data manifold.
This analysis enables us to identify a limitation of MixUp, which we call
"manifold intrusion". In a nutshell, manifold intrusion in MixUp is a form of
under-fitting resulting from conflicts between the synthetic labels of the
mixed-up examples and the labels of original training data. Such a phenomenon
usually happens when the parameters controlling the generation of mixing
policies are not sufficiently fine-tuned on the training data. To address this
issue, we propose a novel adaptive version of MixUp, where the mixing policies
are automatically learned from the data using an additional network and
objective function designed to avoid manifold intrusion. The proposed
regularizer, AdaMixUp, is empirically evaluated on several benchmark datasets.
Extensive experiments demonstrate that AdaMixUp improves upon MixUp when
applied to the current art of deep classification models.Comment: Accepted by AAAI201
Latent Embeddings for Collective Activity Recognition
Rather than simply recognizing the action of a person individually,
collective activity recognition aims to find out what a group of people is
acting in a collective scene. Previ- ous state-of-the-art methods using
hand-crafted potentials in conventional graphical model which can only define a
limited range of relations. Thus, the complex structural de- pendencies among
individuals involved in a collective sce- nario cannot be fully modeled. In
this paper, we overcome these limitations by embedding latent variables into
feature space and learning the feature mapping functions in a deep learning
framework. The embeddings of latent variables build a global relation
containing person-group interac- tions and richer contextual information by
jointly modeling broader range of individuals. Besides, we assemble atten- tion
mechanism during embedding for achieving more com- pact representations. We
evaluate our method on three col- lective activity datasets, where we
contribute a much larger dataset in this work. The proposed model has achieved
clearly better performance as compared to the state-of-the- art methods in our
experiments.Comment: 6pages, accepted by IEEE-AVSS201
Influence of Waterside Buildings’ Layout on Wind Environment and the Relation with Design Based on a Case Study of the She Kou Residential District
It is important to improve residential thermal comfort in the high dense cities, in which wind environment is crucial. Waterside buildings take an advantage of micro-hydrological-climate in summer that should be used to enhance residential thermal comfort especially in the subtropical region. In order to propose design approaches according to the outdoor thermal comfort of the waterside residential, a case study of Shenzhen She Kou residential district has been made. It focused on various factors that could have influence on wind environment for improving thermal comfort. Using wind velocity ratio (ΔRi) criterion, factors of building development volume, building direction and layout pattern, open space arrangement etc. have been broadly explored using FLUENT simulation. To planning parameters, the Floor Area Ratio (FAR) is significantly influence wind environment, the smaller FAR is better. To the vertical layout of the buildings, multi-storey layout and multi-storey & sub high-rise mixed layout would provide better wind environment. To the horizontal layout, the determinant is better than the peripheral. Other factors such as the buildings’ direction towards the road, buildings’ height, and open space setting, have influence on wind environment yet. In general, the more benefit of design layout for wind breezing, the better wind environment it could ge
Phase perturbation improves channel robustness for speech spoofing countermeasures
In this paper, we aim to address the problem of channel robustness in speech
countermeasure (CM) systems, which are used to distinguish synthetic speech
from human natural speech. On the basis of two hypotheses, we suggest an
approach for perturbing phase information during the training of time-domain CM
systems. Communication networks often employ lossy compression codec that
encodes only magnitude information, therefore heavily altering phase
information. Also, state-of-the-art CM systems rely on phase information to
identify spoofed speech. Thus, we believe the information loss in the phase
domain induced by lossy compression codec degrades the performance of the
unseen channel. We first establish the dependence of time-domain CM systems on
phase information by perturbing phase in evaluation, showing strong
degradation. Then, we demonstrated that perturbing phase during training leads
to a significant performance improvement, whereas perturbing magnitude leads to
further degradation.Comment: 5 pages; Accepted to INTERSPEECH 202
- …