349 research outputs found

    MixUp as Locally Linear Out-Of-Manifold Regularization

    Full text link
    MixUp is a recently proposed data-augmentation scheme, which linearly interpolates a random pair of training examples and correspondingly the one-hot representations of their labels. Training deep neural networks with such additional data is shown capable of significantly improving the predictive accuracy of the current art. The power of MixUp, however, is primarily established empirically and its working and effectiveness have not been explained in any depth. In this paper, we develop an understanding for MixUp as a form of "out-of-manifold regularization", which imposes certain "local linearity" constraints on the model's input space beyond the data manifold. This analysis enables us to identify a limitation of MixUp, which we call "manifold intrusion". In a nutshell, manifold intrusion in MixUp is a form of under-fitting resulting from conflicts between the synthetic labels of the mixed-up examples and the labels of original training data. Such a phenomenon usually happens when the parameters controlling the generation of mixing policies are not sufficiently fine-tuned on the training data. To address this issue, we propose a novel adaptive version of MixUp, where the mixing policies are automatically learned from the data using an additional network and objective function designed to avoid manifold intrusion. The proposed regularizer, AdaMixUp, is empirically evaluated on several benchmark datasets. Extensive experiments demonstrate that AdaMixUp improves upon MixUp when applied to the current art of deep classification models.Comment: Accepted by AAAI201

    Latent Embeddings for Collective Activity Recognition

    Full text link
    Rather than simply recognizing the action of a person individually, collective activity recognition aims to find out what a group of people is acting in a collective scene. Previ- ous state-of-the-art methods using hand-crafted potentials in conventional graphical model which can only define a limited range of relations. Thus, the complex structural de- pendencies among individuals involved in a collective sce- nario cannot be fully modeled. In this paper, we overcome these limitations by embedding latent variables into feature space and learning the feature mapping functions in a deep learning framework. The embeddings of latent variables build a global relation containing person-group interac- tions and richer contextual information by jointly modeling broader range of individuals. Besides, we assemble atten- tion mechanism during embedding for achieving more com- pact representations. We evaluate our method on three col- lective activity datasets, where we contribute a much larger dataset in this work. The proposed model has achieved clearly better performance as compared to the state-of-the- art methods in our experiments.Comment: 6pages, accepted by IEEE-AVSS201

    Influence of Waterside Buildings’ Layout on Wind Environment and the Relation with Design Based on a Case Study of the She Kou Residential District

    Get PDF
    It is important to improve residential thermal comfort in the high dense cities, in which wind environment is crucial. Waterside buildings take an advantage of micro-hydrological-climate in summer that should be used to enhance residential thermal comfort especially in the subtropical region. In order to propose design approaches according to the outdoor thermal comfort of the waterside residential, a case study of Shenzhen She Kou residential district has been made. It focused on various factors that could have influence on wind environment for improving thermal comfort. Using wind velocity ratio (ΔRi) criterion, factors of building development volume, building direction and layout pattern, open space arrangement etc. have been broadly explored using FLUENT simulation. To planning parameters, the Floor Area Ratio (FAR) is significantly influence wind environment, the smaller FAR is better. To the vertical layout of the buildings, multi-storey layout and multi-storey & sub high-rise mixed layout would provide better wind environment. To the horizontal layout, the determinant is better than the peripheral. Other factors such as the buildings’ direction towards the road, buildings’ height, and open space setting, have influence on wind environment yet. In general, the more benefit of design layout for wind breezing, the better wind environment it could ge

    Phase perturbation improves channel robustness for speech spoofing countermeasures

    Full text link
    In this paper, we aim to address the problem of channel robustness in speech countermeasure (CM) systems, which are used to distinguish synthetic speech from human natural speech. On the basis of two hypotheses, we suggest an approach for perturbing phase information during the training of time-domain CM systems. Communication networks often employ lossy compression codec that encodes only magnitude information, therefore heavily altering phase information. Also, state-of-the-art CM systems rely on phase information to identify spoofed speech. Thus, we believe the information loss in the phase domain induced by lossy compression codec degrades the performance of the unseen channel. We first establish the dependence of time-domain CM systems on phase information by perturbing phase in evaluation, showing strong degradation. Then, we demonstrated that perturbing phase during training leads to a significant performance improvement, whereas perturbing magnitude leads to further degradation.Comment: 5 pages; Accepted to INTERSPEECH 202
    • …
    corecore