17 research outputs found

    Evaluating Supervision Levels Trade-Offs for Infrared-Based People Counting

    Full text link
    Object detection models are commonly used for people counting (and localization) in many applications but require a dataset with costly bounding box annotations for training. Given the importance of privacy in people counting, these models rely more and more on infrared images, making the task even harder. In this paper, we explore how weaker levels of supervision can affect the performance of deep person counting architectures for image classification and point-level localization. Our experiments indicate that counting people using a CNN Image-Level model achieves competitive results with YOLO detectors and point-level models, yet provides a higher frame rate and a similar amount of model parameters.Comment: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 202

    J Regularization Improves Imbalanced Multiclass Segmentation

    Get PDF
    We propose a new loss formulation to further advance the multiclass segmentation of cluttered cells under weakly supervised conditions. When adding a Youden's J statistic regularization term to the cross entropy loss we improve the separation of touching and immediate cells, obtaining sharp segmentation boundaries with high adequacy. This regularization intrinsically supports class imbalance thus eliminating the necessity of explicitly using weights to balance training. Simulations demonstrate this capability and show how the regularization leads to correct results by helping advancing the optimization when cross entropy stagnates. We build upon our previous work on multiclass segmentation by adding yet another training class representing gaps between adjacent cells. This addition helps the classifier identify narrow gaps as background and no longer as touching regions. We present results of our methods for 2D and 3D images, from bright field images to confocal stacks containing different types of cells, and we show that they accurately segment individual cells after training with a limited number of images, some of which are poorly annotated

    FERAtt: Facial Expression Recognition with Attention Net

    Full text link
    We present a new end-to-end network architecture for facial expression recognition with an attention model. It focuses attention in the human face and uses a Gaussian space representation for expression recognition. We devise this architecture based on two fundamental complementary components: (1) facial image correction and attention and (2) facial expression representation and classification. The first component uses an encoder-decoder style network and a convolutional feature extractor that are pixel-wise multiplied to obtain a feature attention map. The second component is responsible for obtaining an embedded representation and classification of the facial expression. We propose a loss function that creates a Gaussian structure on the representation space. To demonstrate the proposed method, we create two larger and more comprehensive synthetic datasets using the traditional BU3DFE and CK+ facial datasets. We compared results with the PreActResNet18 baseline. Our experiments on these datasets have shown the superiority of our approach in recognizing facial expressions

    Domain Generalization by Rejecting Extreme Augmentations

    Full text link
    Data augmentation is one of the most effective techniques for regularizing deep learning models and improving their recognition performance in a variety of tasks and domains. However, this holds for standard in-domain settings, in which the training and test data follow the same distribution. For the out-of-domain case, where the test data follow a different and unknown distribution, the best recipe for data augmentation is unclear. In this paper, we show that for out-of-domain and domain generalization settings, data augmentation can provide a conspicuous and robust improvement in performance. To do that, we propose a simple training procedure: (i) use uniform sampling on standard data augmentation transformations; (ii) increase the strength transformations to account for the higher data variance expected when working out-of-domain, and (iii) devise a new reward function to reject extreme transformations that can harm the training. With this procedure, our data augmentation scheme achieves a level of accuracy that is comparable to or better than state-of-the-art methods on benchmark domain generalization datasets. Code: \url{https://github.com/Masseeh/DCAug

    Influence of the type of rare-earth cation on electrical properties of Ba(Zr,Ti)O3 ceramics

    Get PDF
    En este artículo es investigado el efecto del radio iónico de diferentes cationes de tierras raras sobre las propiedades eléctricas del compuesto Ba(Zr,Ti)O3. La cerámica BaZr0.09Ti0.9O3 fue dopada con iones de Gd3+, Pr3+ y La3+, sustituyendo un 5 % de contenido de cationes Ba2+. Los resultados muestran que el coeficiente de difusividad se incrementa con el aumento del radio iónico, mientras el carácter de la transición de fase permanece casi constante, de acuerdo con el modelo fenomenológico de Santos-Eiras. En todas las muestras se reporta un comportamiento no relajador. El proceso de conductividad iónica en el BaZr0.09Ti0.9O3 aparece a 200 ºC, valor inferior al observado en el material cerámico BaTiO3. El valor de energía de activación en el proceso de conductividad del BaZr0.09Ti0.9O3 (Ea= 0.81 eV) indica un mecanismo de compensación de carga en la estructura cristalina similar al BaTiO3, es decir vacancias de oxígeno simplemente ionizadas. Para el compuesto BaZr0.09Ti0.9O3 dopado con iones Gd3+, Pr3+ y La3+ aparece además un mecanismo de conducción de electrones, debido a la sustitución del Ba2+.Postprint (published version

    J Regularization Improves Imbalanced Multiclass Segmentation

    Get PDF
    We propose a new loss formulation to further advance the multiclass segmentation of cluttered cells under weakly supervised conditions. When adding a Youden's J statistic regularization term to the cross entropy loss we improve the separation of touching and immediate cells, obtaining sharp segmentation boundaries with high adequacy. This regularization intrinsically supports class imbalance thus eliminating the necessity of explicitly using weights to balance training. Simulations demonstrate this capability and show how the regularization leads to correct results by helping advancing the optimization when cross entropy stagnates. We build upon our previous work on multiclass segmentation by adding yet another training class representing gaps between adjacent cells. This addition helps the classifier identify narrow gaps as background and no longer as touching regions. We present results of our methods for 2D and 3D images, from bright field images to confocal stacks containing different types of cells, and we show that they accurately segment individual cells after training with a limited number of images, some of which are poorly annotated
    corecore