483 research outputs found

    Unsupervised Domain Adaptation for Multispectral Pedestrian Detection

    Get PDF
    Multimodal information (e.g., visible and thermal) can generate robust pedestrian detections to facilitate around-the-clock computer vision applications, such as autonomous driving and video surveillance. However, it still remains a crucial challenge to train a reliable detector working well in different multispectral pedestrian datasets without manual annotations. In this paper, we propose a novel unsupervised domain adaptation framework for multispectral pedestrian detection, by iteratively generating pseudo annotations and updating the parameters of our designed multispectral pedestrian detector on target domain. Pseudo annotations are generated using the detector trained on source domain, and then updated by fixing the parameters of detector and minimizing the cross entropy loss without back-propagation. Training labels are generated using the pseudo annotations by considering the characteristics of similarity and complementarity between well-aligned visible and infrared image pairs. The parameters of detector are updated using the generated labels by minimizing our defined multi-detection loss function with back-propagation. The optimal parameters of detector can be obtained after iteratively updating the pseudo annotations and parameters. Experimental results show that our proposed unsupervised multimodal domain adaptation method achieves significantly higher detection performance than the approach without domain adaptation, and is competitive with the supervised multispectral pedestrian detectors

    Embedded real-time object detection for a UAV warning system

    Get PDF

    Eye Detection and Face Recognition Across the Electromagnetic Spectrum

    Get PDF
    Biometrics, or the science of identifying individuals based on their physiological or behavioral traits, has increasingly been used to replace typical identifying markers such as passwords, PIN numbers, passports, etc. Different modalities, such as face, fingerprint, iris, gait, etc. can be used for this purpose. One of the most studied forms of biometrics is face recognition (FR). Due to a number of advantages over typical visible to visible FR, recent trends have been pushing the FR community to perform cross-spectral matching of visible images to face images from higher spectra in the electromagnetic spectrum.;In this work, the SWIR band of the EM spectrum is the primary focus. Four main contributions relating to automatic eye detection and cross-spectral FR are discussed. First, a novel eye localization algorithm for the purpose of geometrically normalizing a face across multiple SWIR bands for FR algorithms is introduced. Using a template based scheme and a novel summation range filter, an extensive experimental analysis show that this algorithm is fast, robust, and highly accurate when compared to other available eye detection methods. Also, the eye locations produced by this algorithm provides higher FR results than all other tested approaches. This algorithm is then augmented and updated to quickly and accurately detect eyes in more challenging unconstrained datasets, spanning the EM spectrum. Additionally, a novel cross-spectral matching algorithm is introduced that attempts to bridge the gap between the visible and SWIR spectra. By fusing multiple photometric normalization combinations, the proposed algorithm is not only more efficient than other visible-SWIR matching algorithms, but more accurate in multiple challenging datasets. Finally, a novel pre-processing algorithm is discussed that bridges the gap between document (passport) and live face images. It is shown that the pre-processing scheme proposed, using inpainting and denoising techniques, significantly increases the cross-document face recognition performance

    Modelos de aprendizaje automático en la detección e identificación de personas: una revisión de literatura

    Get PDF
    Introduction: This article is the result of research entitled "Development of a prototype to optimize access conditions to the SENA-Pescadero using artificial intelligence and open-source tools", developed at the Servicio Nacional de Aprendizaje in 2020.   Problem: How to identify Machine Learning Techniques applied to computer vision processes through a literature review? Objective: Determine the application, as well as advantages and disadvantages of machine learning techniques focused on the detection and identification of people. Methodology: Systematic literature review in 4 high-impact bibliographic and scientific databases, using search filters and information selection criteria. Results: Machine Learning techniques defined as Principal Component Analysis, Weak Label Regularized Local Coordinate Coding, Support Vector Machines, Haar Cascade Classifiers and EigenFaces and FisherFaces, as well as their applicability in detection and identification processes.   Conclusion: The research led to the identification of the main computational intelligence techniques based on machine learning, applied to the detection and identification of people. Their influence was shown in several application cases, but most of them were focused on the implementation and optimization of access control systems, or tasks in which the identification of people was required for the execution of processes. Originality: Through this research, we studied and defined the main machine learning techniques currently used for the detection and identification of people. Limitations: The systematic review is limited to information available in the 4 databases consulted, and the amount of information is variable as articles are deposited in the databases.Introducción: Este artículo es el resultado de la investigación titulada " Desarrollo de un prototipo para optimizar las condiciones de acceso al SENA-Pescadero utilizando inteligencia artificial y herramientas de código abierto", desarrollada en el Servicio Nacional de Aprendizaje en 2020. Problema: ¿Cómo identificar las técnicas de aprendizaje automático aplicadas a los procesos de visión por computador a través de una revisión bibliográfica? Objetivo: Determinar la aplicación, así como las ventajas y desventajas de las técnicas de aprendizaje automático enfocadas a la detección e identificación de personas. Metodología: Revisión sistemática de la literatura en 4 bases de datos bibliográficas y científicas de alto impacto, utilizando filtros de búsqueda y criterios de selección de información. Resultados: Técnicas de aprendizaje automático definidas como Análisis de Componentes Principales, Codificación Local de Coordenadas Regularizada de Etiquetas Débiles, Máquinas de Vectores de Soporte, Clasificadores en Cascada de Haar y EigenFaces y FisherFaces, así como su aplicabilidad en procesos de detección e identificación. Conclusiones: La investigación permitió identificar las principales técnicas de inteligencia computacional basadas en machine learning aplicadas a la detección e identificación de personas. Su influencia se mostró en varios casos de aplicación, pero la mayoría de ellos se centraron en la implementación y optimización de sistemas de control de acceso, o tareas en las que se requería la identificación de personas para la ejecución de procesos Originalidad: A través de esta investigación se estudiaron y definieron las principales técnicas de machine learning utilizadas actualmente para la detección e identificación de personas
    corecore