8 research outputs found

    Fusion of Multispectral Data Through Illumination-aware Deep Neural Networks for Pedestrian Detection

    Get PDF
    Multispectral pedestrian detection has received extensive attention in recent years as a promising solution to facilitate robust human target detection for around-the-clock applications (e.g. security surveillance and autonomous driving). In this paper, we demonstrate illumination information encoded in multispectral images can be utilized to significantly boost performance of pedestrian detection. A novel illumination-aware weighting mechanism is present to accurately depict illumination condition of a scene. Such illumination information is incorporated into two-stream deep convolutional neural networks to learn multispectral human-related features under different illumination conditions (daytime and nighttime). Moreover, we utilized illumination information together with multispectral data to generate more accurate semantic segmentation which are used to boost pedestrian detection accuracy. Putting all of the pieces together, we present a powerful framework for multispectral pedestrian detection based on multi-task learning of illumination-aware pedestrian detection and semantic segmentation. Our proposed method is trained end-to-end using a well-designed multi-task loss function and outperforms state-of-the-art approaches on KAIST multispectral pedestrian dataset

    Nonparametric Spatio-Temporal Joint Probabilistic Data Association Coupled Filter and Interfering Extended Target Tracking

    Full text link
    Extended target tracking estimates the centroid and shape of the target in space and time. In various situations where extended target tracking is applicable, the presence of multiple targets can lead to interference, particularly when they maneuver behind one another in a sensor like a camera. Nonetheless, when dealing with multiple extended targets, there's a tendency for them to share similar shapes within a group, which can enhance their detectability. For instance, the coordinated movement of a cluster of aerial vehicles might cause radar misdetections during their convergence or divergence. Similarly, in the context of a self-driving car, lane markings might split or converge, resulting in inaccurate lane tracking detections. A well-known joint probabilistic data association coupled (JPDAC) filter can address this problem in only a single-point target tracking. A variation of JPDACF was developed by introducing a nonparametric Spatio-Temporal Joint Probabilistic Data Association Coupled Filter (ST-JPDACF) to address the problem for extended targets. Using different kernel functions, we manage the dependency of measurements in space (inside a frame) and time (between frames). Kernel functions are able to be learned using a limited number of training data. This extension can be used for tracking the shape and dynamics of nonparametric dependent extended targets in clutter when targets share measurements. The proposed algorithm was compared with other well-known supervised methods in the interfering case and achieved promising results.Comment: 12 pages, 8 figures, Journa

    Multimodal fusion architectures for pedestrian detection

    Get PDF
    Pedestrian detection provides a crucial functionality in many human-centric applications, such as video surveillance, urban scene analysis, and autonomous driving. Recently, multimodal pedestrian detection has received extensive attention since the fusion of complementary information captured by visible and infrared sensors enables robust human target detection under daytime and nighttime scenes. In this chapter, we systematically evaluate the performance of different multimodal fusion architectures in order to identify the optimal solutions for pedestrian detection. We made two important observations: (1) it is useful to combine the most commonly used concatenation fusion scheme with a global scene-aware mechanism to learn both human-related features and correlation between visible and thermal feature maps; (2) the two-stream segmentation supervision without multimodal fusion provides the most effective scheme to infuse segmentation information as supervision for learning human-related features. Based on these studies, we present a unified multimodal fusion framework for joint training of target detection and segmentation supervision which achieves the state-of-the-art multimodal pedestrian detection performance on the public KAIST benchmark dataset.</p

    Parametrização de algoritmos para deteção de estrada a bordo do AtlasCar

    Get PDF
    Mestrado em Engenharia MecânicaO veículo AtlasCar é um protótipo desenvolvido pelo Laboratório de Automação e Robótica do Departamento de Engenharia Mecânica da Universidade de Aveiro, para o estudo de sistemas de segurança ativos e passivos e soluções de condução autónoma. O objetivo deste trabalho _e dotar o AtlasCar de um sistema de visão artificial capaz de reconhecer os limites da estrada, sendo para isso necessária a procura de marcas na estrada e/ou características das mesmas. Este trabalho encontra-se dividido em duas partes principais: o desenvolvimento de módulos de software para a implementação de algoritmos com metodologias de deteção distintas e o estudo dos parâmetros que os compõem. O primeiro algoritmo baseia-se no método de RANSAC para efetuar a aproximação de uma spline aos limites da estrada, por sua vez o segundo algoritmo utiliza a transformada de Hough para efetuar esta aproximação. Os métodos apresentados foram testados em condições reais e foram analisadas as situações de falha, vantagens e desvantagens da aplicação de um método em relativamente ao outro. Com a implementação deste sistema ser a possível a distinguir as zonas navegáveis das não navegáveis em contexto de estrada, permitindo assim a outros processos fazer um planeamento local da navegação.The AtlasCar vehicle is a prototype developed by the Laboratory of Automation and Robotics, Department of Mechanical Engineering of the University of Aveiro, for the study of active and passive safety systems and autonomous driving solutions. The objective of this work is to provide the AtlasCar an artificial vision system able to recognize road borders, identifying road lane markings and their characteristics. This work is divided into two main parts: development of software modules to implement algorithms with different detection methodologies and the study of parameters that compose them. The first algorithm is based on RANSAC method for performing spline fitting to the road limits, the second algoritm uses the Hough transform to make this fitting. The methods performed were tested under real conditions, and failure situations, advantages and drawbacks of their applications are analysed and compared. With the implementation of this system it will be possible to distinguish navigable from non-navigable road surface, thus endowing path planning algorithms to operate in local context

    Exploración de tolerancia a imprecisiones en aplicaciones ADAS y su efecto en un sistema de control

    Get PDF
    Proyecto de Graduación (Licenciatura en Ingeniería Mecatrónica) Instituto Tecnológico de Costa Rica. Área Académica de Ingeniería Mecatrónica, 2018Los vehículos que día a día son conducidos en las calles poseen Sistemas Avanzados de Asistencia el Conductor (Advanced Driver Assistance Systems, ADAS). Estas aplicaciones ADAS le brindan apoyo al conductor para facilitarle la tarea de conducir e inclusive pueda desentenderse de algunas tareas. Este desarrollo tecnológico en la industria automotriz tiene como meta llegar a la conducción autónoma de vehículos en un 100%. El reto que se planteó para el desarrollo de este proyecto fue desarrollar 3 de estas aplicaciones que involucren Visión por Computadora (Computer Vision, CV) y comprobar su tolerancia a imprecisiones mediante la implementación Computación Aproximada (Aproximate Computing, AC), lo cual aunado a un medio de pruebas donde permite observar su efecto en un sistema de control y comprobar así contra el funcionamiento “exacto" de las mismas aplicaciones.The day-to-day vehicles that are driven on the streets have Advanced Driver Assistance Systems (ADAS). These ADAS applications provide support to the driver to facilitate the task of driving and even can get off tasks. This technological development in the automotive industry aims to reach the autonomous driving of vehicles by 100%. The challenge for the development of this project is to develop 3 of these applications involving Computer Vision (CV) and test their tolerance to errors through Approximate Computing (AC) implementation, this coupled with a test medium where its effect will be seen in a system of Control and will be checked against the "exact" operation of the same applications

    Políticas de Copyright de Publicações Científicas em Repositórios Institucionais: O Caso do INESC TEC

    Get PDF
    A progressiva transformação das práticas científicas, impulsionada pelo desenvolvimento das novas Tecnologias de Informação e Comunicação (TIC), têm possibilitado aumentar o acesso à informação, caminhando gradualmente para uma abertura do ciclo de pesquisa. Isto permitirá resolver a longo prazo uma adversidade que se tem colocado aos investigadores, que passa pela existência de barreiras que limitam as condições de acesso, sejam estas geográficas ou financeiras. Apesar da produção científica ser dominada, maioritariamente, por grandes editoras comerciais, estando sujeita às regras por estas impostas, o Movimento do Acesso Aberto cuja primeira declaração pública, a Declaração de Budapeste (BOAI), é de 2002, vem propor alterações significativas que beneficiam os autores e os leitores. Este Movimento vem a ganhar importância em Portugal desde 2003, com a constituição do primeiro repositório institucional a nível nacional. Os repositórios institucionais surgiram como uma ferramenta de divulgação da produção científica de uma instituição, com o intuito de permitir abrir aos resultados da investigação, quer antes da publicação e do próprio processo de arbitragem (preprint), quer depois (postprint), e, consequentemente, aumentar a visibilidade do trabalho desenvolvido por um investigador e a respetiva instituição. O estudo apresentado, que passou por uma análise das políticas de copyright das publicações científicas mais relevantes do INESC TEC, permitiu não só perceber que as editoras adotam cada vez mais políticas que possibilitam o auto-arquivo das publicações em repositórios institucionais, como também que existe todo um trabalho de sensibilização a percorrer, não só para os investigadores, como para a instituição e toda a sociedade. A produção de um conjunto de recomendações, que passam pela implementação de uma política institucional que incentive o auto-arquivo das publicações desenvolvidas no âmbito institucional no repositório, serve como mote para uma maior valorização da produção científica do INESC TEC.The progressive transformation of scientific practices, driven by the development of new Information and Communication Technologies (ICT), which made it possible to increase access to information, gradually moving towards an opening of the research cycle. This opening makes it possible to resolve, in the long term, the adversity that has been placed on researchers, which involves the existence of barriers that limit access conditions, whether geographical or financial. Although large commercial publishers predominantly dominate scientific production and subject it to the rules imposed by them, the Open Access movement whose first public declaration, the Budapest Declaration (BOAI), was in 2002, proposes significant changes that benefit the authors and the readers. This Movement has gained importance in Portugal since 2003, with the constitution of the first institutional repository at the national level. Institutional repositories have emerged as a tool for disseminating the scientific production of an institution to open the results of the research, both before publication and the preprint process and postprint, increase the visibility of work done by an investigator and his or her institution. The present study, which underwent an analysis of the copyright policies of INESC TEC most relevant scientific publications, allowed not only to realize that publishers are increasingly adopting policies that make it possible to self-archive publications in institutional repositories, all the work of raising awareness, not only for researchers but also for the institution and the whole society. The production of a set of recommendations, which go through the implementation of an institutional policy that encourages the self-archiving of the publications developed in the institutional scope in the repository, serves as a motto for a greater appreciation of the scientific production of INESC TEC
    corecore