73 research outputs found

    Adapting heterogeneous ensembles with particle swarm optimization for video face recognition

    Get PDF
    In video-based face recognition applications, matching is typically performed by comparing query samples against biometric models (i.e., an individual’s facial model) that is designed with reference samples captured during an enrollment process. Although statistical and neural pattern classifiers may represent a flexible solution to this kind of problem, their performance depends heavily on the availability of representative reference data. With operators involved in the data acquisition process, collection and analysis of reference data is often expensive and time consuming. However, although a limited amount of data is initially available during enrollment, new reference data may be acquired and labeled by an operator over time. Still, due to a limited control over changing operational conditions and personal physiology, classification systems used for video-based face recognition are confronted to complex and changing pattern recognition environments. This thesis concerns adaptive multiclassifier systems (AMCSs) for incremental learning of new data during enrollment and update of biometric models. To avoid knowledge (facial models) corruption over time, the proposed AMCS uses a supervised incremental learning strategy based on dynamic particle swarm optimization (DPSO) to evolve a swarm of fuzzy ARTMAP (FAM) neural networks in response to new data. As each particle in a FAM hyperparameter search space corresponds to a FAM network, the learning strategy adapts learning dynamics by co-optimizing all their parameters – hyperparameters, weights, and architecture – in order to maximize accuracy, while minimizing computational cost and memory resources. To achieve this, the relationship between the classification and optimization environments is studied and characterized, leading to these additional contributions. An initial version of this DPSO-based incremental learning strategy was applied to an adaptive classification system (ACS), where the accuracy of a single FAM neural network is maximized. It is shown that the original definition of a classification system capable of supervised incremental learning must be reconsidered in two ways. Not only must a classifier’s learning dynamics be adapted to maintain a high level of performance through time, but some previously acquired learning validation data must also be used during adaptation. It is empirically shown that adapting a FAM during incremental learning constitutes a type III dynamic optimization problem in the search space, where the local optima values and their corresponding position change in time. Results also illustrate the necessity of a long term memory (LTM) to store previously acquired data for unbiased validation and performance estimation. The DPSO-based incremental learning strategy was then modified to evolve the swarm (or pool) of FAM networks within an AMCS. A key element for the success of ensembles is tackled: classifier diversity. With several correlation and diversity indicators, it is shown that genoVIII type (i.e., hyperparameters) diversity in the optimization environment is correlated with classifier diversity in the classification environment. Following this result, properties of a DPSO algorithm that seeks to maintain genotype particle diversity to detect and follow local optima are exploited to generate and evolve diversified pools of FAMclassifiers. Furthermore, a greedy search algorithm is presented to perform an efficient ensemble selection based on accuracy and genotype diversity. This search algorithm allows for diversified ensembles without evaluating costly classifier diversity indicators, and selected ensembles also yield accuracy comparable to that of reference ensemble-based and batch learning techniques, with only a fraction of the resources. Finally, after studying the relationship between the classification environment and the search space, the objective space of the optimization environment is also considered. An aggregated dynamical niching particle swarm optimization (ADNPSO) algorithm is presented to guide the FAM networks according two objectives: FAM accuracy and computational cost. Instead of purely solving a multi-objective optimization problem to provide a Pareto-optimal front, the ADNPSO algorithm aims to generate pools of classifiers among which both genotype and phenotype (i.e., objectives) diversity are maximized. ADNPSO thus uses information in the search spaces to guide particles towards different local Pareto-optimal fronts in the objective space. A specialized archive is then used to categorize solutions according to FAMnetwork size and then capture locally non-dominated classifiers. These two components are then integrated to the AMCS through an ADNPSO-based incremental learning strategy. The AMCSs proposed in this thesis are promising since they create ensembles of classifiers designed with the ADNPSO-based incremental learning strategy and provide a high level of accuracy that is statistically comparable to that obtained through mono-objective optimization and reference batch learning techniques, and yet requires a fraction of the computational cost

    Adaptive classifier ensembles for face recognition in video-surveillance

    Get PDF
    Lors de l’implĂ©mentation de systĂšmes de sĂ©curitĂ© tels que la vidĂ©o-surveillance intelligente, l’utilisation d’images de visages prĂ©sente de nombreux avantages par rapport Ă  d’autres traits biomĂ©triques. En particulier, cela permet de dĂ©tecter d’éventuels individus d’intĂ©rĂȘt de maniĂšre discrĂšte et non intrusive, ce qui peut ĂȘtre particuliĂšrement avantageux dans des situations comme la dĂ©tection d’individus sur liste noire, la recherche dans des donnĂ©es archivĂ©es ou la rĂ©-identification de visages. MalgrĂ© cela, la reconnaissance de visages reste confrontĂ©e Ă  de nombreuses difficultĂ©s propres Ă  la vidĂ©o surveillance. Entre autres, le manque de contrĂŽle sur l’environnement observĂ© implique de nombreuses variations dans les conditions d’éclairage, la rĂ©solution de l’image, le flou de mouvement, l’orientation et l’expression des visages. Pour reconnaĂźtre des individus, des modĂšles de visages sont habituellement gĂ©nĂ©rĂ©s Ă  l’aide d’un nombre limitĂ© d’images ou de vidĂ©os de rĂ©fĂ©rence collectĂ©es lors de sessions d’inscription. Cependant, ces acquisitions ne se dĂ©roulant pas nĂ©cessairement dans les mĂȘmes conditions d’observation, les donnĂ©es de rĂ©fĂ©rence reprĂ©sentent pas toujours la complexitĂ© du problĂšme rĂ©el. D’autre part, bien qu’il soit possible d’adapter les modĂšles de visage lorsque de nouvelles donnĂ©es de rĂ©fĂ©rence deviennent disponibles, un apprentissage incrĂ©mental basĂ© sur des donnĂ©es significativement diffĂ©rentes expose le systĂšme Ă  un risque de corruption de connaissances. Enfin, seule une partie de ces connaissances est effectivement pertinente pour la classification d’une image donnĂ©e. Dans cette thĂšse, un nouveau systĂšme est proposĂ© pour la dĂ©tection automatique d’individus d’intĂ©rĂȘt en vidĂ©o-surveillance. Plus particuliĂšrement, celle-ci se concentre sur un scĂ©nario centrĂ© sur l’utilisateur, oĂč un systĂšme de reconnaissance de visages est intĂ©grĂ© Ă  un outil d’aide Ă  la dĂ©cision pour alerter un opĂ©rateur lorsqu’un individu d’intĂ©rĂȘt est dĂ©tectĂ© sur des flux vidĂ©o. Un tel systĂšme se doit d’ĂȘtre capable d’ajouter ou supprimer des individus d’intĂ©rĂȘt durant son fonctionnement, ainsi que de mettre Ă  jour leurs modĂšles de visage dans le temps avec des nouvelles donnĂ©es de rĂ©fĂ©rence. Pour cela, le systĂšme proposĂ© se base sur de la dĂ©tection de changement de concepts pour guider une stratĂ©gie d’apprentissage impliquant des ensembles de classificateurs. Chaque individu inscrit dans le systĂšme est reprĂ©sentĂ© par un ensemble de classificateurs Ă  deux classes, chacun Ă©tant spĂ©cialisĂ© dans des conditions d’observation diffĂ©rentes, dĂ©tectĂ©es dans les donnĂ©es de rĂ©fĂ©rence. De plus, une nouvelle rĂšgle pour la fusion dynamique d’ensembles de classificateurs est proposĂ©e, utilisant des modĂšles de concepts pour estimer la pertinence des classificateurs vis-Ă -vis de chaque image Ă  classifier. Enfin, les visages sont suivis d’une image Ă  l’autre dans le but de les regrouper en trajectoires, et accumuler les dĂ©cisions dans le temps. Au Chapitre 2, la dĂ©tection de changement de concept est dans un premier temps utilisĂ©e pour limiter l’augmentation de complexitĂ© d’un systĂšme d’appariement de modĂšles adoptant une stratĂ©gie de mise Ă  jour automatique de ses galeries. Une nouvelle approche sensible au contexte est proposĂ©e, dans laquelle seules les images de haute confiance capturĂ©es dans des conditions d’observation diffĂ©rentes sont utilisĂ©es pour mettre Ă  jour les modĂšles de visage. Des expĂ©rimentations ont Ă©tĂ© conduites avec trois bases de donnĂ©es de visages publiques. Un systĂšme d’appariement de modĂšles standard a Ă©tĂ© utilisĂ©, combinĂ© avec un module de dĂ©tection de changement dans les conditions d’illumination. Les rĂ©sultats montrent que l’approche proposĂ©e permet de diminuer la complexitĂ© de ces systĂšmes, tout en maintenant la performance dans le temps. Au Chapitre 3, un nouveau systĂšme adaptatif basĂ© des ensembles de classificateurs est proposĂ© pour la reconnaissance de visages en vidĂ©o-surveillance. Il est composĂ© d’un ensemble de classificateurs incrĂ©mentaux pour chaque individu inscrit, et se base sur la dĂ©tection de changement de concepts pour affiner les modĂšles de visage lorsque de nouvelles donnĂ©es sont disponibles. Une stratĂ©gie hybride est proposĂ©e, dans laquelle des classificateurs ne sont ajoutĂ©s aux ensembles que lorsqu’un changement abrupt est dĂ©tectĂ© dans les donnĂ©es de rĂ©fĂ©rence. Lors d’un changement graduel, les classificateurs associĂ©s sont mis Ă  jour, ce qui permet d’affiner les connaissances propres au concept correspondant. Une implĂ©mentation particuliĂšre de ce systĂšme est proposĂ©e, utilisant des ensembles de classificateurs de type Fuzzy-ARTMAP probabilistes, gĂ©nĂ©rĂ©s et mis Ă  jour Ă  l’aide d’une stratĂ©gie basĂ©e sur une optimisation par essaims de particules dynamiques, et utilisant la distance de Hellinger entre histogrammes pour dĂ©tecter des changements. Les simulations rĂ©alisĂ©es sur la base de donnĂ©e de vidĂ©o-surveillance Faces in Action (FIA) montrent que le systĂšme proposĂ© permet de maintenir un haut niveau de performance dans le temps, tout en limitant la corruption de connaissance. Il montre des performances de classification supĂ©rieure Ă  un systĂšme similaire passif (sans dĂ©tection de changement), ainsi qu’a des systĂšmes de rĂ©fĂ©rence de type kNN probabiliste, et TCM-kNN. Au Chapitre 4, une Ă©volution du systĂšme prĂ©sentĂ© au Chapitre 3 est proposĂ©e, intĂ©grant des mĂ©canismes permettant d’adapter dynamiquement le comportement du systĂšme aux conditions d’observation changeantes en mode opĂ©rationnel. Une nouvelle rĂšgle de fusion basĂ©e sur de la pondĂ©ration dynamique est proposĂ©e, assignant Ă  chaque classificateur un poids proportionnel Ă  son niveau de compĂ©tence estimĂ© vis-Ă -vis de chaque image Ă  classifier. De plus, ces compĂ©tences sont estimĂ©es Ă  l’aide des modĂšles de concepts utilisĂ©s en apprentissage pour la dĂ©tection de changement, ce qui permet un allĂšgement des ressources nĂ©cessaires en mode opĂ©rationnel. Une Ă©volution de l’implĂ©mentation proposĂ©e au Chapitre 3 est prĂ©sentĂ©e, dans laquelle les concepts sont modĂ©lisĂ©s Ă  l’aide de l’algorithme de partitionnement Fuzzy C-Means, et la fusion de classificateurs rĂ©alisĂ©e avec une moyenne pondĂ©rĂ©e. Les simulation expĂ©rimentales avec les bases de donnĂ©es de vidĂ©o-surveillance FIA et Chokepoint montrent que la mĂ©thode de fusion proposĂ©e permet d’obtenir des rĂ©sultats supĂ©rieurs Ă  la mĂ©thode de sĂ©lection dynamique DSOLA, tout en utilisant considĂ©rablement moins de ressources de calcul. De plus, la mĂ©thode proposĂ©e montre des performances de classification supĂ©rieures aux systĂšmes de rĂ©fĂ©rence de type kNN probabiliste, TCM-kNN et Adaptive Sparse Coding

    Adaptive multi-classifier systems for face re-identification applications

    Get PDF
    In video surveillance, decision support systems rely more and more on face recognition (FR) to rapidly determine if facial regions captured over a network of cameras correspond to individuals of interest. Systems for FR in video surveillance are applied in a range of scenarios, for instance in watchlist screening, face re-identification, and search and retrieval. The focus of this Thesis is video-to-video FR, as found in face re-identification applications, where facial models are designed on reference data, and update is archived on operational captures from video streams. Several challenges emerge from the task of recognizing individuals of interest from faces captured with video cameras. Most notably, it is often assumed that the facial appearance of target individuals do not change over time, and the proportions of faces captured for target and non-target individuals are balanced, known a priori and remain fixed. However, faces captured during operations vary due to several factors, including illumination, blur, resolution, pose expression, and camera interoperability. In addition, facial models used matching are commonly not representative since they are designed a priori, with a limited amount of reference samples that are collected and labeled at a high cost. Finally, the proportions of target and non-target individuals continuously change during operations. In literature, adaptive multiple classifier systems (MCSs) have been successfully applied to video-to-video FR, where the facial model for each target individual is designed using an ensemble of 2-class classifiers (trained using target vs. non-target reference samples). Recent approaches employ ensembles of 2-class Fuzzy ARTMAP classifiers, with a DPSO strategy to generate a pool of classifiers with optimized hyperparameters, and Boolean combination to merge their responses in the ROC space. Besides, the skew-sensitive ensembles were recently proposed to adapt the fusion function of an ensemble according to class imbalance measured on operational data. These active approaches estimate target vs. non-target proportions periodically during operations distance, and the fusion of classifier ensembles are adapted to such imbalance. Finally, face tracking can be used to regroup the system responses linked to a facial trajectory (facial captures from a single person in the scene) for robust spatio-temporal recognition, and to update facial models over time using operational data. In this Thesis, new techniques are proposed to adapt the facial models for individuals enrolled to a video-to-video FR system. Trajectory-based self-updating is proposed to update the system, considering gradual and abrupt changes in the classification environment. Then, skew-sensitive ensembles are proposed to adapt the system to the operational imbalance. In Chapter 2, an adaptive framework is proposed for partially-supervised learning of facial models over time based on facial trajectories. During operations, information from a face tracker and individual-specific ensembles is integrated for robust spatio-temporal recognition and for self-update of facial models. The tracker defines a facial trajectory for each individual in video. Recognition of a target individual is done if the positive predictions accumulated along a trajectory surpass a detection threshold for an ensemble. If the accumulated positive predictions surpass a higher update threshold, then all target face samples from the trajectory are combined with non-target samples (selected from the cohort and universal models) to update the corresponding facial model. A learn-and-combine strategy is employed to avoid knowledge corruption during self-update of ensembles. In addition, a memory management strategy based on Kullback-Leibler divergence is proposed to rank and select the most relevant target and non-target reference samples to be stored in memory as the ensembles evolves. The proposed system was validated with synthetic data and real videos from Face in Action dataset, emulating a passport checking scenario. Initially, enrollment trajectories were used for supervised learning of ensembles, and videos from three capture sessions were presented to the system for FR and self-update. Transaction-level analysis shows that the proposed approach outperforms baseline systems that do not adapt to new trajectories, and provides comparable performance to ideal systems that adapt to all relevant target trajectories, through supervised learning. Subject-level analysis reveals the existence of individuals for which self-updated ensembles provide a considerable benefit. Trajectory-level analysis indicates that the proposed system allows for robust spatio-temporal video-to-video FR. In Chapter 3, an extension and a particular implementation of the ensemble-based system for spatio-temporal FR is proposed, and is characterized in scenarios with gradual and abrupt changes in the classification environment. Transaction-level results show that the proposed system allows to increase AUC accuracy by about 3% in scenarios with abrupt changes, and by about 5% in scenarios with gradual changes. Subject-based analysis reveals the difficulties of FR with different poses, affecting more significantly the lamb- and goat-like individuals. Compared to reference spatio-temporal fusion approaches, the proposed accumulation scheme produces the highest discrimination. In Chapter 4, adaptive skew-sensitive ensembles are proposed to combine classifiers trained by selecting data with varying levels of imbalance and complexity, to sustain a high level the performance for video-to-video FR. During operations, the level of imbalance is periodically estimated from the input trajectories using the HDx quantification method, and pre-computed histogram representations of imbalanced data distributions. Ensemble scores are accumulated of trajectories for robust skew-sensitive spatio-temporal recognition. Results on synthetic data show that adapting the fusion function with the proposed approach can significantly improve performance. Results on real data show that the proposed method can outperform reference techniques in imbalanced video surveillance environments

    Tackling Uncertainties and Errors in the Satellite Monitoring of Forest Cover Change

    Get PDF
    This study aims at improving the reliability of automatic forest change detection. Forest change detection is of vital importance for understanding global land cover as well as the carbon cycle. Remote sensing and machine learning have been widely adopted for such studies with increasing degrees of success. However, contemporary global studies still suffer from lower-than-satisfactory accuracies and robustness problems whose causes were largely unknown. Global geographical observations are complex, as a result of the hidden interweaving geographical processes. Is it possible that some geographical complexities were not expected in contemporary machine learning? Could they cause uncertainties and errors when contemporary machine learning theories are applied for remote sensing? This dissertation adopts the philosophy of error elimination. We start by explaining the mathematical origins of possible geographic uncertainties and errors in chapter two. Uncertainties are unavoidable but might be mitigated. Errors are hidden but might be found and corrected. Then in chapter three, experiments are specifically designed to assess whether or not the contemporary machine learning theories can handle these geographic uncertainties and errors. In chapter four, we identify an unreported systemic error source: the proportion distribution of classes in the training set. A subsequent Bayesian Optimal solution is designed to combine Support Vector Machine and Maximum Likelihood. Finally, in chapter five, we demonstrate how this type of error is widespread not just in classification algorithms, but also embedded in the conceptual definition of geographic classes before the classification. In chapter six, the sources of errors and uncertainties and their solutions are summarized, with theoretical implications for future studies. The most important finding is that, how we design a classification largely pre-determines what we eventually get out of it. This applies for many contemporary popular classifiers including various types of neural nets, decision tree, and support vector machine. This is a cause of the so-called overfitting problem in contemporary machine learning. Therefore, we propose that the emphasis of classification work be shifted to the planning stage before the actual classification. Geography should not just be the analysis of collected observations, but also about the planning of observation collection. This is where geography, machine learning, and survey statistics meet

    VLSI Design

    Get PDF
    This book provides some recent advances in design nanometer VLSI chips. The selected topics try to present some open problems and challenges with important topics ranging from design tools, new post-silicon devices, GPU-based parallel computing, emerging 3D integration, and antenna design. The book consists of two parts, with chapters such as: VLSI design for multi-sensor smart systems on a chip, Three-dimensional integrated circuits design for thousand-core processors, Parallel symbolic analysis of large analog circuits on GPU platforms, Algorithms for CAD tools VLSI design, A multilevel memetic algorithm for large SAT-encoded problems, etc

    Learning as a Nonlinear Line of Attraction for Pattern Association, Classification and Recognition

    Get PDF
    Development of a mathematical model for learning a nonlinear line of attraction is presented in this dissertation, in contrast to the conventional recurrent neural network model in which the memory is stored in an attractive fixed point at discrete location in state space. A nonlinear line of attraction is the encapsulation of attractive fixed points scattered in state space as an attractive nonlinear line, describing patterns with similar characteristics as a family of patterns. It is usually of prime imperative to guarantee the convergence of the dynamics of the recurrent network for associative learning and recall. We propose to alter this picture. That is, if the brain remembers by converging to the state representing familiar patterns, it should also diverge from such states when presented by an unknown encoded representation of a visual image. The conception of the dynamics of the nonlinear line attractor network to operate between stable and unstable states is the second contribution in this dissertation research. These criteria can be used to circumvent the plasticity-stability dilemma by using the unstable state as an indicator to create a new line for an unfamiliar pattern. This novel learning strategy utilizes stability (convergence) and instability (divergence) criteria of the designed dynamics to induce self-organizing behavior. The self-organizing behavior of the nonlinear line attractor model can manifest complex dynamics in an unsupervised manner. The third contribution of this dissertation is the introduction of the concept of manifold of color perception. The fourth contribution of this dissertation is the development of a nonlinear dimensionality reduction technique by embedding a set of related observations into a low-dimensional space utilizing the result attained by the learned memory matrices of the nonlinear line attractor network. Development of a system for affective states computation is also presented in this dissertation. This system is capable of extracting the user\u27s mental state in real time using a low cost computer. It is successfully interfaced with an advanced learning environment for human-computer interaction

    Sensors and Technologies in Spain: State-of-the-Art

    Get PDF
    The aim of this special issue was to provide a comprehensive view on the state-of-the-art sensor technology in Spain. Different problems cause the appearance and development of new sensor technologies and vice versa, the emergence of new sensors facilitates the solution of existing real problems. [...

    Language impairment and colour categories

    Get PDF
    Goldstein (1948) reported multiple cases of failure to categorise colours in patients that he termed amnesic or anomic aphasics. these patients have a particular difficulty in producing perceptual categories in the absence of other aphasic impairments. we hold that neuropsychological evidence supports the view that the task of colour categorisation is logically impossible without labels

    How culture might constrain color categories

    Get PDF

    Error management in ATLAS TDAQ : an intelligent systems approach

    Get PDF
    This thesis is concerned with the use of intelligent system techniques (IST) within a large distributed software system, specifically the ATLAS TDAQ system which has been developed and is currently in use at the European Laboratory for Particle Physics(CERN). The overall aim is to investigate and evaluate a range of ITS techniques in order to improve the error management system (EMS) currently used within the TDAQ system via error detection and classification. The thesis work will provide a reference for future research and development of such methods in the TDAQ system. The thesis begins by describing the TDAQ system and the existing EMS, with a focus on the underlying expert system approach, in order to identify areas where improvements can be made using IST techniques. It then discusses measures of evaluating error detection and classification techniques and the factors specific to the TDAQ system. Error conditions are then simulated in a controlled manner using an experimental setup and datasets were gathered from two different sources. Analysis and processing of the datasets using statistical and ITS techniques shows that clusters exists in the data corresponding to the different simulated errors. Different ITS techniques are applied to the gathered datasets in order to realise an error detection model. These techniques include Artificial Neural Networks (ANNs), Support Vector Machines (SVMs) and Cartesian Genetic Programming (CGP) and a comparison of the respective advantages and disadvantages is made. The principle conclusions from this work are that IST can be successfully used to detect errors in the ATLAS TDAQ system and thus can provide a tool to improve the overall error management system. It is of particular importance that the IST can be used without having a detailed knowledge of the system, as the ATLAS TDAQ is too complex for a single person to have complete understanding of. The results of this research will benefit researchers developing and evaluating IST techniques in similar large scale distributed systems
    • 

    corecore