147 research outputs found

    Online Vehicle Logo Recognition Using Cauchy Prior Logistic Regression

    Get PDF
    Vehicle logo recognition is an important part of vehicle identification in intelligent transportation systems. State-of-the-art vehicle logo recognition approaches typically consider training models on large datasets. However, there might only be a small training dataset to start with and more images can be obtained during the real-time applications. This paper proposes an online image recognition framework which provides solutions for both small and large datasets. Using this recognition framework, models are built efficiently using a weight updating scheme. Another novelty of this work is that the Cauchy prior logistic regression with conjugate gradient descent is proposed to deal with the multinomial classification tasks. The Cauchy prior results in a quicker convergence speed for the weight updating process which could decrease the computational cost for both online and offline methods. By testing with a publicly available dataset, the Cauchy prior logistic regression deceases the classification time by 59%. An accuracy of up to 98.80% is achieved when the proposed framework is applied

    Online Vehicle Logo Recognition Using Cauchy Prior Logistic Regression

    Get PDF
    Vehicle logo recognition is an important part of vehicle identification in intelligent transportation systems. State-of-the-art vehicle logo recognition approaches typically consider training models on large datasets. However, there might only be a small training dataset to start with and more images can be obtained during the real-time applications. This paper proposes an online image recognition framework which provides solutions for both small and large datasets. Using this recognition framework, models are built efficiently using a weight updating scheme. Another novelty of this work is that the Cauchy prior logistic regression with conjugate gradient descent is proposed to deal with the multinomial classification tasks. The Cauchy prior results in a quicker convergence speed for the weight updating process which could decrease the computational cost for both online and offline methods. By testing with a publicly available dataset, the Cauchy prior logistic regression deceases the classification time by 59%. An accuracy of up to 98.80% is achieved when the proposed framework is applied

    Machine Learning Methods for Autonomous Object Recognition and Restoration in Images

    Get PDF
    Image recognition and image restoration are important tasks in the field of image processing. Image recognition are becoming very popular due to the state-of-the-art deep learning methods. However, these models usually require big datasets and high computational costs, which could be challenging. This thesis proposes an online learning framework that deals with both small and big datasets. For small datasets, a Cauchy prior logistic regression classifier is proposed to provide a quick convergence, and the online weight updating scheme is efficient due to the previously trained weights being reused. For big datasets, convolutional neural network could be implemented. For image recognition, non-parametric classifiers are often used for image recognition such as K-nearest neighbours, however, K-nearest neighbours are vulnerable to noise and high dimensional features. This thesis proposes a non-parametric classifier based on Bayesian compressive sensing; the developed classifier is robust and it does not need a training stage. For image restoration, which is usually performed before image recognition as a preprocessing process. This thesis proposes such a joint framework that performs image recognition and restoration simultaneously. In image restoration, image rotation and occlusion are common problems but convolutional neural networks are not suitable to solve these due to the limitation of the convolutional process and pooling process. This thesis develops a joint framework based on capsule networks. The developed joint capsule framework could achieve a good result on recognition, image de-noising, recovering rotation and removing occlusion. The developed algorithms have been evaluated for vehicle logo restoration and recognition, however, they are transferable to other implementations. This thesis also developed an automatic detection and recognition framework for badger monitoring for the first time. Badger plays a key role in the transmission of bovine tuberculosis, which is described by government as the most pressing animal health problem in the UK. An automatic badger monitoring system could help researcher to understand the transmission mechanisms and thereby to develop methods to deal with the transmission between species

    Beyond Searching: Understanding How People Use Search to Support Their Creative Endeavors

    Get PDF
    Creativity is an essential part of people's daily life and work across a range of everyday tasks. However, little prior work has explored how people use search engines and information resources as part of their creative processes, and how systems might better support users' information needs when working on tasks that involve creative endeavors. In this dissertation research, I sought to investigate the types of information seeking tools and strategies that people currently use in practice when they engage in projects that involve everyday creativity. The dissertation includes two parts. In the first part, an online survey with 175 participants was conducted to get a general understanding of how people use search engines and other existing information tools to support their everyday creativity tasks, the types of creative process stages that are involved in their tasks, and how they use different tools to support different creative stages. To get a deeper understanding of people's behaviors and their creative processes, in the second part, I conducted a two-week diary study to investigate users' in-situ search behaviors in their design-related projects from different perspectives (e.g., types of information sought in a project, intents to use the information found online, strategies of using different resources or tools in creative processes, and challenges encountered in creative processes). At the end of this dissertation, I discuss the implications of this research and provide recommendations for future research and the future design of search systems.Doctor of Philosoph

    Advances in Artificial Intelligence: Models, Optimization, and Machine Learning

    Get PDF
    The present book contains all the articles accepted and published in the Special Issue “Advances in Artificial Intelligence: Models, Optimization, and Machine Learning” of the MDPI Mathematics journal, which covers a wide range of topics connected to the theory and applications of artificial intelligence and its subfields. These topics include, among others, deep learning and classic machine learning algorithms, neural modelling, architectures and learning algorithms, biologically inspired optimization algorithms, algorithms for autonomous driving, probabilistic models and Bayesian reasoning, intelligent agents and multiagent systems. We hope that the scientific results presented in this book will serve as valuable sources of documentation and inspiration for anyone willing to pursue research in artificial intelligence, machine learning and their widespread applications

    Intelligent video surveillance

    Get PDF
    In the focus of this thesis are the new and modified algorithms for object detection, recognition and tracking within the context of video analytics. The manual video surveillance has been proven to have low effectiveness and, at the same time, high expense because of the need in manual labour of operators, which are additionally prone to erroneous decisions. Along with increase of the number of surveillance cameras, there is a strong need to push for automatisation of the video analytics. The benefits of this approach can be found both in military and civilian applications. For military applications, it can help in localisation and tracking of objects of interest. For civilian applications, the similar object localisation procedures can make the criminal investigations more effective, extracting the meaningful data from the massive video footage. Recently, the wide accessibility of consumer unmanned aerial vehicles has become a new threat as even the simplest and cheapest airborne vessels can carry some cargo that means they can be upgraded to a serious weapon. Additionally they can be used for spying that imposes a threat to a private life. The autonomous car driving systems are now impossible without applying machine vision methods. The industrial applications require automatic quality control, including non-destructive methods and particularly methods based on the video analysis. All these applications give a strong evidence in a practical need in machine vision algorithms for object detection, tracking and classification and gave a reason for writing this thesis. The contributions to knowledge of the thesis consist of two main parts: video tracking and object detection and recognition, unified by the common idea of its applicability to video analytics problems. The novel algorithms for object detection and tracking, described in this thesis, are unsupervised and have only a small number of parameters. The approach is based on rigid motion segmentation by Bayesian filtering. The Bayesian filter, which was proposed specially for this method and contributes to its novelty, is formulated as a generic approach, and then applied to the video analytics problems. The method is augmented with optional object coordinate estimation using plain two-dimensional terrain assumption which gives a basis for the algorithm usage inside larger sensor data fusion models. The proposed approach for object detection and classification is based on the evolving systems concept and the new Typicality-Eccentricity Data Analytics (TEDA) framework. The methods are capable of solving classical problems of data mining: clustering, classification, and regression. The methods are proposed in a domain-independent way and are capable of addressing shift and drift of the data streams. Examples are given for the clustering and classification of the imagery data. For all the developed algorithms, the experiments have shown sustainable results on the testing data. The practical applications of the proposed algorithms are carefully examined and tested

    Pameran Reka Cipta, Penyelidikan dan Inovasi (PRPI) 2009

    Get PDF
    PRPI 2009 kini telah memasuki tahun penganjurannya yang ke-7. Pameran penyelidikan di UPM telah bermula sejak tahun 1997 semasa Exhibition & Seminar Harnessing for Industry Advantage. Pada tahun 2002, Pameran Reka Cipta dan Penyelidikan (PRP) buat pertama kali telah diadakan dengan menggunakan konsep pertandingan hasil projek penyelidikan yang telah dijalankan oleh para penyelidik UPM. Kejayaan penganjuran PRP 2002 telah merintis usaha untuk menjadikannya sebagai aktiviti tahunan UPM dan ianya terus berkembang sejajar dengan nama baharunya yang ditukar kepada Pameran Reka Cipta, Penyelidikan dan Inovasi yang bermula penganjurannya pada tahun 2005. Sebagai kesinambungan daripada kejayaan penganjuran PRPI 2006, 2007 dan 2008 yang lalu dan status UPM sebagai salah sebuah Universiti Penyelidikan, PRPI 2009 kali ini yang merupakan pameran penyelidikan yang terbesar di UPM terus dilaksanakan dengan aspirasi dan semangat yang lebih jitu. Pameran ini juga menjadi pelantar kepada para penyelidik untuk mengenengahkan hasil penyelidikan yang dijalankan dan penemuan baharu kepada umum. Di samping itu ianya juga menjadi penanda aras terhadap kualiti sesuatu projek penyelidikan bagi melayakkan para penyelidik UPM untuk menyertai pameran di peringkat kebangsaan dan seterusnya antarabangsa. Adalah diharapkan pelaksanaan PRPI 2009 ini akan dapat menyemarakkan budaya penyelidikan di kalangan staf dan juga pelajar UPM sekaligus menjadikan UPM sebagai Universiti Penyelidikan yang cemerlang di negara ini

    Essentials of Business Analytics

    Get PDF

    Fractional Calculus and the Future of Science

    Get PDF
    Newton foresaw the limitations of geometry’s description of planetary behavior and developed fluxions (differentials) as the new language for celestial mechanics and as the way to implement his laws of mechanics. Two hundred years later Mandelbrot introduced the notion of fractals into the scientific lexicon of geometry, dynamics, and statistics and in so doing suggested ways to see beyond the limitations of Newton’s laws. Mandelbrot’s mathematical essays suggest how fractals may lead to the understanding of turbulence, viscoelasticity, and ultimately to end of dominance of the Newton’s macroscopic world view.Fractional Calculus and the Future of Science examines the nexus of these two game-changing contributions to our scientific understanding of the world. It addresses how non-integer differential equations replace Newton’s laws to describe the many guises of complexity, most of which lay beyond Newton’s experience, and many had even eluded Mandelbrot’s powerful intuition. The book’s authors look behind the mathematics and examine what must be true about a phenomenon’s behavior to justify the replacement of an integer-order with a noninteger-order (fractional) derivative. This window into the future of specific science disciplines using the fractional calculus lens suggests how what is seen entails a difference in scientific thinking and understanding

    Learning Attention Mechanisms and Context: An Investigation into Vision and Emotion

    Get PDF
    Attention mechanisms for context modelling are becoming ubiquitous in neural architectures in machine learning. The attention mechanism is a technique that filters out information that is irrelevant to a given task and focuses on learning task-dependent fixation points or regions. Furthermore, attention mechanisms suggest a question about a given task, i.e. `what' to learn and `where/how' to learn for task-specific context modelling. The context is the conditional variables instrumental in deciding the categorical distribution for the given data. Also, why is learning task-specific context necessary? In order to answer these questions, context modelling with attention in the vision and emotion domains is explored in this thesis using attention mechanisms with different hierarchical structures. The three main goals of this thesis are building superior classifiers using attention-based deep neural networks~(DNNs), investigating the role of context modelling in the given tasks, and developing a framework for interpreting hierarchies and attention in deep attention networks. In the vision domain, gesture and posture recognition tasks in diverse environments, are chosen. In emotion, visual and speech emotion recognition tasks are chosen. These tasks are selected for their sequential properties for modelling a spatiotemporal context. One of the key challenges from a machine learning standpoint is to extract patterns which bear maximum correlation with the information encoded in its signal while being as insensitive as possible to other types of information carried by the signal. A possible way to overcome this problem is to learn task-dependent representations. In order to achieve that, novel spatiotemporal context modelling networks and the mixture of multi-view attention~(MOMA) networks are proposed using bidirectional long-short-term memory network (BLSTM), convolutional neural network~(CNN), Capsule and attention networks. A framework has been proposed to interpret the internal attention states with respect to the given task. The results of the classifiers in the assigned tasks are compared with the \textit{state-of-the-art} DNNs, and the proposed classifiers achieve superior results. The context in speech emotion recognition is explored deeply with the attention interpretation framework, and it shows that the proposed model can assign word importance based on acoustic context. Furthermore, it has been observed that the internal states of the attention bear correlation with human perception of acoustic cues for speech emotion recognition. Overall, the results demonstrate superior classifiers and context learning models with interpretable frameworks. The findings are very important for speech emotion recognition systems. In this thesis, not only better models are produced, but also the interpretability of those models are explored, and their internal states are analysed. The phones and words are aligned with the attention vectors, and it is seen that the vowel sounds are more important for defining emotion acoustic cues than the consonants, and the model can assign word importance based on acoustic context. Also, how these approaches for emotion recognition using word importance for predicting emotions are demonstrated by the attention weight visualisation over the words. In a broader perspective, the findings from the thesis about gesture, posture and emotion recognition may be helpful in tasks like human-robot interaction~(HRI) and conversational artificial agents (such as Siri, Alexa). The communication is grounded with the symbolic and sub-symbolic cues of intent either from visual, audio or haptics. The understanding of intent is much dependent on the reasoning about the situational context. Emotion, i.e.\ speech and visual emotion, provides context to a situation, and it is a deciding factor in the response generation. Emotional intelligence and information from vision, audio and other modalities are essential for making human-human and human-robot communication more natural and feedback-driven
    corecore