2,989 research outputs found

    Radar-based Feature Design and Multiclass Classification for Road User Recognition

    Full text link
    The classification of individual traffic participants is a complex task, especially for challenging scenarios with multiple road users or under bad weather conditions. Radar sensors provide an - with respect to well established camera systems - orthogonal way of measuring such scenes. In order to gain accurate classification results, 50 different features are extracted from the measurement data and tested on their performance. From these features a suitable subset is chosen and passed to random forest and long short-term memory (LSTM) classifiers to obtain class predictions for the radar input. Moreover, it is shown why data imbalance is an inherent problem in automotive radar classification when the dataset is not sufficiently large. To overcome this issue, classifier binarization is used among other techniques in order to better account for underrepresented classes. A new method to couple the resulting probabilities is proposed and compared to others with great success. Final results show substantial improvements when compared to ordinary multiclass classificationComment: 8 pages, 6 figure

    Swarm-based Descriptor Combination and its Application for Image Classification

    Get PDF
    In this paper, we deal with the descriptor combination problem in image classification tasks. This problem refers to the definition of an appropriate combination of image content descriptors that characterize different visual properties, such as color, shape and texture. In this paper, we propose to model the descriptor combination as a swarm-based optimization problem, which finds out the set of parameters that maximizes the classification accuracy of the Optimum-Path Forest (OPF) classifier. In our model, a descriptor is seen as a pair composed of a feature extraction algorithm and a suitable distance function. Our strategy here is to combine distance scores defined by different descriptors, as well as to employ them to weight OPF edges, which connect samples in the feature space. An extensive evaluation of several swarm-based optimization techniques was performed. Experimental results have demonstrated the robustness of the proposed combination approach

    Swarm-based Descriptor Combination and its Application for Image Classification

    Get PDF
    In this paper, we deal with the descriptor combination problem in image classification tasks. This problem refers to the definition of an appropriate combination of image content descriptors that characterize different visual properties, such as color, shape and texture. In this paper, we propose to model the descriptor combination as a swarm-based optimization problem, which finds out the set of parameters that maximizes the classification accuracy of the Optimum-Path Forest (OPF) classifier. In our model,  a descriptor is seen as a pair composed of a feature extraction algorithm and a suitable distance function. Our strategy here is to combine distance scores defined by different descriptors, as well as to employ them to weight OPF edges, which connect samples in the feature space. An extensive evaluation of several swarm-based optimization techniques was performed. Experimental results have demonstrated the robustness of the proposed combination approach

    An Overview on Application of Machine Learning Techniques in Optical Networks

    Get PDF
    Today's telecommunication networks have become sources of enormous amounts of widely heterogeneous data. This information can be retrieved from network traffic traces, network alarms, signal quality indicators, users' behavioral data, etc. Advanced mathematical tools are required to extract meaningful information from these data and take decisions pertaining to the proper functioning of the networks from the network-generated data. Among these mathematical tools, Machine Learning (ML) is regarded as one of the most promising methodological approaches to perform network-data analysis and enable automated network self-configuration and fault management. The adoption of ML techniques in the field of optical communication networks is motivated by the unprecedented growth of network complexity faced by optical networks in the last few years. Such complexity increase is due to the introduction of a huge number of adjustable and interdependent system parameters (e.g., routing configurations, modulation format, symbol rate, coding schemes, etc.) that are enabled by the usage of coherent transmission/reception technologies, advanced digital signal processing and compensation of nonlinear effects in optical fiber propagation. In this paper we provide an overview of the application of ML to optical communications and networking. We classify and survey relevant literature dealing with the topic, and we also provide an introductory tutorial on ML for researchers and practitioners interested in this field. Although a good number of research papers have recently appeared, the application of ML to optical networks is still in its infancy: to stimulate further work in this area, we conclude the paper proposing new possible research directions

    Automatic Quality Estimation for ASR System Combination

    Get PDF
    Recognizer Output Voting Error Reduction (ROVER) has been widely used for system combination in automatic speech recognition (ASR). In order to select the most appropriate words to insert at each position in the output transcriptions, some ROVER extensions rely on critical information such as confidence scores and other ASR decoder features. This information, which is not always available, highly depends on the decoding process and sometimes tends to over estimate the real quality of the recognized words. In this paper we propose a novel variant of ROVER that takes advantage of ASR quality estimation (QE) for ranking the transcriptions at "segment level" instead of: i) relying on confidence scores, or ii) feeding ROVER with randomly ordered hypotheses. We first introduce an effective set of features to compensate for the absence of ASR decoder information. Then, we apply QE techniques to perform accurate hypothesis ranking at segment-level before starting the fusion process. The evaluation is carried out on two different tasks, in which we respectively combine hypotheses coming from independent ASR systems and multi-microphone recordings. In both tasks, it is assumed that the ASR decoder information is not available. The proposed approach significantly outperforms standard ROVER and it is competitive with two strong oracles that e xploit prior knowledge about the real quality of the hypotheses to be combined. Compared to standard ROVER, the abs olute WER improvements in the two evaluation scenarios range from 0.5% to 7.3%
    corecore