Search CORE

423 research outputs found

Automatic Identity Recognition Using Speech Biometric

Author: Abushariah Mohammad A. M.
Alqudah Assal A. M.
Publication venue: 'European Scientific Institute, ESI'
Publication date: 28/04/2016
Field of study

Biometric technology refers to the automatic identification of a person using physical or behavioral traits associated with him/her. This technology can be an excellent candidate for developing intelligent systems such as speaker identification, facial recognition, signature verification...etc. Biometric technology can be used to design and develop automatic identity recognition systems, which are highly demanded and can be used in banking systems, employee identification, immigration, e-commerce…etc. The first phase of this research emphasizes on the development of automatic identity recognizer using speech biometric technology based on Artificial Intelligence (AI) techniques provided in MATLAB. For our phase one, speech data is collected from 20 (10 male and 10 female) participants in order to develop the recognizer. The speech data include utterances recorded for the English language digits (0 to 9), where each participant recorded each digit 3 times, which resulted in a total of 600 utterances for all participants. For our phase two, speech data is collected from 100 (50 male and 50 female) participants in order to develop the recognizer. The speech data is divided into text-dependent and text-independent data, whereby each participant selected his/her full name and recorded it 30 times, which makes up the text-independent data. On the other hand, the text-dependent data is represented by a short Arabic language story that contains 16 sentences, whereby every sentence was recorded by every participant 5 times. As a result, this new corpus contains 3000 (30 utterances * 100 speakers) sound files that represent the text-independent data using their full names and 8000 (16 sentences * 5 utterances * 100 speakers) sound files that represent the text-dependent data using the short story. For the purpose of our phase one of developing the automatic identity recognizer using speech, the 600 utterances have undergone the feature extraction and feature classification phases. The speech-based automatic identity recognition system is based on the most dominating feature extraction technique, which is known as the Mel-Frequency Cepstral Coefficient (MFCC). For feature classification phase, the system is based on the Vector Quantization (VQ) algorithm. Based on our experimental results, the highest accuracy achieved is 76%. The experimental results have shown acceptable performance, but can be improved further in our phase two using larger speech data size and better performance classification techniques such as the Hidden Markov Model (HMM)

European Scientific Journal, ESJ

European Scientific Journal (European Scientific Institute)

A system for learning statistical motion patterns

Author: Fu Z.
Hu W.
Maybank Stephen J.
Tan T.
Xiao X.
Xie D.
Publication venue: IEEE Computer Society
Publication date: 01/01/2006
Field of study

Analysis of motion patterns is an effective approach for anomaly detection and behavior prediction. Current approaches for the analysis of motion patterns depend on known scenes, where objects move in predefined ways. It is highly desirable to automatically construct object motion patterns which reflect the knowledge of the scene. In this paper, we present a system for automatically learning motion patterns for anomaly detection and behavior prediction based on a proposed algorithm for robustly tracking multiple objects. In the tracking algorithm, foreground pixels are clustered using a fast accurate fuzzy k-means algorithm. Growing and prediction of the cluster centroids of foreground pixels ensure that each cluster centroid is associated with a moving object in the scene. In the algorithm for learning motion patterns, trajectories are clustered hierarchically using spatial and temporal information and then each motion pattern is represented with a chain of Gaussian distributions. Based on the learned statistical motion patterns, statistical methods are used to detect anomalies and predict behaviors. Our system is tested using image sequences acquired, respectively, from a crowded real traffic scene and a model traffic scene. Experimental results show the robustness of the tracking algorithm, the efficiency of the algorithm for learning motion patterns, and the encouraging performance of algorithms for anomaly detection and behavior prediction

CiteSeerX

Crossref

Southampton (e-Prints Soton)

Birkbeck Institutional Research Online

A system for learning statistical motion patterns

Author: Fu Z.
Hu W.
Maybank Stephen J.
Tan T.
Xiao X.
Xie D.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

CiteSeerX

Crossref

Birkbeck Institutional Research Online

Intrusion detection for in-vehicle communication networks: An unsupervised kohonen SOM approach

Author: Barletta V. S.
Caivano D.
Nannavecchia A.
Scalera M.
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

The diffusion of embedded and portable communication devices on modern vehicles entails new security risks since in-vehicle communication protocols are still insecure and vulnerable to attacks. Increasing interest is being given to the implementation of automotive cybersecurity systems. In this work we propose an efficient and high-performing intrusion detection system based on an unsupervised Kohonen Self-Organizing Map (SOM) network, to identify attack messages sent on a Controller Area Network (CAN) bus. The SOM network found a wide range of applications in intrusion detection because of its features of high detection rate, short training time, and high versatility. We propose to extend the SOM network to intrusion detection on in-vehicle CAN buses. Many hybrid approaches were proposed to combine the SOM network with other clustering methods, such as the k-means algorithm, in order to improve the accuracy of the model. We introduced a novel distance-based procedure to integrate the SOM network with the K-means algorithm and compared it with the traditional procedure. The models were tested on a car hacking dataset concerning traffic data messages sent on a CAN bus, characterized by a large volume of traffic with a low number of features and highly imbalanced data distribution. The experimentation showed that the proposed method greatly improved detection accuracy over the traditional approach

Archivio istituzionale della ricerca - Università di Bari

Semi-continuous hidden Markov models for speech recognition

Author: Huang Xuedong
Publication venue: The University of Edinburgh
Publication date: 01/01/1989
Field of study

Edinburgh Research Archive

Convergence of Smoothed Empirical Measures with Applications to Entropy Estimation

Author: Goldfeld Ziv
Greenewald Kristjan
Polyanskiy Yury
Weed Jonathan
Publication venue
Publication date: 01/05/2020
Field of study

This paper studies convergence of empirical measures smoothed by a Gaussian kernel. Specifically, consider approximating

P\ast\mathcal{N}_\sigma

, for

\mathcal{N}_\sigma\triangleq\mathcal{N}(0,\sigma^2 \mathrm{I}_d)

, by

\hat{P}_n\ast\mathcal{N}_\sigma

, where

\hat{P}_n

is the empirical measure, under different statistical distances. The convergence is examined in terms of the Wasserstein distance, total variation (TV), Kullback-Leibler (KL) divergence, and

\chi^2

-divergence. We show that the approximation error under the TV distance and 1-Wasserstein distance (

\mathsf{W}_1

) converges at rate

e^{O(d)}n^{-\frac{1}{2}}

in remarkable contrast to a typical

n^{-\frac{1}{d}}

rate for unsmoothed

\mathsf{W}_1

(and

d\ge 3

). For the KL divergence, squared 2-Wasserstein distance (

\mathsf{W}_2^2

), and

\chi^2

-divergence, the convergence rate is

e^{O(d)}n^{-1}

, but only if

P

achieves finite input-output

\chi^2

mutual information across the additive white Gaussian noise channel. If the latter condition is not met, the rate changes to

\omega(n^{-1})

for the KL divergence and

\mathsf{W}_2^2

, while the

\chi^2

-divergence becomes infinite - a curious dichotomy. As a main application we consider estimating the differential entropy

h(P\ast\mathcal{N}_\sigma)

in the high-dimensional regime. The distribution

P

is unknown but

n

i.i.d samples from it are available. We first show that any good estimator of

h(P\ast\mathcal{N}_\sigma)

must have sample complexity that is exponential in

d

. Using the empirical approximation results we then show that the absolute-error risk of the plug-in estimator converges at the parametric rate

e^{O(d)}n^{-\frac{1}{2}}

, thus establishing the minimax rate-optimality of the plug-in. Numerical results that demonstrate a significant empirical superiority of the plug-in approach to general-purpose differential entropy estimators are provided.Comment: arXiv admin note: substantial text overlap with arXiv:1810.1158

arXiv.org e-Print Archive

DSpace@MIT

DART: Distribution Aware Retinal Transform for Event-based Cameras

Author: Orchard Garrick
Ramesh Bharath
Thi Ngoc Anh Le
Xiang Cheng
Yang Hong
Zhang Shihao
Publication venue
Publication date: 14/11/2018
Field of study

We introduce a generic visual descriptor, termed as distribution aware retinal transform (DART), that encodes the structural context using log-polar grids for event cameras. The DART descriptor is applied to four different problems, namely object classification, tracking, detection and feature matching: (1) The DART features are directly employed as local descriptors in a bag-of-features classification framework and testing is carried out on four standard event-based object datasets (N-MNIST, MNIST-DVS, CIFAR10-DVS, NCaltech-101). (2) Extending the classification system, tracking is demonstrated using two key novelties: (i) For overcoming the low-sample problem for the one-shot learning of a binary classifier, statistical bootstrapping is leveraged with online learning; (ii) To achieve tracker robustness, the scale and rotation equivariance property of the DART descriptors is exploited for the one-shot learning. (3) To solve the long-term object tracking problem, an object detector is designed using the principle of cluster majority voting. The detection scheme is then combined with the tracker to result in a high intersection-over-union score with augmented ground truth annotations on the publicly available event camera dataset. (4) Finally, the event context encoded by DART greatly simplifies the feature correspondence problem, especially for spatio-temporal slices far apart in time, which has not been explicitly tackled in the event-based vision domain.Comment: 12 pages, revision submitted to TPAMI in Nov 201

arXiv.org e-Print Archive

ScholarBank@NUS

Computational Approaches to Drug Profiling and Drug-Protein Interactions

Author: Scott Oliver B.
Publication venue: UCL (University College London)
Publication date: 28/03/2023
Field of study

Despite substantial increases in R&D spending within the pharmaceutical industry, denovo drug design has become a time-consuming endeavour. High attrition rates led to a long period of stagnation in drug approvals. Due to the extreme costs associated with introducing a drug to the market, locating and understanding the reasons for clinical failure is key to future productivity. As part of this PhD, three main contributions were made in this respect. First, the web platform, LigNFam enables users to interactively explore similarity relationships between ‘drug like’ molecules and the proteins they bind. Secondly, two deep-learning-based binding site comparison tools were developed, competing with the state-of-the-art over benchmark datasets. The models have the ability to predict offtarget interactions and potential candidates for target-based drug repurposing. Finally, the open-source ScaffoldGraph software was presented for the analysis of hierarchical scaffold relationships and has already been used in multiple projects, including integration into a virtual screening pipeline to increase the tractability of ultra-large screening experiments. Together, and with existing tools, the contributions made will aid in the understanding of drug-protein relationships, particularly in the fields of off-target prediction and drug repurposing, helping to design better drugs faster

UCL Discovery