5 research outputs found
Data-Driven Supervised Learning for Life Science Data
Life science data are often encoded in a non-standard way by means of alpha-numeric sequences, graph representations, numerical vectors of variable length, or other formats. Domain-specific or data-driven similarity measures like alignment functions have been employed with great success. The vast majority of more complex data analysis algorithms require fixed-length vectorial input data, asking for substantial preprocessing of life science data. Data-driven measures are widely ignored in favor of simple encodings. These preprocessing steps are not always easy to perform nor particularly effective, with a potential loss of information and interpretability. We present some strategies and concepts of how to employ data-driven similarity measures in the life science context and other complex biological systems. In particular, we show how to use data-driven similarity measures effectively in standard learning algorithms
Feature Selection and Classifier Development for Radio Frequency Device Identification
The proliferation of simple and low-cost devices, such as IEEE 802.15.4 ZigBee and Z-Wave, in Critical Infrastructure (CI) increases security concerns. Radio Frequency Distinct Native Attribute (RF-DNA) Fingerprinting facilitates biometric-like identification of electronic devices emissions from variances in device hardware. Developing reliable classifier models using RF-DNA fingerprints is thus important for device discrimination to enable reliable Device Classification (a one-to-many looks most like assessment) and Device ID Verification (a one-to-one looks how much like assessment). AFITs prior RF-DNA work focused on Multiple Discriminant Analysis/Maximum Likelihood (MDA/ML) and Generalized Relevance Learning Vector Quantized Improved (GRLVQI) classifiers. This work 1) introduces a new GRLVQI-Distance (GRLVQI-D) classifier that extends prior GRLVQI work by supporting alternative distance measures, 2) formalizes a framework for selecting competing distance measures for GRLVQI-D, 3) introducing response surface methods for optimizing GRLVQI and GRLVQI-D algorithm settings, 4) develops an MDA-based Loadings Fusion (MLF) Dimensional Reduction Analysis (DRA) method for improved classifier-based feature selection, 5) introduces the F-test as a DRA method for RF-DNA fingerprints, 6) provides a phenomenological understanding of test statistics and p-values, with KS-test and F-test statistic values being superior to p-values for DRA, and 7) introduces quantitative dimensionality assessment methods for DRA subset selection
Metric Learning for Prototype-Based Classification
Biehl M, Hammer B, Schneider P, Villmann T. Metric learning for prototype based classification. In: Bianchini M, Maggini M, Scarselli F, eds. Innovations in Neural Information – Paradigms and Applications. Studies in Computational Intelligence, 247. Berlin: Springer; 2009: 183-199
Innovations in Neural Information Paradigms and Applications - Studies in Computational Intelligence.
This research book presents some of the most recent advances in neural information processing models including both theoretical concepts and practical applications. The contributions include: Advances in neural information processing paradigms; Self organising structures; Unsupervised and supervised learning of graph domains; Neural grammar networks; Model complexity in neural network learning; Regularization and suboptimal solutions in neural learning; Neural networks for the classification of vectors, sequences and graphs; Metric learning for prototype-based classification; Ensembles of neural networks; Fraud detection using machine learning; Computational modelling of neural multimodal integration.
This book is directed to the researchers, graduate students, professors and practitioner interested in recent advances in neural information processing paradigms and applications