Search CORE

1,922 research outputs found

Data Balancing for Efficient Training of Hybrid ANN/HMM Automatic Speech Recognition Systems

Author: Díaz de María Fernando
García-Moral Ana I.
Peláez Moreno Carmen
Solera Ureña R.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

Hybrid speech recognizers, where the estimation of the emission pdf of the states of Hidden Markov Models (HMMs), usually carried out using Gaussian Mixture Models (GMMs), is substituted by Artificial Neural Networks (ANNs) have several advantages over the classical systems. However, to obtain performance improvements, the computational requirements are heavily increased because of the need to train the ANN. Departing from the observation of the remarkable skewness of speech data, this paper proposes sifting out the training set and balancing the amount of samples per class. With this method the training time has been reduced 18 times while obtaining performances similar to or even better than those with the whole database, especially in noisy environments. However, the application of these reduced sets is not straightforward. To avoid the mismatch between training and testing conditions created by the modification of the distribution of the training data, a proper scaling of the a posteriori probabilities obtained and a resizing of the context window need to be performed as demonstrated in the paper.This work was supported in part by the regional grant (Comunidad Autónoma de Madrid-UC3M) CCG06-UC3M/TIC-0812 and in part by a project funded by the Spanish Ministry of Science and Innovation (TEC 2008-06382).Publicad

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

How to shift bias: Lessons from the Baldwin effect

Author: Turney Peter D.
Publication venue
Publication date: 01/01/1996
Field of study

An inductive learning algorithm takes a set of data as input and generates a hypothesis as output. A set of data is typically consistent with an infinite number of hypotheses; therefore, there must be factors other than the data that determine the output of the learning algorithm. In machine learning, these other factors are called the bias of the learner. Classical learning algorithms have a fixed bias, implicit in their design. Recently developed learning algorithms dynamically adjust their bias as they search for a hypothesis. Algorithms that shift bias in this manner are not as well understood as classical algorithms. In this paper, we show that the Baldwin effect has implications for the design and analysis of bias shifting algorithms. The Baldwin effect was proposed in 1896, to explain how phenomena that might appear to require Lamarckian evolution (inheritance of acquired characteristics) can arise from purely Darwinian evolution. Hinton and Nowlan presented a computational model of the Baldwin effect in 1987. We explore a variation on their model, which we constructed explicitly to illustrate the lessons that the Baldwin effect has for research in bias shifting algorithms. The main lesson is that it appears that a good strategy for shift of bias in a learning algorithm is to begin with a weak bias and gradually shift to a strong bias

CiteSeerX

CogPrints Cognitive Sciences Eprint Archive

SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary

Author: Chawla Nitesh V.
Fernández Hilario Alberto Luis
García López Salvador
Herrera Triguero Francisco
Publication venue: 'AI Access Foundation'
Publication date: 01/01/2018
Field of study

The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is considered \de facto" standard in the framework of learning from imbalanced data. This is due to its simplicity in the design of the procedure, as well as its robustness when applied to di erent type of problems. Since its publication in 2002, SMOTE has proven successful in a variety of applications from several di erent domains. SMOTE has also inspired several approaches to counter the issue of class imbalance, and has also signi cantly contributed to new supervised learning paradigms, including multilabel classi cation, incremental learning, semi-supervised learning, multi-instance learning, among others. It is standard benchmark for learning from imbalanced data. It is also featured in a number of di erent software packages | from open source to commercial. In this paper, marking the fteen year anniversary of SMOTE, we re ect on the SMOTE journey, discuss the current state of a airs with SMOTE, its applications, and also identify the next set of challenges to extend SMOTE for Big Data problems.This work have been partially supported by the Spanish Ministry of Science and Technology under projects TIN2014-57251-P, TIN2015-68454-R and TIN2017-89517-P; the Project 887 BigDaP-TOOLS - Ayudas Fundaci on BBVA a Equipos de Investigaci on Cient ca 2016; and the National Science Foundation (NSF) Grant IIS-1447795

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Institucional Universidad de Granada

Machine learning, medical diagnosis, and biomedical engineering research - commentary

Author: Foster Kenneth R.
Koprowski Robert
Skufca Joseph D.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

A large number of papers are appearing in the biomedical engineering literature that describe the use of machine learning techniques to develop classifiers for detection or diagnosis of disease. However, the usefulness of this approach in developing clinically validated diagnostic techniques so far has been limited and the methods are prone to overfitting and other problems which may not be immediately apparent to the investigators. This commentary is intended to help sensitize investigators as well as readers and reviewers of papers to some potential pitfalls in the development of classifiers, and suggests steps that researchers can take to help avoid these problems. Building classifiers should be viewed not simply as an add-on statistical analysis, but as part and parcel of the experimental process. Validation of classifiers for diagnostic applications should be considered as part of a much larger process of establishing the clinical validity of the diagnostic technique

Springer - Publisher Connector

PubMed Central

Repozytorium Uniwersytetu Śląskiego RE-BUŚ

TEACHING OLD CALIPERS NEW TRICKS: USING CRANIOMETRICS FOR ANCESTRY ADMIXTURE ESTIMATION VIA FUZZY MATH

Author: Carnahan Kristi
Publication venue: The Aquila Digital Community
Publication date: 01/05/2022
Field of study

Cranial measurements have been a cornerstone of physical anthropology since its formation as a discipline in the early 1900s. However, most other ancestry determination methods come with a significant epistemological issue: they differentiate individuals into discrete categories without accounting for the issue of admixture. Advances in data mining and analysis techniques can now be used to help resolve this issue through soft computing, also known as “fuzzy math”. This type of advanced computational math requires specialized knowledge in computer programming, statistics, and data analysis techniques unless one is using computer programs specially designed to run these analyses. This project compiled a database from multiple open-source craniometrics data and utilized prepared packages within the R statistical environment to find a valid soft computing method for fuzzy ancestry determination that does not require extensive knowledge in computer programming or data mining. Exploration of database demographics notes an excess of White-identified individuals, and when tested, this demographic skew impacts the ability of the given package to return valid results. The package chosen was valid using the compiled database. Exploration of causes for the invalid results, including a significant White skew in the underlying database due to accessibility of metric databases, overfitting, and the inherent issues of admixture on craniometric research, are explored, and future directions discussed

Aquila Digital Community

Detecting unknown attacks in wireless sensor networks that contain mobile nodes

Author: Banković
Banković
Cai
Campo
David Fraga
Ganeriwal
Greenberg
José M. Moya
Juan Carlos Vallejo
Loo
Muñoz
Rieck
Studený
Varadhan
Wallenta
Zhang
Zorana Banković
Publication venue: 'MDPI AG'
Publication date: 01/01/2012
Field of study

As wireless sensor networks are usually deployed in unattended areas, security policies cannot be updated in a timely fashion upon identification of new attacks. This gives enough time for attackers to cause significant damage. Thus, it is of great importance to provide protection from unknown attacks. However, existing solutions are mostly concentrated on known attacks. On the other hand, mobility can make the sensor network more resilient to failures, reactive to events, and able to support disparate missions with a common set of sensors, yet the problem of security becomes more complicated. In order to address the issue of security in networks with mobile nodes, we propose a machine learning solution for anomaly detection along with the feature extraction process that tries to detect temporal and spatial inconsistencies in the sequences of sensed values and the routing paths used to forward these values to the base station. We also propose a special way to treat mobile nodes, which is the main novelty of this work. The data produced in the presence of an attacker are treated as outliers, and detected using clustering techniques. These techniques are further coupled with a reputation system, in this way isolating compromised nodes in timely fashion. The proposal exhibits good performances at detecting and confining previously unseen attacks, including the cases when mobile nodes are compromised

CiteSeerX

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

PubMed Central

Archivo Digital UPM

Benelearn 2005: Annual Machine Learning Conference of Belgium and the Netherlands:CTIT Proceedings of the 14th annual Machine Learning Conference of Belgium and the Netherlands

Author
Publication venue: Centre for Telematics and Information Technology (CTIT)
Publication date: 01/02/2005
Field of study

University of Twente Research Information