557 research outputs found
Hybrid training approaches to Hidden Markov Model-based acoustic models for automatic speech recognition
Doctor of Philosoph
Hidden Markov Models
Hidden Markov Models (HMMs), although known for decades, have made a big career nowadays and are still in state of development. This book presents theoretical issues and a variety of HMMs applications in speech recognition and synthesis, medicine, neurosciences, computational biology, bioinformatics, seismology, environment protection and engineering. I hope that the reader will find this book useful and helpful for their own research
GA-BW based HMM in Brain Image Segmentation
Image segmentation is an important preprocessing step in a sophisticated and complex image processing algorithm. In segmenting real-world images, the cost of misclassification could depend on the true class. For example, in a two-class (negative or positive class) problem, the cost of misclassifying positive to negative class could not be equal to that of misclassifying negative to positive class. However, existing algorithms do not take into account the unequal misclassification cost. Here, we introduce a procedure to minimize the misclassification cost with class-dependent cost. The procedure assumes the hidden Markov model (HMM) which has been popularly used for image segmentation in recent years. In this proposed method  Baum-Welch (B-W) Algorithm is used to calculate the HMM model parameters. However, the B-W algorithm uses an initial random guess of the parameters, therefore, after convergence the output tends to be close to this initial value of the algorithm, which is not necessarily the global optimum of the model parameters. To achieve an optimum result  Genetic Algorithm (GA) combined with Baum-Welch (GA-BW) is proposed  and the idea is to use GA exploration ability to obtain the optimal parameters within the solution space. By using this proposed method, brain tumor region and non tumor region is segmented and classified within the state of art.
GA-BW based HMM in Brain Image Segmentation
Image segmentation is an important preprocessing step in a sophisticated and complex image processing algorithm. In segmenting real-world images, the cost of misclassification could depend on the true class. For example, in a two-class (negative or positive class) problem, the cost of misclassifying positive to negative class could not be equal to that of misclassifying negative to positive class. However, existing algorithms do not take into account the unequal misclassification cost. Here, we introduce a procedure to minimize the misclassification cost with class-dependent cost. The procedure assumes the hidden Markov model (HMM) which has been popularly used for image segmentation in recent years. In this proposed method  Baum-Welch (B-W) Algorithm is used to calculate the HMM model parameters. However, the B-W algorithm uses an initial random guess of the parameters, therefore, after convergence the output tends to be close to this initial value of the algorithm, which is not necessarily the global optimum of the model parameters. To achieve an optimum result  Genetic Algorithm (GA) combined with Baum-Welch (GA-BW) is proposed  and the idea is to use GA exploration ability to obtain the optimal parameters within the solution space. By using this proposed method, brain tumor region and non tumor region is segmented and classified within the state of art.
Automatic transcription of polyphonic music exploiting temporal evolution
PhDAutomatic music transcription is the process of converting an audio recording
into a symbolic representation using musical notation. It has numerous applications
in music information retrieval, computational musicology, and the
creation of interactive systems. Even for expert musicians, transcribing polyphonic
pieces of music is not a trivial task, and while the problem of automatic
pitch estimation for monophonic signals is considered to be solved, the creation
of an automated system able to transcribe polyphonic music without setting
restrictions on the degree of polyphony and the instrument type still remains
open.
In this thesis, research on automatic transcription is performed by explicitly
incorporating information on the temporal evolution of sounds. First efforts address
the problem by focusing on signal processing techniques and by proposing
audio features utilising temporal characteristics. Techniques for note onset and
offset detection are also utilised for improving transcription performance. Subsequent
approaches propose transcription models based on shift-invariant probabilistic
latent component analysis (SI-PLCA), modeling the temporal evolution
of notes in a multiple-instrument case and supporting frequency modulations in
produced notes. Datasets and annotations for transcription research have also
been created during this work. Proposed systems have been privately as well as
publicly evaluated within the Music Information Retrieval Evaluation eXchange
(MIREX) framework. Proposed systems have been shown to outperform several
state-of-the-art transcription approaches.
Developed techniques have also been employed for other tasks related to music
technology, such as for key modulation detection, temperament estimation,
and automatic piano tutoring. Finally, proposed music transcription models
have also been utilized in a wider context, namely for modeling acoustic scenes
Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks
Future wireless networks have a substantial potential in terms of supporting
a broad range of complex compelling applications both in military and civilian
fields, where the users are able to enjoy high-rate, low-latency, low-cost and
reliable information services. Achieving this ambitious goal requires new radio
techniques for adaptive learning and intelligent decision making because of the
complex heterogeneous nature of the network structures and wireless services.
Machine learning (ML) algorithms have great success in supporting big data
analytics, efficient parameter estimation and interactive decision making.
Hence, in this article, we review the thirty-year history of ML by elaborating
on supervised learning, unsupervised learning, reinforcement learning and deep
learning. Furthermore, we investigate their employment in the compelling
applications of wireless networks, including heterogeneous networks (HetNets),
cognitive radios (CR), Internet of things (IoT), machine to machine networks
(M2M), and so on. This article aims for assisting the readers in clarifying the
motivation and methodology of the various ML algorithms, so as to invoke them
for hitherto unexplored services as well as scenarios of future wireless
networks.Comment: 46 pages, 22 fig
Recommended from our members
PATTERNA: transcriptome-wide search for functional RNA elements via structural data signatures.
Establishing a link between RNA structure and function remains a great challenge in RNA biology. The emergence of high-throughput structure profiling experiments is revolutionizing our ability to decipher structure, yet principled approaches for extracting information on structural elements directly from these data sets are lacking. We present PATTERNA, an unsupervised pattern recognition algorithm that rapidly mines RNA structure motifs from profiling data. We demonstrate that PATTERNA detects motifs with an accuracy comparable to commonly used thermodynamic models and highlight its utility in automating data-directed structure modeling from large data sets. PATTERNA is versatile and compatible with diverse profiling techniques and experimental conditions
- …