Search CORE

60 research outputs found

Kalman tracking of linear predictor and harmonic noise models for noisy speech enhancement

Author: Ben Milner
Boll
Chen
Deller
Ephraim
Ephraim
Ephraim
Ephraim
Esfandiar Zavarehei
Friedman
Griffin
Hansen
Ioannis Andrianakis
Jonathan Darch
Kalman
Lim
Lim
Paul White
Qin Yan
Rentzos
Saeed Vaseghi
Sameti
Secrest
Seltzer
Stylianou
Stylianou
Tucker
Turunen
Vaseghi
Weber
Yan
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

This paper presents a speech enhancement method based on the tracking and denoising of the formants of a linear prediction (LP) model of the spectral envelope of speech and the parameters of a harmonic noise model (HNM) of its excitation. The main advantages of tracking and denoising the prominent energy contours of speech are the efficient use of the spectral and temporal structures of successive speech frames and a mitigation of processing artefact known as the ‘musical noise’ or ‘musical tones’.The formant-tracking linear prediction (FTLP) model estimation consists of three stages: (a) speech pre-cleaning based on a spectral amplitude estimation, (b) formant-tracking across successive speech frames using the Viterbi method, and (c) Kalman filtering of the formant trajectories across successive speech frames.The HNM parameters for the excitation signal comprise; voiced/unvoiced decision, the fundamental frequency, the harmonics’ amplitudes and the variance of the noise component of excitation. A frequency-domain pitch extraction method is proposed that searches for the peak signal to noise ratios (SNRs) at the harmonics. For each speech frame several pitch candidates are calculated. An estimate of the pitch trajectory across successive frames is obtained using a Viterbi decoder. The trajectories of the noisy excitation harmonics across successive speech frames are modeled and denoised using Kalman filters.The proposed method is used to deconstruct noisy speech, de-noise its model parameters and then reconstitute speech from its cleaned parts. Experimental evaluations show the performance gains of the formant tracking, pitch extraction and noise reduction stages

Crossref

Southampton (e-Prints Soton)

University of East Anglia digital repository

Pre-processing of Speech Signals for Robust Parameter Estimation

Author: Esquivel Jaramillo Alfredo
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2021
Field of study

VBN

Digital Linearization of High Capacity and Spectrally Efficient Direct Detection Optical Transceivers

Author: Li Zhe
Publication venue: UCL (University College London)
Publication date: 28/12/2017
Field of study

Metropolitan area networks are experiencing unprecedented traffic growth. The provision of information and entertainment supported by cloud services, broadband video and mobile technologies such as long-term evolution (LTE) and 5G are creating a rapidly increasing demand for bandwidth. Although wavelength division multiplexing (WDM) architectures have been introduced into metro transport networks to provide significant savings over single-channel systems, to cope with the ever-increasing traffic growth, it is urgently required to deploy higher data rates (100 Gb/s and beyond) for each WDM channel. In comparison to dual-polarization digital coherent transceivers, single-polarization and single photodiode-based direct-detection (DD) transceivers may be favourable for metropolitan, inter-data centre and access applications due to their use of a simple and low-cost optical hardware structure. Single sideband (SSB) quadrature amplitude modulation (QAM) subcarrier modulation (SCM) is a promising signal format to achieve high information spectral density (ISD). However, due to the nonlinear effect termed signal-signal beat interference (SSBI) caused by the square-law detection, the performance of such SSB SCM DD systems is severely degraded. Therefore, it is essential to develop effective and low-complexity linearization techniques to eliminate the SSBI penalty and improve the performance of such transceivers. Extensive studies on SSB SCM DD transceivers employing a number of novel digital linearization techniques to support high capacity (≥ 100 Gb/s per channel) and spectrally-efficient (net ISD > 2 b/s/Hz) WDM transmission covering metropolitan reach scenarios (up to 240 km) are described in detail in this thesis. Digital modulation formats that can be used in DD links and the corresponding transceiver configurations are firstly reviewed, from which the SSB SCM signalling format is identified as the most promising format to achieve high data rates and ISDs. Following this, technical details of the digital linearization approaches (iterative SSBI cancellation, single-stage linearization filter and simplified non-iterative SSBI cancellation, two-stage linearization filter, Kramers-Kronig scheme) considered in the thesis are presented. Their compensation performance in a dispersion pre-compensated (Tx-EDC) 112 Gb/s per channel 35 GHz-spaced WDM SSB 16-QAM Nyquist-SCM DD system transmitting over up to 240 km standard single-mode fibre (SSMF) is assessed. Net ISDs of up to 3.18 b/s/Hz are achieved. Moreover, we also show that, with the use of effective digital linearization techniques, further simplification of the DD transceivers can be realized by moving electronic dispersion compensation from the transmitter to the receiver without sacrificing performance. The optical ISD limit of SSB SCM DD system finally explored through experiments with higher-order modulation formats combined with effective digital linearization techniques. 168 Gb/s per channel WDM 64-QAM signals were successfully transmitted over 80 km, achieving a record net optical ISD of 4.54 b/s/Hz. Finally, areas for further research are identified

UCL Discovery

Diffusion MRI tractography for oncological neurosurgery planning:Clinical research prototype

Author: Krahulec Daniel
Publication venue: Eindhoven University of Technology
Publication date: 04/04/2023
Field of study

Pure OAI Repository

Diffusion MRI tractography for oncological neurosurgery planning:Clinical research prototype

Author: Krahulec Daniel
Publication venue: Eindhoven University of Technology
Publication date: 04/04/2023
Field of study

Pure OAI Repository

Heterogeneous LTE/ Wi-Fi architecture for intelligent transportation systems

Author: Sadek Noha
Publication venue: AUC Knowledge Fountain
Publication date: 01/06/2015
Field of study

Intelligent Transportation Systems (ITS) make use of advanced technologies to enhance road safety and improve traffic efficiency. It is anticipated that ITS will play a vital future role in improving traffic efficiency, safety, comfort and emissions. In order to assist the passengers to travel safely, efficiently and conveniently, several application requirements have to be met simultaneously. In addition to the delivery of regular traffic and safety information, vehicular networks have been recently required to support infotainment services. Previous vehicular network designs and architectures do not satisfy this increasing traffic demand as they are setup for either voice or data traffic, which is not suitable for the transfer of vehicular traffic. This new requirement is one of the key drivers behind the need for new mobile wireless broadband architectures and technologies. For this purpose, this thesis proposes and investigates a heterogeneous IEEE 802.11 and LTE vehicular system that supports both infotainment and ITS traffic control data. IEEE 802.11g is used for V2V communications and as an on-board access network while, LTE is used for V2I communications. A performance simulation-based study is conducted to validate the feasibility of the proposed system in an urban vehicular environment. The system performance is evaluated in terms of data loss, data rate, delay and jitter. Several simulation scenarios are performed and evaluated. In the V2I-only scenario, the delay, jitter and data drops for both ITS and video traffic are within the acceptable limits, as defined by vehicular application requirements. Although a tendency of increase in video packet drops during handover from one eNodeB to another is observed yet, the attainable data loss rate is still below the defined benchmarks. In the integrated V2V-V2I scenario, data loss in uplink ITS traffic was initially observed so, Burst communication technique is applied to prevent packet losses in the critical uplink ITS traffic. A quantitative analysis is performed to determine the number of packets per burst, the inter-packet and inter-burst intervals. It is found that a substantial improvement is achieved using a two-packet Burst, where no packets are lost in the uplink direction. The delay, jitter and data drops for both uplink and downlink ITS traffic, and video traffic are below the benchmarks of vehicular applications. Thus, the results indicate that the proposed heterogeneous system offers acceptable performance that meets the requirements of the different vehicular applications. All simulations are conducted on OPNET Network Modeler and results are subjected to a 95% confidence analysis

AUC Knowledge Fountain (American Univ. in Cairo)

Statistical single channel source separation

Author: Darsono Abd Majid
Publication venue: Newcastle University
Publication date: 01/01/2012
Field of study

PhD ThesisSingle channel source separation (SCSS) principally is one of the challenging fields in signal processing and has various significant applications. Unlike conventional SCSS methods which were based on linear instantaneous model, this research sets out to investigate the separation of single channel in two types of mixture which is nonlinear instantaneous mixture and linear convolutive mixture. For the nonlinear SCSS in instantaneous mixture, this research proposes a novel solution based on a two-stage process that consists of a Gaussianization transform which efficiently compensates for the nonlinear distortion follow by a maximum likelihood estimator to perform source separation. For linear SCSS in convolutive mixture, this research proposes new methods based on nonnegative matrix factorization which decomposes a mixture into two-dimensional convolution factor matrices that represent the spectral basis and temporal code. The proposed factorization considers the convolutive mixing in the decomposition by introducing frequency constrained parameters in the model. The method aims to separate the mixture into its constituent spectral-temporal source components while alleviating the effect of convolutive mixing. In addition, family of Itakura-Saito divergence has been developed as a cost function which brings the beneficial property of scale-invariant. Two new statistical techniques are proposed, namely, Expectation-Maximisation (EM) based algorithm framework which maximizes the log-likelihood of a mixed signals, and the maximum a posteriori approach which maximises the joint probability of a mixed signal using multiplicative update rules. To further improve this research work, a novel method that incorporates adaptive sparseness into the solution has been proposed to resolve the ambiguity and hence, improve the algorithm performance. The theoretical foundation of the proposed solutions has been rigorously developed and discussed in details. Results have concretely shown the effectiveness of all the proposed algorithms presented in this thesis in separating the mixed signals in single channel and have outperformed others available methods.Universiti Teknikal Malaysia Melaka(UTeM), Ministry of Higher Education of Malaysi

Newcastle University eTheses

Gold/graphene fractals as tunable plasmonic devices

Author: PUTHIYA PURAYIL NIKHIL SANTH
Publication venue: Universit\ue0 degli studi di Genova
Publication date: 01/08/2018
Field of study

Graphene, an atomically thin sheet of carbon atoms arranged in a honeycomb geometry, is attracting unique attention thanks to its extraordinary mechanical, electrical and optical properties. This thesis work concerns the realization of graphene-based nanoscale devices for novel plasmonic applications. We focus mainly on gold/graphene (Au/G) structures designed to display plasmonic multiresonances in the visible range thanks to the nanostructure geometry based on the Sierpinski carpet (SC) deterministic fractal

Archivio istituzionale della ricerca - Università di Genova

Extremely high data-rate, reliable network systems research

Author: Foudriat E. C.
Maly Kurt J.
Mukkamala R.
Murray Nicholas D.
Overstreet C. Michael
Publication venue
Publication date
Field of study

Significant progress was made over the year in the four focus areas of this research group: gigabit protocols, extensions of metropolitan protocols, parallel protocols, and distributed simulations. Two activities, a network management tool and the Carrier Sensed Multiple Access Collision Detection (CSMA/CD) protocol, have developed to the point that a patent is being applied for in the next year; a tool set for distributed simulation using the language SIMSCRIPT also has commercial potential and is to be further refined. The year's results for each of these areas are summarized and next year's activities are described

NASA Technical Reports Server