85 research outputs found
Gain optimization for cochlear implant systems
Cochlear implant systems need Automatic Gain Control (AGC) to compress the large dynamic range (~120 dB) of the acoustic environment into the small dynamic range (< 20 dB) of electrical stimulation. This thesis is concerned with the design, implementation and evaluation of AGC systems for cochlear implants. It investigated the effects of AGC on the speech intelligibility of cochlear implant recipients. Various AGC configurations were evaluated with sentences presented over a wide range of levels at different Signal-to-Noise Ratios (SNR) to identify important factors affecting the performance. Signal metrics were developed to quantify the effects of AGC on the channel envelopes. The goal was to improve speech intelligibility in adverse listening conditions.
The performance-intensity functions of cochlear implant recipients with no AGC and with a front-end compression limiter were measured in noise. With no AGC, the proportion of envelope clipping grew monotonically with presentation level. The front-end limiter substantially reduced envelope clipping yet gave little improvement in speech intelligibility. The recipients were highly tolerant of envelope clipping when the background noise was low. SNR degradation was identified as the main factor reducing speech intelligibility.
A front-end limiter cannot guarantee zero envelope clipping. In contrast, the proposed envelope profile limiter eliminated envelope clipping and hence preserved the spectral profile. The two AGCs were evaluated, with two release times (75 and 625 ms). The shorter release time gave worse speech intelligibility because it caused more waveform distortion and output SNR reduction. For a given release time, preserving spectral envelope profile gave additional benefits. In a take-home experiment, cochlear implant recipients rated a program with the envelope profile limiter equivalent to their everyday program.
A conventional cochlear implant signal path uses a predetermined input dynamic range, which is shifted up or down by the AGC. In contrast, the proposed Adaptive Loudness Growth Function (ALGF) continually optimized the input dynamic range by estimating the noise floor and peak level in each channel. The ALGF gave better Speech Reception Threshold (SRT) than the existing state-of-the-art AGC system at the high presentation level when evaluated with a newly developed roving-level SRT test at three presentation levels
Investigation of Consumer Grade EEG as a Fall Risk Assessment Tool
Fall prevention for geriatric populations is a growing concern among clinicians and researchers due to severe risk of morbidity and loss of independence. Emerging evidence has demonstrated that cognitive workload while walking influences gait stability and the risk of falling. Electroencephalography (EEG) presents a potential method to provide objective measures, via Theta (4-7 Hz) and Alpha (8-13 Hz) frequency band powers, of cognitive workload during daily activities. Consumer grade EEG headsets increase the accessibility of EEG signal measurement through lowered cost and easier setup protocols. The following thesis
presents a series of studies investigating the sensitivity of the Interaxon Muse headband, and the Emotiv Epoc+ to measure cognitive load changes under ambulatory conditions (i.e., while walking). While the Muse yielded no sensitivity to changes in neural activity associated with changes in cognitive load levels, the Emotiv Epoc+ yielded high sensitivity to cognitive load changes across all 14 electrodes. Further research concerning this thesis centered around the use of the Emotiv Epoc+ system to distinguish levels of cognitive load under ambulatory conditions.
To examine the impact of motion artifact on EEG signals measured by the Emotiv Epoc+, a swim cap paradigm was used to isolate noise associated with gait. Signal to noise ratio (SNR) estimates indicate that EEG signals are 8 to 20 times the power of gait-induced noise, supporting the Emotiv Epoc+ during ambulatory monitoring conditions. Applying time and spectral system identification techniques, the relationship between motion-induced artifacts and recorded inertial measurement unit (IMU) yielded a strong nonlinear response. The final study of this thesis evaluated the utility of the Emotiv Epoc+ to measure 3 levels of cognitive load using a working memory paradigm while walking on a treadmill. A quadratic support vector machine (SVM) classifier was able to classify three levels of cognitive load at an accuracy of 70.3 %. These promising initial results, coupled with the short measurement time (10 sec), support the long-term goal of assessing cognitive load in an ambulatory environment towards implementation in fall risk assessment systems
FPGA implementations for parallel multidimensional filtering algorithms
PhD ThesisOne and multi dimensional raw data collections introduce noise and artifacts, which need to be recovered from degradations by an automated filtering system before, further machine analysis. The need for automating wide-ranged filtering applications necessitates the design of generic filtering architectures, together with the development of multidimensional and extensive convolution operators. Consequently, the aim of this thesis is to investigate the problem of automated construction of a generic parallel filtering system. Serving this goal, performance-efficient FPGA implementation architectures are developed to realize parallel one/multi-dimensional filtering algorithms. The proposed generic architectures provide a mechanism for fast FPGA prototyping of high performance computations to obtain efficiently implemented performance indices of area, speed, dynamic power, throughput and computation rates, as a complete package. These parallel filtering algorithms and their automated generic architectures tackle the major bottlenecks and limitations of existing multiprocessor systems in wordlength, input data segmentation, boundary conditions as well as inter-processor communications, in order to support high data throughput real-time applications of low-power architectures using a Xilinx Virtex-6 FPGA board.
For one-dimensional raw signal filtering case, mathematical model and architectural development of the generalized parallel 1-D filtering algorithms are presented using the 1-D block filtering method. Five generic architectures are implemented on a Virtex-6 ML605 board, evaluated and compared. A complete set of results on area, speed, power, throughput and computation rates are obtained and discussed as performance indices for the 1-D convolution architectures. A successful application of parallel 1-D cross-correlation is demonstrated.
For two dimensional greyscale/colour image processing cases, new parallel 2-D/3-D filtering algorithms are presented and mathematically modelled using input decimation and output image reconstruction by interpolation. Ten generic architectures are implemented on the Virtex-6 ML605 board, evaluated and compared. Key results on area, speed, power, throughput and computation rate are obtained and discussed as performance indices for the 2-D convolution architectures. 2-D image reconfigurable processors are developed and implemented using single, dual and quad MAC FIR units. 3-D Colour image processors are devised to act as 3-D colour filtering engines. A 2-D cross-correlator parallel engine is successfully developed as a parallel 2-D matched filtering algorithm for locating any MRI slice within a MRI data stack library. Twelve 3-D MRI filtering operators are plugged in and adapted to be suitable for biomedical imaging, including 3-D edge operators and 3-D noise smoothing operators.
Since three dimensional greyscale/colour volumetric image applications are computationally intensive, a new parallel 3-D/4-D filtering algorithm is presented and mathematically modelled using volumetric data image segmentation by decimation and output reconstruction by interpolation, after simultaneously and independently performing 3-D filtering. Eight generic architectures are developed and implemented on the Virtex-6 board, including 3-D spatial and FFT convolution architectures. Fourteen 3-D MRI filtering operators are plugged and adapted for this particular biomedical imaging application, including 3-D edge operators and 3-D noise smoothing operators. Three successful applications are presented in 4-D colour MRI (fMRI) filtering processors, k-space MRI volume data filter and 3-D cross-correlator.IRAQI Government
Design and Implementation of Complexity Reduced Digital Signal Processors for Low Power Biomedical Applications
Wearable health monitoring systems can provide remote care with supervised, inde-pendent living which are capable of signal sensing, acquisition, local processing and transmission. A generic biopotential signal (such as Electrocardiogram (ECG), and Electroencephalogram (EEG)) processing platform consists of four main functional components. The signals acquired by the electrodes are amplified and preconditioned by the (1) Analog-Front-End (AFE) which are then digitized via the (2) Analog-to-Digital Converter (ADC) for further processing. The local digital signal processing is usually handled by a custom designed (3) Digital Signal Processor (DSP) which is responsible for either anyone or combination of signal processing algorithms such as noise detection, noise/artefact removal, feature extraction, classification and compres-sion. The digitally processed data is then transmitted via the (4) transmitter which is renown as the most power hungry block in the complete platform. All the afore-mentioned components of the wearable systems are required to be designed and fitted into an integrated system where the area and the power requirements are stringent. Therefore, hardware complexity and power dissipation of each functional component are crucial aspects while designing and implementing a wearable monitoring platform. The work undertaken focuses on reducing the hardware complexity of a biosignal DSP and presents low hardware complexity solutions that can be employed in the aforemen-tioned wearable platforms.
A typical state-of-the-art system utilizes Sigma Delta (Σ∆) ADCs incorporating a Σ∆ modulator and a decimation filter whereas the state-of-the-art decimation filters employ linear phase Finite-Impulse-Response (FIR) filters with high orders that in-crease the hardware complexity [1–5]. In this thesis, the novel use of minimum phase Infinite-Impulse-Response (IIR) decimators is proposed where the hardware complexity is massively reduced compared to the conventional FIR decimators. In addition, the non-linear phase effects of these filters are also investigated since phase non-linearity may distort the time domain representation of the signal being filtered which is un-desirable effect for biopotential signals especially when the fiducial characteristics carry diagnostic importance. In the case of ECG monitoring systems the effect of the IIR filter phase non-linearity is minimal which does not affect the diagnostic accuracy of the signals.
The work undertaken also proposes two methods for reducing the hardware complexity of the popular biosignal processing tool, Discrete Wavelet Transform (DWT). General purpose multipliers are known to be hardware and power hungry in terms of the number of addition operations or their underlying building blocks like full adders or half adders required. Higher number of adders leads to an increase in the power consumption which is directly proportional to the clock frequency, supply voltage, switching activity and the resources utilized. A typical Field-Programmable-Gate-Array’s (FPGA) resources are Look-up Tables (LUTs) whereas a custom Digital Signal Processor’s (DSP) are gate-level cells of standard cell libraries that are used to build adders [6]. One of the proposed methods is the replacement of the hardware and power hungry general pur-pose multipliers and the coefficient memories with reconfigurable multiplier blocks that are composed of simple shift-add networks and multiplexers. This method substantially reduces the resource utilization as well as the power consumption of the system. The second proposed method is the design and implementation of the DWT filter banks using IIR filters which employ less number of arithmetic operations compared to the state-of-the-art FIR wavelets. This reduces the hardware complexity of the analysis filter bank of the DWT and can be employed in applications where the reconstruction is not required. However, the synthesis filter bank for the IIR wavelet transform has a higher computational complexity compared to the conventional FIR wavelet synthesis filter banks since re-indexing of the filtered data sequence is required that can only be achieved via the use of extra registers. Therefore, this led to the proposal of a novel design which replaces the complex IIR based synthesis filter banks with FIR fil-ters which are the approximations of the associated IIR filters. Finally, a comparative study is presented where the hybrid IIR/FIR and FIR/FIR wavelet filter banks are de-ployed in a typical noise reduction scenario using the wavelet thresholding techniques. It is concluded that the proposed hybrid IIR/FIR wavelet filter banks provide better denoising performance, reduced computational complexity and power consumption in comparison to their IIR/IIR and FIR/FIR counterparts
Speech Recognition
Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for searching the hypothesis space, and multimodal approaches to speech recognition. The last part of the book is devoted to other speech processing applications that can use the information from automatic speech recognition for speaker identification and tracking, for prosody modeling in emotion-detection systems and in other speech processing applications that are able to operate in real-world environments, like mobile communication services and smart homes
A computational framework for sound segregation in music signals
Tese de doutoramento. Engenharia Electrotécnica e de Computadores. Faculdade de Engenharia. Universidade do Porto. 200
Recent Advances in Signal Processing
The signal processing task is a very critical issue in the majority of new technological inventions and challenges in a variety of applications in both science and engineering fields. Classical signal processing techniques have largely worked with mathematical models that are linear, local, stationary, and Gaussian. They have always favored closed-form tractability over real-world accuracy. These constraints were imposed by the lack of powerful computing tools. During the last few decades, signal processing theories, developments, and applications have matured rapidly and now include tools from many areas of mathematics, computer science, physics, and engineering. This book is targeted primarily toward both students and researchers who want to be exposed to a wide variety of signal processing techniques and algorithms. It includes 27 chapters that can be categorized into five different areas depending on the application at hand. These five categories are ordered to address image processing, speech processing, communication systems, time-series analysis, and educational packages respectively. The book has the advantage of providing a collection of applications that are completely independent and self-contained; thus, the interested reader can choose any chapter and skip to another without losing continuity
Adaptive Feedback Cancellation in Hearing Aids
Acoustic feedback is a well-known phenomenon in hearing aids and public address systems. Under certain conditions it causes the so-called howling effect, which is highly annoying for the hearing aid user and limits the maximum amplification of the hearing aid. The most common choice to prevent howling is the adaptive feedback cancellation algorithm, which is able to completely eliminate the feedback signal. However, standard adaptive feedback cancellation algorithms suffer from a biased adaptation if the input signal is spectrally colored, as it is for speech and music signals. Due to this bias distortion artifacts (entrainment) are generated and consequently, the sound quality is significantly reduced.
Most of the known methods to reduce the bias have focused on speech signals. However, those methods do not cope with music, since the tonality and correlation are much stronger for such signals. This leads to a higher bias and consequently, to stronger entrainment for music than for speech. Other methods, which deal with music signals, work only satisfactorily when using a very slow adaptation speed. This reduces the ability to react fast to feedback path changes. Hence, howling occurs for a longer time when the feedback path is changing.
In this thesis, a new sub-band adaptive feedback cancellation system for hearing aid applications is proposed. It combines decorrelation methods with a new realization of a non-parametric variable step size.
The adaptation is realized in sub-bands which decreases the computational complexity and increases the adaptation performance of the system simultaneously.
The applied decorrelation methods, prediction error filter and frequency shift, are well known approaches to reduce the bias. However, the combination of both is first proposed in this thesis.
To apply the proposed step size in the context of adaptive feedback cancellation, a method to estimate the signal power of the desired input signal, i.e., without feedback, also referred to as source signal power is necessary. This estimate is theoretically derived and it is demonstrated that it is a reliabe estimate if the decorrelation methods are additionally applied.
In order to further improve the performance of the system three additional control methods are derived: The first one is an impulse detection to detect wideband impulses, which could lead to misadaptation. Secondly, a modified estimate of the source signal power to stabilize the system in case of jarring voices is proposed. Lastly, a correlation detection, which is applied to improve the trade-off between adaptation stability and tracking behavior, is developed.
The complete system is optimized and evaluated for several speech and music signals as well as for different feedback scenarios in simulations with feedback paths measured under realistic situations. Additionally, the system is tested by real-time simulations with hearing aid dummies and a torso and head simulator. For both simulation setups hearing loss compensation methods as applied in realistic hearing aids are used. The performance is measured in terms of being able to prevent entrainment (adaptation stability) and reacting to feedback path changes (tracking behavior).
The complete adaptive feedback cancellation system shows an excellent performance. Furthermore, the system relies only on few parameters, shows a low computational complexity, and therefore has a strong practical relevance
Robust acoustic beamforming in the presence of channel propagation uncertainties
Beamforming is a popular multichannel signal processing technique used in conjunction with microphone arrays to spatially filter a sound field. Conventional optimal beamformers assume that the propagation channels between each source and microphone pair are a deterministic function of the source and microphone geometry. However in real acoustic environments, there are several mechanisms that give rise to unpredictable variations in the phase and amplitudes of the propagation channels. In the presence of these uncertainties the performance of beamformers degrade. Robust beamformers are designed to reduce this performance degradation. However, robust beamformers rely on tuning parameters that are not closely related to the array geometry.
By modeling the uncertainty in the acoustic channels explicitly we can derive more accurate expressions for the source-microphone channel variability. As such we are able to derive beamformers that are well suited to the application of acoustics in realistic environments. Through experiments we validate the acoustic channel models and through simulations we show the performance gains of the associated robust beamformer.
Furthermore, by modeling the speech short time Fourier transform coefficients we are able to design a beamformer framework in the power domain. By utilising spectral subtraction we are able to see performance benefits over ideal conventional beamformers. Including the channel uncertainties models into the weights design improves robustness.Open Acces
- …