Search CORE

7 research outputs found

Two-Channel Speech Enhancement and Implementation Considerations: Noise Reduction and Speech Quality

Author: Hammer Martin
Kjeldsen Thomas Lynge
Kjærgaard Jacob Barsøe
Publication venue: Department of Electronic Systems
Publication date: 01/01/2007
Field of study

Rake, Peel, Sketch:The Signal Processing Pipeline Revisited

Author: Scheibler Robin
Publication venue: Lausanne, EPFL
Publication date: 02/05/2017
Field of study

The prototypical signal processing pipeline can be divided into four blocks. Representation of the signal in a basis suitable for processing. Enhancement of the meaningful part of the signal and noise reduction. Estimation of important statistical properties of the signal. Adaptive processing to track and adapt to changes in the signal statistics. This thesis revisits each of these blocks and proposes new algorithms, borrowing ideas from information theory, theoretical computer science, or communications. First, we revisit the Walsh-Hadamard transform (WHT) for the case of a signal sparse in the transformed domain, namely that has only K â€ N non-zero coefficients. We show that an efficient algorithm exists that can compute these coefficients in O(K log2(K) log2(N/K)) and using only O(K log2(N/K)) samples. This algorithm relies on a fast hashing procedure that computes small linear combinations of transformed domain coefficients. A bipartite graph is formed with linear combinations on one side, and non-zero coefficients on the other. A peeling decoder is then used to recover the non-zero coefficients one by one. A detailed analysis of the algorithm based on error correcting codes over the binary erasure channel is given. The second chapter is about beamforming. Inspired by the rake receiver from wireless communications, we recognize that echoes in a room are an important source of extra signal diversity. We extend several classic beamforming algorithms to take advantage of echoes and also propose new optimal formulations. We explore formulations both in time and frequency domains. We show theoretically and in numerical simulations that the signal-to-interference-and-noise ratio increases proportionally to the number of echoes used. Finally, beyond objective measures, we show that echoes also directly improve speech intelligibility as measured by the perceptual evaluation of speech quality (PESQ) metric. Next, we attack the problem of direction of arrival of acoustic sources, to which we apply a robust finite rate of innovation reconstruction framework. FRIDA â the resulting algorithm â exploits wideband information coherently, works at very low signal-to-noise ratio, and can resolve very close sources. The algorithm can use either raw microphone signals or their cross- correlations. While the former lets us work with correlated sources, the latter creates a quadratic number of measurements that allows to locate many sources with few microphones. Thorough experiments on simulated and recorded data shows that FRIDA compares favorably with the state-of-the-art. We continue by revisiting the classic recursive least squares (RLS) adaptive filter with ideas borrowed from recent results on sketching least squares problems. The exact update of RLS is replaced by a few steps of conjugate gradient descent. We propose then two different precondi- tioners, obtained by sketching the data, to accelerate the convergence of the gradient descent. Experiments on artificial as well as natural signals show that the proposed algorithm has a performance very close to that of RLS at a lower computational burden. The fifth and final chapter is dedicated to the software and hardware tools developed for this thesis. We describe the pyroomacoustics Python package that contains routines for the evaluation of audio processing algorithms and reference implementations of popular algorithms. We then give an overview of the microphone arrays developed

Infoscience - École polytechnique fédérale de Lausanne

The Telecommunications and Data Acquisition Report

Author: Posner E. C.
Publication venue
Publication date
Field of study

This publication, one of a series formerly titled The Deep Space Network (DSN) Progress Report, documents DSN progress in flight project support, tracking and data acquisition research and technology, network engineering, hardware and software implementation, and operations. In addition, developments in Earth-based radio technology as applied to geodynamics, astrophysics, and the radio search for extraterrestrial intelligence are reported

NASA Technical Reports Server

Speech Recognition

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for searching the hypothesis space, and multimodal approaches to speech recognition. The last part of the book is devoted to other speech processing applications that can use the information from automatic speech recognition for speaker identification and tracking, for prosody modeling in emotion-detection systems and in other speech processing applications that are able to operate in real-world environments, like mobile communication services and smart homes

Directory of Open Access Books (DOAB)

Projections et distances discrètes

Author: Normand Nicolas
Publication venue: HAL CCSD
Publication date: 29/11/2012
Field of study

Le travail se situe dans le domaine de la géométrie discrète. La tomographie discrète sera abordée sous l'angle de ses liens avec la théorie de l'information, illustrés par l'application de la transformation Mojette et de la "Finite Radon Transform" au codage redondant d'information pour la transmission et le stockage distribué. Les distances discrètes seront exposées selon les points de vue théorique (avec une nouvelle classe de distances construites par des chemins à poids variables) et algorithmique (transformation en distance, axe médian, granulométrie) en particulier par des méthodes en un balayage d'image (en "streaming"). Le lien avec les séquences d'entiers non-décroissantes et l'inverse de Lambek-Moser sera mis en avant

Thèses en Ligne

Nonconvex optimization for improved exploitation of gradient sparsity in CT image reconstruction

Author: Chartrand Rick
Jørgensen Jakob Sauer
Pan Xiaochuan
Sidky Emil Y.
Publication venue: University of Southern California
Publication date: 01/01/2013
Field of study

Online Research Database In Technology

Abstracts on Radio Direction Finding (1899 - 1995)

Author
Publication venue
Publication date: 01/01/2018
Field of study

The files on this record represent the various databases that originally composed the CD-ROM issue of "Abstracts on Radio Direction Finding" database, which is now part of the Dudley Knox Library's Abstracts and Selected Full Text Documents on Radio Direction Finding (1899 - 1995) Collection. (See Calhoun record https://calhoun.nps.edu/handle/10945/57364 for further information on this collection and the bibliography). Due to issues of technological obsolescence preventing current and future audiences from accessing the bibliography, DKL exported and converted into the three files on this record the various databases contained in the CD-ROM. The contents of these files are: 1) RDFA_CompleteBibliography_xls.zip [RDFA_CompleteBibliography.xls: Metadata for the complete bibliography, in Excel 97-2003 Workbook format; RDFA_Glossary.xls: Glossary of terms, in Excel 97-2003 Workbookformat; RDFA_Biographies.xls: Biographies of leading figures, in Excel 97-2003 Workbook format]; 2) RDFA_CompleteBibliography_csv.zip [RDFA_CompleteBibliography.TXT: Metadata for the complete bibliography, in CSV format; RDFA_Glossary.TXT: Glossary of terms, in CSV format; RDFA_Biographies.TXT: Biographies of leading figures, in CSV format]; 3) RDFA_CompleteBibliography.pdf: A human readable display of the bibliographic data, as a means of double-checking any possible deviations due to conversion

Calhoun, Institutional Archive of the Naval Postgraduate School