2,538 research outputs found

    The Microsoft 2017 Conversational Speech Recognition System

    Full text link
    We describe the 2017 version of Microsoft's conversational speech recognition system, in which we update our 2016 system with recent developments in neural-network-based acoustic and language modeling to further advance the state of the art on the Switchboard speech recognition task. The system adds a CNN-BLSTM acoustic model to the set of model architectures we combined previously, and includes character-based and dialog session aware LSTM language models in rescoring. For system combination we adopt a two-stage approach, whereby subsets of acoustic models are first combined at the senone/frame level, followed by a word-level voting via confusion networks. We also added a confusion network rescoring step after system combination. The resulting system yields a 5.1\% word error rate on the 2000 Switchboard evaluation set

    Coded Modulation Assisted Radial Basis Function Aided Turbo Equalisation for Dispersive Rayleigh Fading Channels

    No full text
    In this contribution a range of Coded Modulation (CM) assisted Radial Basis Function (RBF) based Turbo Equalisation (TEQ) schemes are investigated when communicating over dispersive Rayleigh fading channels. Specifically, 16QAM based Trellis Coded Modulation (TCM), Turbo TCM (TTCM), Bit-Interleaved Coded Modulation (BICM) and iteratively decoded BICM (BICM-ID) are evaluated in the context of an RBF based TEQ scheme and a reduced-complexity RBF based In-phase/Quadrature-phase (I/Q) TEQ scheme. The Least Mean Square (LMS) algorithm was employed for channel estimation, where the initial estimation step-size used was 0.05, which was reduced to 0.01 for the second and the subsequent TEQ iterations. The achievable coding gain of the various CM schemes was significantly increased, when employing the proposed RBF-TEQ or RBF-I/Q-TEQ rather than the conventional non-iterative Decision Feedback Equaliser - (DFE). Explicitly, the reduced-complexity RBF-I/Q-TEQ-CM achieved a similar performance to the full-complexity RBF-TEQ-CM, while attaining a significant complexity reduction. The best overall performer was the RBF-I/Q-TEQ-TTCM scheme, requiring only 1.88~dB higher SNR at BER=10-5, than the identical throughput 3~BPS uncoded 8PSK scheme communicating over an AWGN channel. The coding gain of the scheme was 16.78-dB

    Transform domain filtering in incremental and diffusion strategies over distributed networks

    Get PDF
    We analyse incremental and diffusion co-operative schemes in which nodes share information to some neighbour nodes in order to estimate desired parameter of interest locally in the presence of noise. Each node works as an adaptive filter and having its own learning ability. In incremental co-operative fashion a node takes information from previous node and after local estimation the information is sent to next node whereas in diffusion the input is taken from various nodes so that after each iteration the behaviour of distributed network is observed. We employ LMS structure for updating the observations. The convergence performance and computational complexity of LMS-filter is very important consideration for the point of view of speed boost and cost reduction. The convergence performance of a filter depends on eigenvalue spread of covariance matrix of input data or in other words inversely proportional to the eigenvalue spread of the input data. If input data is de-correlated the eigenvalue spread is less and if input data is correlated the eigenvalue spread is more. Transform domain filter has data de-correlation properties of transforms like DCT & DFT. The data de-correlation by the unitary transforms is depends on the orthogonal property of individual transform. Hence we get improved convergence performance by applying transform domain to input data followed by power normalization of input data. If the input data is fully de-correlated the covariance matrix of input data is proportional to the identity matrix

    Distributed and Collaborative Processing of Audio Signals: Algorithms, Tools and Applications

    Full text link
    Tesis por compendio[ES] Esta tesis se enmarca en el campo de las Tecnologías de la Información y las Comunicaciones (TIC), especialmente en el área del procesado digital de la señal. En la actualidad, y debido al auge del Internet de los cosas (IoT), existe un creciente interés por las redes de sensores inalámbricos (WSN), es decir, redes compuestas de diferentes tipos de dispositivos específicamente distribuidos en una determinada zona para realizar diferentes tareas de procesado de señal. Estos dispositivos o nodos suelen estar equipados con transductores electroacústicos así como con potentes y eficientes procesadores con capacidad de comunicación. En el caso particular de las redes de sensores acústicos (ASN), los nodos se dedican a resolver diferentes tareas de procesado de señales acústicas. El desarrollo de potentes sistemas de procesado centralizado han permitido aumentar el número de canales de audio, ampliar el área de control o implementar algoritmos más complejos. En la mayoría de los casos, una topología de ASN distribuida puede ser deseable debido a varios factores tales como el número limitado de canales utilizados por los dispositivos de adquisición y reproducción de audio, la conveniencia de un sistema escalable o las altas exigencias computacionales de los sistemas centralizados. Todos estos aspectos pueden llevar a la utilización de nuevas técnicas de procesado distribuido de señales con el fin de aplicarlas en ASNs. Para ello, una de las principales aportaciones de esta tesis es el desarrollo de algoritmos de filtrado adaptativo para sistemas de audio multicanal en redes distribuidas. Es importante tener en cuenta que, para aplicaciones de control del campo sonoro (SFC), como el control activo de ruido (ANC) o la ecualización activa de ruido (ANE), los nodos acústicos deben estar equipados con actuadores con el fin de controlar y modificar el campo sonoro. Sin embargo, la mayoría de las propuestas de redes distribuidas adaptativas utilizadas para resolver problemas de control del campo sonoro no tienen en cuenta que los nodos pueden interferir o modificar el comportamiento del resto. Por lo tanto, otra contribución destacable de esta tesis se centra en el análisis de cómo el sistema acústico afecta el comportamiento de los nodos dentro de una ASN. En los casos en que el entorno acústico afecta negativamente a la estabilidad del sistema, se han propuesto varias estrategias distribuidas para resolver el problema de interferencia acústica con el objetivo de estabilizar los sistemas de ANC. En el diseño de los algoritmos distribuidos también se han tenido en cuenta aspectos de implementación práctica. Además, con el objetivo de crear perfiles de ecualización diferentes en zonas de escucha independientes en presencia de ruidos multitonales, se han presentado varios algoritmos distribuidos de ANE en banda estrecha y banda ancha sobre una ASN con una comunicación colaborativa y compuesta por nodos acústicos. Se presentan además resultados experimentales para validar el uso de los algoritmos distribuidos propuestos en el trabajo para aplicaciones prácticas. Para ello, se ha diseñado un software de simulación acústica que permite analizar el rendimiento de los algoritmos desarrollados en la tesis. Finalmente, se ha realizado una implementación práctica que permite ejecutar aplicaciones multicanal de SFC. Para ello, se ha desarrollado un prototipo en tiempo real que controla las aplicaciones de ANC y ANE utilizando nodos acústicos colaborativos. El prototipo consiste en dos sistemas de control de audio personalizado (PAC) compuestos por un asiento de coche y un nodo acústico, el cual está equipado con dos altavoces, dos micrófonos y un procesador con capacidad de comunicación entre los dos nodos. De esta manera, es posible crear dos zonas independientes de control de ruido que mejoran el confort acústico del usuario sin necesidad de utilizar auriculares.[CA] Aquesta tesi s'emmarca en el camp de les Tecnologies de la Informació i les Comunicacions (TIC), especialment en l'àrea del processament digital del senyal. En l'actualitat, i a causa de l'auge de la Internet dels coses (IoT), existeix un creixent interés per les xarxes de sensors sense fils (WSN), és a dir, xarxes compostes de diferents tipus de dispositius específicament distribuïts en una determinada zona per a fer diferents tasques de processament de senyal. Aquests dispositius o nodes solen estar equipats amb transductors electroacústics així com amb potents i eficients processadors amb capacitat de comunicació. En el cas particular de les xarxes de sensors acústics (ASN), els nodes es dediquen a resoldre diferents tasques de processament de senyals acústics. El desenvolupament de potents sistemes de processament centralitzat han permés augmentar el nombre de canals d'àudio, ampliar l'àrea de control o implementar algorismes més complexos. En la majoria dels casos, una topologia de ASN distribuïda pot ser desitjable a causa de diversos factors tals com el nombre limitat de canals utilitzats pels dispositius d'adquisició i reproducció d'àudio, la conveniència d'un sistema escalable o les altes exigències computacionals dels sistemes centralitzats. Tots aquests aspectes poden portar a la utilització de noves tècniques de processament distribuït de senyals amb la finalitat d'aplicar-les en ASNs. Per a això, una de les principals aportacions d'aquesta tesi és el desenvolupament d'algorismes de filtrat adaptatiu per a sistemes d'àudio multicanal en xarxes distribuïdes. És important tindre en compte que, per a aplicacions de control del camp sonor (SFC), com el control actiu de soroll (ANC) o l'equalització activa de soroll (ANE), els nodes acústics han d'estar equipats amb actuadors amb la finalitat de controlar i modificar el camp sonor. No obstant això, la majoria de les propostes de xarxes distribuïdes adaptatives utilitzades per a resoldre problemes de control del camp sonor no tenen en compte que els nodes poden modificar el comportament de la resta. Per tant, una altra contribució destacable d'aquesta tesi se centra en l'anàlisi de com el sistema acústic afecta el comportament dels nodes dins d'una ASN. En els casos en què l'entorn acústic afecta negativament a l'estabilitat del sistema, s'han proposat diverses estratègies distribuïdes per a resoldre el problema d'interferència acústica amb l'objectiu d'estabilitzar els sistemes de ANC. En el disseny dels algorismes distribuïts també s'han tingut en compte aspectes d'implementació pràctica. A més, amb l'objectiu de crear perfils d'equalització diferents en zones d'escolta independents en presència de sorolls multitonales, s'han presentat diversos algorismes distribuïts de ANE en banda estreta i banda ampla sobre una ASN amb una comunicació col·laborativa i composta per nodes acústics. Es presenten a més resultats experimentals per a validar l'ús dels algorismes distribuïts proposats en el treball per a aplicacions pràctiques. Per a això, s'ha dissenyat un programari de simulació acústica que permet analitzar el rendiment dels algorismes desenvolupats en la tesi. Finalment, s'ha realitzat una implementació pràctica que permet executar aplicacions multicanal de SFC. Per a això, s'ha desenvolupat un prototip en temps real que controla les aplicacions de ANC i ANE utilitzant nodes acústics col·laboratius. El prototip consisteix en dos sistemes de control d'àudio personalitzat (PAC) compostos per un seient de cotxe i un node acústic, el qual està equipat amb dos altaveus, dos micròfons i un processador amb capacitat de comunicació entre els dos nodes. D'aquesta manera, és possible crear dues zones independents de control de soroll que milloren el confort acústic de l'usuari sense necessitat d'utilitzar auriculars.[EN] This thesis fits into the field of Information and Communications Technology (ICT), especially in the area of digital signal processing. Nowadays and due to the rise of the Internet of Things (IoT), there is a growing interest in wireless sensor networks (WSN), that is, networks composed of different types of devices specifically distributed in some area to perform different signal processsing tasks. These devices, also referred to as nodes, are usually equipped with electroacoustic transducers as well as powerful and efficient processors with communication capability. In the particular case of acoustic sensor networks (ASN), nodes are dedicated to solving different acoustic signal processing tasks. These audio signal processing applications have been undergone a major development in recent years due in part to the advances made in computer hardware and software. The development of powerful centralized processing systems has allowed the number of audio channels to be increased, the control area to be extended or more complex algorithmms to be implemented. In most cases, a distributed ASN topology can be desirable due to several factors such as the limited number of channels used by the sound acquisition and reproduction devices, the convenience of a scalable system or the high computational demands of a centralized fashion. All these aspects may lead to the use of novel distributed signal processing techniques with the aim to be applied over ASNs. To this end, one of the main contributions of this dissertation is the development of adaptive filtering algorithms for multichannel sound systems over distributed networks. Note that, for sound field control (SFC) applications, such as active noise control (ANC) or active noise equalization (ANE), acoustic nodes must be not only equipped with sensors but also with actuators in order to control and modify the sound field. However, most of the adaptive distributed networks approaches used to solve soundfield control problems do not take into account that the nodes may interfere or modify the behaviour of the rest. Therefore, other important contribution of this thesis is focused on analyzing how the acoustic system affects the behavior of the nodes within an ASN. In cases where the acoustic environment adversely affects the system stability, several distributed strategies have been proposed for solving the acoustic interference problem with the aim to stabilize ANC control systems. These strategies are based on both collaborative and non-collaborative approaches. Implementation aspects such as hardware constraints, sensor locations, convergenge rate or computational and communication burden, have been also considered on the design of the distributed algorithms. Moreover and with the aim to create independent-zone equalization profiles in the presence of multi-tonal noises, distributed narrowband and broadband ANE algorithms over an ASN with a collaborative learning and composed of acoustic nodes have been presented. Experimental results are presented to validate the use of the distributed algorithms proposed in the work for practical applications. For this purpose, an acoustic simulation software has been specifically designed to analyze the performance of the developed algorithms. Finally, the performance of the proposed distributed algorithms for multichannel SFC applications has been evaluated by means of a real practical implementation. To this end, a real-time prototype that controls both ANC and ANE applications by using collaborative acoustic nodes has been developed. The prototype consists of two personal audio control (PAC) systems composed of a car seat and an acoustic node, which is equipped with two loudspeakers, two microphones and a processor with communications capability. In this way, it is possible to create two independent noise control zones improving the acoustic comfort of the user without the use of headphones.Antoñanzas Manuel, C. (2019). Distributed and Collaborative Processing of Audio Signals: Algorithms, Tools and Applications [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/130209TESISCompendi

    Block Least-Mean Squares Algorithm Over Distributed Wireless Sensor Network

    Get PDF
    In a distributed parameter estimation problem, during each sampling instant, a typical sensor node communicates its estimate either by the diffusion algorithm or by the incremental algorithm. Both these conventional distributed algorithms involve significant communication overheads and, consequently, defeat the basic purpose of wireless sensor networks. In the present paper, we therefore propose two new distributed algorithms, namely, block diffusion least mean square (BDLMS) and block incremental least mean square (BILMS) by extending the concept of block adaptive filtering techniques to the distributed adaptation scenario. The performance analysis of the proposed BDLMS and BILMS algorithms has been carried out and found to have similar performances to those offered by conventional diffusion LMS and incremental LMS algorithms, respectively. The convergence analyses of the proposed algorithms obtained from the simulation study are also found to be in agreement with the theoretical analysis. The remarkable and interesting aspect of the proposed block-based algorithms is that their communication overheads per node and latencies are less than those of the conventional algorithms by a factor as high as the block size used in the algorithms

    Convergence-Optimal Quantizer Design of Distributed Contraction-based Iterative Algorithms with Quantized Message Passing

    Full text link
    In this paper, we study the convergence behavior of distributed iterative algorithms with quantized message passing. We first introduce general iterative function evaluation algorithms for solving fixed point problems distributively. We then analyze the convergence of the distributed algorithms, e.g. Jacobi scheme and Gauss-Seidel scheme, under the quantized message passing. Based on the closed-form convergence performance derived, we propose two quantizer designs, namely the time invariant convergence-optimal quantizer (TICOQ) and the time varying convergence-optimal quantizer (TVCOQ), to minimize the effect of the quantization error on the convergence. We also study the tradeoff between the convergence error and message passing overhead for both TICOQ and TVCOQ. As an example, we apply the TICOQ and TVCOQ designs to the iterative waterfilling algorithm of MIMO interference game.Comment: 17 pages, 9 figures, Transaction on Signal Processing, accepte
    corecore