Search CORE

231 research outputs found

End-to-End Classification of Reverberant Rooms using DNNs

Author: Evers Christine
Naylor Patrick A.
Papayiannis Constantinos
Publication venue
Publication date: 21/12/2019
Field of study

Reverberation is present in our workplaces, our homes and even in places designed as auditoria, such as concert halls and theatres. This work investigates how deep learning can use the effect of reverberation on speech to classify a recording in terms of the room in which it was recorded in. Approaches previously taken in the literature for the task relied on handpicked acoustic parameters as features used by classifiers. Estimating the values of these parameters from reverberant speech involves estimation errors, inevitably impacting the classification accuracy. This paper shows how DNNs can perform the classification in an end-to-end fashion, therefore by operating directly on reverberant speech. Based on the above, a method for the training of generalisable DNN classifiers and a DNN architecture for the task are proposed. A study is also made on the relationship between feature-maps derived by DNNs and acoustic parameters that describe known properties of reverberation. In the experiments shown, AIRs are used that were measured in 7 real rooms. The classification accuracy of DNNs is compared between the case of having access to the AIRs and the case of having access only to the reverberant speech recorded in the same rooms. The experiments show that with access to the AIRs a DNN achieves an accuracy of 99.1% and with access only to reverberant speech, the proposed DNN achieves an accuracy of 86.9%. The experiments replicate the testing procedure used in previous work, which relied on handpicked acoustic parameters, allowing the direct evaluation of the benefit of using deep learning.Comment: Submitted to IEEE/ACM Transactions on Audio, Speech, and Language Processin

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

Spiral - Imperial College Digital Repository

Balloon Atrial Septostomy as Initial Therapy in Pediatric Pulmonary Hypertension

Author: Cash Michelle
Critser Paul J.
Evers Patrick D.
Hirsch Russel
McGovern Eimear
Publication venue: UKnowledge
Publication date: 01/10/2020
Field of study

Balloon atrial septostomy is a palliative procedure currently used to bridge medically refractory pulmonary hypertension patients to lung transplantation. In the current report, we present balloon atrial septostomy as an initial therapy for high-risk pediatric pulmonary hypertension patients at our institution. Nineteen patients with median age of 4.3 years (range 0.1-14.3 years) underwent balloon atrial septostomy during initial admission for pulmonary hypertension. There were no procedural complications or deaths within 24 h of balloon atrial septostomy. Patients were followed for a median of 2.6 years (interquartile range 1.0-4.8 years). Three (16%) patients died, 3 (16%) underwent lung transplantation, and 1 (5%) underwent reverse Potts shunt. Transplant-free survival at 30 days, 1 year, and 3 years was 84%, 76%, and 67% respectively. This single-center experience suggests early-BAS in addition to pharmacotherapy is safe and warrants consideration in high-risk pediatric pulmonary hypertension patients

University of Kentucky

The mission of the James Blair Historical Review is to publish the College of William and Mary’s best undergraduate history research papers, and thereby showcase the talent of the College’s history students and the strength of her Department of History. The Historical Review seeks to provide a professional platform through which students can explore historically significant themes and issues.The Field of Cloth of Gold: Henry VIII’s Display of Princely Magnificence -Ami Limoncelli Sacrifice and Salvation: Religious Drama in Colonial Mexico -Andrew DiAntonio Insurrections and Independence: How the Gunpowder Incident Thrust British and Afro-Virginians into the American Revolution -Nicole Lidstrom “Black as an Indian and Dirty as a Pig” The Unexpected Perseverance of Female Hospital Workers during America’s Civil War -Anna Storm Australian Aboriginal Rights The 1967 Referendum -Lisa Keppl

DSpace at The College of William and Mary

Signal compaction using polynomial EVD for spherical array processing with applications

Author: Evers Christine
Naylor Patrick A.
Neo Vincent W.
Weiss Stephan
Publication venue
Publication date: 17/08/2023
Field of study

Multi-channel signals captured by spatially separated sensors often contain a high level of data redundancy. A compact signal representation enables more efficient storage and processing, which has been exploited for data compression, noise reduction, and speech and image coding. This paper focuses on the compact representation of speech signals acquired by spherical microphone arrays. A polynomial matrix eigenvalue decomposition (PEVD) can spatially decorrelate signals over a range of time lags and is known to achieve optimum multi-channel data compaction. However, the complexity of PEVD algorithms scales at best cubically with the number of channel signals, e.g., the number of microphones comprised in a spherical array used for processing. In contrast, the spherical harmonic transform (SHT) provides a compact spatial representation of the 3-dimensional sound field measured by spherical microphone arrays, referred to as eigenbeam signals, at a cost that rises only quadratically with the number of microphones. Yet, the SHT’s spatially orthogonal basis functions cannot completely decorrelate sound field components over a range of time lags. In this work, we propose to exploit the compact representation offered by the SHT to reduce the number of channels used for subsequent PEVD processing. In the proposed framework for signal representation, we show that the diagonality factor improves by up to 7 dB over the microphone signal representation with a significantly lower computation cost. Moreover, when applying this framework to speech enhancement and source separation, the proposed method improves metrics known as short-time objective intelligibility (STOI) and source-to-distortion ratio (SDR) by up to 0.2 and 20 dB, respectively

University of Strathclyde Institutional Repository

Spiral - Imperial College Digital Repository

Riau Vegetables for Singapore Consumers: A Collaborative Knowledge-Transfer Project across the Straits of Malacca

Author: Chua Sin Bin
EVERS Hans-Dieter
LOH Patrick H. M.
MENKHOFF Thomas
Publication venue
Publication date: 01/06/2007
Field of study

Institutional Knowledge at Singapore Management University

Design of a Robot-Based Emotion-Mirroring Game to Engage Autistic Children with Emotional Expressions

Author: Ainger Eloise
Alcorn Alyssa M.
Babovic Snezana
Charisi Vasiliki
Chevalier Pauline
Davison Daniel Patrick
Evers Vanessa
Heylen Dirk K.J.
Li Jamy Jue
Pellicano Elizabeth
Petrovic Suncica
Reidsma Dennis
Schadenberg Bob Rinse
Shen Jie
Publication venue
Publication date: 11/05/2018
Field of study

University of Twente Research Information

Semiconductor Bloch-equations formalism: Derivation and application to high-harmonic generation from Dirac fermions

Author: Crewse Jack
Evers Ferdinand
Groessing Patrick
Nitsch Maximilian
Schmid Christoph P.
Seith Adrian
Weigl Leonhard
Wilhelm Jan
Publication venue: 'American Physical Society (APS)'
Publication date: 18/03/2021
Field of study

We rederive the semiconductor Bloch equations emphasizing the close link to the Berry connection. Our rigorous derivation reveals the existence of two further contributions to the current, in addition to the frequently considered intraband and polarization-related interband terms. The extra contributions become sizable in situations with strong dephasing or when the dipole-matrix elements are strongly wave-number dependent. We apply the formalism to high-harmonic generation for a Dirac metal. The extra terms add to the frequency-dependent emission intensity (high-harmonic spectrum) significantly at certain frequencies changing the total signal up to a factor of 10

arXiv.org e-Print Archive

University of Regensburg Publication Server