Search CORE

14,404 research outputs found

Robust Speech Detection for Noisy Environments

Author: Hernández Luis A.
San Segundo Hernández Rubén
Varela Serrano Oscar
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

This paper presents a robust voice activity detector (VAD) based on hidden Markov models (HMM) to improve speech recognition systems in stationary and non-stationary noise environments: inside motor vehicles (like cars or planes) or inside buildings close to high traffic places (like in a control tower for air traffic control (ATC)). In these environments, there is a high stationary noise level caused by vehicle motors and additionally, there could be people speaking at certain distance from the main speaker producing non-stationary noise. The VAD presented in this paper is characterized by a new front-end and a noise level adaptation process that increases significantly the VAD robustness for different signal to noise ratios (SNRs). The feature vector used by the VAD includes the most relevant Mel Frequency Cepstral Coefficients (MFCC), normalized log energy and delta log energy. The proposed VAD has been evaluated and compared to other well-known VADs using three databases containing different noise conditions: speech in clean environments (SNRs mayor que 20 dB), speech recorded in stationary noise environments (inside or close to motor vehicles), and finally, speech in non stationary environments (including noise from bars, television and far-field speakers). In the three cases, the detection error obtained with the proposed VAD is the lowest for all SNRs compared to Acero¿s VAD (reference of this work) and other well-known VADs like AMR, AURORA or G729 annex b

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Improvement of Text Dependent Speaker Identification System Using Neuro-Genetic Hybrid Algorithm in Office Environmental Conditions

Author: Islam Md. Rabiul
Rahman Md. Fayzur
Publication venue: International Journal of Computer Science Issues, IJCSI
Publication date: 01/08/2009
Field of study

In this paper, an improved strategy for automated text dependent speaker identification system has been proposed in noisy environment. The identification process incorporates the Neuro-Genetic hybrid algorithm with cepstral based features. To remove the background noise from the source utterance, wiener filter has been used. Different speech pre-processing techniques such as start-end point detection algorithm, pre-emphasis filtering, frame blocking and windowing have been used to process the speech utterances. RCC, MFCC, ?MFCC, ??MFCC, LPC and LPCC have been used to extract the features. After feature extraction of the speech, Neuro-Genetic hybrid algorithm has been used in the learning and identification purposes. Features are extracted by using different techniques to optimize the performance of the identification. According to the VALID speech database, the highest speaker identification rate of 100.000% for studio environment and 82.33% for office environmental conditions have been achieved in the close set text dependent speaker identification system

arXiv.org e-Print Archive

CogPrints Cognitive Sciences Eprint Archive

Beating the reaction limits of biosensor sensitivity with dynamic tracking of single binding events

Author: Sevenler Derin
Trueb Jacob
Unlu M. Selim
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 12/09/2018
Field of study

The clinical need for ultrasensitive molecular analysis has motivated the development of several endpoint-assay technologies capable of single-molecule readout. These endpoint assays are now primarily limited by the affinity and specificity of the molecular-recognition agents for the analyte of interest. In contrast, a kinetic assay with single-molecule readout could distinguish between low-abundance, high-affinity (specific analyte) and high-abundance, low-affinity (nonspecific background) binding by measuring the duration of individual binding events at equilibrium. Here, we describe such a kinetic assay, in which individual binding events are detected and monitored during sample incubation. This method uses plasmonic gold nanorods and interferometric reflectance imaging to detect thousands of individual binding events across a multiplex solid-phase sensor with a large area approaching that of leading bead-based endpoint-assay technologies. A dynamic tracking procedure is used to measure the duration of each event. From this, the total rates of binding and debinding as well as the distribution of binding-event durations are determined. We observe a limit of detection of 19 fM for a proof-of-concept synthetic DNA analyte in a 12-plex assay format.First author draf

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

LiDAR mapping of tidal marshes for ecogeomorphological modelling in the TIDE project

Author: Belluco E
Feola A
Ferrari S
Katzenbeisser R
Lohani B
Marani M
Mason David Cecil
Menenti M
Paterson D.M.
Scott T.R.
Vardy S
Wang C
Wang H-J.
Publication venue
Publication date: 01/01/2005
Field of study

The European research project TIDE (Tidal Inlets Dynamics and Environment) is developing and validating coupled models describing the morphological, biological and ecological evolution of tidal environments. The interactions between the physical and biological processes occurring in these regions requires that the system be studied as a whole rather than as separate parts. Extensive use of remote sensing including LiDAR is being made to provide validation data for the modelling. This paper describes the different uses of LiDAR within the project and their relevance to the TIDE science objectives. LiDAR data have been acquired from three different environments, the Venice Lagoon in Italy, Morecambe Bay in England, and the Eden estuary in Scotland. LiDAR accuracy at each site has been evaluated using ground reference data acquired with differential GPS. A semi-automatic technique has been developed to extract tidal channel networks from LiDAR data either used alone or fused with aerial photography. While the resulting networks may require some correction, the procedure does allow network extraction over large areas using objective criteria and reduces fieldwork requirements. The networks extracted may subsequently be used in geomorphological analyses, for example to describe the drainage patterns induced by networks and to examine the rate of change of networks. Estimation of the heights of the low and sparse vegetation on marshes is being investigated by analysis of the statistical distribution of the measured LiDAR heights. Species having different mean heights may be separated using the first-order moments of the height distribution

Central Archive at the University of Reading

Sub-millimeter nuclear medical imaging with high sensitivity in positron emission tomography using beta-gamma coincidences

Author: Habs D.
Lang C.
Parodi K.
Thirolf P. G.
Publication venue: 'IOP Publishing'
Publication date: 13/01/2014
Field of study

We present a nuclear medical imaging technique, employing triple-gamma trajectory intersections from beta^+ - gamma coincidences, able to reach sub-millimeter spatial resolution in 3 dimensions with a reduced requirement of reconstructed intersections per voxel compared to a conventional PET reconstruction analysis. This '

\gamma

-PET' technique draws on specific beta^+ - decaying isotopes, simultaneously emitting an additional photon. Exploiting the triple coincidence between the positron annihilation and the third photon, it is possible to separate the reconstructed 'true' events from background. In order to characterize this technique, Monte-Carlo simulations and image reconstructions have been performed. The achievable spatial resolution has been found to reach ca. 0.4 mm (FWHM) in each direction for the visualization of a 22Na point source. Only 40 intersections are sufficient for a reliable sub-millimeter image reconstruction of a point source embedded in a scattering volume of water inside a voxel volume of about 1 mm^3 ('high-resolution mode'). Moreover, starting with an injected activity of 400 MBq for ^76Br, the same number of only about 40 reconstructed intersections are needed in case of a larger voxel volume of 2 x 2 x 3~mm^3 ('high-sensitivity mode'). Requiring such a low number of reconstructed events significantly reduces the required acquisition time for image reconstruction (in the above case to about 140 s) and thus may open up the perspective for a quasi real-time imaging.Comment: 17 pages, 5 figutes, 3 table

arXiv.org e-Print Archive

MPG.PuRe