5,529 research outputs found
Spectral analysis for nonstationary audio
A new approach for the analysis of nonstationary signals is proposed, with a
focus on audio applications. Following earlier contributions, nonstationarity
is modeled via stationarity-breaking operators acting on Gaussian stationary
random signals. The focus is on time warping and amplitude modulation, and an
approximate maximum-likelihood approach based on suitable approximations in the
wavelet transform domain is developed. This paper provides theoretical analysis
of the approximations, and introduces JEFAS, a corresponding estimation
algorithm. The latter is tested and validated on synthetic as well as real
audio signal.Comment: IEEE/ACM Transactions on Audio, Speech and Language Processing,
Institute of Electrical and Electronics Engineers, In pres
Encoding and processing of sensory information in neuronal spike trains
Recently, a statistical signal-processing technique has allowed the information carried by single spike trains of sensory neurons on time-varying stimuli to be characterized quantitatively in a variety of preparations. In weakly electric fish, its application to first-order sensory neurons encoding electric field amplitude (P-receptor afferents) showed that they convey accurate information on temporal modulations in a behaviorally relevant frequency range (<80 Hz). At the next stage of the electrosensory pathway (the electrosensory lateral line lobe, ELL), the information sampled by first-order neurons is used to extract upstrokes and downstrokes in the amplitude modulation waveform. By using signal-detection techniques, we determined that these temporal features are explicitly represented by short spike bursts of second-order neurons (ELL pyramidal cells). Our results suggest that the biophysical mechanism underlying this computation is of dendritic origin. We also investigated the accuracy with which upstrokes and downstrokes are encoded across two of the three somatotopic body maps of the ELL (centromedial and lateral). Pyramidal cells of the centromedial map, in particular I-cells, encode up- and downstrokes more reliably than those of the lateral map. This result correlates well with the significance of these temporal features for a particular behavior (the jamming avoidance response) as assessed by lesion experiments of the centromedial map
Modeling and frequency tracking of marine mammal whistle calls
Submitted in partial fulfillment of the requirements for the degree of Master of Science at the Massachusetts Institute of Technology and the Woods Hole Oceanographic Institution February 2009Marine mammal whistle calls present an attractive medium for covert underwater
communications. High quality models of the whistle calls are needed in order to synthesize
natural-sounding whistles with embedded information. Since the whistle calls
are composed of frequency modulated harmonic tones, they are best modeled as a
weighted superposition of harmonically related sinusoids. Previous research with bottlenose
dolphin whistle calls has produced synthetic whistles that sound too “clean”
for use in a covert communications system. Due to the sensitivity of the human auditory
system, watermarking schemes that slightly modify the fundamental frequency
contour have good potential for producing natural-sounding whistles embedded with
retrievable watermarks. Structured total least squares is used with linear prediction
analysis to track the time-varying fundamental frequency and harmonic amplitude
contours throughout a whistle call. Simulation and experimental results demonstrate
the capability to accurately model bottlenose dolphin whistle calls and retrieve embedded
information from watermarked synthetic whistle calls. Different fundamental
frequency watermarking schemes are proposed based on their ability to produce natural
sounding synthetic whistles and yield suitable watermark detection and retrieval
- …