22 research outputs found
Composing first species counterpoint with a variable neighbourhood search algorithm
In this article, a variable neighbourhood search (VNS) algorithm is developed that can generate musical fragments consisting of a melody for the cantus firmus and the first species counterpoint. The objective function of the algorithm is based on a quantification of existing rules for counterpoint. The VNS algorithm developed in this article is a local search algorithm that starts from a randomly generated melody and improves it by changing one or two notes at a time. A thorough parametric analysis of the VNS reveals the significance of the algorithm's parameters on the quality of the composed fragment, as well as their optimal settings. A comparison of the VNS algorithm with a developed genetic algorithm shows that the VNS is more efficient. The VNS algorithm has been implemented in a user-friendly software environment for composition, called Optimuse. Optimuse allows a user to specify a number of characteristics such as length, key and mode. Based on this information, Optimuse 'composes' both cantus firmus and first species counterpoint. Alternatively, the user may specify a cantus firmus, and let Optimuse compose the accompanying first species counterpoint. © 2012 Taylor & Francis
Multi-Channel Speech Enhancement and Amplitude Modulation Analysis for Noise Robust Automatic Speech Recognition
The paper describes a system for automatic speech recognition (ASR) that is benchmarked with data of the 3rd CHiME challenge, a dataset comprising distant microphone recordings of noisy acoustic scenes in public environments. The proposed ASR system employs various methods to increase recognition accuracy and noise robustness. Two different multi-channel speech enhancement techniques are used to eliminate interfering sounds in the audio stream. One speech enhancement method aims at separating the target speaker's voice from background sources based on non-negative matrix factorization (NMF) using variational Bayesian (VB) inference to estimate NMF parameters. The second technique is based on a time-varying minimum variance distortionless response (MVDR) beamformer that uses spatial information to suppress sound signals not arriving from a desired direction. Prior to speech enhancement, a microphone channel failure detector is applied that is based on cross-comparing cha nnels using a modulation-spectral representation of the speech signal. ASR feature extraction employs the amplitude modulation filter bank (AMFB) that implicates prior information of speech to analyze its temporal dynamics. AMFBs outperform the commonly used frame splicing technique of filter bank features in conjunction with a deep neural network (DNN) based ASR system, which denotes an equivalent data-driven approach to extract modulation-spectral information. In addition, features are speaker adapted, a recurrent neural network (RNN) is employed for language modeling, and hypotheses of different ASR systems are combined to further enhance the recognition accuracy. The proposed ASR system achieves an absolute word error rate (WER) of 5.67% on the real evaluation test data, which is 0.16% lower compared to the best score reported within the 3rd CHiME challenge
Physics-Based And Spike-Guided Tools For Sound Design
In this paper we present graphical tools and parameters search
algorithms for the timbre space exploration and design of complex
sounds generated by physical modeling synthesis. The tools
are built around a sparse representation of sounds based on Gammatone
functions and provide the designer with both a graphical
and an auditory insight. The auditory representation of a number
of reference sounds, located as landmarks in a 2D sound design
space, provides the designer with an effective aid to direct his
search for new sounds. The sonic landmarks can either be synthetic
sounds chosen by the user or be automatically derived by
using clever parameter search and clustering algorithms. The proposed
probabilistic method in this paper makes use of the sparse
representations to model the distance between sparsely represented
sounds. A subsequent optimization model minimizes those distances
to estimate the optimal parameters, which generate the landmark
sounds on the given auditory landscape
Auditory Representations as Landmarks in the Sound Design Space
A graphical tool for the timbre space exploration and interactive
design of complex sounds by physical modeling synthesis
is presented. It is built around an auditory representation
of sounds based on spike functions and provides the
designer with both a graphical and an auditory insight. The
auditory representation of a number of reference sounds, located
as landmarks in a 2D sound design space, provides the
designer with an effective aid to direct his search along the
paths that lie in the proximity of the most inspiring landmarks
Ectasia and Severe Atherosclerosis: Relationships with Chlamydia pneumoniae, Helicobacter pylori, and Inflammatory Markers
To date, there has been no convincing evidence for an association between Chlamydia pneumoniae or Helicobacter pylori and ectasia. In this case-control study, we have investigated the association of H. pylori and C. pneumoniae seropositivity with ectasia, severe coronary atherosclerosis, and normal vessels, which were so classified by coronary angiography. We have also evaluated the influence of these infections on inflammatory markers such as high-sensitive C-reactive protein (hsCRP) and interleukin 6 (IL-6). Of the 796 patients undergoing coronary angiography for suspected ischemic heart disease, 244 patients were recruited. Of these, 91 had normal vessels, 88 had 3 or more obstructed vessels, and 65 had ectatic vessels without atherosclerosis. Eighty-seven atherosclerotic patients (98.9%) were positive for C. pneumoniae IgG, as were 64 ectatic patients (98.5%) and 76 controls (83.5%) (P < 0.001). Forty-two atherosclerotic patients (47.7%) were positive for C. pneumoniae IgM, as were 43 ectatic patients (66.2%) and 43 controls (47.3%) (P = 0.036). Seventy-two atherosclerotic patients (81.8%) were positive for H. pylori IgA, as were 26 ectatic patients (40.0%) and 44 controls (48.4%) (P < 0.001). High-sensitive CRP levels were significantly higher in ectatic patients (5.639 mg/L) than in controls (4.390 mg/L) (P = 0.032), and IL-6 levels were significantly higher in atherosclerotic patients (33.92 U/L) than in controls (14.01 U/L) (P < 0.001). Interleukin-6 levels were higher in H. pylori seropositive patients, and hsCRP levels were higher in C. pneumoniae seropositive patients, when compared with seronegatives. We suggest that, as in atherosclerosis, C. pneumoniae infection is related to ectasia, with raised CRP levels