320 research outputs found

    Perception of attributes in real and synthetic string instrument sounds

    Get PDF
    This thesis explores the perceptual features of natural and synthetic string instrument sounds. The contributions are in formal listening experiments on a variety of features in musical sounds that have not been studied in detail previously. The effects of inharmonicity on timbre and pitch have been measured. The results indicate that the implementation of inharmonicity is not always necessary. The timbre effect is more salient in natural instruments, but for high tones a pitch difference may also be detected. Guidelines were given for compensation of the pitch effect. A perceptual study of the decaying parameters showed that large deviations from the reference value are tolerated perceptually. The studies on the audibility of initial pitch glides and dual-polarization effects provides practical knowledge that helps in the implementation of these features in digital sound synthesis. Related to expression rather than basic string behavior, the study on perception-based control of the vibrato parameters has a sligthly different background. However, all of the studied features are more or less player-controlled by different ways of plucking the string or pressing the key. The main objective of the thesis is to find answers to current problems in digital sound synthesis, such as parameter quantization. Another aim is to gain more general understanding of how we perceive musical sounds.reviewe

    Acoustic features of piano sounds

    No full text
    To date efforts of music transcription indicate the need for modelling the data signal in a more comprehensive manner in order to improve the transcription process of music performances. This research work is concerned with the investigation of two features associated with the reproduced sound of a piano; the inharmonicity factor of the piano strings and the double decay rate of the resulting sound. Firstly, a simple model of the inharmonicity is proposed and the factors that affect the modelled signal are identified, such as the magnitude of the inharmonicity, the number of harmonics, the time parameter, the phase characteristics and the harmonic amplitudes. A formation of a socalled “one-sided” effect appears in simulated signals, although this effect is obscured in real recordings potentially due to the non-uniformly varying amplitudes of the harmonic terms. This effect is also discussed through the use of the cepstrum by analysing real piano note recordings and synthesized signals. The cepstrum is further used to describe the effect of the coupled behaviour of two strings through digital waveguides. Secondly, the double decay rate effect is modelled through coupled oscillators and digital waveguides. A physical model of multiple strings is also presented as an extension to the simple model of coupled oscillators and various measurements on a real grand piano are carried out in order to investigate the coupling mechanism between the strings, the soundboard and the bridge. Finally, a model, with reduced dimensionality, is proposed to represent the signal model for single and multiple notes formulated around a Bayesian framework. The potential of such a model is illustrated with the transcription of simple examples of real monophonic and polyphonic piano recordings by implementing the Metropolis-Hastings algorithm and Gibbs sampler for multivariate parameter estimation

    The Temperament Police: The Truth, the Ground Truth, and Nothing but the Truth

    Get PDF
    The tuning system of a keyboard instrument is chosen so that frequently used musical intervals sound as consonant as possible. Temperament refers to the compromise arising from the fact that not all intervals can be maximally consonant simultaneously. Recent work showed that it is possible to estimate temperament from audio recordings with no prior knowledge of the musical score, using a conservative (high precision, low recall) automatic transcription algorithm followed by frequency estimation using quadratic interpolation and bias correction from the log magnitude spectrum. In this paper we develop a harpsichord-specific transcription system to analyse over 500 recordings of solo harpsichord music for which the temperament is specified on the CD sleeve notes. We compare the measured temperaments with the annotations and discuss the differences between temperament as a theoretical construct and as a practical issue for professional performers and tuners. The implications are that ground truth is not always scientific truth, and that content-based analysis has an important role in the study of historical performance practice. 1

    On inharmonicity in bass guitar strings with application to tapered and lumped constructions

    Get PDF
    In this study, the inharmonicity of bass guitar strings with and without areas of lowered and raised mass near the saddle is studied. Using a very high sample rate of over 900 kHz enabled finite difference time domain simulation to be applied for strings that simultaneously have nonzero stiffness and linear density which varies along the length of the string. Results are compared to experiments on specially constructed strings. Perturbation theory is demonstrated to be sufficiently accurate (and much more computationally efficient) for practical design purposes in reducing inharmonicity. The subject of inharmonicity is well known in pianos but has not been studied extensively in bass guitar strings. Here, the inharmonicity is found to be low in the lowest (open string) pitch on the five string bass guitar (B0) given typical standard construction. Conversely, the inharmonicity is high (around 100 cents at the 10th partial) when that string is sounded when stopped at the 12th fret and very high (around 100 cents at the 6th partial) when that string is stopped at the 21st fret. Bass guitar strings were constructed with three different constructions (standard, tapered and lumped) in order to demonstrate how incorporating a lump of raised mass near the saddle can achieve close to zero inharmonicity for the lower frequency partials. This also has potential in terms of improving the use of higher fret numbers for musical harmony (reducing beating) and also in controlling pitch glide that has, with some exceptions, often been attributed solely to nonlinear behaviour.Publisher PDFPeer reviewe

    DDSP-Piano: A Neural Sound Synthesizer Informed by Instrument Knowledge

    Get PDF
    Instrument sound synthesis using deep neural networks has received numerous improvements over the last couple of years. Among them, the Differentiable Digital Signal Processing (DDSP) framework has modernized the spectral modeling paradigm by including signal-based synthesizers and effects into fully differentiable architectures. The present work extends the applications of DDSP to the task of polyphonic sound synthesis, with the proposal of a differentiable piano synthesizer conditioned on MIDI inputs. The model architecture is motivated by high-level acoustic modeling knowledge of the instrument, which, along with the sound structure priors inherent to the DDSP components, makes for a lightweight, interpretable, and realistic-sounding piano model. A subjective listening test has revealed that the proposed approach achieves better sound quality than a state-of-the-art neural-based piano synthesizer, but physical-modeling-based models still hold the best quality. Leveraging its interpretability and modularity, a qualitative analysis of the model behavior was also conducted: it highlights where additional modeling knowledge and optimization procedures could be inserted in order to improve the synthesis quality and the manipulation of sound properties. Eventually, the proposed differentiable synthesizer can be further used with other deep learning models for alternative musical tasks handling polyphonic audio and symbolic data

    Inharmonicity in the natural mode frequencies of overwound strings

    Get PDF

    Audibility of the timbral effects of inharmonicity in stringed instrument tones

    Get PDF
    Abstract: Listening tests were conducted to find the audibility of inharmonicity in musical sounds produced by stringed instruments, such as the piano or the guitar. The audibility threshold of inharmonicity was measured at five fundamental frequencies. Results show that the detection of inharmonicity is strongly dependent on the fundamental frequency f 0 . A simple model is presented for estimating the threshold as a function of f 0 . The need to implement inharmonicity in digital sound synthesis is discussed

    Physical Models for Fast Estimation of Guitar String, Fret and Plucking Position

    Get PDF

    Physically Informed Subtraction of a String's Resonances from Monophonic, Discretely Attacked Tones : a Phase Vocoder Approach

    Get PDF
    A method for the subtraction of a string's oscillations from monophonic, plucked- or hit-string tones is presented. The remainder of the subtraction is the response of the instrument's body to the excitation, and potentially other sources, such as faint vibrations of other strings, background noises or recording artifacts. In some respects, this method is similar to a stochastic-deterministic decomposition based on Sinusoidal Modeling Synthesis [MQ86, IS87]. However, our method targets string partials expressly, according to a physical model of the string's vibrations described in this thesis. Also, the method sits on a Phase Vocoder scheme. This approach has the essential advantage that the subtraction of the partials can take place \instantly", on a frame-by-frame basis, avoiding the necessity of tracking the partials and therefore availing of the possibility of a real-time implementation. The subtraction takes place in the frequency domain, and a method is presented whereby the computational cost of this process can be reduced through the reduction of a partial's frequency-domain data to its main lobe. In each frame of the Phase Vocoder, the string is encoded as a set of partials, completely described by four constants of frequency, phase, magnitude and exponential decay. These parameters are obtained with a novel method, the Complex Exponential Phase Magnitude Evolution (CSPME), which is a generalisation of the CSPE [SG06] to signals with exponential envelopes and which surpasses the nite resolution of the Discrete Fourier Transform. The encoding obtained is an intuitive representation of the string, suitable to musical processing
    • …
    corecore