138 research outputs found

    A Novel Efficient Algorithm for Voice Gender Conversion

    Get PDF
    Realistic Voice Gender Conversion (VGC) requires independent scaling of the glottal (pitch) and vocal tract (formant) related features of the input speech signal. We present a VGC algorithm which has two novel features. Firstly, an efficient frequency scaling algorithm is presented. Secondly, we use this to scale all frequencies in the input signal by the desired formant scaling factor. We then deconvolve the glottal contribution using a standard linear predictive analysis and frequency scale it further such that the desired pitch scaling factor is equal to the product of the two frequency scaling factors. Finally, we resynthesize the converted speech. The female-to-male results were excellent while the male-to-female results sounded synthetic

    Using Podcasts to support Communication Skills Development: A Case Study for Content Preferences among Postgraduate Research Students

    Get PDF
    The need for the integration of generic skills training into structured PhD programmes is widely accepted. However, effective integration of such training requires flexible delivery mechanisms which facilitate self-paced and independent learning. A video recording was made of an eminent speaker delivering a 1-h live presentation to a group of 15 first-year science and engineering PhD research students. The topic of the presentation was inter-disciplinary professional communication skills. Following the presentation, the video recording was post-processed into seven alternative podcast formats. These podcast formats included a typed transcription, a full audio recording, a full video recording, presentation slides with embedded speech etc. The choice of podcast formats was based on ease-of-production by a typical computer-literate academic and ease-of-use by a typical computer-literate student. At a subsequent session, the seven podcast formats were shown to the 15 students and a survey to assess their reactions to the various formats was carried out. The survey results (quantitative and qualitative) were analysed to provide useful insight into the student preferences in relation to podcast formats. The students expressed a clear preference for summary key-point slides with explanatory voice-over by the original speaker

    A novel approach to Acoustic Echo cancellation

    Get PDF
    In this paper a novel approach to single microphone Acoustic Echo cancellation (AEC) is presented. This approach performs AEC by employing techniques developed for monaural sound source separation. It is shown that the AEC problem can be cast in a monaural sound source separation framework and through this framework significant echo suppression can be achieved. The new approach is evaluated through experiments on simulated data

    A novel approach to Acoustic Echo cancellation

    Get PDF
    In this paper a novel approach to single microphone Acoustic Echo cancellation (AEC) is presented. This approach performs AEC by employing techniques developed for monaural sound source separation. It is shown that the AEC problem can be cast in a monaural sound source separation framework and through this framework significant echo suppression can be achieved. The new approach is evaluated through experiments on simulated data

    Multi-Channel Audio Time-Scale Modification

    Get PDF
    Phase vecoder based approaches to audio time-scale modification introduce a reverberant artefact into the time scaled output. Recent techniques have been developed to reduce the presence of this artefact; however, these techniques have the effect of introducing additional issues relating to their application to multi-channel recordings. This paper addresses these issues by collectively analysing all channels prior to time-scaling each individual channel

    Sub-band Independent Subspace Analysis for Drum Transcription

    Get PDF
    While Independent Subspace Analysis provides a means of separating sound sources from a single channel signal, making it an effective tool for drum transcription, it does have a number of problems. Not least of these is that the amount of information required to allow separation of sound sources varies from signal to signal. To overcome this indeterminacy and improve the robustness of transcription an extension of Independent Subspace Analysis to include sub-band processing is proposed. The use of this approach is demonstrated by its application in a simple drum transcription algorithm

    Comparison of Signal Reconstruction Methods for the Azimuth Discrimination and Resynthesis Algorithm

    Get PDF
    The Azimuth Discrimination and Resynthesis algorithm, (ADRess), has been shown to produce high quality sound source separation results for intensity panned stereo recordings. There are however, artifacts such as phasiness which become apparent in the separated signals under certain conditions. This is largely due to the fact that only the magnitude spectra for the separated sources are estimated. Each source is then resynthesised using the phase information obtained from the original mixture. This paper describes the nature and origin of the associated artifacts and proposes alternative techniques for resynthesising the separated signals. A comparison of each technique is then presented

    Sound Source Separation: Azimuth Discrimination and Resynthesis

    Get PDF
    In this paper we present a novel sound source separation algorithm which requires no prior knowledge, no learning, assisted or otherwise, and performs the task of separation based purely on azimuth discrimination within the stereo field. The algorithm exploits the use of the pan pot as a means to achieve image localisation within stereophonic recordings. As such, only an interaural intensity difference exists between left and right channels for a single source. We use gain scaling and phase cancellation techniques to expose frequency dependent nulls across the azimuth domain, from which source separation and resynthesis is carried out. We present results obtained from real recordings, and show that for musical recordings, the algorithm improves upon the output quality of current source separation schemes

    Multi pitch estimation by using modified IIR Comb Filters

    Get PDF
    A technique for detecting the pitches of a polyphonic signal of presented. The system utilises modified IIR comb filters, which are generated to ensure that n null (stop band notches) exists at multiples of note frequencies, and that a very flat pass band is present in the remain of the spectrum. Thus, the signal spectrum is not distorted after applying the filters 60 the audio signal, which is the case when using FIR comb filters. The presented approach improves upon an existing multi pitch detection model bared on an FIR comb filter framework

    GSE1 Postgraduate Information Literacy and Communication Skills Training-Project Orientated Delivery

    Get PDF
    Module presented by the Faculty of Science and Engineering in cooperation with the Subject Librarian from Learning Teaching and Research Development in the Library. Original format was four three hour workshops plus 18 hrs self paced course work using interdisciplinary written and verbal skills training tasks . Translate a peer reviewed Journal paper to communicate it to a interdisciplinary audience and prepare a short presentation. It was enhanced with the introduction of a peer-learning component by dividing participants into four groups of eight. A project orientated and problem based learning (POPBL) approach has been shown to work well in the delivery of educational outcomes and also in the delivery of Information Literac
    • …
    corecore