2,348 research outputs found

    Electoral system reviews in New Zealand, Britain and Canada: a critical comparison

    Get PDF
    This article compares the use of people outside government to consider electoral reform in three countries using the single-member plurality electoral system. The composition of electoral reform bodies, ranging from commissions of experts (New Zealand) and ex- politicians (Britain) to assemblies of randomly selected citizens (British Columbia), appears to have influenced how well their recommendations were received by the public. Governments should be careful not to assume that they can retain control of the electoral reform process once they let it out of their hands, as the cases of New Zealand and British Columbia show, where majorities of the voters chose reform

    Reconstructing intelligible audio speech from visual speech features

    Get PDF
    This work describes an investigation into the feasibility of producing intelligible audio speech from only visual speech fea- tures. The proposed method aims to estimate a spectral enve- lope from visual features which is then combined with an arti- ficial excitation signal and used within a model of speech pro- duction to reconstruct an audio signal. Different combinations of audio and visual features are considered, along with both a statistical method of estimation and a deep neural network. The intelligibility of the reconstructed audio speech is measured by human listeners, and then compared to the intelligibility of the video signal only and when combined with the reconstructed audio

    Voicing classification of visual speech using convolutional neural networks

    Get PDF
    The application of neural network and convolutional neural net- work (CNN) architectures is explored for the tasks of voicing classification (classifying frames as being either non-speech, unvoiced, or voiced) and voice activity detection (VAD) of vi- sual speech. Experiments are conducted for both speaker de- pendent and speaker independent scenarios. A Gaussian mixture model (GMM) baseline system is de- veloped using standard image-based two-dimensional discrete cosine transform (2D-DCT) visual speech features, achieving speaker dependent accuracies of 79% and 94%, for voicing classification and VAD respectively. Additionally, a single- layer neural network system trained using the same visual fea- tures achieves accuracies of 86 % and 97 %. A novel technique using convolutional neural networks for visual speech feature extraction and classification is presented. The voicing classifi- cation and VAD results using the system are further improved to 88 % and 98 % respectively. The speaker independent results show the neural network system to outperform both the GMM and CNN systems, achiev- ing accuracies of 63 % for voicing classification, and 79 % for voice activity detection

    Objective measures for predicting the intelligibility of spectrally smoothed speech with artificial excitation

    Get PDF
    A study is presented on how well objective measures of speech quality and intelligibility can predict the subjective in- telligibility of speech that has undergone spectral envelope smoothing and simplification of its excitation. Speech modi- fications are made by resynthesising speech that has been spec- trally smoothed. Objective measures are applied to the mod- ified speech and include measures of speech quality, signal- to-noise ratio and intelligibility, as well as proposing the nor- malised frequency-weighted spectral distortion (NFD) measure. The measures are compared to subjective intelligibility scores where it is found that several have high correlation (|r| ≥ 0.7), with NFD achieving the highest correlation (r = −0.81

    Fintech: how and is it being taught in academia?

    Get PDF
    Merriam Webster defines Fintech as “products and companies that employ newly developed digital and online technologies in the banking and financial services industries” (Merriam-Webster). Fintech, short for financial technology, is not new; however, it has become a hot topic as of recently. During the pandemic, Fintech and the many apps and programs that make up financial technology began to surge. Applications that made applying for loans and moving money easily between friends and businesses were of utmost importance during this time. Instant financial results and access are only continuing in demand. The days of going into a brick-and-mortar building and sitting with a person for a lengthy period to discuss your finances are starting to become outdated. However, even with the demand for Fintech and its abilities to streamline monetary transactions, the trust in the security of these applications is still low. Cumulatively, there has been over $7 billion stolen in the crypto space, so it is also clear that the cybersecurity of crypto and public trust in adoption is also an important topic (Thapa, 2022). Our data collection comprises of a survey that is sent out to the faculty at various schools to gather data about the current state of how Fintech is being taught at universities. In addition, we examine a plethora of academic journals to gather information about what is being taught, what should be taught and how the current state of Fintech in academia can be improved

    Generating intelligible audio speech from visual speech

    Get PDF
    This work is concerned with generating intelligible audio speech from a video of a person talking. Regression and classification methods are proposed first to estimate static spectral envelope features from active appearance model (AAM) visual features. Two further methods are then developed to incorporate temporal information into the prediction - a feature-level method using multiple frames and a model-level method based on recurrent neural networks. Speech excitation information is not available from the visual signal, so methods to artificially generate aperiodicity and fundamental frequency are developed. These are combined within the STRAIGHT vocoder to produce a speech signal. The various systems are optimised through objective tests before applying subjective intelligibility tests that determine a word accuracy of 85% from a set of human listeners on the GRID audio-visual speech database. This compares favourably with a previous regression-based system that serves as a baseline which achieved a word accuracy of 33%
    corecore