2,568,160 research outputs found

    Speech Function and Speech Role in Carl Fredricksen's Dialogue on Up Movie

    Full text link
    One aim of this article is to show through a concrete example how speech function and speech role used in movie. The illustrative example is taken from the dialogue of Up movie. Central to the analysis proper form of dialogue on Up movie that contain of speech function and speech role; i.e. statement, offer, question, command, giving, and demanding. 269 dialogue were interpreted by actor, and it was found that the use of speech function and speech role

    Sampling-based speech parameter generation using moment-matching networks

    Full text link
    This paper presents sampling-based speech parameter generation using moment-matching networks for Deep Neural Network (DNN)-based speech synthesis. Although people never produce exactly the same speech even if we try to express the same linguistic and para-linguistic information, typical statistical speech synthesis produces completely the same speech, i.e., there is no inter-utterance variation in synthetic speech. To give synthetic speech natural inter-utterance variation, this paper builds DNN acoustic models that make it possible to randomly sample speech parameters. The DNNs are trained so that they make the moments of generated speech parameters close to those of natural speech parameters. Since the variation of speech parameters is compressed into a low-dimensional simple prior noise vector, our algorithm has lower computation cost than direct sampling of speech parameters. As the first step towards generating synthetic speech that has natural inter-utterance variation, this paper investigates whether or not the proposed sampling-based generation deteriorates synthetic speech quality. In evaluation, we compare speech quality of conventional maximum likelihood-based generation and proposed sampling-based generation. The result demonstrates the proposed generation causes no degradation in speech quality.Comment: Submitted to INTERSPEECH 201

    Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

    Get PDF
    In this paper we introduce a new cepstral coefficient extraction method based on an intelligibility measure for speech in noise, the Glimpse Proportion measure. This new method aims to increase the intelligibility of speech in noise by modifying the clean speech, and has applications in scenarios such as public announcement and car navigation systems. We first explain how the Glimpse Proportion measure operates and further show how we approximated it to integrate it into an existing spectral envelope parameter extraction method commonly used in the HMM-based speech synthesis framework. We then demonstrate how this new method changes the modelled spectrum according to the characteristics of the noise and show results for a listening test with vocoded and HMM-based synthetic speech. The test indicates that the proposed method can significantly improve intelligibility of synthetic speech in speech shaped noise. Index Terms — cepstral coefficient extraction, objective measure for speech intelligibility, Lombard speech, HMM-based speech synthesis 1

    Anonymity In Cyberspace: Judicial and Legislative Regulations

    Get PDF
    Historically, the scope of constitutional protections for fundamental rights has evolved to keep pace with new social norms and new technology. Internet speech is on the rise. The First Amendment protects an individual’s right to speak anonymously, but to what extent does it protect a right to anonymous online speech? This question is difficult because the government must balance the fundamental nature of speech rights with the potential dangers associated with anonymous online speech, including defamation, invasion of privacy, and intentional infliction of emotional distress. While lower courts have held that there is a right to anonymous online speech, they have not yet adopted a common standard. Meanwhile, to simplify the confusion and protect the rights of those who are injured by anonymous online speech, state legislatures are seeking to restrict some or all anonymous online-speech rights. This Note explores the history of speech regulation, with a special focus on the history of anonymous online speech, and the justifications for protecting speech rights. It then discusses the judicial standards under which courts require disclosure of anonymous speakers and the current legislative proposals to restrict speech rights. Next, this Note suggests that legislatures should not restrict speech rights, and should instead expand the remedies available to those injured by harmful speech. This Note also suggests that courts should adopt a summary judgment standard that requires plaintiffs to provide evidence demonstrating that the anonymous speaker has committed a tort before requiring the speaker to disclose his or her identity

    Mixture Density Networks, Human Articulatory Data and Acoustic-to-Articulatory Inversion of Continuous Speech

    Get PDF
    Researchers have been investigating methods for retrieving the articulation underlying an acoustic speech signal for more than three decades. A successful method would find many applications, for example: low bit-rate speech coding, helping individuals with speech and hearing disorders by providing visual feedback during speech training, and the possibility of improved automatic speech recognition

    Private Speech as Social Action

    Get PDF
    An important theoretical construct within the Vygotskian sociocultural approach to second language learning is private speech. Within a conversation-analytic framework, an agnostic stance is taken in this paper toward the possible intrapsychological function(s) of private speech in order to (1) illustrate how private speech can be identified within the details of talk-in-interaction and (2) how private speech can be understood as social action. It is argued that attention to the details of how private speech is produced is important in order to show how private speech has been identified as such; that viewing private speech as social action allows for a more emic perspective; and that, at least within interaction, private speech is social not just in origin, but each time that it is produced
    corecore