116 research outputs found

    The 2005 AMI system for the transcription of speech in meetings

    Get PDF
    In this paper we describe the 2005 AMI system for the transcription\ud of speech in meetings used for participation in the 2005 NIST\ud RT evaluations. The system was designed for participation in the speech\ud to text part of the evaluations, in particular for transcription of speech\ud recorded with multiple distant microphones and independent headset\ud microphones. System performance was tested on both conference room\ud and lecture style meetings. Although input sources are processed using\ud different front-ends, the recognition process is based on a unified system\ud architecture. The system operates in multiple passes and makes use\ud of state of the art technologies such as discriminative training, vocal\ud tract length normalisation, heteroscedastic linear discriminant analysis,\ud speaker adaptation with maximum likelihood linear regression and minimum\ud word error rate decoding. In this paper we describe the system performance\ud on the official development and test sets for the NIST RT05s\ud evaluations. The system was jointly developed in less than 10 months\ud by a multi-site team and was shown to achieve very competitive performance

    Rhetoric of direct and indirect notation of concepts: the procedure of transformation of an idea into an effect

    Get PDF
    The paper dwells on the textual ways direct and indirect references to concepts transform into particular effects. The topicality of the publication is brought about by the necessity to study the impact on the audience achieved by verbal formation of particular impressions. The goal of the article is to reveal how the direct reference to the democracy concept or its absence creates the desired effect in the inaugurals by American Presidents Obama and Biden. The paper applies the rhetorical methodology of idea-turned-impression to explain the way the concept named in the beginning of a speech transforms into a particular effect meant to influence the audience. It is found that the indirect reference to the democracy concept is implemented by the units of two groups: basic-level, indicating the people and the government, and axiological, naming freedom, equality, unity etc. The direct reference by the noun ā€˜democracyā€™ performs compositional and persuasive ā€“ existential and metaphorical ā€“ functions. The methodology of the article includes methods of analysis and synthesis, as well as special methods of knowledge, which are typical for philology

    Social Situatedness: Vygotsky and Beyond

    Get PDF
    The concept of ā€˜social situatednessā€™, i.e. the idea that the development of individual intelligence requires a social (and cultural) embedding, has recently received much attention in cognitive science and artificial intelligence research. The work of Lev Vygotsky who put forward this view already in the 1920s has influenced the discussion to some degree, but still remains far from well known. This paper therefore aims to give an overview of his cognitive development theory and discuss its relation to more recent work in primatology and socially situated artificial intelligence, in particular humanoid robotics

    How Linguistic Chickens Help Spot Spoken-Eggs: Phonological Constraints on Speech Identification

    Get PDF
    It has long been known that the identification of aural stimuli as speech is context-dependent (Remez et al., 1981). Here, we demonstrate that the discrimination of speech stimuli from their non-speech transforms is further modulated by their linguistic structure. We gauge the effect of phonological structure on discrimination across different manifestations of well-formedness in two distinct languages. One case examines the restrictions on English syllables (e.g., the well-formed melif vs. ill-formed mlif); another investigates the constraints on Hebrew stems by comparing ill-formed AAB stems (e.g., TiTuG) with well-formed ABB and ABC controls (e.g., GiTuT, MiGuS). In both cases, non-speech stimuli that conform to well-formed structures are harder to discriminate from speech than stimuli that conform to ill-formed structures. Auxiliary experiments rule out alternative acoustic explanations for this phenomenon. In English, we show that acoustic manipulations that mimic the mlifā€“melif contrast do not impair the classification of non-speech stimuli whose structure is well-formed (i.e., disyllables with phonetically short vs. long tonic vowels). Similarly, non-speech stimuli that are ill-formed in Hebrew present no difficulties to English speakers. Thus, non-speech stimuli are harder to classify only when they are well-formed in the participantsā€™ native language. We conclude that the classification of non-speech stimuli is modulated by their linguistic structure: inputs that support well-formed outputs are more readily classified as speech

    No Platforming

    Get PDF
    This paper explains how the practice of ā€˜no platformingā€™ can be reconciled with a liberal politics. While opponents say that no platforming flouts ideals of open public discourse, and defenders see it as a justifiable harm-prevention measure, both sides mistakenly treat the debate like a run-of-the-mill free speech conflict, rather than an issue of academic freedom specifically. Content-based restrictions on speech in universities are ubiquitous. And this is no affront to a liberal conception of academic freedom, whose purpose isnā€™t just to protect the speech of academics, but also to give them the prerogative to determine which views and speakers have sufficient disciplinary credentials to receive a hearing in academic contexts. No platforming should therefore be acceptable to liberals, in principle, in cases where it is used to support a university culture that maintains rigorous disciplinary standards, by denying attention and credibility to speakers without appropriate disciplinary credentials

    Lecture Notes on the Application of the Functions of Semantic Categories in Translation

    Get PDF
    The aim of this paper is to shed light on the contribution of semantic categories to translation To this end the paper uses both a theoretical and a descriptive approach Indeed the theories relating to the terms category and semantic categories are presented Then the functions of semantic categories are described in translated sentences Finally the paper has found that the kernels of English sentences together with the transformations that can be derived from every clause or phrase are key aspects of sentence comprehension and stylistic differences Equally important is the finding that translation becomes easier and more technical when the functions of semantic categories are applied in sentence constructio

    Bjƶrn E. Lindblom 80 Ʃves

    Get PDF

    Removing the Silencer : Coverage and Protection of Physician Speech Under the First Amendment

    Get PDF
    The physicianā€“patient relationship rests on a bedrock of trust. Without trust, patientsā€”and for that matter, physiciansā€”are less willing to divulge information critical to providing accurate medical diagnoses and treatments. The state of Florida seemingly ignored this when its legislature, with support from the National Rifle Association and other pro-gun advocates, enacted the Firearm Owners Privacy Act (FOPA), a statute that restricts physicians from questioning their patients about firearm ownership. In Wollschlaeger v. Governor of Florida , the United States Court of Appeals for the Eleventh Circuit held that FOPA did not regulate physician speech but, instead, regulated physician conduct. As such, the law was exempted from First Amendment scrutiny. But almost one year to the day after publishing its first Wollschlaeger opinion, the Eleventh Circuit sua sponte vacated its original opinion and substituted in its place a brand new opinionā€”one holding that FOPA was subject to First Amendment scrutiny, but nonetheless passed constitutional muster. This Note uses the diverging Wollschlaeger opinions as a vehicle to analyze the First Amendmentā€™s coverage and protection of physician speech. Specifically, it argues that an uninhibited line of communication is required to protect the trust necessary for an effective physicianā€“patient relationship. This logical underpinning leads to the conclusion that the First Amendment presumptively covers physician speech and, furthermore, that physician speech should be subject to intermediate scrutinyā€”a level of scrutiny that FOPA cannot meet

    The listening talker: A review of human and algorithmic context-induced modifications of speech

    Get PDF
    International audienceSpeech output technology is finding widespread application, including in scenarios where intelligibility might be compromised - at least for some listeners - by adverse conditions. Unlike most current algorithms, talkers continually adapt their speech patterns as a response to the immediate context of spoken communication, where the type of interlocutor and the environment are the dominant situational factors influencing speech production. Observations of talker behaviour can motivate the design of more robust speech output algorithms. Starting with a listener-oriented categorisation of possible goals for speech modification, this review article summarises the extensive set of behavioural findings related to human speech modification, identifies which factors appear to be beneficial, and goes on to examine previous computational attempts to improve intelligibility in noise. The review concludes by tabulating 46 speech modifications, many of which have yet to be perceptually or algorithmically evaluated. Consequently, the review provides a roadmap for future work in improving the robustness of speech output
    • ā€¦
    corecore