21 research outputs found

    Context-Dependent Acoustic Modelling for Speech Recognition

    Get PDF
    Ph.DDOCTOR OF PHILOSOPH

    Two uses for syllables in a speech recognition system

    Get PDF

    The stop-like modification of /ð/ : a case study in the analysis and handling of speech variation

    Get PDF
    Thesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2007.Includes bibliographical references (leaves 138-142).Phonetic variation is pervasive in everyday speech. Studying these variations is essential for building acoustic models and lexical representations that effectively capture the variability of speech. This thesis examines one of the commonly-occurring phonetic variations in English: the stop-like modification of the dental fricative /ð/. This variant exhibits a drastic change from the canonical /ð/; the manner of production is changed from one that is fricative to one that is stop-like. Furthermore, the place of articulation of stop-like /0/ has been a point of uncertainty, leading to the confusion between stop-like /ð/1 and /d/. This thesis aims to uncover the segmental context of stop-like /ð/, possible causes of the modification, whether the dental place of articulation is preserved despite modification, and if there are salient acoustic cues that distinguish between stop-like /ð/ and /d/. Word-initial /ð/ in the read speech of the TIMIT Database, the task-oriented spontaneous speech of the AEMT Corpus, and the non-task-oriented spontaneous speech of the Buckeye Corpus are examined acoustically. It is found that stop-like /ð/ occurs most often when it is preceded by silence or when preceded by a stop consonant. The occurrence is less frequent when /ð/ is preceded by a fricative or an affricate consonant. This modification rarely occurs when /ð/ is preceded by a vowel or liquid consonant. The findings suggest that possible factors that may contribute to the stop-like modification of /ð/include physiological mechanisms of speech production, prosody, and/or other aspects of speaking style and manner. Acoustic analysis indicates that stop-like /ð/ is significantly different from /d/ in burst amplitude, burst spectrum shape, burst peak frequency, and second formant at following- vowel onset.(cont.) Moreover, the acoustic differences indicate that the dental place of articulation is preserved for stop-like /ð/. Automatic classification experiments involving these acoustic measures suggest that they are robust in distinguishing stop-like /ð/ from /d/. Applications of these findings may lie in areas of automatic speech recognition, speech transcription, and development of acoustic measures for speech disorder diagnosis.by Sherry Y. Zhao.Ph.D

    Intertextual Readings of the Nyāyabhūṣaṇa on Buddhist Anti-Realism

    Get PDF
    This two-part dissertation has two goals: 1) a close philological reading of a 50-page section of a 10th-century Sanskrit philosophical work (Bhāsarvajña's Nyāyabhūṣaṇa), and 2) the creation and assessment of a novel intertextuality research system (Vātāyana) centered on the same work. The first half of the dissertation encompasses the philology project in four chapters: 1) background on the author, work, and key philosophical ideas in the passage; 2) descriptions of all known manuscript witnesses of this work and a new critical edition that substantially improves upon the editio princeps; 3) a word-for-word English translation richly annotated with both traditional explanatory material and novel digital links to not one but two interactive online research systems; and 4) a discussion of the Sanskrit author's dialectical strategy in the studied passage. The second half of the dissertation details the intertextuality research system in a further four chapters: 5) why it is needed and what can be learned from existing projects; 6) the creation of the system consisting of curated textual corpus, composite algorithm in natural language processing and information retrieval, and live web-app interface; 7) an evaluation of system performance measured against a small gold-standard dataset derived from traditional philological research; and 8) a discussion of the impact such new technology could have on humanistic research more broadly. System performance was assessed to be quite good, with a 'recall@5' of 80%, meaning that most previously known cases of mid-length quotation and even paraphrase could be automatically found and returned within the system's top five hits. Moreover, the system was also found to return a 34% surplus of additional significant parallels not found in the small benchmark. This assessment confirms that Vātāyana can be useful to researchers by aiding them in their collection and organization of intertextual observations, leaving them more time to focus on interpretation. Seventeen appendices illustrate both these efforts and a number of side projects, the latter of which span translation alignment, network visualization of an important database of South Asian prosopography (PANDiT), and a multi-functional Sanskrit text-processing web application (Skrutable).:Preface (i) Table of Contents (ii) Abbreviations (v) Terms and Symbols (v) Nyāyabhūṣaṇa Witnesses (v) Main Sanskrit Editions (vi) Introduction (vii) A Multi-Disciplinary Project in Intertextual Reading (vii) Main Object of Study: Nyāyabhūṣaṇa 104–154 (vii) Project Outline (ix) Part I: Close Reading (1) 1 Background (1) 1.1 Bhāsarvajña (1) 1.2 The Nyāyabhūṣaṇa (6) 1.2.1 Ts One of Several Commentaries on Bhāsarvajña's Nyāyasāra (6) 1.2.2 In Modern Scholarship, with Focus on NBhū 104–154 (8) 1.3 Philosophical Context (11) 1.3.1 Key Philosophical Concepts (12) 1.3.2 Intra-Textual Context within the Nyāyabhūṣaṇa (34) 1.3.3 Inter-Textual Context (36) 2 Edition of NBhū 104–154 (39) 2.1 Source Materials (39) 2.1.1 Edition of Yogīndrānanda 1968 (E) (40) 2.1.2 Manuscripts (P1, P2, V) (43) 2.1.3 Diplomatic Transcripts (59) 2.2 Notes on Using the Edition (60) 2.3 Critical Edition of NBhū 104–154 with Apparatuses (62) 3 Translation of NBhū 104–154 (108) 3.1 Notes on Translation Method (108) 3.2 Notes on Outline Headings (112) 3.3 Annotated Translation of NBhū 104–154 (114) 4 Discussion (216) 4.1 Internal Structure of NBhū 104–154 (216) 4.2 Critical Assessment of Bhāsarvajña's Argumentation (218)   Part II: Distant Reading with Digital Humanities (224) 5 Background in Intertextuality Detection (224) 5.1 Sanskrit Projects (225) 5.2 Non-Sanskrit Projects (228) 5.3 Operationalizing Intertextuality (233) 6 Building an Intertextuality Machine (239) 6.1 Corpus (Pramāṇa NLP) (239) 6.2 Algorithm (Vātāyana) (242) 6.3 User Interface (Vātāyana) (246) 7 Evaluating System Performance (255) 7.1 Previous Scholarship on NBhū 104–154 as Philological Benchmark (255) 7.2 System Performance Relative to Benchmark (257) 8 Discussion (262) Conclusion (266) Works Cited (269) Main Sanskrit Editions (269) Works Cited in Part I (271) Works Cited in Part II (281) Appendices (285) Appendix 1: Correspondence of Joshi 1986 to Yogīndrānanda 1968 (286) Appendix 1D: Full-Text Alignment of Joshi 1986 to Yogīndrānanda 1968 (287) Appendix 2: Prosopographical Relations Important for NBhū 104–154 (288) Appendix 2D: Command-Line Tool “Pandit Grapher” (290) Appendix 3: Previous Suggestions to Improve Text of NBhū 104–154 (291) Appendix 4D: Transcript and Collation Data for NBhū 104–154 (304) Appendix 5D: Command-Line Tool “cte2cex” for Transcript Data Conversion (305) Appendix 6D: Deployment of Brucheion for Interactive Transcript Data (306) Appendix 7: Highlighted Improvements to Text of NBhū 104–154 (307) Appendix 7D: Alternate Version of Edition With Highlighted Improvements (316) Appendix 8D: Digital Forms of Translation of NBhū 104–154 (317) Appendix 9: Analytic Outline of NBhū 104–154 by Shodo Yamakami (318) Appendix 10.1: New Analytic Outline of NBhū 104–154 (Overall) (324) Appendix 10.2: New Analytic Outline of NBhū 104–154 (Detailed) (325) Appendix 11D: Skrutable Text Processing Library and Web Application (328) Appendix 12D: Pramāṇa NLP Corpus, Metadata, and LDA Modeling Info (329) Appendix 13D: Vātāyana Intertextuality Research Web Application (330) Appendix 14: Sample of Yamakami Citation Benchmark for NBhū 104–154 (331) Appendix 14D: Full Yamakami Citation Benchmark for NBhū 104–154 (333) Appendix 15: Vātāyana Recall@5 Scores for NBhū 104–154 (334) Appendix 16: PVA, PVin, and PVSV Vātāyana Search Hits for Entire NBhū (338) Appendix 17: Sample Listing of Vātāyana Search Hits for Entire NBhū (349) Appendix 17D: Full Listing of Vātāyana Search Hits for Entire NBhū (355) Overview of Digital Appendices (356) Zusammenfassung (Thesen Zur Dissertation) (357) Summary of Results (361

    Boise State University Catalog: 1990-1991 (UP 4.4)

    Get PDF

    Treatise on Hearing: The Temporal Auditory Imaging Theory Inspired by Optics and Communication

    Full text link
    A new theory of mammalian hearing is presented, which accounts for the auditory image in the midbrain (inferior colliculus) of objects in the acoustical environment of the listener. It is shown that the ear is a temporal imaging system that comprises three transformations of the envelope functions: cochlear group-delay dispersion, cochlear time lensing, and neural group-delay dispersion. These elements are analogous to the optical transformations in vision of diffraction between the object and the eye, spatial lensing by the lens, and second diffraction between the lens and the retina. Unlike the eye, it is established that the human auditory system is naturally defocused, so that coherent stimuli do not react to the defocus, whereas completely incoherent stimuli are impacted by it and may be blurred by design. It is argued that the auditory system can use this differential focusing to enhance or degrade the images of real-world acoustical objects that are partially coherent. The theory is founded on coherence and temporal imaging theories that were adopted from optics. In addition to the imaging transformations, the corresponding inverse-domain modulation transfer functions are derived and interpreted with consideration to the nonuniform neural sampling operation of the auditory nerve. These ideas are used to rigorously initiate the concepts of sharpness and blur in auditory imaging, auditory aberrations, and auditory depth of field. In parallel, ideas from communication theory are used to show that the organ of Corti functions as a multichannel phase-locked loop (PLL) that constitutes the point of entry for auditory phase locking and hence conserves the signal coherence. It provides an anchor for a dual coherent and noncoherent auditory detection in the auditory brain that culminates in auditory accommodation. Implications on hearing impairments are discussed as well.Comment: 603 pages, 131 figures, 13 tables, 1570 reference

    A South Indian Digest of Commentaries on the Nyāyasūtra

    Get PDF
    The Nyāyasūtravivaraṇa, written in the first centuries of the 2nd millennium CE, provides the most accessible introduction to the core teachings of old Nyāya. Excerpting from the two earliest and most important treatises of this tradition—the Nyāyabhāṣya and Nyāyavārttika—Gambhīravaṃśaja created a comprehensive yet concise digest. The present work contains not only a critical edition of the first chapter based on all known textual sources but also a complete documentation of the variants, a comprehensive study of the parallel passages, a detailed discussion of the preparation and processing of the text-critical data, and a detailed documentation of the Grantha Tamil, Telugu and Kannada scripts.Das Nyāyasūtravivaraṇa, geschrieben in den ersten Jahrhunderten des 2. Jahrtausends u. Z., bietet die zugänglichste Einführung in die Kernlehren des alten Nyāya. Anhand von Auszügen aus den beiden frühesten und wichtigsten Abhandlungen dieser Tradition – dem Nyāyabhāṣya und dem Nyāyavārttika – schuf Gambhīravaṃśaja ein umfassendes und doch prägnantes Digest. Das vorliegende Werk enthält nicht nur eine kritische Ausgabe des ersten Kapitels basierend auf allen bekannten Textquellen, sondern auch eine vollständige Dokumentation der Varianten, eine umfassende Studie der Parallelstellen, eine detaillierte Erörterung der Aufbereitung und Verarbeitung von textkritischen Daten sowie eine ausführliche Dokumentation der Grantha Tamil-, Telugu- und Kannada-Schriften

    A South Indian Digest of Commentaries on the Nyāyasūtra

    Get PDF
    The Nyāyasūtravivaraṇa, written in the first centuries of the 2nd millennium CE, provides the most accessible introduction to the core teachings of old Nyāya. Excerpting from the two earliest and most important treatises of this tradition—the Nyāyabhāṣya and Nyāyavārttika—Gambhīravaṃśaja created a comprehensive yet concise digest. The present work contains not only a critical edition of the first chapter based on all known textual sources but also a complete documentation of the variants, a comprehensive study of the parallel passages, a detailed discussion of the preparation and processing of the text-critical data, and a detailed documentation of the Grantha Tamil, Telugu and Kannada scripts

    Agroecological Transitions: From Theory to Practice in Local Participatory Design

    Get PDF
    This Open Access book presents feedback from the ‘Territorial Agroecological Transition in Action’- TATA-BOX research project, which was devoted to these specific issues. The multidisciplinary and multi-organisation research team steered a four-year action-research process in two territories of France. It also presents: i) the key dimensions to be considered when dealing with agroecological transition: diversity of agriculture models, management of uncertainties, polycentric governance, autonomies, and role of actors’ networks; ii) an operational and original participatory process and associated boundary tools to support local stakeholders in shifting from a shared diagnosis to a shared action plan for transition, and in so doing developing mutual understanding and involvement; iii) an analysis of the main effects of the methodology on research organisation and on stakeholders’ development and application; iv) critical analysis and foresights on the main outcomes of TATA-BOX, provided by external researchers
    corecore