12 research outputs found

    The cross-linguistic performance of word segmentation models over time.

    Get PDF
    We select three word segmentation models with psycholinguistic foundations - transitional probabilities, the diphone-based segmenter, and PUDDLE - which track phoneme co-occurrence and positional frequencies in input strings, and in the case of PUDDLE build lexical and diphone inventories. The models are evaluated on caregiver utterances in 132 CHILDES corpora representing 28 languages and 11.9 m words. PUDDLE shows the best performance overall, albeit with wide cross-linguistic variation. We explore the reasons for this variation, fitting regression models to performance scores with linguistic properties which capture lexico-phonological characteristics of the input: word length, utterance length, diversity in the lexicon, the frequency of one-word utterances, the regularity of phoneme patterns at word boundaries, and the distribution of diphones in each language. These properties together explain four-tenths of the observed variation in segmentation performance, a strong outcome and a solid foundation for studying further variables which make the segmentation task difficult

    Research Developments in World Englishes

    Get PDF
    This book is available as open access through the Bloomsbury Open Access programme and is available on www.bloomsburycollections.com. It is funded by the University of Klagenfurt, Austria. Discussing key issues of current relevance and setting the tone for future research in world Englishes, this book provides new perspectives on the diverse realities of Englishes around the world. Written by an international team of established and renowned scholars, it is the inaugural volume in the new series Bloomsbury Advances in World Englishes, dedicated to advancing research in the field. Chapters discuss important topics in contemporary world Englishes research, including de-colonial approaches, emerging varieties in post-protectorates and international uses as communicative events to highlight the globalizing aspect of English as a semiotic code. The book also expands on cultural conceptualizations to investigate the connections between Englishes and localized cultural knowledge and ongoing changes and attitudes towards local forms in multilingual settings. Closing with an examination of how world Englishes and the use of English as a lingua franca could influence the future teaching of Englishes, Research Developments in World Englishes presents a detailed picture of contemporary research approaches and points the way towards exciting future directions

    Research Developments in World Englishes

    Get PDF
    This book is available as open access through the Bloomsbury Open Access programme and is available on www.bloomsburycollections.com. It is funded by the University of Klagenfurt, Austria. Discussing key issues of current relevance and setting the tone for future research in world Englishes, this book provides new perspectives on the diverse realities of Englishes around the world. Written by an international team of established and renowned scholars, it is the inaugural volume in the new series Bloomsbury Advances in World Englishes, dedicated to advancing research in the field. Chapters discuss important topics in contemporary world Englishes research, including de-colonial approaches, emerging varieties in post-protectorates and international uses as communicative events to highlight the globalizing aspect of English as a semiotic code. The book also expands on cultural conceptualizations to investigate the connections between Englishes and localized cultural knowledge and ongoing changes and attitudes towards local forms in multilingual settings. Closing with an examination of how world Englishes and the use of English as a lingua franca could influence the future teaching of Englishes, Research Developments in World Englishes presents a detailed picture of contemporary research approaches and points the way towards exciting future directions

    CLARIN

    Get PDF
    The book provides a comprehensive overview of the Common Language Resources and Technology Infrastructure – CLARIN – for the humanities. It covers a broad range of CLARIN language resources and services, its underlying technological infrastructure, the achievements of national consortia, and challenges that CLARIN will tackle in the future. The book is published 10 years after establishing CLARIN as an Europ. Research Infrastructure Consortium

    CLARIN. The infrastructure for language resources

    Get PDF
    CLARIN, the "Common Language Resources and Technology Infrastructure", has established itself as a major player in the field of research infrastructures for the humanities. This volume provides a comprehensive overview of the organization, its members, its goals and its functioning, as well as of the tools and resources hosted by the infrastructure. The many contributors representing various fields, from computer science to law to psychology, analyse a wide range of topics, such as the technology behind the CLARIN infrastructure, the use of CLARIN resources in diverse research projects, the achievements of selected national CLARIN consortia, and the challenges that CLARIN has faced and will face in the future. The book will be published in 2022, 10 years after the establishment of CLARIN as a European Research Infrastructure Consortium by the European Commission (Decision 2012/136/EU)

    CLARIN

    Get PDF
    The book provides a comprehensive overview of the Common Language Resources and Technology Infrastructure – CLARIN – for the humanities. It covers a broad range of CLARIN language resources and services, its underlying technological infrastructure, the achievements of national consortia, and challenges that CLARIN will tackle in the future. The book is published 10 years after establishing CLARIN as an Europ. Research Infrastructure Consortium

    Rhotics.New Data and Perspectives

    Get PDF
    This book provides an insight into the patterns of variation and change of rhotics in different languages and from a variety of perspectives. It sheds light on the phonetics, the phonology, the socio-linguistics and the acquisition of /r/-sounds in languages as diverse as Dutch, English, French, German, Greek, Hebrew, Italian, Kuikuro, Malayalam, Romanian, Slovak, Tyrolean and Washili Shingazidja thus contributing to the discussion on the unity and uniqueness of this group of sounds
    corecore