959 research outputs found

    CREATION OF A DIGITAL CORPUS OF BULGARIAN DIALECTS

    Get PDF
    The paper presents our considerations related to the creation of a digital corpus of Bulgarian dialects. The dialectological archive of Bulgarian language consists of more than 250 audio tapes. All tapes were recorded between 1955 and 1965 in the course of regular dialectological expeditions throughout the country. The records typically contain interviews with inhabitants of small villages in Bulgaria. The topics covered are usually related to such issues as birth, everyday life, marriage, family relationship, death, etc. Only a few tapes contain folk songs from different regions of the country. Taking into account the progressive deterioration of the magnetic media and the realistic prospects of data loss, the Institute for Bulgarian Language at the Academy of Sciences launched in 1997 a project aiming at restoration and digital preservation of the dialectological archive. Within the framework of this project more than the half of the records was digitized, de-noised and stored on digital recording media. Since then restoration and digitization activities are done in the Institute on a regular basis. As a result a large collection of sound files has been gathered. Our further efforts are aimed at the creation of a digital corpus of Bulgarian dialects, which will be made available for phonological and linguistic research. Such corpora typically include besides the sound files two basic elements: a transcription, aligned with the sound file, and a set of standardized metadata that defines the corpus. In our work we will present considerations on how these tasks could be realized in the case of the corpus of Bulgarian dialects. Our suggestions will be based on a comparative analysis of existing methods and techniques to build such corpora, and by selecting the ones that fit closer to the particular needs. Our experience can be used in similar institutions storing folklore archives, history related spoken records etc

    Digital Presentation of Bulgarian Lexical Heritage. Towards an Electronic Historical Dictionary

    Get PDF
    The article presents the results of the project “ICT Tools for Historical Linguistic Studies”, funded by the European Social Fund, OP Human Resources. The main project goal was to elaborate electronic tools for creating a Historical Dictionary of Diachronic Type that should present the history of the Bulgarian words from their first written occurrence until today. By the end of the project the team (Faculty of Slavic Studies at Sofia University, Institute for Bulgarian Language, BAS and PAM Publishing Company, Sofia) had at their disposal a set of Old Bulgarian Unicode fonts, meant for publishing medieval texts and a convertor that converts non-Unicode documents into the new standard. The convertor allowed the participants to create in a relatively short time a Diachronic text corpus of Bulgarian medieval texts, containing already more than 90 texts dated from the 10th to the 18th century. The corpus software enables editing the texts and turned out to be an excellent tool for preparing electronic editions of the Old Bulgarian (OCS) manuscripts. In addition to the corpus an electronic dictionary of Old Bulgarian is available, which contains the digitized version of Старобългарски речник, produced by IBL. Both tools are accessible on the project website at the address histdict.uni-sofia.bg. The Standard of the Historical Dictionary took shape during the project course and respective software for elaborating new dictionary entries was designed and tested. The article also displays screenshots that demonstrate the functionalities of both the corpus and dictionary software.The article presents the results of the project “ICT Tools for Historical Linguistic Studies”, funded by the European Social Fund, OP Human Resources

    06491 Abstracts Collection -- Digital Historical Corpora- Architecture, Annotation, and Retrieval

    Get PDF
    From 03.12.06 to 08.12.06, the Dagstuhl Seminar 06491 ``Digital Historical Corpora - Architecture, Annotation, and Retrieval\u27\u27 was held in the International Conference and Research Center (IBFI), Schloss Dagstuhl. During the seminar, several participants presented their current research, and ongoing work and open problems were discussed. Abstracts of the presentations given during the seminar as well as abstracts of seminar results and ideas are put together in this paper. The first section describes the seminar topics and goals in general. Links to extended abstracts or full papers are provided, if availabl

    Slavic Dialectology: A Survey of Research since 1989

    Get PDF
    The last 25 years in Slavic dialectology mark the period not only of JSL’s founding but also of major and multiple political, social, and economic reorganizations in predominantly Slavic-speaking states. During this period research institutions and their priorities and projects have both continued and changed; technological innovation has meant moving towards electronic dissemination, “digital humanities,” and innovative modes of presenting research data and findings. In some cases major works (e.g., dialect atlases) have advanced during this period. Moreover, a new generation of scholars has had greater opportunities for mobility and therefore exposure to a variety of linguistic frameworks and approaches, which has fostered cross-border collaboration in the field. The present essay gives an overview of progress made on dialect projects both created institutionally and individually and including both traditional (book, article) and new digital means of dissemination

    Use of Knowledge Technologies for Presentation of Bulgarian Folklore Heritage Semantics

    Get PDF
    Preserving and presenting the Bulgarian folklore heritage is a long-term commitment of scholars and researchers working in many areas. This article presents ontological model of the Bulgarian folklore knowledge, exploring knowledge technologies for presenting the semantics of the phenomena of our traditional culture. This model is a step to the development of the digital library for the “Bulgarian Folklore Heritage” virtual exposition which is a part of the “Knowledge Technologies for Creation of Digital Presentation and Significant Repositories of Folklore Heritage” project

    Proceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities (ACRH-2). 29 November 2012, Lisbon, Portugal

    Get PDF
    Proceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities (ACRH-2), held in Lisbon, Portugal on 29 November 2012
    corecore