29 research outputs found

    UWB @ DIACR-Ita: Lexical Semantic Change Detection with CCA and Orthogonal Transformation

    Get PDF
    In this paper, we describe our method for detection of lexical semantic change (i.e., word sense changes over time) for the DIACR-Ita shared task, where we ranked 1st. We examine semantic differences between specific words in two Italian corpora, chosen from different time periods. Our method is fully unsupervised and language independent. It consists of preparing a semantic vector space for each corpus, earlier and later. Then we compute a linear transformation between earlier and later spaces, using CCA and Orthogonal Transformation. Finally, we measure the cosines between the transformed vectors

    The Second Cross-Lingual Challenge on Recognition, Normalization, Classification, and Linking of Named Entities across Slavic Languages

    Get PDF
    We describe the Second Multilingual Named Entity Challenge in Slavic languages. The task is recognizing mentions of named entities in Web documents, their normalization, and cross-lingual linking The Challenge was organized as part of the 7th Balto-Slavic Natural Language Processing Workshop, co-located with the ACL-2019 conference. Eight teams participated in the competition, which covered four languages and five entity types. Performance for the named entity recognition task reached 90% F-measure, much higher than reported in the first edition of the Challenge. Seven teams covered all four languages, and five teams participated in the cross-lingual entity linking task. Detailed evaluation information is available on the shared task web page.Non peer reviewe

    The Second Cross-Lingual Challenge on Recognition, Normalization, Classification, and Linking of Named Entities across Slavic Languages

    Get PDF
    We describe the Second Multilingual Named Entity Challenge in Slavic languages. The task is recognizing mentions of named entities in Web documents, their normalization, and cross-lingual linking The Challenge was organized as part of the 7th Balto-Slavic Natural Language Processing Workshop, co-located with the ACL-2019 conference. Eight teams participated in the competition, which covered four languages and five entity types. Performance for the named entity recognition task reached 90% F-measure, much higher than reported in the first edition of the Challenge. Seven teams covered all four languages, and five teams participated in the cross-lingual entity linking task. Detailed evaluation information is available on the shared task web page.Non peer reviewe

    Perception of the municipality in the context of a historical heritage using the example of the cheb trusess

    Get PDF
    The paper deals with the perception and image of the town Cheb in the context of historical heritage. Since 2017, the town has been running a tour of historic trusses. The aim of the research was to determine the influence of this activity on the image of the town, or whether the historic roofs already form part of the image of the town at least for some Czech residents and citizens. So far, the image of the city has not been systematically investigated. Two surveys have been carried out, one among the city residents and one among the citizens of the Czech Republic. It was found that the city of Cheb is associated with historical heritage in the minds of the respondents. However, outside the town of Cheb the historical trusses are not yet sufficiently associated with the perception of it. The article also presents a semantic differential with citizens' perception of the town of Cheb

    EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020

    Get PDF
    Welcome to EVALITA 2020! EVALITA is the evaluation campaign of Natural Language Processing and Speech Tools for Italian. EVALITA is an initiative of the Italian Association for Computational Linguistics (AILC, http://www.ai-lc.it) and it is endorsed by the Italian Association for Artificial Intelligence (AIxIA, http://www.aixia.it) and the Italian Association for Speech Sciences (AISV, http://www.aisv.it)

    Mobile client for the gamebook publishing system

    No full text
    Cílem této bakalářské práce je vytvořit mobilního klienta (aplikaci) pro platformu Android, který umožňuje procházet gamebooky vytvořené ve vyvíjeném systému. Gamebook je kniha, ve které se čtenář rozhoduje jak se bude příběh vyvíjet. V první části práce jsou porovnány některé dostupné aplikace, které umožňují procházení gamebooků. Druhá část práce popisuje vyvíjený systém, ve kterém je možné gamebooky vytvářet a publikovat. Součástí tohoto systému je i mobilní klient. Poslední část obsahuje návrh klienta a popis implementace včetně ověření jeho funkcionality.ObhájenoThe purpose of this bachelor thesis is to create mobile client (application) for the Android platform, which can read gamebooks that are created in the system which is being developed. Gamebook is a book that allows reader to make decisions and partly influence the story of the book. In the first part of this work there is a comparison of applications that can read gamebooks. The second part describes the system for creating and publishing gamebooks which is being developed. Mobile client is part of this system. The last part contains design of the client and description of implementation including verification of functionality

    Advanced searching in data from news portals

    No full text
    Cílem této diplomové práce je realizovat jednoduché a rozšířené vyhledávání pro systém MediaGist v datech ze zpravodajských portálů. MediaGist je on-line systém pro kroslinguální analýzu agregovaných zpráv a komentářů založený na technologii sumarizace a analýze sentimentu. V první části této práce jsou popsány základní principy vyhledávání informací. Druhá část práce se věnuje porovnání nástrojů Elasticsearch a Apache Solr umožňujících textové vyhledávání. Dále je popsán návrh~a~implementace vyhledávání za pomoci nástroje Elasticsearch. Poslední část práce zahrnuje testování a vyhodnocení vytvořeného vyhledávání.ObhájenoThe purpose of this master thesis is to create simple and advanced searching for MediaGist system in data from news portals. MediaGist is an online system~for~crosslingual analysis of aggregated news and commentaries based on summarization and sentiment analysis technologies. In the first part of this work the basic principles of information retrieval are described. The second part deals with the comparison of Elasticsearch and Apache Solr, that allows text searching. Next is described design and implementation of the searching using the Elasticsearch tool. The last part contains testing and evaluation of the created searching
    corecore