33 research outputs found

    Treebanking user-generated content: a UD based overview of guidelines, corpora and unified recommendations

    Get PDF
    This article presents a discussion on the main linguistic phenomena which cause difficulties in the analysis of user-generated texts found on the web and in social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework of syntactic analysis. Given on the one hand the increasing number of treebanks featuring user-generated content, and its somewhat inconsistent treatment in these resources on the other, the aim of this article is twofold: (1) to provide a condensed, though comprehensive, overview of such treebanks—based on available literature—along with their main features and a comparative analysis of their annotation criteria, and (2) to propose a set of tentative UD-based annotation guidelines, to promote consistent treatment of the particular phenomena found in these types of texts. The overarching goal of this article is to provide a common framework for researchers interested in developing similar resources in UD, thus promoting cross-linguistic consistency, which is a principle that has always been central to the spirit of UD

    Engage students in news writing

    Get PDF
    The technologies evolution impacts how information is produced and consumed by users. Nonetheless, with the spread of information content available on most online news platforms, the misinformation increases alongside the less credible content. In this scope, the present research aims to develop a technological ecosystem to promote students’ writing ability. The system will help students, search for credible content to create school newspapers. Thus, in this article, the architecture of the solution for news writing tool for the Portuguese language is presented. This paper aims to introduce a constructive approach that presents the system architecture that will support the development of a news creation tool.publishe
    corecore