131 research outputs found
Research Articles in Simplified HTML: a Web-first format for HTML-based scholarly articles
Purpose. This paper introduces the Research Articles in Simplified HTML (or RASH), which is a Web-first format for writing HTML-based scholarly papers; it is accompanied by the RASH Framework, a set of tools for interacting with RASH-based articles. The paper also presents an evaluation that involved authors and reviewers of RASH articles submitted to the SAVE-SD 2015 and SAVE-SD 2016 workshops.
Design. RASH has been developed aiming to: be easy to learn and use; share scholarly documents (and embedded semantic annotations) through the Web; support its adoption within the existing publishing workflow.
Findings. The evaluation study confirmed that RASH is ready to be adopted in workshops, conferences, and journals and can be quickly learnt by researchers who are familiar with HTML.
Research Limitations. The evaluation study also highlighted some issues in the adoption of RASH, and in general of HTML formats, especially by less technically savvy users. Moreover, additional tools are needed, e.g., for enabling additional conversions from/to existing formats such as OpenXML.
Practical Implications. RASH (and its Framework) is another step towards enabling the definition of formal representations of the meaning of the content of an article, facilitating its automatic discovery, enabling its linking to semantically related articles, providing access to data within the article in actionable form, and allowing integration of data between papers.
Social Implications. RASH addresses the intrinsic needs related to the various users of a scholarly article: researchers (focussing on its content), readers (experiencing new ways for browsing it), citizen scientists (reusing available data formally defined within it through semantic annotations), publishers (using the advantages of new technologies as envisioned by the Semantic Publishing movement).
Value. RASH helps authors to focus on the organisation of their texts, supports them in the task of semantically enriching the content of articles, and leaves all the issues about validation, visualisation, conversion, and semantic data extraction to the various tools developed within its Framework
Wanted: standards for automatic reproducibility of computational experiments
Those seeking to reproduce a computational experiment often need to manually
look at the code to see how to build necessary libraries, configure parameters,
find data, and invoke the experiment; it is not automatic. Automatic
reproducibility is a more stringent goal, but working towards it would benefit
the community. This work discusses a machine-readable language for specifying
how to execute a computational experiment. We invite interested stakeholders to
discuss this language at https://github.com/charmoniumQ/execution-description .Comment: Submitted to SE4RS'23 Portland, O
A method for defining human-machine micro-task workflows for gathering legal information
Series : Lecture notes in computer science, ISSN 0302-9743, vol. 8929With the growing popularity of micro-task crowdsourcing platforms, new workflow-based micro-task crowdsourcing approaches are starting to emerge. Such workflows occur in legal, political and conflict resolution do-mains as well, presenting new challenges, namely in micro-task specification and human-machine interaction, which result mostly from the flow of unstruc-tured data. Domain ontologies provide the structure and semantics required to describe the data flowing throughout the workflow in a way understandable to both humans and machines. This paper presents a method for the construction of micro-task workflows from legal domain ontologies. The method is currently being employed in the context of the UMCourt project in order to formulate in-formation retrieval and conflict resolution workflows.This work is part-funded by FEDER Funds, by the ERDF (Eu-ropean Regional Development Fund) through the COMPETE Programme (operation-al programme for competitiveness) and by National Funds through the FCT (Portu-guese Foundation for Science and Technology) within the project FCOMP-01-0124-FEDER-028980 (PTDC/EEI-SII/1386/2012). The work of Nuno Luz is supported by the doctoral grant SFRH/BD/70302/2010
Document Automation Architectures: Updated Survey in Light of Large Language Models
This paper surveys the current state of the art in document automation (DA).
The objective of DA is to reduce the manual effort during the generation of
documents by automatically creating and integrating input from different
sources and assembling documents conforming to defined templates. There have
been reviews of commercial solutions of DA, particularly in the legal domain,
but to date there has been no comprehensive review of the academic research on
DA architectures and technologies. The current survey of DA reviews the
academic literature and provides a clearer definition and characterization of
DA and its features, identifies state-of-the-art DA architectures and
technologies in academic research, and provides ideas that can lead to new
research opportunities within the DA field in light of recent advances in
generative AI and large language models.Comment: The current paper is the updated version of an earlier survey on
document automation [Ahmadi Achachlouei et al. 2021]. Updates in the current
paper are as follows: We shortened almost all sections to reduce the size of
the main paper (without references) from 28 pages to 10 pages, added a review
of selected papers on large language models, removed certain sections and
most of diagrams. arXiv admin note: substantial text overlap with
arXiv:2109.1160
It ROCS! The RASH Online Conversion Service
In this poster paper we introduce the RASH Online Conversion Service, i.e., a Web application that allows the conversion of ODT documents into RASH, a HTML-based markup language for writing scholarly articles, and from RASH into LaTeX according to Springer LNCS and ACM ICPS
Exploring the Betrothed Lovers
We present the ongoing activities and the first results achieved in a research project concerning the understanding of narrative in the high school. Students and teachers experimented with new ways to learn linguistic and digital skills, by using a collaborative learning environment built around the novel I Promessi Sposi. We analyzed the literary text, extracting social networks of characters and other fundamental narrative elements (sequences, locations, etc.), in order to provide the students with appropriate tools and resources to conduct their own inquiries on the novel
- …