189 research outputs found
The Porter stemming algorithm: then and now
Purpose: In 1980, Porter presented a simple algorithm for stemming English language words. This paper summarises the main features of the algorithm, and highlights its role not just in modern information retrieval research, but also in a range of related subject domains.
Design: Review of literature and research involving use of the Porter algorithm.
Findings: The algorithm has been widely adopted and extended so that it has become the standard approach to word conflation for information retrieval in a wide range of languages.
Value: The 1980 paper in Program by Porter describing his algorithm has been highly cited. This paper provides a context for the original paper as well as an overview of its subsequent use
InsightiGen: a versatile tool to generate insight for an academic systematic literature review
A comprehensive literature review has always been an essential first step of
every meaningful research. In recent years, however, the availability of a vast
amount of information in both open-access and subscription-based literature in
every field has made it difficult, if not impossible, to be certain about the
comprehensiveness of one's survey. This subsequently can lead to reviewers'
questioning of the novelties of the research directions proposed, regardless of
the quality of the actual work presented. In this situation, statistics derived
from the published literature data can provide valuable quantitative and visual
information about research trends, knowledge gaps, and research networks and
hubs in different fields. Our tool provides an automatic and rapid way of
generating insight for systematic reviews in any research area.Comment: 15 pages, 5 figure
An approach to graph-based analysis of textual documents
In this paper a new graph-based model is proposed for the representation of textual documents. Graph-structures are obtained from textual documents by making use of the well-known Part-Of-Speech (POS) tagging technique. More specifically, a simple rule-based (re) classifier is used to map each tag onto graph vertices and edges. As a result, a decomposition of textual documents is obtained where tokens are automatically parsed and attached to either a vertex or an edge. It is shown how textual documents can be aggregated through their graph-structures and finally, it is shown how vertex-ranking methods can be used to find relevant tokens.(1)
Activismo Feminista 2.0
Cada vez más mujeres se conectan, navegan y contribuyen en la construcción de internet, haciendo uso de sus servicios interactivos y abriendo espacios de comunicación y acción en la red con el fin de transformar el mundo. Las mujeres se están apropiando de la tecnología a través de las redes para ponerla como base de una sociedad más inclusiva e igualitaria. Redes que posibilitan la denuncia de las desigualdades, la organización de campañas que apelan a mejorar sus condiciones de vida, la creación de espacios de temáticas no visibilizadas y de interés común
Automatic Stance Detection Using End-to-End Memory Networks
We present a novel end-to-end memory network for stance detection, which
jointly (i) predicts whether a document agrees, disagrees, discusses or is
unrelated with respect to a given target claim, and also (ii) extracts snippets
of evidence for that prediction. The network operates at the paragraph level
and integrates convolutional and recurrent neural networks, as well as a
similarity matrix as part of the overall architecture. The experimental
evaluation on the Fake News Challenge dataset shows state-of-the-art
performance.Comment: NAACL-2018; Stance detection; Fact-Checking; Veracity; Memory
networks; Neural Networks; Distributed Representation
- …