23 research outputs found
Fake news detection: a survey of evaluation datasets
Fake news detection has gained increasing importance among the research community due to the widespread diffusion of fake news through media platforms. Many dataset have been released in the last few years, aiming to assess the performance of fake news detection methods. In this survey, we systematically review twenty-seven popular datasets for fake news detection by providing insights into the characteristics of each dataset and comparative analysis among them. A fake news detection datasets characterization composed of eleven characteristics extracted from the surveyed datasets is provided, along with a set of requirements for comparing and building new datasets. Due to the ongoing interest in this research topic, the results of the analysis are valuable to many researchers to guide the selection or definition of suitable datasets for evaluating their fake news detection methods
Basic language learning in artificial animals
We explore a general architecture for artificial animals, or animats, that develops over time. The architecture combines reinforcementlearning, dynamic concept formation, and homeostatic decision-making aimed at need satisfaction. We show that thisarchitecture, which contains no ad hoc features for language processing, is capable of basic language learning of three kinds: (i)learning to reproduce phonemes that are perceived in the environment via motor babbling; (ii) learning to reproduce sequences ofphonemes corresponding to spoken words perceived in the environment; and (iii) learning to ground the semantics of spoken wordsin sensory experience by associating spoken words (e.g. the word “cold”) to sensory experience (e.g. the activity of a sensor forcold temperature) and vice versa
Maintaining regularity and generalization in data using the minimum description length principle and genetic algorithm: case of grammatical inference
In this paper, a genetic algorithm with minimum description length (GAWMDL) is proposed for grammatical inference. The primary challenge of identifying a language of infinite cardinality from a finite set of examples should know when to generalize and specialize the training data. The minimum description length principle that has been incorporated addresses this issue is discussed in this paper. Previously, the e-GRIDS learning model was proposed, which enjoyed the merits of the minimum description length principle, but it is limited to positive examples only. The proposed GAWMDL, which incorporates a traditional genetic algorithm and has a powerful global exploration capability that can exploit an optimum offspring. This is an effective approach to handle a problem which has a large search space such the grammatical inference problem. The computational capability, the genetic algorithm poses is not questionable, but it still suffers from premature convergence mainly arising due to lack of population diversity. The proposed GAWMDL incorporates a bit mask oriented data structure that performs the reproduction operations, creating the mask, then Boolean based procedure is applied to create an offspring in a generative manner. The Boolean based procedure is capable of introducing diversity into the population, hence alleviating premature convergence. The proposed GAWMDL is applied in the context free as well as regular languages of varying complexities. The computational experiments show that the GAWMDL finds an optimal or close-to-optimal grammar. Two fold performance analysis have been performed. First, the GAWMDL has been evaluated against the elite mating pool genetic algorithm which was proposed to introduce diversity and to address premature convergence. GAWMDL is also tested against the improved tabular representation algorithm. In addition, the authors evaluate the performance of the GAWMDL against a genetic algorithm not using the minimum description length principle. Statistical tests demonstrate the superiority of the proposed algorithm. Overall, the proposed GAWMDL algorithm greatly improves the performance in three main aspects: maintains regularity of the data, alleviates premature convergence and is capable in grammatical inference from both positive and negative corpora
Recommended from our members
Ontology-based end-user visual query formulation: Why, what, who, how, and which?
Value creation in an organisation is a time-sensitive and data-intensive process, yet it is often delayed and bounded by the reliance on IT experts extracting data for domain experts. Hence, there is a need for providing people who are not professional developers with the flexibility to pose relatively complex and ad hoc queries in an easy and intuitive way. In this respect, visual methods for query formulation undertake the challenge of making querying independent of users’ technical skills and the knowledge of the underlying textual query language and the structure of data. An ontology is more promising than the logical schema of the underlying data for guiding users in formulating queries, since it provides a richer vocabulary closer to the users’ understanding. However, on the one hand, today the most of world’s enterprise data reside in relational databases rather than triple stores, and on the other, visual query formulation has become more compelling due to ever-increasing data size and complexity—known as Big Data. This article presents and argues for ontology-based visual query formulation for end-users; discusses its feasibility in terms of ontology-based data access, which virtualises legacy relational databases as RDF, and the dimensions of Big Data; presents key conceptual aspects and dimensions, challenges, and requirements; and reviews, categorises, and discusses notable approaches and systems
Preserving Local Food Traditions: A Hybrid Participatory Approach for Stimulating Transgenerational Dialogue
Local food traditions are an essential part of culture and society, reflecting a community’s history, values, and beliefs. Elders play a major role in passing on local food knowledge to younger generations, ensuring local food traditions and cultural identity do not disappear over time. To preserve these traditions, it is essential to engage older and younger generations of a community in a transgenerational dialogue. From this perspective, the study utilizes a hybrid participatory approach, composed of design thinking and learning-by-doing. Results of the case study underline the effectiveness of the approach in stimulating both the transfer of knowledge, as well as the involvement of younger generations, in the preservation of local food traditions
Computational Models of Language Evolution: Challenges and Future Perspectives
This paper provides an analysis of the trends of the scientific production of language evolution models discussing the current developments and outlines the most promising future perspectives of this research field. A hybrid evaluation methodology has been applied in this study that integrates bibliometric and social research techniques to gain both quantitative and qualitative evidence of the research impact of language evolution models. Due to the ongoing interest in this research topic, the results of the analysis are valuable to many researchers to reveal the developments in the field and to plan future research directions
Query Processing of Geosocial Data in Location-Based Social Networks
The increasing use of social media and the recent advances in geo-positioning technologies have produced a great amount of geosocial data, consisting of spatial, textual, and social information, to be managed and queried. In this paper, we focus on the issue of query processing by providing a systematic literature review of geosocial data representations, query processing methods, and evaluation approaches published over the last two decades (2000–2020). The result of our analysis shows the categories of geosocial queries proposed by the surveyed studies, the query primitives and the kind of access method used to retrieve the result of the queries, the common evaluation metrics and datasets used to evaluate the performance of the query processing methods, and the main open challenges that should be faced in the near future. Due to the ongoing interest in this research topic, the results of this survey are valuable to many researchers and practitioners by gaining an in-depth understanding of the geosocial querying process and its applications and possible future perspectives
Detecting Deceptive Behaviours through Facial Cues from Videos: A Systematic Review
Interest in detecting deceptive behaviours by various application fields, such as security systems, political debates, advanced intelligent user interfaces, etc., makes automatic deception detection an active research topic. This interest has stimulated the development of many deception-detection methods in the literature in recent years. This work systematically reviews the literature focused on facial cues of deception. The most relevant methods applied in the literature of the last decade have been surveyed and classified according to the main steps of the facial-deception-detection process (video pre-processing, facial feature extraction, and decision making). Moreover, datasets used for the evaluation and future research directions have also been analysed