Search CORE

884 research outputs found

Efficient and Trustworthy Review/Opinion Spam Detection

Author: Sanketi P. Raut, Prof. Chitra Wasnik
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 30/04/2017
Field of study

The most common mode for consumers to express their level of satisfaction with their purchases is through online ratings, which we can refer as Online Review System. Network analysis has recently gained a lot of attention because of the arrival and the increasing attractiveness of social sites, such as blogs, social networking applications, micro blogging, or customer review sites. The reviews are used by potential customers to find opinions of existing users before purchasing the products. Online review systems plays an important part in affecting consumers' actions and decision making, and therefore attracting many spammers to insert fake feedback or reviews in order to manipulate review content and ratings. Malicious users misuse the review website and post untrustworthy, low quality, or sometimes fake opinions, which are referred as Spam Reviews. In this study, we aim at providing an efficient method to identify spam reviews and to filter out the spam content with the dataset of gsmarena.com. Experiments on the dataset collected from gsmarena.com show that the proposed system achieves higher accuracy than the standard na?ve bayes

International Journal on Recent and Innovation Trends in Computing and Communication

Detecting child grooming behaviour patterns on social media

Author: C.-C. Chang
D. Bogdanova
H. Nijman
H. Whittle
J.K. Chambers
L.N. Olson
M.F. Porter
S. Argamon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Online paedophile activity in social media has become a major concern in society as Internet access is easily available to a broader younger population. One common form of online child exploitation is child grooming, where adults and minors exchange sexual text and media via social media platforms. Such behaviour involves a number of stages performed by a predator (adult) with the final goal of approaching a victim (minor) in person. This paper presents a study of such online grooming stages from a machine learning perspective. We propose to characterise such stages by a series of features covering sentiment polarity, content, and psycho-linguistic and discourse patterns. Our experiments with online chatroom conversations show good results in automatically classifying chatlines into various grooming stages. Such a deeper understanding and tracking of predatory behaviour is vital for building robust systems for detecting grooming conversations and potential predators on social media

Crossref

Open Research Online (The Open University)

A Novel Contrastive Learning Method for Clickbait Detection on RoCliCo: A Romanian Clickbait Corpus of News Articles

Author: Broscoteanu Daria-Mihaela
Ionescu Radu Tudor
Publication venue
Publication date: 10/10/2023
Field of study

To increase revenue, news websites often resort to using deceptive news titles, luring users into clicking on the title and reading the full news. Clickbait detection is the task that aims to automatically detect this form of false advertisement and avoid wasting the precious time of online users. Despite the importance of the task, to the best of our knowledge, there is no publicly available clickbait corpus for the Romanian language. To this end, we introduce a novel Romanian Clickbait Corpus (RoCliCo) comprising 8,313 news samples which are manually annotated with clickbait and non-clickbait labels. Furthermore, we conduct experiments with four machine learning methods, ranging from handcrafted models to recurrent and transformer-based neural networks, to establish a line-up of competitive baselines. We also carry out experiments with a weighted voting ensemble. Among the considered baselines, we propose a novel BERT-based contrastive learning model that learns to encode news titles and contents into a deep metric space such that titles and contents of non-clickbait news have high cosine similarity, while titles and contents of clickbait news have low cosine similarity. Our data set and code to reproduce the baselines are publicly available for download at https://github.com/dariabroscoteanu/RoCliCo.Comment: Accepted at EMNLP 202

arXiv.org e-Print Archive

Software Agents for Electronic Marketplaces: Current and Future Research Directions

Author: Kanellis Panagiotis
Kontolemakis George
Martakos Drakoulis
Publication venue: AIS Electronic Library (AISeL)
Publication date: 01/04/2004
Field of study

The premise of software agents to define the structural and operational models of the virtual marketplace of the future can account for the increased interest regarding their application in areas where they can add substantial value in terms of automation and functionality. At the heart of such a marketplace rests an ontology modeling the domain upon which a nucleus of agent-based services can be constructed. Negotiation services hold the dominant position in terms of the attention they have received in research. Complementary to them, but no less important, are the advising services representing support functionality that is required throughout the cycle of a deal; from the expressed intention of the two parties to eventual maturity and closure. In this paper we focus on research trends and on their possible future development for ontologies and the above service categories emphasizing on the role of software agents in this context. A review and analysis of past and present works helps to formulate sets of questions that future research will seek to address

AIS Electronic Library (AISeL)

Unleashing the Potential of Argument Mining for IS Research: A Systematic Review and Research Agenda

Author: Söllner Matthias
Wambsganss Thiemo
Weingart Pablo
Publication venue: AIS Electronic Library (AISeL)
Publication date: 12/12/2022
Field of study

Argument mining (AM) represents the unique use of natural language processing (NLP) techniques to extract arguments from unstructured data automatically. Despite expanding on commonly used NLP techniques, such as sentiment analysis, AM has hardly been applied in information systems (IS) research yet. Consequentially, knowledge about the potentials for the usage of AM on IS use cases appears to be still limited. First, we introduce AM and its current usage in fields beyond IS. To address this research gap, we conducted a systematic literature review on IS literature to identify IS use cases that can potentially be extended with AM. We develop eleven text-based IS research topics that provide structure and context to the use cases and their AM potentials. Finally, we formulate a novel research agenda to guide both researchers and practitioners to design, compare and evaluate the use of AM for text-based applications and research streams in IS

AIS Electronic Library (AISeL)

Deception

Author: Sarkadi Stefan
Publication venue
Publication date: 01/05/2021
Field of study

King's Research Portal

Detection of spam-posting accounts on Twitter

Author: Inuwa-Dutse Ise
Korkontzelos Yannis
Liptrott Mark
Publication venue: 'Elsevier BV'
Publication date: 08/08/2018
Field of study

Edge Hill University Research Information Repository

Let’s lie together:Co-presence effects on children’s deceptive skills

Author: Swerts M.G.J.
Publication venue: The Association for Computational Linguistics
Publication date: 01/01/2012
Field of study

Tilburg University Repository

A Qualitative Analysis of Common Practices in Annotations: A Taxonomy and Design Space

Author: Doppalapudi Bhavana
Quadri Ghulam Jilani
Rahman Md Dilshadur
Rosen Paul
Szafir Danielle Albers
Publication venue
Publication date: 09/06/2023
Field of study

Annotations are a vital component of data externalization and collaborative analysis, directing readers' attention to important visual elements. Therefore, it is crucial to understand their design space for effectively annotating visualizations. However, despite their widespread use in visualization, we have identified a lack of a design space for common practices for annotations. In this paper, we present two studies that explore how people annotate visualizations to support effective communication. In the first study, we evaluate how visualization students annotate bar charts when answering high-level questions about the data. Qualitative coding of the resulting annotations generates a taxonomy comprising enclosure, connector, text, mark, and color, revealing how people leverage different visual elements to communicate critical information. We then extend our taxonomy by performing thematic coding on a diverse range of real-world annotated charts, adding trend and geometric annotations to the taxonomy. We then combine the results of these studies into a design space of annotations that focuses on the key elements driving the design choices available when annotating a chart, providing a reference guide for using annotations to communicate insights from visualizations

arXiv.org e-Print Archive