49 research outputs found

    Topological Feature Based Classification

    Full text link
    There has been a lot of interest in developing algorithms to extract clusters or communities from networks. This work proposes a method, based on blockmodelling, for leveraging communities and other topological features for use in a predictive classification task. Motivated by the issues faced by the field of community detection and inspired by recent advances in Bayesian topic modelling, the presented model automatically discovers topological features relevant to a given classification task. In this way, rather than attempting to identify some universal best set of clusters for an undefined goal, the aim is to find the best set of clusters for a particular purpose. Using this method, topological features can be validated and assessed within a given context by their predictive performance. The proposed model differs from other relational and semi-supervised learning models as it identifies topological features to explain the classification decision. In a demonstration on a number of real networks the predictive capability of the topological features are shown to rival the performance of content based relational learners. Additionally, the model is shown to outperform graph-based semi-supervised methods on directed and approximately bipartite networks.Comment: Awarded 3rd Best Student Paper at 14th International Conference on Information Fusion 201

    A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces

    Get PDF
    Mutemi, A., & Bacao, F. (2023). A numeric-based machine learning design for detecting organized retail fraud in digital marketplaces. Scientific Reports, 13(1), 1-16. [12499]. https://doi.org/10.1038/s41598-023-38304-5Organized retail crime (ORC) is a significant issue for retailers, marketplace platforms, and consumers. Its prevalence and influence have increased fast in lockstep with the expansion of online commerce, digital devices, and communication platforms. Today, it is a costly affair, wreaking havoc on enterprises’ overall revenues and continually jeopardizing community security. These negative consequences are set to rocket to unprecedented heights as more people and devices connect to the Internet. Detecting and responding to these terrible acts as early as possible is critical for protecting consumers and businesses while also keeping an eye on rising patterns and fraud. The issue of detecting fraud in general has been studied widely, especially in financial services, but studies focusing on organized retail crimes are extremely rare in literature. To contribute to the knowledge base in this area, we present a scalable machine learning strategy for detecting and isolating ORC listings on a prominent marketplace platform by merchants committing organized retail crimes or fraud. We employ a supervised learning approach to classify postings as fraudulent or real based on past data from buyer and seller behaviors and transactions on the platform. The proposed framework combines bespoke data preprocessing procedures, feature selection methods, and state-of-the-art class asymmetry resolution techniques to search for aligned classification algorithms capable of discriminating between fraudulent and legitimate listings in this context. Our best detection model obtains a recall score of 0.97 on the holdout set and 0.94 on the out-of-sample testing data set. We achieve these results based on a select set of 45 features out of 58.publishersversionpublishe

    Detecting Fraudsters in Online Auction Using Variations of Neighbor Diversity

    Get PDF
    Inflated reputation fraud is a serious problem in online auction. Recent work suggested that neighbor diversity is an effective feature for discerning fraudsters from normal users. However, there exist many different methods to quantify diversity in the literature. This raises the problem of finding the most suitable method to calculate neighbor diversity for detecting fraudsters. We collect four different methods to quantify diversity, and apply them to calculate neighbor diversity. We then use these various neighbor diversities for fraudster detection. Experimental results on a real-world dataset demonstrate that, although these diversities were calculated differently, their performances on fraudster detection are similar. This finding reflects the robustness of neighbor diversity, regardless of how the diversity is calculated

    Developing Effective Fraud Detection Methods for Online Auction

    Get PDF
    [[abstract]]The past decade has witnessed the rapid growth of online auctions. However, the low cost and anonymity in joining online auctions provided an easy path for fraudsters. The simple binary reputation system promoted by the auction site is clearly not enough to protect consumers from fraud. In view of this, many fraud detection methods have been proposed. Nevertheless, there are still many weaknesses needed to be improved. To help secure the online trading environment, this study aims at developing more effective methods to identify the fraudsters in online auctions. First, a novel selection method is proposed for deriving a concise attribute set used to build efficient detection models, which allow a reduction in detection costs while improving detection accuracy. In addition, a two-stage detection procedure is proposed wherein multiple mutual-complement models are combined for promoting overall detection accuracy. To evaluate the proposed methods, actual auction transaction histories were collected for testing. The experimental results show that these methods can outperform those in the previous work.[[notice]]補正完
    corecore