Search CORE

356 research outputs found

Computing with Granular Words

Author: Hou Hailong
Publication venue: ScholarWorks @ Georgia State University
Publication date: 07/05/2011
Field of study

Computational linguistics is a sub-field of artificial intelligence; it is an interdisciplinary field dealing with statistical and/or rule-based modeling of natural language from a computational perspective. Traditionally, fuzzy logic is used to deal with fuzziness among single linguistic terms in documents. However, linguistic terms may be related to other types of uncertainty. For instance, different users search ‘cheap hotel’ in a search engine, they may need distinct pieces of relevant hidden information such as shopping, transportation, weather, etc. Therefore, this research work focuses on studying granular words and developing new algorithms to process them to deal with uncertainty globally. To precisely describe the granular words, a new structure called Granular Information Hyper Tree (GIHT) is constructed. Furthermore, several technologies are developed to cooperate with computing with granular words in spam filtering and query recommendation. Based on simulation results, the GIHT-Bayesian algorithm can get more accurate spam filtering rate than conventional method Naive Bayesian and SVM; computing with granular word also generates better recommendation results based on users’ assessment when applied it to search engine

ScholarWorks @ Georgia State University

Bibliometric Survey on Incremental Learning in Text Classification Algorithms for False Information Detection

Author: Barve Yashoda Narayanprasad, Mrs.
Mulay Preeti, Dr.
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 23/11/2020
Field of study

The false information or misinformation over the web has severe effects on people, business and society as a whole. Therefore, detection of misinformation has become a topic of research among many researchers. Detecting misinformation of textual articles is directly connected to text classification problem. With the massive and dynamic generation of unstructured textual documents over the web, incremental learning in text classification has gained more popularity. This survey explores recent advancements in incremental learning in text classification and review the research publications of the area from Scopus, Web of Science, Google Scholar, and IEEE databases and perform quantitative analysis by using methods such as publication statistics, collaboration degree, research network analysis, and citation analysis. The contribution of this study in incremental learning in text classification provides researchers insights on the latest status of the research through literature survey, and helps the researchers to know the various applications and the techniques used recently in the field

DigitalCommons@University of Nebraska

A Comprehensive Survey of Data Mining-based Fraud Detection Research

Author: Agrawal
Au
Berry
Brentnall
Chen
Chiang Wang
David C. Yen
Feelders
Han
Hayhoe
Kirkosa
Ku
Leonard
Mitchell
Ngai
Quah
Rothman
Shaw
Shing-Han Li
Song
Sudjianto
Titus
Wen-Hui Lu
White
Publication venue: 'Elsevier BV'
Publication date: 30/09/2010
Field of study

This survey paper categorises, compares, and summarises from almost all published technical and review articles in automated fraud detection within the last 10 years. It defines the professional fraudster, formalises the main types and subtypes of known fraud, and presents the nature of data evidence collected within affected industries. Within the business context of mining the data to achieve higher cost savings, this research presents methods and techniques together with their problems. Compared to all related reviews on fraud detection, this survey covers much more technical articles and is the only one, to the best of our knowledge, which proposes alternative data and solutions from related domains.Comment: 14 page

arXiv.org e-Print Archive

Crossref

Email Filtering Using Hybrid Feature Selection Model

Author: Alwada'n Tariq
Mohammad Adel Hamdan
Smadi Sami
Publication venue: 'Computers, Materials and Continua (Tech Science Press)'
Publication date: 23/03/2022
Field of study

Teeside University's Research Repository

Fuzzy Rough Positive Region based Nearest Neighbour Classification

Author: Cornelis Chris
Jensen Richard
Verbiest Nele
Publication venue: IEEE Press
Publication date: 01/01/2012
Field of study

Aberystwyth Research Portal

Automatic domain ontology extraction for context-sensitive opinion mining

Author: Lai Chapmann C.L.
Lau Raymond Y.K.
Li Yuefeng
Ma Jian
Publication venue
Publication date: 01/01/2009
Field of study

Automated analysis of the sentiments presented in online consumer feedbacks can facilitate both organizations’ business strategy development and individual consumers’ comparison shopping. Nevertheless, existing opinion mining methods either adopt a context-free sentiment classification approach or rely on a large number of manually annotated training examples to perform context sensitive sentiment classification. Guided by the design science research methodology, we illustrate the design, development, and evaluation of a novel fuzzy domain ontology based contextsensitive opinion mining system. Our novel ontology extraction mechanism underpinned by a variant of Kullback-Leibler divergence can automatically acquire contextual sentiment knowledge across various product domains to improve the sentiment analysis processes. Evaluated based on a benchmark dataset and real consumer reviews collected from Amazon.com, our system shows remarkable performance improvement over the context-free baseline

Queensland University of Technology ePrints Archive

AIS Electronic Library (AISeL)

Deficient data classification with fuzzy learning

Author: Liu Shigang
Publication venue: Deakin University, Faculty of Science, Engineering and Built Environment, School of Information Technology
Publication date: 01/02/2017
Field of study

This thesis first proposes a novel algorithm for handling both missing values and imbalanced data classification problems. Then, algorithms for addressing the class imbalance problem in Twitter spam detection (Network Security Problem) have been proposed. Finally, the security profile of SVM against deliberate attacks has been simulated and analysed.<br /

Deakin Research Online

Fuzzy Rough Positive Region based Nearest Neighbour Classification

Author: Cornelis Chris
Jensen Richard
Verbiest Nele
Publication venue: IEEE Press
Publication date: 01/01/2012
Field of study

Abstract—This paper proposes a classifier that uses fuzzy rough set theory to improve the Fuzzy Nearest Neighbour (FNN) classifier. We show that previous attempts to use fuzzy rough set theory to improve the FNN algorithm have some shortcomings and we overcome them by using the fuzzy positive region to measure the quality of the nearest neighbours in the FNN classifier. A preliminary experimental evaluation shows that the new approach generally improves upon existing methods. I

CiteSeerX

Crossref

Aberystwyth Research Portal

ZENODO

Ghent University Academic Bibliography

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Smart Substation Network Fault Classification Based on a Hybrid Optimization Algorithm

Author: Liu Xiaofeng
Lou Jichao
Xia Xin
Publication venue: Electronics and Telecommunications Committee
Publication date: 01/01/2019
Field of study

Accurate network fault diagnosis in smart substations is key to strengthening grid security. To solve fault classification problems and enhance classification accuracy, we propose a hybrid optimization algorithm consisting of three parts: anti-noise processing (ANP), an improved separation interval method (ISIM), and a genetic algorithm-particle swarm optimization (GA-PSO) method. ANP cleans out the outliers and noise in the dataset. ISIM uses a support vector machine (SVM) architecture to optimize SVM kernel parameters. Finally, we propose the GA-PSO algorithm, which combines the advantages of both genetic and particle swarm optimization algorithms to optimize the penalty parameter. The experimental results show that our proposed hybrid optimization algorithm enhances the classification accuracy of smart substation network faults and shows stronger performance compared with existing methods

Biblioteka Nauki - repozytorium artykuÅÃ³w

International Journal of Electronics and Telecommunications (Warsaw University of Technology)