Search CORE

537 research outputs found

Text classification supervised algorithms with term frequency inverse document frequency and global vectors for word representation: a comparative study

Author: Bahassine Said
Benabbes Khalid
Hamou Aadi Fatima Zahrae Ait
Housni Khalid
Labd Zakia
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/02/2024
Field of study

Over the course of the previous two decades, there has been a rise in the quantity of text documents stored digitally. The ability to organize and categorize those documents in an automated mechanism, is known as text categorization which is used to classify them into a set of predefined categories so they may be preserved and sorted more efficiently. Identifying appropriate structures, architectures, and methods for text classification presents a challenge for researchers. This is due to the significant impact this concept has on content management, contextual search, opinion mining, product review analysis, spam filtering, and text sentiment mining. This study analyzes the generic categorization strategy and examines supervised machine learning approaches and their ability to comprehend complex models and nonlinear data interactions. Among these methods are k-nearest neighbors (KNN), support vector machine (SVM), and ensemble learning algorithms employing various evaluation techniques. Thereafter, an evaluation is conducted on the constraints of every technique and how they can be applied to real-life situations

Institute of Advanced Engineering and Science