Topic Classification for Short Texts

Boroianu, Mihai Augustin; Grec, Mihai; Neagu, Dan Claudiu; Rus, Andrei Bogdan; Silaghi, Gheorghe Cosmin

Topic Classification for Short Texts

Authors: Mihai Augustin Boroianu
Mihai Grec
Dan Claudiu Neagu
Andrei Bogdan Rus
Gheorghe Cosmin Silaghi
Publication date: 19 September 2022
Publisher: AIS Electronic Library (AISeL)

Abstract

In the context of TV and social media surveillance, constructing models to automate topic identification of short texts is key task. This paper formalizes the topic classification as a top-K multinomial classification problem and constructs worth-to-consider models for practical usage. We describe the full data processing pipeline, discussing about dataset selection, text preprocessing, feature extraction, model selection and learning, including hyperparameter optimization. When computing time and resources are limited, we show that a classical model like SVM performs as well as an advanced deep neural network, but with shorter model training time

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

AIS Electronic Library (AISeL)

oai:aisel.aisnet.org:isd2014-1...

Last time updated on 14/10/2022