Using a Probabilistic Neural Network for a Large Multi-label Problem

Abstract

The automation of the categorization of economic activ-ities from business descriptions in free text format is a huge challenge for the Brazilian governmental administration in the present day. When this problem is tackled by humans, the subjectivity on their classification brings another prob-lem: different human classifiers can give different results when working on a set of same business descriptions. This can cause a serious distortion on the information for the planning and taxation of the governmental administrations on the three levels: County, State and Federal. Further-more, the number of possible categories considered is very large, more than 1000 in the Brazilian scenario. The large number of categories makes the problem even harder to be solved, as this is also a multi-labeled problem. In this work we compared the multi-label lazy learning technique, ML-KNN, to our Probabilistic Neural Network approach. Our implementation overcome the ML-KNN algorithm in four metrics typically used in the literature for multi-label cate-gorization problems.

Similar works

Full text

thumbnail-image

CiteSeerX

redirect
Last time updated on 28/10/2017

This paper was published in CiteSeerX.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.