Search CORE

6 research outputs found

Distance Functions and Normalization Under Stream Scenarios

Author: Barboza Eduardo V. L.
Britto Jr Alceu de Souza
Cruz Rafael M. O.
de Almeida Paulo R. Lisboa
Publication venue
Publication date: 04/07/2023
Field of study

Data normalization is an essential task when modeling a classification system. When dealing with data streams, data normalization becomes especially challenging since we may not know in advance the properties of the features, such as their minimum/maximum values, and these properties may change over time. We compare the accuracies generated by eight well-known distance functions in data streams without normalization, normalized considering the statistics of the first batch of data received, and considering the previous batch received. We argue that experimental protocols for streams that consider the full stream as normalized are unrealistic and can lead to biased and poor results. Our results indicate that using the original data stream without applying normalization, and the Canberra distance, can be a good combination when no information about the data stream is known beforehand.Comment: Paper accepted to the 2023 International Joint Conference on Neural Network

arXiv.org e-Print Archive

An Implicit Segmentation-based Method for Recognition of

Author: Alceu De
Flávio Bortolozzi
Luiz E Soares Oliveira
Paulo Rodrigo Cavalin
Robert Sabouriń
Souza Britto Jr
Publication venue
Publication date: 03/04/2020
Field of study

ABSTRACT This paper describes an implicit segmentation-based method for recognition of strings of characters (words or numerals). In a two-stage HMM-based method, an implicit segmentation is applied to segment either words or numeral strings, and in the verication stage, foreground and background features are combined to compensate the loss in terms of recognition rate when segmentation and recognition are performed in the same process. A rigorous experimental protocol shows the performance of the proposed method for isolated characters, numeral strings, and words

CiteSeerX