11,642 research outputs found
Certainty of outlier and boundary points processing in data mining
Data certainty is one of the issues in the real-world applications which is
caused by unwanted noise in data. Recently, more attentions have been paid to
overcome this problem. We proposed a new method based on neutrosophic set (NS)
theory to detect boundary and outlier points as challenging points in
clustering methods. Generally, firstly, a certainty value is assigned to data
points based on the proposed definition in NS. Then, certainty set is presented
for the proposed cost function in NS domain by considering a set of main
clusters and noise cluster. After that, the proposed cost function is minimized
by gradient descent method. Data points are clustered based on their membership
degrees. Outlier points are assigned to noise cluster and boundary points are
assigned to main clusters with almost same membership degrees. To show the
effectiveness of the proposed method, two types of datasets including 3
datasets in Scatter type and 4 datasets in UCI type are used. Results
demonstrate that the proposed cost function handles boundary and outlier points
with more accurate membership degrees and outperforms existing state of the art
clustering methods.Comment: Conference Paper, 6 page
Recommended from our members
A conceptual design tool: Sketch and fuzzy logic based system
A real time sketch and fuzzy logic based prototype system for conceptual design has been developed. This system comprises four phases. In the first one, the system accepts the input of on-line free-hand sketches, and segments them into meaningful parts by using fuzzy knowledge to detect corners and inflection points on the sketched curves. The fuzzy knowledge is applied to capture userâs drawing intention in terms of sketching position, direction, speed and acceleration. During the second phase, each segmented sub-part (curve) can be classified and identified as one of the following 2D primitives: straight lines, circles, circular arcs, ellipses, elliptical arcs or B-spline curves. Then, 2D topology information (connectivity, unitary constraints and pairwise constraints) is extracted dynamically from the identified 2D primitives. From the extracted information, a more accurate 2D geometry can be built up by a 2D geometric constraint solver. The 2D topology and geometry information is then employed to further interpretation of a 3D geometry. The system can not only accept sketched input, but also usersâ interactive input of 2D and 3D primitives.
This makes it friendly and easier to use, in comparison with âsketched input onlyâ, or âinteractive input onlyâ systems.
Finally, examples are given to illustrate the system
A Convolutional Neural Network model based on Neutrosophy for Noisy Speech Recognition
Convolutional neural networks are sensitive to unknown noisy condition in the
test phase and so their performance degrades for the noisy data classification
task including noisy speech recognition. In this research, a new convolutional
neural network (CNN) model with data uncertainty handling; referred as NCNN
(Neutrosophic Convolutional Neural Network); is proposed for classification
task. Here, speech signals are used as input data and their noise is modeled as
uncertainty. In this task, using speech spectrogram, a definition of
uncertainty is proposed in neutrosophic (NS) domain. Uncertainty is computed
for each Time-frequency point of speech spectrogram as like a pixel. Therefore,
uncertainty matrix with the same size of spectrogram is created in NS domain.
In the next step, a two parallel paths CNN classification model is proposed.
Speech spectrogram is used as input of the first path and uncertainty matrix
for the second path. The outputs of two paths are combined to compute the final
output of the classifier. To show the effectiveness of the proposed method, it
has been compared with conventional CNN on the isolated words of Aurora2
dataset. The proposed method achieves the average accuracy of 85.96 in noisy
train data. It is more robust against Car, Airport and Subway noises with
accuracies 90, 88 and 81 in test sets A, B and C, respectively. Results show
that the proposed method outperforms conventional CNN with the improvement of
6, 5 and 2 percentage in test set A, test set B and test sets C, respectively.
It means that the proposed method is more robust against noisy data and handle
these data effectively.Comment: International conference on Pattern Recognition and Image Analysis
(IPRIA 2019
- âŠ