35 research outputs found
Improving classifications for cardiac autonomic neuropathy using multi-level ensemble classifiers and feature selection based on random forest
This paper is devoted to empirical investigation of novel multi-level ensemble meta classifiers for the detection and monitoring of progression of cardiac autonomic neuropathy, CAN, in diabetes patients. Our experiments relied on an extensive database and concentrated on ensembles of ensembles, or multi-level meta classifiers, for the classification of cardiac autonomic neuropathy progression. First, we carried out a thorough investigation comparing the performance of various base classifiers for several known sets of the most essential features in this database and determined that Random Forest significantly and consistently outperforms all other base classifiers in this new application. Second, we used feature selection and ranking implemented in Random Forest. It was able to identify a new set of features, which has turned out better than all other sets considered for this large and well-known database previously. Random Forest remained the very best classier for the new set of features too. Third, we investigated meta classifiers and new multi-level meta classifiers based on Random Forest, which have improved its performance. The results obtained show that novel multi-level meta classifiers achieved further improvement and obtained new outcomes that are significantly better compared with the outcomes published in the literature previously for cardiac autonomic neuropathy
A survey of state-of-the-art methods for securing medical databases
This review article presents a survey of recent work devoted to advanced state-of-the-art methods for securing of medical databases. We concentrate on three main directions, which have received attention recently: attribute-based encryption for enabling secure access to confidential medical databases distributed among several data centers; homomorphic encryption for providing answers to confidential queries in a secure manner; and privacy-preserving data mining used to analyze data stored in medical databases for verifying hypotheses and discovering trends. Only the most recent and significant work has been included
Centroid sets with largest weight in Munn semirings for data mining applications
Our main results show that Munn semirings over idempotent semifields possess
more convenient properties than the Munn rings over fields. First, we describe all
centroid sets that can be generated as ideals of the largest weight in Munn semirings
over idempotent semifields. Second, we handle the more general case of all onesided
ideals too. The multiplication in the Munn semirings is not commutative and
the family of arbitrary one-sided ideals is larger than that of two-sided ideals. It is essential to consider all ideals not only in order
to develop theoretical foundations, but also since the larger set of ideals may lead
to design of classification and clustering systems with better properties. Our main
theorem describes all ideals and one-sided ideals with the largest weight in Munn
semirings over idempotent semifields
Classification systems based on combinatorial semigroups
The present article continues the investigation of constructions essential for applications of combinatorial semigroups to the design of multiple classification systems in data mining. Our main theorem gives a complete description of all optimal classification systems defined by one-sided ideals in a construction based on combinatorial Rees matrix semigroups. It strengthens and generalizes previous results, which handled the more narrow case of two-sided ideals. © 2012 Springer Science+Business Media New York