649 research outputs found
DYNAMICAL LINK METRIC ADJUSTMENT USING CLASSIFICATION AND REGRESSION TREE (CART) AND SOFTWARE DEFINED NETWORKING (SDN) TECHNOLOGIES
Presented herein are techniques to dynamically switch between different link metric algorithms based on Classification And Regression Tree (CART) and Software Defined Networking (SDN) technologies
Prediction model for coronary artery disease using neural networks and feature selection based on classification and regression tree
Background and aims: Risk of implementing invasive diagnostic procedures for coronary artery disease (CAD) such as angiography is considerable. On the other hand, Successful experience has been achieved in medical data mining approaches. Therefore this study has been done to produce a model based on data mining techniques of neural networks that can predict coronary artery disease. Methods: In this descriptive- analytical study, the data set includes nine risk factors of 13228 participants who were undergone angiography at Tehran Heart Center. (4059 participants were not suffering from CAD but 9169 were suffering from CAD). Producing model for predicting coronary artery disease was done based on multilayer perceptron neural networks and variable selection based on classification and regression tree (CART) using of Statistica software. For comparison and selection of best model, the ROC curve analysis was used. Results: After seven-time modeling and comparing the generated models, the final model consists of all existing risk factors obtained with the area under ROC curve of 0.754, accuracy of 74.19%, sensitivity of 92.41% and specificity of 33.25% .Also, variable selection results in producing a model consists of four risk factors with area under ROC curve of 0.737, accuracy of 74.19%, sensitivity of 93.34% and specificity of 31.17% was produced. Conclusion: The obtained model is produced based on neural networks. The model is able to identify both high risk patients and acceptable number of healthy subjects. Also, utilizing the feature selection in this study ends up in production of a model which consists of only four risk factors as: age, sex, diabetes and high blood pressure
Bridging the Gap Between the Least and the Most Influential Twitter Users
Social networks play an increasingly important role in shaping the behaviour of users of the Web. Conceivably Twitter stands out from the others, not only for the platform's simplicity but also for the great influence that the messages sent over the network can have. The impact of such messages determines the influence of a Twitter user and is what tools such as Klout, PeerIndex or TwitterGrader aim to calculate. Reducing all the factors that make a person influential into a single number is not an easy task, and the effort involved could become useless if the Twitter users do not know how to improve it. In this paper we identify what specific actions should be carried out for a Twitterer to increase their influence in each of above-mentioned tools applying, for this purpose, data mining techniques based on classification and regression algorithms to the information collected from a set of Twitter users.This work has been partially founded by the European Commission Project ”SiSOB: An Observatorium for Science
in Society based in Social Models” (http://sisob.lcc.uma.es) (Contract no.: FP7 266588), ”Sistemas Inalámbricos
de Gestión de Información Crítica” (with code number TIN2011-23795 and granted by the MEC, Spain) and ”3DTUTOR:
Sistema Interoperable de Asistencia y Tutoría Virtual e Inteligente 3D” (with code number IPT-2011-0889-
900000 and granted by the MINECO, Spain
Analiza środowiskowych uwarunkowań lokalizacji osadnictwa ze starszej i środkowej epoki kamienia na Pojezierzu Lubuskim. Zastosowanie drzew klasyfikacyjno-regresyjnych
The article presents results of the study of environmental variables influencing location of Palaeolithic and Mesolithic settlement in Pojezierze Lubuskie. A new method of predicting archaeological sites location has been suggested, based on classification and regression trees (CART), including nine natural variables. A detailed model has been designed illustrating dependency between these natural variables and intensity of prehistoric activity. The study, results have been also presented on the map, which systematisies and presents, in a coherent manner, a model of human presence in Palaeolithic and esolithic periods
Database Analysis to Support Nutrient Criteria Development (Phase II)
The intent of this publication of the Arkansas Water Resources Center is to provide a location whereby a final report on water research to a funding agency can be archived. The Texas Commission on Environmental Quality (TCEQ) contracted with University of Arkansas researchers for a multiple year project titled “Database Analysis to Support Nutrient Criteria Development”. This publication covers the second of three phases of that project and has maintained the original format of the report as submitted to TCEQ. This report can be cited either as an AWRC publication (see below) or directly as the final report to TCEQ
Learning Heterogeneous Similarity Measures for Hybrid-Recommendations in Meta-Mining
The notion of meta-mining has appeared recently and extends the traditional
meta-learning in two ways. First it does not learn meta-models that provide
support only for the learning algorithm selection task but ones that support
the whole data-mining process. In addition it abandons the so called black-box
approach to algorithm description followed in meta-learning. Now in addition to
the datasets, algorithms also have descriptors, workflows as well. For the
latter two these descriptions are semantic, describing properties of the
algorithms. With the availability of descriptors both for datasets and data
mining workflows the traditional modelling techniques followed in
meta-learning, typically based on classification and regression algorithms, are
no longer appropriate. Instead we are faced with a problem the nature of which
is much more similar to the problems that appear in recommendation systems. The
most important meta-mining requirements are that suggestions should use only
datasets and workflows descriptors and the cold-start problem, e.g. providing
workflow suggestions for new datasets.
In this paper we take a different view on the meta-mining modelling problem
and treat it as a recommender problem. In order to account for the meta-mining
specificities we derive a novel metric-based-learning recommender approach. Our
method learns two homogeneous metrics, one in the dataset and one in the
workflow space, and a heterogeneous one in the dataset-workflow space. All
learned metrics reflect similarities established from the dataset-workflow
preference matrix. We demonstrate our method on meta-mining over biological
(microarray datasets) problems. The application of our method is not limited to
the meta-mining problem, its formulations is general enough so that it can be
applied on problems with similar requirements
- …