8 research outputs found
A survey on utilization of data mining approaches for dermatological (skin) diseases prediction
Due to recent technology advances, large volumes of medical data is obtained. These data contain valuable information. Therefore data mining techniques can be used to extract useful patterns. This paper is intended to introduce data mining and its various techniques and a survey of the available literature on medical data mining. We emphasize mainly on the application of data mining on skin diseases. A categorization has been provided based on the different data mining techniques. The utility of the various data mining methodologies is highlighted. Generally association mining is suitable for extracting rules. It has been used especially in cancer diagnosis. Classification is a robust method in medical mining. In this paper, we have summarized the different uses of classification in dermatology. It is one of the most important methods for diagnosis of erythemato-squamous diseases. There are different methods like Neural Networks, Genetic Algorithms and fuzzy classifiaction in this topic. Clustering is a useful method in medical images mining. The purpose of clustering techniques is to find a structure for the given data by finding similarities between data according to data characteristics. Clustering has some applications in dermatology. Besides introducing different mining methods, we have investigated some challenges which exist in mining skin data
A survey of outlier detection methodologies
Outlier detection has been used for centuries to detect and, where appropriate, remove anomalous observations from data. Outliers arise due to mechanical faults, changes in system behaviour, fraudulent behaviour, human error, instrument error or simply through natural deviations in populations. Their detection can identify system faults and fraud before they escalate with potentially catastrophic consequences. It can identify errors and remove their contaminating effect on the data set and as such to purify the data for processing. The original outlier detection methods were arbitrary but now, principled and systematic techniques are used, drawn from the full gamut of Computer Science and Statistics. In this paper, we introduce a survey of contemporary techniques for outlier detection. We identify their respective motivations and distinguish their advantages and disadvantages in a comparative review
3D data visualization techniques and applications for visual multidimensional data mining
2012 - 2013Despite modern technology provide new tools to measure the world around
us, we are quickly generating massive amounts of high-dimensional, spatialtemporal
data. In this work, I deal with two types of datasets: one in which
the spatial characteristics are relatively dynamic and the data are sampled
at different periods of time, and the other where many dimensions prevail,
although the spatial characteristics are relatively static.
The first dataset refers to a peculiar aspect of uncertainty arising from
contractual relationships that regulate a project execution: the dispute
management. In recent years there has been a growth in size and complexity
of the projects managed by public or private organizations. This leads to
increased probability of project failures, frequently due to the difficulty and
the ability to achieve the objectives such as on-time delivery, cost containment,
expected quality achievement. In particular, one of the most common causes
of project failure is the very high degree of uncertainty that affects the
expected performance of the project, especially when different stakeholders
with divergent aims and goals are involved in the project...[edited by author]XII n.s
Deep Learning based Novel Anomaly Detection Methods for Diabetic Retinopathy Screening
Programa Oficial de Doutoramento en Computación. 5009V01[Abstract] Computer-Aided Screening (CAS) systems are getting popularity in disease diagnosis. Modern CAS systems exploit data driven machine learning algorithms including supervised and unsupervised methods.
In medical imaging, annotating pathological samples are much harder and time consuming work than healthy samples. Therefore, there is always an abundance of healthy samples and scarcity of annotated and labelled pathological samples. Unsupervised anomaly detection algorithms
can be implemented for the development of CAS system using the largely available healthy samples, especially when disease/nodisease decision is important for screening.
This thesis proposes unsupervised machine learning methodologies for anomaly detection in retinal fundus images. A novel patchbased image reconstructor architecture for DR detection is presented, that addresses the shortcomings of standard autoencoders-based reconstructors.
Furthermore, a full-size image based anomaly map generation methodology is presented, where the potential DR lesions can be visualized at the pixel-level. Afterwards, a novel methodology is proposed to extend the patch-based architecture to a fully-convolutional
architecture for one-shot full-size image reconstruction. Finally, a novel methodology for supervised DR classification is proposed that utilizes the anomaly maps
Advances in Robotics, Automation and Control
The book presents an excellent overview of the recent developments in the different areas of Robotics, Automation and Control. Through its 24 chapters, this book presents topics related to control and robot design; it also introduces new mathematical tools and techniques devoted to improve the system modeling and control. An important point is the use of rational agents and heuristic techniques to cope with the computational complexity required for controlling complex systems. Through this book, we also find navigation and vision algorithms, automatic handwritten comprehension and speech recognition systems that will be included in the next generation of productive systems developed by man
Bayesian Condition Monitoring in Neonatal Intensive Care
Institute for Adaptive and Neural ComputationThe observed physiological dynamics of an infant receiving intensive care contain a great deal of information about factors which cannot be examined directly, including the state of health of the infant and the operation of the monitoring equipment. This type of data tends to contain both common, recognisable patterns (e.g. as caused by certain clinical operations or artifacts) and some which are rare and harder to interpret. The problem of identifying the presence of these patterns using prior knowledge is clinically significant, and one which is naturally described in terms of statistical machine learning.
In this thesis I develop probabilistic dynamical models which are capable of making useful inferences from neonatal intensive care unit monitoring data. The Factorial Switching Kalman Filter (FSKF) in particular is adopted as a suitable framework for monitoring the condition of an infant. The main contributions are as follows: (1) the application of the FSKF for inferring common factors in physiological monitoring data, which includes finding parameterisations of linear dynamical models to represent common physiological and artifactual conditions, and adapting parameter estimation and inference techniques for the purpose; (2) the formulation of a model for novel physiological dynamics, used to infer the times in which something is happening which is not described by any of the known patterns. EM updates are derived for the latter model in order to estimate parameters. Experimental results are given which show the developed methods to be effective on genuine monitoring data