3,323 research outputs found

    Machine learning based data pre-processing for the purpose of medical data mining and decision support

    Get PDF
    Building an accurate and reliable model for prediction for different application domains, is one of the most significant challenges in knowledge discovery and data mining. Sometimes, improved data quality is itself the goal of the analysis, usually to improve processes in a production database and the designing of decision support. As medicine moves forward there is a need for sophisticated decision support systems that make use of data mining to support more orthodox knowledge engineering and Health Informatics practice. However, the real-life medical data rarely complies with the requirements of various data mining tools. It is often inconsistent, noisy, containing redundant attributes, in an unsuitable format, containing missing values and imbalanced with regards to the outcome class label.Many real-life data sets are incomplete, with missing values. In medical data mining the problem with missing values has become a challenging issue. In many clinical trials, the medical report pro-forma allow some attributes to be left blank, because they are inappropriate for some class of illness or the person providing the information feels that it is not appropriate to record the values for some attributes. The research reported in this thesis has explored the use of machine learning techniques as missing value imputation methods. The thesis also proposed a new way of imputing missing value by supervised learning. A classifier was used to learn the data patterns from a complete data sub-set and the model was later used to predict the missing values for the full dataset. The proposed machine learning based missing value imputation was applied on the thesis data and the results are compared with traditional Mean/Mode imputation. Experimental results show that all the machine learning methods which we explored outperformed the statistical method (Mean/Mode).The class imbalance problem has been found to hinder the performance of learning systems. In fact, most of the medical datasets are found to be highly imbalance in their class label. The solution to this problem is to reduce the gap between the minority class samples and the majority class samples. Over-sampling can be applied to increase the number of minority class sample to balance the data. The alternative to over-sampling is under-sampling where the size of majority class sample is reduced. The thesis proposed one cluster based under-sampling technique to reduce the gap between the majority and minority samples. Different under-sampling and over-sampling techniques were explored as ways to balance the data. The experimental results show that for the thesis data the new proposed modified cluster based under-sampling technique performed better than other class balancing techniques.In further research it is found that the class imbalance problem not only affects the classification performance but also has an adverse effect on feature selection. The thesis proposed a new framework for feature selection for class imbalanced datasets. The research found that, using the proposed framework the classifier needs less attributes to show high accuracy, and more attributes are needed if the data is highly imbalanced.The research described in the thesis contains the flowing four novel main contributions.a) Improved data mining methodology for mining medical datab) Machine learning based missing value imputation methodc) Cluster Based semi-supervised class balancing methodd) Feature selection framework for class imbalance datasetsThe performance analysis and comparative study show that the use of proposed method of missing value imputation, class balancing and feature selection framework can provide an effective approach to data preparation for building medical decision support

    Advances in Robotics, Automation and Control

    Get PDF
    The book presents an excellent overview of the recent developments in the different areas of Robotics, Automation and Control. Through its 24 chapters, this book presents topics related to control and robot design; it also introduces new mathematical tools and techniques devoted to improve the system modeling and control. An important point is the use of rational agents and heuristic techniques to cope with the computational complexity required for controlling complex systems. Through this book, we also find navigation and vision algorithms, automatic handwritten comprehension and speech recognition systems that will be included in the next generation of productive systems developed by man

    The state of the art development of AHP (1979-2017): A literature review with a social network analysis

    Get PDF
    Although many papers describe the evolution of the analytic hierarchy process (AHP), most adopt a subjective approach. This paper examines the pattern of development of the AHP research field using social network analysis and scientometrics, and identifies its intellectual structure. The objectives are: (i) to trace the pattern of development of AHP research; (ii) to identify the patterns of collaboration among authors; (iii) to identify the most important papers underpinning the development of AHP; and (iv) to discover recent areas of interest. We analyse two types of networks: social networks, that is, co-authorship networks, and cognitive mapping or the network of disciplines affected by AHP. Our analyses are based on 8441 papers published between 1979 and 2017, retrieved from the ISI Web of Science database. To provide a longitudinal perspective on the pattern of evolution of AHP, we analyse these two types of networks during the three periods 1979?1990, 1991?2001 and 2002?2017. We provide some basic statistics on AHP journals and researchers, review the main topics and applications of integrated AHPs and provide direction for future research by highlighting some open questions

    Human Factors in Agile Software Development

    Full text link
    Through our four years experiments on students' Scrum based agile software development (ASD) process, we have gained deep understanding into the human factors of agile methodology. We designed an agile project management tool - the HASE collaboration development platform to support more than 400 students self-organized into 80 teams to practice ASD. In this thesis, Based on our experiments, simulations and analysis, we contributed a series of solutions and insights in this researches, including 1) a Goal Net based method to enhance goal and requirement management for ASD process, 2) a novel Simple Multi-Agent Real-Time (SMART) approach to enhance intelligent task allocation for ASD process, 3) a Fuzzy Cognitive Maps (FCMs) based method to enhance emotion and morale management for ASD process, 4) the first large scale in-depth empirical insights on human factors in ASD process which have not yet been well studied by existing research, and 5) the first to identify ASD process as a human-computation system that exploit human efforts to perform tasks that computers are not good at solving. On the other hand, computers can assist human decision making in the ASD process.Comment: Book Draf

    S-TREE: Self-Organizing Trees for Data Clustering and Online Vector Quantization

    Full text link
    This paper introduces S-TREE (Self-Organizing Tree), a family of models that use unsupervised learning to construct hierarchical representations of data and online tree-structured vector quantizers. The S-TREE1 model, which features a new tree-building algorithm, can be implemented with various cost functions. An alternative implementation, S-TREE2, which uses a new double-path search procedure, is also developed. S-TREE2 implements an online procedure that approximates an optimal (unstructured) clustering solution while imposing a tree-structure constraint. The performance of the S-TREE algorithms is illustrated with data clustering and vector quantization examples, including a Gauss-Markov source benchmark and an image compression application. S-TREE performance on these tasks is compared with the standard tree-structured vector quantizer (TSVQ) and the generalized Lloyd algorithm (GLA). The image reconstruction quality with S-TREE2 approaches that of GLA while taking less than 10% of computer time. S-TREE1 and S-TREE2 also compare favorably with the standard TSVQ in both the time needed to create the codebook and the quality of image reconstruction.Office of Naval Research (N00014-95-10409, N00014-95-0G57

    The state of the art development of AHP (1979-2017): a literature review with a social network analysis

    Get PDF
    Although many papers describe the evolution of the analytic hierarchy process (AHP), most adopt a subjective approach. This paper examines the pattern of development of the AHP research field using social network analysis and scientometrics, and identifies its intellectual structure. The objectives are: (i) to trace the pattern of development of AHP research; (ii) to identify the patterns of collaboration among authors; (iii) to identify the most important papers underpinning the development of AHP; and (iv) to discover recent areas of interest. We analyse two types of networks: social networks, that is, co-authorship networks, and cognitive mapping or the network of disciplines affected by AHP. Our analyses are based on 8441 papers published between 1979 and 2017, retrieved from the ISI Web of Science database. To provide a longitudinal perspective on the pattern of evolution of AHP, we analyse these two types of networks during the three periods 1979–1990, 1991–2001 and 2002–2017. We provide some basic statistics on AHP journals and researchers, review the main topics and applications of integrated AHPs and provide direction for future research by highlighting some open questions

    Collective intelligence in self-organized industrial cyber-physical systems

    Get PDF
    Cyber-physical systems (CPS) play an important role in the implementation of new Industry 4.0 solutions, acting as the backbone infrastructure to host distributed intelligence capabilities and promote the collective intelligence that emerges from the interactions among individuals. This collective intelligence concept provides an alternative way to design complex systems with several benefits, such as modularity, flexibility, robustness, and reconfigurability to condition changes, but it also presents several challenges to be managed (e.g., non-linearity, self-organization, and myopia). With this in mind, this paper discusses the factors that characterize collective intelligence, particularly that associated with industrial CPS, analyzing the enabling concepts, technologies, and application sectors, and providing an illustrative example of its application in an automotive assembly line. The main contribution of the paper focuses on a comprehensive review and analysis of the main aspects, challenges, and research opportunities to be considered for implementing collective intelligence in industrial CPS. The identified challenges are clustered according to five different categories, namely decentralization, emergency, intelligent machines and products, infrastructures and methods, and human integration and ethics. Although the research indicates some potential benefits of using collective intelligence to achieve the desired levels of autonomy and dynamic adaptation of industrial CPS, such approaches are still in the early stages, with perspectives to increase in the coming years. Based on that, they need to be further developed considering some main aspects, for example, related to balancing the distribution of intelligence by the vertical and horizontal dimensions and controlling the nervousness in self-organized systems.info:eu-repo/semantics/publishedVersio
    • …
    corecore