Search CORE

3,323 research outputs found

Machine learning based data pre-processing for the purpose of medical data mining and decision support

Author: Rahman M. Mostafizur
Publication venue
Publication date: 01/03/2014
Field of study

Building an accurate and reliable model for prediction for different application domains, is one of the most significant challenges in knowledge discovery and data mining. Sometimes, improved data quality is itself the goal of the analysis, usually to improve processes in a production database and the designing of decision support. As medicine moves forward there is a need for sophisticated decision support systems that make use of data mining to support more orthodox knowledge engineering and Health Informatics practice. However, the real-life medical data rarely complies with the requirements of various data mining tools. It is often inconsistent, noisy, containing redundant attributes, in an unsuitable format, containing missing values and imbalanced with regards to the outcome class label.Many real-life data sets are incomplete, with missing values. In medical data mining the problem with missing values has become a challenging issue. In many clinical trials, the medical report pro-forma allow some attributes to be left blank, because they are inappropriate for some class of illness or the person providing the information feels that it is not appropriate to record the values for some attributes. The research reported in this thesis has explored the use of machine learning techniques as missing value imputation methods. The thesis also proposed a new way of imputing missing value by supervised learning. A classifier was used to learn the data patterns from a complete data sub-set and the model was later used to predict the missing values for the full dataset. The proposed machine learning based missing value imputation was applied on the thesis data and the results are compared with traditional Mean/Mode imputation. Experimental results show that all the machine learning methods which we explored outperformed the statistical method (Mean/Mode).The class imbalance problem has been found to hinder the performance of learning systems. In fact, most of the medical datasets are found to be highly imbalance in their class label. The solution to this problem is to reduce the gap between the minority class samples and the majority class samples. Over-sampling can be applied to increase the number of minority class sample to balance the data. The alternative to over-sampling is under-sampling where the size of majority class sample is reduced. The thesis proposed one cluster based under-sampling technique to reduce the gap between the majority and minority samples. Different under-sampling and over-sampling techniques were explored as ways to balance the data. The experimental results show that for the thesis data the new proposed modified cluster based under-sampling technique performed better than other class balancing techniques.In further research it is found that the class imbalance problem not only affects the classification performance but also has an adverse effect on feature selection. The thesis proposed a new framework for feature selection for class imbalanced datasets. The research found that, using the proposed framework the classifier needs less attributes to show high accuracy, and more attributes are needed if the data is highly imbalanced.The research described in the thesis contains the flowing four novel main contributions.a) Improved data mining methodology for mining medical datab) Machine learning based missing value imputation methodc) Cluster Based semi-supervised class balancing methodd) Feature selection framework for class imbalance datasetsThe performance analysis and comparative study show that the use of proposed method of missing value imputation, class balancing and feature selection framework can provide an effective approach to data preparation for building medical decision support

Repository@Hull - Worktribe

Creating groups with similar expected behavioural response in randomized controlled trials: a fuzzy cognitive map approach

Author
Publication venue: BioMed Central
Publication date
Field of study

Springer - Publisher Connector

Proceedings of the 2017 International Conference on Decision Support System Technology:with a theme on data, information and knowledge visualisation in decision making

Author
Publication venue
Publication date: 01/01/2017
Field of study

Repository of the University of Namur

Advances in Robotics, Automation and Control

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

The book presents an excellent overview of the recent developments in the different areas of Robotics, Automation and Control. Through its 24 chapters, this book presents topics related to control and robot design; it also introduces new mathematical tools and techniques devoted to improve the system modeling and control. An important point is the use of rational agents and heuristic techniques to cope with the computational complexity required for controlling complex systems. Through this book, we also find navigation and vision algorithms, automatic handwritten comprehension and speech recognition systems that will be included in the next generation of productive systems developed by man

Directory of Open Access Books (DOAB)

The state of the art development of AHP (1979-2017): A literature review with a social network analysis

Author: Ali Emrouznejad
Marianna Marra
Adebanjo D.
Cengiz Toklu M.
Dey B.
Kazibudzki P. T.
Kengpol A.
Knorr-Cetina K.
Medjoudj R.
Rotolo D.
Tiwari M.
Publication venue: 'Informa UK Limited'
Publication date: 14/06/2017
Field of study

Although many papers describe the evolution of the analytic hierarchy process (AHP), most adopt a subjective approach. This paper examines the pattern of development of the AHP research field using social network analysis and scientometrics, and identifies its intellectual structure. The objectives are: (i) to trace the pattern of development of AHP research; (ii) to identify the patterns of collaboration among authors; (iii) to identify the most important papers underpinning the development of AHP; and (iv) to discover recent areas of interest. We analyse two types of networks: social networks, that is, co-authorship networks, and cognitive mapping or the network of disciplines affected by AHP. Our analyses are based on 8441 papers published between 1979 and 2017, retrieved from the ISI Web of Science database. To provide a longitudinal perspective on the pattern of evolution of AHP, we analyse these two types of networks during the three periods 1979?1990, 1991?2001 and 2002?2017. We provide some basic statistics on AHP journals and researchers, review the main topics and applications of integrated AHPs and provide direction for future research by highlighting some open questions

University of Essex Research Repository

Crossref

Biblioteca Digital de la Comunidad de Madrid

Aston Publications Explorer

Sussex Research Online

Human Factors in Agile Software Development

Author: Lin Jun
Publication venue
Publication date: 01/01/2015
Field of study

Through our four years experiments on students' Scrum based agile software development (ASD) process, we have gained deep understanding into the human factors of agile methodology. We designed an agile project management tool - the HASE collaboration development platform to support more than 400 students self-organized into 80 teams to practice ASD. In this thesis, Based on our experiments, simulations and analysis, we contributed a series of solutions and insights in this researches, including 1) a Goal Net based method to enhance goal and requirement management for ASD process, 2) a novel Simple Multi-Agent Real-Time (SMART) approach to enhance intelligent task allocation for ASD process, 3) a Fuzzy Cognitive Maps (FCMs) based method to enhance emotion and morale management for ASD process, 4) the first large scale in-depth empirical insights on human factors in ASD process which have not yet been well studied by existing research, and 5) the first to identify ASD process as a human-computation system that exploit human efforts to perform tasks that computers are not good at solving. On the other hand, computers can assist human decision making in the ASD process.Comment: Book Draf

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

S-TREE: Self-Organizing Trees for Data Clustering and Online Vector Quantization

Author: Campos Marcos
Carpenter Gail
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/09/2000
Field of study

This paper introduces S-TREE (Self-Organizing Tree), a family of models that use unsupervised learning to construct hierarchical representations of data and online tree-structured vector quantizers. The S-TREE1 model, which features a new tree-building algorithm, can be implemented with various cost functions. An alternative implementation, S-TREE2, which uses a new double-path search procedure, is also developed. S-TREE2 implements an online procedure that approximates an optimal (unstructured) clustering solution while imposing a tree-structure constraint. The performance of the S-TREE algorithms is illustrated with data clustering and vector quantization examples, including a Gauss-Markov source benchmark and an image compression application. S-TREE performance on these tasks is compared with the standard tree-structured vector quantizer (TSVQ) and the generalized Lloyd algorithm (GLA). The image reconstruction quality with S-TREE2 approaches that of GLA while taking less than 10% of computer time. S-TREE1 and S-TREE2 also compare favorably with the standard TSVQ in both the time needed to create the codebook and the quality of image reconstruction.Office of Naval Research (N00014-95-10409, N00014-95-0G57

Boston University Institutional Repository (OpenBU)

The state of the art development of AHP (1979-2017): a literature review with a social network analysis

Author: Adebanjo D.
Ali Emrouznejad
Cengiz Toklu M.
Dey B.
Kazibudzki P. T.
Kengpol A.
Knorr-Cetina K.
Marianna Marra
Medjoudj R.
Rotolo D.
Tiwari M.
Publication venue: 'Informa UK Limited'
Publication date: 14/06/2017
Field of study

Although many papers describe the evolution of the analytic hierarchy process (AHP), most adopt a subjective approach. This paper examines the pattern of development of the AHP research field using social network analysis and scientometrics, and identifies its intellectual structure. The objectives are: (i) to trace the pattern of development of AHP research; (ii) to identify the patterns of collaboration among authors; (iii) to identify the most important papers underpinning the development of AHP; and (iv) to discover recent areas of interest. We analyse two types of networks: social networks, that is, co-authorship networks, and cognitive mapping or the network of disciplines affected by AHP. Our analyses are based on 8441 papers published between 1979 and 2017, retrieved from the ISI Web of Science database. To provide a longitudinal perspective on the pattern of evolution of AHP, we analyse these two types of networks during the three periods 1979–1990, 1991–2001 and 2002–2017. We provide some basic statistics on AHP journals and researchers, review the main topics and applications of integrated AHPs and provide direction for future research by highlighting some open questions

University of Essex Research Repository

Crossref

Aston Publications Explorer

Sussex Research Online

Collective intelligence in self-organized industrial cyber-physical systems

Author: Leitão Paulo
Queiroz Jonas
Sakurada Lucas
Publication venue: 'MDPI AG'
Publication date: 01/01/2022
Field of study

Cyber-physical systems (CPS) play an important role in the implementation of new Industry 4.0 solutions, acting as the backbone infrastructure to host distributed intelligence capabilities and promote the collective intelligence that emerges from the interactions among individuals. This collective intelligence concept provides an alternative way to design complex systems with several benefits, such as modularity, flexibility, robustness, and reconfigurability to condition changes, but it also presents several challenges to be managed (e.g., non-linearity, self-organization, and myopia). With this in mind, this paper discusses the factors that characterize collective intelligence, particularly that associated with industrial CPS, analyzing the enabling concepts, technologies, and application sectors, and providing an illustrative example of its application in an automotive assembly line. The main contribution of the paper focuses on a comprehensive review and analysis of the main aspects, challenges, and research opportunities to be considered for implementing collective intelligence in industrial CPS. The identified challenges are clustered according to five different categories, namely decentralization, emergency, intelligent machines and products, infrastructures and methods, and human integration and ethics. Although the research indicates some potential benefits of using collective intelligence to achieve the desired levels of autonomy and dynamic adaptation of industrial CPS, such approaches are still in the early stages, with perspectives to increase in the coming years. Based on that, they need to be further developed considering some main aspects, for example, related to balancing the distribution of intelligence by the vertical and horizontal dimensions and controlling the nervousness in self-organized systems.info:eu-repo/semantics/publishedVersio

Directory of Open Access Journals

Biblioteca Digital do IPB