1,150 research outputs found

    Data mining in soft computing framework: a survey

    Get PDF
    The present article provides a survey of the available literature on data mining using soft computing. A categorization has been provided based on the different soft computing tools and their hybridizations used, the data mining function implemented, and the preference criterion selected by the model. The utility of the different soft computing methodologies is highlighted. Generally fuzzy sets are suitable for handling the issues related to understandability of patterns, incomplete/noisy data, mixed media information and human interaction, and can provide approximate solutions faster. Neural networks are nonparametric, robust, and exhibit good learning and generalization capabilities in data-rich environments. Genetic algorithms provide efficient search algorithms to select a model, from mixed media data, based on some preference criterion/objective function. Rough sets are suitable for handling different types of uncertainty in data. Some challenges to data mining and the application of soft computing methodologies are indicated. An extensive bibliography is also included

    A novel hybrid recommendation system for library book selection

    Get PDF
    Abstract. Increasing number of books published in a year and decreasing budgets have made collection development increasingly difficult in libraries. Despite the data to help decision making being available in the library systems, the librarians have little means to utilize the data. In addition, modern key technologies, such as machine learning, that generate more value out data have not yet been utilized in the field of libraries to their full extent. This study was set to discover a way to build a recommendation system that could help librarians who are struggling with book selection process. This thesis proposed a novel hybrid recommendation system for library book selection. The data used to build the system consisted of book metadata and book circulation data of books located in Joensuu City Libraryโ€™s adult fiction collection. The proposed system was based on both rule-based components and a machine learning model. The user interface for the system was build using web technologies so that the system could be used via using web browser. The proposed recommendation system was evaluated using two different methods: automated tests and focus group methodology. The system achieved an accuracy of 79.79% and F1 score of 0.86 in automated tests. Uncertainty rate of the system was 27.87%. With these results in automated tests, the proposed system outperformed baseline machine learning models. The main suggestions that were gathered from focus group evaluation were that while the proposed system was found interesting, librarians thought it would need more features and configurability in order to be usable in real world scenarios. Results indicate that making good quality recommendations using book metadata is challenging because the data is high dimensional categorical data by its nature. Main implications of the results are that recommendation systems in domain of library collection development should focus on data pre-processing and feature engineering. Further investigation is suggested to be carried out regarding knowledge representation

    ๊ณ„์ธต ๊ฐ•ํ™” ํ•™์Šต์—์„œ์˜ ํƒํ—˜์  ํ˜ผํ•ฉ ํƒ์ƒ‰

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ „๊ธฐยท์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2020. 8. ๋ฌธ๋ณ‘๋กœ.Balancing exploitation and exploration is a great challenge in many optimization problems. Evolutionary algorithms, such as evolutionary strategies and genetic algorithms, are algorithms inspired by biological evolution. They have been used for various optimization problems, such as combinatorial optimization and continuous optimization. However, evolutionary algorithms lack fine-tuning near local optima; in other words, they lack exploitation power. This drawback can be overcome by hybridization. Hybrid genetic algorithms, or memetic algorithms, are successful examples of hybridization. Although the solution space is exponentially vast in some optimization problems, these algorithms successfully find satisfactory solutions. In the deep learning era, the problem of exploitation and exploration has been relatively neglected. In deep reinforcement learning problems, however, balancing exploitation and exploration is more crucial than that in problems with supervision. Many environments in the real world have an exponentially wide state space that must be explored by agents. Without sufficient exploration power, agents only reveal a small portion of the state space and end up with seeking only instant rewards. In this thesis, a hybridization method is proposed which contains both gradientbased policy optimization with strong exploitation power and evolutionary policy optimization with strong exploration power. First, the gradient-based policy optimization and evolutionary policy optimization are analyzed in various environments. The results demonstrate that evolutionary policy optimization is robust for sparse rewards but weak for instant rewards, whereas gradient-based policy optimization is effective for instant rewards but weak for sparse rewards. This difference between the two optimizations reveals the potential of hybridization in policy optimization. Then, a hybrid search is suggested in the framework of hierarchical reinforcement learning. The results demonstrate that the hybrid search finds an effective agent for complex environments with sparse rewards thanks to its balanced exploitation and exploration.๋งŽ์€ ์ตœ์ ํ™” ๋ฌธ์ œ์—์„œ ํƒ์‚ฌ์™€ ํƒํ—˜์˜ ๊ท ํ˜•์„ ๋งž์ถ”๋Š” ๊ฒƒ์€ ๋งค์šฐ ์ค‘์š”ํ•œ ๋ฌธ์ œ์ด๋‹ค. ์ง„ํ™” ์ „๋žต๊ณผ ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜๊ณผ ๊ฐ™์€ ์ง„ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ์ž์—ฐ์—์„œ์˜ ์ง„ํ™”์—์„œ ์˜๊ฐ์„ ์–ป์€ ๋ฉ”ํƒ€ํœด๋ฆฌ์Šคํ‹ฑ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด๋‹ค. ์ด๋“ค์€ ์กฐํ•ฉ ์ตœ์ ํ™”, ์—ฐ์† ์ตœ์ ํ™”์™€ ๊ฐ™์€ ๋‹ค์–‘ํ•œ ์ตœ์ ํ™” ๋ฌธ์ œ๋ฅผ ํ’€๊ธฐ ์œ„ํ•ด ์‚ฌ์šฉ๋˜์—ˆ๋‹ค. ํ•˜์ง€๋งŒ ์ง„ํ™” ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ์ง€์—ญ ์ตœ์ ํ•ด ๊ทผ์ฒ˜์—์„œ์˜ ๋ฏธ์„ธ ์กฐ์ •, ์ฆ‰ ํƒ์‚ฌ์— ์•ฝํ•œ ํŠน์„ฑ์ด ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ๊ฒฐ์ ํ•จ์€ ํ˜ผํ•ฉํ™”๋ฅผ ํ†ตํ•ด ๊ทน๋ณตํ•  ์ˆ˜ ์žˆ๋‹ค. ํ˜ผํ•ฉ ์œ ์ „ ์•Œ๊ณ ๋ฆฌ์ฆ˜, ํ˜น์€ ๋ฏธ๋ฏธํ‹ฑ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด ์„ฑ๊ณต์ ์ธ ํ˜ผํ•ฉํ™”์˜ ์‚ฌ๋ก€์ด๋‹ค. ์ด๋Ÿฌํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ์ตœ์ ํ™” ๋ฌธ์ œ์˜ ํ•ด ๊ณต๊ฐ„์ด ๊ธฐํ•˜๊ธ‰์ˆ˜์ ์œผ๋กœ ๋„“๋”๋ผ๋„ ์„ฑ๊ณต์ ์œผ๋กœ ๋งŒ์กฑ์Šค๋Ÿฌ์šด ํ•ด๋ฅผ ์ฐพ์•„๋‚ธ๋‹ค. ํ•œํŽธ ์‹ฌ์ธต ํ•™์Šต์˜ ์‹œ๋Œ€์—์„œ, ํƒ์‚ฌ์™€ ํƒํ—˜์˜ ๊ท ํ˜•์„ ๋งž์ถ”๋Š” ๋ฌธ์ œ๋Š” ์ข…์ข… ๋ฌด์‹œ๋˜์—ˆ๋‹ค. ํ•˜์ง€๋งŒ ์‹ฌ์ธต ๊ฐ•ํ™”ํ•™์Šต์—์„œ๋Š” ํƒ์‚ฌ์™€ ํƒํ—˜์˜ ๊ท ํ˜•์„ ๋งž์ถ”๋Š” ์ผ์€ ์ง€๋„ํ•™์Šต์—์„œ๋ณด๋‹ค ํ›จ์”ฌ ๋” ์ค‘์š”ํ•˜๋‹ค. ๋งŽ์€ ์‹ค์ œ ์„ธ๊ณ„์˜ ํ™˜๊ฒฝ์€ ๊ธฐํ•˜๊ธ‰์ˆ˜์ ์œผ๋กœ ํฐ ์ƒํƒœ ๊ณต๊ฐ„์„ ๊ฐ€์ง€๊ณ  ์žˆ๊ณ  ์—์ด์ „ํŠธ๋Š” ์ด๋ฅผ ํƒํ—˜ํ•ด์•ผ๋งŒ ํ•œ๋‹ค. ์ถฉ๋ถ„ํ•œ ํƒํ—˜ ๋Šฅ๋ ฅ์ด ์—†์œผ๋ฉด ์—์ด์ „ํŠธ๋Š” ์ƒํƒœ ๊ณต๊ฐ„์˜ ๊ทนํžˆ ์ผ๋ถ€๋งŒ์„ ๋ฐํ˜€๋‚ด์–ด ๊ฒฐ๊ตญ ์ฆ‰๊ฐ์ ์ธ ๋ณด์ƒ๋งŒ ํƒํ•˜๊ฒŒ ๋  ๊ฒƒ์ด๋‹ค. ๋ณธ ํ•™์œ„๋…ผ๋ฌธ์—์„œ๋Š” ๊ฐ•ํ•œ ํƒ์‚ฌ ๋Šฅ๋ ฅ์„ ๊ฐ€์ง„ ๊ทธ๋ ˆ๋””์–ธํŠธ ๊ธฐ๋ฐ˜ ์ •์ฑ… ์ตœ์ ํ™”์™€ ๊ฐ•ํ•œ ํƒํ—˜ ๋Šฅ๋ ฅ์„ ๊ฐ€์ง„ ์ง„ํ™”์  ์ •์ฑ… ์ตœ์ ํ™”๋ฅผ ํ˜ผํ•ฉํ•˜๋Š” ๊ธฐ๋ฒ•์„ ์ œ์‹œํ•  ๊ฒƒ์ด๋‹ค. ์šฐ์„  ๊ทธ๋ ˆ๋””์–ธํŠธ ๊ธฐ๋ฐ˜ ์ •์ฑ… ์ตœ์ ํ™”์™€ ์ง„ํ™”์  ์ •์ฑ… ์ตœ์ ํ™”๋ฅผ ๋‹ค์–‘ํ•œ ํ™˜๊ฒฝ์—์„œ ๋ถ„์„ํ•œ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ ๊ทธ๋ ˆ๋””์–ธํŠธ ๊ธฐ๋ฐ˜ ์ •์ฑ… ์ตœ์ ํ™”๋Š” ์ฆ‰๊ฐ์  ๋ณด์ƒ์— ํšจ๊ณผ์ ์ด์ง€๋งŒ ๋ณด์ƒ์˜ ๋ฐ€๋„๊ฐ€ ๋‚ฎ์„๋•Œ ์ทจ์•ฝํ•œ ๋ฐ˜๋ฉด ์ง„ํ™”์  ์ •์ฑ… ์ตœ์ ํ™”๊ฐ€ ๋ฐ€๋„๊ฐ€ ๋‚ฎ์€ ๋ณด์ƒ์— ๋Œ€ํ•ด ๊ฐ•ํ•˜์ง€๋งŒ ์ฆ‰๊ฐ์ ์ธ ๋ณด์ƒ์— ๋Œ€ํ•ด ์ทจ์•ฝํ•˜๋‹ค๋Š” ๊ฒƒ์„ ์•Œ ์ˆ˜ ์žˆ๋‹ค. ๋‘ ๊ฐ€์ง€ ์ตœ์ ํ™”์˜ ํŠน์ง• ์ƒ ์ฐจ์ด์ ์ด ํ˜ผํ•ฉ์  ์ •์ฑ… ์ตœ์ ํ™”์˜ ๊ฐ€๋Šฅ์„ฑ์„ ๋ณด์—ฌ์ค€๋‹ค. ๊ทธ๋ฆฌ๊ณ  ๊ณ„์ธต์  ๊ฐ•ํ™” ํ•™์Šต ํ”„๋ ˆ์ž„์›Œํฌ์—์„œ์˜ ํ˜ผํ•ฉ ํƒ์ƒ‰ ๊ธฐ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ๊ทธ ๊ฒฐ๊ณผ ํ˜ผํ•ฉ ํƒ์ƒ‰ ๊ธฐ๋ฒ•์ด ๊ท ํ˜•์žกํžŒ ํƒ์‚ฌ์™€ ํƒํ—˜ ๋•๋ถ„์— ๋ฐ€๋„๊ฐ€ ๋‚ฎ์€ ๋ณด์ƒ์„ ์ฃผ๋Š” ๋ณต์žกํ•œ ํ™˜๊ฒฝ์—์„œ ํšจ๊ณผ์ ์ธ ์—์ด์ „ํŠธ๋ฅผ ์ฐพ์•„๋‚ธ ๋‹ค๋Š” ๊ฒƒ์„ ๋ณด์—ฌ์ค€๋‹ค.I. Introduction 1 II. Background 6 2.1 Evolutionary Computations 6 2.1.1 Hybrid Genetic Algorithm 7 2.1.2 Evolutionary Strategy 9 2.2 Hybrid Genetic Algorithm Example: Brick Layout Problem 10 2.2.1 Problem Statement 11 2.2.2 Hybrid Genetic Algorithm 11 2.2.3 Experimental Results 14 2.2.4 Discussion 15 2.3 Reinforcement Learning 16 2.3.1 Policy Optimization 19 2.3.2 Proximal Policy Optimization 21 2.4 Neuroevolution for Reinforcement Learning 23 2.5 Hierarchical Reinforcement Learning 25 2.5.1 Option-based HRL 26 2.5.2 Goal-based HRL 27 2.5.3 Exploitation versus Exploration 27 III. Understanding Features of Evolutionary Policy Optimizations 29 3.1 Experimental Setup 31 3.2 Feature Analysis 32 3.2.1 Convolution Filter Inspection 32 3.2.2 Saliency Map 36 3.3 Discussion 40 3.3.1 Behavioral Characteristics 40 3.3.2 ES Agent without Inputs 42 IV. Hybrid Search for Hierarchical Reinforcement Learning 44 4.1 Method 45 4.2 Experimental Setup 47 4.2.1 Environment 47 4.2.2 Network Architectures 50 4.2.3 Training 50 4.3 Results 51 4.3.1 Comparison 51 4.3.2 Experimental Results 53 4.3.3 Behavior of Low-Level Policy 54 4.4 Conclusion 55 V. Conclusion 56 5.1 Summary 56 5.2 Future Work 57 Bibliography 58Docto

    Short-Term Rainfall Prediction Using Supervised Machine Learning

    Get PDF
    Floods and rain significantly impact the economy of many agricultural countries in the world. Early prediction of rain and floods can dramatically help prevent natural disaster damage. This paper presents a machine learning and data-driven method that can accurately predict short-term rainfall. Various machine learning classification algorithms have been implemented on an Australian weather dataset to train and develop an accurate and reliable model. To choose the best suitable prediction model, diverse machine learning algorithms have been applied for classification as well. Eventually, the performance of the models has been compared based on standard performance measurement metrics. The finding shows that the hist gradient boosting classifier has given the highest accuracy of 91%, with a good F1 value and receiver operating characteristic, the area under the curve score

    DPVis: Visual Analytics with Hidden Markov Models for Disease Progression Pathways

    Full text link
    Clinical researchers use disease progression models to understand patient status and characterize progression patterns from longitudinal health records. One approach for disease progression modeling is to describe patient status using a small number of states that represent distinctive distributions over a set of observed measures. Hidden Markov models (HMMs) and its variants are a class of models that both discover these states and make inferences of health states for patients. Despite the advantages of using the algorithms for discovering interesting patterns, it still remains challenging for medical experts to interpret model outputs, understand complex modeling parameters, and clinically make sense of the patterns. To tackle these problems, we conducted a design study with clinical scientists, statisticians, and visualization experts, with the goal to investigate disease progression pathways of chronic diseases, namely type 1 diabetes (T1D), Huntington's disease, Parkinson's disease, and chronic obstructive pulmonary disease (COPD). As a result, we introduce DPVis which seamlessly integrates model parameters and outcomes of HMMs into interpretable and interactive visualizations. In this study, we demonstrate that DPVis is successful in evaluating disease progression models, visually summarizing disease states, interactively exploring disease progression patterns, and building, analyzing, and comparing clinically relevant patient subgroups.Comment: to appear at IEEE Transactions on Visualization and Computer Graphic
    • โ€ฆ
    corecore