1,057 research outputs found
A systematic review of data quality issues in knowledge discovery tasks
Hay un gran crecimiento en el volumen de datos porque las organizaciones capturan permanentemente la cantidad colectiva de datos para lograr un mejor proceso de toma de decisiones. El desafĂo mas fundamental es la exploraciĂłn de los grandes volĂşmenes de datos y la extracciĂłn de conocimiento Ăştil para futuras acciones por medio de tareas para el descubrimiento del conocimiento; sin embargo, muchos datos presentan mala calidad. Presentamos una revisiĂłn sistemática de los asuntos de calidad de datos en las áreas del descubrimiento de conocimiento y un estudio de caso aplicado a la enfermedad agrĂcola conocida como la roya del cafĂ©.Large volume of data is growing because the organizations are continuously capturing the collective amount of data for better decision-making process. The most fundamental challenge is to explore the large volumes of data and extract useful knowledge for future actions through knowledge discovery tasks, nevertheless many data has poor quality. We presented a systematic review of the data quality issues in knowledge discovery tasks and a case study applied to agricultural disease named coffee rust
Thought and Behavior Contagion in Capital Markets
Prevailing models of capital markets capture a limited form of social influence and information transmission, in which the beliefs and behavior of an investor affects others only through market price, information transmission and processing is simple (without thoughts and feelings), and there is no localization in the influence of an investor on others. In reality, individuals often process verbal arguments obtained in conversation or from media presentations, and observe the behavior of others. We review here evidence concerning how these activities cause beliefs and behaviors to spread, affect financial decisions, and affect market prices; and theoretical models of social influence and its effects on capital markets. Social influence is central to how information and investor sentiment are transmitted, so thought and behavior contagion should be incorporated into the theory of capital markets.capital markets; thought contagion; behavioral contagion; herd behavior; information cascades; social learning; investor psychology; accounting regulation; disclosure policy; behavioral finance; market efficiency; popular models; memes
Memetic Pareto Evolutionary Artificial Neural Networks for the determination of growth limits of Listeria Monocytogenes
The main objective of this work is to automatically
design neural network models with sigmoidal basis
units for classification tasks, so that classifiers are
obtained in the most balanced way possible in terms of
CCR and Sensitivity (given by the lowest percentage of
examples correctly predicted to belong to each class).
We present a Memetic Pareto Evolutionary NSGA2
(MPENSGA2) approach based on the Pareto-NSGAII
evolution (PNSGAII) algorithm. We propose to
augmente it with a local search using the improved
Rprop—IRprop algorithm for the prediction of
growth/no growth of L. monocytogenes as a function of
the storage temperature, pH, citric (CA) and ascorbic
acid (AA). The results obtained show that the
generalization ability can be more efficiently improved
within a framework that is multi-objective instead of a
within a single-objective one
An Optimisation-Driven Prediction Method for Automated Diagnosis and Prognosis
open access articleThis article presents a novel hybrid classification paradigm for medical diagnoses and prognoses prediction. The core mechanism of the proposed method relies on a centroid classification algorithm whose logic is exploited to formulate the classification task as a real-valued optimisation problem. A novel metaheuristic combining the algorithmic structure of Swarm Intelligence optimisers with the probabilistic search models of Estimation of Distribution Algorithms is designed to optimise such a problem, thus leading to high-accuracy predictions. This method is tested over 11 medical datasets and compared against 14 cherry-picked classification algorithms. Results show that the proposed approach is competitive and superior to the state-of-the-art on several occasions
A Framework for Leveraging Artificial Intelligence in Project Management
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Information Systems and Technologies ManagementThis dissertation aims to support the project manager in their daily tasks. As we use artificial
intelligence (AI) and machine learning (ML) in everyday life, it is necessary to include them in business
and change traditional ways of working. For the purpose of this study, it is essential to understand
challenges and areas of project management and how artificial intelligence can contribute to them. A
theoretical overview, applying the knowledge of project management, will show a holistic view of the
current situation in the enterprises. The research is about artificial intelligence applications in project
management, the common activities in project management, the biggest challenges, and how AI and
ML can support it. Understanding project managers help create a framework that will contribute to
optimizing their tasks. After designing and developing the framework for applying artificial intelligence
to project management, the project managers were asked to evaluate. This study is essential to
increase awareness among the stakeholders and enterprises on how automation of the processes can
be improved and how AI and ML can decrease the possibility of risk and cost along with improving the
happiness and efficiency of the employees
An Interval-based Multiobjective Approach to Feature Subset Selection Using Joint Modeling of Objectives and Variables
This paper studies feature subset selection in classification using a multiobjective estimation of distribution algorithm. We consider six functions, namely area under ROC curve, sensitivity, specificity, precision, F1 measure and Brier score, for evaluation of feature subsets and as the objectives of the problem. One of the characteristics of these objective functions is the existence of noise in their values that should be appropriately handled during optimization. Our proposed
algorithm consists of two major techniques which are specially designed for the feature subset selection problem. The first one is a solution ranking method based on interval values to handle the noise in the objectives of this problem. The second one is a model estimation method for learning a joint probabilistic model of objectives and variables which is used to generate new solutions and advance through the search space. To simplify model estimation, l1 regularized regression is used to select a
subset of problem variables before model learning. The proposed algorithm is compared with a well-known ranking method for interval-valued objectives and a standard multiobjective genetic algorithm. Particularly, the effects of the two new techniques are experimentally investigated. The experimental results show that the proposed algorithm is able to obtain comparable or better
performance on the tested datasets
Recommended from our members
A niching memetic algorithm for simultaneous clustering and feature selection
Clustering is inherently a difficult task, and is made even more difficult when the selection of relevant features is also an issue. In this paper we propose an approach for simultaneous clustering and feature selection using a niching memetic algorithm. Our approach (which we call NMA_CFS) makes feature selection an integral part of the global clustering search procedure and attempts to overcome the problem of identifying less promising locally optimal solutions in both clustering and feature selection, without making any a priori assumption about the number of clusters. Within the NMA_CFS procedure, a variable composite representation is devised to encode both feature selection and cluster centers with different numbers of clusters. Further, local search operations are introduced to refine feature selection and cluster centers encoded in the chromosomes. Finally, a niching method is integrated to preserve the population diversity and prevent premature convergence. In an experimental evaluation we demonstrate the effectiveness of the proposed approach and compare it with other related approaches, using both synthetic and real data
- …