Search CORE

1,089 research outputs found

Adaptive classification under computational budget constraints using sequential data gathering

Author: Couckuyt Ivo
Deschrijver Dirk
Dhaene Tom
van der Herten Joachim
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Rock-burst occurrence prediction based on optimized naïve bayes models

Author: Armaghani Danial
Asteris Panagiotis
Ke Bo
Khandelwal Manoj
Mamou Anna
Skentou Athanasia
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

Rock-burst is a common failure in hard rock related projects in civil and mining construction and therefore, proper classification and prediction of this phenomenon is of interest. This research presents the development of optimized naïve Bayes models, in predicting rock-burst failures in underground projects. The naïve Bayes models were optimized using four weight optimization techniques including forward, backward, particle swarm optimization, and evolutionary. An evolutionary random forest model was developed to identify the most significant input parameters. The maximum tangential stress, elastic energy index, and uniaxial tensile stress were then selected by the feature selection technique (i.e., evolutionary random forest) to develop the optimized naïve Bayes models. The performance of the models was assessed using various criteria as well as a simple ranking system. The results of this research showed that particle swarm optimization was the most effective technique in improving the accuracy of the naïve Bayes model for rock-burst prediction (cumulative ranking = 21), while the backward technique was the worst weight optimization technique (cumulative ranking = 11). All the optimized naïve Bayes models identified the maximum tangential stress as the most significant parameter in predicting rock-burst failures. The results of this research demonstrate that particle swarm optimization technique may improve the accuracy of naïve Bayes algorithms in predicting rock-burst occurrence. © 2013 IEEE

Directory of Open Access Journals

Federation ResearchOnline

Comparison of CNN Classification Model using Machine Learning with Bayesian Optimizer

Author: Afitian M. Yahya Firza
Eni Arofah Dyiyah Kresna
Setyawan Anggi
Surono Sugiyarto
Thobirin Aris
Publication venue: Ital Publication
Publication date: 01/09/2023
Field of study

One of the best-known and frequently used areas of Deep Learning in image processing is the Convolutional Neural Network (CNN), which has architectural designs such as Inceptionv3, DenseNet201, Resnet50, and MobileNet used in image classification and pattern recognition. Furthermore, the CNN extracts feature from the image according to the designed architecture and performs classification through the fully connected layer, which executes the Machine Learning (ML) algorithm tasks. Examples of ML that are commonly used include Naive Bayes (NB), k-Nearest Neighbor (k-NN), Support Vector Machine (SVM), and Decision Tree (DT). This research was conducted based on an AI model development background and the need for a system to diagnose COVID-19 quickly and accurately. The aim was to classify the aforementioned CNN models with ML algorithms and compare the models’ accuracy before and after Bayesian optimization using CXR lung images with a total of 2000 data. Consequently, the CNN extracted 80% of the training data and 20% for testing, which was assigned to four different ML models for classification with the use of Bayesian optimization to ensure the best accuracy. It was observed that the best model classification was generated by the MobileNetV2-SVM structure with an accuracy of 93%. Therefore, the accuracy obtained using the SVM algorithm is higher than the other three ML algorithms. Doi: 10.28991/HIJ-2023-04-03-05 Full Text: PD

Directory of Open Access Journals

HighTech and Innovation Journal

Oil and Gas flow Anomaly Detection on offshore naturally flowing wells using Deep Neural Networks

Author: Bayazitova Guzel
Publication venue
Publication date: 23/10/2023
Field of study

Dissertation presented as the partial requirement for obtaining a Master's degree in Data Science and Advanced Analytics, specialization in Data ScienceThe Oil and Gas industry, as never before, faces multiple challenges. It is being impugned for being dirty, a pollutant, and hence the more demand for green alternatives. Nevertheless, the world still has to rely heavily on hydrocarbons, since it is the most traditional and stable source of energy, as opposed to extensively promoted hydro, solar or wind power. Major operators are challenged to produce the oil more efficiently, to counteract the newly arising energy sources, with less of a climate footprint, more scrutinized expenditure, thus facing high skepticism regarding its future. It has to become greener, and hence to act in a manner not required previously. While most of the tools used by the Hydrocarbon E&P industry is expensive and has been used for many years, it is paramount for the industry’s survival and prosperity to apply predictive maintenance technologies, that would foresee potential failures, making production safer, lowering downtime, increasing productivity and diminishing maintenance costs. Many efforts were applied in order to define the most accurate and effective predictive methods, however data scarcity affects the speed and capacity for further experimentations. Whilst it would be highly beneficial for the industry to invest in Artificial Intelligence, this research aims at exploring, in depth, the subject of Anomaly Detection, using the open public data from Petrobras, that was developed by experts. For this research the Deep Learning Neural Networks, such as Recurrent Neural Networks with LSTM and GRU backbones, were implemented for multi-class classification of undesirable events on naturally flowing wells. Further, several hyperparameter optimization tools were explored, mainly focusing on Genetic Algorithms as being the most advanced methods for such kind of tasks. The research concluded with the best performing algorithm with 2 stacked GRU and the following vector of hyperparameters weights: [1, 47, 40, 14], which stand for timestep 1, number of hidden units 47, number of epochs 40 and batch size 14, producing F1 equal to 0.97%. As the world faces many issues, one of which is the detrimental effect of heavy industries to the environment and as result adverse global climate change, this project is an attempt to contribute to the field of applying Artificial Intelligence in the Oil and Gas industry, with the intention to make it more efficient, transparent and sustainable

Repositório da Universidade Nova de Lisboa

A Machine Learning Based Method for Customer Behavior Prediction

Author: Jing Li
Lei Huang
Shuxiao Pan
Xin Zhu*
Publication venue: 'Mechanical Engineering Faculty in Slavonski Brod'
Publication date: 01/01/2019
Field of study

Under the data-driven environment, market competition is increasingly fierce. Enterprises begin to pay attention to precise marketing to make costs down, improve marketing efficiency and competitiveness. E-mail marketing is widely used in enterprises due to its advantages of low cost and wide audience. This paper uses machine-learning techniques such as decision tree, cluster analysis and Naive Bayes algorithm to analyze customer characteristics and attributes with historical purchase records, and further analyzes the key factors that affect potential customers\u27 purchase behavior by selecting models with high promotion degree through promotion graph, to realize accurate marketing. The results show that the prediction effect of decision tree is better than clustering analysis and Naive Bayesian algorithm, and has a higher promotion degree. The customers who are 45-55 years old and commute 1-2 kilometers away are more likely to make purchases if they do not have a car or have a car at home

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Improving stacking methodology for combining classifiers: applications to cosmetic industry

Author: Gomes Charles
Nocairi Hisham
Saporta Gilbert
Thomas Marie
Publication venue: HAL CCSD
Publication date: 14/10/2016
Field of study

International audienceStacking (Wolpert (1992), Breiman (1996)) is known to be a successful way of linearly combining several models. We modify the usual stacking methodology when the response is binary and predictions highly correlated,by combining predictions with PLS-Discriminant Analysis instead of ordinary least squares. For small data sets we develop a strategy based on repeated split samples in order to select relevant variables and ensure the robustness of the nal model. Five base (or level-0) classiers are combined in order to get an improved rule which is applied to a classical benchmark of UCI Machine Learning Repository. Our methodology is then applied to the prediction of dangerousness of 165 chemicals used in the cosmetic industry, described by 35 in vitro and in silico characteristics, since faced to safety constraints, one cannot rely on a single prediction method, especially when the sample sizeis low

HAL Descartes

Hal-Diderot

Information Geometrically Generalized Covariate Shift Adaptation

Author: Hino Hideitsu
Kimura Masanari
Publication venue: 'MIT Press - Journals'
Publication date: 18/04/2023
Field of study

Many machine learning methods assume that the training and test data follow the same distribution. However, in the real world, this assumption is very often violated. In particular, the phenomenon that the marginal distribution of the data changes is called covariate shift, one of the most important research topics in machine learning. We show that the well-known family of covariate shift adaptation methods is unified in the framework of information geometry. Furthermore, we show that parameter search for geometrically generalized covariate shift adaptation method can be achieved efficiently. Numerical experiments show that our generalization can achieve better performance than the existing methods it encompasses

arXiv.org e-Print Archive

Adaptive classification under computational budget constraints using sequential data gathering

Author: Aernouts
Aerts
Ailon
Akaike
Anderson
Basudhar
Boser
Breiman
Burbidge
Carpentier
Castro
Chang
Cohn
Couckuyt
Couckuyt
Crombecq
Crombecq
Crombecq
Dam
Deb
Deschrijver
Deschrijver
Dirk Deschrijver
Elsayed
Fang
Farhang-Mehr
Forrester
Gorissen
Gorissen
Gorissen
Gorissen
Gorissen
Hagan
Hall
Hendrickx
Hickernell
Houthooft
Huang
Husslage
Ivo Couckuyt
Jamieson
Jin
Joachim van der Herten
Jones
Jones
Kennedy
Koziel
Lin
Montgomery
Niederreiter
Platt
Qian
Ruiz
Russell
Sacks
Settles
Settles
Singh
Singh
Stephens
Suykens
Tom Dhaene
Trapeznikov
Turk
van der Herten
Viana
Xiong
Zhao
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref