Search CORE

3 research outputs found

Large scale data analysis using MLlib

Author: Abdulghafoor Mohammed Mostafa
Hussein Ali Ahmed
Khamees Khaleel Mohammed
Nawaf Abbod Maan
Sutikno Tole
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/10/2021
Field of study

Recent advancements in the internet, social media, and internet of things (IoT) devices have significantly increased the amount of data generated in a variety of formats. The data must be converted into formats that is easily handled by the data analysis techniques. It is mathematically and physically expensive to apply machine learning algorithms to big and complicated data sets. It is a resource-intensive process that necessitates a huge amount of logical and physical resources. Machine learning is a sophisticated data analytics technology that has gained in importance as a result of the massive amount of data generated daily that needs to be examined. Apache Spark machine learning library (MLlib) is one of the big data analysis platforms that provides a variety of outstanding functions for various machine learning tasks, spanning from classification to regression and dimension reduction. From a computational standpoint, this research investigated Apache Spark MLlib 2.0 as an open source, autonomous, scalable, and distributed learning library. Several real-world machine learning experiments are carried out in order to evaluate the properties of the platform on a qualitative and quantitative level. Some of the fundamental concepts and approaches for developing a scalable data model in a distributed environment are also discussed

Journal of Education and Learning (EduLearn)

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System

A new model for iris classification based on Naïve Bayes grid parameters optimization

Author: Ali Ahmed Hussein
Khaleel Mohammad Khamees
Mohammed Mostafa Abdulghfoor
Salih Al-Hakam Ayad
Salman Saba Abdul-baqi
Publication venue: International Journal of Sciences: Basic and Applied Research
Publication date: 11/08/2018
Field of study

Data mining classification plays an important role in the prediction of outcomes. One of the outstanding classifications methods in data mining is Naive Bayes Classification (NBC). It is capable of envisaging results and mostly effective than other classification methods. Many Naive Bayes classification method provide low performance in classification and regression problems Ones of the facts behinds the performances of the NBC is dues to the assumptions of contingent on independence amidst predictors and the initials hyper parameters. However, this strong assumption leads to loss of accuracy. In this study, a new method for boosting the accuracy of NBC was proposed. The proposed new technique uses a grid search to give better accuracy Naïve Bayes classification

GSSRR.ORG: International Journals: Publishing Research Papers in all Fields

Similarity paper report-Large scale data analysis using MLlib

Author: Abdulghafoor Mohammed Mostafa
Hussein Ali Ahmed
Khamees Khaleel Mohammed
Nawaf Abbod Maan
Sutikno Tole
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date
Field of study

Universitas Ahmad Dahlan Repository