2 research outputs found

    Comparison of k-Nearest Neighbor and Naive Bayes Methods for SNP Data Classification

    Get PDF
    In an accident, sometimes the identity of a person who has an accident is hard to know, so it is necessary to use biological data such as Single Nucleotide Polymorphism (SNP) data to identify the person's origin. This research aims to compare the accuracy and the F1 score of the k-Nearest Neighbor method and the Naive Bayes method in classifying SNP data from 120 people who divide into groups, namely European (CEU) and Yoruba (YRI). Determination of the best method based on the average value of accuracy and the average value of F1 score from 1000 iterations with various percentage distributions of training datasets and testing datasets. In this research, the selection of SNP locations for the classification process was carried out by correlation analysis. The average accuracy obtained for the k-Nearest Neighbor method with the value of k=31 is 98.38% where the average F1 score is 98.39% while the Naive Bayes method obtained the average accuracy of 96.74% and the average F1 score of 96.63%. In this case, the k-Nearest Neighbor method is better than the Naive Bayes method in classifying SNP data to determine the origin of a person's ancestor tends to be from CEU or YRI

    Susceptible Vaccine Infected Removed (SVIR) Model for COVID-19 Cases in Indonesia

    No full text
    Analysis of data on COVID-19 cases in Indonesia is shown by using the Susceptible Vaccine Infected Removed (SVIR) in this article. In the previous research, cases in the period March-May 2021 were studied, and the reproduction number was computed based on the Susceptible Infected Removed (SIR) model. The prediction did not agree with the real data. Therefore the objective of this article is to improve the model by adding the vaccine variable leading to the new model called the SVIR model as the novelty of this article. The used data are collected from COVID-19 cases of the Indonesian population published by the Indonesian government from March 2020-April 2022. However, the vaccinated persons with COVID-19 cases have been recorded since January 2022. Therefore the models rely on the period January 2021-March 2022, where the parameters in the SIR and SVIR models are determined in this period. The method used is discretizing the models into linear systems, and these systems are solved by Ordinary Least Square (OLS) for time-dependent parameters. It is assumed that the birth rate and death rate in the considered period are constant. Additionally, individuals who have recovered from COVID-19 will not be infected again, and vaccination is not necessarily twice. Furthermore, individuals who have been vaccinated will not be infected with the COVID-19 virus. The SVIR model has captured 3 waves of COVID-19 cases that are appropriate to the real situation in Indonesia from January 2021-March 2022. Additionally, the reproduction numbers as functions of time have been generated. The fluctuations of reproduction numbers agree with the real data. For further research, different regions such as districts in Java and other islands will also be analyzed as the implication of this research
    corecore