189 research outputs found
Comparison the Error Rate of Autoregressive Distributed Lag (ARDL) and Vector Autoregressive (VAR) (Case Study: Forecast of Export Quantities in DIY)
Forecasting is estimating the size or number of something in the future. Regression model that enters current independent variable value, and lagged value is called distributed-lag model, if it enters one or more lagged value, it is called autoregressive. Koyck method is used for dynamic model which the lagged length is unknown, for the known lagged length it is used the Almon method. Vector Autoregressive (VAR) is a method that explains every variable in the model depend on the lag movement from the variable itself and all the others variable. This research aimed to explain the application of Autoregressive distributed-lag model and Vector Autoregressive (VAR) method for the forecasting for export amount in DIY. It takes export amount in DIY and inflation data, kurs, and Indonesias foreign exchange reserve. Forecasting formation: defining Koyck and Almon distributed-lag dynamic model, then the best model is chosen and distribution-lag dynamic forecasting is performed. After that it is performed stationary test, co-integration test, optimal lag examination, granger causality test, parameter estimation, VAR model stability, and performs forecasting with VAR method. The forecasting result shows MAPE value from ARDL method obtained is 0.475812%, while MAPE value from VAR method is 0.464473%. Thus it can be concluded that Vector Autoregressive (VAR) method is more effective to be used in case study of export amount in DIY forecasting
Distance Functions Study in Fuzzy C-Means Core and Reduct Clustering
Fuzzy C-Means is a distance-based clustering process which applied by fuzzy logic concept. Clustering process worked in linear to the iteration process to minimizing the objective function. The objective function is an addition of the multiplication between the coordinates distance towards their closest cluster centroid and their membership degree. The more the iteration process, the objective function should get lower and lower. The objective of this research is to observe whether the distances which usually applied are able to fulfill the aforementioned hypothesis for determining the most suitable distance for Fuzzy C-Means clustering application. Few distance function was applied in the same dataset. 5 standard datasets and 2 random datasets were used to test the fuzzy c-means clustering performance with the 7 different distance function. Accuracy, purity, and Rand Index also applied to measure the quality of the resulted cluster. The observation result depicted that the distance function which resulted in the best quality of clusters are Euclidean, Average, Manhattan, Minkowski, Minkowski-Chebisev, and Canberra distance. These 6 distances were able to fulfill the basic hypothesis of the objective function behavior on Fuzzy C-Means Clustering method. The only distance who were not able to fulfill the basic hypothesis is Chebisev distance
Comparasional of the M, MM and S Estimatory in Rabust Regression Analysis on Indonesian Literacy Index Data 2018
Regression analysis is a method used to determine the relationship between one dependent variable and one or more independent variables. However, the existence of outliers in the 2018 Community Literacy Development Index data led to the application of statistical methods not sensitive to pencils for analysis. This was the reason for adopting robust regression methods, including the M, S, and MM estimations. Therefore, this research aims to compare these three estimates and select the one with the best estimate based on the parameter estimation model associated with the RSE and R2 values. Descriptive and inferential analysis with robust regression was used due to several outlier data and to provide good regression model results with unbiased values. It was discovered that the S-estimator and MM-estimator are the best methods because they have the most minor Residual Standard Error (RSE) of 1.856 and R2 of 0.9778
Optimization of markov weighted fuzzy time series forecasting using genetic algorithm (GA) AND particle Swarm Optimization (PSO)
The Markov Weighted Fuzzy Time Series (MWFTS) is a method for making predictions based on
developing a fuzzy time series (FTS) algorithm. The MWTS has overcome certain limitations of
FTS, such as repetition of fuzzy logic relationships and weight considerations of fuzzy logic
relationships. The main challenge of the MWFTS method is the absence of standardized rules for
determining partition intervals. This study compares the MWFTS model to the partition methods
Genetic Algorithm-Fuzzy K-Medoids clustering (GA-FKM) and Fuzzy K-Medoids clusteringParticle Swarm Optimization (FKM-PSO) to solve the problem of determining the partition interval
and develop an algorithm. Optimal partition optimization. The GA optimization algorithm’s
performance on GA-FKM depends on optimizing the clustering of FKM to obtain the most
significant partition interval. Implementing the PSO optimization algorithm on FKM-PSO involves
maximizing the interval length following the FKM procedure. The proposed method was applied to
Anand Vihar, India’s air quality data. The MWFTS method combined with the GA-FKM
partitioning method reduced the mean absolute square error (MAPE) from 17.440 to 16.85%. While
the results of forecasting using the MWFTS method in conjunction with the FKM-PSO partition
method were able to reduce the MAPE percentage from 9.78% to 7.58%, the MAPE percentage was
still 9.78%. Initially, the root mean square error (RMSE) score for the GA-FKM partitioning
technique was 48,179 to 47,01. After applying the FKM-PSO method, the initial RMSE score of
30,638 was reduced to 24,863
An epidemic model with viral mutations and vaccine interventions
Prediction is a means of forecasting a future value by using and analyzing historical or current data. A popular neural network
architecture used as a prediction model is the Recurrent Neural Network (RNN) because of its wide application and very high
generalization performance. This study aims to improve the RNN prediction model’s accuracy using k-means grouping and
PCA dimension reduction methods by comparing the five distance functions. Data were processed using Python software
and the results obtained from the PCA calculation yielded three new variables or principal components out of the five
examined. This study used an optimized RNN prediction model with k-means clustering by comparing the Euclidean,
Manhattan, Canberra, Average, and Chebyshev distance functions as a measure of data grouping similarity to avoid being
trapped in the local optimal solution. In addition, PCA dimension reduction was also used in facilitating multivariate data
analysis. The k-means grouping showed that the most optimal distance is the average function producing a DBI value of
0.60855 and converging at the 9th iteration. The RNN prediction model results evaluated based on the number of RMSE
errors which was 0.83, while that of MAPE was 8.62%. Therefore, it was concluded that the K-means and PCA methods
generated a more optimal prediction model for the RNN metho
- …