149,507 research outputs found
Optimization of Support Vector Machine Method Using Feature Selection to Improve Classification Results
The performance of the organizations or companiesare based on the qualities possessed by their employee. Both of good or bad employee performance will have an impact on productivity and the impact of profits obtained by the company. Support Vector Machine (SVM) is a machine learning method based on statistical learning theory and can solve high non-linearity, regression, etc. In machine learning, the optimization model is a part for improving the accuracy of the model for data learning. Several techniques are used, one of which is feature selection, namely reducing data dimensions so that it can reduce computation in data modeling. This study aims to apply the method of machine learning to the employee data of the Bank Rakyat Indonesia (BRI) company. The method used is SVM method by increasing the accuracy of learning data by using a feature selection technique using a wrapper algorithm. From the results of the classification test, the average accuracy obtained is 72 percent with a precision value of 71 and the recall value is rounded off to 72 percent, with a combination of SVM and cross-validation. Data obtained from Kaggle data, which consists of training data and testing data. each consisting of 30 columns and 22005 rows in the training data and testing data consisting of 29 col-umns and 6000 rows. The results of this study get a classification score of 82 percent. The precision value obtained is rounded off to 82 percent, a recall of 86 percent and an f1-score of 81 percent
Discretized conformal prediction for efficient distribution-free inference
In regression problems where there is no known true underlying model,
conformal prediction methods enable prediction intervals to be constructed
without any assumptions on the distribution of the underlying data, except that
the training and test data are assumed to be exchangeable. However, these
methods bear a heavy computational cost-and, to be carried out exactly, the
regression algorithm would need to be fitted infinitely many times. In
practice, the conformal prediction method is run by simply considering only a
finite grid of finely spaced values for the response variable. This paper
develops discretized conformal prediction algorithms that are guaranteed to
cover the target value with the desired probability, and that offer a tradeoff
between computational cost and prediction accuracy
Recommended from our members
Out of School and Off Track: The Overuse of Suspensions in American Middle and High Schools
Analyzing data from over 26,000 U.S. middle and high schools, the report reveals profound disparities in suspension rates when disaggregating data by race/ethnicity, gender, and disability status. The report identifies districts with the largest number of "hotspot" schools (suspending 25 percent or more of their total student body), suggests alternatives that are already in use, and highlights civil rights concerns
- …