216,805 research outputs found

    Robust linear and support vector regression

    Full text link

    Comparing Machine Learning and Logistic Regression Methods for Predicting Hypertension Using a Combination of Gene Expression and Next-Generation Sequencing Data

    Get PDF
    Machine learning methods continue to show promise in the analysis of data from genetic association studies because of the high number of variables relative to the number of observations. However, few best practices exist for the application of these methods. We extend a recently proposed supervised machine learning approach for predicting disease risk by genotypes to be able to incorporate gene expression data and rare variants. We then apply 2 different versions of the approach (radial and linear support vector machines) to simulated data from Genetic Analysis Workshop 19 and compare performance to logistic regression. Method performance was not radically different across the 3 methods, although the linear support vector machine tended to show small gains in predictive ability relative to a radial support vector machine and logistic regression. Importantly, as the number of genes in the models was increased, even when those genes contained causal rare variants, model predictive ability showed a statistically significant decrease in performance for both the radial support vector machine and logistic regression. The linear support vector machine showed more robust performance to the inclusion of additional genes. Further work is needed to evaluate machine learning approaches on larger samples and to evaluate the relative improvement in model prediction from the incorporation of gene expression data

    Large Crowd Count Based on Improved SURF Algorithm

    Get PDF
    This paper uses an analysis of Speeded up Robust Feature (SURF), based on the method of Linear Interpolation for camera distortion calibration, for high-density crowd counting. The eigenvalues are built on the Gray Level Co-occurrence Matrix (GLCM) features and the SURF features. Though the method of linear interpolation, weight values are interpolated to reduce the error, which is caused by camera distortion calibration. The optimized crowd’s feature vector can be got then. Through the method of support vector regression, the crowd’s number can be forecast by training model. The experiment result shows that the method of this paper has a higher accuracy than the previous methods

    â„“

    Get PDF
    Linear multiple kernel learning model has been used for predicting financial time series. However, ℓ1-norm multiple support vector regression is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we adopt ℓp-norm multiple kernel support vector regression (1≤p<∞) as a stock price prediction model. The optimization problem is decomposed into smaller subproblems, and the interleaved optimization strategy is employed to solve the regression model. The model is evaluated on forecasting the daily stock closing prices of Shanghai Stock Index in China. Experimental results show that our proposed model performs better than ℓ1-norm multiple support vector regression model

    Estimation of walking energy expenditure by using support vector regression

    Full text link
    This paper develops a new predictor of walking energy expenditure from wireless measurements of body movements using triaxial accelerometers. Reliable data were collected from repeated walking experiments in different conditions on a treadmill with simultaneous measurement of expired oxygen and carbon dioxide. Support vector regression, a powerful non-linear regression method, was used to process and model the data. This novel processing method sets this investigation apart from existing papers. Good results were achieved in the robust estimation of walking related energy expenditure from a number of variables derived from triaxial accelerometer and treadmill speed. ©2005 IEEE
    • …
    corecore