3,306 research outputs found
K-trivial, K-low and MLR-low sequences: a tutorial
A remarkable achievement in algorithmic randomness and algorithmic
information theory was the discovery of the notions of K-trivial, K-low and
Martin-Lof-random-low sets: three different definitions turns out to be
equivalent for very non-trivial reasons. This paper, based on the course taught
by one of the authors (L.B.) in Poncelet laboratory (CNRS, Moscow) in 2014,
provides an exposition of the proof of this equivalence and some related
results. We assume that the reader is familiar with basic notions of
algorithmic information theory.Comment: 25 page
Multilabel Classification with R Package mlr
We implemented several multilabel classification algorithms in the machine
learning package mlr. The implemented methods are binary relevance, classifier
chains, nested stacking, dependent binary relevance and stacking, which can be
used with any base learner that is accessible in mlr. Moreover, there is access
to the multilabel classification versions of randomForestSRC and rFerns. All
these methods can be easily compared by different implemented multilabel
performance measures and resampling methods in the standardized mlr framework.
In a benchmark experiment with several multilabel datasets, the performance of
the different methods is evaluated.Comment: 18 pages, 2 figures, to be published in R Journal; reference
correcte
A novel approach for prediction of vitamin D status using support vector regression
BACKGROUND Epidemiological evidence suggests that vitamin D deficiency is linked to various chronic diseases. However direct measurement of serum 25-hydroxyvitamin D (25(OH)D) concentration, the accepted biomarker of vitamin D status, may not be feasible in large epidemiological studies. An alternative approach is to estimate vitamin D status using a predictive model based on parameters derived from questionnaire data. In previous studies, models developed using Multiple Linear Regression (MLR) have explained a limited proportion of the variance and predicted values have correlated only modestly with measured values. Here, a new modelling approach, nonlinear radial basis function support vector regression (RBF SVR), was used in prediction of serum 25(OH)D concentration. Predicted scores were compared with those from a MLR model. METHODS Determinants of serum 25(OH)D in Caucasian adults (n = 494) that had been previously identified were modelled using MLR and RBF SVR to develop a 25(OH)D prediction score and then validated in an independent dataset. The correlation between actual and predicted serum 25(OH)D concentrations was analysed with a Pearson correlation coefficient. RESULTS Better correlation was observed between predicted scores and measured 25(OH)D concentrations using the RBF SVR model in comparison with MLR (Pearson correlation coefficient: 0.74 for RBF SVR; 0.51 for MLR). The RBF SVR model was more accurately able to identify individuals with lower 25(OH)D levels (<75 nmol/L). CONCLUSION Using identical determinants, the RBF SVR model provided improved prediction of serum 25(OH)D concentrations and vitamin D deficiency compared with a MLR model, in this dataset.Dr. Guo is funded by an Australian Postgraduate Award. Prof. Lucas is funded by a National Health and Medical Research (NHMRC) Career Development
Fellowship and receives research funding from Cancer Australia, NHMRC, and MS Research Australia. Prof. Ponsonby is funded by a NHMRC Research Fellowship
and receives research funding from NHMRC and MS Research Australia. The Ausimmune Study was funded by the US National Multiple Sclerosis Society, NHMRC,
and MS Research Australia
Non-destructive soluble solids content determination for ‘Rocha’ Pear Based on VIS-SWNIR spectroscopy under ‘Real World’ sorting facility conditions
In this paper we report a method to determine the soluble solids content (SSC) of 'Rocha' pear (Pyrus communis L. cv. Rocha) based on their short-wave NIR reflectance spectra (500-1100 nm) measured in conditions similar to those found in packinghouse fruit sorting facilities. We obtained 3300 reflectance spectra from pears acquired from different lots, producers and with diverse storage times and ripening stages. The macroscopic properties of the pears, such as size, temperature and SSC were measured under controlled laboratory conditions. For the spectral analysis, we implemented a computational pipeline that incorporates multiple pre-processing techniques including a feature selection procedure, various multivariate regression models and three different validation strategies. This benchmark allowed us to find the best model/preproccesing procedure for SSC prediction from our data. From the several calibration models tested, we have found that Support Vector Machines provides the best predictions metrics with an RMSEP of around 0.82 ∘ Brix and 1.09 ∘ Brix for internal and external validation strategies respectively. The latter validation was implemented to assess the prediction accuracy of this calibration method under more 'real world-like' conditions. We also show that incorporating information about the fruit temperature and size to the calibration models improves SSC predictability. Our results indicate that the methodology presented here could be implemented in existing packinghouse facilities for single fruit SSC characterization.Funding Agency
CEOT strategic project
UID/Multi/00631/2019
project OtiCalFrut
ALG-01-0247-FEDER-033652
Ideias em Caixa 2010, CAIXA GERAL DE DEPOSITOS
Fundacao para a Ciencia e a Tecnologia (Ciencia)info:eu-repo/semantics/publishedVersio
On the importance of nonlinear modeling in computer performance prediction
Computers are nonlinear dynamical systems that exhibit complex and sometimes
even chaotic behavior. The models used in the computer systems community,
however, are linear. This paper is an exploration of that disconnect: when
linear models are adequate for predicting computer performance and when they
are not. Specifically, we build linear and nonlinear models of the processor
load of an Intel i7-based computer as it executes a range of different
programs. We then use those models to predict the processor loads forward in
time and compare those forecasts to the true continuations of the time seriesComment: Appeared in "Proceedings of the 12th International Symposium on
Intelligent Data Analysis
- …