19,319 research outputs found
Use Case Point Approach Based Software Effort Estimation using Various Support Vector Regression Kernel Methods
The job of software effort estimation is a critical one in the early stages
of the software development life cycle when the details of requirements are
usually not clearly identified. Various optimization techniques help in
improving the accuracy of effort estimation. The Support Vector Regression
(SVR) is one of several different soft-computing techniques that help in
getting optimal estimated values. The idea of SVR is based upon the computation
of a linear regression function in a high dimensional feature space where the
input data are mapped via a nonlinear function. Further, the SVR kernel methods
can be applied in transforming the input data and then based on these
transformations, an optimal boundary between the possible outputs can be
obtained. The main objective of the research work carried out in this paper is
to estimate the software effort using use case point approach. The use case
point approach relies on the use case diagram to estimate the size and effort
of software projects. Then, an attempt has been made to optimize the results
obtained from use case point analysis using various SVR kernel methods to
achieve better prediction accuracy.Comment: 13 pages, 6 figures, 11 Tables, International Journal of Information
Processing (IJIP
Estimating Photometric Redshifts Using Support Vector Machines
We present a new approach to obtaining photometric redshifts using a kernel
learning technique called Support Vector Machines (SVMs). Unlike traditional
spectral energy distribution fitting, this technique requires a large and
representative training set. When one is available, however, it is likely to
produce results that are comparable to the best obtained using template fitting
and artificial neural networks. Additional photometric parameters such as
morphology, size and surface brightness can be easily incorporated. The
technique is demonstrated using samples of galaxies from the Sloan Digital Sky
Survey Data Release 2 and the hybrid galaxy formation code GalICS. The RMS
error in redshift estimation is for both samples. The strengths and
limitations of the technique are assessed.Comment: 10 pages, 3 figures, to appear in the PASP, minor typos fixed to make
consistent with published versio
Is One Hyperparameter Optimizer Enough?
Hyperparameter tuning is the black art of automatically finding a good
combination of control parameters for a data miner. While widely applied in
empirical Software Engineering, there has not been much discussion on which
hyperparameter tuner is best for software analytics. To address this gap in the
literature, this paper applied a range of hyperparameter optimizers (grid
search, random search, differential evolution, and Bayesian optimization) to
defect prediction problem. Surprisingly, no hyperparameter optimizer was
observed to be `best' and, for one of the two evaluation measures studied here
(F-measure), hyperparameter optimization, in 50\% cases, was no better than
using default configurations.
We conclude that hyperparameter optimization is more nuanced than previously
believed. While such optimization can certainly lead to large improvements in
the performance of classifiers used in software analytics, it remains to be
seen which specific optimizers should be applied to a new dataset.Comment: 7 pages, 2 columns, accepted for SWAN1
Forecasting of commercial sales with large scale Gaussian Processes
This paper argues that there has not been enough discussion in the field of
applications of Gaussian Process for the fast moving consumer goods industry.
Yet, this technique can be important as it e.g., can provide automatic feature
relevance determination and the posterior mean can unlock insights on the data.
Significant challenges are the large size and high dimensionality of commercial
data at a point of sale. The study reviews approaches in the Gaussian Processes
modeling for large data sets, evaluates their performance on commercial sales
and shows value of this type of models as a decision-making tool for
management.Comment: 1o pages, 5 figure
- …