Search CORE

24,166 research outputs found

Is One Hyperparameter Optimizer Enough?

Author: Bergstra J.
Bergstra J.
Lewis C.
Menzies T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 02/10/2018
Field of study

Hyperparameter tuning is the black art of automatically finding a good combination of control parameters for a data miner. While widely applied in empirical Software Engineering, there has not been much discussion on which hyperparameter tuner is best for software analytics. To address this gap in the literature, this paper applied a range of hyperparameter optimizers (grid search, random search, differential evolution, and Bayesian optimization) to defect prediction problem. Surprisingly, no hyperparameter optimizer was observed to be `best' and, for one of the two evaluation measures studied here (F-measure), hyperparameter optimization, in 50\% cases, was no better than using default configurations. We conclude that hyperparameter optimization is more nuanced than previously believed. While such optimization can certainly lead to large improvements in the performance of classifiers used in software analytics, it remains to be seen which specific optimizers should be applied to a new dataset.Comment: 7 pages, 2 columns, accepted for SWAN1

arXiv.org e-Print Archive

Crossref

Pseudo-3D RSSI-based WSN localization algorithm using linear regression

Author: Cheung
Cleveland
Gao
Gu
Hashemi
Higgens
Koyuncu
Liu
Luo
McElduff
Nazareth
Neter
Pandey
Patwari
Pivato
Sarkar
Takashima
Vanheel
Publication venue: 'Wiley'
Publication date: 01/01/2015
Field of study

Crossref

Ghent University Academic Bibliography

Predicting and Evaluating Software Model Growth in the Automotive Industry

Author: Ali Mohammad
Berger Christian
Herpel Thomas
Knauss Alessia
Preenja Harri
Schroeder Jan
Staron Miroslaw
Publication venue
Publication date: 01/01/2017
Field of study

The size of a software artifact influences the software quality and impacts the development process. In industry, when software size exceeds certain thresholds, memory errors accumulate and development tools might not be able to cope anymore, resulting in a lengthy program start up times, failing builds, or memory problems at unpredictable times. Thus, foreseeing critical growth in software modules meets a high demand in industrial practice. Predicting the time when the size grows to the level where maintenance is needed prevents unexpected efforts and helps to spot problematic artifacts before they become critical. Although the amount of prediction approaches in literature is vast, it is unclear how well they fit with prerequisites and expectations from practice. In this paper, we perform an industrial case study at an automotive manufacturer to explore applicability and usability of prediction approaches in practice. In a first step, we collect the most relevant prediction approaches from literature, including both, approaches using statistics and machine learning. Furthermore, we elicit expectations towards predictions from practitioners using a survey and stakeholder workshops. At the same time, we measure software size of 48 software artifacts by mining four years of revision history, resulting in 4,547 data points. In the last step, we assess the applicability of state-of-the-art prediction approaches using the collected data by systematically analyzing how well they fulfill the practitioners' expectations. Our main contribution is a comparison of commonly used prediction approaches in a real world industrial setting while considering stakeholder expectations. We show that the approaches provide significantly different results regarding prediction accuracy and that the statistical approaches fit our data best

arXiv.org e-Print Archive

Crossref

Chalmers Research

Standardization of electroencephalography for multi-site, multi-platform and multi-investigator studies: Insights from the canadian biomarker integration network in depression

Author: Alonso Esther
Arnott Stephen R.
Atluri Sravya
Blumberger Daniel
Brenner Colleen A.
Daskalakis Zafiris J.
Dhami Prabhjot
Dharsee Moyez
Evans Kenneth R.
Farzan Faranak
Frehlich Matthew
Frey Benicio N.
Kennedy Sidney H.
Kleffner Killian
Lam Raymond W.
Liotti Mario
Mcandrews Mary Pat
Milev Roumen
Price Rae
Ravindran Arun
Rotzinger Susan
Vila-Rodriguez Fidel
Wong Willy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Subsequent to global initiatives in mapping the human brain and investigations of neurobiological markers for brain disorders, the number of multi-site studies involving the collection and sharing of large volumes of brain data, including electroencephalography (EEG), has been increasing. Among the complexities of conducting multi-site studies and increasing the shelf life of biological data beyond the original study are timely standardization and documentation of relevant study parameters. We presentthe insights gained and guidelines established within the EEG working group of the Canadian Biomarker Integration Network in Depression (CAN-BIND). CAN-BIND is a multi-site, multi-investigator, and multiproject network supported by the Ontario Brain Institute with access to Brain-CODE, an informatics platform that hosts a multitude of biological data across a growing list of brain pathologies. We describe our approaches and insights on documenting and standardizing parameters across the study design, data collection, monitoring, analysis, integration, knowledge-translation, and data archiving phases of CAN-BIND projects. We introduce a custom-built EEG toolbox to track data preprocessing with open-access for the scientific community. We also evaluate the impact of variation in equipment setup on the accuracy of acquired data. Collectively, this work is intended to inspire establishing comprehensive and standardized guidelines for multi-site studies

Archivio istituzionale della ricerca - Università di Padova