Search CORE

70,788 research outputs found

Multi-Instance Multi-Label Learning

Author: Alphonse
Amar
Andrews
Auer
Barutcuoglu
Blum
Boutell
Chen
Chen
Dietterich
Edgar
Elisseeff
Evgeniou
Foulds
Fung
Jin
Jorgensen
Kazawa
Kelley
Long
Maron
Min-Ling Zhang
Pham Dinh
Salton
Schapire
Schölkopf
Sebastiani
Settles
Sheng-Jun Huang
Tsochantaridis
Ueda
Viola
Weiss
Yang
Yu-Feng Li
Yuille
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhang
Zhi-Hua Zhou
Zhou
Zhou
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date: 23/10/2011
Field of study

In this paper, we propose the MIML (Multi-Instance Multi-Label learning) framework where an example is described by multiple instances and associated with multiple class labels. Compared to traditional learning frameworks, the MIML framework is more convenient and natural for representing complicated objects which have multiple semantic meanings. To learn from MIML examples, we propose the MimlBoost and MimlSvm algorithms based on a simple degeneration strategy, and experiments show that solving problems involving complicated objects with multiple semantic meanings in the MIML framework can lead to good performance. Considering that the degeneration process may lose information, we propose the D-MimlSvm algorithm which tackles MIML problems directly in a regularization framework. Moreover, we show that even when we do not have access to the real objects and thus cannot capture more information from real objects by using the MIML representation, MIML is still useful. We propose the InsDif and SubCod algorithms. InsDif works by transforming single-instances into the MIML representation for learning, while SubCod works by transforming single-label examples into the MIML representation for learning. Experiments show that in some tasks they are able to achieve better performance than learning the single-instances or single-label examples directly.Comment: 64 pages, 10 figures; Artificial Intelligence, 201

arXiv.org e-Print Archive

CiteSeerX

Elsevier - Publisher Connector

Crossref

Built to Last or Built Too Fast? Evaluating Prediction Models for Build Times

Author: Baysal Olga
Bisong Ekaba
Tran Eric
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/06/2017
Field of study

Automated builds are integral to the Continuous Integration (CI) software development practice. In CI, developers are encouraged to integrate early and often. However, long build times can be an issue when integrations are frequent. This research focuses on finding a balance between integrating often and keeping developers productive. We propose and analyze models that can predict the build time of a job. Such models can help developers to better manage their time and tasks. Also, project managers can explore different factors to determine the best setup for a build job that will keep the build wait time to an acceptable level. Software organizations transitioning to CI practices can use the predictive models to anticipate build times before CI is implemented. The research community can modify our predictive models to further understand the factors and relationships affecting build times.Comment: 4 paged version published in the Proceedings of the IEEE/ACM 14th International Conference on Mining Software Repositories (MSR) Pages 487-490. MSR 201

arXiv.org e-Print Archive

Crossref

Carleton University's Institutional Repository

A Feature Selection Method for Multivariate Performance Measures

Author: Mao Qi
Tsang Ivor W.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

Feature selection with specific multivariate performance measures is the key to the success of many applications, such as image retrieval and text classification. The existing feature selection methods are usually designed for classification error. In this paper, we propose a generalized sparse regularizer. Based on the proposed regularizer, we present a unified feature selection framework for general loss functions. In particular, we study the novel feature selection paradigm by optimizing multivariate performance measures. The resultant formulation is a challenging problem for high-dimensional data. Hence, a two-layer cutting plane algorithm is proposed to solve this problem, and the convergence is presented. In addition, we adapt the proposed method to optimize multivariate measures for multiple instance learning problems. The analyses by comparing with the state-of-the-art feature selection methods show that the proposed method is superior to others. Extensive experiments on large-scale and high-dimensional real world datasets show that the proposed method outperforms

l_1

-SVM and SVM-RFE when choosing a small subset of features, and achieves significantly improved performances over SVM

^{perf}

in terms of

F_1

-score

arXiv.org e-Print Archive

CiteSeerX

Crossref

OPUS - University of Technology Sydney

DR-NTU (Digital Repository of NTU)

A review on applications of wavelet transform and artificial intelligence systems in fault diagnosis of rotating machinery

Author: Ali Saud Al Tobi Maamer
Bevan Geraint
Harrison David
Ramachandran K. P.
Wallace Peter
Publication venue
Publication date: 24/10/2016
Field of study

ResearchOnline@GCU