Search CORE

2,451 research outputs found

Feature selection for microarray gene expression data using simulated annealing guided by the multivariate joint entropy

Author: Belanche Muñoz Luis Antonio
González Navarro Félix Fernando
Publication venue
Publication date: 01/01/2013
Field of study

In this work a new way to calculate the multivariate joint entropy is presented. This measure is the basis for a fast information-theoretic based evaluation of gene relevance in a Microarray Gene Expression data context. Its low complexity is based on the reuse of previous computations to calculate current feature relevance. The mu-TAFS algorithm --named as such to differentiate it from previous TAFS algorithms-- implements a simulated annealing technique specially designed for feature subset selection. The algorithm is applied to the maximization of gene subset relevance in several public-domain microarray data sets. The experimental results show a notoriously high classification performance and low size subsets formed by biologically meaningful genes.Postprint (published version

arXiv.org e-Print Archive

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Media planning by optimizing contact frequencies

Author: Kapsenberg S.
Kloprogge P.
Piersma N.
Wagelmans A.P.M.
Publication venue
Publication date
Field of study

In this paper we study a model to estimate the probability that a target group of an advertising campaign is reached by a commercial message a given number of times. This contact frequency distribution is known to be computationally difficult to calculate because of dependence between the viewing probabilities of advertisements. Our model calculates good estimates of contact frequencies in a very short time based on data that is often available. A media planning model that optimizes effective reach as a function of contact frequencies demonstrates the usefulness of the model. Several local search procedures such as taboo search, simulated annealing and genetic algorithms are applied to find a good media schedule. The results show that local search methods are flexible, fast and accurate in finding media schedules for media planning models based on contact frequencies. The contact frequency model is a potentially useful new tool for media planners.optimization;contact frequency;effective reach;media planning

Research Papers in Economics

Optimizing Product Line Designs: Efficient Methods and Comparisons

Author: Belloni Alexandre
Freund Robert M.
Selove Matthew
Simester Duncan
Publication venue
Publication date: 01/01/2005
Field of study

We compare a broad range of optimal product line design methods. The comparisons take advantage of recent advances that make it possible to identify the optimal solution to problems that are too large for complete enumeration. Several of the methods perform surprisingly well, including Simulated Annealing, Product-Swapping and Genetic Algorithms. The Product-Swapping heuristic is remarkable for its simplicity. The performance of this heuristic suggests that the optimal product line design problem may be far easier to solve in practice than indicated by complexity theory

CiteSeerX

DSpace@MIT

Application of Global Optimization Methods for Feature Selection and Machine Learning

Author: Shaohua Wu
Wanneng Shu
Wei Wang
Xinyong Feng
Yong Hu
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2013
Field of study

The feature selection process constitutes a commonly encountered problem of global combinatorial optimization. The process reduces the number of features by removing irrelevant and redundant data. This paper proposed a novel immune clonal genetic algorithm based on immune clonal algorithm designed to solve the feature selection problem. The proposed algorithm has more exploration and exploitation abilities due to the clonal selection theory, and each antibody in the search space specifies a subset of the possible features. Experimental results show that the proposed algorithm simplifies the feature selection process effectively and obtains higher classification accuracy than other feature selection algorithms

Crossref

Directory of Open Access Journals

Media planning by optimizing contact frequencies

Author: Kapsenberg S.
Kloprogge P.
Piersma N. (Nanda)
Wagelmans A.P.M. (Albert)
Publication venue
Publication date: 01/01/1998
Field of study

EUR Research Repository

Erasmus University Digital Repository

Recommended from our members

Interactive product catalogue with user preference tracking

Author: Guan SU
Tay YS
Publication venue: 'Inderscience Publishers'
Publication date: 01/01/2007
Field of study

In the context of m-commerce, small screen size poses serious difficulty for users to browse effectively through a product catalogue, given the limited number of products that may be presented on-screen. Despite the availability of search engines, filters and recommender systems to aid users, these techniques focus on a narrow segment of product offering. The users are thus denied the opportunity to do a more expansive exploration of the products available. This paper describes a novel approach to overcome the constraints of small screen size. Through integration of a product catalogue with a recommender system, an adaptive system has been created that guides users through the process of product browsing. An original technique has been developed to cluster similar positive examples together to identify areas of interest of a user. The performance of this technique has been evaluated and the results proved to be promising

Brunel University Research Archive

Non-monotone Submodular Maximization with Nearly Optimal Adaptivity and Query Complexity

Author: Fahrbach Matthew
Mirrokni Vahab
Zadimoghaddam Morteza
Publication venue
Publication date: 28/05/2019
Field of study

Submodular maximization is a general optimization problem with a wide range of applications in machine learning (e.g., active learning, clustering, and feature selection). In large-scale optimization, the parallel running time of an algorithm is governed by its adaptivity, which measures the number of sequential rounds needed if the algorithm can execute polynomially-many independent oracle queries in parallel. While low adaptivity is ideal, it is not sufficient for an algorithm to be efficient in practice---there are many applications of distributed submodular optimization where the number of function evaluations becomes prohibitively expensive. Motivated by these applications, we study the adaptivity and query complexity of submodular maximization. In this paper, we give the first constant-factor approximation algorithm for maximizing a non-monotone submodular function subject to a cardinality constraint

k

that runs in

O(\log(n))

adaptive rounds and makes

O(n \log(k))

oracle queries in expectation. In our empirical study, we use three real-world applications to compare our algorithm with several benchmarks for non-monotone submodular maximization. The results demonstrate that our algorithm finds competitive solutions using significantly fewer rounds and queries.Comment: 12 pages, 8 figure

arXiv.org e-Print Archive

Feature selection in high dimensional regression problems for genomic

Author: Dhaenens Clarisse
Even Gaël
Hamon Julie
Jacques Julien
Publication venue: HAL CCSD
Publication date: 20/06/2013
Field of study

International audienceIn the context of genomic selection in animal breeding, an important objective consists in looking for explicative markers for a phe- notype under study. In order to deal with a high number of markers, we propose to use combinatorial optimization to perform variable selection. Results show that our approach outperforms some classical and widely used methods on simulated and "closed to real" datasets

HAL - Lille 3

INRIA a CCSD electronic archive server

HAL Descartes