274 research outputs found
Compilers that learn to optimise: a probabilistic machine learning approach
Compiler optimisation is the process of making a compiler produce better code, i.e. code that,
for example, runs faster on a target architecture. Although numerous program transformations
for optimisation have been proposed in the literature, these transformations are not always beneficial and they can interact in very complex ways. Traditional approaches adopted by compiler
writers fix the order of the transformations and decide when and how these transformations
should be applied to a program by using hard-coded heuristics. However, these heuristics require a lot of time and effort to construct and may sacrifice performance on programs they have
not been tuned for.This thesis proposes a probabilistic machine learning solution to the compiler optimisation problem that automatically determines "good" optimisation strategies for programs. This
approach uses predictive modelling in order to search the space of compiler transformations.
Unlike most previous work that learns when/how to apply a single transformation in isolation or
a fixed-order set of transformations, the techniques proposed in this thesis are capable of tackling the general problem of predicting "good" sequences of compiler transformations. This is
achieved by exploiting transference across programs with two different techniques: Predictive
Search Distributions (PSD) and multi-task Gaussian process prediction (multi-task GP). While
the former directly addresses the problem of predicting "good" transformation sequences, the
latter learns regression models (or proxies) of the performance of the programs in order to
rapidly scan the space of transformation sequences.Both methods, PSD and multi-task GP, are formulated as general machine learning techniques. In particular, the PSD method is proposed in order to speed up search in combinatorial
optimisation problems by learning a distribution over good solutions on a set of problem in¬
stances and using that distribution to search the optimisation space of a problem that has not
been seen before. Likewise, multi-task GP is proposed as a general method for multi-task learning that directly models the correlation between several machine learning tasks, exploiting the
shared information across the tasks.Additionally, this thesis presents an extension to the well-known analysis of variance
(ANOVA) methodology in order to deal with sequence data. This extension is used to address the problem of optimisation space characterisation by identifying and quantifying the
main effects of program transformations and their interactions.Finally, the machine learning methods proposed are successfully applied to a data set that
has been generated as a result of the application of source-to-source transformations to 12 C
programs from the UTDSP benchmark suite
El análisis de gráficos en la conceptualización estadÃstica
A partir del uso de los gráficos dispuestos en la calculadora tales como los histogramas, las ojivas y las cajas, se proponen actividades encaminadas a desarrollar el uso de conceptos estadÃsticos fuertemente relacionados con el análisis de datos, en la toma de decisiones. También se aborda el papel de las diferentes representaciones en la argumentación desarrollada. Se trata sobre todo de mostrar un camino alterno (y hasta contrario al habitual), para abordar la enseñanza de la estadÃstica escolar, basado en el análisis de gráficos
Deuda pública soberana
El presente ensayo pretende responder a la pregunta ¿Cómo se está financiando el estado colombiano? Se podrá demostrar que Colombia está optando por endeudarse cada vez más, esto implica asumir una serie de riesgos que podrÃan desembocar en graves consecuencias económicas y sociales. El fin de esto, es abordar esta problemática, radica en encontrar una forma de limitar o disminuir el endeudamiento del paÃs y encontrar mejores instrumentos para la consecución de recursos tanto para los gastos como para la inversión
Contextual directed acyclic graphs
Estimating the structure of directed acyclic graphs (DAGs) from observational
data remains a significant challenge in machine learning. Most research in this
area concentrates on learning a single DAG for the entire population. This
paper considers an alternative setting where the graph structure varies across
individuals based on available "contextual" features. We tackle this contextual
DAG problem via a neural network that maps the contextual features to a DAG,
represented as a weighted adjacency matrix. The neural network is equipped with
a novel projection layer that ensures the output matrices are sparse and
satisfy a recently developed characterization of acyclicity. We devise a
scalable computational framework for learning contextual DAGs and provide a
convergence guarantee and an analytical gradient for backpropagating through
the projection layer. Our experiments suggest that the new approach can recover
the true context-specific graph where existing approaches fail
Gray-box inference for structured Gaussian process models
We develop an automated variational infer- ence method for Bayesian structured prediction problems with Gaussian process (gp) priors and linear-chain likelihoods. Our approach does not need to know the details of the structured likelihood model and can scale up to a large number of observations. Furthermore, we show that the required expected likelihood term and its gradients in the variational objective (ELBO) can be estimated efficiently by using expectations over very low-dimensional Gaussian distributions. Optimization of the ELBO is fully parallelizable over sequences and amenable to stochastic optimization, which we use along with control variate techniques to make our framework useful in practice. Results on a set of natural language processing tasks show that our method can be as good as (and sometimes better than, in particular with respect to expected log-likelihood) hard-coded approaches including svm-struct and crfs, and overcomes the scalability limitations of previous inference algorithms based on sampling. Overall, this is a fundamental step to developing automated inference methods for Bayesian structured prediction
- …