Search CORE

25 research outputs found

Cross-Lingual Adaptation using Structural Correspondence Learning

Author: Prettenhofer Peter
Stein Benno
Publication venue
Publication date: 25/08/2010
Field of study

Cross-lingual adaptation, a special case of domain adaptation, refers to the transfer of classification knowledge between two languages. In this article we describe an extension of Structural Correspondence Learning (SCL), a recently proposed algorithm for domain adaptation, for cross-lingual adaptation. The proposed method uses unlabeled documents from both languages, along with a word translation oracle, to induce cross-lingual feature correspondences. From these correspondences a cross-lingual representation is created that enables the transfer of classification knowledge from the source to the target language. The main advantages of this approach over other approaches are its resource efficiency and task specificity. We conduct experiments in the area of cross-language topic and sentiment classification involving English as source language and German, French, and Japanese as target languages. The results show a significant improvement of the proposed method over a machine translation baseline, reducing the relative error due to cross-lingual adaptation by an average of 30% (topic classification) and 59% (sentiment classification). We further report on empirical analyses that reveal insights into the use of unlabeled data, the sensitivity with respect to important hyperparameters, and the nature of the induced cross-lingual correspondences

arXiv.org e-Print Archive

CiteSeerX

API design for machine learning software: experiences from the scikit-learn project

Author: Blondel Mathieu
Buitinck Lars
Gramfort Alexandre
Grisel Olivier
Grobler Jaques
Holt Brian
Joly Arnaud
Layton Robert
Louppe Gilles
Mueller Andreas
Niculae Vlad
Pedregosa Fabian
Prettenhofer Peter
Vanderplas Jake
Varoquaux Gaël
Publication venue
Publication date: 01/09/2013
Field of study

Scikit-learn is an increasingly popular machine learning li- brary. Written in Python, it is designed to be simple and efficient, accessible to non-experts, and reusable in various contexts. In this paper, we present and discuss our design choices for the application programming interface (API) of the project. In particular, we describe the simple and elegant interface shared by all learning and processing units in the library and then discuss its advantages in terms of composition and reusability. The paper also comments on implementation details specific to the Python ecosystem and analyzes obstacles faced by users and developers of the library

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Federation ResearchOnline

HAL-CEA

Scikit-learn: Machine Learning in Python

Author: Blondel Mathieu
Brucher Matthieu
Cournapeau David
Dubourg Vincent
Duchesnay Edouard
Gramfort Alexandre
Grisel Olivier
Michel Vincent
Passos Alexandre
Pedregosa Fabian
Perrot Matthieu
Prettenhofer Peter
Thirion Bertrand
Vanderplas Jake
Varoquaux Gaël
Weiss Ron
Publication venue: Microtome Publishing
Publication date: 12/10/2011
Field of study

International audienceScikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. Emphasis is put on ease of use, performance, documentation, and API consistency. It has minimal dependencies and is distributed under the simplified BSD license, encouraging its use in both academic and commercial settings. Source code, binaries, and documentation can be downloaded from http://scikit-learn.sourceforge.net

HAL Clermont Université

INRIA a CCSD electronic archive server

HAL-CEA

On decomposing a deep neural network into modules

Author: Dijkstra Edsger W
Kiczales Gregor
Kirsch Louis
LeCun Yann
Pan Sinno Jialin
Prehofer Christian
Prettenhofer Peter
Rajan Hridesh
Rajan Hridesh
Tarr P.
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2020
Field of study

Deep learning is being incorporated in many modern software systems. Deep learning approaches train a deep neural network (DNN) model using training examples, and then use the DNN model for prediction. While the structure of a DNN model as layers is observable, the model is treated in its entirety as a monolithic component. To change the logic implemented by the model, e.g. to add/remove logic that recognizes inputs belonging to a certain class, or to replace the logic with an alternative, the training examples need to be changed and the DNN needs to be retrained using the new set of examples. We argue that decomposing a DNN into DNN modules— akin to decomposing a monolithic software code into modules—can bring the benefits of modularity to deep learning. In this work, we develop a methodology for decomposing DNNs for multi-class problems into DNN modules. For four canonical problems, namely MNIST, EMNIST, FMNIST, and KMNIST, we demonstrate that such decomposition enables reuse of DNN modules to create different DNNs, enables replacement of one DNN module in a DNN with another without needing to retrain. The DNN models formed by composing DNN modules are at least as good as traditional monolithic DNNs in terms of test accuracy for our problems

Digital Repository @ Iowa State University (ISU)

Crossref

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Forecasting Daily Solar Energy Production Using Robust Regression Techniques

Author: Louppe Gilles
Prettenhofer Peter
Publication venue
Publication date: 05/02/2014
Field of study

We describe a novel approach to forecast daily solar energy production based on the output of a numerical weather prediction (NWP) model using non-parametric robust regression techniques. Our approach comprises two steps: First, we use a non-linear interpolation technique, Gaussian Process regression (also known as Kriging in Geostatistics), to interpolate the coarse NWP grid to the location of the solar energy production facilities. Second, we use Gradient Boosted Regression Trees, a non-parametric regression technique, to predict the daily solar energy output based on the interpolated NWP model and additional spatio-temporal features. Experimental evidence suggests that two aspects of our approach are crucial for its effectiveness: a) the ability of Gaussian Process regression to incorporate both input and output uncertainty which we leverage by deriving input uncertainty from an ensemble of 11 NWP models and including convidence intervals alongside the interpolated point estimates and b) the ability of Gradient Boosted Regression Trees to handle outliers in the outputs by using robust loss functions - a property that is very important due to the volatile nature of solar energy output. We evaluated the approach on a dataset of daily solar energy measurements from 98 stations in Oklahoma. The results show a relative improvement of 17.17% and 46.19% over the baselines, Spline Interpolation and Gaussian Mixture Models, resp

Open Repository and Bibliography - Liège

Gradient Boosted Regression Trees in Scikit-Learn

Author: Louppe Gilles
Prettenhofer Peter
Publication venue
Publication date: 23/02/2014
Field of study

This talk describes Gradient Boosted Regression Trees (GBRT), a powerful statistical learning technique with applications in a variety of areas, ranging from web page ranking to environmental niche modeling. GBRT is a key ingredient of many winning solutions in data-mining competitions such as the Netflix Prize, the GE Flight Quest, or the Heritage Health Price. We give a brief introduction to the GBRT model and regression trees -- focusing on intuition rather than mathematical formulas. The majority of the talk is dedicated to an in depth discussion how to apply GBRT in practice using scikit-learn. We cover important topics such as regularization, model tuning and model interpretation that should significantly improve your score on Kaggle

Open Repository and Bibliography - Liège

Acquiring explicit user goals from search query logs

Author: Kröll Mark
Prettenhofer Peter
Strohmaier Markus
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

Crossref

MAnnheim DOCument Server

Equipping intelligent agents with commonsense knowledge acquired from search query logs: Results from an exploratory story

Author: Kröll Mark
Prettenhofer Peter
Strohmaier Markus
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

MAnnheim DOCument Server