Search CORE

17,652 research outputs found

Comparison of the CPU and memory performance of StatPatternRecognition (SPR) and Toolkit for MultiVariate Analysis (TMVA)

Author: Allwein
Bay
Breiman
Breiman
Breiman
Breiman
Breiman
Breiman
Breiman
Brun
Dasarathy
Duda
Fisher
Freund
Friedman
Friedman
Friedman
G. Palombo
Hastie
Haykin
Ihaka
Lam
McLachlan
Meyer
Narsky
Quinlan
Vapnik
Webb
Publication venue: 'Elsevier BV'
Publication date: 28/03/2011
Field of study

High Energy Physics data sets are often characterized by a huge number of events. Therefore, it is extremely important to use statistical packages able to efficiently analyze these unprecedented amounts of data. We compare the performance of the statistical packages StatPatternRecognition (SPR) and Toolkit for MultiVariate Analysis (TMVA). We focus on how CPU time and memory usage of the learning process scale versus data set size. As classifiers, we consider Random Forests, Boosted Decision Trees and Neural Networks. For our tests, we employ a data set widely used in the machine learning community, "Threenorm" data set, as well as data tailored for testing various edge cases. For each data set, we constantly increase its size and check CPU time and memory needed to build the classifiers implemented in SPR and TMVA. We show that SPR is often significantly faster and consumes significantly less memory. For example, the SPR implementation of Random Forest is by an order of magnitude faster and consumes an order of magnitude less memory than TMVA on Threenorm data

arXiv.org e-Print Archive

Crossref

Caltech Authors

Guinea pigs sublethally infected with aerosolized Legionella pneumophila develop humoral and cell-mediated immune responses and are protected against lethal aerosol challenge. A model for studying host defense against lung infections caused by intracellular pathogens.

Author: Breiman RF
Horwitz MA
Publication venue: eScholarship, University of California
Publication date: 01/03/1987
Field of study

We have employed the guinea pig model of L. pneumophila infection, which mimics Legionnaires' disease in humans both clinically and pathologically, to study humoral and cell-mediated immune responses to L. pneumophila and to examine protective immunity after aerosol exposure, the natural route of infection. Guinea pigs exposed to sublethal concentrations of L. pneumophila by aerosol developed strong humoral immune responses. By the indirect fluorescent antibody assay, exposed guinea pigs had a median serum antibody titer (expressed as the reciprocal of the highest positive dilution) of 32, whereas control guinea pigs had a median titer of less than 1. Sublethally infected (immunized) guinea pigs also developed strong cell-mediated immune responses. In response to L. pneumophila antigens, splenic lymphocytes from immunized but not control animals proliferated strongly in vitro, as measured by their capacity to incorporate [3H]thymidine. Moreover, immunized but not control guinea pigs developed strong cutaneous delayed-type hypersensitivity to intradermally injected L. pneumophila antigens. Sublethally infected (immunized) guinea pigs exhibited strong protective immunity to L. pneumophila. In two independent experiments, all 22 immunized guinea pigs survived aerosol challenge with one or three times the lethal dose of L. pneumophila whereas none of 16 sham-immunized control guinea pigs survived (p less than 0.0001 in each experiment). Immunized guinea pigs were not protected significantly from challenge with 10 times the lethal dose. Immunized but not control animals cleared the bacteria from their lungs. This study demonstrates that guinea pigs sublethally infected with L. pneumophila by the aerosol route develop strong humoral immune responses to this pathogen, develop strong cell-mediated immune responses and cutaneous delayed-type hypersensitivity to L. pneumophila antigens, are protected against subsequent lethal aerosol challenge, and are able to clear the bacteria from their lungs. The guinea pig model of L. pneumophila pulmonary infection is as an excellent one for studying general principles of host defense against pulmonary infections caused by intracellular pathogens

Crossref

eScholarship - University of California

Semi-supervised Learning for Photometric Supernova Classification

Author: Breiman
Breiman
Breiman
Breiman
Chad M. Schafer
Chapelle
Coifman
Darren Homrighausen
Dovi Poznanski
Dubath
Falck
Freeman
Gong
Grimmett
Hastie
Homeier
Johnson
Joseph W. Richards
Kessler
Kessler
Kunz
Kuznetsova
Lafon
Newling
Nugent
Peter E. Freeman
Poznanski
Poznanski
Poznanski
Richards
Richards
Richards
Rodney
Ruppert
Sullivan
Wasserman
Publication venue: 'Wiley'
Publication date: 27/09/2011
Field of study

We present a semi-supervised method for photometric supernova typing. Our approach is to first use the nonlinear dimension reduction technique diffusion map to detect structure in a database of supernova light curves and subsequently employ random forest classification on a spectroscopically confirmed training set to learn a model that can predict the type of each newly observed supernova. We demonstrate that this is an effective method for supernova typing. As supernova numbers increase, our semi-supervised method efficiently utilizes this information to improve classification, a property not enjoyed by template based methods. Applied to supernova data simulated by Kessler et al. (2010b) to mimic those of the Dark Energy Survey, our methods achieve (cross-validated) 95% Type Ia purity and 87% Type Ia efficiency on the spectroscopic sample, but only 50% Type Ia purity and 50% efficiency on the photometric sample due to their spectroscopic follow-up strategy. To improve the performance on the photometric sample, we search for better spectroscopic follow-up procedures by studying the sensitivity of our machine learned supernova classification on the specific strategy used to obtain training sets. With a fixed amount of spectroscopic follow-up time, we find that deeper magnitude-limited spectroscopic surveys are better for producing training sets. For supernova Ia (II-P) typing, we obtain a 44% (1%) increase in purity to 72% (87%) and 30% (162%) increase in efficiency to 65% (84%) of the sample using a 25th (24.5th) magnitude-limited survey instead of the shallower spectroscopic sample used in the original simulations. When redshift information is available, we incorporate it into our analysis using a novel method of altering the diffusion map representation of the supernovae. Incorporating host redshifts leads to a 5% improvement in Type Ia purity and 13% improvement in Type Ia efficiency.Comment: 16 pages, 11 figures, accepted for publication in MNRA

arXiv.org e-Print Archive

Crossref

A Methodology for the Diagnostic of Aircraft Engine Based on Indicators Aggregation

Author: D. Ruta
E. Côme
H. Peng
I. Guyon
J. Lacaille
L. Breiman
L. Breiman
L. Vasov
M. Basseville
N. Japkowicz
X. Flandrois
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Aircraft engine manufacturers collect large amount of engine related data during flights. These data are used to detect anomalies in the engines in order to help companies optimize their maintenance costs. This article introduces and studies a generic methodology that allows one to build automatic early signs of anomaly detection in a way that is understandable by human operators who make the final maintenance decision. The main idea of the method is to generate a very large number of binary indicators based on parametric anomaly scores designed by experts, complemented by simple aggregations of those scores. The best indicators are selected via a classical forward scheme, leading to a much reduced number of indicators that are tuned to a data set. We illustrate the interest of the method on simulated data which contain realistic early signs of anomalies.Comment: Proceedings of the 14th Industrial Conference, ICDM 2014, St. Petersburg : Russian Federation (2014

arXiv.org e-Print Archive

Crossref

HAL-Paris1

Texture-based crowd detection and localisation

Author: A.C. Davies
L. Breiman
R.M. Haralick
S.J. McKenna
Y. Freund
Publication venue: IEEE / Robotics and Automation Society
Publication date: 26/06/2012
Field of study

This paper presents a crowd detection system based on texture analysis. The state-of-the-art techniques based on co-occurrence matrix have been revisited and a novel set of features proposed. These features provide a richer description of the co-occurrence matrix, and can be exploited to obtain stronger classification results, especially when smaller portions of the image are considered. This is extremely useful for crowd localisation: acquired images are divided into smaller regions in order to perform a classification on each one. A thorough evaluation of the proposed system on a real world data set is also presented: this validates the improvements in reliability of the crowd detection and localisation

University of Lincoln Institutional Repository

Crossref

Archivio istituzionale della ricerca - Università di Padova

Machine learning with the hierarchy‐of‐hypotheses (HoH) approach discovers novel pattern in studies on biological invasions

Author: Barnett‐page E
Breiman L
Cooper H
Fidler F
Hackett EJ
Publication venue
Publication date: 01/01/2019
Field of study

Research synthesis on simple yet general hypotheses and ideas is challenging in scientific disciplines studying highly context‐dependent systems such as medical, social, and biological sciences. This study shows that machine learning, equation‐free statistical modeling of artificial intelligence, is a promising synthesis tool for discovering novel patterns and the source of controversy in a general hypothesis. We apply a decision tree algorithm, assuming that evidence from various contexts can be adequately integrated in a hierarchically nested structure. As a case study, we analyzed 163 articles that studied a prominent hypothesis in invasion biology, the enemy release hypothesis. We explored if any of the nine attributes that classify each study can differentiate conclusions as classification problem. Results corroborated that machine learning can be useful for research synthesis, as the algorithm could detect patterns that had been already focused in previous narrative reviews. Compared with the previous synthesis study that assessed the same evidence collection based on experts' judgement, the algorithm has newly proposed that the studies focusing on Asian regions mostly supported the hypothesis, suggesting that more detailed investigations in these regions can enhance our understanding of the hypothesis. We suggest that machine learning algorithms can be a promising synthesis tool especially where studies (a) reformulate a general hypothesis from different perspectives, (b) use different methods or variables, or (c) report insufficient information for conducting meta‐analyses

Institutional Repository of the Freie Universität Berlin

Crossref

ICA as a preprocessing technique for classification

Author: F. Breiman
Hyvärinen
L. Breiman
M. Mitchell
R.O. Duda
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

In this paper we propose the use of the independent component analysis (ICA) [1] technique for improving the classification rate of decision trees and multilayer perceptrons [2], [3]. The use of an ICA for the preprocessing stage, makes the structure of both classifiers simpler, and therefore improves the generalization properties. The hypothesis behind the proposed preprocessing is that an ICA analysis will transform the feature space into a space where the components are independent, and aligned to the axes and therefore will be more adapted to the way that a decision tree is constructed. Also the inference of the weights of a multilayer perceptron will be much easier because the gradient search in the weight space will follow independent trajectories. The result is that classifiers are less complex and on some databases the error rate is lower. This idea is also applicable to regressio

Crossref

Secretaría de Estado de Cultura

RIUVic

Improving access to Special Collections by automating descriptive metadata creation

Author: Breiman Jessica
Publication venue: University of Utah
Publication date: 05/05/2016
Field of study

presentationPresentation given at the Utah Library Association Annual Conference, Layton, UT

The University of Utah: J. Willard Marriott Digital Library

Predictive Maintenance on the Machining Process and Machine Tool

Author: Box
Breiman
Djeziri
Extrapolation
Farokhzad
Galar
Guyon
Jin
Maronna
Mills
Mobley
Tsymbal
Tyagi
Winkler
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

This paper presents the process required to implement a data driven Predictive Maintenance (PdM) not only in the machine decision making, but also in data acquisition and processing. A short review of the different approaches and techniques in maintenance is given. The main contribution of this paper is a solution for the predictive maintenance problem in a real machining process. Several steps are needed to reach the solution, which are carefully explained. The obtained results show that the Preventive Maintenance (PM), which was carried out in a real machining process, could be changed into a PdM approach. A decision making application was developed to provide a visual analysis of the Remaining Useful Life (RUL) of the machining tool. This work is a proof of concept of the methodology presented in one process, but replicable for most of the process for serial productions of pieces

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

TECNALIA Publications

Peer Observation as a Job-Embedded Professional Development Tool

Author: Breiman Daniel J.
Publication venue: eRepository @ Seton Hall
Publication date: 20/05/2019
Field of study

Teacher professional development is typically provided outside of the workplace, and is therefore disconnected to daily classroom practices. An alternative model of professional development is peer observation, which is contextualized through coaching and collaboration in the classroom. To date, research and investigation into the practice of peer observation is lacking. To fill that gap, this study examined the influence of peer observation on teacher practice, while identifying factors that were most beneficial and challenging about peer observation and its influence on workplace collegiality. This study used qualitative methods and action research that allowed teachers to be part of the research process. Three teams of teachers participated in the study at a suburban high school. Each team consisted of two teachers, pairing an experienced teacher with an inexperienced teacher. Participants in the study reported how peer observation provided professional development in the context of their workplace. Teachers in each team shared the same instructional content area which, according to findings, made the peer observation process more relevant. Peer observation was also found to build and strengthen collegiality, facilitate an exchange of instructional techniques between teachers, and break down isolating instructional practices. Participants also appreciated receiving feedback from a colleague in a non-threatening way