2,266 research outputs found
Feature selection for chemical sensor arrays using mutual information
We address the problem of feature selection for classifying a diverse set of chemicals using an array of metal oxide sensors. Our aim is to evaluate a filter approach to feature selection with reference to previous work, which used a wrapper approach on the same data set, and established best features and upper bounds on classification performance. We selected feature sets that exhibit the maximal mutual information with the identity of the chemicals. The selected features closely match those found to perform well in the previous study using a wrapper approach to conduct an exhaustive search of all permitted feature combinations. By comparing the classification performance of support vector machines (using features selected by mutual information) with the performance observed in the previous study, we found that while our approach does not always give the maximum possible classification performance, it always selects features that achieve classification performance approaching the optimum obtained by exhaustive search. We performed further classification using the selected feature set with some common classifiers and found that, for the selected features, Bayesian Networks gave the best performance. Finally, we compared the observed classification performances with the performance of classifiers using randomly selected features. We found that the selected features consistently outperformed randomly selected features for all tested classifiers. The mutual information filter approach is therefore a computationally efficient method for selecting near optimal features for chemical sensor arrays
High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso
The goal of supervised feature selection is to find a subset of input
features that are responsible for predicting output values. The least absolute
shrinkage and selection operator (Lasso) allows computationally efficient
feature selection based on linear dependency between input features and output
values. In this paper, we consider a feature-wise kernelized Lasso for
capturing non-linear input-output dependency. We first show that, with
particular choices of kernel functions, non-redundant features with strong
statistical dependence on output values can be found in terms of kernel-based
independence measures. We then show that the globally optimal solution can be
efficiently computed; this makes the approach scalable to high-dimensional
problems. The effectiveness of the proposed method is demonstrated through
feature selection experiments with thousands of features.Comment: 18 page
Forward-Backward Asymmetry in Top Quark Production in ppbar Collisions at sqrt{s}=1.96 TeV
Reconstructable final state kinematics and charge assignment in the reaction
ppbar->ttbar allows tests of discrete strong interaction symmetries at high
energy. We define frame dependent forward-backward asymmetries for the outgoing
top quark in both the ppbar and ttbar rest frames, correct for experimental
distortions, and derive values at the parton-level. Using 1.9/fb of ppbar
collisions at sqrt{s}=1.96 TeV recorded with the CDF II detector at the
Fermilab Tevatron, we measure forward-backward top quark production asymmetries
in the ppbar and ttbar rest frames of A_{FB,pp} = 0.17 +- 0.08 and A_{FB,tt} =
0.24 +- 0.14.Comment: 7 pages, 2 figures, submitted to Phys.Rev.Lett, corrected references
and change of tex
Observation of Exclusive Gamma Gamma Production in p pbar Collisions at sqrt{s}=1.96 TeV
We have observed exclusive \gamma\gamma production in proton-antiproton
collisions at \sqrt{s}=1.96 TeV, using data from 1.11 \pm 0.07 fb^{-1}
integrated luminosity taken by the Run II Collider Detector at Fermilab. We
selected events with two electromagnetic showers, each with transverse energy
E_T > 2.5 GeV and pseudorapidity |\eta| < 1.0, with no other particles detected
in -7.4 < \eta < +7.4. The two showers have similar E_T and azimuthal angle
separation \Delta\phi \sim \pi; 34 events have two charged particle tracks,
consistent with the QED process p \bar{p} to p + e^+e^- + \bar{p} by two-photon
exchange, while 43 events have no charged tracks. The number of these events
that are exclusive \pi^0\pi^0 is consistent with zero and is < 15 at 95% C.L.
The cross section for p\bar{p} to p+\gamma\gamma+\bar{p} with |\eta(\gamma)| <
1.0 and E_T(\gamma) > 2.5$ GeV is
2.48^{+0.40}_{-0.35}(stat)^{+0.40}_{-0.51}(syst) pb.Comment: 7 pages, 4 figure
Evidence for t\bar{t}\gamma Production and Measurement of \sigma_t\bar{t}\gamma / \sigma_t\bar{t}
Using data corresponding to 6.0/fb of ppbar collisions at sqrt(s) = 1.96 TeV
collected by the CDF II detector, we present a cross section measurement of
top-quark pair production with an additional radiated photon. The events are
selected by looking for a lepton, a photon, significant transverse momentum
imbalance, large total transverse energy, and three or more jets, with at least
one identified as containing a b quark. The ttbar+photon sample requires the
photon to have 10 GeV or more of transverse energy, and to be in the central
region. Using an event selection optimized for the ttbar+photon candidate
sample we measure the production cross section of, and the ratio of cross
sections of the two samples. Control samples in the dilepton+photon and
lepton+photon+\met, channels are constructed to aid in decay product
identification and background measurements. We observe 30 ttbar+photon
candidate events compared to the standard model expectation of 26.9 +/- 3.4
events. We measure the ttbar+photon cross section to be 0.18+0.08 pb, and the
ratio of the cross section of ttbar+photon to ttbar to be 0.024 +/- 0.009.
Assuming no ttbar+photon production, we observe a probability of 0.0015 of the
background events alone producing 30 events or more, corresponding to 3.0
standard deviations.Comment: 9 pages, 3 figure
Combined search for the standard model Higgs boson decaying to a bb pair using the full CDF data set
We combine the results of searches for the standard model Higgs boson based
on the full CDF Run II data set obtained from sqrt(s) = 1.96 TeV p-pbar
collisions at the Fermilab Tevatron corresponding to an integrated luminosity
of 9.45/fb. The searches are conducted for Higgs bosons that are produced in
association with a W or Z boson, have masses in the range 90-150 GeV/c^2, and
decay into bb pairs. An excess of data is present that is inconsistent with the
background prediction at the level of 2.5 standard deviations (the most
significant local excess is 2.7 standard deviations).Comment: To be published in Phys. Rev. Lett (v2 contains minor updates based
on comments from PRL
Observation of the Baryonic Flavor-Changing Neutral Current Decay Lambda_b -> Lambda mu+ mu-
We report the first observation of the baryonic flavor-changing neutral
current decay Lambda_b -> Lambda mu+ mu- with 24 signal events and a
statistical significance of 5.8 Gaussian standard deviations. This measurement
uses ppbar collisions data sample corresponding to 6.8fb-1 at sqrt{s}=1.96TeV
collected by the CDF II detector at the Tevatron collider. The total and
differential branching ratios for Lambda_b -> Lambda mu+ mu- are measured. We
find B(Lambda_b -> Lambda mu+ mu-) = [1.73+-0.42(stat)+-0.55(syst)] x 10^{-6}.
We also report the first measurement of the differential branching ratio of B_s
-> phi mu+ mu- using 49 signal events. In addition, we report branching ratios
for B+ -> K+ mu+ mu-, B0 -> K0 mu+ mu-, and B -> K*(892) mu+ mu- decays.Comment: 8 pages, 2 figures, 4 tables. Submitted to Phys. Rev. Let
Precise measurement of the W-boson mass with the CDF II detector
We have measured the W-boson mass MW using data corresponding to 2.2/fb of
integrated luminosity collected in proton-antiproton collisions at 1.96 TeV
with the CDF II detector at the Fermilab Tevatron collider. Samples consisting
of 470126 W->enu candidates and 624708 W->munu candidates yield the measurement
MW = 80387 +- 12 (stat) +- 15 (syst) = 80387 +- 19 MeV. This is the most
precise measurement of the W-boson mass to date and significantly exceeds the
precision of all previous measurements combined
Precision Top-Quark Mass Measurements at CDF
We present a precision measurement of the top-quark mass using the full
sample of Tevatron TeV proton-antiproton collisions collected
by the CDF II detector, corresponding to an integrated luminosity of 8.7
. Using a sample of candidate events decaying into the
lepton+jets channel, we obtain distributions of the top-quark masses and the
invariant mass of two jets from the boson decays from data. We then compare
these distributions to templates derived from signal and background samples to
extract the top-quark mass and the energy scale of the calorimeter jets with
{\it in situ} calibration. The likelihood fit of the templates from signal and
background events to the data yields the single most-precise measurement of the
top-quark mass, \mtop = 172.85 \pm\pmComment: submitted to Phys. Rev. Let
- …