998 research outputs found

    A tight excess risk bound via a unified PAC-Bayesian-Rademacher-Shtarkov-MDL complexity

    Get PDF
    We present a novel notion of complexity that interpolates between and generalizes some classic existing complexity notions in learning theory: for estimators like empirical risk minimization (ERM) with arbitrary bounded losses, it is upper bounded in terms of data-independent Rademacher complexity; for generalized Bayesian estimators, it is upper bounded by the data-dependent information complexity (also known as stochastic or PAC-Bayesian, KL(posterior∥prior) complexity. For (penalized) ERM, the new complexity reduces to (generalized) normalized maximum likelihood (NML) complexity, i.e. a minimax log-loss individual-sequence regret. Our first main result bounds excess risk in terms of the new complexity. Our second main result links the new complexity via Rademacher complexity to L2(P) entropy, thereby generalizing earlier results of Opper, Haussler, Lugosi, and Cesa-Bianchi who did the log-loss case with L∞. Together, these results recover optimal bounds for VC- and large (polynomial entropy) classes, replacing localized Rademacher complexity by a simpler analysis which almost completely separates the two aspects that determine the achievable rates: 'easiness' (Bernstein) conditions and model complexity

    A Tight Excess Risk Bound via a Unified PAC-Bayesian-Rademacher-Shtarkov-MDL Complexity

    Get PDF
    We present a novel notion of complexity that interpolates between and generalizes some classic existing complexity notions in learning theory: for estimators like empirical risk minimization (ERM) with arbitrary bounded losses, it is upper bounded in terms of data-independent Rademacher complexity; for generalized Bayesian estimators, it is upper bounded by the data-dependent information complexity (also known as stochastic or PAC-Bayesian, KL(posterior∥prior) complexity. For (penalized) ERM, the new complexity reduces to (generalized) normalized maximum likelihood (NML) complexity, i.e. a minimax log-loss individual-sequence regret. Our first main result bounds excess risk in terms of the new complexity. Our second main result links the new complexity via Rademacher complexity to L2(P) entropy, thereby generalizing earlier results of Opper, Haussler, Lugosi, and Cesa-Bianchi who did the log-loss case with L∞. Together, these results recover optimal bounds for VC- and large (polynomial entropy) classes, replacing localized Rademacher complexity by a simpler analysis which almost completely separates the two aspects that determine the achievable rates: 'easiness' (Bernstein) conditions and model complexity

    Fast rates for general unbounded loss functions: From ERM to generalized bayes

    Get PDF
    We present new excess risk bounds for general unbounded loss functions including log loss and squared loss, where the distribution of the losses may be heavy-tailed. The bounds hold for general estimators, but they are optimized when applied to η-generalized Bayesian, MDL, and empirical risk minimization estimators. In the case of log loss, the bounds imply convergence rates for generalized Bayesian inference under misspecification in terms of a generalization of the Hellinger metric as long as the learning rate η is set correctly. For general loss functions, our bounds rely on two separate conditions: the v-GRIP (generalized reversed information projection) conditions, which control the lower tail of the excess loss; and the newly introduced witness condition, which controls the upper tail. The parameter v in the v-GRIP conditions determines the achievable rate and is akin to the exponent in the Tsybakov margin condition and the Bernstein condition for bounded losses, which the v-GRIP conditions generalize; favorable v in combination with small model complexity leads to Õ(1/n) rates. The witness condition allows us to connect the excess risk to an “annealed” version thereof, by which we generalize several previous results connecting Hellinger and Rényi divergence to KL divergence

    Safe-Bayesian Generalized Linear Regression

    Get PDF
    We study generalized Bayesian inference under misspecification, i.e. when the model is ‘wrong but useful’. Generalized Bayes equips the likelihood with a learning rate η. We show that for generalized linear models (GLMs), η-generalized Bayes concentrates around the best approximation of the truth within the model for specific ηeq1, even under severely misspecified noise, as long as the tails of the true distribution are exponential. We derive MCMC samplers for generalized Bayesian lasso and logistic regression and give examples of both simulated and real-world data in which generalized Bayes substantially outperforms standard Bayes

    Accuracy and Stability of Computing High-Order Derivatives of Analytic Functions by Cauchy Integrals

    Full text link
    High-order derivatives of analytic functions are expressible as Cauchy integrals over circular contours, which can very effectively be approximated, e.g., by trapezoidal sums. Whereas analytically each radius r up to the radius of convergence is equal, numerical stability strongly depends on r. We give a comprehensive study of this effect; in particular we show that there is a unique radius that minimizes the loss of accuracy caused by round-off errors. For large classes of functions, though not for all, this radius actually gives about full accuracy; a remarkable fact that we explain by the theory of Hardy spaces, by the Wiman-Valiron and Levin-Pfluger theory of entire functions, and by the saddle-point method of asymptotic analysis. Many examples and non-trivial applications are discussed in detail.Comment: Version 4 has some references and a discussion of other quadrature rules added; 57 pages, 7 figures, 6 tables; to appear in Found. Comput. Mat

    Alterations to nuclear architecture and genome behavior in senescent cells.

    Get PDF
    The organization of the genome within interphase nuclei, and how it interacts with nuclear structures is important for the regulation of nuclear functions. Many of the studies researching the importance of genome organization and nuclear structure are performed in young, proliferating, and often transformed cells. These studies do not reveal anything about the nucleus or genome in nonproliferating cells, which may be relevant for the regulation of both proliferation and replicative senescence. Here, we provide an overview of what is known about the genome and nuclear structure in senescent cells. We review the evidence that nuclear structures, such as the nuclear lamina, nucleoli, the nuclear matrix, nuclear bodies (such as promyelocytic leukemia bodies), and nuclear morphology all become altered within growth-arrested or senescent cells. Specific alterations to the genome in senescent cells, as compared to young proliferating cells, are described, including aneuploidy, chromatin modifications, chromosome positioning, relocation of heterochromatin, and changes to telomeres

    Forward pi^0 Production and Associated Transverse Energy Flow in Deep-Inelastic Scattering at HERA

    Full text link
    Deep-inelastic positron-proton interactions at low values of Bjorken-x down to x \approx 4.10^-5 which give rise to high transverse momentum pi^0 mesons are studied with the H1 experiment at HERA. The inclusive cross section for pi^0 mesons produced at small angles with respect to the proton remnant (the forward region) is presented as a function of the transverse momentum and energy of the pi^0 and of the four-momentum transfer Q^2 and Bjorken-x. Measurements are also presented of the transverse energy flow in events containing a forward pi^0 meson. Hadronic final state calculations based on QCD models implementing different parton evolution schemes are confronted with the data.Comment: 27 pages, 8 figures and 3 table

    Search for Doubly-Charged Higgs Boson Production at HERA

    Get PDF
    A search for the single production of doubly-charged Higgs bosons H^{\pm \pm} in ep collisions is presented. The signal is searched for via the Higgs decays into a high mass pair of same charge leptons, one of them being an electron. The analysis uses up to 118 pb^{-1} of ep data collected by the H1 experiment at HERA. No evidence for doubly-charged Higgs production is observed and mass dependent upper limits are derived on the Yukawa couplings h_{el} of the Higgs boson to an electron-lepton pair. Assuming that the doubly-charged Higgs only decays into an electron and a muon via a coupling of electromagnetic strength h_{e \mu} = \sqrt{4 \pi \alpha_{em}} = 0.3, a lower limit of 141 GeV on the H^{\pm\pm} mass is obtained at the 95% confidence level. For a doubly-charged Higgs decaying only into an electron and a tau and a coupling h_{e\tau} = 0.3, masses below 112 GeV are ruled out.Comment: 15 pages, 3 figures, 1 tabl

    Low Q^2 Jet Production at HERA and Virtual Photon Structure

    Get PDF
    The transition between photoproduction and deep-inelastic scattering is investigated in jet production at the HERA ep collider, using data collected by the H1 experiment. Measurements of the differential inclusive jet cross-sections dsigep/dEt* and dsigmep/deta*, where Et* and eta* are the transverse energy and the pseudorapidity of the jets in the virtual photon-proton centre of mass frame, are presented for 0 < Q2 < 49 GeV2 and 0.3 < y < 0.6. The interpretation of the results in terms of the structure of the virtual photon is discussed. The data are best described by QCD calculations which include a partonic structure of the virtual photon that evolves with Q2.Comment: 20 pages, 5 Figure

    Hadron Production in Diffractive Deep-Inelastic Scattering

    Get PDF
    Characteristics of hadron production in diffractive deep-inelastic positron-proton scattering are studied using data collected in 1994 by the H1 experiment at HERA. The following distributions are measured in the centre-of-mass frame of the photon dissociation system: the hadronic energy flow, the Feynman-x (x_F) variable for charged particles, the squared transverse momentum of charged particles (p_T^{*2}), and the mean p_T^{*2} as a function of x_F. These distributions are compared with results in the gamma^* p centre-of-mass frame from inclusive deep-inelastic scattering in the fixed-target experiment EMC, and also with the predictions of several Monte Carlo calculations. The data are consistent with a picture in which the partonic structure of the diffractive exchange is dominated at low Q^2 by hard gluons.Comment: 16 pages, 6 figures, submitted to Phys. Lett.
    corecore