736 research outputs found

    From Mutual Information to Expected Dynamics: New Generalization Bounds for Heavy-Tailed SGD

    Full text link
    Understanding the generalization abilities of modern machine learning algorithms has been a major research topic over the past decades. In recent years, the learning dynamics of Stochastic Gradient Descent (SGD) have been related to heavy-tailed dynamics. This has been successfully applied to generalization theory by exploiting the fractal properties of those dynamics. However, the derived bounds depend on mutual information (decoupling) terms that are beyond the reach of computability. In this work, we prove generalization bounds over the trajectory of a class of heavy-tailed dynamics, without those mutual information terms. Instead, we introduce a geometric decoupling term by comparing the learning dynamics (depending on the empirical risk) with an expected one (depending on the population risk). We further upper-bound this geometric term, by using techniques from the heavy-tailed and the fractal literature, making it fully computable. Moreover, as an attempt to tighten the bounds, we propose a PAC-Bayesian setting based on perturbed dynamics, in which the same geometric term plays a crucial role and can still be bounded using the techniques described above.Comment: Accepted in the NeurIPS 2023 Workshop Heavy Tails in Machine Learnin

    Learning via Wasserstein-Based High Probability Generalisation Bounds

    Full text link
    Minimising upper bounds on the population risk or the generalisation gap has been widely used in structural risk minimisation (SRM) -- this is in particular at the core of PAC-Bayesian learning. Despite its successes and unfailing surge of interest in recent years, a limitation of the PAC-Bayesian framework is that most bounds involve a Kullback-Leibler (KL) divergence term (or its variations), which might exhibit erratic behavior and fail to capture the underlying geometric structure of the learning problem -- hence restricting its use in practical applications. As a remedy, recent studies have attempted to replace the KL divergence in the PAC-Bayesian bounds with the Wasserstein distance. Even though these bounds alleviated the aforementioned issues to a certain extent, they either hold in expectation, are for bounded losses, or are nontrivial to minimize in an SRM framework. In this work, we contribute to this line of research and prove novel Wasserstein distance-based PAC-Bayesian generalisation bounds for both batch learning with independent and identically distributed (i.i.d.) data, and online learning with potentially non-i.i.d. data. Contrary to previous art, our bounds are stronger in the sense that (i) they hold with high probability, (ii) they apply to unbounded (potentially heavy-tailed) losses, and (iii) they lead to optimizable training objectives that can be used in SRM. As a result we derive novel Wasserstein-based PAC-Bayesian learning algorithms and we illustrate their empirical advantage on a variety of experiments.Comment: Accepted to NeurIPS 202

    The chemical properties of dissolved organic matter as a function of seasonal and microbiological factors

    Get PDF
    Dissolved organic matter (DOM) is the pool of molecules predominantly produced from cellular growth in both terrestrial and aquatic systems and forms a reservoir of 662 Pg of carbon in the ocean. With as many as 10⁔ to 10⁷ different chemicals held in a single sample, the chemical diversity typically outstrips the capability of analytical techniques and the human capacity to effectively monitor the effects of environmental factors on their individual abundance. To address this issue, we adopted a “fingerprinting” approach and performed two sets of experiments to monitor the behavior of co-clustered compounds. In the first experiment, the change of DOM under seasonal, spatial, and reactivity variables was delineated using a size-exclusion chromatography approach applying multiple detectors and a computing technique called PARAFAC. The model showed how the molar mass and fluorescent properties of DOM change with the impact of biological activity and photodegradation in terrestrial aquatic systems, as well as leaching from different soil and sediment profiles. The second experiment analyzed the production of DOM moieties during the growth of several mixed diatom assemblages. Various patterns in fluorescent molecules and NMR bands were observed characterizing better the deep biological imprint of primary producers on DOM in estuaries. These two experiments, both performed in boreal regions, were complementary both in processes (i.e., production vs degradation) and in techniques (e.g., mass spectrometry vs NMR). It also enabled us to map those effects across the aquatic gradient (i.e., rivers and coasts). Interesting findings were yielded, and each time a limited number of factors were able to explain most of the data variance which allowed me to situate and discuss my results within the context of various DOM studies

    MĂ©thodologie pour l’étude de l’évolution des comportements des voyageurs de transport collectif urbain

    Get PDF
    RÉSUMÉ : DĂ©mocratisĂ© depuis dĂ©jĂ  plusieurs annĂ©es les systĂšmes tarifaires automatisĂ©s, relatifs Ă  l’accĂšs aux transports en commun, gĂ©nĂšrent des masses de donnĂ©es encore trop peu exploitĂ©es. Ces donnĂ©es issues de cartes Ă  puce sont devenues si volumineuses que leur analyse reprĂ©sente un vĂ©ritable dĂ©fi pour l’homme, mais Ă©galement un immense potentiel pour la planification en transport en commun. Ce mĂ©moire s’inscrit dans le cadre de la valorisation de volumĂ©tries importantes de donnĂ©es quotidiennes. Ouvrant un projet commun avec des exploitants de transports, il s’agit de s’intĂ©resser Ă  l’analyse de la demande. L’ensemble des mĂ©thodes seront dĂ©veloppĂ©es Ă  partir de trois ans de donnĂ©es de transaction issues de l’utilisation du transport par bus Ă  Gatineau. L’objectif principal de la recherche est de prĂ©senter une mĂ©thodologie simple et complĂšte, relative Ă  l’étude longitudinale des comportements d’usage des cartes Ă  puces sur long terme en utilisant diffĂ©rentes techniques d’exploration de donnĂ©es. À terme, cette mĂ©thode d’analyse fournit des rĂ©sultats aidant le travail d’un planificateur de rĂ©seau. Les sous-objectifs de l’étude sont les suivants : - DĂ©velopper un algorithme permettant une analyse comportementale des usagers. - DĂ©velopper un algorithme expĂ©rimental amĂ©liorant la mĂ©thode prĂ©cĂ©dente, afin que l’analyste puisse suivre l’évolution des comportements des usagers Ă  travers le temps. - Proposer une mĂ©thode de prĂ©vision des Ă©volutions, enrichissant ainsi les connaissances apportĂ©es Ă  la planification. Ce mĂ©moire dĂ©bute par une revue de littĂ©rature prĂ©sentant l’intĂ©rĂȘt de l’utilisation des cartes Ă  puces en analyse. Il s’agit de s’intĂ©resser aux diverses Ă©tudes rĂ©alisĂ©es, notamment dans le cadre d’analyses comportementales. Une partie de la littĂ©rature s’intĂ©resse aux techniques d’exploration de donnĂ©es, particuliĂšrement dans le cas de segmentations et de prĂ©visions. La section mĂ©thodologie prĂ©sente les raisonnements rĂ©pondant aux trois sous-objectifs, et la derniĂšre partie les rĂ©sultats des diverses expĂ©rimentations effectuĂ©es sur les donnĂ©es fournies par la STO. Les contributions apportĂ©es par ce mĂ©moire sont : - La prĂ©sentation d’une mĂ©thode classique d’analyse comportementale des cartes Ă  puce Ă  partir de leurs utilisations. Un travail de segmentation est effectuĂ© sur l’ensemble des dĂ©placements hebdomadaires en transports en commun afin de repĂ©rer les similaritĂ©s entre comportements. - La conception et la critique d’une mĂ©thode expĂ©rimentale basĂ©e sur une segmentation hebdomadaire visant Ă  montrer l’évolution des comportements des cartes Ă  travers le temps. - DiffĂ©rents indicateurs de qualitĂ© et de stabilitĂ© de segmentation sont proposĂ©s afin de comparer les diverses mĂ©thodes engagĂ©es, et de caractĂ©riser la population de cartes Ă©tudiĂ©e. - Jouant sur une possible Ă©volution comportementale des cartes, une critique sur la fiabilitĂ© de l’utilisation de mĂ©thodes de prĂ©vision est rĂ©alisĂ©e. Les prĂ©visions sont appliquĂ©es sur l’évolution comportementale des groupes ainsi que l’évolution de la taille de leur population. En conclusion, ce projet prĂ©sente une mĂ©thode classique, fonctionnelle et applicable en industrie permettant l’analyse comportementale des usagers. Prenant comme entrĂ©e un jeu de donnĂ©es de cartes Ă  puce, la mĂ©thode exporte les rĂ©sultats de segmentation liĂ©s Ă  l’utilisation des transports en commun. Par cela, elle dĂ©finit 6 groupes d’identifiants aux comportements similaires dont les caractĂ©ristiques propres permettent l’aide Ă  la dĂ©cision en planification des transports. Il s’agit de trois groupes dont les dĂ©placements rĂ©currents en semaine ressemblent Ă  ceux de travailleurs Ă  temps plein et Ă  mi-temps. Deux groupes reprĂ©sentent les comportements de dĂ©placements occasionnels et le dernier contient l’ensemble des cartes qui produisent le plus de dĂ©placement. Le tout est rĂ©alisĂ© en un temps relativement long : 11 minutes pour la segmentation de 10 millions de dĂ©placements. La mĂ©thode expĂ©rimentale, quant Ă  elle, se concentre sur l’évolution comportementale possible des cartes. Sans pour autant ĂȘtre parfaite, elle admet un potentiel Ă©norme. En effet, elle permet une analyse des comportements des 6 groupes de cartes en un temps de calcul trĂšs court (38 secondes pour une qualitĂ© similaire). Il s’agit de groupes dont les caractĂ©ristiques sont trĂšs proches de ceux issus de la mĂ©thode classique, mais le principe incrĂ©mental de la segmentation rend possible l'Ă©tude de l’évolution comportementale, jugĂ©e fixe dans la mĂ©thode classique.----------ABSTRACT : For several years automated fare systems related to public transport access are generating an, not enough, exploited massive volume of data. These smart card data became so voluminous they represent a challenge for humans and a huge potential to public transport planning too. This work aims to value massive volumes of daily data. Opening a common project with transit services, the analysis is based on studying demand. All methods were developed thanks to the three years of transactions from the usage of Gatineau’s bus network. Presenting a simple and a complete methodology to apply a longitudinal analysis on smart card usage behavior using different data mining techniques, represent the main purpose of the research. At the end, the analysis methodology gives the results helping to do the transit planners job. The sub-objectives are the following ones: - Develop an algorithm allowing users’ behavior analysis - Develop an improved algorithm (experimental), allowing to follow the users’ behavior evolution through time. - Propose an evolution prevision methodology, enhancing transit planning knowledge. This works starts with a literature review presenting smart card data usage in analysis, particularly through different studies done in behavior analysis. The second part of the literature review is about data mining techniques like clustering and forecasting. The methodology section describes the three sub-objectives, and the final section presents the different applications on STO’s data. The main achievements of this project are: - The presentation of a classical methodology allowing to analyze smart cards’ behavior through their activities. A clustering technique is applied on all weekly usage of public transit to find similarity between behaviors. - The conception and critic of an experimental method based on week-to-week clustering aiming to show the users’ behavior evolution through time. - Different quality and stability indicators are proposed to compare the methods applied and to characterize the population. Knowing that users’ behavior can evolve, a critic on prevision technique viability is applied. Forecasts methods are used on clusters’ behavior evolution and their population size evolution. Finally, this project presents an industrially applicable methodology on transit users’ behavior. Taking smart card data as input, the algorithm exports the transit usage clustering results. This way it defines 6 groups of IDs with similar behavior which the proper characteristics help the transit planner to take decisions. There are three groups which the trips patterns look like full time and part-time workers trip patterns. Two of the groups represent occasional trip behavior and the last one holds the cards with the most trips. The computation time is relatively high: 11 minutes for the clustering of 10 million transactions. The experimental method focuses on the possible smart cards’ behavior evolution. Without being perfect, it shows a huge potential. Indeed, the method allows a behavior analysis of 6 groups with a shorter computation time (only 38 seconds for a similar quality). These groups present the same characteristics as those from the traditional method, but the way the algorithm works makes the behavior evolution analysis possible in this case

    Tighter Generalisation Bounds via Interpolation

    Full text link
    This paper contains a recipe for deriving new PAC-Bayes generalisation bounds based on the (f,Γ)(f, \Gamma)-divergence, and, in addition, presents PAC-Bayes generalisation bounds where we interpolate between a series of probability divergences (including but not limited to KL, Wasserstein, and total variation), making the best out of many worlds depending on the posterior distributions properties. We explore the tightness of these bounds and connect them to earlier results from statistical learning, which are specific cases. We also instantiate our bounds as training objectives, yielding non-trivial guarantees and practical performances

    Du conte Ă  la scĂšne, l'exemple de Blanche-Neige

    Full text link
    Le prĂ©sent mĂ©moire de recherche-crĂ©ation a pour objet les rĂ©Ă©critures thĂ©Ăątrales des contes. Si le conte merveilleux et le thĂ©Ăątre sont a priori deux genres littĂ©raires distincts de par leur origine, leur nature et leur mode d’expression, ils prĂ©sentent nĂ©anmoins des points communs qui facilitent la transgression des frontiĂšres gĂ©nĂ©riques. Aussi les dramaturges se sont-ils parfois risquĂ©s Ă  proposer des adaptations de contes merveilleux pour le thĂ©Ăątre. C’est par exemple le cas de JoĂ«l Pommerat, qui a rĂ©cemment proposĂ© des transpositions scĂ©niques de trois contes traditionnels, revisitant et actualisant thĂšmes et personnages. Dans cet essai, je m’intĂ©resse tout d’abord aux Ă©lĂ©ments qui favorisent la rĂ©Ă©criture thĂ©Ăątrale des contes. Je m’interroge ensuite sur la maniĂšre dont Pommerat, qui fonde sa rĂ©flexion sur les apports de la philosophie, de la sociologie et de la psychanalyse, prĂ©pare ses spectacles ; ce que j’illustre en mettant en lumiĂšre la mĂ©ditation menĂ©e par l’auteur sur le travail du deuil dans son spectacle Cendrillon. À l’instar de Pommerat, je prĂ©sente enfin une Ă©tude psychanalytique des personnages du conte Blanche-Neige des frĂšres Grimm. La crĂ©ation que je propose consiste en une transposition thĂ©Ăątrale du conte Blanche-Neige des frĂšres Grimm, qui dĂ©place situations et personnages dans le monde contemporain de l’entreprise.The purpose of this research-creation master thesis is to rewrite the theatrical aspects of the fairy tales. While storytelling and theatre are a priori two literary genres distinct in origin, nature and mode of expression, they nevertheless share common qualities that facilitate the transgression of generic boundaries. So playwrights have sometimes ventured to offer adaptations of fairy tales for the theatre. This is the case, for example, of JoĂ«l Pommerat, who recently proposed scenic transpositions of three traditional fairy tales, revisiting and updating themes and characters. In this essay, I am first interested in the elements that promote the theatrical rewriting of fairy tales. I then ask myself how Pommerat, who bases his reflection on the contributions of philosophy, sociology and psychoanalysis, prepares his shows ; what I illustrate by highlighting the author's reflections on the work of mourning in his play Cinderella. Like Pommerat, I finally present a psychoanalytical study of the characters in the Grimm brothers' Snow White tale. The creation I propose consists of a theatrical transposition of the Grimm brothers' Snow White tale, which moves situations and characters into the contemporary world of business

    Perturbations of the CD8+ T-cell repertoire in CVID patients with complications

    Get PDF
    AbstractA higher chronic expansion of effector cytotoxic CD8+DR+ T-lymphocytes has been reported in common variable immunodeficiency (CVID) patients with complications such as splenomegaly, autoimmune disease and/or granulomatous disease. In order to document the features associated with this T cell activation involving the CD8+ T-compartment, we examined the diversity of the alpha/beta TCR repertoire of the patient's CD8+ T-lymphocytes using the qualitative analysis of the CDR3 lengths (Immunoscope).Ten CIVD patients were enrolled in this study, four without complications (Group 1), six with complications (Group 2). All patients exhibited non-gaussian altered CDR3 length distributions, albeit to different extent within the different VÎČ families. CVID patients with activated CD8+ T-cells show a reduction of their TCR repertoire diversity which is more severe in patients with complications. Viral reactivations such as CMV are suspected to be part of the mechanisms underlying immunosenescence

    Learning Stochastic Majority Votes by Minimizing a PAC-Bayes Generalization Bound

    Get PDF
    We investigate a stochastic counterpart of majority votes over finite ensembles of classifiers, and study its generalization properties. While our approach holds for arbitrary distributions, we instantiate it with Dirichlet distributions: this allows for a closed-form and differentiable expression for the expected risk, which then turns the generalization bound into a tractable training objective.The resulting stochastic majority vote learning algorithm achieves state-of-the-art accuracy and benefits from (non-vacuous) tight generalization bounds, in a series of numerical experiments when compared to competing algorithms which also minimize PAC-Bayes objectives -- both with uninformed (data-independent) and informed (data-dependent) priors

    Immunological markers after long-term treatment interruption in chronically HIV-1 infected patients with CD4 cell count above 400 x 10(6) cells/l.

    Get PDF
    OBJECTIVE: To analyse immunological markers associated with CD4+ lymphocyte T-cell count (CD4+) evolution during 12-month follow-up after treatment discontinuation. METHOD: Prospective observational study of chronically HIV-1 infected patients with CD4+ above 400 x 10(6) cells/l. RESULTS: CD4+ changes took place in two phases: an initial rapid decrease in the first month (-142 x 10(6) cells/l on average), followed by a slow decline (-17 x 10(6) cells/l on average) The second slope of CD4+ decline was not correlated with the first and only baseline plasma HIV RNA was associated with it. The decline in CD4+ during the first month was steeper in patients with higher CD4+ and weaker plasma HIV RNA baseline levels. Moreover, the decline was less pronounced (P < 10(-4)) in patients with CD4+ nadir above 350 x 10(6) cells/l (-65 x 10(6) cells/l per month) in comparison with those below 350 x 10(6) cells/l (-200 x 10(6) cells/l per month). A high number of dendritic cells (DCs) whatever the type was associated with high CD4+ at the time of treatment interruption and its steeper decline over the first month. Moreover, the myeloid DC level was stable whereas the lymphoid DC count, which tended to decrease in association with decrease in CD4+, was negatively correlated with the HIV RNA load slope. CONCLUSIONS: The results support the use of the CD4+ nadir to predict the CD4+ dynamic after treatment interruption and consideration of the CD4+ count after 1-month of interruption merely reflects the 12-month level of CD4+. Although DCs seem to be associated with the CD4+ dynamic, the benefit of monitoring them has still to be defined
    • 

    corecore