736 research outputs found
From Mutual Information to Expected Dynamics: New Generalization Bounds for Heavy-Tailed SGD
Understanding the generalization abilities of modern machine learning
algorithms has been a major research topic over the past decades. In recent
years, the learning dynamics of Stochastic Gradient Descent (SGD) have been
related to heavy-tailed dynamics. This has been successfully applied to
generalization theory by exploiting the fractal properties of those dynamics.
However, the derived bounds depend on mutual information (decoupling) terms
that are beyond the reach of computability. In this work, we prove
generalization bounds over the trajectory of a class of heavy-tailed dynamics,
without those mutual information terms. Instead, we introduce a geometric
decoupling term by comparing the learning dynamics (depending on the empirical
risk) with an expected one (depending on the population risk). We further
upper-bound this geometric term, by using techniques from the heavy-tailed and
the fractal literature, making it fully computable. Moreover, as an attempt to
tighten the bounds, we propose a PAC-Bayesian setting based on perturbed
dynamics, in which the same geometric term plays a crucial role and can still
be bounded using the techniques described above.Comment: Accepted in the NeurIPS 2023 Workshop Heavy Tails in Machine Learnin
Learning via Wasserstein-Based High Probability Generalisation Bounds
Minimising upper bounds on the population risk or the generalisation gap has
been widely used in structural risk minimisation (SRM) -- this is in particular
at the core of PAC-Bayesian learning. Despite its successes and unfailing surge
of interest in recent years, a limitation of the PAC-Bayesian framework is that
most bounds involve a Kullback-Leibler (KL) divergence term (or its
variations), which might exhibit erratic behavior and fail to capture the
underlying geometric structure of the learning problem -- hence restricting its
use in practical applications. As a remedy, recent studies have attempted to
replace the KL divergence in the PAC-Bayesian bounds with the Wasserstein
distance. Even though these bounds alleviated the aforementioned issues to a
certain extent, they either hold in expectation, are for bounded losses, or are
nontrivial to minimize in an SRM framework. In this work, we contribute to this
line of research and prove novel Wasserstein distance-based PAC-Bayesian
generalisation bounds for both batch learning with independent and identically
distributed (i.i.d.) data, and online learning with potentially non-i.i.d.
data. Contrary to previous art, our bounds are stronger in the sense that (i)
they hold with high probability, (ii) they apply to unbounded (potentially
heavy-tailed) losses, and (iii) they lead to optimizable training objectives
that can be used in SRM. As a result we derive novel Wasserstein-based
PAC-Bayesian learning algorithms and we illustrate their empirical advantage on
a variety of experiments.Comment: Accepted to NeurIPS 202
The chemical properties of dissolved organic matter as a function of seasonal and microbiological factors
Dissolved organic matter (DOM) is the pool of molecules predominantly produced from cellular growth in both terrestrial and aquatic systems and forms a reservoir of 662 Pg of carbon in the ocean. With as many as 10â” to 10â· different chemicals held in a single sample, the chemical diversity typically outstrips the capability of analytical techniques and the human capacity to effectively monitor the effects of environmental factors on their individual abundance. To address this issue, we adopted a âfingerprintingâ approach and performed two sets of experiments to monitor the behavior of co-clustered compounds.
In the first experiment, the change of DOM under seasonal, spatial, and reactivity variables was delineated using a size-exclusion chromatography approach applying multiple detectors and a computing technique called PARAFAC. The model showed how the molar mass and fluorescent properties of DOM change with the impact of biological activity and photodegradation in terrestrial aquatic systems, as well as leaching from different soil and sediment profiles. The second experiment analyzed the production of DOM moieties during the growth of several mixed diatom assemblages. Various patterns in fluorescent molecules and NMR bands were observed characterizing better the deep biological imprint of primary producers on DOM in estuaries.
These two experiments, both performed in boreal regions, were complementary both in processes (i.e., production vs degradation) and in techniques (e.g., mass spectrometry vs NMR). It also enabled us to map those effects across the aquatic gradient (i.e., rivers and coasts). Interesting findings were yielded, and each time a limited number of factors were able to explain most of the data variance which allowed me to situate and discuss my results within the context of various DOM studies
MĂ©thodologie pour lâĂ©tude de lâĂ©volution des comportements des voyageurs de transport collectif urbain
RĂSUMĂ : DĂ©mocratisĂ© depuis dĂ©jĂ plusieurs annĂ©es les systĂšmes tarifaires automatisĂ©s, relatifs Ă lâaccĂšs aux transports en commun, gĂ©nĂšrent des masses de donnĂ©es encore trop peu exploitĂ©es. Ces donnĂ©es issues de cartes Ă puce sont devenues si volumineuses que leur analyse reprĂ©sente un vĂ©ritable dĂ©fi pour lâhomme, mais Ă©galement un immense potentiel pour la planification en transport en commun.
Ce mĂ©moire sâinscrit dans le cadre de la valorisation de volumĂ©tries importantes de donnĂ©es quotidiennes. Ouvrant un projet commun avec des exploitants de transports, il sâagit de sâintĂ©resser Ă lâanalyse de la demande. Lâensemble des mĂ©thodes seront dĂ©veloppĂ©es Ă partir de trois ans de donnĂ©es de transaction issues de lâutilisation du transport par bus Ă Gatineau. Lâobjectif principal de la recherche est de prĂ©senter une mĂ©thodologie simple et complĂšte, relative Ă lâĂ©tude longitudinale des comportements dâusage des cartes Ă puces sur long terme en utilisant diffĂ©rentes techniques dâexploration de donnĂ©es. Ă terme, cette mĂ©thode dâanalyse fournit des rĂ©sultats aidant le travail dâun planificateur de rĂ©seau. Les sous-objectifs de lâĂ©tude sont les suivants :
- DĂ©velopper un algorithme permettant une analyse comportementale des usagers. - DĂ©velopper un algorithme expĂ©rimental amĂ©liorant la mĂ©thode prĂ©cĂ©dente, afin que lâanalyste puisse suivre lâĂ©volution des comportements des usagers Ă travers le temps. - Proposer une mĂ©thode de prĂ©vision des Ă©volutions, enrichissant ainsi les connaissances apportĂ©es Ă la planification. Ce mĂ©moire dĂ©bute par une revue de littĂ©rature prĂ©sentant lâintĂ©rĂȘt de lâutilisation des cartes Ă puces en analyse. Il sâagit de sâintĂ©resser aux diverses Ă©tudes rĂ©alisĂ©es, notamment dans le cadre dâanalyses comportementales. Une partie de la littĂ©rature sâintĂ©resse aux techniques dâexploration de donnĂ©es, particuliĂšrement dans le cas de segmentations et de prĂ©visions. La section mĂ©thodologie prĂ©sente les raisonnements rĂ©pondant aux trois sous-objectifs, et la derniĂšre partie les rĂ©sultats des diverses expĂ©rimentations effectuĂ©es sur les donnĂ©es fournies par la STO. Les contributions apportĂ©es par ce mĂ©moire sont :
- La prĂ©sentation dâune mĂ©thode classique dâanalyse comportementale des cartes Ă puce Ă partir de leurs utilisations. Un travail de segmentation est effectuĂ© sur lâensemble des dĂ©placements hebdomadaires en transports en commun afin de repĂ©rer les similaritĂ©s entre comportements.
- La conception et la critique dâune mĂ©thode expĂ©rimentale basĂ©e sur une segmentation hebdomadaire visant Ă montrer lâĂ©volution des comportements des cartes Ă travers le temps.
- DiffĂ©rents indicateurs de qualitĂ© et de stabilitĂ© de segmentation sont proposĂ©s afin de comparer les diverses mĂ©thodes engagĂ©es, et de caractĂ©riser la population de cartes Ă©tudiĂ©e. - Jouant sur une possible Ă©volution comportementale des cartes, une critique sur la fiabilitĂ© de lâutilisation de mĂ©thodes de prĂ©vision est rĂ©alisĂ©e. Les prĂ©visions sont appliquĂ©es sur lâĂ©volution comportementale des groupes ainsi que lâĂ©volution de la taille de leur population. En conclusion, ce projet prĂ©sente une mĂ©thode classique, fonctionnelle et applicable en industrie permettant lâanalyse comportementale des usagers. Prenant comme entrĂ©e un jeu de donnĂ©es de cartes Ă puce, la mĂ©thode exporte les rĂ©sultats de segmentation liĂ©s Ă lâutilisation des transports en commun. Par cela, elle dĂ©finit 6 groupes dâidentifiants aux comportements similaires dont les caractĂ©ristiques propres permettent lâaide Ă la dĂ©cision en planification des transports. Il sâagit de trois groupes dont les dĂ©placements rĂ©currents en semaine ressemblent Ă ceux de travailleurs Ă temps plein et Ă mi-temps. Deux groupes reprĂ©sentent les comportements de dĂ©placements occasionnels et le dernier contient lâensemble des cartes qui produisent le plus de dĂ©placement. Le tout est rĂ©alisĂ© en un temps relativement long : 11 minutes pour la segmentation de 10 millions de dĂ©placements. La mĂ©thode expĂ©rimentale, quant Ă elle, se concentre sur lâĂ©volution comportementale possible des cartes. Sans pour autant ĂȘtre parfaite, elle admet un potentiel Ă©norme. En effet, elle permet une analyse des comportements des 6 groupes de cartes en un temps de calcul trĂšs court (38 secondes pour une qualitĂ© similaire). Il sâagit de groupes dont les caractĂ©ristiques sont trĂšs proches de ceux issus de la mĂ©thode classique, mais le principe incrĂ©mental de la segmentation rend possible l'Ă©tude de lâĂ©volution comportementale, jugĂ©e fixe dans la mĂ©thode classique.----------ABSTRACT : For several years automated fare systems related to public transport access are generating an, not enough, exploited massive volume of data. These smart card data became so voluminous they represent a challenge for humans and a huge potential to public transport planning too. This work aims to value massive volumes of daily data. Opening a common project with transit services, the analysis is based on studying demand. All methods were developed thanks to the three years of transactions from the usage of Gatineauâs bus network.
Presenting a simple and a complete methodology to apply a longitudinal analysis on smart card usage behavior using different data mining techniques, represent the main purpose of the research. At the end, the analysis methodology gives the results helping to do the transit planners job.
The sub-objectives are the following ones: - Develop an algorithm allowing usersâ behavior analysis - Develop an improved algorithm (experimental), allowing to follow the usersâ behavior evolution through time. - Propose an evolution prevision methodology, enhancing transit planning knowledge. This works starts with a literature review presenting smart card data usage in analysis, particularly through different studies done in behavior analysis. The second part of the literature review is about data mining techniques like clustering and forecasting. The methodology section describes the three sub-objectives, and the final section presents the different applications on STOâs data.
The main achievements of this project are: - The presentation of a classical methodology allowing to analyze smart cardsâ behavior through their activities. A clustering technique is applied on all weekly usage of public transit to find similarity between behaviors. - The conception and critic of an experimental method based on week-to-week clustering aiming to show the usersâ behavior evolution through time. - Different quality and stability indicators are proposed to compare the methods applied and to characterize the population. Knowing that usersâ behavior can evolve, a critic on prevision technique viability is applied. Forecasts methods are used on clustersâ behavior evolution and their population size evolution. Finally, this project presents an industrially applicable methodology on transit usersâ behavior. Taking smart card data as input, the algorithm exports the transit usage clustering results. This way it defines 6 groups of IDs with similar behavior which the proper characteristics help the transit planner to take decisions. There are three groups which the trips patterns look like full time and part-time workers trip patterns. Two of the groups represent occasional trip behavior and the last one holds the cards with the most trips. The computation time is relatively high: 11 minutes for the clustering of 10 million transactions. The experimental method focuses on the possible smart cardsâ behavior evolution. Without being perfect, it shows a huge potential. Indeed, the method allows a behavior analysis of 6 groups with a shorter computation time (only 38 seconds for a similar quality). These groups present the same characteristics as those from the traditional method, but the way the algorithm works makes the behavior evolution analysis possible in this case
Tighter Generalisation Bounds via Interpolation
This paper contains a recipe for deriving new PAC-Bayes generalisation bounds
based on the -divergence, and, in addition, presents PAC-Bayes
generalisation bounds where we interpolate between a series of probability
divergences (including but not limited to KL, Wasserstein, and total
variation), making the best out of many worlds depending on the posterior
distributions properties. We explore the tightness of these bounds and connect
them to earlier results from statistical learning, which are specific cases. We
also instantiate our bounds as training objectives, yielding non-trivial
guarantees and practical performances
Du conte Ă la scĂšne, l'exemple de Blanche-Neige
Le prĂ©sent mĂ©moire de recherche-crĂ©ation a pour objet les rĂ©Ă©critures thĂ©Ăątrales des contes. Si le conte merveilleux et le thĂ©Ăątre sont a priori deux genres littĂ©raires distincts de par leur origine, leur nature et leur mode dâexpression, ils prĂ©sentent nĂ©anmoins des points communs qui facilitent la transgression des frontiĂšres gĂ©nĂ©riques. Aussi les dramaturges se sont-ils parfois risquĂ©s Ă proposer des adaptations de contes merveilleux pour le thĂ©Ăątre. Câest par exemple le cas de JoĂ«l Pommerat, qui a rĂ©cemment proposĂ© des transpositions scĂ©niques de trois contes traditionnels, revisitant et actualisant thĂšmes et personnages.
Dans cet essai, je mâintĂ©resse tout dâabord aux Ă©lĂ©ments qui favorisent la rĂ©Ă©criture thĂ©Ăątrale des contes. Je mâinterroge ensuite sur la maniĂšre dont Pommerat, qui fonde sa rĂ©flexion sur les apports de la philosophie, de la sociologie et de la psychanalyse, prĂ©pare ses spectacles ; ce que jâillustre en mettant en lumiĂšre la mĂ©ditation menĂ©e par lâauteur sur le travail du deuil dans son spectacle Cendrillon. Ă lâinstar de Pommerat, je prĂ©sente enfin une Ă©tude psychanalytique des personnages du conte Blanche-Neige des frĂšres Grimm.
La crĂ©ation que je propose consiste en une transposition thĂ©Ăątrale du conte Blanche-Neige des frĂšres Grimm, qui dĂ©place situations et personnages dans le monde contemporain de lâentreprise.The purpose of this research-creation master thesis is to rewrite the theatrical aspects of the fairy tales. While storytelling and theatre are a priori two literary genres distinct in origin, nature and mode of expression, they nevertheless share common qualities that facilitate the transgression of generic boundaries. So playwrights have sometimes ventured to offer adaptations of fairy tales for the theatre. This is the case, for example, of JoĂ«l Pommerat, who recently proposed scenic transpositions of three traditional fairy tales, revisiting and updating themes and characters.
In this essay, I am first interested in the elements that promote the theatrical rewriting of fairy tales. I then ask myself how Pommerat, who bases his reflection on the contributions of philosophy, sociology and psychoanalysis, prepares his shows ; what I illustrate by highlighting the author's reflections on the work of mourning in his play Cinderella. Like Pommerat, I finally present a psychoanalytical study of the characters in the Grimm brothers' Snow White tale.
The creation I propose consists of a theatrical transposition of the Grimm brothers' Snow White tale, which moves situations and characters into the contemporary world of business
Perturbations of the CD8+ T-cell repertoire in CVID patients with complications
AbstractA higher chronic expansion of effector cytotoxic CD8+DR+ T-lymphocytes has been reported in common variable immunodeficiency (CVID) patients with complications such as splenomegaly, autoimmune disease and/or granulomatous disease. In order to document the features associated with this T cell activation involving the CD8+ T-compartment, we examined the diversity of the alpha/beta TCR repertoire of the patient's CD8+ T-lymphocytes using the qualitative analysis of the CDR3 lengths (Immunoscope).Ten CIVD patients were enrolled in this study, four without complications (Group 1), six with complications (Group 2). All patients exhibited non-gaussian altered CDR3 length distributions, albeit to different extent within the different VÎČ families. CVID patients with activated CD8+ T-cells show a reduction of their TCR repertoire diversity which is more severe in patients with complications. Viral reactivations such as CMV are suspected to be part of the mechanisms underlying immunosenescence
Learning Stochastic Majority Votes by Minimizing a PAC-Bayes Generalization Bound
We investigate a stochastic counterpart of majority votes over finite ensembles of classifiers, and study its generalization properties. While our approach holds for arbitrary distributions, we instantiate it with Dirichlet distributions: this allows for a closed-form and differentiable expression for the expected risk, which then turns the generalization bound into a tractable training objective.The resulting stochastic majority vote learning algorithm achieves state-of-the-art accuracy and benefits from (non-vacuous) tight generalization bounds, in a series of numerical experiments when compared to competing algorithms which also minimize PAC-Bayes objectives -- both with uninformed (data-independent) and informed (data-dependent) priors
Immunological markers after long-term treatment interruption in chronically HIV-1 infected patients with CD4 cell count above 400 x 10(6) cells/l.
OBJECTIVE: To analyse immunological markers associated with CD4+ lymphocyte T-cell count (CD4+) evolution during 12-month follow-up after treatment discontinuation. METHOD: Prospective observational study of chronically HIV-1 infected patients with CD4+ above 400 x 10(6) cells/l. RESULTS: CD4+ changes took place in two phases: an initial rapid decrease in the first month (-142 x 10(6) cells/l on average), followed by a slow decline (-17 x 10(6) cells/l on average) The second slope of CD4+ decline was not correlated with the first and only baseline plasma HIV RNA was associated with it. The decline in CD4+ during the first month was steeper in patients with higher CD4+ and weaker plasma HIV RNA baseline levels. Moreover, the decline was less pronounced (P < 10(-4)) in patients with CD4+ nadir above 350 x 10(6) cells/l (-65 x 10(6) cells/l per month) in comparison with those below 350 x 10(6) cells/l (-200 x 10(6) cells/l per month). A high number of dendritic cells (DCs) whatever the type was associated with high CD4+ at the time of treatment interruption and its steeper decline over the first month. Moreover, the myeloid DC level was stable whereas the lymphoid DC count, which tended to decrease in association with decrease in CD4+, was negatively correlated with the HIV RNA load slope. CONCLUSIONS: The results support the use of the CD4+ nadir to predict the CD4+ dynamic after treatment interruption and consideration of the CD4+ count after 1-month of interruption merely reflects the 12-month level of CD4+. Although DCs seem to be associated with the CD4+ dynamic, the benefit of monitoring them has still to be defined
- âŠ