Search CORE

399 research outputs found

Valid generalisation from approximate interpolation

Author: Anthony M.
Bartlett P.
Ishai Y.
Shawe-Taylor J.
Publication venue
Publication date: 01/01/1996
Field of study

Complexity of pattern classes and Lipschitz property

Author: J. Shawe-Taylor
M. Ledoux
N. Cristianini
P.L. Bartlett
Publication venue: Springer-Verlag, Berlin Heidelberg
Publication date: 01/01/2004
Field of study

Rademacher and Gaussian complexities are successfully used in learning theory for measuring the capacity of the class of functions to be learned. One of the most important properties for these complexities is their Lipschitz property: a composition of a class of functions with a fixed Lipschitz function may increase its complexity by at most twice the Lipschitz constant. The proof of this property is non-trivial (in contrast to the other properties) and it is believed that the proof in the Gaussian case is conceptually more difficult then the one for the Rademacher case. In this paper we give a detailed prove of the Lipschitz property for the Rademacher case and generalize the same idea to an arbitrary complexity (including the Gaussian). We also discuss a related topic about the Rademacher complexity of a class consisting of all the Lipschitz functions with a given Lipschitz constant. We show that the complexity is surprisingly low in the one-dimensional case. The question for higher dimensions remains open

Southampton (e-Prints Soton)

Crossref

UCL Discovery

Predicting T Cell Receptor Antigen Specificity From Structural Features Derived From Homology Models of Receptor-Peptide-Major Histocompatibility Complexes

Author: Chain B
Milighetti M
Shawe-Taylor J
Publication venue
Publication date: 01/09/2021
Field of study

The physical interaction between the T cell receptor (TCR) and its cognate antigen causes T cells to activate and participate in the immune response. Understanding this physical interaction is important in predicting TCR binding to a target epitope, as well as potential cross-reactivity. Here, we propose a way of collecting informative features of the binding interface from homology models of T cell receptor-peptide-major histocompatibility complex (TCR-pMHC) complexes. The information collected from these structures is sufficient to discriminate binding from non-binding TCR-pMHC pairs in multiple independent datasets. The classifier is limited by the number of crystal structures available for the homology modelling and by the size of the training set. However, the classifier shows comparable performance to sequence-based classifiers requiring much larger training sets

UCL Discovery

PubMed Central

Computational modelling of COVID-19: A study of compliance and superspreaders

Author: Lee F
Perez Ortiz M
Shawe-Taylor J
Publication venue: MedRxiv
Publication date: 15/05/2021
Field of study

Background: The success of social distancing implementations of severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2) depends heavily on population compliance. Mathematical modelling has been used extensively to assess the rate of viral transmission from behavioural responses. Previous epidemics of SARS-Cov-2 have been characterised by superspreaders, a small number of individuals who transmit a disease to a large group of individuals, who contribute to the stochasticity (or randomness) of transmission compared to other pathogens such as Influenza. This growing evidence proves an urgent matter to understand transmission routes in order to target and combat outbreaks. / Objective: To investigate the role of superspreaders in the rate of viral transmission with various levels of compliance. / Method: A SEIRS inspired social network model is adapted and calibrated to observe the infected links of a general population with and without superspreaders on four compliance levels. Local and global connection parameters are adjusted to simulate close contact networks and travel restrictions respectively and each performance assessed. The mean and standard deviation of infections with superspreaders and non-superspreaders were calculated for each compliance level. / Results: Increased levels of compliance of superspreaders proves a significant reduction in infections. Assuming long-lasting immunity, superspreaders could potentially slow down the spread due to their high connectivity. / Discussion: The main advantage of applying the network model is to capture the heterogeneity and locality of social networks, including the role of superspreaders in epidemic dynamics. The main challenge is the immediate attention on social settings with targeted interventions to tackle superspreaders in future empirical work. / Conclusion: Superspreaders play a central role in slowing down infection spread following compliance guidelines. It is crucial to adjust social distancing measures to prevent future outbreaks accompanied by population-wide testing and effective tracing

UCL Discovery

Randomized learning and generalization of fair and private classifiers: From PAC-Bayes to stability and differential privacy

Author: Donini M
Oneto L
Pontil M
Shawe-Taylor J
Publication venue
Publication date: 27/11/2020
Field of study

We address the problem of randomized learning and generalization of fair and private classifiers. From one side we want to ensure that sensitive information does not unfairly influence the outcome of a classifier. From the other side we have to learn from data while preserving the privacy of individual observations. We initially face this issue in the PAC-Bayes framework presenting an approach which trades off and bounds the risk and the fairness of the randomized (Gibbs) classifier. Our new approach is able to handle several different state-of-the-art fairness measures. For this purpose, we further develop the idea that the PAC-Bayes prior can be defined based on the data-generating distribution without actually knowing it. In particular, we define a prior and a posterior which give more weight to functions with good generalization and fairness properties. Furthermore, we will show that this randomized classifier possesses interesting stability properties using the algorithmic distribution stability theory. Finally, we will show that the new posterior can be exploited to define a randomized accurate and fair algorithm. Differential privacy theory will allow us to derive that the latter algorithm has interesting privacy preserving properties ensuring our threefold goal of good generalization, fairness, and privacy of the final model

UCL Discovery

Sigmoid Neural Transfer Function Realised by Percolation

Author: Daalen M.
Hall T.J.
Hands M.A.
Parmiter P.J.M.
Pignon D.
Shawe-Taylor J.
Slack J.K.
Publication venue
Publication date: 01/02/1996
Field of study

An experiment using the phenomenon of percolation has been conducted to demonstrate the implementation of neural functionality (summing and sigmoid transfer). A simple analog approximation to digital percolation is implemented. The device consists of a piece of amorphous silicon with stochastic bit-stream optical inputs, in which a current percolating from one end to the other defines the neuron output, also in the form of a stochastic bit stream. Preliminary experimental results are presented

Southampton (e-Prints Soton)

Tighter risk certificates for neural networks

Author: Pérez-Ortiz M
Rivasplata O
Shawe-Taylor J
Szepesvári C
Publication venue
Publication date: 01/01/2021
Field of study

This paper presents an empirical study regarding training probabilistic neural networks using training objectives derived from PAC-Bayes bounds. In the context of probabilistic neural networks, the output of training is a probability distribution over network weights. We present two training objectives, used here for the first time in connection with training neural networks. These two training objectives are derived from tight PAC-Bayes bounds. We also re-implement a previously used training objective based on a classical PAC-Bayes bound, to compare the properties of the predictors learned using the different training objectives. We compute risk certificates for the learnt predictors, based on part of the data used to learn the predictors. We further experiment with different types of priors on the weights (both data-free and data-dependent priors) and neural network architectures. Our experiments on MNIST and CIFAR-10 show that our training methods produce competitive test set errors and non-vacuous risk bounds with much tighter values than previous results in the literature, showing promise not only to guide the learning algorithm through bounding the risk but also for model selection. These observations suggest that the methods studied here might be good candidates for self-certified learning, in the sense of using the whole data set for learning a predictor and certifying its risk on any unseen data (from the same distribution as the training data) potentially without the need for holding out test data

UCL Discovery

TrueLearn: A family of bayesian algorithms to match lifelong learners to open educational resources

Author: Bulathwela S
Pérez-Ortiz M
Shawe-Taylor J
Yilmaz E
Publication venue: 'Association for the Advancement of Artificial Intelligence (AAAI)'
Publication date: 21/11/2019
Field of study

The recent advances in computer-assisted learning systems and the availability of open educational resources today promise a pathway to providing cost-efficient high-quality education to large masses of learners. One of the most ambitious use cases of computer-assisted learning is to build a lifelong learning recommendation system. Unlike short-term courses, lifelong learning presents unique challenges, requiring sophisticated recommendation models that account for a wide range of factors such as background knowledge of learners or novelty of the material while effectively maintaining knowledge states of masses of learners for significantly longer periods of time (ideally, a lifetime). This work presents the foundations towards building a dynamic, scalable and transparent recommendation system for education, modelling learner’s knowledge from implicit data in the form of engagement with open educational resources. We i) use a text ontology based on Wikipedia to automatically extract knowledge components of educational resources and, ii) propose a set of online Bayesian strategies inspired by the well-known areas of item response theory and knowledge tracing. Our proposal, TrueLearn, focuses on recommendations for which the learner has enough background knowledge (so they are able to understand and learn from the material), and the material has enough novelty that would help the learner improve their knowledge about the subject and keep them engaged. We further construct a large open educational video lectures dataset and test the performance of the proposed algorithms, which show clear promise towards building an effective educational recommendation system

arXiv.org e-Print Archive

UCL Discovery