Search CORE

54,120 research outputs found

Statistical classification techniques for photometric supernova typing

Author: Bassett B.
Campbell H.
Hlozek R.
Kunz M.
Lampeitl H.
Martin B.
Newling J.
Nichol R.
Parkinson D.
Smith M.
Varughese M.
Publication venue
Publication date: 02/08/2017
Field of study

Future photometric supernova surveys will produce vastly more candidates than can be followed up spectroscopically, highlighting the need for effective classification methods based on light curves alone. Here we introduce boosting and kernel density estimation techniques which have minimal astrophysical input, and compare their performance on 20 000 simulated Dark Energy Survey light curves. We demonstrate that these methods perform very well provided a representative sample of the full population is used for training. Interestingly, we find that they do not require the redshift of the host galaxy or candidate supernova. However, training on the types of spectroscopic subsamples currently produced by supernova surveys leads to poor performance due to the resulting bias in training, and we recommend that special attention be given to the creation of representative training samples. We show that given a typical non-representative training sample, S, one can expect to pull out a representative subsample of about 10 per cent of the size of S, which is large enough to outperform the methods trained on all of

RERO DOC Digital Library

Deep Neural Networks and Data for Automated Driving

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/07/2022
Field of study

This open access book brings together the latest developments from industry and research on automated driving and artificial intelligence. Environment perception for highly automated driving heavily employs deep neural networks, facing many challenges. How much data do we need for training and testing? How to use synthetic data to save labeling costs for training? How do we increase robustness and decrease memory usage? For inevitably poor conditions: How do we know that the network is uncertain about its decisions? Can we understand a bit more about what actually happens inside neural networks? This leads to a very practical problem particularly for DNNs employed in automated driving: What are useful validation techniques and how about safety? This book unites the views from both academia and industry, where computer vision and machine learning meet environment perception for highly automated driving. Naturally, aspects of data, robustness, uncertainty quantification, and, last but not least, safety are at the core of it. This book is unique: In its first part, an extended survey of all the relevant aspects is provided. The second part contains the detailed technical elaboration of the various questions mentioned above

Directory of Open Access Books (DOAB)

Recommended from our members

Neural Models for Information Retrieval without Labeled Data

Author: Zamani Hamed
Publication venue: ScholarWorks@UMass Amherst
Publication date: 30/10/2019
Field of study

Recent developments of machine learning models, and in particular deep neural networks, have yielded significant improvements on several computer vision, natural language processing, and speech recognition tasks. Progress with information retrieval (IR) tasks has been slower, however, due to the lack of large-scale training data as well as neural network models specifically designed for effective information retrieval. In this dissertation, we address these two issues by introducing task-specific neural network architectures for a set of IR tasks and proposing novel unsupervised or \emph{weakly supervised} solutions for training the models. The proposed learning solutions do not require labeled training data. Instead, in our weak supervision approach, neural models are trained on a large set of noisy and biased training data obtained from external resources, existing models, or heuristics. We first introduce relevance-based embedding models that learn distributed representations for words and queries. We show that the learned representations can be effectively employed for a set of IR tasks, including query expansion, pseudo-relevance feedback, and query classification. We further propose a standalone learning to rank model based on deep neural networks. Our model learns a sparse representation for queries and documents. This enables us to perform efficient retrieval by constructing an inverted index in the learned semantic space. Our model outperforms state-of-the-art retrieval models, while performing as efficiently as term matching retrieval models. We additionally propose a neural network framework for predicting the performance of a retrieval model for a given query. Inspired by existing query performance prediction models, our framework integrates several information sources, such as retrieval score distribution and term distribution in the top retrieved documents. This leads to state-of-the-art results for the performance prediction task on various standard collections. We finally bridge the gap between retrieval and recommendation models, as the two key components in most information systems. Search and recommendation often share the same goal: helping people get the information they need at the right time. Therefore, joint modeling and optimization of search engines and recommender systems could potentially benefit both systems. In more detail, we introduce a retrieval model that is trained using user-item interaction (e.g., recommendation data), with no need to query-document relevance information for training. Our solutions and findings in this dissertation smooth the path towards learning efficient and effective models for various information retrieval and related tasks, especially when large-scale training data is not available

ScholarWorks@UMass Amherst

Practical Network Acceleration with Tiny Sets: Hypothesis, Theory, and Algorithm

Author: Wang Guo-Hua
Wu Jianxin
Publication venue
Publication date: 02/03/2023
Field of study

Due to data privacy issues, accelerating networks with tiny training sets has become a critical need in practice. Previous methods achieved promising results empirically by filter-level pruning. In this paper, we both study this problem theoretically and propose an effective algorithm aligning well with our theoretical results. First, we propose the finetune convexity hypothesis to explain why recent few-shot compression algorithms do not suffer from overfitting problems. Based on it, a theory is further established to explain these methods for the first time. Compared to naively finetuning a pruned network, feature mimicking is proved to achieve a lower variance of parameters and hence enjoys easier optimization. With our theoretical conclusions, we claim dropping blocks is a fundamentally superior few-shot compression scheme in terms of more convex optimization and a higher acceleration ratio. To choose which blocks to drop, we propose a new metric, recoverability, to effectively measure the difficulty of recovering the compressed network. Finally, we propose an algorithm named PRACTISE to accelerate networks using only tiny training sets. PRACTISE outperforms previous methods by a significant margin. For 22% latency reduction, it surpasses previous methods by on average 7 percentage points on ImageNet-1k. It also works well under data-free or out-of-domain data settings. Our code is at https://github.com/DoctorKey/PractiseComment: under review for TPAMI. arXiv admin note: substantial text overlap with arXiv:2202.0786

arXiv.org e-Print Archive

Intra-Camera Supervised Person Re-Identification

Author: Gong S
Li M
Morerio P
Murino V
Zhu X
Zhu X
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/02/2020
Field of study

Existing person re-identification (re-id) methods mostly exploit a large set of cross-camera identity labelled training data. This requires a tedious data collection and annotation process, leading to poor scalability in practical re-id applications. On the other hand unsupervised re-id methods do not need identity label information, but they usually suffer from much inferior and insufficient model performance. To overcome these fundamental limitations, we propose a novel person re-identification paradigm based on an idea of independent per-camera identity annotation. This eliminates the most time-consuming and tedious inter-camera identity labelling process, significantly reducing the amount of human annotation efforts. Consequently, it gives rise to a more scalable and more feasible setting, which we call Intra-Camera Supervised (ICS) person re-id, for which we formulate a Multi-tAsk mulTi-labEl (MATE) deep learning method. Specifically, MATE is designed for self-discovering the cross-camera identity correspondence in a per-camera multi-task inference framework. Extensive experiments demonstrate the cost-effectiveness superiority of our method over the alternative approaches on three large person re-id datasets. For example, MATE yields 88.7% rank-1 score on Market-1501 in the proposed ICS person re-id setting, significantly outperforming unsupervised learning models and closely approaching conventional fully supervised learning competitors

arXiv.org e-Print Archive

University of Surrey

Catalogo dei prodotti della ricerca

Queen Mary Research Online

More choices, more chances : a strategy to reduce the proportion of young people not in education, employment or training in Scotland

Author
Publication venue: Scottish Executive
Publication date: 01/01/2006
Field of study

Digital Education Resource Archive

The relationship between training and employment growth in small and medium-sized enterprises

Author: Cosh Andy
Hughes Alan
Weeks M.
Publication venue: DfEE
Publication date: 01/01/2000
Field of study

Digital Education Resource Archive

Delivering Better Housing and Employment Outcomes for Offenders on Probation

Author: Minton Jon
Pleace Nicholas
Publication venue: Department for Work and Pensions
Publication date: 01/01/2009
Field of study

The report ‘Delivering better housing and employment outcomes for offenders on probation’ presents the findings of qualitative research which included fieldwork in six probation areas with professionals involved in the delivery of housing, employment, training and education

Enlighten

Skills for life : improving adult literacy and numeracy : report by the Comptroller and Auditor General

Author
Publication venue: National Audit Office
Publication date: 01/01/2004
Field of study

Digital Education Resource Archive

Machine Learning Models that Remember Too Much

Author: Biggio B.
Bugiel S.
Dinh T. T. A.
Fredrikson M.
Graham-Cumming J.
Han S.
Kloft M.
Krizhevsky A.
Krizhevsky A.
Lin Z.
Lowd D.
Maas A. L.
Ohrimenko O.
Schuster F.
Torres-Arias S.
Vapnik V.
Zhang C.
Publication venue
Publication date: 22/09/2017
Field of study

Machine learning (ML) is becoming a commodity. Numerous ML frameworks and services are available to data holders who are not ML experts but want to train predictive models on their data. It is important that ML models trained on sensitive inputs (e.g., personal images or documents) not leak too much information about the training data. We consider a malicious ML provider who supplies model-training code to the data holder, does not observe the training, but then obtains white- or black-box access to the resulting model. In this setting, we design and implement practical algorithms, some of them very similar to standard ML techniques such as regularization and data augmentation, that "memorize" information about the training dataset in the model yet the model is as accurate and predictive as a conventionally trained model. We then explain how the adversary can extract memorized information from the model. We evaluate our techniques on standard ML tasks for image classification (CIFAR10), face recognition (LFW and FaceScrub), and text analysis (20 Newsgroups and IMDB). In all cases, we show how our algorithms create models that have high predictive power yet allow accurate extraction of subsets of their training data

arXiv.org e-Print Archive

Crossref