Search CORE

3,346 research outputs found

On statistics, computation and scalability

Author: Jordan Michael I.
Publication venue: 'Bernoulli Society for Mathematical Statistics and Probability'
Publication date: 30/09/2013
Field of study

How should statistical procedures be designed so as to be scalable computationally to the massive datasets that are increasingly the norm? When coupled with the requirement that an answer to an inferential question be delivered within a certain time budget, this question has significant repercussions for the field of statistics. With the goal of identifying "time-data tradeoffs," we investigate some of the statistical consequences of computational perspectives on scability, in particular divide-and-conquer methodology and hierarchies of convex relaxations.Comment: Published in at http://dx.doi.org/10.3150/12-BEJSP17 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm

arXiv.org e-Print Archive

CiteSeerX

Curriculum Guidelines for Undergraduate Programs in Data Science

Author: Agarwal Mahesh
Averett Maia
Baumer Benjamin
Bray Andrew
Bressoud Thomas
Bryant Lance
Cheng Lei
De Veaux Richard
Francis Amanda
Gould Robert
Kim Albert Y.
Kretchmar Matt
Lu Qin
Moskol Ann
Nolan Deborah
Pelayo Roberto
Raleigh Sean
Sethi Ricky J.
Sondjaja Mutiara
Tiruviluamala Neelesh
Uhlig Paul
Washington Talitha
Wesley Curtis
White David
Ye Ping
Publication venue: 'Annual Reviews'
Publication date: 01/01/2017
Field of study

The Park City Math Institute (PCMI) 2016 Summer Undergraduate Faculty Program met for the purpose of composing guidelines for undergraduate programs in Data Science. The group consisted of 25 undergraduate faculty from a variety of institutions in the U.S., primarily from the disciplines of mathematics, statistics and computer science. These guidelines are meant to provide some structure for institutions planning for or revising a major in Data Science

arXiv.org e-Print Archive

Smith College: Smith ScholarWorks

Jorge A. Swieca's contributions to quantum field theory in the 60s and 70s and their relevance in present research

Author: B. Bakalov
B. Bakalov
B. Schroer
B. Schroer
B. Schroer
B. Schroer
B. Schroer
B. Schroer
D. Buchholz
D. Buchholz
D. Buchholz
D. Buchholz
D. Buchholz
D. Kastler
D. Yenni
D.C. Brydges
E. Abdalla
E. Marino
E. Witten
F. Bloch
F. Coester
G. Lechner
G.C. Marques
H. Babujian
H. Babujian
H. Epstein
H. Ezawa
H.J. Borchers
J. Goldstone
J. Maldacena
J. Mund
J. Mund
J. Schwinger
J.A. Swieca
J.A. Swieca
J.A. Swieca
J.A. Swieca
J.D. Dollard
J.E. Lowenstein
J.H. Lowenstein
K.-H. Rehren
K.-H. Rehren
L.V. Belvedere
M. Dütsch
M. Hortacsu
M. Karowski
M. Karowski
M. Karowski
N.D. Mermin
P. Di Vecchia
P.D. Hislop
P.W. Anderson
P.W. Higgs
R. Brout
R. Brunetti
R. Haag
R. Koberle
R. Koberle
R.F. Dashen
R.J. Jost
S. Coleman
V. Kurak
V. Kurak
W. Heisenberg
W.H. Furry
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/05/2010
Field of study

After revisiting some high points of particle physics and QFT of the two decades from 1960 to 1980, I comment on the work by Jorge Andre Swieca. I explain how it fits into the quantum field theory during these two decades and draw attention to its relevance to the ongoing particle physics research. A particular aim of this article is to direct thr readers mindfulness to the relevance of what at the time of Swieca was called "the Schwinger Higgs screening mechanism". which, together with recent ideas which generalize the concept of gauge theories, has all the ingredients to revolutionize the issue of gauge theories and the standard model.Comment: 49 pages, expansion and actualization of text, improvement of formulations and addition of many references to be published in EPJH - Historical Perspectives on Contemporary Physic

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Benchmarking in cluster analysis: A white paper

Author: Boulesteix Anne-Laure
Dangl Rainer
Dean Nema
Guyon Isabelle
Hennig Christian
Leisch Friedrich
Steinley Douglas
Van Mechelen Iven
Publication venue
Publication date: 01/10/2018
Field of study

To achieve scientific progress in terms of building a cumulative body of knowledge, careful attention to benchmarking is of the utmost importance. This means that proposals of new methods of data pre-processing, new data-analytic techniques, and new methods of output post-processing, should be extensively and carefully compared with existing alternatives, and that existing methods should be subjected to neutral comparison studies. To date, benchmarking and recommendations for benchmarking have been frequently seen in the context of supervised learning. Unfortunately, there has been a dearth of guidelines for benchmarking in an unsupervised setting, with the area of clustering as an important subdomain. To address this problem, discussion is given to the theoretical conceptual underpinnings of benchmarking in the field of cluster analysis by means of simulated as well as empirical data. Subsequently, the practicalities of how to address benchmarking questions in clustering are dealt with, and foundational recommendations are made

arXiv.org e-Print Archive

Proceedings - University of Groningen

ARTS repository - University of Groningen

Enlighten

Dissertations of the University of Groningen

Unsupervised Deep Hashing for Large-scale Visual Search

Author: Feng Xiaoyi
Hadid Abdenour
Peng Jinye
Xia Zhaoqiang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 31/01/2016
Field of study

Learning based hashing plays a pivotal role in large-scale visual search. However, most existing hashing algorithms tend to learn shallow models that do not seek representative binary codes. In this paper, we propose a novel hashing approach based on unsupervised deep learning to hierarchically transform features into hash codes. Within the heterogeneous deep hashing framework, the autoencoder layers with specific constraints are considered to model the nonlinear mapping between features and binary codes. Then, a Restricted Boltzmann Machine (RBM) layer with constraints is utilized to reduce the dimension in the hamming space. Extensive experiments on the problem of visual search demonstrate the competitiveness of our proposed approach compared to state-of-the-art

arXiv.org e-Print Archive

Crossref