Search CORE

3,298 research outputs found

Trading query complexity for sample-based testing and multi-testing scalability

Author: Fischer E.
Lachish Oded
Vasudev Y.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

We show that every non-adaptive property testing algorithm making a constant number of queries, over a fixed alphabet, can be converted to a sample-based (as per [Goldreich and Ron, 2015]) testing algorithm whose average number of queries is a fixed, smaller than

1

, power of

n

. Since the query distribution of the sample-based algorithm is not dependent at all on the property, or the original algorithm, this has many implications in scenarios where there are many properties that need to be tested for concurrently, such as testing (relatively large) unions of properties, or converting a Merlin-Arthur Proximity proof (as per [Gur and Rothblum, 2013]) to a proper testing algorithm. The proof method involves preparing the original testing algorithm for a combinatorial analysis. For the analysis we develop a structural lemma for hypergraphs that may be of independent interest. When analyzing a hypergraph that was extracted from a

2

-sided test, it allows for finding generalized sunflowers that provide for a large-deviation type analysis. For

1

-sided tests the bounds can be improved further by applying Janson's inequality directly over our structures

arXiv.org e-Print Archive

Crossref

Birkbeck Institutional Research Online

Towards a Scalable Dynamic Spatial Database System

Author: Diaconu Raluca
Keller Joaquín
Valero Mathieu
Publication venue
Publication date: 16/11/2012
Field of study

With the rise of GPS-enabled smartphones and other similar mobile devices, massive amounts of location data are available. However, no scalable solutions for soft real-time spatial queries on large sets of moving objects have yet emerged. In this paper we explore and measure the limits of actual algorithms and implementations regarding different application scenarios. And finally we propose a novel distributed architecture to solve the scalability issues.Comment: (2012

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Hal-Diderot

Jeeva: Enterprise Grid-enabled Web Portal for Protein Secondary Structure Prediction

Author: Buyya Rajkumar
Gubbi Jayavardhana
Jin Chao
Palaniswami Marimuthu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

This paper presents a Grid portal for protein secondary structure prediction developed by using services of Aneka, a .NET-based enterprise Grid technology. The portal is used by research scientists to discover new prediction structures in a parallel manner. An SVM (Support Vector Machine)-based prediction algorithm is used with 64 sample protein sequences as a case study to demonstrate the potential of enterprise Grids.Comment: 7 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

University of Melbourne Institutional Repository

Challenges for the comprehensive management of cloud services in a PaaS framework

Author: Andrikopoulos Vasilios
Biro József
García-Gómez Sergio
Jiménez Gañán Miguel
Junker Frederic
Menychtas Andreas
Momm Christof
Strauch Steve
Taher Yehia
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2012
Field of study

The 4CaaSt project aims at developing a PaaS framework that enables flexible definition, marketing, deployment and management of Cloud-based services and applications. The major innovations proposed by 4CaaSt are the blueprint and its lifecycle management, a one stop shop for Cloud services and a PaaS level resource management featuring elasticity. 4CaaSt also provides a portfolio of ready to use Cloud native services and Cloud-aware immigrant technologies

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Tilburg University Repository

String and Membrane Gaussian Processes

Author: Roberts Stephen
Samo Yves-Laurent Kom
Publication venue
Publication date: 01/01/2016
Field of study

In this paper we introduce a novel framework for making exact nonparametric Bayesian inference on latent functions, that is particularly suitable for Big Data tasks. Firstly, we introduce a class of stochastic processes we refer to as string Gaussian processes (string GPs), which are not to be mistaken for Gaussian processes operating on text. We construct string GPs so that their finite-dimensional marginals exhibit suitable local conditional independence structures, which allow for scalable, distributed, and flexible nonparametric Bayesian inference, without resorting to approximations, and while ensuring some mild global regularity constraints. Furthermore, string GP priors naturally cope with heterogeneous input data, and the gradient of the learned latent function is readily available for explanatory analysis. Secondly, we provide some theoretical results relating our approach to the standard GP paradigm. In particular, we prove that some string GPs are Gaussian processes, which provides a complementary global perspective on our framework. Finally, we derive a scalable and distributed MCMC scheme for supervised learning tasks under string GP priors. The proposed MCMC scheme has computational time complexity

\mathcal{O}(N)

and memory requirement

\mathcal{O}(dN)

, where

N

is the data size and

d

the dimension of the input space. We illustrate the efficacy of the proposed approach on several synthetic and real-world datasets, including a dataset with

6

millions input points and

8

attributes.Comment: To appear in the Journal of Machine Learning Research (JMLR), Volume 1

arXiv.org e-Print Archive

Oxford University Research Archive

The state of peer-to-peer network simulators

Author: Agosti M.
Anirban Basu
Annapureddy S.
Barcellos M.
Baumgart I.
Boufkhad Y.
Cheng B.
Clarke I.
Dabek F.
de Vogeleer K.
Doulkeridis C.
Ghinita G.
Haridasan M.
Huebsch R. J.
Ian Wakeman
Iliofotou M.
James Stanier
Johansson B.
Leonini L.
Likert R.
Mavlankar A.
Maymounkov P.
Naicken S.
Naicken S.
Ren D.
Rosenberg J.
Rowstron A. I. T.
Simon Fleming
Stephen Naicken
Stingl D.
Urban P.
Vijay K. Gurbani
Wang S.
Webb S.
Zantout B.
Zhang D.
Zhao B.
Zhou Y.
Zhu W.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/08/2013
Field of study

Networking research often relies on simulation in order to test and evaluate new ideas. An important requirement of this process is that results must be reproducible so that other researchers can replicate, validate and extend existing work. We look at the landscape of simulators for research in peer-to-peer (P2P) networks by conducting a survey of a combined total of over 280 papers from before and after 2007 (the year of the last survey in this area), and comment on the large quantity of research using bespoke, closed-source simulators. We propose a set of criteria that P2P simulators should meet, and poll the P2P research community for their agreement. We aim to drive the community towards performing their experiments on simulators that allow for others to validate their results

Crossref

Sussex Research Online

Improving the Scalability of DPWS-Based Networked Infrastructures

Author: Campos Filipe
Pereira José
Publication venue
Publication date: 31/07/2014
Field of study

The Devices Profile for Web Services (DPWS) specification enables seamless discovery, configuration, and interoperability of networked devices in various settings, ranging from home automation and multimedia to manufacturing equipment and data centers. Unfortunately, the sheer simplicity of event notification mechanisms that makes it fit for resource-constrained devices, makes it hard to scale to large infrastructures with more stringent dependability requirements, ironically, where self-configuration would be most useful. In this report, we address this challenge with a proposal to integrate gossip-based dissemination in DPWS, thus maintaining compatibility with original assumptions of the specification, and avoiding a centralized configuration server or custom black-box middleware components. In detail, we show how our approach provides an evolutionary and non-intrusive solution to the scalability limitations of DPWS and experimentally evaluate it with an implementation based on the the Web Services for Devices (WS4D) Java Multi Edition DPWS Stack (JMEDS).Comment: 28 pages, Technical Repor

arXiv.org e-Print Archive

CiteSeerX

Analysis of current middleware used in peer-to-peer and grid implementations for enhancement by catallactic mechanisms

Author: Chacin Pablo
Chao Isaac
Freitag Felix
Publication venue
Publication date
Field of study

This deliverable describes the work done in task 3.1, Middleware analysis: Analysis of current middleware used in peer-to-peer and grid implementations for enhancement by catallactic mechanisms from work package 3, Middleware Implementation. The document is divided in four parts: The introduction with application scenarios and middleware requirements, Catnets middleware architecture, evaluation of existing middleware toolkits, and conclusions. -- Die Arbeit definiert Anforderungen an Grid und Peer-to-Peer Middleware Architekturen und analysiert diese auf ihre Eignung für die prototypische Umsetzung der Katallaxie. Eine Middleware-Architektur für die Umsetzung der Katallaxie in Application Layer Netzwerken wird vorgestellt.Grid Computing

Research Papers in Economics