Search CORE

8,126 research outputs found

TAPER: query-aware, partition-enhancement for large, heterogenous, graphs

Author: Firth Hugo
Missier Paolo
Publication venue
Publication date: 23/06/2016
Field of study

Graph partitioning has long been seen as a viable approach to address Graph DBMS scalability. A partitioning, however, may introduce extra query processing latency unless it is sensitive to a specific query workload, and optimised to minimise inter-partition traversals for that workload. Additionally, it should also be possible to incrementally adjust the partitioning in reaction to changes in the graph topology, the query workload, or both. Because of their complexity, current partitioning algorithms fall short of one or both of these requirements, as they are designed for offline use and as one-off operations. The TAPER system aims to address both requirements, whilst leveraging existing partitioning algorithms. TAPER takes any given initial partitioning as a starting point, and iteratively adjusts it by swapping chosen vertices across partitions, heuristically reducing the probability of inter-partition traversals for a given pattern matching queries workload. Iterations are inexpensive thanks to time and space optimisations in the underlying support data structures. We evaluate TAPER on two different large test graphs and over realistic query workloads. Our results indicate that, given a hash-based partitioning, TAPER reduces the number of inter-partition traversals by around 80%; given an unweighted METIS partitioning, by around 30%. These reductions are achieved within 8 iterations and with the additional advantage of being workload-aware and usable online.Comment: 12 pages, 11 figures, unpublishe

arXiv.org e-Print Archive

University of Birmingham Research Portal

Newcastle University E-Prints

Can social microblogging be used to forecast intraday exchange rates?

Author: Papaioannnou Panagiotis
Papaioannou George
Russo Lucia
Siettos Constantinos
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

The Efficient Market Hypothesis (EMH) is widely accepted to hold true under certain assumptions. One of its implications is that the prediction of stock prices at least in the short run cannot outperform the random walk model. Yet, recently many studies stressing the psychological and social dimension of financial behavior have challenged the validity of the EMH. Towards this aim, over the last few years, internet-based communication platforms and search engines have been used to extract early indicators of social and economic trends. Here, we used Twitter's social networking platform to model and forecast the EUR/USD exchange rate in a high-frequency intradaily trading scale. Using time series and trading simulations analysis, we provide some evidence that the information provided in social microblogging platforms such as Twitter can in certain cases enhance the forecasting efficiency regarding the very short (intradaily) forex.Comment: This is a prior version of the paper published at NETNOMICS. The final publication is available at http://www.springer.com/economics/economic+theory/journal/1106

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

Recommended from our members

HARD: Hybrid Adaptive Resource Discovery for Jungle Computing

Author: Aguiar R. L.
Barraca J. P.
Zarrin J.
Publication venue: 'Elsevier BV'
Publication date: 15/07/2017
Field of study

In recent years, Jungle Computing has emerged as a distributed computing paradigm based on simultaneous combination of various hierarchical and distributed computing environments which are composed by large number of heterogeneous resources. In such a computing environment, the resources and the underlying computation and communication infrastructures are highly-hierarchical and heterogeneous. This creates a lot of difficulty and complexity for finding the proper resources in a precise way in order to run a particular job on the system efficiently. This paper proposes Hybrid Adaptive Resource Discovery (HARD), a novel efficient and highly scalable resource-discovery approach which is built upon a virtual hierarchical overlay based on self-organization and self-adaptation of processing resources in the system, where the computing resources are organized into distributed hierarchies according to a proposed hierarchical multi-layered resource description model. The proposed approach supports distributed query processing within and across hierarchical layers by deploying various distributed resource discovery services and functionalities in the system which are implemented using different adapted algorithms and mechanisms in each level of hierarchy. The proposed approach addresses the requirements for resource discovery in Jungle Computing environments such as high-hierarchy, high-heterogeneity, high-scalability and dynamicity. Simulation results show significant scalability and efficiency of the proposed approach over highly heterogeneous, hierarchical and dynamic computing environments

City Research Online

Repositório Institucional da Universidade de Aveiro

Anglia Ruskin Research

Cycle-Consistent Deep Generative Hashing for Cross-Modal Retrieval

Author: Shao Ling
Wang Yang
Wu Lin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/10/2018
Field of study

In this paper, we propose a novel deep generative approach to cross-modal retrieval to learn hash functions in the absence of paired training samples through the cycle consistency loss. Our proposed approach employs adversarial training scheme to lean a couple of hash functions enabling translation between modalities while assuming the underlying semantic relationship. To induce the hash codes with semantics to the input-output pair, cycle consistency loss is further proposed upon the adversarial training to strengthen the correlations between inputs and corresponding outputs. Our approach is generative to learn hash functions such that the learned hash codes can maximally correlate each input-output correspondence, meanwhile can also regenerate the inputs so as to minimize the information loss. The learning to hash embedding is thus performed to jointly optimize the parameters of the hash functions across modalities as well as the associated generative models. Extensive experiments on a variety of large-scale cross-modal data sets demonstrate that our proposed method achieves better retrieval results than the state-of-the-arts.Comment: To appeared on IEEE Trans. Image Processing. arXiv admin note: text overlap with arXiv:1703.10593 by other author

arXiv.org e-Print Archive

University of Queensland eSpace

GRIDKIT: Pluggable overlay networks for Grid computing

Author: A. El-Sayed
A. Grimshaw
A. Rowstron
B. Li
F. Dabek
F. Kon
G. Coulson
H. Balakrishnan
K. Czajkowski
L. Mathy
M. Castro
M. Clark
N. Furmento
N. Parlavantzas
P. Grace
S. Floyd
S. Pallickara
Publication venue: SPRINGER-VERLAG BERLIN
Publication date: 01/01/2004
Field of study

A `second generation' approach to the provision of Grid middleware is now emerging which is built on service-oriented architecture and web services standards and technologies. However, advanced Grid applications have significant demands that are not addressed by present-day web services platforms. As one prime example, current platforms do not support the rich diversity of communication `interaction types' that are demanded by advanced applications (e.g. publish-subscribe, media streaming, peer-to-peer interaction). In the paper we describe the Gridkit middleware which augments the basic service-oriented architecture to address this particular deficiency. We particularly focus on the communications infrastructure support required to support multiple interaction types in a unified, principled and extensible manner-which we present in terms of the novel concept of pluggable overlay networks

CiteSeerX

Crossref

Lancaster E-Prints