Search CORE

15 research outputs found

MLGWSC-1: The first Machine Learning Gravitational-Wave Search Mock Data Challenge

Author: Brügmann B.
Cao Z.
Cuoco E.
Guo Z.
Huerta E.
Iosif P.
Klimenko S.
Koloniari A.
Messenger C.
Mishra T.
Nitz A.
Nousi P.
Ohme F.
Passalis N.
Ren Z.
Salemi F.
Schäfer M.
Stergioulas N.
Tefas A.
Vedovato G.
Wang H.
Wu S.
Zelenka O.
Publication venue
Publication date
Field of study

We present the results of the first Machine Learning Gravitational-Wave Search Mock Data Challenge (MLGWSC-1). For this challenge, participating groups had to identify gravitational-wave signals from binary black hole mergers of increasing complexity and duration embedded in progressively more realistic noise. The final of the 4 provided datasets contained real noise from the O3a observing run and signals up to a duration of 20 seconds with the inclusion of precession effects and higher order modes. We present the average sensitivity distance and runtime for the 6 entered algorithms derived from 1 month of test data unknown to the participants prior to submission. Of these, 4 are machine learning algorithms. We find that the best machine learning based algorithms are able to achieve up to 95% of the sensitive distance of matched-filtering based production analyses for simulated Gaussian noise at a false-alarm rate (FAR) of one per month. In contrast, for real noise, the leading machine learning search achieved 70%. For higher FARs the differences in sensitive distance shrink to the point where select machine learning submissions outperform traditional search algorithms at FARs

\geq 200

per month on some datasets. Our results show that current machine learning search algorithms may already be sensitive enough in limited parameter regions to be useful for some production settings. To improve the state-of-the-art, machine learning algorithms need to reduce the false-alarm rates at which they are capable of detecting signals and extend their validity to regions of parameter space where modeled searches are computationally expensive to run. Based on our findings we compile a list of research areas that we believe are the most important to elevate machine learning searches to an invaluable tool in gravitational-wave signal detection

MPG.PuRe

The graph tempo framework for exploring the evolution of a graph through pattern aggregation

Author: Koloniari G.
Pitoura E.
Triantafillou Peter
Tsoukanara E.
Publication venue: IEEE Computer Society
Publication date: 01/01/2024
Field of study

Warwick Research Archives Portal Repository

One is enough: distributed filtering for duplicate elimination

Author: Koloniari G.
Ntarmos N.
Pitoura E.
Souravlias D.
Publication venue
Publication date: 01/01/2011
Field of study

The growth of online services has created the need for duplicate elimination in high-volume streams of events. The sheer volume of data in applications such as pay-per-click clickstream processing, RSS feed syndication and notiﬁcation services in social sites such Twitter and Facebook makes traditional centralized solutions hard to scale. In this paper, we propose an approach based on distributed ﬁltering. To this end, we introduce a suite of distributed Bloom ﬁlters that exploit diﬀerent ways of partitioning the event space. To address the continuous nature of event delivery, the ﬁlters are extended to support sliding window semantics. Moreover, we examine locality-related tradeoﬀs and propose a tree-based architecture to allow for duplicate elimination across geographic locations. We cast the design space and present experimental results that demonstrate the pros and cons of our various solutions in diﬀerent settings

Enlighten

Design of PeerSum: A Summary Service for P2P Applications

Author: A. Chakravarti
A. Oser
A. Voglozin
G. Koloniari
G. Raschia
L. Zadeh
T. Ozsu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Retrieving Arbitrary XML Fragments from Structured Peer-to-Peer Networks

Author: A. Rowstron
B. Catania
C. Sartiani
G. Koloniari
M. Yoshikawa
W.-X. Rao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Robust and Efficient Large-Large Table Outer Joins on Distributed Infrastructures

Author: C. Galindo-Legaria
C. Kim
D. Kossmann
D. Witt De
G. Bhargava
G. Hill
G. Koloniari
P.Å. Larson
Publication venue: Springer Verlag
Publication date: 01/01/2014
Field of study

Outer joins are ubiquitous in many workloads but are sensitive to load-balancing problems. Current approaches mitigate such problems caused by data skew by using (partial) replication. However, contemporary replication-based approaches (1) introduce overhead, since they usually result in redundant data movement, (2) are sensitive to parameter tuning and value of data skew and (3) typically require that one side is small. In this paper, we propose a novel parallel algorithm, Redistribution and Efficient Query with Counters (REQC), aimed at robustness in terms of size of join sides, variation in skew and parameter tuning. Experimental results demonstrate that our algorithm is faster, more robust and less demanding in terms of network bandwidth, compared to the state-of-the-art

Durham Research Online

Crossref

Abstract. Non-trivial search predicates beyond mere equality are at the current focus of P2P research. Structured queries, as an important type of non-trivial search, have been studied extensively mainly for unstructured P2P systems so far. As unstructured P2P systems do not use indexing, structured queries are very easy to implement since they can be treated equally to any other type of query. However, this comes at the expense of very high bandwidth consumption and limitations in terms of guarantees and expressiveness that can be provided. Structured P2P systems are an efficient alternative as they typically offer logarithmic search complexity in the number of peers. Though the use of a distributed index (typically a distributed hash table) makes the implementation of structured queries more efficient, it also introduces considerable complexity, and thus only a few approaches exist so far. In this paper we present a first solution for efficiently supporting structured queries, more specifically, XPath queries, in structured P2P systems. For the moment we focus on supporting queries with descendant axes (“//”) and wildcards (“*”) and do not address joins. The results presented in this paper provide foundational basic functionalities to be used by higher-level query engines for more efficient, complex query support.

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX

Crossref

Sharable file searching in unstructured peer-to-peer systems

Author: AB Templeman
CE Shannon
ET Joynes
G Koloniari
J Risson
K Li
K Li
K Li
K Yang
Masaru Kitsuregawa
Wanlei Zhou
Wenyu Qu
Y Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2010
Field of study

The existing sharable file searching methods have at least one of the following disadvantages: (1) they are applicable only to certain topology patterns, (2) suffer single point failure, or (3) incur prohibitive maintenance cost. These drawbacks prevent their effective application in unstructured Peer-to-peer (P2P) systems (where the system topologies are changed time to time due to peers\u27 frequently entering and leaving the systems), despite the considerable success of sharing file search in conventional peer-to-peer systems. Motivated by this, we develop several fully dynamic algorithms for searching sharing files in unstructured peer to peer systems. Our solutions can handle any topology pattern with small search time and computational overhead. We also present an in-depth analysis that provides valuable insight into the characteristics of alternative effective search strategies and leads to precision guarantees. Extensive experiments validate our theoretical findings and demonstrate the efficiency of our techniques in practice

Deakin Research Online

Crossref