Search CORE

2,913 research outputs found

Multivariate Hawkes Processes for Large-scale Inference

Author: Kalogeratos Argyris
Lemonnier Rémi
Scaman Kevin
Publication venue
Publication date: 26/02/2016
Field of study

In this paper, we present a framework for fitting multivariate Hawkes processes for large-scale problems both in the number of events in the observed history

n

and the number of event types

d

(i.e. dimensions). The proposed Low-Rank Hawkes Process (LRHP) framework introduces a low-rank approximation of the kernel matrix that allows to perform the nonparametric learning of the

d^2

triggering kernels using at most

O(ndr^2)

operations, where

r

is the rank of the approximation (

r \ll d,n

). This comes as a major improvement to the existing state-of-the-art inference algorithms that are in

O(nd^2)

. Furthermore, the low-rank approximation allows LRHP to learn representative patterns of interaction between event types, which may be valuable for the analysis of such complex processes in real world datasets. The efficiency and scalability of our approach is illustrated with numerical experiments on simulated as well as real datasets.Comment: 16 pages, 5 figure

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Efficient Non-parametric Bayesian Hawkes Processes

Author: Rizoiu Marian-Andrei
Walder Christian
Xie Lexing
Zhang Rui
Publication venue
Publication date: 25/05/2019
Field of study

In this paper, we develop an efficient nonparametric Bayesian estimation of the kernel function of Hawkes processes. The non-parametric Bayesian approach is important because it provides flexible Hawkes kernels and quantifies their uncertainty. Our method is based on the cluster representation of Hawkes processes. Utilizing the stationarity of the Hawkes process, we efficiently sample random branching structures and thus, we split the Hawkes process into clusters of Poisson processes. We derive two algorithms -- a block Gibbs sampler and a maximum a posteriori estimator based on expectation maximization -- and we show that our methods have a linear time complexity, both theoretically and empirically. On synthetic data, we show our methods to be able to infer flexible Hawkes triggering kernels. On two large-scale Twitter diffusion datasets, we show that our methods outperform the current state-of-the-art in goodness-of-fit and that the time complexity is linear in the size of the dataset. We also observe that on diffusions related to online videos, the learned kernels reflect the perceived longevity for different content types such as music or pets videos

arXiv.org e-Print Archive

OPUS - University of Technology Sydney

Multivariate Spatiotemporal Hawkes Processes and Network Reconstruction

Author: Bertozzi Andrea L.
Brantingham P. Jeffrey
Li Hao
Porter Mason A.
Yuan Baichuan
Publication venue
Publication date: 15/11/2018
Field of study

There is often latent network structure in spatial and temporal data and the tools of network analysis can yield fascinating insights into such data. In this paper, we develop a nonparametric method for network reconstruction from spatiotemporal data sets using multivariate Hawkes processes. In contrast to prior work on network reconstruction with point-process models, which has often focused on exclusively temporal information, our approach uses both temporal and spatial information and does not assume a specific parametric form of network dynamics. This leads to an effective way of recovering an underlying network. We illustrate our approach using both synthetic networks and networks constructed from real-world data sets (a location-based social media network, a narrative of crime events, and violent gang crimes). Our results demonstrate that, in comparison to using only temporal data, our spatiotemporal approach yields improved network reconstruction, providing a basis for meaningful subsequent analysis --- such as community structure and motif analysis --- of the reconstructed networks

arXiv.org e-Print Archive

eScholarship - University of California

Uncovering Causality from Multivariate Hawkes Integrated Cumulants

Author: Achab Massil
Bacry Emmanuel
Gaïffas Stéphane
Mastromatteo Iacopo
Muzy Jean-Francois
Publication venue
Publication date: 29/05/2017
Field of study

We design a new nonparametric method that allows one to estimate the matrix of integrated kernels of a multivariate Hawkes process. This matrix not only encodes the mutual influences of each nodes of the process, but also disentangles the causality relationships between them. Our approach is the first that leads to an estimation of this matrix without any parametric modeling and estimation of the kernels themselves. A consequence is that it can give an estimation of causality relationships between nodes (or users), based on their activity timestamps (on a social network for instance), without knowing or estimating the shape of the activities lifetime. For that purpose, we introduce a moment matching method that fits the third-order integrated cumulants of the process. We show on numerical experiments that our approach is indeed very robust to the shape of the kernels, and gives appealing results on the MemeTracker database

arXiv.org e-Print Archive

HAL-Polytechnique

Statistical Inference for Networks of High-Dimensional Point Processes

Author: Kolar Mladen
Shojaie Ali
Wang Xu
Publication venue: Collection of Biostatistics Research Archive
Publication date: 20/12/2019
Field of study

Fueled in part by recent applications in neuroscience, high-dimensional Hawkes process have become a popular tool for modeling the network of interactions among multivariate point process data. While evaluating the uncertainty of the network estimates is critical in scientific applications, existing methodological and theoretical work have only focused on estimation. To bridge this gap, this paper proposes a high-dimensional statistical inference procedure with theoretical guarantees for multivariate Hawkes process. Key to this inference procedure is a new concentration inequality on the first- and second-order statistics for integrated stochastic processes, which summarizes the entire history of the process. We apply this concentration inequality, combining a recent result on martingale central limit theory, to give an upper bounds for the convergence rate of the test statistics. We verify our theoretical results with extensive simulation and an application to a neuron spike train data set

arXiv.org e-Print Archive

Collection Of Biostatistics Research Archive