Search CORE

15 research outputs found

Ensemble Kalman Filters with Resampling

Author: Bao Jiajun
Ghattas Omar Al
Sanz-Alonso Daniel
Publication venue
Publication date: 16/08/2023
Field of study

Filtering is concerned with online estimation of the state of a dynamical system from partial and noisy observations. In applications where the state of the system is high dimensional, ensemble Kalman filters are often the method of choice. These algorithms rely on an ensemble of interacting particles to sequentially estimate the state as new observations become available. Despite the practical success of ensemble Kalman filters, theoretical understanding is hindered by the intricate dependence structure of the interacting particles. This paper investigates ensemble Kalman filters that incorporate an additional resampling step to break the dependency between particles. The new algorithm is amenable to a theoretical analysis that extends and improves upon those available for filters without resampling, while also performing well in numerical examples.Comment: 32 pages, 5 figure

arXiv.org e-Print Archive

Covariance Operator Estimation: Sparsity, Lengthscale, and Ensemble Kalman Filters

Author: Al-Ghattas Omar
Chen Jiaheng
Sanz-Alonso Daniel
Waniorek Nathan
Publication venue
Publication date: 25/10/2023
Field of study

This paper investigates covariance operator estimation via thresholding. For Gaussian random fields with approximately sparse covariance operators, we establish non-asymptotic bounds on the estimation error in terms of the sparsity level of the covariance and the expected supremum of the field. We prove that thresholded estimators enjoy an exponential improvement in sample complexity compared with the standard sample covariance estimator if the field has a small correlation lengthscale. As an application of the theory, we study thresholded estimation of covariance operators within ensemble Kalman filters.Comment: 25 pages, 1 figur

arXiv.org e-Print Archive

Limits of use of social media for monitoring biosecurity events.

Author: Marijke Welvaert
Mark Cameron
Omar Al-Ghattas
Peter Caley
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2017
Field of study

Compared to applications that trigger massive information streams, like earthquakes and human disease epidemics, the data input for agricultural and environmental biosecurity events (ie. the introduction of unwanted exotic pests and pathogens), is expected to be sparse and less frequent. To investigate if Twitter data can be useful for the detection and monitoring of biosecurity events, we adopted a three-step process. First, we confirmed that sightings of two migratory species, the Bogong moth (Agrotis infusa) and the Common Koel (Eudynamys scolopaceus) are reported on Twitter. Second, we developed search queries to extract the relevant tweets for these species. The queries were based on either the taxonomic name, common name or keywords that are frequently used to describe the species (symptomatic or syndromic). Third, we validated the results using ground truth data. Our results indicate that the common name queries provided a reasonable number of tweets that were related to the ground truth data. The taxonomic query resulted in too small datasets, while the symptomatic queries resulted in large datasets, but with highly variable signal-to-noise ratios. No clear relationship was observed between the tweets from the symptomatic queries and the ground truth data. Comparing the results for the two species showed that the level of familiarity with the species plays a major role. The more familiar the species, the more stable and reliable the Twitter data. This clearly presents a problem for using social media to detect the arrival of an exotic organism of biosecurity concern for which public is unfamiliar

Directory of Open Access Journals

PubMed Central

University of Canberra Research Repository

Welvaert et al. Bogong moth and Common Koel surveillance

Author: Al-Ghattas Omar
Caley Peter
Cameron Mark
Welvaert Marijke
Publication venue
Publication date: 24/02/2017
Field of study

The EXCEL datasheets "Common Moth", "Common Koel 1", "Common Koel 2", Common Koel 3", "Symp Moth 1", "Symp Moth2", "Symp Koel 1", "Symp Koel 2", "Symp Koel 3", and "Symp Koel 4" are relevance summaries (0=non-relevant, 1=relevant) of de-identified tweets (from Twitter). Tweets were produced using the Commonwealth Scientific & Industrial Research Organisations Emergency Situation Awareness (ESA) system. They are derived by the searches defined within the associated manuscript. The survey data of Bogong Moth data are field data collected from the summit ridge of Mount Gingera, Brindabella Ranges, Australia. Please contact the correspondence author, Peter Caley ([email protected]), for further information

Dryad Digital Repository (Duke University)

CUSUM charts for the Koel Common name queries.

Author: Marijke Welvaert (480915)
Mark Cameron (407123)
Omar Al-Ghattas (3775813)
Peter Caley (23190)
Publication venue
Publication date
Field of study

Red dots indicate a deviation of the number of tweets, in particular the upper part of the chart points to an increase in tweets. UDB: Upper Decision Boundary; LDB: Lower Decision Boundary.</p

FigShare

Overview of the queries used in this study.

Author: Marijke Welvaert (480915)
Mark Cameron (407123)
Omar Al-Ghattas (3775813)
Peter Caley (23190)
Publication venue
Publication date
Field of study

Three types of queries are distinguished: (1) a taxonomic query using the taxonomic classification, (2) a common name query using the common name of the species, and (3) a symptomatic query that searches for tweets that indicate the presence of the species without mentioning either the taxonomic or common name.</p

FigShare

Lag parameters used to fit AR models for the Twitter time series.

Author: Marijke Welvaert (480915)
Mark Cameron (407123)
Omar Al-Ghattas (3775813)
Peter Caley (23190)
Publication venue
Publication date
Field of study

Lag parameters used to fit AR models for the Twitter time series.</p

FigShare

Validation of Bogong moth Twitter data against ground truth data collected in surveys.

Author: Marijke Welvaert (480915)
Mark Cameron (407123)
Omar Al-Ghattas (3775813)
Peter Caley (23190)
Publication venue
Publication date
Field of study

The Twitter data are represented as time series of the weekly counts, while the survey data are shown as bar charts. The grey shaded area delimits the time period in which tweets couldn’t be reliably captured.</p

FigShare

Validation of Koel Twitter data against historical monthly sightings.

Author: Marijke Welvaert (480915)
Mark Cameron (407123)
Omar Al-Ghattas (3775813)
Peter Caley (23190)
Publication venue
Publication date
Field of study

The Twitter data are represented as time series of the weekly counts, while the historical data are shown as monthly bars that are replicated for each migration season. The grey shaded area delimits the time period in which tweets couldn’t be reliably captured.</p

FigShare

CUSUM charts for the Koel symptomatic queries.

Author: Marijke Welvaert (480915)
Mark Cameron (407123)
Omar Al-Ghattas (3775813)
Peter Caley (23190)
Publication venue
Publication date
Field of study

Red dots indicate a deviation of the number of tweets, in particular the upper part of the chart points to an increase in tweets. UDB: Upper Decision Boundary; LDB: Lower Decision Boundary.</p

FigShare