Search CORE

15 research outputs found

Social Sensing of Floods in the UK

Author: Arthur Rudy
Boulton Chris A.
Shotton Humphrey
Williams Hywel T. P.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 13/11/2017
Field of study

"Social sensing" is a form of crowd-sourcing that involves systematic analysis of digital communications to detect real-world events. Here we consider the use of social sensing for observing natural hazards. In particular, we present a case study that uses data from a popular social media platform (Twitter) to detect and locate flood events in the UK. In order to improve data quality we apply a number of filters (timezone, simple text filters and a naive Bayes `relevance' filter) to the data. We then use place names in the user profile and message text to infer the location of the tweets. These two steps remove most of the irrelevant tweets and yield orders of magnitude more located tweets than we have by relying on geo-tagged data. We demonstrate that high resolution social sensing of floods is feasible and we can produce high-quality historical and real-time maps of floods using Twitter.Comment: 24 pages, 6 figure

arXiv.org e-Print Archive

Crossref

Open Research Exeter

FigShare

Number of tweets collected per day during the whole collection period 22/12/2015 and 04/01/2016 at each filter level.

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

Number of tweets collected per day during the whole collection period 22/12/2015 and 04/01/2016 at each filter level.</p

FigShare

Flood map generated by twitter converted into FFC format for validation.

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

White indicated no tweets. Colour bar units are relative floodiness. Top Left: Floodiness grid (64 × 64) over England and Wales on 28/10/2015 using (r, α) = (1.0, 0.15). Top Right: Showing only grid squares above threshold 0.1. Bottom Left: Counties with floods on 28/10/2015 according to Twitter. Bottom Right: Counties with floods on 28/10/2015 according to the FFC, with gh set to 1 for flooded counties.</p

FigShare

Floodiness grid, 64 × 64, over the North East on 5/12/2015 between 4pm and 5pm using (r, α, T) = (1.0, 0.15, 0.1).

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

White indicates no tweets or zero population. Colour bar indicates floodiness relative to daily max.</p

FigShare

Number of tweets collected per day during the whole collection period 22/10/2015 and 25/11/2016.

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

Number of tweets collected per day during the whole collection period 22/10/2015 and 25/11/2016.</p

FigShare

Tuning relative floodiness threshold T by varying text versus location weighting r and population scaling exponent α.

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

Each point corresponds to the average precision and recall over 15 days for a different triple of r, α, T.</p

FigShare

Total number of tweets remaining after each filter is applied and correlation of the number of tweets per day with FFC data.

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

Total number of tweets remaining after each filter is applied and correlation of the number of tweets per day with FFC data.</p

FigShare

Number of relevant tweets collected with location info in each field: GPS-tagged tweets, location field GPS coordinates, location field toponyms, message text toponyms.

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

Number of relevant tweets collected with location info in each field: GPS-tagged tweets, location field GPS coordinates, location field toponyms, message text toponyms.</p

FigShare

Total number of tweets with each kind of location information.

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

Total number of tweets with each kind of location information.</p

FigShare

Precision, recall and parameter set obtained by maximising Fβ scores using absolute and normalised floodiness.

Author: Chris A. Boulton (4811043)
Humphrey Shotton (4811040)
Hywel T. P. Williams (4811049)
Rudy Arthur (4811046)
Publication venue
Publication date
Field of study

Precision, recall and parameter set obtained by maximising Fβ scores using absolute and normalised floodiness.</p

FigShare

Social Sensing of Floods in the UK

Number of tweets collected per day during the whole collection period 22/12/2015 and 04/01/2016 at each filter level.

Flood map generated by twitter converted into FFC format for validation.

Floodiness grid, 64 × 64, over the North East on 5/12/2015 between 4pm and 5pm using (<i>r</i>, <i>α</i>, <i>T</i>) = (1.0, 0.15, 0.1).

Number of tweets collected per day during the whole collection period 22/10/2015 and 25/11/2016.

Tuning relative floodiness threshold <i>T</i> by varying text versus location weighting <i>r</i> and population scaling exponent α.

Total number of tweets remaining after each filter is applied and correlation of the number of tweets per day with FFC data.

Number of relevant tweets collected with location info in each field: GPS-tagged tweets, location field GPS coordinates, location field toponyms, message text toponyms.

Total number of tweets with each kind of location information.

Precision, recall and parameter set obtained by maximising <i>F</i><sub><i>β</i></sub> scores using absolute and normalised floodiness.