15 research outputs found
Social Sensing of Floods in the UK
"Social sensing" is a form of crowd-sourcing that involves systematic
analysis of digital communications to detect real-world events. Here we
consider the use of social sensing for observing natural hazards. In
particular, we present a case study that uses data from a popular social media
platform (Twitter) to detect and locate flood events in the UK. In order to
improve data quality we apply a number of filters (timezone, simple text
filters and a naive Bayes `relevance' filter) to the data. We then use place
names in the user profile and message text to infer the location of the tweets.
These two steps remove most of the irrelevant tweets and yield orders of
magnitude more located tweets than we have by relying on geo-tagged data. We
demonstrate that high resolution social sensing of floods is feasible and we
can produce high-quality historical and real-time maps of floods using Twitter.Comment: 24 pages, 6 figure
Number of tweets collected per day during the whole collection period 22/12/2015 and 04/01/2016 at each filter level.
<p>Number of tweets collected per day during the whole collection period 22/12/2015 and 04/01/2016 at each filter level.</p
Flood map generated by twitter converted into FFC format for validation.
<p>White indicated no tweets. Colour bar units are relative floodiness. Top Left: Floodiness grid (64 × 64) over England and Wales on 28/10/2015 using
(<i>r</i>, <i>α</i>) = (1.0, 0.15). Top Right: Showing only grid squares above threshold 0.1. Bottom Left: Counties with floods on 28/10/2015 according to Twitter. Bottom Right: Counties with floods on 28/10/2015 according to the FFC, with <i>g<sub>h</sub></i> set to 1 for flooded counties.</p
Floodiness grid, 64 × 64, over the North East on 5/12/2015 between 4pm and 5pm using (<i>r</i>, <i>α</i>, <i>T</i>) = (1.0, 0.15, 0.1).
<p>White indicates no tweets or zero population. Colour bar indicates floodiness relative to daily max.</p
Number of tweets collected per day during the whole collection period 22/10/2015 and 25/11/2016.
<p>Number of tweets collected per day during the whole collection period 22/10/2015 and 25/11/2016.</p
Tuning relative floodiness threshold <i>T</i> by varying text versus location weighting <i>r</i> and population scaling exponent α.
<p>Each point corresponds to the average precision and recall over 15 days for a different triple of <i>r</i>, <i>α</i>, <i>T</i>.</p
Total number of tweets remaining after each filter is applied and correlation of the number of tweets per day with FFC data.
<p>Total number of tweets remaining after each filter is applied and correlation of the number of tweets per day with FFC data.</p
Number of relevant tweets collected with location info in each field: GPS-tagged tweets, location field GPS coordinates, location field toponyms, message text toponyms.
<p>Number of relevant tweets collected with location info in each field: GPS-tagged tweets, location field GPS coordinates, location field toponyms, message text toponyms.</p
Total number of tweets with each kind of location information.
<p>Total number of tweets with each kind of location information.</p
Precision, recall and parameter set obtained by maximising <i>F</i><sub><i>β</i></sub> scores using absolute and normalised floodiness.
<p>Precision, recall and parameter set obtained by maximising <i>F</i><sub><i>β</i></sub> scores using absolute and normalised floodiness.</p