33,717 research outputs found
empathi: An ontology for Emergency Managing and Planning about Hazard Crisis
In the domain of emergency management during hazard crises, having sufficient
situational awareness information is critical. It requires capturing and
integrating information from sources such as satellite images, local sensors
and social media content generated by local people. A bold obstacle to
capturing, representing and integrating such heterogeneous and diverse
information is lack of a proper ontology which properly conceptualizes this
domain, aggregates and unifies datasets. Thus, in this paper, we introduce
empathi ontology which conceptualizes the core concepts concerning with the
domain of emergency managing and planning of hazard crises. Although empathi
has a coarse-grained view, it considers the necessary concepts and relations
being essential in this domain. This ontology is available at
https://w3id.org/empathi/
Social media mining for identification and exploration of health-related information from pregnant women
Widespread use of social media has led to the generation of substantial
amounts of information about individuals, including health-related information.
Social media provides the opportunity to study health-related information about
selected population groups who may be of interest for a particular study. In
this paper, we explore the possibility of utilizing social media to perform
targeted data collection and analysis from a particular population group --
pregnant women. We hypothesize that we can use social media to identify cohorts
of pregnant women and follow them over time to analyze crucial health-related
information. To identify potentially pregnant women, we employ simple
rule-based searches that attempt to detect pregnancy announcements with
moderate precision. To further filter out false positives and noise, we employ
a supervised classifier using a small number of hand-annotated data. We then
collect their posts over time to create longitudinal health timelines and
attempt to divide the timelines into different pregnancy trimesters. Finally,
we assess the usefulness of the timelines by performing a preliminary analysis
to estimate drug intake patterns of our cohort at different trimesters. Our
rule-based cohort identification technique collected 53,820 users over thirty
months from Twitter. Our pregnancy announcement classification technique
achieved an F-measure of 0.81 for the pregnancy class, resulting in 34,895 user
timelines. Analysis of the timelines revealed that pertinent health-related
information, such as drug-intake and adverse reactions can be mined from the
data. Our approach to using user timelines in this fashion has produced very
encouraging results and can be employed for other important tasks where
cohorts, for which health-related information may not be available from other
sources, are required to be followed over time to derive population-based
estimates.Comment: 9 page
Understanding Psycholinguistic Behavior of predominant drunk texters in Social Media
In the last decade, social media has evolved as one of the leading platform
to create, share, or exchange information; it is commonly used as a way for
individuals to maintain social connections. In this online digital world,
people use to post texts or pictures to express their views socially and create
user-user engagement through discussions and conversations. Thus, social media
has established itself to bear signals relating to human behavior. One can
easily design user characteristic network by scraping through someone's social
media profiles. In this paper, we investigate the potential of social media in
characterizing and understanding predominant drunk texters from the perspective
of their social, psychological and linguistic behavior as evident from the
content generated by them. Our research aims to analyze the behavior of drunk
texters on social media and to contrast this with non-drunk texters. We use
Twitter social media to obtain the set of drunk texters and non-drunk texters
and show that we can classify users into these two respective sets using
various psycholinguistic features with an overall average accuracy of 96.78%
with very high precision and recall. Note that such an automatic classification
can have far-reaching impact - (i) on health research related to addiction
prevention and control, and (ii) in eliminating abusive and vulgar contents
from Twitter, borne by the tweets of drunk texters.Comment: 6 pages, 8 Figures, ISCC 2018 Workshops - ICTS4eHealth 201
Graphs in machine learning: an introduction
Graphs are commonly used to characterise interactions between objects of
interest. Because they are based on a straightforward formalism, they are used
in many scientific fields from computer science to historical sciences. In this
paper, we give an introduction to some methods relying on graphs for learning.
This includes both unsupervised and supervised methods. Unsupervised learning
algorithms usually aim at visualising graphs in latent spaces and/or clustering
the nodes. Both focus on extracting knowledge from graph topologies. While most
existing techniques are only applicable to static graphs, where edges do not
evolve through time, recent developments have shown that they could be extended
to deal with evolving networks. In a supervised context, one generally aims at
inferring labels or numerical values attached to nodes using both the graph
and, when they are available, node characteristics. Balancing the two sources
of information can be challenging, especially as they can disagree locally or
globally. In both contexts, supervised and un-supervised, data can be
relational (augmented with one or several global graphs) as described above, or
graph valued. In this latter case, each object of interest is given as a full
graph (possibly completed by other characteristics). In this context, natural
tasks include graph clustering (as in producing clusters of graphs rather than
clusters of nodes in a single graph), graph classification, etc. 1 Real
networks One of the first practical studies on graphs can be dated back to the
original work of Moreno [51] in the 30s. Since then, there has been a growing
interest in graph analysis associated with strong developments in the modelling
and the processing of these data. Graphs are now used in many scientific
fields. In Biology [54, 2, 7], for instance, metabolic networks can describe
pathways of biochemical reactions [41], while in social sciences networks are
used to represent relation ties between actors [66, 56, 36, 34]. Other examples
include powergrids [71] and the web [75]. Recently, networks have also been
considered in other areas such as geography [22] and history [59, 39]. In
machine learning, networks are seen as powerful tools to model problems in
order to extract information from data and for prediction purposes. This is the
object of this paper. For more complete surveys, we refer to [28, 62, 49, 45].
In this section, we introduce notations and highlight properties shared by most
real networks. In Section 2, we then consider methods aiming at extracting
information from a unique network. We will particularly focus on clustering
methods where the goal is to find clusters of vertices. Finally, in Section 3,
techniques that take a series of networks into account, where each network i
Using Twitter to Understand Public Interest in Climate Change: The case of Qatar
Climate change has received an extensive attention from public opinion in the
last couple of years, after being considered for decades as an exclusive
scientific debate. Governments and world-wide organizations such as the United
Nations are working more than ever on raising and maintaining public awareness
toward this global issue. In the present study, we examine and analyze Climate
Change conversations in Qatar's Twittersphere, and sense public awareness
towards this global and shared problem in general, and its various related
topics in particular. Such topics include but are not limited to politics,
economy, disasters, energy and sandstorms. To address this concern, we collect
and analyze a large dataset of 109 million tweets posted by 98K distinct users
living in Qatar -- one of the largest emitters of CO2 worldwide. We use a
taxonomy of climate change topics created as part of the United Nations Pulse
project to capture the climate change discourse in more than 36K tweets. We
also examine which topics people refer to when they discuss climate change, and
perform different analysis to understand the temporal dynamics of public
interest toward these topics.Comment: Will appear in the proceedings of the International Workshop on
Social Media for Environment and Ecological Monitoring (SWEEM'16
- …