13,425 research outputs found
empathi: An ontology for Emergency Managing and Planning about Hazard Crisis
In the domain of emergency management during hazard crises, having sufficient
situational awareness information is critical. It requires capturing and
integrating information from sources such as satellite images, local sensors
and social media content generated by local people. A bold obstacle to
capturing, representing and integrating such heterogeneous and diverse
information is lack of a proper ontology which properly conceptualizes this
domain, aggregates and unifies datasets. Thus, in this paper, we introduce
empathi ontology which conceptualizes the core concepts concerning with the
domain of emergency managing and planning of hazard crises. Although empathi
has a coarse-grained view, it considers the necessary concepts and relations
being essential in this domain. This ontology is available at
https://w3id.org/empathi/
Hiding in Plain Sight: A Longitudinal Study of Combosquatting Abuse
Domain squatting is a common adversarial practice where attackers register
domain names that are purposefully similar to popular domains. In this work, we
study a specific type of domain squatting called "combosquatting," in which
attackers register domains that combine a popular trademark with one or more
phrases (e.g., betterfacebook[.]com, youtube-live[.]com). We perform the first
large-scale, empirical study of combosquatting by analyzing more than 468
billion DNS records---collected from passive and active DNS data sources over
almost six years. We find that almost 60% of abusive combosquatting domains
live for more than 1,000 days, and even worse, we observe increased activity
associated with combosquatting year over year. Moreover, we show that
combosquatting is used to perform a spectrum of different types of abuse
including phishing, social engineering, affiliate abuse, trademark abuse, and
even advanced persistent threats. Our results suggest that combosquatting is a
real problem that requires increased scrutiny by the security community.Comment: ACM CCS 1
Recommended from our members
Information needs after stroke: What to include and how to structure it on a website. A qualitative study using focus groups and card sorting
Background: Use of the Internet to obtain health and other information is increasing. Previous studies have identified the specific information needs of people with stroke but not in relation to the Internet. People with aphasia (PwA) may face barriers in accessing the Internet: Navigating websites requires an ability to categorise information and this ability is often impaired in PwA. The website categorisation preferences of people with stroke and with aphasia have not yet been reported.
Aims: This study aimed: (a) to determine what information people who have had a stroke would like to see on a website about living with stroke; (b) to determine the most effective means of structuring information on the website so that it is accessible to people with stroke; and c) to identify any differences between people with and without aphasia in terms of preferences for structuring information on the website.
Methods & Procedures: Participants were recruited from a hospital's Stroke Database. Focus groups were used to elicit what information participants wanted on a website about living with stroke. The themes raised were depicted on 133 cards. To determine the most effective way of structuring information on the website, and whether there were any differences in preferences between PwA and PwoA, participants used a modified closed card-sorting technique to sort the cards under website categories.
Outcomes & Results: A total of 48 people were invited, and 12 (25%) agreed to take part. We ran three focus groups: one with PwA (n = 5) and two with people without aphasia (PwoA) (n = 3, n = 4). Participants wanted more information about stroke causes and effects (particularly emotional issues), roles of local agencies, and returning to previous activities (driving, going out). All participants completed the card-sorting exercise. Few cards (6%) were categorised identically by everyone. Cards relating to local agencies and groups were not consistently categorised together. Cards relating to emotions were segregated. The categorisation preferences for PwA were more fragmented than those for PwoA: 60% of PwA agreed on the categorisation of 51% of the cards, whereas 60% of PwoA agreed on the categorisation of 76% of the cards.
Conclusions: Information needs covered all stages of the stroke journey. The card sorting was accessible to everyone, and provided evidence of structuring preferences and of some of the categorisation difficulties faced by PwA. More research is needed on what an accessible website looks like for PwA
Museums as disseminators of niche knowledge: Universality in accessibility for all
Accessibility has faced several challenges within audiovisual translation Studies and gained great opportunities for its establishment as a methodologically and theoretically well-founded discipline. Initially conceived as a set of services and practices that provides access to audiovisual media content for persons with sensory impairment, today accessibility can be viewed as a concept involving more and more universality thanks to its contribution to the dissemination of audiovisual products on the topic of marginalisation. Against this theoretical backdrop, accessibility is scrutinised from the perspective of aesthetics of migration and minorities within the field of the visual arts in museum settings. These aesthetic narrative forms act as modalities that encourage the diffusion of ‘niche’ knowledge, where processes of translation and interpretation provide access to all knowledge as counter discourse. Within this framework, the ways in which language is used can be considered the beginning of a type of local grammar in English as lingua franca for interlingual translation and subtitling, both of which ensure access to knowledge for all citizens as a human rights principle and regardless of cultural and social differences. Accessibility is thus gaining momentum as an agent for the democratisation and transparency of information against media discourse distortions and oversimplifications
Identifying Purpose Behind Electoral Tweets
Tweets pertaining to a single event, such as a national election, can number
in the hundreds of millions. Automatically analyzing them is beneficial in many
downstream natural language applications such as question answering and
summarization. In this paper, we propose a new task: identifying the purpose
behind electoral tweets--why do people post election-oriented tweets? We show
that identifying purpose is correlated with the related phenomenon of sentiment
and emotion detection, but yet significantly different. Detecting purpose has a
number of applications including detecting the mood of the electorate,
estimating the popularity of policies, identifying key issues of contention,
and predicting the course of events. We create a large dataset of electoral
tweets and annotate a few thousand tweets for purpose. We develop a system that
automatically classifies electoral tweets as per their purpose, obtaining an
accuracy of 43.56% on an 11-class task and an accuracy of 73.91% on a 3-class
task (both accuracies well above the most-frequent-class baseline). Finally, we
show that resources developed for emotion detection are also helpful for
detecting purpose
Methodologies for the Automatic Location of Academic and Educational Texts on the Internet
Traditionally online databases of web resources have been compiled by a human editor, or though the submissions of authors or interested parties. Considerable resources are needed to maintain a constant level of input and relevance in the face of increasing material quantity and quality, and much of what is in databases is of an ephemeral nature. These pressures dictate that many databases stagnate after an initial period of enthusiastic data entry. The solution to this problem would seem to be the automatic harvesting of resources, however, this process necessitates the automatic classification of resources as ‘appropriate’ to a given database, a problem only solved by complex text content analysis.
This paper outlines the component methodologies necessary to construct such an automated harvesting system, including a number of novel approaches. In particular this paper looks at the specific problems of automatically identifying academic research work and Higher Education pedagogic materials. Where appropriate, experimental data is presented from searches in the field of Geography as well as the Earth and Environmental Sciences. In addition, appropriate software is reviewed where it exists, and future directions are outlined
Computational Sociolinguistics: A Survey
Language is a social phenomenon and variation is inherent to its social
nature. Recently, there has been a surge of interest within the computational
linguistics (CL) community in the social dimension of language. In this article
we present a survey of the emerging field of "Computational Sociolinguistics"
that reflects this increased interest. We aim to provide a comprehensive
overview of CL research on sociolinguistic themes, featuring topics such as the
relation between language and social identity, language use in social
interaction and multilingual communication. Moreover, we demonstrate the
potential for synergy between the research communities involved, by showing how
the large-scale data-driven methods that are widely used in CL can complement
existing sociolinguistic studies, and how sociolinguistics can inform and
challenge the methods and assumptions employed in CL studies. We hope to convey
the possible benefits of a closer collaboration between the two communities and
conclude with a discussion of open challenges.Comment: To appear in Computational Linguistics. Accepted for publication:
18th February, 201
A Survey of Volunteered Open Geo-Knowledge Bases in the Semantic Web
Over the past decade, rapid advances in web technologies, coupled with
innovative models of spatial data collection and consumption, have generated a
robust growth in geo-referenced information, resulting in spatial information
overload. Increasing 'geographic intelligence' in traditional text-based
information retrieval has become a prominent approach to respond to this issue
and to fulfill users' spatial information needs. Numerous efforts in the
Semantic Geospatial Web, Volunteered Geographic Information (VGI), and the
Linking Open Data initiative have converged in a constellation of open
knowledge bases, freely available online. In this article, we survey these open
knowledge bases, focusing on their geospatial dimension. Particular attention
is devoted to the crucial issue of the quality of geo-knowledge bases, as well
as of crowdsourced data. A new knowledge base, the OpenStreetMap Semantic
Network, is outlined as our contribution to this area. Research directions in
information integration and Geographic Information Retrieval (GIR) are then
reviewed, with a critical discussion of their current limitations and future
prospects
- …