9,418 research outputs found
Web Data Extraction, Applications and Techniques: A Survey
Web Data Extraction is an important problem that has been studied by means of
different scientific tools and in a broad range of applications. Many
approaches to extracting data from the Web have been designed to solve specific
problems and operate in ad-hoc domains. Other approaches, instead, heavily
reuse techniques and algorithms developed in the field of Information
Extraction.
This survey aims at providing a structured and comprehensive overview of the
literature in the field of Web Data Extraction. We provided a simple
classification framework in which existing Web Data Extraction applications are
grouped into two main classes, namely applications at the Enterprise level and
at the Social Web level. At the Enterprise level, Web Data Extraction
techniques emerge as a key tool to perform data analysis in Business and
Competitive Intelligence systems as well as for business process
re-engineering. At the Social Web level, Web Data Extraction techniques allow
to gather a large amount of structured data continuously generated and
disseminated by Web 2.0, Social Media and Online Social Network users and this
offers unprecedented opportunities to analyze human behavior at a very large
scale. We discuss also the potential of cross-fertilization, i.e., on the
possibility of re-using Web Data Extraction techniques originally designed to
work in a given domain, in other domains.Comment: Knowledge-based System
You are What you Eat (and Drink): Identifying Cultural Boundaries by Analyzing Food & Drink Habits in Foursquare
Food and drink are two of the most basic needs of human beings. However, as
society evolved, food and drink became also a strong cultural aspect, being
able to describe strong differences among people. Traditional methods used to
analyze cross-cultural differences are mainly based on surveys and, for this
reason, they are very difficult to represent a significant statistical sample
at a global scale. In this paper, we propose a new methodology to identify
cultural boundaries and similarities across populations at different scales
based on the analysis of Foursquare check-ins. This approach might be useful
not only for economic purposes, but also to support existing and novel
marketing and social applications. Our methodology consists of the following
steps. First, we map food and drink related check-ins extracted from Foursquare
into users' cultural preferences. Second, we identify particular individual
preferences, such as the taste for a certain type of food or drink, e.g., pizza
or sake, as well as temporal habits, such as the time and day of the week when
an individual goes to a restaurant or a bar. Third, we show how to analyze this
information to assess the cultural distance between two countries, cities or
even areas of a city. Fourth, we apply a simple clustering technique, using
this cultural distance measure, to draw cultural boundaries across countries,
cities and regions.Comment: 10 pages, 10 figures, 1 table. Proceedings of 8th AAAI Intl. Conf. on
Weblogs and Social Media (ICWSM 2014
Recherche d'Information Sociale et Recommandation: Etat d'art et travaux futurs
International audienceThe explosion of web 2.0 and social networks has created an enormous and rewarding source of information that has motivated researchers in different fields to exploit it. Our work revolves around the issue of access and identification of social information and their use in building a user profile enriched with a social dimension, and operating in a process of personalization and recommendation. We study several approaches of Social IR (Information Retrieval), distinguished by the type of incorporated social information. We also study various social recommendation approaches classified by the type of recommendation. We then present a study of techniques for modeling the social user profile dimension, followed by a critical discussion. Thus, we propose our social recommendation approach integrating an advanced social user profile model.L’explosion du web 2.0 et des réseaux sociaux a crée une source d’information énorme et enrichissante qui a motivé les chercheurs dans différents domaines à l’exploiter. Notre travail s’articule autour de la problématique d’accès et d’identification des informations sociales et leur exploitation dans la construction d’un profil utilisateur enrichi d’une dimension sociale, et son exploitation dans un processus de personnalisation et de recommandation. Nous étudions différentes approches sociales de RI (Recherche d’Information), distinguées par le type d’informations sociales incorporées. Nous étudions également diverses approches de recommandation sociale classées par le type de recommandation. Nous exposons ensuite une étude des techniques de modélisation de la dimension sociale du profil utilisateur, suivie par une discussion critique. Ainsi, nous présentons notre approche de recommandation sociale proposée intégrant un modèle avancé de profil utilisateur social
- …