9,418 research outputs found

    Web Data Extraction, Applications and Techniques: A Survey

    Full text link
    Web Data Extraction is an important problem that has been studied by means of different scientific tools and in a broad range of applications. Many approaches to extracting data from the Web have been designed to solve specific problems and operate in ad-hoc domains. Other approaches, instead, heavily reuse techniques and algorithms developed in the field of Information Extraction. This survey aims at providing a structured and comprehensive overview of the literature in the field of Web Data Extraction. We provided a simple classification framework in which existing Web Data Extraction applications are grouped into two main classes, namely applications at the Enterprise level and at the Social Web level. At the Enterprise level, Web Data Extraction techniques emerge as a key tool to perform data analysis in Business and Competitive Intelligence systems as well as for business process re-engineering. At the Social Web level, Web Data Extraction techniques allow to gather a large amount of structured data continuously generated and disseminated by Web 2.0, Social Media and Online Social Network users and this offers unprecedented opportunities to analyze human behavior at a very large scale. We discuss also the potential of cross-fertilization, i.e., on the possibility of re-using Web Data Extraction techniques originally designed to work in a given domain, in other domains.Comment: Knowledge-based System

    You are What you Eat (and Drink): Identifying Cultural Boundaries by Analyzing Food & Drink Habits in Foursquare

    Full text link
    Food and drink are two of the most basic needs of human beings. However, as society evolved, food and drink became also a strong cultural aspect, being able to describe strong differences among people. Traditional methods used to analyze cross-cultural differences are mainly based on surveys and, for this reason, they are very difficult to represent a significant statistical sample at a global scale. In this paper, we propose a new methodology to identify cultural boundaries and similarities across populations at different scales based on the analysis of Foursquare check-ins. This approach might be useful not only for economic purposes, but also to support existing and novel marketing and social applications. Our methodology consists of the following steps. First, we map food and drink related check-ins extracted from Foursquare into users' cultural preferences. Second, we identify particular individual preferences, such as the taste for a certain type of food or drink, e.g., pizza or sake, as well as temporal habits, such as the time and day of the week when an individual goes to a restaurant or a bar. Third, we show how to analyze this information to assess the cultural distance between two countries, cities or even areas of a city. Fourth, we apply a simple clustering technique, using this cultural distance measure, to draw cultural boundaries across countries, cities and regions.Comment: 10 pages, 10 figures, 1 table. Proceedings of 8th AAAI Intl. Conf. on Weblogs and Social Media (ICWSM 2014

    Recherche d'Information Sociale et Recommandation: Etat d'art et travaux futurs

    Get PDF
    International audienceThe explosion of web 2.0 and social networks has created an enormous and rewarding source of information that has motivated researchers in different fields to exploit it. Our work revolves around the issue of access and identification of social information and their use in building a user profile enriched with a social dimension, and operating in a process of personalization and recommendation. We study several approaches of Social IR (Information Retrieval), distinguished by the type of incorporated social information. We also study various social recommendation approaches classified by the type of recommendation. We then present a study of techniques for modeling the social user profile dimension, followed by a critical discussion. Thus, we propose our social recommendation approach integrating an advanced social user profile model.L’explosion du web 2.0 et des réseaux sociaux a crée une source d’information énorme et enrichissante qui a motivé les chercheurs dans différents domaines à l’exploiter. Notre travail s’articule autour de la problématique d’accès et d’identification des informations sociales et leur exploitation dans la construction d’un profil utilisateur enrichi d’une dimension sociale, et son exploitation dans un processus de personnalisation et de recommandation. Nous étudions différentes approches sociales de RI (Recherche d’Information), distinguées par le type d’informations sociales incorporées. Nous étudions également diverses approches de recommandation sociale classées par le type de recommandation. Nous exposons ensuite une étude des techniques de modélisation de la dimension sociale du profil utilisateur, suivie par une discussion critique. Ainsi, nous présentons notre approche de recommandation sociale proposée intégrant un modèle avancé de profil utilisateur social
    • …
    corecore