Search CORE

1,479 research outputs found

Folksonomies and clustering in the collaborative system CiteULike

Author: Andrea Capocci
Berendt B
Caldarelli G
Cattuto C Loreto V
Ferrer i Cancho R
Guido Caldarelli
Heyman P Garcia-Molina H
Hotho A
Lambiotte R
Santos-Neto E Ripeanu M Iamnitchi A
Schmitz C Grahl M Hotho A Stumme G Cattuto C Baldassarri A Loreto V Servedio V D P
Simon H A
Zipf G K
Publication venue: 'IOP Publishing'
Publication date: 16/10/2007
Field of study

We analyze CiteULike, an online collaborative tagging system where users bookmark and annotate scientific papers. Such a system can be naturally represented as a tripartite graph whose nodes represent papers, users and tags connected by individual tag assignments. The semantics of tags is studied here, in order to uncover the hidden relationships between tags. We find that the clustering coefficient reflects the semantical patterns among tags, providing useful ideas for the designing of more efficient methods of data classification and spam detection.Comment: 9 pages, 5 figures, iop style; corrected typo

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Archivio della ricerca della Scuola IMT Alti Studi Lucca

IMT Institutional Repository

Measuring Similarity in Large-Scale Folksonomies

Author: Capra Licia
De Meo Pasquale
Ferrara Emilio
Quattrone Giovanni
Publication venue
Publication date: 01/01/2011
Field of study

Social (or folksonomic) tagging has become a very popular way to describe content within Web 2.0 websites. Unlike\ud taxonomies, which overimpose a hierarchical categorisation of content, folksonomies enable end-users to freely create and choose the categories (in this case, tags) that best\ud describe some content. However, as tags are informally de-\ud ﬁned, continually changing, and ungoverned, social tagging\ud has often been criticised for lowering, rather than increasing, the efﬁciency of searching, due to the number of synonyms, homonyms, polysemy, as well as the heterogeneity of\ud users and the noise they introduce. To address this issue, a\ud variety of approaches have been proposed that recommend\ud users what tags to use, both when labelling and when looking for resources. As we illustrate in this paper, real world\ud folksonomies are characterized by power law distributions\ud of tags, over which commonly used similarity metrics, including the Jaccard coefﬁcient and the cosine similarity, fail\ud to compute. We thus propose a novel metric, speciﬁcally\ud developed to capture similarity in large-scale folksonomies,\ud that is based on a mutual reinforcement principle: that is,\ud two tags are deemed similar if they have been associated to\ud similar resources, and vice-versa two resources are deemed\ud similar if they have been labelled by similar tags. We offer an efﬁcient realisation of this similarity metric, and assess its quality experimentally, by comparing it against cosine similarity, on three large-scale datasets, namely Bibsonomy, MovieLens and CiteULike

arXiv.org e-Print Archive

CiteSeerX

UCL Discovery

CogPrints Cognitive Sciences Eprint Archive

Exploring The Value Of Folksonomies For Creating Semantic Metadata

Author: Al-Khalifa Hend S.
Davis Hugh C.
Publication venue
Publication date: 01/01/2007
Field of study

Finding good keywords to describe resources is an on-going problem: typically we select such words manually from a thesaurus of terms, or they are created using automatic keyword extraction techniques. Folksonomies are an increasingly well populated source of unstructured tags describing web resources. This paper explores the value of the folksonomy tags as potential source of keyword metadata by examining the relationship between folksonomies, community produced annotations, and keywords extracted by machines. The experiment has been carried-out in two ways: subjectively, by asking two human indexers to evaluate the quality of the generated keywords from both systems; and automatically, by measuring the percentage of overlap between the folksonomy set and machine generated keywords set. The results of this experiment show that the folksonomy tags agree more closely with the human generated keywords than those automatically generated. The results also showed that the trained indexers preferred the semantics of folksonomy tags compared to keywords extracted automatically. These results can be considered as evidence for the strong relationship of folksonomies to the human indexer’s mindset, demonstrating that folksonomies used in the del.icio.us bookmarking service are a potential source for generating semantic metadata to annotate web resources

CiteSeerX

Southampton (e-Prints Soton)

Crossref

Tag-Aware Recommender Systems: A State-of-the-art Survey

Author: A Capocci
A Clauset
A Gunawardana
A Hotho
AE Gelfand
AP Dempster
B Pittel
C Cattuto
C Cattuto
C Cattuto
C Liu
DM Blei
G Adomavicius
G Cimini
G Ghoshal
G Koutrika
G Linden
G Salton
GQ Zhang
J Scott
JA Hanley
JB Schafer
JL Herlocker
JM Kleinberg
JW Wang
K Tso
L Lathauwer De
L Lü
L Spiteri
LdaF Costa
M Dubinko
M Girvan
M Medo
MEJ Newman
MJ Pazzani
MS Shang
MS Shang
MS Shang
O Nov
P Kazienko
P Mika
P Resnick
P Resnick
P Wu
R Albert
R Lambiotte
S Boccaletti
S Brin
S Deerwester
SN Dorogovtsev
T Zhou
T Zhou
T Zhou
Tao Zhou
TG Kolda
V Zlatić
X Si
Y Ding
YC Zhang
Yi-Cheng Zhang
Z Huang
Zi-Ke Zhang
ZK Zhang
ZK Zhang
ZK Zhang
ZK Zhang
ZK Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/02/2012
Field of study

In the past decade, Social Tagging Systems have attracted increasing attention from both physical and computer science communities. Besides the underlying structure and dynamics of tagging systems, many efforts have been addressed to unify tagging information to reveal user behaviors and preferences, extract the latent semantic relations among items, make recommendations, and so on. Specifically, this article summarizes recent progress about tag-aware recommender systems, emphasizing on the contributions from three mainstream perspectives and approaches: network-based methods, tensor-based methods, and the topic-based methods. Finally, we outline some other tag-related works and future challenges of tag-aware recommendation algorithms.Comment: 19 pages, 3 figure

arXiv.org e-Print Archive

Crossref

RERO DOC Digital Library

A scalable mining of frequent quadratic concepts in d-folksonomies

Author: Jelassi Mohamed Nader
Nguifo Engelbert Mephu
Yahia Sadok Ben
Publication venue
Publication date: 01/01/2012
Field of study

Folksonomy mining is grasping the interest of web 2.0 community since it represents the core data of social resource sharing systems. However, a scrutiny of the related works interested in mining folksonomies unveils that the time stamp dimension has not been considered. For example, the wealthy number of works dedicated to mining tri-concepts from folksonomies did not take into account time dimension. In this paper, we will consider a folksonomy commonly composed of triples and we shall consider the time as a new dimension. We motivate our approach by highlighting the battery of potential applications. Then, we present the foundations for mining quadri-concepts, provide a formal definition of the problem and introduce a new efficient algorithm, called QUADRICONS for its solution to allow for mining folksonomies in time, i.e., d-folksonomies. We also introduce a new closure operator that splits the induced search space into equivalence classes whose smallest elements are the quadri-minimal generators. Carried out experiments on large-scale real-world datasets highlight good performances of our algorithm

arXiv.org e-Print Archive

HAL Clermont Université

Hal-Diderot

Hypergraph model of social tagging networks

Author: Blattner M
Cattuto C
Cattuto C
Chuang Liu
Dellschaft K Staab S
Halpin H Robu V Shepherd H
Karypis G Aggarwal R Kumar V Shekhar S
Palla G
Sen S Lam S K Rashid A M Cosley D Frankowski D Osterhouse J Harper F M Riedl J
Shang M-S
Zi-Ke Zhang
Publication venue: 'IOP Publishing'
Publication date: 09/03/2010
Field of study

The past few years have witnessed the great success of a new family of paradigms, so-called folksonomy, which allows users to freely associate tags to resources and efficiently manage them. In order to uncover the underlying structures and user behaviors in folksonomy, in this paper, we propose an evolutionary hypergrah model to explain the emerging statistical properties. The present model introduces a novel mechanism that one can not only assign tags to resources, but also retrieve resources via collaborative tags. We then compare the model with a real-world dataset: \emph{Del.icio.us}. Indeed, the present model shows considerable agreement with the empirical data in following aspects: power-law hyperdegree distributions, negtive correlation between clustering coefficients and hyperdegrees, and small average distances. Furthermore, the model indicates that most tagging behaviors are motivated by labeling tags to resources, and tags play a significant role in effectively retrieving interesting resources and making acquaintance with congenial friends. The proposed model may shed some light on the in-depth understanding of the structure and function of folksonomy.Comment: 7 pages,7 figures, 32 reference

arXiv.org e-Print Archive

Crossref

Tagging, Folksonomy & Co - Renaissance of Manual Indexing?

Author: Voss Jakob
Publication venue
Publication date: 01/01/2007
Field of study

This paper gives an overview of current trends in manual indexing on the Web. Along with a general rise of user generated content there are more and more tagging systems that allow users to annotate digital resources with tags (keywords) and share their annotations with other users. Tagging is frequently seen in contrast to traditional knowledge organization systems or as something completely new. This paper shows that tagging should better be seen as a popular form of manual indexing on the Web. Difference between controlled and free indexing blurs with sufficient feedback mechanisms. A revised typology of tagging systems is presented that includes different user roles and knowledge organization systems with hierarchical relationships and vocabulary control. A detailed bibliography of current research in collaborative tagging is included.Comment: Preprint. 12 pages, 1 figure, 54 reference

arXiv.org e-Print Archive

E-LIS

Bridging the gap between folksonomies and the semantic web: an experience report

Author: Angeletou Sofia
Motta Enrico
Sabou Marta
Specia Lucia
Publication venue
Publication date: 01/01/2007
Field of study

Abstract. While folksonomies allow tagging of similar resources with a variety of tags, their content retrieval mechanisms are severely hampered by being agnostic to the relations that exist between these tags. To overcome this limitation, several methods have been proposed to find groups of implicitly inter-related tags. We believe that content retrieval can be further improved by making the relations between tags explicit. In this paper we propose the semantic enrichment of folksonomy tags with explicit relations by harvesting the Semantic Web, i.e., dynamically selecting and combining relevant bits of knowledge from online ontologies. Our experimental results show that, while semantic enrichment needs to be aware of the particular characteristics of folksonomies and the Semantic Web, it is beneficial for both.

CiteSeerX

Open Research Online