Search CORE

194 research outputs found

Analyzing Tag Semantics Across Collaborative Tagging Systems

Author: Benz Dominik
Grobelnik Marko
Hotho Andreas
Jaschke Robert
Mladenic Dunja
Servedio Vito D. P.
Sizov Sergej
Szomszor Martin
Publication venue
Publication date: 01/01/2008
Field of study

The objective of our group was to exploit state-of-the-art Information Retrieval methods for finding associations and dependencies between tags, capturing and representing differences in tagging behavior and vocabulary of various folksonomies, with the overall aim to better understand the semantics of tags and the tagging process. Therefore we analyze the semantic content of tags in the Flickr and Delicious folksonomies. We find that: tag context similarity leads to meaningful results in Flickr, despite its narrow folksonomy character; the comparison of tags across Flickr and Delicious shows little semantic overlap, being tags in Flickr associated more to visual aspects rather than technological as it seems to be in Delicious; there are regions in the tag-tag space, provided with the cosine similarity metric, that are characterized by high density; the order of tags inside a post has a semantic relevance

Southampton (e-Prints Soton)

Posted, Visited, Exported: Altmetrics in the Social Tagging System BibSonomy

Author: Doerfel S.
Hotho A.
Jäschke R.
Stumme G.
Zoller D.
Publication venue: 'Elsevier BV'
Publication date: 11/06/2016
Field of study

In social tagging systems, like Mendeley, CiteULike, and BibSonomy, users can post, tag, visit, or export scholarly publications. In this paper, we compare citations with metrics derived from users’ activities (altmetrics) in the popular social bookmarking system BibSonomy. Our analysis, using a corpus of more than 250,000 publications published before 2010, reveals that overall, citations and altmetrics in BibSonomy are mildly correlated. Furthermore, grouping publications by user-generated tags results in topic-homogeneous subsets that exhibit higher correlations with citations than the full corpus. We find that posts, exports, and visits of publications are correlated with citations and even bear predictive power over future impact. Machine learning classifiers predict whether the number of citations that a publication receives in a year exceeds the median number of citations in that year, based on the usage counts of the preceding year. In that setup, a Random Forest predictor outperforms the baseline on average by seven percentage points

Crossref

White Rose Research Online

Folksonomies and clustering in the collaborative system CiteULike

Author: Andrea Capocci
Berendt B
Caldarelli G
Cattuto C Loreto V
Ferrer i Cancho R
Guido Caldarelli
Heyman P Garcia-Molina H
Hotho A
Lambiotte R
Santos-Neto E Ripeanu M Iamnitchi A
Schmitz C Grahl M Hotho A Stumme G Cattuto C Baldassarri A Loreto V Servedio V D P
Simon H A
Zipf G K
Publication venue: 'IOP Publishing'
Publication date: 16/10/2007
Field of study

We analyze CiteULike, an online collaborative tagging system where users bookmark and annotate scientific papers. Such a system can be naturally represented as a tripartite graph whose nodes represent papers, users and tags connected by individual tag assignments. The semantics of tags is studied here, in order to uncover the hidden relationships between tags. We find that the clustering coefficient reflects the semantical patterns among tags, providing useful ideas for the designing of more efficient methods of data classification and spam detection.Comment: 9 pages, 5 figures, iop style; corrected typo

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Archivio della ricerca della Scuola IMT Alti Studi Lucca

IMT Institutional Repository

Participatory Patterns in an International Air Quality Monitoring Initiative

Author: Becker Martin
Bossche Joris Van den
Caminiti Saverio
De Baets Bernard
Elen Bart
Francis Louise
Gravino Pietro
Hotho Andreas
Ingarra Stefano
Loreto Vittorio
Molino Andrea
Mueller Juergen
Peters Jan
Ricchiuti Ferdinando
Saracino Fabio
Servedio Vito D. P.
Stumme Gerd
Sîrbu Alina
Theunis Jan
Tria Francesca
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

The issue of sustainability is at the top of the political and societal agenda, being considered of extreme importance and urgency. Human individual action impacts the environment both locally (e.g., local air/water quality, noise disturbance) and globally (e.g., climate change, resource use). Urban environments represent a crucial example, with an increasing realization that the most effective way of producing a change is involving the citizens themselves in monitoring campaigns (a citizen science bottom-up approach). This is possible by developing novel technologies and IT infrastructures enabling large citizen participation. Here, in the wider framework of one of the first such projects, we show results from an international competition where citizens were involved in mobile air pollution monitoring using low cost sensing devices, combined with a web-based game to monitor perceived levels of pollution. Measures of shift in perceptions over the course of the campaign are provided, together with insights into participatory patterns emerging from this study. Interesting effects related to inertia and to direct involvement in measurement activities rather than indirect information exposure are also highlighted, indicating that direct involvement can enhance learning and environmental awareness. In the future, this could result in better adoption of policies towards decreasing pollution.Comment: 17 pages, 6 figures, 1 supplementary fil

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Ghent University Academic Bibliography

UCL Discovery

PubMed Central

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Institutionelles Repositorium der Leibniz Universität Hannover

Online-Publikations-Server der Universität Würzburg

Archivio della ricerca- Università di Roma La Sapienza

FigShare

TaxoFolk: a hybrid taxonomy–folksonomy classification for enhanced knowledge navigation

Author: Ching-Chieh Kiu
Dunn G
Eric Tsui
Fichter D
Ganter B
Hotho A
Kiu CC
Mitchell RL
Stock WG
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Using Semantic Technologies in Digital Libraries- A Roadmap to Quality Evaluation

Author: A. Hotho
D. Vrandečić
H. Halpin
J. Diederich
K. Bischoff
K. Razikin
M. Sanderson
N. Fuhr
P. Cimiano
R. Krestel
S. Bao
S. Chan
S.R. Kruk
Publication venue
Publication date: 01/01/2009
Field of study

Abstract. In digital libraries semantic techniques are often deployed to reduce the expensive manual overhead for indexing documents, maintaining metadata, or caching for future search. However, using such techniques may cause a decrease in a collection’s quality due to their statistical nature. Since data quality is a major concern in digital libraries, it is important to be able to measure the (loss of) quality of metadata automatically generated by semantic techniques. In this paper we present a user study based on a typical semantic technique use

CiteSeerX

LEKYTHOS

Crossref

Erratum to: Participatory Sensing, Opinions and Collective Awareness

Author: Haklay Muki
Hotho Andreas
Loreto Vittorio
Servedio Vito D. P.
Stumme Gerd
Theunis Jan
Tria Francesca
Publication venue
Publication date: 08/08/2016
Field of study

Crossref

Open Access Repository

Evaluation of ontology enhancement tools

Author: A. Hotho
C. Holsapple
D. Faure
M. Kavalec
M. Schaal
M. Vazirgiannis
P. Haase
S. Dill
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Mining algorithms can enhance the task of ontology establishment but methods are needed to assess the quality of their findings. Ontology establishment is a long-term interactive process, so it is important to evaluate the contribution of a mining tool at an early phase of this process so that only appropriate tools are used in later phases. We propose a method for the evaluation of such tools on their impact on ontology enhancement. We model impact as quality perceived by the expert and as statistical quality computed by an objective function. We further provide a mechanism that juxtaposes the two forms of quality. We have applied our method on an ontology enhancement tool and gained some interesting insights on the interplay between perceived impact and statistical quality. © 2006 Springer-Verlag

Crossref

Bilkent University Institutional Repository

Niche as a determinant of word fate in online groups

Author: A Baronchelli
A Dijksterhuis
A Hotho
Adilson E. Motter
C Cattuto
C Cattuto
C Eble
CD Manning
D Crystal
D Fisher
D Jablonski
D Nettle
D Sornette
D Watts
DJ Hruschka
DM Abrams
DW Nickerson
E Lieberman
Eduardo G. Altmann
EG Altmann
EM Rogers
Enrico Scalas
EV Clark
G Hardin
G Lupyan
G Smitherman
G Szabo
HP Grice
I Trestian
J Kleinberg
J Munat
J-B Michel
J-P Onnela
Janet B. Pierrehumbert
JF Fontanari
K Kuiper
K Lerman
KW Church
L Milroy
L Steels
M Foote
M Pagel
M Seshadri
MA Serrano
MC González
MH Davis
ML Salganik
NL Komarova
P Chesley
P Eckert
P Wexler
Q Lu
R Crane
R Schifanella
R Torres Cacoullos
RA Blythe
RD Malmgren
RK Colwell
RV Solé
S Fortunato
S Kirby
S Wasserman
S Wichmann
W Kruskal
W Labov
Y Neuman
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Patterns of word use both reflect and influence a myriad of human activities and interactions. Like other entities that are reproduced and evolve, words rise or decline depending upon a complex interplay between {their intrinsic properties and the environments in which they function}. Using Internet discussion communities as model systems, we define the concept of a word niche as the relationship between the word and the characteristic features of the environments in which it is used. We develop a method to quantify two important aspects of the size of the word niche: the range of individuals using the word and the range of topics it is used to discuss. Controlling for word frequency, we show that these aspects of the word niche are strong determinants of changes in word frequency. Previous studies have already indicated that word frequency itself is a correlate of word success at historical time scales. Our analysis of changes in word frequencies over time reveals that the relative sizes of word niches are far more important than word frequencies in the dynamics of the entire vocabulary at shorter time scales, as the language adapts to new concepts and social groupings. We also distinguish endogenous versus exogenous factors as additional contributors to the fates of words, and demonstrate the force of this distinction in the rise of novel words. Our results indicate that short-term nonstationarity in word statistics is strongly driven by individual proclivities, including inclinations to provide novel information and to project a distinctive social identity.Comment: Supporting Information is available here: http://www.plosone.org/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pone.0019009.s00

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Semantic contextualisation of social tag-based profiles and item recommendations

Author: A. Hotho
A. Shepitsen
B. Markines
C.M. Au Yeung
D. Vallet
G. Adomavicius
I. Cantador
J. Gemmell
K.Q. Weinberger
L. Specia
M.E.J. Newman
M.G. Noll
S. Angeletou
S. Niwa
S. Sen
S. Xu
S.A. Golder
T. Bogers
V. Zanardi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Proceedigns of 12th International Conference, EC-Web 2011, Toulouse, France, August 30 - September 1, 2011.The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-23014-1_9We present an approach that efficiently identifies the semantic meanings and contexts of social tags within a particular folksonomy, and exploits them to build contextualised tag-based user and item profiles. We apply our approach to a dataset obtained from Delicious social bookmarking system, and evaluate it through two experiments: a user study consisting of manual judgements of tag disambiguation and contextualisation cases, and an offline study measuring the performance of several tag-powered item recommendation algorithms by using contextualised profiles. The results obtained show that our approach is able to accurately determine the actual semantic meanings and contexts of tag annotations, and allow item recommenders to achieve better precision and recall on their predictions.This work was supported by the Spanish Ministry of Science and Innovation (TIN2008-06566-C04-02), and the Community of Madrid (CCG10- UAM/TIC-5877

CiteSeerX

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblos-e Archivo