271 research outputs found
Personalizing and Improving Resource Recommendation by Analyzing Users Preferences in Social Tagging Activities
Collaborative tagging which is the keystone of the social practices of web 2.0 has been highly developed in the last few years. In this paper, we propose a new method to analyze user profiles according to their tagging activity in order to improve resource recommendation. We base upon association rules which is a powerful method to discover interesting relationships among large datasets on the web. Focusing on association rules we can find correlations between tags in a social network. Our aim is to recommend resources annotated with tags suggested by association rules, in order to enrich user profiles. The effectiveness of the recommendation depends on the resolution of social tagging drawbacks. In our recommender process, we demonstrate how we can reduce tag ambiguity and spelling variations problems by taking into account social similarities calculated on folksonomies, in order to personalize resource recommendation. We surmount also the lack of semantic links between tags during the recommendation process. Experiments are carried out with two different scenarios: the first one is a proof of concept over two baseline datasets and the second one is a real world application for diabetes disease
Dynamic Tagging for Enterprise Knowledge Sharing and Representation
The development of Web 2.0 technology provides an easy way for people to transfer and share knowledge. As a collaborative tagging tool, folksonomy is an efficient indexing method in Web 2.0 environments. After analyzing the pros and cons of current knowledge management systems, this research proposes a dynamic Tagging system that combines folksonomy technologies with other approaches including automatic schema enrichment and training. The proposed system improves access to a large, growing collection by supporting users collaboratively contribute to the building of tags. In addition, the proposed system provides an efficient way for firms to represent knowledge and share knowledge with customers and other firms
Recommended from our members
Reusing Ontologies to Enrich Semantically User Content in Web2.0: A Case Study on Folksonomies
Semantic Web and Web2.0 emerged during the past decade promising to achieve new frontiers for the Web. On the one hand, the Semantic Web is an interlinked web of data, supported by ontological semantics and allowing for intelligent applications such as semantic search and integration of heterogeneous content across systems and applications. On the other hand, Web2.0 represents the new technologies and paradigms that revolutionised the user engagement in content creation and introduced novel means towards social interaction. Bridging the gap between Web2.0 and the Semantic Web has been proposed as a means to better manage and interact with the large amounts of user contributed content, which is a new challenge for Web2.0. This thesis focuses on a popular paradigm of Web2.0, folksonomies. In particular, we investigate the semantic enrichment of folksonomy tagspaces by reusing ontologies available in the Semantic Web. We identify the need for methods that automatically apply semantic descriptions to user generated content without requiring user intervention or alteration of the current tagging paradigm. We use an iterative approach in order to identify the characteristics of folksonomies and the attributes of knowledge sources that influence the semantic enrichment of tagspaces. We build on the results of our experimental studies to implement a folksonomy enrichment algorithm, that given an input tagspace, automatically creates a semantic structure that describes the meaning and relations of tags. We introduce measures for the evaluation of enriched tagspaces and finally, we propose a search algorithm that exploits the semantic structures to improve folksonomy search
Recommended from our members
Classification design : understanding the decisions between theory and consequence
Classification systems are systems of terms and term relationships intended to sort and gather like concepts and documents. These systems are ubiquitous as the substrate of our interactions with library collections, retail websites, and bureaucracies. Through their design and impact, classification systems share with other technologies an unavoidable though often ignored relationship to politics, power, and authority (Fleischmann & Wallace, 2007). Despite concern among scholars that classification systems embody values and bias, there is little work examining how these qualities are built into a classification system. Specifically, we do not adequately understand classification construction, in which classification designers make decisions by applying classification theory to the specific context of a project (Park, 2008). If systems embody values— particularly values that might either cause harm (Berman, 1971) or provide an additional means of communicating the creator’s position (Feinberg, 2007)— we must understand how and when the system takes on these qualities. This dissertation bridges critical classification theory with design-oriented classification theory. Where critical classification theory is concerned with the outcomes of classification system design, design-oriented classification theory is concerned with the correct processes by which to build a classification system. To connect the consequences of classification system design to designers’ methods and intentions, I use the research lens of infrastructure studies, particularly infrastructural inversion (Star & Ruhleder, 1996) or making visible the work behind infrastructures such as classification systems. Accordingly, my research focuses on designers’ decisions and rethinks our assumptions regarding the factors that classification designers consider in making their design decisions. I adopted an ethnographic approach to the study of classification design that would make visible design decisions and designers’ consideration of factors. Using this approach, I studied the daily design work of volunteer classification designers who maintain a curated folksonomy. Using the grounded theory method (Strauss & Corbin, 1998), I analyzed the designers’ decisions. My analysis identified the implications of the designers’ convergences and divergences from established classification methods for the character of the system and for the connection between classification theory and classification methods. I show how the factors—and the prioritization of factors—that these designers considered in making their decisions were consistent with the values and needs of the community. Therefore, I argue that classification designers have an important role in creating the values or bias of a classification system. In particular, designers’ divergence from universal guidelines and designers’ choices among sources of evidence represent opportunities to align a classification system to its community. I recommend that classification research focus on such instances of divergence and choice to understand the connection between classification design and the values of classification systems. The Introduction motivates the problem space around values in classification systems and outlines my approach in focusing on classification design. The Literature Review outlines the dominant theories in classification scholarship according to three elements of classification design: what decisions designers make, what information designers use in their decisions, and what skills designers apply to their decisions. In the Methods chapter, I introduce the site of my ethnographic research (The Fanwork Repository), detail my ethnographic methods, summarize the types of data I collected, and describe my grounded analysis. Three findings chapters examine one type of complex decision each: Names, Works, and Guidelines, respectively. In the fourth findings chapter, Synthesis, I define 10 factors designers considered across these complex design decisions. I then discuss how the factors figured into complex design decisions, how the factors overlapped and conflicted in design decisions, and how designers understood their role in making complex design decisions. In the Discussion chapter I connect the findings from the site of my ethnography to classification scholarship. In the Conclusion, I consider the contribution of examining classification systems as infrastructure, highlight the differences in accounts of classification design decisions made visible through classification theory and infrastructure studies approaches, and present suggestions for future research in classification design and the study of classification systems as infrastructure.Informatio
Using Data Mining for Facilitating User Contributions in the Social Semantic Web
This thesis utilizes recommender systems to aid the user in contributing to the Social Semantic Web. In this work, we propose a framework that maps domain properties to recommendation technologies. Next, we develop novel recommendation algorithms for improving personalized tag recommendation and for recommendation of semantic relations. Finally, we introduce a framework to analyze different types of potential attacks against social tagging systems and evaluate their impact on those systems
A Generic architecture for semantic enhanced tagging systems
The Social Web, or Web 2.0, has recently gained popularity because of its low cost and ease of use. Social tagging sites (e.g. Flickr and YouTube) offer new principles for end-users to publish and classify their content (data). Tagging systems contain free-keywords (tags) generated by end-users to annotate and categorise data. Lack of semantics is the main drawback in social tagging due to the use of unstructured vocabulary. Therefore, tagging systems suffer from shortcomings such as low precision, lack of collocation, synonymy, multilinguality, and use of shorthands. Consequently, relevant contents are not visible, and thus not retrievable while searching in tag-based systems.
On the other hand, the Semantic Web, so-called Web 3.0, provides a rich semantic infrastructure. Ontologies are the key enabling technology for the Semantic Web. Ontologies can be integrated with the Social Web to overcome the lack of semantics in tagging systems.
In the work presented in this thesis, we build an architecture to address a number of tagging systems drawbacks. In particular, we make use of the controlled vocabularies presented by ontologies to improve the information retrieval in tag-based systems. Based on the tags provided by the end-users, we introduce the idea of adding “system tags” from semantic, as well as social, resources. The “system tags” are comprehensive and wide-ranging in comparison with the limited “user tags”. The system tags are used to fill the gap between the user tags and the search terms used for searching in the tag-based systems. We restricted the scope of our work to tackle the following tagging systems shortcomings:
- The lack of semantic relations between user tags and search terms (e.g. synonymy, hypernymy),
- The lack of translation mediums between user tags and search terms (multilinguality),
- The lack of context to define the emergent shorthand writing user tags.
To address the first shortcoming, we use the WordNet ontology as a semantic lingual resource from where system tags are extracted. For the second shortcoming, we use the MultiWordNet ontology to recognise the cross-languages linkages between different languages. Finally, to address the third shortcoming, we use tag clusters that are obtained from the Social Web to create a context for defining the meaning of shorthand writing tags.
A prototype for our architecture was implemented. In the prototype system, we built our own database to host videos that we imported from real tag-based system (YouTube). The user tags associated with these videos were also imported and stored in the database. For each user tag, our algorithm adds a number of system tags that came from either semantic ontologies (WordNet or MultiWordNet), or from tag clusters that are imported from the Flickr website. Therefore, each system tag added to annotate the imported videos has a relationship with one of the user tags on that video. The relationship might be one of the following: synonymy, hypernymy, similar term, related term, translation, or clustering relation.
To evaluate the suitability of our proposed system tags, we developed an online environment where participants submit search terms and retrieve two groups of videos to be evaluated. Each group is produced from one distinct type of tags; user tags or system tags. The videos in the two groups are produced from the same database and are evaluated by the same participants in order to have a consistent and reliable evaluation. Since the user tags are used nowadays for searching the real tag-based systems, we consider its efficiency as a criterion (reference) to which we compare the efficiency of the new system tags.
In order to compare the relevancy between the search terms and each group of retrieved videos, we carried out a statistical approach. According to Wilcoxon Signed-Rank test, there was no significant difference between using either system tags or user tags. The findings revealed that the use of the system tags in the search is as efficient as the use of the user tags; both types of tags produce different results, but at the same level of relevance to the submitted search terms
A Generic architecture for semantic enhanced tagging systems
The Social Web, or Web 2.0, has recently gained popularity because of its low cost and ease of use. Social tagging sites (e.g. Flickr and YouTube) offer new principles for end-users to publish and classify their content (data). Tagging systems contain free-keywords (tags) generated by end-users to annotate and categorise data. Lack of semantics is the main drawback in social tagging due to the use of unstructured vocabulary. Therefore, tagging systems suffer from shortcomings such as low precision, lack of collocation, synonymy, multilinguality, and use of shorthands. Consequently, relevant contents are not visible, and thus not retrievable while searching in tag-based systems.
On the other hand, the Semantic Web, so-called Web 3.0, provides a rich semantic infrastructure. Ontologies are the key enabling technology for the Semantic Web. Ontologies can be integrated with the Social Web to overcome the lack of semantics in tagging systems.
In the work presented in this thesis, we build an architecture to address a number of tagging systems drawbacks. In particular, we make use of the controlled vocabularies presented by ontologies to improve the information retrieval in tag-based systems. Based on the tags provided by the end-users, we introduce the idea of adding “system tags” from semantic, as well as social, resources. The “system tags” are comprehensive and wide-ranging in comparison with the limited “user tags”. The system tags are used to fill the gap between the user tags and the search terms used for searching in the tag-based systems. We restricted the scope of our work to tackle the following tagging systems shortcomings:
- The lack of semantic relations between user tags and search terms (e.g. synonymy, hypernymy),
- The lack of translation mediums between user tags and search terms (multilinguality),
- The lack of context to define the emergent shorthand writing user tags.
To address the first shortcoming, we use the WordNet ontology as a semantic lingual resource from where system tags are extracted. For the second shortcoming, we use the MultiWordNet ontology to recognise the cross-languages linkages between different languages. Finally, to address the third shortcoming, we use tag clusters that are obtained from the Social Web to create a context for defining the meaning of shorthand writing tags.
A prototype for our architecture was implemented. In the prototype system, we built our own database to host videos that we imported from real tag-based system (YouTube). The user tags associated with these videos were also imported and stored in the database. For each user tag, our algorithm adds a number of system tags that came from either semantic ontologies (WordNet or MultiWordNet), or from tag clusters that are imported from the Flickr website. Therefore, each system tag added to annotate the imported videos has a relationship with one of the user tags on that video. The relationship might be one of the following: synonymy, hypernymy, similar term, related term, translation, or clustering relation.
To evaluate the suitability of our proposed system tags, we developed an online environment where participants submit search terms and retrieve two groups of videos to be evaluated. Each group is produced from one distinct type of tags; user tags or system tags. The videos in the two groups are produced from the same database and are evaluated by the same participants in order to have a consistent and reliable evaluation. Since the user tags are used nowadays for searching the real tag-based systems, we consider its efficiency as a criterion (reference) to which we compare the efficiency of the new system tags.
In order to compare the relevancy between the search terms and each group of retrieved videos, we carried out a statistical approach. According to Wilcoxon Signed-Rank test, there was no significant difference between using either system tags or user tags. The findings revealed that the use of the system tags in the search is as efficient as the use of the user tags; both types of tags produce different results, but at the same level of relevance to the submitted search terms
- …