11 research outputs found

    Discoveries of Contextualized Research Areas for Scientific Community

    Get PDF
    Abstract: In scientific community searching the relevant paper is a time critical task and often leads to misdirection. This is mainly due to the unstructured or semi-structured information indexed by search engines. However, with the evolution of social web (Web 2.0), the social community is being engaged in a system where they provide useful annotations (termed as tags) to the electronic resources. The current research makes use of social web to discover the most relevant papers from the Web. In our previous work, we compared author's keywords in a publication with the tags provided by the social community for serendipitous discoveries of related concepts and papers. However, in the current work, we focused to find a hierarchy of concepts related to focused publication. This hierarchy is discovered from Wordnet ontology, producing the convergent and divergent concepts for the author's keywords in a publication. These extended sets of keywords are matched with tags in social Web to find related concepts. In this way, the system is able to not only find the concepts in social Web that are directly related to the topics of the paper, instead, the system also discover the divergent and convergent concepts related to the topic of the paper. In this way, the users will be able to explore a topic hierarchy related to the paper's topics. The user can follow a concept to see the associated resources that have been annotated by the social community. Introduction Since the advent of the Word Wide Web, we see the information flux has been extending at the phenomenal speed. The huge amount of information at the WWW is being indexed by the search engines like Google, Yahoo etc The focus of current research is in utilizing rich metadata available in terms of tags in social bookmarking Aust. J. Basic & Appl. Sci., 5(6): 1641-1647, 2011 1642 systems for serendipitous discoveries of related research areas and papers in scientific domain. For example when a user is reading a particular research paper, the user is interested to find related papers and emerging fields in the context of the current paper. This is applicable to different types of users such as a user who wants to get new research directions for the focused area, a user who wants to see the community trends for the focused research area, and a user who is interested to discover most relevant recent papers in the focused domain. In our previous work, we have recommended newly evolving concepts, trends, and papers for the topic of the paper being read by the user (Afzal, 2010) using CiteULike tagging system. For the discovery of relevant and evolving concepts from CiteULIke, we utilized tags and compared them (directly and partially) with the keywords of the focused paper. The discovery was very useful, however, it lacks in recommending the hierarchy of the topics, for example converged and diverged concepts (topics) for the focused topics were not recommended. For example if an author' keyword is "Wiki", the system was able to find the evolving concepts from CiteULike such as "semantic wikis", "wikification", "Wikipedia" etc. However, the system was unable to find that the concept wiki is converging from the concepts (such as blogs, knowledge management, Wikipedia etc), and diverging to concepts (such as social networking sites, collaborative content etc). To give a broader picture of the related and emerging research areas for the focused research area, we have proposed a framework to utilize the tagging information to enhance the semantic context of a paper by extending the keyword dataset of research publication using the senses of WordNet ontology (Christiane Fellbaum, 1998). The convergent and divergent concepts for a concept defined in a paper (in terms of a keyword) are retrieved from Wordnet. Subsequently, the extended set of concept is mapped on the tags available in tagging application. The framework focus on indentifying all the convergent and divergent concepts of vocabulary for the keywords data set of the papers. The extended dataset of keywords have further been matched directly and partially with CiteULike tags for serendipitous discoveries of related concepts. The prototype has been implemented for an online journal such as Journal of Universal Computer Science. All of the journal's papers have been acquired along with the associated keywords. The keyword dataset has been extended using Wordnet ontology to find convergent and divergent concepts, subsequently; the extended set of keywords is searched within CiteULike dataset. The discovered concepts are pushed to users's local context. For example a user is reading a research paper in journal's environment; the user is able to see how the research topics of the focused paper are evolving and what are other related concepts from where the research topics of the focused paper have evolved. This broad picture helps users to set their directions. Related Work: This section provides a brief literature review about the related areas of the research. The paper focus on utilizing social bookmarking systems, enhance the system discoveries using Wordnet, embed the developed system in "Links into the Future" system. The following text explains the above mentioned areas briefly. Social Bookmarking: As the social bookmarking services have build up a parallel and influencing social based community with huge amount of data extending at phenomenal speed. There are some popular social bookmarking applications running across the web. CiteUlike is one of the bookmarking application which offers the services of sharing and discovering the information regarding the academics and research papers. Bibsonomy, another socialbookmarking application which offers its users in sharing bookmarks and publication references. Similarly, Delicious allows its users to bookmark/tag the web resources , its storage,sharing and viewing the bookmarks of other people. The collaborative tags have been used in proposing the recommendations for the scientific papers Similarly, the importance of tagging based systems have been realized in the knowledge discovery domain, where discovering the relevant information for contents stored at digital journal using the other alternative sources of social bookmarking data repositories. The potentials of Tags have also been exploited as a secondary measure for the knowledge diffusion, The results showed that there exist positive correlation between tags and citations and tags have potential to be used as one of the factor for the knowledge diffusion. Aust. J. Basic & Appl. Sci., 5(6): 1641-1647, 2011 1643 To prove the experiment, three datasets from the popular social bookmarking applications (i.e CitUlike, Bipsonomy, Delicious) were used. The above findings in the academic domain provide an opportunity to exploit and identify the potential advantages of the socially bookmarked data for extending the research scholar's context when he is going to read the scientific manuscript. As keyword tag is an important element in the taxonomy of a research paper and helps the reader in broadening his dimensions about the domain of the paper. The purpose of this research work is to broaden the user's context using the collaborative tagging. Wordnet: Wordnet is a semantically rich database which comprises of language semantics like noun,adjective,verb,adverb etc (Christiane Fellbaum, 1998). It include the hypernym and hyponym relations which allow in associating the concepts semantically. The synset (Hypernyms and hyponyms) offered by the WordNet has been used in enriching the bookmark collections, correlation among tags on the basis of semantics, semantics relatedness of searching query terms, etc In the proposed methodology is to extend the research publication's keyword dataset using the semantic based WordNet ontology (especially with the help of semantic relations offered by Hypernym and hyponym). This is done through adding the two data sets of concept vocabulary, which will be populated on the basis of keywords dataset of the research publication. One dataset is representing the 'Convergent to' means generalized concept list and will be built on the basis of Hypernyms for the paper's keyword using Word Net. Similarly the other table is recognized as 'Divergent from' means specialized concept list and be generated on the basis of Hyponyms of WordNet. Proposed Architecture: The proposed architecture of broadening the overall user's context on the basis of semantically extending the paper's keyword dataset is shown in Content Discovery Component: This component is responsible of selecting all the papers from the digital library. In this case we have selected only JUCS dataset. These papers are in the PDF format. So first these papers are converted into the text stream and then sent to the keyword selection module

    Which Hashtag to use? Building a Hashtag recommender system and understanding the textual features surrounding Hashtags

    Get PDF
    Hashtags are community-based tags on twitter that are used to annotate tweets and make them findable. To make a user's participation on the social platform more relevant, recommending a hashtag would help a user participate better. This study is an attempt to build a recommender system for hashtag recommendation, and to further study the textual features around hashtags, which assist in their retrieval. The suggested system performs better for tweets with longer text; those with a URL, with multiple hashtags and those that have user mentions.Master of Science in Information Scienc

    A probabilistic approach to personalized tag recommendation

    Get PDF

    Modeling User Expertise in Folksonomies by Fusing Multi-type Features

    Get PDF
    Abstract. The folksonomy refers to the online collaborative tagging system which offers a new open platform for content annotation with uncontrolled vocabulary. As folksonomies are gaining in popularity, the expert search and spammer detection in folksonomies attract more and more attention. However, most of previous work are limited on some folksonomy features. In this paper, we introduce a generic and flexible user expertise model for expert search and spammer detection. We first investigate a comprehensive set of expertise evidences related to users, objects and tags in folksonomies. Then we discuss the rich interactions between them and propose a unified Continuous CRF model to integrate these features and interactions. This model's applications for expert recommendation and spammer detection are also exploited. Extensive experiments are conducted on a real tagging dataset and demonstrate the model's advantages over previous methods, both in performance and coverage

    Reverse k-Ranks Queries on Large Graphs

    Get PDF
    published_or_final_versio

    Infer user interests via link structure regularization

    Get PDF
    Learning user interests from online social networks helps to better understand user behaviors and provides useful guidance to design user-centric applications. Apart from analyzing users' online content, it is also important to consider users' social connections in the social Web. Graph regularization methods have been widely used in various text mining tasks, which can leverage the graph structure information extracted from data. Previously, graph regularization methods operate under the cluster assumption that nearby nodes are more similar and nodes on the same structure (typically referred to as a cluster or a manifold) are likely to be similar. We argue that learning user interests from complex, sparse, and dynamic social networks should be based on the link structure assumption under which node similarities are evaluated based on the local link structures instead of explicit links between two nodes. We propose a regularization framework based on the relation bipartite graph, which can be constructed from any type of relations. Using Twitter as our case study, we evaluate our proposed framework from social networks built from retweet relations. Both quantitative and qualitative experiments show that our proposed method outperforms a few competitive baselines in learning user interests over a set of predefined topics. It also gives superior results compared to the baselines on retweet prediction and topical authority identification

    Personalized tag recommendation using graph-based ranking on multi-type interrelated objects

    No full text

    Document recommendation in social tagging services

    Full text link
    Social tagging services allow users to annotate various on-line resources with freely chosen keywords (tags). They not only facilitate the users in finding and organizing online re-sources, but also provide meaningful collaborative semantic data which can potentially be exploited by recommender systems. Traditional studies on recommender systems fo-cused on user rating data, while recently social tagging data is becoming more and more prevalent. How to perform re-source recommendation based on tagging data is an emerg-ing research topic. In this paper we consider the problem of document (e.g. Web pages, research papers) recommen-dation using purely tagging data. That is, we only have data containing users, tags, documents and the relation-ships among them. We propose a novel graph-based rep-resentation learning algorithm for this purpose. The users, tags and documents are represented in the same semantic space in which two related objects are close to each other. For a given user, we recommend those documents that are sufficiently close to him/her. Experimental results on two data sets crawled from Del.icio.us and CiteULike show that our algorithm can generate promising recommendations and outperforms traditional recommendation algorithms

    Pertanika Journal of Science & Technology

    Get PDF
    corecore