20 research outputs found

    Modeling and Analysis of Scholar Mobility on Scientific Landscape

    Full text link
    Scientific literature till date can be thought of as a partially revealed landscape, where scholars continue to unveil hidden knowledge by exploring novel research topics. How do scholars explore the scientific landscape , i.e., choose research topics to work on? We propose an agent-based model of topic mobility behavior where scholars migrate across research topics on the space of science following different strategies, seeking different utilities. We use this model to study whether strategies widely used in current scientific community can provide a balance between individual scientific success and the efficiency and diversity of the whole academic society. Through extensive simulations, we provide insights into the roles of different strategies, such as choosing topics according to research potential or the popularity. Our model provides a conceptual framework and a computational approach to analyze scholars' behavior and its impact on scientific production. We also discuss how such an agent-based modeling approach can be integrated with big real-world scholarly data.Comment: To appear in BigScholar, WWW 201

    Enabling the Discovery of Digital Cultural Heritage Objects through Wikipedia

    Get PDF
    Over the past years large digital cultural heritage collections have become increasingly available. While these provide adequate search functionality for the expert user, this may not offer the best support for non-expert or novice users. In this paper we propose a novel mechanism for introducing new users to the items in a collection by allowing them to browse Wikipedia articles, which are augmented with items from the cultural heritage collection. Using Europeana as a case-study we demonstrate the effectiveness of our approach for encouraging users to spend longer exploring items in Europeana compared with the existing search provision

    A text mining and topic modelling perspective of ethnic marketing research

    Get PDF
    This study presents an enhanced automated approach based on literature analysis and synthesis for establishing the dimensions of the ethnic marketing literature, covering a set of 239 journal articles published by nine major publishers. The approach reported is enhanced by two novel procedures to address previously identified limitations, namely: definition of a relevant dictionary based on both a sufficient lexicon extracted from a definition of the core theme and a conditional dictionary, with related but non-core terms; and a visually appealing pictorial representation to summarize the discovered topics. The application of the method to ethnic marketing indicates that ethnic marketing research is characterized by high conceptual heterogeneity, although a clear definition of "ethnic marketing" is imperative for research development. Overall, the paper advances an approach with considerable scalability advantages when compared with extant approaches, an important issue to consider when textual sources become big data.- (undefined

    Revealing Hidden Community Structures and Identifying Bridges in Complex Networks: An Application to Analyzing Contents of Web Pages for Browsing

    Get PDF
    International audienceThe emergence of scale free and small world properties in real world complex networks has stimulated lots of activity in the field of network analysis. An example of such a network comes from the field of Content Analysis (CA) and Text Mining where the goal is to analyze the contents of a set of web pages. The Network can be represented by the words appearing in the web pages as nodes and the edges representing a relation between two words if they appear in a document together. In this paper we present a CA system that helps users analyze these networks representing the textual contents of a set of web pages visually. Major contributions include a methodology to cluster complex networks based on duplication of nodes and identification of bridges i.e. words that might be of user interest but have a low frequency in the document corpus. We have tested this system with a number of data sets and users have found it very useful for the exploration of data. One of the case studies is presented in detail which is based on browsing a collection of web pages on Wikipedia (http://en.wikipedia.org/wiki/Main_Page)

    Fracture Mechanics Method for Word Embedding Generation of Neural Probabilistic Linguistic Model

    Get PDF
    Word embedding, a lexical vector representation generated via the neural linguistic model (NLM), is empirically demonstrated to be appropriate for improvement of the performance of traditional language model. However, the supreme dimensionality that is inherent in NLM contributes to the problems of hyperparameters and long-time training in modeling. Here, we propose a force-directed method to improve such problems for simplifying the generation of word embedding. In this framework, each word is assumed as a point in the real world; thus it can approximately simulate the physical movement following certain mechanics. To simulate the variation of meaning in phrases, we use the fracture mechanics to do the formation and breakdown of meaning combined by a 2-gram word group. With the experiments on the natural linguistic tasks of part-of-speech tagging, named entity recognition and semantic role labeling, the result demonstrated that the 2-dimensional word embedding can rival the word embeddings generated by classic NLMs, in terms of accuracy, recall, and text visualization
    corecore