Search CORE

20 research outputs found

Modeling and Analysis of Scholar Mobility on Scientific Landscape

Author: Chiu Dah Ming
Venkatramanan Srinivasan
Ying Qiu Fang
Publication venue
Publication date: 10/03/2015
Field of study

Scientific literature till date can be thought of as a partially revealed landscape, where scholars continue to unveil hidden knowledge by exploring novel research topics. How do scholars explore the scientific landscape , i.e., choose research topics to work on? We propose an agent-based model of topic mobility behavior where scholars migrate across research topics on the space of science following different strategies, seeking different utilities. We use this model to study whether strategies widely used in current scientific community can provide a balance between individual scientific success and the efficiency and diversity of the whole academic society. Through extensive simulations, we provide insights into the roles of different strategies, such as choosing topics according to research potential or the popularity. Our model provides a conceptual framework and a computational approach to analyze scholars' behavior and its impact on scientific production. We also discuss how such an agent-based modeling approach can be integrated with big real-world scholarly data.Comment: To appear in BigScholar, WWW 201

arXiv.org e-Print Archive

Crossref

Enabling the Discovery of Digital Cultural Heritage Objects through Wikipedia

Author: Agirre Eneko
Clough Paul
Hall Mark Michael
Lopez de Lacalle Oier
Soroa Aitor
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2012
Field of study

Over the past years large digital cultural heritage collections have become increasingly available. While these provide adequate search functionality for the expert user, this may not offer the best support for non-expert or novice users. In this paper we propose a novel mechanism for introducing new users to the items in a collection by allowing them to browse Wikipedia articles, which are augmented with items from the cultural heritage collection. Using Europeana as a case-study we demonstrate the effectiveness of our approach for encouraging users to spend longer exploring items in Europeana compared with the existing search provision

CiteSeerX

Open Research Online (The Open University)

Edge Hill University Research Information Repository

Bisociative Exploration of Biological and Financial Literature Using Clustering

Author: Igor Mozetič
Janez Kranjc
Oliver Schmidt
Paul Thompson
Werner Dubitzky
Publication venue: Springer Berlin Heidelberg
Publication date: 01/01/2012
Field of study

Springer - Publisher Connector

Bisociative exploration of biological and financial literature with using clustering

Author: Dubitzky Werner
Kranjc Janez
Mozetic Igor
Schmidt Oliver
Thompson Paul
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2012
Field of study

Ulster University's Research Portal

A text mining and topic modelling perspective of ethnic marketing research

Author: Cortez Paulo
Moro Sérgio
Pires Guilherme
Rita Paulo
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

This study presents an enhanced automated approach based on literature analysis and synthesis for establishing the dimensions of the ethnic marketing literature, covering a set of 239 journal articles published by nine major publishers. The approach reported is enhanced by two novel procedures to address previously identified limitations, namely: definition of a relevant dictionary based on both a sufficient lexicon extracted from a definition of the core theme and a conditional dictionary, with related but non-core terms; and a visually appealing pictorial representation to summarize the discovered topics. The application of the method to ethnic marketing indicates that ethnic marketing research is characterized by high conceptual heterogeneity, although a clear definition of "ethnic marketing" is imperative for research development. Overall, the paper advances an approach with considerable scalability advantages when compared with extant approaches, an important issue to consider when textual sources become big data.- (undefined

Universidade do Minho: RepositoriUM

Repositório Institucional do ISCTE-IUL

Revealing Hidden Community Structures and Identifying Bridges in Complex Networks: An Application to Analyzing Contents of Web Pages for Browsing

Author: Melançon Guy
Sallaberry Arnaud
Zaidi Faraz
Publication venue: HAL CCSD
Publication date: 01/01/2009
Field of study

International audienceThe emergence of scale free and small world properties in real world complex networks has stimulated lots of activity in the field of network analysis. An example of such a network comes from the field of Content Analysis (CA) and Text Mining where the goal is to analyze the contents of a set of web pages. The Network can be represented by the words appearing in the web pages as nodes and the edges representing a relation between two words if they appear in a document together. In this paper we present a CA system that helps users analyze these networks representing the textual contents of a set of web pages visually. Major contributions include a methodology to cluster complex networks based on duplication of nodes and identification of bridges i.e. words that might be of user interest but have a low frequency in the document corpus. We have tested this system with a number of data sets and users have found it very useful for the exploration of data. One of the case studies is presented in detail which is based on browsing a collection of web pages on Wikipedia (http://en.wikipedia.org/wiki/Main_Page)

INRIA a CCSD electronic archive server

Fracture Mechanics Method for Word Embedding Generation of Neural Probabilistic Linguistic Model

Author: Size Bi
Ting-lei Huang
Xiao Liang
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2016
Field of study

Word embedding, a lexical vector representation generated via the neural linguistic model (NLM), is empirically demonstrated to be appropriate for improvement of the performance of traditional language model. However, the supreme dimensionality that is inherent in NLM contributes to the problems of hyperparameters and long-time training in modeling. Here, we propose a force-directed method to improve such problems for simplifying the generation of word embedding. In this framework, each word is assumed as a point in the real world; thus it can approximately simulate the physical movement following certain mechanics. To simulate the variation of meaning in phrases, we use the fracture mechanics to do the formation and breakdown of meaning combined by a 2-gram word group. With the experiments on the natural linguistic tasks of part-of-speech tagging, named entity recognition and semantic role labeling, the result demonstrated that the 2-dimensional word embedding can rival the word embeddings generated by classic NLMs, in terms of accuracy, recall, and text visualization

Crossref

Directory of Open Access Journals