Search CORE

44,863 research outputs found

A similarity-based community detection method with multiple prototype representation

Author: Martin Arnaud
Pan Quan
Zhou Kuang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Communities are of great importance for understanding graph structures in social networks. Some existing community detection algorithms use a single prototype to represent each group. In real applications, this may not adequately model the different types of communities and hence limits the clustering performance on social networks. To address this problem, a Similarity-based Multi-Prototype (SMP) community detection approach is proposed in this paper. In SMP, vertices in each community carry various weights to describe their degree of representativeness. This mechanism enables each community to be represented by more than one node. The centrality of nodes is used to calculate prototype weights, while similarity is utilized to guide us to partitioning the graph. Experimental results on computer generated and real-world networks clearly show that SMP performs well for detecting communities. Moreover, the method could provide richer information for the inner structure of the detected communities with the help of prototype weights compared with the existing community detection models

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1

Mapping Big Data into Knowledge Space with Cognitive Cyber-Infrastructure

Author: Zhuge Hai
Publication venue
Publication date: 18/07/2015
Field of study

Big data research has attracted great attention in science, technology, industry and society. It is developing with the evolving scientific paradigm, the fourth industrial revolution, and the transformational innovation of technologies. However, its nature and fundamental challenge have not been recognized, and its own methodology has not been formed. This paper explores and answers the following questions: What is big data? What are the basic methods for representing, managing and analyzing big data? What is the relationship between big data and knowledge? Can we find a mapping from big data into knowledge space? What kind of infrastructure is required to support not only big data management and analysis but also knowledge discovery, sharing and management? What is the relationship between big data and science paradigm? What is the nature and fundamental challenge of big data computing? A multi-dimensional perspective is presented toward a methodology of big data computing.Comment: 59 page

arXiv.org e-Print Archive

CiteSeerX

Prediction of Emerging Technologies Based on Analysis of the U.S. Patent Citation Network

Author: A. Hargadon
A. Jaffe
A. Pyka
A. Sood
A. Usher
A. Verbeek
A. Vespignani
B. Milman
C. Chen
C. Sternitzke
C. Weng
D. Harhoff
E. Duguet
E. Garfield
E. Garfield
F. Murray
F. Narin
G. McMillanm
G. Palla
H. Moed
H. Small
H. Small
J. Alcacer
J. Hagedoorn
J. Lanjouw
J. Podolny
J. Podolny
J. Schumpeter
J. Ward
Jan Tobochnik
K. Debackere
K. Lai
K. OuYang
K. Strandburg
K. Strandburg
Katherine Strandburg
Kinga Makovi
L. Fleming
L. Fleming
L. Fleming
L. Leydesdorff
László Zalányi
M. Girvan
M. Meyer
M. Meyer
M. Mogee
M. Mogee
M. Mogee
M. Newman
M. Newman
M. Wallace
M. Weitzman
N. Shibata
N. Shibata
N. Shibata
O. Sorenson
P. Almeida
P. Pons
P. Saviotti
P. Saviotti
P. Saviotti
P. Érdi
P.C. Lee
Péter Volf
Péter Érdi
R. Fontana
R. Henderson
R. Kostoff
R. Kostoff
R. Kostoff
R. Tijssen
S. Chang
Y. Kajikawa
Y. Kajikawa
Z. Huang
Z. Huang
Zoltán Somogyvári
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/04/2013
Field of study

The network of patents connected by citations is an evolving graph, which provides a representation of the innovation process. A patent citing another implies that the cited patent reflects a piece of previously existing knowledge that the citing patent builds upon. A methodology presented here (i) identifies actual clusters of patents: i.e. technological branches, and (ii) gives predictions about the temporal changes of the structure of the clusters. A predictor, called the {citation vector}, is defined for characterizing technological development to show how a patent cited by other patents belongs to various industrial fields. The clustering technique adopted is able to detect the new emerging recombinations, and predicts emerging new technology clusters. The predictive ability of our new method is illustrated on the example of USPTO subcategory 11, Agriculture, Food, Textiles. A cluster of patents is determined based on citation data up to 1991, which shows significant overlap of the class 442 formed at the beginning of 1997. These new tools of predictive analytics could support policy decision making processes in science and technology, and help formulate recommendations for action

arXiv.org e-Print Archive

Crossref