Search CORE

91,802 research outputs found

Competitive dynamics of lexical innovations in multi-layer networks

Author: Javarone Marco Alberto
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2014
Field of study

We study the introduction of lexical innovations into a community of language users. Lexical innovations, i.e., new terms added to people's vocabulary, play an important role in the process of language evolution. Nowadays, information is spread through a variety of networks, including, among others, online and offline social networks and the World Wide Web. The entire system, comprising networks of different nature, can be represented as a multi-layer network. In this context, lexical innovations diffusion occurs in a peculiar fashion. In particular, a lexical innovation can undergo three different processes: its original meaning is accepted; its meaning can be changed or misunderstood (e.g., when not properly explained), hence more than one meaning can emerge in the population; lastly, in the case of a loan word, it can be translated into the population language (i.e., defining a new lexical innovation or using a synonym) or into a dialect spoken by part of the population. Therefore, lexical innovations cannot be considered simply as information. We develop a model for analyzing this scenario using a multi-layer network comprising a social network and a media network. The latter represents the set of all information systems of a society, e.g., television, the World Wide Web and radio. Furthermore, we identify temporal directed edges between the nodes of these two networks. In particular, at each time step, nodes of the media network can be connected to randomly chosen nodes of the social network and vice versa. In so doing, information spreads through the whole system and people can share a lexical innovation with their neighbors or, in the event they work as reporters, by using media nodes. Lastly, we use the concept of "linguistic sign" to model lexical innovations, showing its fundamental role in the study of these dynamics. Many numerical simulations have been performed.Comment: 23 pages, 19 figures, 1 tabl

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Cagliari

The Best Trail Algorithm for Assisted Navigation of Web Sites

Author: Levene Mark
Wheeldon Richard
Publication venue
Publication date: 01/01/2003
Field of study

We present an algorithm called the Best Trail Algorithm, which helps solve the hypertext navigation problem by automating the construction of memex-like trails through the corpus. The algorithm performs a probabilistic best-first expansion of a set of navigation trees to find relevant and compact trails. We describe the implementation of the algorithm, scoring methods for trails, filtering algorithms and a new metric called \emph{potential gain} which measures the potential of a page for future navigation opportunities.Comment: 11 pages, 11 figure

arXiv.org e-Print Archive

CiteSeerX

Birkbeck Institutional Research Online

GraphVite: A High-Performance CPU-GPU Hybrid System for Node Embedding

Author: Qu Meng
Tang Jian
Xu Shizhen
Zhu Zhaocheng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2019
Field of study

Learning continuous representations of nodes is attracting growing interest in both academia and industry recently, due to their simplicity and effectiveness in a variety of applications. Most of existing node embedding algorithms and systems are capable of processing networks with hundreds of thousands or a few millions of nodes. However, how to scale them to networks that have tens of millions or even hundreds of millions of nodes remains a challenging problem. In this paper, we propose GraphVite, a high-performance CPU-GPU hybrid system for training node embeddings, by co-optimizing the algorithm and the system. On the CPU end, augmented edge samples are parallelly generated by random walks in an online fashion on the network, and serve as the training data. On the GPU end, a novel parallel negative sampling is proposed to leverage multiple GPUs to train node embeddings simultaneously, without much data transfer and synchronization. Moreover, an efficient collaboration strategy is proposed to further reduce the synchronization cost between CPUs and GPUs. Experiments on multiple real-world networks show that GraphVite is super efficient. It takes only about one minute for a network with 1 million nodes and 5 million edges on a single machine with 4 GPUs, and takes around 20 hours for a network with 66 million nodes and 1.8 billion edges. Compared to the current fastest system, GraphVite is about 50 times faster without any sacrifice on performance.Comment: accepted at WWW 201

arXiv.org e-Print Archive

Crossref

Structuring visual exploratory analysis of skill demand

Author: A.-S. Dadzie
Brooke
Cunningham
Davenport
E.M. Sibarani
Einsfeld
Elia
Ellis
Gehl
Granville
Harper
Heer
Hervás
I. Novalija
Inselberg
Keim
Khobreh
Kulyk
Liu
Munzner
Nielsen
S. Scerri
Sacha
Sibarani
Strawn
Terblanche
Tominski
Turkay
Ware
Welter
Wowczko
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

The analysis of increasingly large and diverse data for meaningful interpretation and question answering is handicapped by human cognitive limitations. Consequently, semi-automatic abstraction of complex data within structured information spaces becomes increasingly important, if its knowledge content is to support intuitive, exploratory discovery. Exploration of skill demand is an area where regularly updated, multi-dimensional data may be exploited to assess capability within the workforce to manage the demands of the modern, technology- and data-driven economy. The knowledge derived may be employed by skilled practitioners in defining career pathways, to identify where, when and how to update their skillsets in line with advancing technology and changing work demands. This same knowledge may also be used to identify the combination of skills essential in recruiting for new roles. To address the challenges inherent in exploring the complex, heterogeneous, dynamic data that feeds into such applications, we investigate the use of an ontology to guide structuring of the information space, to allow individuals and institutions to interactively explore and interpret the dynamic skill demand landscape for their specific needs. As a test case we consider the relatively new and highly dynamic field of Data Science, where insightful, exploratory data analysis and knowledge discovery are critical. We employ context-driven and task-centred scenarios to explore our research questions and guide iterative design, development and formative evaluation of our ontology-driven, visual exploratory discovery and analysis approach, to measure where it adds value to users’ analytical activity. Our findings reinforce the potential in our approach, and point us to future paths to build on

Crossref

Fraunhofer-ePrints

Open Research Online (The Open University)

A Survey of Volunteered Open Geo-Knowledge Bases in the Semantic Web

Author: A. Ballatore
A. Buccella
A. Burton-Jones
A. Gangemi
A. Gore
A. Gómez-Pérez
A. Polleres
A. Schwering
A. Turner
B. Smith
C. Bizer
C. Jones
C. Keßler
C. Keßler
C. Manning
C.B. Jones
D. Buscaldi
D. Coleman
D. Nadeau
D. Strasunskas
D. Sui
F. Baader
F. Fonseca
F. Giunchiglia
F. Giunchiglia
F. Harvey
F.. Gey
F.J. Lopez-Pellicer
G. Bordogna
G. Fu
G. Tré De
G. Weikum
J. Giles
J. Goodwin
J. Howe
J. Leveling
K. Janowicz
K. Janowicz
L. Vaccari
L.L. Hill
M. Egenhofer
M. Goodchild
M. Goodchild
M. Grassi
M. Haklay
M. Haklay
M. Haklay
M. Kitsuregawa
M. Lutz
N. Choi
N. Guarino
N. Guarino
P. Burrough
P. Magnus
P. Roget
P. Singh
P.D. Smart
R. Fouad
R. Rada
S. Auer
S. Auer
S. Freitas
S. Hahmann
S. Overell
S. Schade
S. Staab
S. Vaid
S. Winter
T. Berners-Lee
T. Mandl
T. Mandl
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Over the past decade, rapid advances in web technologies, coupled with innovative models of spatial data collection and consumption, have generated a robust growth in geo-referenced information, resulting in spatial information overload. Increasing 'geographic intelligence' in traditional text-based information retrieval has become a prominent approach to respond to this issue and to fulfill users' spatial information needs. Numerous efforts in the Semantic Geospatial Web, Volunteered Geographic Information (VGI), and the Linking Open Data initiative have converged in a constellation of open knowledge bases, freely available online. In this article, we survey these open knowledge bases, focusing on their geospatial dimension. Particular attention is devoted to the crucial issue of the quality of geo-knowledge bases, as well as of crowdsourced data. A new knowledge base, the OpenStreetMap Semantic Network, is outlined as our contribution to this area. Research directions in information integration and Geographic Information Retrieval (GIR) are then reviewed, with a critical discussion of their current limitations and future prospects

arXiv.org e-Print Archive

Crossref

Drawing OWL 2 ontologies with Eddy the editor

Author: Lembo Domenico
Pantaleone Daniele
Santarelli Valerio
Savo Domenico Fabio
Publication venue: 'IOS Press'
Publication date: 01/01/2018
Field of study

In this paper we introduce Eddy, a new open-source tool for the graphical editing of OWL~2 ontologies. Eddy is specifically designed for creating ontologies in Graphol, a completely visual ontology language that is equivalent to OWL~2. Thus, in Eddy ontologies are easily drawn as diagrams, rather than written as sets of formulas, as commonly happens in popular ontology design and engineering environments. This makes Eddy particularly suited for usage by people who are more familiar with diagramatic languages for conceptual modeling rather than with typical ontology formalisms, as is often required in non-academic and industrial contexts. Eddy provides intuitive functionalities for specifying Graphol diagrams, guarantees their syntactic correctness, and allows for exporting them in standard OWL 2 syntax. A user evaluation study we conducted shows that Eddy is perceived as an easy and intuitive tool for ontology specification

Archivio della ricerca- Università di Roma La Sapienza