Search CORE

240 research outputs found

Computational intelligent methods for trusting in social networks

Author: Núñez González José David
Publication venue
Publication date: 26/02/2016
Field of study

104 p.This Thesis covers three research lines of Social Networks. The first proposed reseach line is related with Trust. Different ways of feature extraction are proposed for Trust Prediction comparing results with classic methods. The problem of bad balanced datasets is covered in this work. The second proposed reseach line is related with Recommendation Systems. Two experiments are proposed in this work. The first experiment is about recipe generation with a bread machine. The second experiment is about product generation based on rating given by users. The third research line is related with Influence Maximization. In this work a new heuristic method is proposed to give the minimal set of nodes that maximizes the influence of the network

Archivo Digital para la Docencia y la Investigación

Doctor of Philosophy

Author: Koop David Allen
Publication venue: University of Utah
Publication date: 01/05/2012
Field of study

dissertationServing as a record of what happened during a scientific process, often computational, provenance has become an important piece of computing. The importance of archiving not only data and results but also the lineage of these entities has led to a variety of systems that capture provenance as well as models and schemas for this information. Despite significant work focused on obtaining and modeling provenance, there has been little work on managing and using this information. Using the provenance from past work, it is possible to mine common computational structure or determine differences between executions. Such information can be used to suggest possible completions for partial workflows, summarize a set of approaches, or extend past work in new directions. These applications require infrastructure to support efficient queries and accessible reuse. In order to support knowledge discovery and reuse from provenance information, the management of those data is important. One component of provenance is the specification of the computations; workflows provide structured abstractions of code and are commonly used for complex tasks. Using change-based provenance, it is possible to store large numbers of similar workflows compactly. This storage also allows efficient computation of differences between specifications. However, querying for specific structure across a large collection of workflows is difficult because comparing graphs depends on computing subgraph isomorphism which is NP-Complete. Graph indexing methods identify features that help distinguish graphs of a collection to filter results for a subgraph containment query and reduce the number of subgraph isomorphism computations. For provenance, this work extends these methods to work for more exploratory queries and collections with significant overlap. However, comparing workflow or provenance graphs may not require exact equality; a match between two graphs may allow paired nodes to be similar yet not equivalent. This work presents techniques to better correlate graphs to help summarize collections. Using this infrastructure, provenance can be reused so that users can learn from their own and others' history. Just as textual search has been augmented with suggested completions based on past or common queries, provenance can be used to suggest how computations can be completed or which steps might connect to a given subworkflow. In addition, provenance can help further science by accelerating publication and reuse. By incorporating provenance into publications, authors can more easily integrate their results, and readers can more easily verify and repeat results. However, reusing past computations requires maintaining stronger associations with any input data and underlying code as well as providing paths for migrating old work to new hardware or algorithms. This work presents a framework for maintaining data and code as well as supporting upgrades for workflow computations

The University of Utah: J. Willard Marriott Digital Library

User Interfaces to the Web of Data based on Natural Language Generation

Author: Ell Basil
Publication venue: KIT Scientific Publishing, Karlsruhe
Publication date: 01/01/2017
Field of study

We explore how Virtual Research Environments based on Semantic Web technologies support research interactions with RDF data in various stages of corpus-based analysis, analyze the Web of Data in terms of human readability, derive labels from variables in SPARQL queries, apply Natural Language Generation to improve user interfaces to the Web of Data by verbalizing SPARQL queries and RDF graphs, and present a method to automatically induce RDF graph verbalization templates via distant supervision

KITopen

Directory of Open Access Books (DOAB)

Community detection in graphs

Author: Adamcsek
Adomavicius
Agarwal
Agrawal
Ahuja
Akaike
Alba
Albert
Albert
Albert
Alves
Andrews
Arenas
Arenas
Arenas
Arenas
Arenas
Asahiro
Ashburner
Asur
Backstrom
Baeza-Yates
Bagrow
Balakrishnan
Bansal
Barabási
Barahona
Barber
Barber
Barber
Barnes
Barrat
Barrat
Baumes
Beirlant
Berg
Bezdek
Bhatia
Bianconi
Bianconi
Bianconi
Biernacki
Blake
Blatt
Blondel
Boccaletti
Boccaletti
Boettcher
Bollobas
Bomze
Bonacich
Bonacich
Bonanno
Bonanno
Borgatti
Brandes
Brandes
Brin
Bron
Burnham
Burt
Capocci
Castellano
Chakrabarti
Chakrabarti
Chan
Chandra
Chen
Chen
Chi
Chung
Clauset
Clauset
Clauset
Clauset
Cohen
Coleman
Condon
Csermely
Danon
Danon
Danon
David
Davis
de Solla Price
de Solla Price
Demmel
Dempster
Derényi
Dhillon
Djidjev
Donath
Donetti
Donetti
Doreian
Dorogovtsev
Dourisboure
Du
Du
Duch
Dunbar
Dunn
Dunn
Earl
Eckmann
Efron
Elias
Erdös
Eriksen
Estrada
Estrada
Euler
Evans
Everett
Everett
Fan
Farkas
Farutin
Feige
Feng
Fenn
Fiedler
Fiedler
Fienberg
Flake
Flake
Ford
Fortunato
Fortunato
Fortunato
Fortunato
Fouss
Fowlkes
Freeman
Freeman
Fu
Gaertler
Gallager
Gan
Garey
Gfeller
Gfeller
Gfeller
Giles
Girvan
Gleiser
Glover
Goldberg
Golub
Gori
Granovetter
Gregory
Gregory
Grünwald
Gudkov
Guimerà
Guimerà
Guimerà
Guimerà
Guimerà
Guimerà
Gusfield
Gustafsson
Gómez
Hagen
Handcock
Harel
Hastie
Hastings
Heimo
Hillier
Hlaoui
Hofman
Holland
Holland
Holme
Holmström
Holzapfel
Homans
Hopcroft
Hu
Hu
Huffman
Hughes
Ispolatov
Itzkovitz
Jonsson
Jordan
Junker
Karloff
Karrer
Kernighan
Kirkpatrick
Klein
Kleinberg
Koskinen
Kottak
Krause
Krawczyk
Krishnamurthy
Kullback
Kumar
Kumar
Kumpula
Kumpula
Kumpula
Kuramoto
Lancichinetti
Lancichinetti
Lancichinetti
Lanczos
Latapy
Latora
Lehmann
Lehmann
Leicht
Leskovec
Leskovec
Leung
Li
Li
Liben-Nowell
Lin
Liu
Lloyd
Long
Lorrain
Lovász
Luccio
Luce
Luce
Lusseau
Mackay
MacQueen
Mancoridis
Mantegna
Mantegna
Massen
Matsuda
Matula
Medus
Mei
Meilă
Meilă
Mendes
Mezard
Middleton
Milgram
Milo
Mirkin
Mitrović
Mokken
Molloy
Moody
Muff
Mézard
Nadler
Nepusz
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Ng
Nicosia
Nishikawa
Noack
Noack
Noh
Noh
Ohkubo
Onnela
Onnela
Orponen
Palla
Palla
Papadimitriou
Pastor-Satorras
Pastor-Satorras
Peeters
Perkins
Peterson
Pikovsky
Pimm
Pinney
Pluchino
Pollner
Porter
Porter
Porter
Pujol
Pólya
Radicchi
Raghavan
Ramasco
Rand
Rattigan
Rattigan
Ravasz
Ravasz
Reddy
Reichardt
Reichardt
Reichardt
Reichardt
Reichardt
Reichardt
Reichardt
Ren
Rhodes
Rice
Richardson
Rissanen
Rives
Rodrigues
Ronhovde
Rosvall
Rosvall
Rowicka
Ruan
Ruan
Sales-Pardo
Santo Fortunato
Sawardecker
Schaeffer
Schenker
Schuetz
Schuetz
Schwarz
Scott
Seidman
Seidman
Sen
Shen
Shen
Sherrington
Shi
Shi
Simon
Simon
Simonsen
Simonsen
Slanina
Snijders
Solomonoff
Son
Spirin
Stanley
Steenstrup
Stewart
Suaris
Sun
Sun
Tibély
Tong
Traag
Travers
Tyler
Vazquez
Vragović
Wallace
Wallace
Wang
Ward
Wasserman
Watts
Watts
Wei
Weinan
Weiss
White
White
Wilkinson
Williams
Winkler
Wu
Wu
Wu
Xiang
Xu
Yang
Ye
Yen
Zachary
Zanghi
Zarei
Zhang
Zhang
Zhang
Zhang
Zhou
Zhou
Zhou
Zhou
Ziv
Łuczak
Šíma
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

The modern science of networks has brought significant advances to our understanding of complex systems. One of the most relevant features of graphs representing real systems is community structure, or clustering, i. e. the organization of vertices in clusters, with many edges joining vertices of the same cluster and comparatively few edges joining vertices of different clusters. Such clusters, or communities, can be considered as fairly independent compartments of a graph, playing a similar role like, e. g., the tissues or the organs in the human body. Detecting communities is of great importance in sociology, biology and computer science, disciplines where systems are often represented as graphs. This problem is very hard and not yet satisfactorily solved, despite the huge effort of a large interdisciplinary community of scientists working on it over the past few years. We will attempt a thorough exposition of the topic, from the definition of the main elements of the problem, to the presentation of most methods developed, with a special focus on techniques designed by statistical physicists, from the discussion of crucial issues like the significance of clustering and how methods should be tested and compared against each other, to the description of applications to real networks.Comment: Review article. 103 pages, 42 figures, 2 tables. Two sections expanded + minor modifications. Three figures + one table + references added. Final version published in Physics Report

arXiv.org e-Print Archive

CiteSeerX

Crossref

Recommended from our members

Semantic chunking

Author: Muszynska Ewa
Publication venue: University of Cambridge
Publication date: 01/10/2020
Field of study

Long sentences pose a challenge for natural language processing (NLP) applications. They are associated with a complex information structure leading to increased requirements for processing resources. Although the issue is present in many areas of research, there is little uniformity in the solutions used by research communities dedicated to individual NLP applications. Different aspects of the problem are addressed by different tasks, such as sentence simplification or shallow chunking. The main contribution of this thesis is the introduction of the task of semantic chunking as a general approach to reducing the cost of processing long sentences. The goal of semantic chunking is to find semantically contained fragments of a sentence representation that can be processed independently and recombined without loss of information. We anchor its principles in established concepts of semantic theory, in particular event and situation semantics. Most of the experiments in this thesis focus on semantic chunking defined on complex semantic representations in Dependency Minimal Recursion Semantics (DMRS), but we also demonstrate that the task can be performed on sentence strings. We present three chunking models: a) rule-based proof-of-concept DMRS chunking system; b) a semi-supervised sequence labelling neural model for surface semantic chunking; c) a system capable of finding semantic chunk boundaries based on the inherent structure of DMRS graphs, generalisable in the form of descriptive templates. We show how semantic chunking can be applied within a divide-and-conquer processing paradigm, using as an example the task of realization from DMRS. The application of semantic chunking yields noticeable efficiency gains without decreasing the quality of results

Apollo (Cambridge)

On the Evolution of Knowledge Graphs: A Survey and Perspective

Author: Chen Zhongwu
Guo Jian
Jiang Xuhui
Shen Yinghan
Sun Xun
Tang Lumingyuan
Wang Saizhuo
Wang Yuanzhuo
Xu Chengjin
Publication venue
Publication date: 10/10/2023
Field of study

Knowledge graphs (KGs) are structured representations of diversified knowledge. They are widely used in various intelligent applications. In this article, we provide a comprehensive survey on the evolution of various types of knowledge graphs (i.e., static KGs, dynamic KGs, temporal KGs, and event KGs) and techniques for knowledge extraction and reasoning. Furthermore, we introduce the practical applications of different types of KGs, including a case study in financial analysis. Finally, we propose our perspective on the future directions of knowledge engineering, including the potential of combining the power of knowledge graphs and large language models (LLMs), and the evolution of knowledge extraction, reasoning, and representation

arXiv.org e-Print Archive

Subgroup discovery for structured target concepts

Author: Kalofolias Janis
Publication venue: Saarländische Universitäts- und Landesbibliothek
Publication date: 01/01/2023
Field of study

The main object of study in this thesis is subgroup discovery, a theoretical framework for finding subgroups in data—i.e., named sub-populations— whose behaviour with respect to a specified target concept is exceptional when compared to the rest of the dataset. This is a powerful tool that conveys crucial information to a human audience, but despite past advances has been limited to simple target concepts. In this work we propose algorithms that bring this framework to novel application domains. We introduce the concept of representative subgroups, which we use not only to ensure the fairness of a sub-population with regard to a sensitive trait, such as race or gender, but also to go beyond known trends in the data. For entities with additional relational information that can be encoded as a graph, we introduce a novel measure of robust connectedness which improves on established alternative measures of density; we then provide a method that uses this measure to discover which named sub-populations are more well-connected. Our contributions within subgroup discovery crescent with the introduction of kernelised subgroup discovery: a novel framework that enables the discovery of subgroups on i.i.d. target concepts with virtually any kind of structure. Importantly, our framework additionally provides a concrete and efficient tool that works out-of-the-box without any modification, apart from specifying the Gramian of a positive definite kernel. To use within kernelised subgroup discovery, but also on any other kind of kernel method, we additionally introduce a novel random walk graph kernel. Our kernel allows the fine tuning of the alignment between the vertices of the two compared graphs, during the count of the random walks, while we also propose meaningful structure-aware vertex labels to utilise this new capability. With these contributions we thoroughly extend the applicability of subgroup discovery and ultimately re-define it as a kernel method.Der Hauptgegenstand dieser Arbeit ist die Subgruppenentdeckung (Subgroup Discovery), ein theoretischer Rahmen für das Auffinden von Subgruppen in Daten—d. h. benannte Teilpopulationen—deren Verhalten in Bezug auf ein bestimmtes Targetkonzept im Vergleich zum Rest des Datensatzes außergewöhnlich ist. Es handelt sich hierbei um ein leistungsfähiges Instrument, das einem menschlichen Publikum wichtige Informationen vermittelt. Allerdings ist es trotz bisherigen Fortschritte auf einfache Targetkonzepte beschränkt. In dieser Arbeit schlagen wir Algorithmen vor, die diesen Rahmen auf neuartige Anwendungsbereiche übertragen. Wir führen das Konzept der repräsentativen Untergruppen ein, mit dem wir nicht nur die Fairness einer Teilpopulation in Bezug auf ein sensibles Merkmal wie Rasse oder Geschlecht sicherstellen, sondern auch über bekannte Trends in den Daten hinausgehen können. Für Entitäten mit zusätzlicher relationalen Information, die als Graph kodiert werden kann, führen wir ein neuartiges Maß für robuste Verbundenheit ein, das die etablierten alternativen Dichtemaße verbessert; anschließend stellen wir eine Methode bereit, die dieses Maß verwendet, um herauszufinden, welche benannte Teilpopulationen besser verbunden sind. Unsere Beiträge in diesem Rahmen gipfeln in der Einführung der kernelisierten Subgruppenentdeckung: ein neuartiger Rahmen, der die Entdeckung von Subgruppen für u.i.v. Targetkonzepten mit praktisch jeder Art von Struktur ermöglicht. Wichtigerweise, unser Rahmen bereitstellt zusätzlich ein konkretes und effizientes Werkzeug, das ohne jegliche Modifikation funktioniert, abgesehen von der Angabe des Gramian eines positiv definitiven Kernels. Für den Einsatz innerhalb der kernelisierten Subgruppentdeckung, aber auch für jede andere Art von Kernel-Methode, führen wir zusätzlich einen neuartigen Random-Walk-Graph-Kernel ein. Unser Kernel ermöglicht die Feinabstimmung der Ausrichtung zwischen den Eckpunkten der beiden unter-Vergleich-gestelltenen Graphen während der Zählung der Random Walks, während wir auch sinnvolle strukturbewusste Vertex-Labels vorschlagen, um diese neue Fähigkeit zu nutzen. Mit diesen Beiträgen erweitern wir die Anwendbarkeit der Subgruppentdeckung gründlich und definieren wir sie im Endeffekt als Kernel-Methode neu

Universaar

Acronym

Semantic approaches to domain template construction and opinion mining from natural language

Author: Trampuš Mitja
Publication venue
Publication date: 05/06/2015
Field of study

Most of the text mining algorithms in use today are based on lexical representation of input texts, for example bag of words. A possible alternative is to first convert text into a semantic representation, one that captures the text content in a structured way and using only a set of pre-agreed labels. This thesis explores the feasibility of such an approach to two tasks on collections of documents: identifying common structure in input documents (»domain template construction«), and helping users find differing opinions in input documents (»opinion mining«). We first discuss ways of converting natural text to a semantic representation. We propose and compare two new methods with varying degrees of target representation complexity. The first method, showing more promise, is based on dependency parser output which it converts to lightweight semantic frames, with role fillers aligned to WordNet. The second method structures text using Semantic Role Labeling techniques and aligns the output to the Cyc ontology.\ud Based on the first of the above representations, we next propose and evaluate two methods for constructing frame-based templates for documents from a given domain (e.g. bombing attack news reports). A template is the set of all salient attributes (e.g. attacker, number of casualties, \ldots). The idea of both methods is to construct abstract frames for which more specific instances (according to the WordNet hierarchy) can be found in the input documents. Fragments of these abstract frames represent the sought-for attributes. We achieve state of the art performance and additionally provide detailed type constraints for the attributes, something not possible with competing methods. Finally, we propose a software system for exposing differing opinions in the news. For any given event, we present the user with all known articles on the topic and let them navigate them by three semantic properties simultaneously: sentiment, topical focus and geography of origin. The result is a dynamically reranked set of relevant articles and a near real time focused summary of those articles. The summary, too, is computed from the semantic text representation discussed above. We conducted a user study of the whole system with very positive results

Semantic approaches to domain template construction and opinion mining from natural language

Author: Trampuš Mitja
Publication venue
Publication date: 05/06/2015
Field of study

Most of the text mining algorithms in use today are based on lexical representation of input texts, for example bag of words. A possible alternative is to first convert text into a semantic representation, one that captures the text content in a structured way and using only a set of pre-agreed labels. This thesis explores the feasibility of such an approach to two tasks on collections of documents: identifying common structure in input documents (»domain template construction«), and helping users find differing opinions in input documents (»opinion mining«). We first discuss ways of converting natural text to a semantic representation. We propose and compare two new methods with varying degrees of target representation complexity. The first method, showing more promise, is based on dependency parser output which it converts to lightweight semantic frames, with role fillers aligned to WordNet. The second method structures text using Semantic Role Labeling techniques and aligns the output to the Cyc ontology. Based on the first of the above representations, we next propose and evaluate two methods for constructing frame-based templates for documents from a given domain (e.g. bombing attack news reports). A template is the set of all salient attributes (e.g. attacker, number of casualties, \ldots). The idea of both methods is to construct abstract frames for which more specific instances (according to the WordNet hierarchy) can be found in the input documents. Fragments of these abstract frames represent the sought-for attributes. We achieve state of the art performance and additionally provide detailed type constraints for the attributes, something not possible with competing methods. Finally, we propose a software system for exposing differing opinions in the news. For any given event, we present the user with all known articles on the topic and let them navigate them by three semantic properties simultaneously: sentiment, topical focus and geography of origin. The result is a dynamically reranked set of relevant articles and a near real time focused summary of those articles. The summary, too, is computed from the semantic text representation discussed above. We conducted a user study of the whole system with very positive results

ePrints.FRI