9,346 research outputs found
Porqpine: a peer-to-peer search engine
In this paper, we present a fully distributed and collaborative search
engine for web pages: Porqpine. This system uses a novel query-based model
and collaborative filtering techniques in order to obtain user-customized
results. All knowledge about users and profiles is stored in each user
node?s application. Overall the system is a multi-agent system that runs on
the computers of the user community. The nodes interact in a peer-to-peer
fashion in order to create a real distributed search engine where
information is completely distributed among all the nodes in the network.
Moreover, the system preserves the privacy of user queries and results by
maintaining the anonymity of the queries? consumers and results? producers.
The knowledge required by the system to work is implicitly caught through
the monitoring of users actions, not only within the system?s interface but
also within one of the most popular web browsers. Thus, users are not
required to explicitly feed knowledge about their interests into the system
since this process is done automatically. In this manner, users obtain the
benefits of a personalized search engine just by installing the application
on their computer. Porqpine does not intend to shun completely conventional
centralized search engines but to complement them by issuing more accurate
and personalized results.Postprint (published version
Contextualized B2B Registries
Abstract. Service discovery is a fundamental concept underpinning the move towards dynamic service-oriented business partnerships. The business process for integrating service discovery and underlying registry technologies into business relationships, procurement and project management functions has not been examined and hence existing Web Service registries lack capabilities required by business today. In this paper we present a novel contextualized B2B registry that supports dynamic registration and discovery of resources within management contexts to ensure that the search space is constrained to the scope of authorized and legitimate resources only. We describe how the registry has been deployed in three case studies from important economic sectors (aerospace, automotive, pharmaceutical) showing how contextualized discovery can support distributed product development processes
A Distributed Method for Trust-Aware Recommendation in Social Networks
This paper contains the details of a distributed trust-aware recommendation
system. Trust-base recommenders have received a lot of attention recently. The
main aim of trust-based recommendation is to deal the problems in traditional
Collaborative Filtering recommenders. These problems include cold start users,
vulnerability to attacks, etc.. Our proposed method is a distributed approach
and can be easily deployed on social networks or real life networks such as
sensor networks or peer to peer networks
Effective Retrieval of Resources in Folksonomies Using a New Tag Similarity Measure
Social (or folksonomic) tagging has become a very popular way to describe
content within Web 2.0 websites. However, as tags are informally defined,
continually changing, and ungoverned, it has often been criticised for
lowering, rather than increasing, the efficiency of searching. To address this
issue, a variety of approaches have been proposed that recommend users what
tags to use, both when labeling and when looking for resources. These
techniques work well in dense folksonomies, but they fail to do so when tag
usage exhibits a power law distribution, as it often happens in real-life
folksonomies. To tackle this issue, we propose an approach that induces the
creation of a dense folksonomy, in a fully automatic and transparent way: when
users label resources, an innovative tag similarity metric is deployed, so to
enrich the chosen tag set with related tags already present in the folksonomy.
The proposed metric, which represents the core of our approach, is based on the
mutual reinforcement principle. Our experimental evaluation proves that the
accuracy and coverage of searches guaranteed by our metric are higher than
those achieved by applying classical metrics.Comment: 6 pages, 2 figures, CIKM 2011: 20th ACM Conference on Information and
Knowledge Managemen
Tangos: the agile numerical galaxy organization system
We present Tangos, a Python framework and web interface for database-driven
analysis of numerical structure formation simulations. To understand the role
that such a tool can play, consider constructing a history for the absolute
magnitude of each galaxy within a simulation. The magnitudes must first be
calculated for all halos at all timesteps and then linked using a merger tree;
folding the required information into a final analysis can entail significant
effort. Tangos is a generic solution to this information organization problem,
aiming to free users from the details of data management. At the querying
stage, our example of gathering properties over history is reduced to a few
clicks or a simple, single-line Python command. The framework is highly
extensible; in particular, users are expected to define their own properties
which tangos will write into the database. A variety of parallelization options
are available and the raw simulation data can be read using existing libraries
such as pynbody or yt. Finally, tangos-based databases and analysis pipelines
can easily be shared with collaborators or the broader community to ensure
reproducibility. User documentation is provided separately.Comment: Clarified various points and further improved code performance;
accepted for publication in ApJS. Tutorials (including video) at
http://tiny.cc/tango
Social media in the Global South: A Network Dataset of the Malian Twittersphere
With the expansion of mobile communications infrastructure, social media
usage in the Global South is surging. Compared to the Global North, populations
of the Global South have had less prior experience with social media from
stationary computers and wired Internet. Many countries are experiencing
violent conflicts that have a profound effect on their societies. As a result,
social networks develop under different conditions than elsewhere, and our goal
is to provide data for studying this phenomenon. In this dataset paper, we
present a data collection of a national Twittersphere in a West African country
of conflict. While not the largest social network in terms of users, Twitter is
an important platform where people engage in public discussion. The focus is on
Mali, a country beset by conflict since 2012 that has recently had a relatively
precarious media ecology. The dataset consists of tweets and Twitter users in
Mali and was collected in June 2022, when the Malian conflict became more
violent internally both towards external and international actors. In a
preliminary analysis, we assume that the conflictual context influences how
people access social media and, therefore, the shape of the Twittersphere and
its characteristics. The aim of this paper is to primarily invite researchers
from various disciplines including complex networks and social sciences
scholars to explore the data at hand further. We collected the dataset using a
scraping strategy of the follower network and the identification of
characteristics of a Malian Twitter user. The given snapshot of the Malian
Twitter follower network contains around seven million accounts, of which
56,000 are clearly identifiable as Malian. In addition, we present the tweets.
The dataset is available at:
https://osf.io/mj2q/?view_only=460f5daef1024f05a0d45e082d26059f (peer review
version).Comment: 17 pages, 6 figure
- …