31,968 research outputs found
Network Sampling: From Static to Streaming Graphs
Network sampling is integral to the analysis of social, information, and
biological networks. Since many real-world networks are massive in size,
continuously evolving, and/or distributed in nature, the network structure is
often sampled in order to facilitate study. For these reasons, a more thorough
and complete understanding of network sampling is critical to support the field
of network science. In this paper, we outline a framework for the general
problem of network sampling, by highlighting the different objectives,
population and units of interest, and classes of network sampling methods. In
addition, we propose a spectrum of computational models for network sampling
methods, ranging from the traditionally studied model based on the assumption
of a static domain to a more challenging model that is appropriate for
streaming domains. We design a family of sampling methods based on the concept
of graph induction that generalize across the full spectrum of computational
models (from static to streaming) while efficiently preserving many of the
topological properties of the input graphs. Furthermore, we demonstrate how
traditional static sampling algorithms can be modified for graph streams for
each of the three main classes of sampling methods: node, edge, and
topology-based sampling. Our experimental results indicate that our proposed
family of sampling methods more accurately preserves the underlying properties
of the graph for both static and streaming graphs. Finally, we study the impact
of network sampling algorithms on the parameter estimation and performance
evaluation of relational classification algorithms
Efficient Opportunistic Sensing using Mobile Collaborative Platform MOSDEN
Mobile devices are rapidly becoming the primary computing device in people's
lives. Application delivery platforms like Google Play, Apple App Store have
transformed mobile phones into intelligent computing devices by the means of
applications that can be downloaded and installed instantly. Many of these
applications take advantage of the plethora of sensors installed on the mobile
device to deliver enhanced user experience. The sensors on the smartphone
provide the opportunity to develop innovative mobile opportunistic sensing
applications in many sectors including healthcare, environmental monitoring and
transportation. In this paper, we present a collaborative mobile sensing
framework namely Mobile Sensor Data EngiNe (MOSDEN) that can operate on
smartphones capturing and sharing sensed data between multiple distributed
applications and users. MOSDEN follows a component-based design philosophy
promoting reuse for easy and quick opportunistic sensing application
deployments. MOSDEN separates the application-specific processing from the
sensing, storing and sharing. MOSDEN is scalable and requires minimal
development effort from the application developer. We have implemented our
framework on Android-based mobile platforms and evaluate its performance to
validate the feasibility and efficiency of MOSDEN to operate collaboratively in
mobile opportunistic sensing applications. Experimental outcomes and lessons
learnt conclude the paper
MOSDEN: A Scalable Mobile Collaborative Platform for Opportunistic Sensing Applications
Mobile smartphones along with embedded sensors have become an efficient
enabler for various mobile applications including opportunistic sensing. The
hi-tech advances in smartphones are opening up a world of possibilities. This
paper proposes a mobile collaborative platform called MOSDEN that enables and
supports opportunistic sensing at run time. MOSDEN captures and shares sensor
data across multiple apps, smartphones and users. MOSDEN supports the emerging
trend of separating sensors from application-specific processing, storing and
sharing. MOSDEN promotes reuse and re-purposing of sensor data hence reducing
the efforts in developing novel opportunistic sensing applications. MOSDEN has
been implemented on Android-based smartphones and tablets. Experimental
evaluations validate the scalability and energy efficiency of MOSDEN and its
suitability towards real world applications. The results of evaluation and
lessons learned are presented and discussed in this paper.Comment: Accepted to be published in Transactions on Collaborative Computing,
2014. arXiv admin note: substantial text overlap with arXiv:1310.405
Modeling and Evaluation of Multisource Streaming Strategies in P2P VoD Systems
In recent years, multimedia content distribution has largely been moved to the Internet, inducing broadcasters, operators and service providers to upgrade with large expenses their infrastructures. In this context, streaming solutions that rely on user devices such as set-top boxes (STBs) to offload dedicated streaming servers are particularly appropriate. In these systems, contents are usually replicated and scattered over the network established by STBs placed at users' home, and the video-on-demand (VoD) service is provisioned through streaming sessions established among neighboring STBs following a Peer-to-Peer fashion. Up to now the majority of research works have focused on the design and optimization of content replicas mechanisms to minimize server costs. The optimization of replicas mechanisms has been typically performed either considering very crude system performance indicators or analyzing asymptotic behavior. In this work, instead, we propose an analytical model that complements previous works providing fairly accurate predictions of system performance (i.e., blocking probability). Our model turns out to be a highly scalable, flexible, and extensible tool that may be helpful both for designers and developers to efficiently predict the effect of system design choices in large scale STB-VoD system
Single-Board-Computer Clusters for Cloudlet Computing in Internet of Things
The number of connected sensors and devices is expected to increase to billions in the near
future. However, centralised cloud-computing data centres present various challenges to meet the
requirements inherent to Internet of Things (IoT) workloads, such as low latency, high throughput
and bandwidth constraints. Edge computing is becoming the standard computing paradigm for
latency-sensitive real-time IoT workloads, since it addresses the aforementioned limitations related
to centralised cloud-computing models. Such a paradigm relies on bringing computation close to
the source of data, which presents serious operational challenges for large-scale cloud-computing
providers. In this work, we present an architecture composed of low-cost Single-Board-Computer
clusters near to data sources, and centralised cloud-computing data centres. The proposed
cost-efficient model may be employed as an alternative to fog computing to meet real-time IoT
workload requirements while keeping scalability. We include an extensive empirical analysis to
assess the suitability of single-board-computer clusters as cost-effective edge-computing micro data
centres. Additionally, we compare the proposed architecture with traditional cloudlet and cloud
architectures, and evaluate them through extensive simulation. We finally show that acquisition costs
can be drastically reduced while keeping performance levels in data-intensive IoT use cases.Ministerio de EconomĂa y Competitividad TIN2017-82113-C2-1-RMinisterio de EconomĂa y Competitividad RTI2018-098062-A-I00European Unionâs Horizon 2020 No. 754489Science Foundation Ireland grant 13/RC/209
Dynamic Graph Stream Algorithms in Space
In this paper we study graph problems in dynamic streaming model, where the
input is defined by a sequence of edge insertions and deletions. As many
natural problems require space, where is the number of
vertices, existing works mainly focused on designing space
algorithms. Although sublinear in the number of edges for dense graphs, it
could still be too large for many applications (e.g. is huge or the graph
is sparse). In this work, we give single-pass algorithms beating this space
barrier for two classes of problems.
We present space algorithms for estimating the number of connected
components with additive error and
-approximating the weight of minimum spanning tree, for any
small constant . The latter improves previous
space algorithm given by Ahn et al. (SODA 2012) for connected graphs with
bounded edge weights.
We initiate the study of approximate graph property testing in the dynamic
streaming model, where we want to distinguish graphs satisfying the property
from graphs that are -far from having the property. We consider
the problem of testing -edge connectivity, -vertex connectivity,
cycle-freeness and bipartiteness (of planar graphs), for which, we provide
algorithms using roughly space, which is
for any constant .
To complement our algorithms, we present space
lower bounds for these problems, which show that such a dependence on
is necessary.Comment: ICALP 201
Lightweight Asynchronous Snapshots for Distributed Dataflows
Distributed stateful stream processing enables the deployment and execution
of large scale continuous computations in the cloud, targeting both low latency
and high throughput. One of the most fundamental challenges of this paradigm is
providing processing guarantees under potential failures. Existing approaches
rely on periodic global state snapshots that can be used for failure recovery.
Those approaches suffer from two main drawbacks. First, they often stall the
overall computation which impacts ingestion. Second, they eagerly persist all
records in transit along with the operation states which results in larger
snapshots than required. In this work we propose Asynchronous Barrier
Snapshotting (ABS), a lightweight algorithm suited for modern dataflow
execution engines that minimises space requirements. ABS persists only operator
states on acyclic execution topologies while keeping a minimal record log on
cyclic dataflows. We implemented ABS on Apache Flink, a distributed analytics
engine that supports stateful stream processing. Our evaluation shows that our
algorithm does not have a heavy impact on the execution, maintaining linear
scalability and performing well with frequent snapshots.Comment: 8 pages, 7 figure
- âŠ