CORE
🇺🇦
make metadata, not war
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Community governance
Advisory Board
Board of supporters
Research network
About
About us
Our mission
Team
Blog
FAQs
Contact us
research
MapReduce analysis for cloud-archived data
Authors
G Alatorre
L Liu
+3 more
N Mandagere
B Palanisamy
A Singh
Publication date
1 January 2014
Publisher
'Institute of Electrical and Electronics Engineers (IEEE)'
Doi
Cite
Abstract
Public storage clouds have become a popular choice for archiving certain classes of enterprise data - for example, application and infrastructure logs. These logs contain sensitive information like IP addresses or user logins due to which regulatory and security requirements often require data to be encrypted before moved to the cloud. In order to leverage such data for any business value, analytics systems (e.g. Hadoop/MapReduce) first download data from these public clouds, decrypt it and then process it at the secure enterprise site. We propose VNCache: an efficient solution for MapReduceanalysis of such cloud-archived log data without requiring an apriori data transfer and loading into the local Hadoop cluster. VNcache dynamically integrates cloud-archived data into a virtual namespace at the enterprise Hadoop cluster. Through a seamless data streaming and prefetching model, Hadoop jobs can begin execution as soon as they are launched without requiring any apriori downloading. With VNcache's accurate pre-fetching and caching, jobs often run on a local cached copy of the data block significantly improving performance. When no longer needed, data is safely evicted from the enterprise cluster reducing the total storage footprint. Uniquely, VNcache is implemented with NO changes to the Hadoop application stack. © 2014 IEEE
Similar works
Full text
Open in the Core reader
Download PDF
Available Versions
CiteSeerX
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:CiteSeerX.psu:10.1.1.591.4...
Last time updated on 29/10/2017
Name not available
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:d-scholarship.pitt.edu:220...
Last time updated on 23/11/2016
Crossref
See this paper in CORE
Go to the repository landing page
Download from data provider
info:doi/10.1109%2Fccgrid.2014...
Last time updated on 01/04/2019
CiteSeerX
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:CiteSeerX.psu:10.1.1.699.1...
Last time updated on 29/10/2017
D-Scholarship@Pitt
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:d-scholarship.pitt.edu:220...
Last time updated on 17/07/2014
Name not available
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:d-scholarship.pitt.edu:220...
Last time updated on 15/12/2016