Search CORE

29 research outputs found

Routing and Caching Strategy in Information-Centric Network (ICN)

Author: Tan Liangyu
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2019
Field of study

The main usage of Internet today is content distribution and retrieval. In today’s Internet, connections and data exchanging can only happen between hosts, which is also called host centric end-to-end communication. As the network users and demand of contents grows quickly, current network paradigm is getting more and more complicated and can barely meet the needs in the future. Recently, the architecture of information/content centric networking (ICN) has been proposed and is expected to replace the current communication model. As an in-network caching system, the cache management scheme is a key factor of ICN. To improve the performance of this architecture, a lot of effort has been put by many researchers into this area. In this paper, a new strategy about content caching and routing is introduced. And the results of this new scheme show that this strategy leads to a good performance, i.e., the new scheme can reach less hops, when comparing with regular LRU cache strategy. And less hops means the requested object can be found in nearer routers, the network traffic, hence, is reduced

Digital Repository @ Iowa State University (ISU)

CRAID: Online RAID upgrades using dynamic hot data reorganization

Author: Cortés Toni
Miranda Bueno Alberto
Publication venue: USENIX Association
Publication date: 01/01/2014
Field of study

Current algorithms used to upgrade RAID arrays typically require large amounts of data to be migrated, even those that move only the minimum amount of data required to keep a balanced data load. This paper presents CRAID, a self-optimizing RAID array that performs an online block reorganization of frequently used, long-term accessed data in order to reduce this migration even further. To achieve this objective, CRAID tracks frequently used, long-term data blocks and copies them to a dedicated partition spread across all the disks in the array. When new disks are added, CRAID only needs to extend this process to the new devices to redistribute this partition, thus greatly reducing the overhead of the upgrade process. In addition, the reorganized access patterns within this partition improve the array’s performance, amortizing the copy overhead and allowing CRAID to offer a performance competitive with traditional RAIDs. We describe CRAID’s motivation and design and we evaluate it by replaying seven real-world workloads including a file server, a web server and a user share. Our experiments show that CRAID can successfully detect hot data variations and begin using new disks as soon as they are added to the array. Also, the usage of a dedicated partition improves the sequentiality of relevant data access, which amortizes the cost of reorganizations. Finally, we prove that a full-HDD CRAID array with a small distributed partition (<1.28% per disk) can compete in performance with an ideally restriped RAID-5 and a hybrid RAID-5 with a small SSD cache.Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC

Evaluation of Load Scheduling Strategies for Real-Time Data Warehouse Environments

Author: Lehner Wolfgang
Thiele Maik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/01/2023
Field of study

The demand for so-called living or real-time data warehouses is increasing in many application areas, including manufacturing, event monitoring and telecommunications. In fields like these, users normally expect short response times for their queries and high freshness for the requested data. However, it is truly challenging to meet both requirements at the same time because of the continuous flow of write-only updates and read-only queries as well as the latency caused by arbitrarily complex ETL processes. To optimize the update flow in terms of data freshness maximization and load minimization, we propose two algorithms - local and global scheduling - that operate on the basis of different system information. We want to discuss the benefits and drawbacks of both approaches in detail and derive recommendations regarding the optimal scheduling strategy for any given system setup and workload

Qucosa

HSSS - Hochschulschriftenserver der SLUB

Technische Universität Dresden: Qucosa

Multi-level Hybrid Cache: Impact and Feasibility

Author: Kim Youngjae
Ma Xiaosong
Shipman Galen M
Zhang Zhe
Zhou Yuanyuan
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date: 01/02/2012
Field of study

Storage class memories, including flash, has been attracting much attention as promising candidates fitting into in today's enterprise storage systems. In particular, since the cost and performance characteristics of flash are in-between those of DRAM and hard disks, it has been considered by many studies as an secondary caching layer underneath main memory cache. However, there has been a lack of studies of correlation and interdependency between DRAM and flash caching. This paper views this problem as a special form of multi-level caching, and tries to understand the benefits of this multi-level hybrid cache hierarchy. We reveal that significant costs could be saved by using Flash to reduce the size of DRAM cache, while maintaing the same performance. We also discuss design challenges of using flash in the caching hierarchy and present potential solutions

Crossref

UNT Digital Library

KSM++: Using I/O-based hints to make memory-deduplication scanners more efficient

Author: Bellosa Frank
Franz Fabian
Groeninger Thorsten
Hillenbrand Marius
Miller Konrad
Rittinghaus Marc
Publication venue
Publication date: 11/12/2014
Field of study

Memory scanning deduplication techniques, as implemented in Linux\u27 Kernel Samepage Merging (KSM), work very well for deduplicating fairly static, anonymous pages with equal content across different virtual machines. However, scanners need very aggressive scan rates when it comes to identifying sharing opportunities with a short life span of up to about 5 min. Otherwise, the scan process is not fast enough to catch those short-lived pages. Our approach generates I/O-based hints in the host to make the memory scanning process more efficient, thus enabling it to find and exploit short-lived sharing opportunities without raising the scan rate. Experiences with similar techniques for paravirtualized guests have shown that pages in a guest’s unified buffer cache are good sharing candidates. We already identify such pages in the host when carrying out I/O-operations on behalf of the guest. The target/source pages in the guest can safely be assumed to be part of the guest’s unified buffer cache. That way, we can determine good sharing hints for the memory scanner. A modification of the guest is not required. We have implemented our approach in Linux. By modifying the KSM scanning mechanism to process these hints preferentially, we move the associated sharing opportunities earlier into the merging stage. Thereby, we deduplicate more pages than the baseline system. In our evaluation, we identify sharing opportunities faster and with less overhead than the traditional linear scanning policy. KSM needs to follow about seven times as many pages as we do, to find a sharing opportunity

CiteSeerX

KITopen

데이터 집약적 응용을 위한 프로그램 컨텍스트 기반의 I/O 최적화

Author: 진용석
Publication venue: 서울대학교 대학원
Publication date: 01/08/2019
Field of study

학위논문(석사)--서울대학교 대학원 :공과대학 컴퓨터공학부,2019. 8. 김지홍.오늘날에는 다양한 형태의 데이터 집약적인 응용이 활용되고 있다. 이러한 응용들은 대용량의 데이터를 분석하거나, 데이터를 구조화하여 스토리지에 저장하는 등 많은 I/O를 발생시켜, 시스템이 I/O를 수행하는 속도에 따라 성능에 큰 영향을 받게 된다. 운영체제는 메인 메모리보다 성능이 크게 떨어지는 저장 장치로의 접근을 최소화하여 파일 I/O의 성능을 극대화하고자 메인 메모리의 일부를 페이지 캐시로 할당한다. 하지만 메모리의 크기는 저장 장치에 비해 크게 제한되어 있어, 파일 I/O의 성능을 높이기 위해서는 앞으로 참조되는 데이터를 잘 보관하고 참조되지 않을 데이터를 캐시로부터 내보내며 효율적으로 관리하는 것이 매우 중요하다. 하지만 어떤 데이터가 앞으로 참조될지, 그리고 어떤 데이터가 참조되지 않을지에 대해서 시스템이 자체적으로 완벽하게 예측하는 것은 불가능하다. 따라서, 시스템보다 상위 계층에서의 최적화를 위한 노력 없이는 I/O 최적화에 있어 명백한 한계가 존재한다. 본 논문에서는 응용이 I/O를 수행하는 맥락, 즉 프로그램 컨텍스트를 기반으로 I/O가 발생하는 시점과 그 패턴을 자동으로 파악하여 분석하는 기법과, 이를 통해 분석한 결과를 기반으로 하여 각각의 I/O가 발생한 프로그램 컨텍스트에 적용할 최적화 방안 추천을 자동화하는 기법을 제안한다. 이를 통해 시스템에서 자체적으로 파악할 수 없는 다양한 힌트를 사전에 제공하고, 이 정보를 시스템이 적극적으로 활용하여 이전보다 효율적인 I/O를 수행할 수 있도록 한다.Many kinds of data intensive applications are broadly utilized nowadays. These applications generate a lot of I/O such as analyzing a large amount of data, structuring the data and storing it in the storage, and the performance is greatly influenced by the speed of the I/O the system performs. The operating system allocates a portion of main memory to the page cache to maximize the performance of file I/O by minimizing access to the storage device which is much lower in performance than main memory. However, since the size of memory is limited compared to the size of the storage device, it is very important to keep the data to be referenced to in future and to export the data not to be referenced from the cache and to manage efficiently to improve the performance of the file I/O. However, it is impossible for the system to predict perfectly about which data will be referenced in the future and which data will not be. Thus, without I/O optimization at the application level, there is a clear limit to performance improvement. In this thesis, we propose a method to automatically detect and analyze I/O characteristics based on I/O program contexts of which an application executes I/O. We propose a technique to automate the optimization recommendation to be applied to the program context in which I/O occurs. Through this, the application can provide various hints to the system that can not be grasped by the system itself, and the system actively reflects this information so that I/O can be performed faster and resources can be used more efficiently than before.제 1 장 서 론 1 제 1 절 연구의 배경 1 제 2 절 연구의 목적 및 기여 4 제 3 절 논문 구성 8 제 2 장 관련 연구 9 제 1 절 프로그램 컨텍스트를 활용한 버퍼 캐싱 9 제 2 절 프로그램 컨텍스트 기반의 데이터 분리 기법 13 제 3 장 프로그램 컨텍스트에 기반한 응용 I/O 분석 19 제 1 절 프로그램 컨텍스트의 정의와 추출 방법 19 제 2 절 PCStat: 프로그램 컨텍스트에 따른 I/O 패턴 분석 22 제 3 절 I/O 쓰레드 환경을 위한 프로그램 컨텍스트의 추출 기법 28 제 4 장 프로그램 컨텍스트에 기반한 I/O 최적화 적용 30 제 1 절 페이지 캐시에 제공하는 힌트 30 제 2 절 fadvise 적용을 통한 프로그램 컨텍스트 기반의 I/O 최적화 32 제 3 절 PCAdvisor: 프로그램 컨텍스트 기반의 I/O 최적화 자동화 35 제 5 장 평가 실험 38 제 1 절 실험 환경 38 제 2 절 실험 결과 39 제 6 장 결 론 44 제 1 절 결론 및 향후 계획 44 참고문헌 46 Abstract 49Maste

SNU Open Repository and Archive

My cache or yours? Making storage more exclusive

Author: Gregory R. Ganger
John Wilkes
Theodore Wong
Publication venue
Publication date: 01/01/2002
Field of study

Modern high-end disk arrays often have several gigabytes of cache RAM. Unfortunately, most array caches use management policies which duplicate the same data blocks at both the client and array levels of the cache hierarchy: they are inclusive. Thus, the aggregate cache behaves as if it was only as big as the larger of the client and array caches, instead of as large as the sum of the two. Inclusiveness is wasteful: cache RAM is expensive. We explore the benefits of a simple scheme to achieve exclusive caching, in which a data block is cached at either a client or the disk array, but not both. Exclusiveness helps to create the effect of a single, large unified cache. We introduce a DEMOTE operation to transfer data ejected from the client to the array, and explore its effectiveness with simulation studies. We quantify the benefits and overheads of demotions across both synthetic and real-life workloads. The results show that we can obtain useful---sometimes substantial---speedups. During our investigations, we also developed some new cache-insertion algorithms that show promise for multi-client systems, and report on some of their properties

CiteSeerX

My cache or yours? Making storage more exclusive

Author: John Wilkes
Theodore M. Wong
Publication venue
Publication date: 01/01/2002
Field of study

Modern high-end disk arrays often have several gigabytes of cache RAM. Unfortunately, most array caches use management policies which duplicate the same data blocks at both the client and array levels of the cache hierarchy: they are inclusive. Thus, the aggregate cache behaves as if it was only as big as the larger of the client and array caches, instead of as large as the sum of the two. Inclusiveness is wasteful: cache RAM is expensive. We explore the benefits of a simple scheme to achieve exclusive caching, in which a data block is cached at either a client or the disk array, but not both. Exclusiveness helps to create the effect of a single, large unified cache. We introduce a DEMOTE operation to transfer data ejected from the client to the array, and explore its effectiveness with simulation studies. We quantify the benefits and overheads of demotions across both synthetic and real-life workloads. The results show that we can obtain useful -- sometimes substantial -- speedups. During our investigations, we also developed some new cache-insertion algorithms that show promise for multi-client systems, and report on some of their properties

CiteSeerX