Skip to main content
Article thumbnail
Location of Repository

Exploiting the performance gains of modern disk drives by enhancing data locality.

By Yuhui Deng


Due to the widening performance gap between RAM and disk drives, a large number of I/O optimization methods have been proposed and designed to alleviate the impact of this gap. One of the most effective approaches of improving disk access performance is enhancing data locality. This is because the method could increase the hit ratio of disk cache and reduce the seek time and rotational latency. Disk drives have experienced dramatic development since the first disk drive was announced in 1956. This paper investigates some important characteristics of modern disk drives. Based on the characteristics and the observation that data access on disk drives is highly skewed, the frequently accessed data blocks and the correlated data blocks are clustered into objects and moved to the outer zones of a modern disk drive. The idea attempts to enhance spatial locality, improve the efficiency of aggressive sequential prefetch, and take advantage of Zoned Bit Recording (ZBR). An experimental simulation is employed to investigate the performance gains generated by the enhanced data locality. The performance gains are analyzed by breaking down the disk access time into seek time, rotational latency, data transfer time, and hit ratio of the disk cache. Experimental results provide useful insights into the performance behaviours of a modern disk drive with enhanced data locality

Topics: Disk drive, Data locality, Data access pattern, Block correlation, Data migration, Performance
Publisher: Elsevier
Year: 2009
DOI identifier: 10.1016/j.ins.2009.02.002
OAI identifier:
Provided by: Cranfield CERES

Suggested articles


  1. (2000). A comparison of file system workloads, in:
  2. (1984). A fast file system for UNIX, doi
  3. (1995). Adaptive block rearrangement, doi
  4. (1998). Advances in disk technology: performance issues, doi
  5. (2007). An efficient algorithm for mining frequent inter-transaction patterns, doi
  6. (2002). Aqueduct: online data migration with performance guarantees, in:
  7. (2001). Blurring the line between oses and storage devices,
  8. (1994). Caching strategies to improve disk system performance, doi
  9. (2003). Characteristics of I/O traffic in personal computer and server workloads, doi
  10. Clustering active disk data to improve disk performance,
  11. (2000). Designing computer systems with MEMS-based storage, in: doi
  12. (2006). Disk drive level workload characterization, in: doi
  13. (1991). Disk shuffling, doi
  14. (1993). Disk subsystem load balancing: disk striping vs. conventional data placement, in: doi
  15. (2008). EED: energy efficient disk drive architecture, doi
  16. (1997). Embedded inodes and explicit grouping: exploiting disk bandwidth for small files, in:
  17. File size distribution on UNIX systems: then and now, doi
  18. (2002). Freeblock scheduling outside of disk firmware, in:
  19. (2005). FS2: dynamic data replication in free disk space for improving disk performance and energy consumption, in: doi
  20. (2007). Hierarchical clustering of mixed data based on distance hierarchy, doi
  21. (2008). Kernel class-wise locality preserving projection, doi
  22. Log-structured File System, doi
  23. (1991). Measurements of a distributed file system, in: doi
  24. (2005). Mining block correlations to improve storage performance, doi
  25. (2002). My Cache or yours? making storage more exclusive, in:
  26. (2003). Object-based storage, doi
  27. (1997). Observing the effects of multi-zone disks, in:
  28. (2006). Optimal clustering size of small file access in network attached storage device, Parallel Processing Letters, doi
  29. (1996). Reshaping access patterns for improving data locality, in:
  30. Seek distance dependent variable max VCM seek current to control thermal rise in VCM’s.
  31. (1971). Storage hierarchies: Gaps, cliffs, and trends, doi
  32. (2005). The automatic improvement of locality in storage systems, doi
  33. (2003). The DiskSim simulation environment version 3.0 reference manual,
  34. (2004). The performance impact of I/O optimizations and disk improvements, doi
  35. (1999). The processor-memory bottleneck: problems and solutions, doi
  36. Towards higher disk head utilization: extracting free bandwidth from busy disk drives, in:
  37. (2002). Track aligned extents: matching access patterns to disk drive characteristics,
  38. (2006). Understanding the performance-temperature interactions in disk I/O of server workloads, in: doi
  39. (1993). Unix disk access patterns, in:
  40. (2007). Zoned-RAID, doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.