Abstract – As we enter the era of large-scale Chip Multi-Processing (CMP) systems, evaluating architectures and projecting performance for commercial workloads on such systems is becoming increasingly important. One of the major areas of concern for Multi-Socket SMP systems has been the detrimental effects of Locking and Synchronization (L&S) overheads. However, the lower on-die interconnect latency and higher available bandwidth in CMP systems can change the effects of L&S dramatically. We wanted to analyze and study these effects. Towards this goal, we built a flexible, fast and accurate platform simulation framework called ManySim, and used this to study effects of locking and synchronization. We demonstrate that the CMP architecture outperforms the multi-socket architecture due to reduced L&S overheads, making the CMP architecture highly scalable and allowing it to almost reach the limits of Amdahl’s law. I
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.