4 research outputs found

    Gem5-X: A Gem5-Based System Level Simulation Framework to Optimize Many-Core Platforms

    Get PDF
    The rapid expansion of online-based services requires novel energy and performance efficient architectures to meet power and latency constraints. Fast architectural exploration has become a key enabler in the proposal of architectural innovation. In this paper, we present gem5-X, a gem5-based system level simulation framework, and a methodology to optimize many-core systems for performance and power. As real-life case studies of many-core server workloads, we use real-time video transcoding and image classification using convolutional neural networks (CNNs). Gem5-X allows us to identify bottlenecks and evaluate the potential benefits of architectural extensions such as in-cache computing and 3D stacked High Bandwidth Memory. For real-time video transcoding, we achieve 15% speed-up using in-order cores with in-cache computing when compared to a baseline in-order system and 76% energy savings when compared to an Out-of-Order system. When using HBM, we further accelerate real-time transcoding and CNNs by up to 7% and 8% respectively

    A selective approach for energy-aware video content adaptation decision-taking engine in android based smartphone

    Get PDF
    Rapid advancement of technology and their increasing affordability have transformed mobile devices from a means of communication to tools for socialization, entertainment, work and learning. However, advancement of battery technology and capacity is slow compared to energy need. Viewing content with high quality of experience will consume high power. In limited available energy, normal content adaptation system will decrease the content quality, hence reducing quality of experience. However, there is a need for optimizing content quality of experience (QoE) in a limited available energy. With modification and improvement, content adaptation may solve this issue. The key objective of this research is to propose a framework for energy-aware video content adaptation system to enable video delivery over the Internet. To optimise the QoE while viewing streaming video on a limited available smartphone energy, an algorithm for energy-aware video content adaptation decision-taking engine named EnVADE is proposed. The EnVADE algorithm uses selective mechanism. Selective mechanism means the video segmented into scenes and adaptation process is done based on the selected scenes. Thus, QoE can be improved. To evaluate EnVADE algorithm in term of energy efficiency, an experimental evaluation has been done. Subjective evaluation by selected respondents are also has been made using Absolute Category Rating method as recommended by ITU to evaluate EnVADE algorithm in term of QoE. In both evaluation, comparison with other methods has been made. The results show that the proposed solution is able to increase the viewing time of about 14% compared to MPEG-DASH which is an official international standard and widely used streaming method. In term of QoE subjective test, EnVADE algorithm score surpasses the score of other video streaming method. Therefore, EnVADE framework and algorithm has proven its capability as an alternative technique to stream video content with higher QoE and lower energy consumption

    Full-System Simulation of big.LITTLE Multicore Architecture for Performance and Energy Exploration

    No full text
    International audienceSingle-ISA heterogeneous multicore processors have gained increasing popularity with the introduction of recent technologies such as ARM big.LITTLE. These processors offer increased energy efficiency through combining low power in- order cores with high performance out-of-order cores. Efficiently exploiting this attractive feature requires careful management so as to meet the demands of targeted applications. In this paper, we explore the design of those architectures based on the ARM big.LITTLE technology by modeling performance and power in gem5 and McPAT frameworks. Our models are validated w.r.t. the Samsung Exynos 5 Octa (5422) chip. We show average errors of 20% in execution time, 13% for power consumption and 24% for energy-to-solution
    corecore