327 research outputs found

    A Survey of Techniques for Architecting TLBs

    Get PDF
    โ€œTranslation lookaside bufferโ€ (TLB) caches virtual to physical address translation information and is used in systems ranging from embedded devices to high-end servers. Since TLB is accessed very frequently and a TLB miss is extremely costly, prudent management of TLB is important for improving performance and energy efficiency of processors. In this paper, we present a survey of techniques for architecting and managing TLBs. We characterize the techniques across several dimensions to highlight their similarities and distinctions. We believe that this paper will be useful for chip designers, computer architects and system engineers

    ๋ฉ€ํ‹ฐ์ฝ”์–ด ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ์—์„œ์˜ ์ž๋™์ฐจ ์‹œ์Šคํ…œ์„ ์œ„ํ•œ ๊ธฐ๋Šฅ์ /์‹œ๊ฐ„์  ์ •ํ™•์„ฑ ๋ณด์žฅ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ธฐ๋ฒ•

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (์„์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2019. 2. ์ด์ฐฝ๊ฑด.This dissertation presents functionally and temporally correct simulation method for cyber-side of an automotive system on multicore simulator. To overcome the limitations of the existing simulation methods which do not correctly model temporal behaviours such as varying execution times and task preemptions, the novel simulation technique assuming single core simulator was proposed. In this work, we extend the single core simulator to the multicore while keeping all of the proposed key ideas to guarantee correct simulation. We introduce heuristic task partitioning algorithm based on memory usages and approximated task-wise blockings of simulated tasks. As a result, we could improve up to 97%p, 15%p of simulation capacity compared to the single core, and other task partitioning algorithms, respectively.๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ๋ฉ€ํ‹ฐ ์ฝ”์–ด ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ์ž๋™์ฐจ ์‚ฌ์ด๋ฒ„-๋ฌผ๋ฆฌ ์‹œ์Šคํ…œ์˜ ์‚ฌ์ด๋ฒ„ ์‹œ์Šคํ…œ์„ ๊ธฐ๋Šฅ์ /์‹œ๊ฐ„์ ์œผ๋กœ ์ •ํ™•ํ•˜๊ฒŒ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ํ•˜๊ธฐ ์œ„ํ•œ ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ์•ž์„  ์—ฐ๊ตฌ์—์„œ๋Š” ์‹œ์Šคํ…œ์˜ ๊ธฐ๋Šฅ์  ํ–‰ํƒœ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ํƒœ์Šคํฌ์˜ ๊ฐ€๋ณ€ ์ˆ˜ํ–‰ ์‹œ๊ฐ„, ์ž์› ์„ ์  ๋“ฑ๊ณผ ๊ฐ™์€ ์‹œ๊ฐ„์  ํ–‰ํƒœ ์—ญ์‹œ ํ•จ๊ป˜ ์ •ํ™•ํžˆ ๋ชจ์‚ฌํ•˜๊ธฐ ์œ„ํ•œ ์ƒˆ๋กœ์šด ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ธฐ๋ฒ•๋“ค์ด ์ œ์•ˆ๋˜์—ˆ๋‹ค. ์•ž์„  ์—ฐ๊ตฌ์—์„œ ์ œ์•ˆ๋œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๊ธฐ๋ฒ•๋“ค์ด ์‹ฑ๊ธ€์ฝ”์–ด ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ๋งŒ์„ ๊ฐ€์ •ํ•˜๊ณ  ์žˆ๋‹ค๋Š” ์ ์— ์ฐฉ์•ˆํ•˜์—ฌ ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์ •ํ™•ํ•œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์„ ๋ณด์žฅํ•˜๊ธฐ ์œ„ํ•ด ์ œ์•ˆ๋˜์—ˆ๋˜ ๊ธฐ์กด ์—ฐ๊ตฌ์˜ ์ฃผ์š” ์•„์ด๋””์–ด๋ฅผ ๋ชจ๋‘ ์œ ์ง€ํ•˜๋ฉด์„œ ์‹ฑ๊ธ€์ฝ”์–ด ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ๋ฅผ ๋ฉ€ํ‹ฐ์ฝ”์–ด ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ๋กœ ํ™•์žฅํ•œ๋‹ค. ์ œ์•ˆํ•˜๋Š” ๋ฐฉ๋ฒ•์—์„œ๋Š” ๊ฐ ํƒœ์Šคํฌ์˜ ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰๊ณผ ๊ทผ์‚ฌํ™” ๋œ ํƒœ์Šคํฌ ๊ฐ„ ๋ธ”๋กœํ‚น ๊ฐ’์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ๋Œ€์ƒ ํƒœ์Šคํฌ์— ๋Œ€ํ•œ ํœด๋ฆฌ์Šคํ‹ฑ ํƒœ์Šคํฌ ๋ถ„ํ•  ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์„ค๊ณ„ํ•œ๋‹ค. ๋˜ํ•œ, ์ž„์˜์ ์œผ๋กœ ์ƒ์„ฑํ•œ ๋‹ค์ˆ˜์˜ ์›Œํฌ๋กœ๋“œ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์„ฑ๋Šฅ์„ ์ธก์ •ํ•˜๊ณ , ์ด๋ฅผ ํ†ตํ•ด ์ œ์•ˆํ•˜๋Š” ๋ฐฉ๋ฒ•์ด ์‹ฑ๊ธ€์ฝ”์–ด ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ ๋ฐ ๋‹ค๋ฅธ ํƒœ์Šคํฌ ๋ถ„ํ•  ์•Œ๊ณ ๋ฆฌ์ฆ˜์— ๋น„ํ•ด ๊ฐ๊ฐ ์ตœ๋Œ€ 97%p, 15%p์˜ ํ–ฅ์ƒ๋œ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์šฉ๋Ÿ‰์„ ๊ฐ–๋Š” ๊ฒƒ์„ ๋ณด์ธ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ ์ œ์•ˆํ•˜๋Š” ๋ฉ€ํ‹ฐ์ฝ”์–ด ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ๋Š” ์•ž์„  ์—ฐ๊ตฌ์—์„œ ์ œ์•ˆ๋˜์—ˆ๋˜ ๊ธฐ๋Šฅ์ /์‹œ๊ฐ„์  ์ •ํ™•์„ฑ์„ ๋™์ผํ•˜๊ฒŒ ๋ณด์žฅํ•จ๊ณผ ๋™์‹œ์— ๋ณด๋‹ค ๋†’์€ ์‹œ๋ฎฌ๋ ˆ์ด์…˜ ์šฉ๋Ÿ‰์„ ์ œ๊ณตํ•จ์œผ๋กœ์จ ์ „์ฒด ์ž๋™์ฐจ ์‹œ์Šคํ…œ์˜ ์‹œ๋ฎฌ๋ ˆ์ด์…˜์— ํšจ๊ณผ์ ์œผ๋กœ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค.1 Introduction 1 1.1 Motivation and Objective 1 1.2 Approach 1 1.3 Organization 2 2 Related Work 3 2.1 Model-Based Simulations 3 2.2 Real-time Execution Platforms 3 2.3 Functionally and Temporally Correct Simulations 3 3 Background 5 3.1 Description on the real cyber-system 5 3.2 Description on the simulated cyber-system 7 3.3 Idea of Functionally and Temporally Correct Simulation 9 4 Problem Description 12 4.1 Keeping the key ideas of the single core simulator 12 4.2 Maximally utilizing the multicore 13 5 Proposed Approach 15 5.1 Memory constraint 15 5.2 The Smallest-blocking-first heuristic 18 5.2.1 Intuition of Smallest-blocking-first algorithm 19 5.2.2 Finding the Expected Earliest Start Time 20 5.2.3 Finding the Expected Latest Finish Time 22 5.2.4 Weighting the [EEST, ELFT] intervals 25 6 Evaluation 28 6.1 Simulatability according to the number of cores 28 6.2 Simulatability according to the partitioning method 30 6.3 Simulatability according to the physical read/write task ratio 31 7 Conclusion 35 References 37Maste

    Dynamic Energy and Thermal Management of Multi-Core Mobile Platforms: A Survey

    Get PDF
    Multi-core mobile platforms are on rise as they enable efficient parallel processing to meet ever-increasing performance requirements. However, since these platforms need to cater for increasingly dynamic workloads, efficient dynamic resource management is desired mainly to enhance the energy and thermal efficiency for better user experience with increased operational time and lifetime of mobile devices. This article provides a survey of dynamic energy and thermal management approaches for multi-core mobile platforms. These approaches do either proactive or reactive management. The upcoming trends and open challenges are also discussed

    High-Performance and Time-Predictable Embedded Computing

    Get PDF
    Nowadays, the prevalence of computing systems in our lives is so ubiquitous that we live in a cyber-physical world dominated by computer systems, from pacemakers to cars and airplanes. These systems demand for more computational performance to process large amounts of data from multiple data sources with guaranteed processing times. Actuating outside of the required timing bounds may cause the failure of the system, being vital for systems like planes, cars, business monitoring, e-trading, etc. High-Performance and Time-Predictable Embedded Computing presents recent advances in software architecture and tools to support such complex systems, enabling the design of embedded computing devices which are able to deliver high-performance whilst guaranteeing the application required timing bounds. Technical topics discussed in the book include: Parallel embedded platforms Programming models Mapping and scheduling of parallel computations Timing and schedulability analysis Runtimes and operating systems The work reflected in this book was done in the scope of the European project P SOCRATES, funded under the FP7 framework program of the European Commission. High-performance and time-predictable embedded computing is ideal for personnel in computer/communication/embedded industries as well as academic staff and master/research students in computer science, embedded systems, cyber-physical systems and internet-of-things.info:eu-repo/semantics/publishedVersio
    • โ€ฆ
    corecore