Search CORE

332 research outputs found

Optimal Eviction Policies for Stochastic Address Traces

Author: Aggarwal
Albers
Allen
Almasi
Arapostathis
Becchetti
Belady
Belady
Bennett
Bernholt
Bertsekas
Bilardi
Bilardi
Blackwell
Borodin
Cormen
Denning
Diakonikolas
Effelsberg
Ehrgott
Fiat
Fotheringham
Franaszek
Francesco Versaci
Frederickson
Gianfranco Bilardi
Graham
Hennessy
Karlin
Kobayashi
Koutsoupias
Levin
Lewis
Liberatore
Lin
Loève
Mattson
Megiddo
Miettinen
Oden
Panagiotou
Papadimitriou
Papadimitriou
Przybylski
Savage
Silberschatz
Sleator
Smaragdakis
Smaragdakis
Spirn
Stone
Sugumar
Thiébaut
Thompson
Turner
Wolfe
Wood
Young
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

The eviction problem for memory hierarchies is studied for the Hidden Markov Reference Model (HMRM) of the memory trace, showing how miss minimization can be naturally formulated in the optimal control setting. In addition to the traditional version assuming a buffer of fixed capacity, a relaxed version is also considered, in which buffer occupancy can vary and its average is constrained. Resorting to multiobjective optimization, viewing occupancy as a cost rather than as a constraint, the optimal eviction policy is obtained by composing solutions for the individual addressable items. This approach is then specialized to the Least Recently Used Stack Model (LRUSM), a type of HMRM often considered for traces, which includes V-1 parameters, where V is the size of the virtual space. A gain optimal policy for any target average occupancy is obtained which (i) is computable in time O(V) from the model parameters, (ii) is optimal also for the fixed capacity case, and (iii) is characterized in terms of priorities, with the name of Least Profit Rate (LPR) policy. An O(log C) upper bound (being C the buffer capacity) is derived for the ratio between the expected miss rate of LPR and that of OPT, the optimal off-line policy; the upper bound is tightened to O(1), under reasonable constraints on the LRUSM parameters. Using the stack-distance framework, an algorithm is developed to compute the number of misses incurred by LPR on a given input trace, simultaneously for all buffer capacities, in time O(log V) per access. Finally, some results are provided for miss minimization over a finite horizon and over an infinite horizon under bias optimality, a criterion more stringent than gain optimality.Comment: 37 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università di Padova

Recommended from our members

Technical Review of Residential Programmable Communicating Thermostat Implementation for Title 24-2008

Author: Auslander David M.
Burke William J.
Do Alexander
White Richard M.
Wright Paul K.
Publication venue: eScholarship, University of California
Publication date: 01/01/2007
Field of study

eScholarship - University of California

Simple optimality proofs for Least Recently Used in the presence of locality of reference

Author: Hiller B.
Vredeveld T.
Publication venue: 'University of Maastricht'
Publication date: 01/01/2009
Field of study

Maastricht University Research Portal

CiteSeerX

31th International Symposium on Theoretical Aspects of Computer Science: STACS '14, March 5th to March 8th, 2014, Lyon, France

Author: STACS <31 2014, Lyon>
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik
Publication date: 01/03/2014
Field of study

Digitale Bibliothek Thüringen

VIRTUAL MEMORY ON A MANY-CORE NOC

Author: McMenamin Adrian Ciaran
Publication venue: University of York
Publication date: 01/06/2019
Field of study

Many-core devices are likely to become increasingly common in real-time and embedded systems as computational demands grow and as expectations for higher performance can generally only be met by by increasing core numbers rather than relying on higher clock speeds. Network-on-chip devices, where multiple cores share a single slice of silicon and employ packetised communications, are a widely-deployed many-core option for system designers. As NoCs are expected to run larger and more complex programs, the small amount of fast, on-chip memory available to each core is unlikely to be sufficient for all but the simplest of tasks, and it is necessary to find an efficient, effective, and time-bounded, means of accessing resources stored in off-chip memory, such as DRAM or Flash storage. The abstraction of paged virtual memory is a familiar technique to manage similar tasks in general computing but has often been shunned by real-time developers because of concern about time predictability. We show it can be a poor choice for a many-core NoC system as, unmodified, it typically uses page sizes optimised for interaction with spinning disks and not solid state media, and transports significant volumes of subsequently unused data across already congested links. In this work we outline and simulate an efficient partial paging algorithm where only those memory resources that are locally accessed are transported between global and local storage. We further show that smaller page sizes add to efficiency. We examine the factors that lead to timing delays in such systems, and show we can predict worst case execution times at even safety-critical thresholds by using statistical methods from extreme value theory. We also show these results are applicable to systems with a variety of connections to memory

White Rose E-theses Online

Multiobjective Optimization of the Power Flow Control of Hybrid Electric Power Train Systems within Simulation and Experimental Emulation Applications

Author: Marx Matthias
Publication venue
Publication date: 30/04/2014
Field of study

In this thesis, the power flow control of hybrid electric power train systems is discussed using the focus of multiobjective optimization goals and related algorithms, based on different control optimization methods, are developed and applied within simulation and experimental environments. Based on the basic relations of hybrid power train systems, an improved technique for the experimental realization and evaluation of these systems is developed and the related Hardware-in-the-Loop (HiL) hybrid electric power train emulation system is demonstrated. Hereby, it is shown that this emulation system technique is suitable to be applied for a more generalized view of the power train structures (consideration of the components as power sources, sinks, transmission elements, storage elements etc.) and its power flow control. The principal applicability of the system is demonstrated using the example of a hybrid electric vehicle as well as other system technologies such as hybrid hydraulic power trains and wind energy conversion systems. The core of the thesis is the discussion, development, application, and evaluation of power flow control optimization algorithms. Hereby, the considered power flow control techniques of the power train are realized with respect to a multiobjective framework using the example of drivability, fuel economy, and component life time as system requirements to be optimized during the operation. From this requirements, a multiobjective control optimization problem results consisting of a suitable combination of the known control goals power management, energy management, and lifetime management is realized. After a discussion about the principal influences of the power flow control on the different performance properties, the application of different control optimization techniques is discussed. Hereby, the example of a fuel cell/supercapacitor-based hybrid electric power train system including braking energy recovery is used. As control optimization methods, parameter optimization techniques are applied at first. Hereby, an embedded-online optimization based on a Golden Section search and an offline optimization based on Global Optimation methods are discussed and applied. Furthermore, direct optimization techniques based on Dynamic Programming (DP) and Model Predictive Control (MPC) are realized. Subsequently, an Instantaneous Optimality (IO)-based technique, which consists of a lookup table-based Time-Invariant Feedback Controller technique, is developed. It becomes clear that all methods leads to suitable results and significant improvement of the control performance. A concluding overview of the methods and its strengths and weaknesses dependent on the application is provided.In dieser Arbeit wird die Leistungsflussregelung bei hybridelektrischen Antriebssystemen mit dem Schwerpunkt der Mehrkriterienoptimierung diskutiert. Hierbei werden geeignete Algorithmen, basierend auf verschiedenen Stellgrößenoptimierungsmethoden, entwickelt und in Simulationen sowie in experimentellem Umfeld angewendet. Aufbauend auf die Grundzusammenhänge hybrider Antriebssysteme wird eine weiterentwickelte experimentelle Umgebung zur Untersuchung und Bewertung vorgestellt und der entsprechende Hardware-in-the-Loop (HiL)-Versuchsstand zur Emulation entsprechender Systeme demonstriert. Diese Emulationstechnik erlaubt eine generalisierte Betrachtung von Antriebssystemstrukturen (Betrachtung der Komponenten als Quellen, Senken, Übertragungselemente, Speicher etc.) und der entsprechenden Leistungsflussregelung. Den Hauptteil dieser Arbeit bildet die Diskussion sowie die Entwicklung, Anwendung und Bewertung von Algorithmen zur Optimierung der Leistungsflussregelung hybridelektrischer Antriebssysteme. In diesem Zusammenhang erfolgt eine mehrkriterielle Betrachtung und Bewertung des Antriebssystems in Hinblick auf die Dynamik, die Kraftstoffökonomie und die Komponentenlebensdauer. Das hieraus resultierende mehrkriterielle Optimierungsproblem der Stellgrößenfolge kann hierbei als Überlagerung von Leistungs-, Energie- und Lebensdauermanagement aufgefasst werden. Basierend auf den Haupteinflüssen der Leistungsflussregelungen auf verschiedene Systemeigenschaften erfolgt die Entwicklung, Anwendung, Bewertung und Diskussion verschiedener Stellgrößenoptimierungsmethoden und -algorithmen. Diese werden am Beispiel eines Brennstoffzellen/Supercap-basierten hybridelektrischen Antriebssystems mit Bremsenergierekuperation demonstriert. Zur Optimierung der Leistungsflussregelung werden als erstes Parameteroptimierungstechniken vorgestellt, wobei eine Embedded-online-Optimierung basierend auf der Methode des Goldenen Schnitts sowie eine Offline-Optimierung unter Verwendung von globalen Optimierungsalgorithmen diskutiert und angewendet werden. Nachfolgend werden direkte Stellgrößenoptimierungstechniken vorgestellt, wobei die Verfahren der Dynamischen Programmierung und des Modelprädiktiven Reglers realisiert werden. Abschließend wird die Entwicklung und Anwendung eines Algorithmus basierend auf der momentanen Optimalität (Instantaneous Optimality) diskutiert, welcher aus einem kombinierten Geschwindigkeits-Prädiktionsalgorithmus und vordefinierten Kennfeldern für die Regelung besteht. Die verwendeten Methoden werden vergleichend gegenübergestellt und gemäß ihrer Stärken und Schwächen bewertet

Duisburg-Essen Publications Online

Recommended from our members

The Design and Implementation of Low-Latency Prediction Serving Systems

Author: Crankshaw Daniel
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Machine learning is being deployed in a growing number of applications which demand real- time, accurate, and cost-efficient predictions under heavy query load. These applications employ a variety of machine learning frameworks and models, often composing several models within the same application. However, most machine learning frameworks and systems are optimized for model training and not deployment.In this thesis, I discuss three prediction serving systems designed to meet the needs of modern interactive machine learning applications. The key idea in this work is to utilize a decoupled, layered design that interposes systems on top of training frameworks to build low-latency, scalable serving systems. Velox introduced this decoupled architecture to enable fast online learning and model personalization in response to feedback. Clipper generalized this system architecture to be framework-agnostic and introduced a set of optimizations to reduce and bound prediction latency and improve prediction throughput, accuracy, and robustness without modifying the underlying machine learning frameworks. And InferLine provisions and manages the individual stages of prediction pipelines to minimize cost while meeting end-to-end tail latency constraints

eScholarship - University of California

Cellular networks for smart grid communication

Author: Kalalas Charalampos
Publication venue: Universitat Politècnica de Catalunya
Publication date: 19/07/2018
Field of study

The next-generation electric power system, known as smart grid, relies on a robust and reliable underlying communication infrastructure to improve the efficiency of electricity distribution. Cellular networks, e.g., LTE/LTE-A systems, appear as a promising technology to facilitate the smart grid evolution. Their inherent performance characteristics and well-established ecosystem could potentially unlock unprecedented use cases, enabling real-time and autonomous distribution grid operations. However, cellular technology was not originally intended for smart grid communication, associated with highly-reliable message exchange and massive device connectivity requirements. The fundamental differences between smart grid and human-type communication challenge the classical design of cellular networks and introduce important research questions that have not been sufficiently addressed so far. Motivated by these challenges, this doctoral thesis investigates novel radio access network (RAN) design principles and performance analysis for the seamless integration of smart grid traffic in future cellular networks. Specifically, we focus on addressing the fundamental RAN problems of network scalability in massive smart grid deployments and radio resource management for smart grid and human-type traffic. The main objective of the thesis lies on the design, analysis and performance evaluation of RAN mechanisms that would render cellular networks the key enabler for emerging smart grid applications. The first part of the thesis addresses the radio access limitations in LTE-based networks for reliable and scalable smart grid communication. We first identify the congestion problem in LTE random access that arises in large-scale smart grid deployments. To overcome this, a novel random access mechanism is proposed that can efficiently support real-time distribution automation services with negligible impact on the background traffic. Motivated by the stringent reliability requirements of various smart grid operations, we then develop an analytical model of the LTE random access procedure that allows us to assess the performance of event-based monitoring traffic under various load conditions and network configurations. We further extend our analysis to include the relation between the cell size and the availability of orthogonal random access resources and we identify an additional challenge for reliable smart grid connectivity. To this end, we devise an interference- and load-aware cell planning mechanism that enhances reliability in substation automation services. Finally, we couple the problem of state estimation in wide-area monitoring systems with the reliability challenges in information acquisition. Using our developed analytical framework, we quantify the impact of imperfect communication reliability in the state estimation accuracy and we provide useful insights for the design of reliability-aware state estimators. The second part of the thesis builds on the previous one and focuses on the RAN problem of resource scheduling and sharing for smart grid and human-type traffic. We introduce a novel scheduler that achieves low latency for distribution automation traffic while resource allocation is performed in a way that keeps the degradation of cellular users at a minimum level. In addition, we investigate the benefits of Device-to-Device (D2D) transmission mode for event-based message exchange in substation automation scenarios. We design a joint mode selection and resource allocation mechanism which results in higher data rates with respect to the conventional transmission mode via the base station. An orthogonal resource partition scheme between cellular and D2D links is further proposed to prevent the underutilization of the scarce cellular spectrum. The research findings of this thesis aim to deliver novel solutions to important RAN performance issues that arise when cellular networks support smart grid communication.Las redes celulares, p.e., los sistemas LTE/LTE-A, aparecen como una tecnología prometedora para facilitar la evolución de la próxima generación del sistema eléctrico de potencia, conocido como smart grid (SG). Sin embargo, la tecnología celular no fue pensada originalmente para las comunicaciones en la SG, asociadas con el intercambio fiable de mensajes y con requisitos de conectividad de un número masivo de dispositivos. Las diferencias fundamentales entre las comunicaciones en la SG y la comunicación de tipo humano desafían el diseño clásico de las redes celulares e introducen importantes cuestiones de investigación que hasta ahora no se han abordado suficientemente. Motivada por estos retos, esta tesis doctoral investiga los principios de diseño y analiza el rendimiento de una nueva red de acceso radio (RAN) que permita una integración perfecta del tráfico de la SG en las redes celulares futuras. Nos centramos en los problemas fundamentales de escalabilidad de la RAN en despliegues de SG masivos, y en la gestión de los recursos radio para la integración del tráfico de la SG con el tráfico de tipo humano. El objetivo principal de la tesis consiste en el diseño, el análisis y la evaluación del rendimiento de los mecanismos de las RAN que convertirán a las redes celulares en el elemento clave para las aplicaciones emergentes de las SGs. La primera parte de la tesis aborda las limitaciones del acceso radio en redes LTE para la comunicación fiable y escalable en SGs. En primer lugar, identificamos el problema de congestión en el acceso aleatorio de LTE que aparece en los despliegues de SGs a gran escala. Para superar este problema, se propone un nuevo mecanismo de acceso aleatorio que permite soportar de forma eficiente los servicios de automatización de la distribución eléctrica en tiempo real, con un impacto insignificante en el tráfico de fondo. Motivados por los estrictos requisitos de fiabilidad de las diversas operaciones en la SG, desarrollamos un modelo analítico del procedimiento de acceso aleatorio de LTE que nos permite evaluar el rendimiento del tráfico de monitorización de la red eléctrica basado en eventos bajo diversas condiciones de carga y configuraciones de red. Además, ampliamos nuestro análisis para incluir la relación entre el tamaño de celda y la disponibilidad de recursos de acceso aleatorio ortogonales, e identificamos un reto adicional para la conectividad fiable en la SG. Con este fin, diseñamos un mecanismo de planificación celular que tiene en cuenta las interferencias y la carga de la red, y que mejora la fiabilidad en los servicios de automatización de las subestaciones eléctricas. Finalmente, combinamos el problema de la estimación de estado en sistemas de monitorización de redes eléctricas de área amplia con los retos de fiabilidad en la adquisición de la información. Utilizando el modelo analítico desarrollado, cuantificamos el impacto de la baja fiabilidad en las comunicaciones sobre la precisión de la estimación de estado. La segunda parte de la tesis se centra en el problema de scheduling y compartición de recursos en la RAN para el tráfico de SG y el tráfico de tipo humano. Presentamos un nuevo scheduler que proporciona baja latencia para el tráfico de automatización de la distribución eléctrica, mientras que la asignación de recursos se realiza de un modo que mantiene la degradación de los usuarios celulares en un nivel mínimo. Además, investigamos los beneficios del modo de transmisión Device-to-Device (D2D) en el intercambio de mensajes basados en eventos en escenarios de automatización de subestaciones eléctricas. Diseñamos un mecanismo conjunto de asignación de recursos y selección de modo que da como resultado tasas de datos más elevadas con respecto al modo de transmisión convencional a través de la estación base. Finalmente, se propone un esquema de partición de recursos ortogonales entre enlaces celulares y D2Postprint (published version

UPCommons. Portal del coneixement obert de la UPC

Virtualization techniques for memory resource exploitation

Author: Garrido Platero Luis Angel
Publication venue: Universitat Politècnica de Catalunya
Publication date: 26/11/2019
Field of study

Cloud infrastructures have become indispensable in our daily lives with the rise of cloud-based services offered by companies like Facebook, Google, Amazon and many others. These cloud infrastructures use a large numbers of servers provisioned with their own computing resources. Each of these servers use a piece of software, called the Hypervisor (``HV''), that allows them to create multiple virtual instances of the server's physical computing resources and abstract them into "Virtual Machines'' (VMs). A VM runs an Operating System, which in turn runs the applications. The VMs within the servers generate varying memory demand behavior. When the demand increases, costly operations such as (virtual) disk accesses and/or VM migrations can occur. As a result, it is necessary to optimize the utilization of the local memory resources within a single computing server. However, pressure on the memory resources can still increase, making it necessary to migrate the VM to a different server with larger memory or add more memory to the same server. At this point, it is important to consider that some of the servers in the cloud infrastructure might have memory resources that they are not using. Considering the possibility to make memory available to the server, new architectures have been introduced that provide hardware support to enable servers to share their memory capacity. This thesis presents multiple contributions to the memory management problem. First, it addresses the problem of optimizing memory resources in a virtualized server through different types of memory abstractions. Two full contributions are presented for managing memory within a single server called SmarTmem and CARLEMM. In this respect, a third contribution is also presented, called CAVMem, that works as the foundation for CARLEMM. Second, this thesis presents two contributions for memory capacity aggregation across multiple servers, offering two mechanisms called GV-Tmem and vMCA, this latter being based on GV-Tmem but with significant enhancements. These mechanisms distribute the server's total memory within a single-server and globally across computing servers using a user-space process with high-level memory management policies.Las infraestructuras para la nube se han vuelto indispensables en nuestras vidas diarias con la proliferación de los servicios ofrecidos por compañías como Facebook, Google, Amazon entre otras. Estas infraestructuras utilizan una gran cantidad de servidores proveídos con sus propios recursos computacionales. Cada unos de estos servidores utilizan un software, llamado el Hipervisor (“HV”), que les permite crear múltiples instancias virtuales de los recursos físicos de computación del servidor y abstraerlos en “Máquinas Virtuales” (VMs). Una VM ejecuta un Sistema Operativo (OS), el cual a su vez ejecuta aplicaciones. Las VMs dentro de los servidores generan un comportamiento variable de demanda de memoria. Cuando la demanda de memoria aumenta, operaciones costosas como accesos al disco (virtual) y/o migraciones de VMs pueden ocurrir. Como resultado, es necesario optimizar la utilización de los recursos de memoria locales dentro del servidor. Sin embargo, la demanda por memoria puede seguir aumentando, haciendo necesario que la VM migre a otro servidor o que se añada más memoria al servidor. En este punto, es importante considerar que algunos servidores podrían tener recursos de memoria que no están utilizando. Considerando la posibilidad de hacer más memoria disponible a los servidores que lo necesitan, nuevas arquitecturas de servidores han sido introducidos que brindan el soporte de hardware necesario para habilitar que los servidores puedan compartir su capacidad de memoria. Esta tesis presenta múltiples contribuciones para el problema de manejo de memoria. Primero, se enfoca en el problema de optimizar los recursos de memoria en un servidor virtualizado a través de distintos tipos de abstracciones de memoria. Dos contribuciones son presentadas para administrar memoria de manera automática dentro de un servidor virtualizado, llamadas SmarTmem y CARLEMM. En este contexto, una tercera contribución es presentada, llamada CAVMem, que proporciona los fundamentos para el desarrollo de CARLEMM. Segundo, la tesis presenta dos contribuciones enfocadas en la agregación de capacidad de memoria a través de múltiples servidores, ofreciendo dos mecanismos llamados GV-Tmem y vMCA, siendo este último basado en GV-Tmem pero con mejoras significativas. Estos mecanismos administran la memoria total de un servidor a nivel local y de manera global a lo largo de los servidores de la infraestructura de nube utilizando un proceso de usuario que implementa políticas de manejo de ..

Tesis Doctorals en Xarxa

Virtualization techniques for memory resource exploitation

Author: Garrido Platero Luis Ángel
Publication venue: Universitat Politècnica de Catalunya
Publication date: 26/11/2019
Field of study

UPCommons. Portal del coneixement obert de la UPC