Search CORE

6,771 research outputs found

Eliminating the call stack to save RAM

Author: Regehr John
Yang Xuejun
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2009
Field of study

ManuscriptMost programming languages support a call stack in the programming model and also in the runtime system.We show that for applications targeting low-power embedded microcontrollers (MCUs), RAM usage can be significantly decreased by partially or completely eliminating the runtime callstack. We present flattening, a transformation that absorbs a function into its caller, replacing function invocations and returns with jumps. Unlike inlining, flattening does not duplicate the bodies of functions that have multiple callsites. Applied aggressively, flattening results in stack elimination. Flattening is most useful in conjunction with a lifting transformation that moves global variables into a local scope. Flattening and lifting can save RAM. However, even more benefit can be obtained by adapting the compiler to cope with properties of flattened code. First, we show that flattening adds false paths that confuse a standard live variables analysis. The resulting problems can be mitigated by breaking spurious live-range conflicts between variables using information from the unflattened callgraph. Second, we show that the impact of high register pressure due to flattened and lifted code, and consequent spills out of the register allocator, can be mitigated by improving a compiler's stack layout optimizations. We have implemented both of these improvements in GCC, and have implemented flattening and lifting as source-to-source transformations. On a collection of applications for the AVR family of 8-bit MCUs, we show that total RAM usage can be reduced by 20% by compiling flattened and lifted programs with our improved GCC

The University of Utah: J. Willard Marriott Digital Library

Eliminating the call stack to save RAM

Author
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2009
Field of study

Crossref

Eliminating the call stack to save RAM

Author: Appel Andrew W.
Ganssle Jack
John Regehr
Makarov Vladimir N.
Nathan Cooprider
Wawersich Christian
Xuejun Yang
Yang Lei
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Investigation of fast initialization of spacecraft bubble memory systems

Author: Hayes P. J.
Looney K. T.
Nichols C. D.
Publication venue
Publication date
Field of study

Bubble domain technology offers significant improvement in reliability and functionality for spacecraft onboard memory applications. In considering potential memory systems organizations, minimization of power in high capacity bubble memory systems necessitates the activation of only the desired portions of the memory. In power strobing arbitrary memory segments, a capability of fast turn on is required. Bubble device architectures, which provide redundant loop coding in the bubble devices, limit the initialization speed. Alternate initialization techniques are investigated to overcome this design limitation. An initialization technique using a small amount of external storage is demonstrated

NASA Technical Reports Server

Control-flow Integrity for Real-time Embedded Systems

Author: Brown Nicholas
Publication venue: Digital WPI
Publication date: 27/04/2017
Field of study

As embedded systems become more connected and more ubiquitous in mission- and safety-critical systems, embedded devices have become a high- value target for hackers and security researchers. Attacks on real-time embedded systems software can put lives in danger and put our critical infrastructure at risk. Despite this, security techniques for embedded systems have not been widely studied. Many existing software security techniques for general purpose computers rely on assumptions that do not hold in the embedded case. This thesis focuses on one such technique, control-flow integrity (CFI), that has been vetted as an effective countermeasure against control-flow hijacking attacks on general purpose computing systems. Without the process isolation and fine-grained memory protections provided by a general purpose computer with a rich operating system, CFI cannot provide any security guarantees. This thesis explores a way to use CFI on ARM Cortex-R devices running minimal real-time operating systems. We provide techniques for protecting runtime structures, isolating processes, and instrumenting compiled ARM binaries with CFI protection

DigitalCommons@WPI

Selection from read-only memory with limited workspace

Author: A. Golynski
B. Chazelle
D.E. Knuth
G. Jacobson
G. Navarro
G.N. Frederickson
J. Pagter
J.I. Munro
J.I. Munro
J.I. Munro
M. Blum
P. Beame
R. Grossi
R. Raman
T. Asano
T.H. Cormen
T.M. Chan
V. Raman
Publication venue
Publication date: 01/01/2013
Field of study

Given an unordered array of

N

elements drawn from a totally ordered set and an integer

k

in the range from

1

N

, in the classic selection problem the task is to find the

k

-th smallest element in the array. We study the complexity of this problem in the space-restricted random-access model: The input array is stored on read-only memory, and the algorithm has access to a limited amount of workspace. We prove that the linear-time prune-and-search algorithm---presented in most textbooks on algorithms---can be modified to use

\Theta(N)

bits instead of

\Theta(N)

words of extra space. Prior to our work, the best known algorithm by Frederickson could perform the task with

\Theta(N)

bits of extra space in

O(N \lg^{*} N)

time. Our result separates the space-restricted random-access model and the multi-pass streaming model, since we can surpass the

\Omega(N \lg^{*} N)

lower bound known for the latter model. We also generalize our algorithm for the case when the size of the workspace is

\Theta(S)

bits, where

\lg^3{N} \leq S \leq N

. The running time of our generalized algorithm is

O(N \lg^{*}(N/S) + N (\lg N) / \lg{} S)

, slightly improving over the

O(N \lg^{*}(N (\lg N)/S) + N (\lg N) / \lg{} S)

bound of Frederickson's algorithm. To obtain the improvements mentioned above, we developed a new data structure, called the wavelet stack, that we use for repeated pruning. We expect the wavelet stack to be a useful tool in other applications as well.Comment: 16 pages, 1 figure, Preliminary version appeared in COCOON-201

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System