Abstract-There is an increasing concern to reduce the cost and overheads during the development of reliable systems. Selective protection of most critical parts of the systems represents a viable solution to obtain a high level of reliability at a fraction of the cost. In particular to design a selective fault mitigation strategy for processor-based systems, it is mandatory to identify and prioritize the most vulnerable registers in the register file as best candidates to be protected (hardened).This paper presents an application-based metric to estimate the criticality of each register from the microprocessor register file in microprocessor-based systems. The proposed metric relies on the combination of three different criteria based on common features of executed applications. The applicability and accuracy of our proposal have been evaluated in a set of applications running in different microprocessors. Results show a significant improvement in accuracy comparing to previous approaches and regardless the underlying architecture.
INTRODUCTION
Technological scaling is posing major challenges on the development of reliable systems, e.g.: voltage and temperature variability, sensitivity to soft errors and electromagnetic interferences, accelerated degradation as aging [1] . These challenges may cause timing faults and Single Event Effects (SEEs), which provoke permanent or temporary effects over the electronic components operation, increasingly affecttingreliability.Therefore, fault tolerant design has become a mandatory issue for an increasing number of application domains, including: space, avionics, automotive, defense, medicine, and communications [2] .
A wide spectrum of design techniques has emerged to overcome these problems. Traditionally, fault tolerant design has relied mainly on expensive and power costly approaches based on hardware redundancy [3] . More recently, thanks to the proliferation of processor-based systems and the need for reliable low-cost solutions, a large number of techniques based on redundant software have been proposed [4, 25] . However, they cause non-negligible overheads in terms of code size, execution time, and data that designers have to cope with [5] .
To reduce the costs and overheads inherent to the protection, whether hardware or software, recent approaches propose to use a selective protection strategy (selective hardening). It consists of protecting only the most critical parts of the designs. This partial protection can be achieved by means of selective redundancy applied to: hardware [6] [7] , software [8] [9] , or by means of hybrid hardware/software approaches [10] [11] .
In particular for processor-based systems, a well-known strategy consists of protecting only the most critical registers in the microprocessor register file [8] [9] [10] [11] .Thus, it is evident the necessity to identify the most critical registers properly during an early design stage in order to facilitate the selection of the hardened set of registers. A proper selection of registers determines to achieve good efficiency/cost tradeoffs for the designed solution and, at the same time, permits to explore the design space effectively, avoiding costly explorations guided by brute force strategies [24] .
Furthermore, early estimation of the register file criticality is crucial in order to quickly obtain an evaluation before the system is fully implemented and fault injections can be performed. These estimations are strongly needed during the design phase in order to properly develop a hardening strategy that better fulfills the reliability requirements and system constraints.
In this context, our work presents a metric to estimate the ---------------- criticality of each register from the microprocessor register file, which allows a fine-grained analysis. It is based on dynamic code analysis (profiling), using the low-level (assembly) source code of the program. In addition, the proposed metric facilitates the selection (and prioritization) of registers to be hardened when a selective approach is required.
As in the original work of Bergaoui et al. [12] , our metric is based on the combination of three different criteria: lifetime, weight in conditional branches, and functional dependencies. The novelty and contributions of our proposal can be summarized as follows:  Dynamic code analysis: unlike the compilation-time method used in [12] (static code analysis), we propose to use dynamic measurements for the computation algorithms during run-time (simulation-time). This kind of assessment does not represent any inconvenience in the usual design-flow of embedded systems, and alternatively, improves significantly the accuracy of the estimations.  Effective lifetime: the modulation of the role of the register lifetime using diverse considerations not taken into account in previous works. These include the consideration of the total lifetime as the sum of several lifetime intervals, and the correct calculation of those intervals (Section 3.1).  Normalization: all criteria have to be normalized with respect to the total number of executed instructions. For instance, in the case of the weight in conditional branches criterion, this consideration permits to assign a real contribution to the criticality estimation.  Direct descendants: counting dynamically only its direct descendants during the program execution to avoid counting erroneously n-levels descendants. Two case studies have been explored and discussed. To validate the applicability of the proposed metric, each case study targets a different microprocessor and benchmark. To corroborate the accuracy of the results, criticality estimations have been analyzed and compared with the results obtained during fault injection campaigns. The experimental outcomes show that the proposal improves related works accuracy results.
This paper is organized as follows. Next section provides background information about related works. Section 3 presents the criticality criteria and defines the proposed metric. Section 4 validates the proposal by means of a comprehensive experimental study. Finally, Section 5 summarizes the concluding remarks of this work.
RELATED WORKS
The most commonly used vulnerability metric is the Architectural Vulnerability Factor (AVF) [13] . The AVF of a hardware structure is defined as the probability that a fault in that structure will result in a visible error in the final output of a program. AVF-derived works evaluate the vulnerability based on the micro-architecture features of processors; however, they do not take into account detailed characteristics of the executed programs.
In addition, it is known [14] [15] that different programs and even different functions in an application are not equally critical due to different data and control flow properties, internal error masking effects, etc. These programs/functions exhibit distinct resilience to hardware-level faults. In this work, therefore, we focus on the criticality of the registers in the register file for a given executed program. Among proposals analyzing specific features of the executed programs to estimate criticality, two groups can be found: proposals based on static code analysis, and those based on dynamic measurements.
Static analysis does not require simulation or actual usage of the system. Instead, it relies on statically-determined criteria (e.g., lifetime, functional dependencies between variables) [12, [16] [17] . Static code analysis is a very important task in modern compilers because it makes possible to find bugs and perform live variable analysis in order to allocate registers. However, it is not very useful in the context of this work, since no compiler can statically know all the program dynamic properties to estimate criticality. Therefore, we propose to compute the criticality based on dynamic code analysis during simulation-time.
Dynamic techniques use information gathered during simulation or operation of the system to estimate criticality factors [18] [19] . Unlike approaches based on static code analysis, dynamic analysis takes into account conditional branches, loop iterations, recursive functions, values only known in run-time, and in general, the execution progress of the application.
In addition, authors in [26] [27] have proposed a control flow protection technique known as Optimized Embedded Signature Monitoring (OESM). The first step in this approach consists on the application profiling (dynamic code analysis) to optimize the number of checkpoints introduced into the application code, which are inserted in the second step by means of the application of a well-known control flow technique such as CFCSS [28] . Unlike this controlflow protection technique, our metric is aimed at identifying the most critical registers from the microprocessor register file for a further application of data protection schemes (out of the scope of the present work).
As summarized in the introduction, the present work extends the proposal presented in [12] by re-defining the calculation of the criticality criteria improving their accuracy in the estimation, by means of considering new parameters and performing dynamic analyses of the executed programs, instead of the static code analysis originally proposed.
APPLICATION-BASED CRITICALITY METRIC
We focus on the criticality of each individual register in the microprocessor register file for a given program (assembly code). Criticality can be expressed in terms of 3 criteria: effective lifetime, weight in conditional branches, and functional dependencies.
Effective Lifetime
Register lifetime represents the time when useful data is present in the register. Any fault occurring to the register during that time destroys data integrity. Therefore, the higher the lifetime is, the longer the register is prone to faults.
Register lifetime is expressed as the sum of clock cycles of all the register living intervals during the program simulation/execution. A living interval starts with a generic write operation and ends with the last read operation, which precedes the next write operation or the end of the program execution.
However, it is important considering that a new interval is created every time there is a write operation to the register, and during the execution of that instruction (n clock cycles), there are k clock cycles (k<n) at the beginning of the write execution in which the register has not yet stored the value, and any fault affecting it during that k time will be overwritten when it finally stores the written value. The remaining time in the interval (n -k) is called partial effective lifetime (pelt). Hence, the register effective lifetime is the sum of all the partial effective lifetimes. These terms are illustrated in Fig. 1 .
In the simplest case, these cycles are due to the number of stages in the instruction pipeline, and in case of more complex processorsthey are consequence of using different techniques that delay the effective write in the register file, these include: forwarding, speculative execution, reorder buffers, etc.
This consideration is very important because it implies that in cases of registers having the same/similar lifetime, criticality is lower for those registers whose lifetime presents a larger number of living intervals, i.e., their effective lifetime is lower.
According to this, we propose to adjust the lifetime (measured in clock cycles) by subtracting the non-effective lifetime (k) of each write operation from the total lifetime of a register, i.e., the effective lifetime. Depending on the microprocessor complexity, the k number of cycles can be either calculated or estimated. Finally, for estimating the register criticality the effective lifetime is normalized with respect to the duration of the program (clock cycles). 
Weight in Conditional Branches
The second criterion is weight in conditional branches. As it was first proposed in [12] , it should be given more attention to the registers taking part in branch conditions. Erroneous data stored in these registers may lead the program control flow to take an incorrect path. We introduce two improvements to this criterion based on the dynamic analysis.
Firstly, real influence of registers on criticality is taken into account depending on the statement where the conditional branch is located. For instance, it is more critical a register involved in the evaluation of a loop condition, whose condition is repeated n times, than another that participates in a single conditional statement (executed once). Dynamic counting of registers that take part in branch conditions is able to capture this influence. Static code analysis, in contrast, may incorrectly indicate the same level of criticality for both registers in the previous example. Fig. 2 presents an example of different variations that can occur during the dynamic counting of the registers involved in conditional branches: r1 is present in a single conditional statement executed once; r2 is firstly present in a loop condition with a static loop limit (executed 10 times) and then it also participates in a second dynamic loop (executed N times); r3 is involved in the last loop statement (executed N times). In terms of weight in conditional branches for this example, the most critical register is r2 and the least critical r1.
The second improvement to this criterion consists on the normalization procedure. The dynamic measurement for each register taking part in conditional branches has to be normalized with respect the total number of executed instructions of the program, instead of the total number of effectively conditional branches. Otherwise, a register involved in conditional branches might have a high value for this criterion, regardless that number of conditional branches did not represent a significant portion of the executed instructions. For instance, a register participating in most conditional branches may be seen as critic for this criterion if only effectively conditional branches are taken into account on the normalization; however, let's suppose that those branches only represent the 1-2% of the executed instructions of the program, in which case the real register vulnerability is considerably lower. This consideration not only permits to assign a real contribution to the conditional branch criterion, but also facilitates to combine this criterion with other criticality criteria.
Functional Dependencies
The third criterion is the count of functional dependencies between registers. A register depends on a given register r (or is a direct descendant of r) if it takes a value that is the result of an expression involving the value of r. This criteri- on remarks the criticality of those registers having a lot of descendants, as erroneous values will be propagated widely. We propose to measure functional dependencies for each register by counting dynamically only its direct descendants during the program execution. Other descendants different to the register direct descendants (n-level descendants where n ≥ 2) are not taken into account. Considering other levels without including dependency time intervals could lead to consider some n-level dependencies erroneously.
To compute functional dependencies, it is built an N×N matrix M (where N is the number of registers in the register file). A cell Mi,j means that register i is descendant of register j. For each instruction simulated, matrix M is updated with the dynamic count of direct descendants. Once the matrix M is completely calculated, each cell is normalized in the same way as for the conditional branches criterion, i.e., with respect to the total number of executed instructions.
Criticality Metric
Criticality C of a register r can be estimated as expressed in (1).
Where: Wlt, Wcb, and Wfd are weight coefficients assigned to each criterion; Clt(r) is the normalized effective lifetime; Ccb(r) is the normalized conditional branch value; and Cfd(r) is the normalized functional dependencies value, which can be calculated using (2) .
Where: M is the dynamic functional dependencies matrix, and N is the number of registers in the register file. Notice that (2) includes the effective lifetime and conditional branches criteria of all direct descendants of r in the calculation of Cfd(r).
EXPERIMENTAL RESULTS AND DISCUSSION

Experimental setup
The weight coefficients used in (1) and (2) for the register criticality calculation were equal to 0.33 each. This configures an equal weight for each criterion. These values were chosen for demonstration, and at the same time, for comparison purposes, as these were the same weights used by authors in [12] . However, these coefficients could have been modified according to each application dependability requirements.
To validate the applicability of the proposal, several target applications have been studied. The experimental setup is divided in two different case studies that permit to demonstrate that the metric is hardware-agnostic. The first set of target applicationsis based on the PicoBlaze microprocessor [20] (case study 1), whereas the second group targets the miniMIPS microprocessor [21] (case study 2).
To corroborate the accuracy of the results in both case studies, criticality estimations have been analyzed and compared with results obtained during fault injection test. We focus on the type of transient fault known as Single Event Upset (SEU), which is characterized by the logic state alteration of a single memory element in the system. For each studied application in the case studies, a fault injection campaign has been carried out against each one of the registers in the register file. Each campaign consisted in 10,000 faults injected (one fault per run). Each fault consisted of a bit-flip in a randomly selected bit from the target register in a randomly selected clock cycle from all the workload duration.
Besides to validate the accuracy of the estimated criticality results comparing with those obtained by means of fault injection tests, the case studies were aimed to verify that our proposal improves related works accuracy results. Therefore, we have also implemented the criticality metric based on static code analysis proposed in [12] to compare the results using the same processor and the same test programs.
To evaluate the goodness of the estimations, we check how well the expected rank of critical registers (based on the error rate obtained in the fault injection tests) matches to the estimated rank, i.e., match level. Ranks are expressed in ascending order from the most critical to the least critical register. The match level determines the distance between each value of the estimated criticality rank compared to its respective expected value. A match level equals to 0 means an exact match between them, which is the desirable value.
Moreover, the analysis of the code necessary for the criticality estimation is performed previously to the final deployment. To do so, specific features of the simulator or code instrumentation can be used. In case of code instrumentation, it is removed from the program after the estimation. Therefore, there is no memory overhead or performance penalty introduced to the deployed program in any case.
Case Study 1: PicoBlaze
PicoBlaze is a widely used IP (intellectual property) core. The main features of this processor are: 16 byte-wide general-purpose data registers, 1K instructions of programmable on-chip program store, byte-wide Arithmetic Logic Unit (ALU) with Carry and Zero indicator flags, 64-byte scratchpad RAM, 256 input and 256 output ports, 31-location Call/Return stack.
The benchmark software suite used for PicoBlaze is made up of 11 example programs. Some of them are representative programs used in embedded systems: proportionalintegral-derivative controller (PID), finite impulse response filter (FIR), and advanced encryption standard (AES) or Rijndael. The rest of them consist of typical small computations: bubble sort (BUB), Fibonacci (FIB), greatest common divisor (GCD), matrix addition (MADD), matrix multiplication (MM), scalar multiplication (MULT), exponentiation (POW), and quick sort (QSORT). In this case, fault injection tests were performed using the software-based simulation tool presented and validated in [22] . Table 1 presents the obtained results for the PicoBlaze case study. For each studied application and for each register (RX) used within the program source code, it is presented the register error rate obtained in the fault injection tests, the estimated criticality calculated using the static criticality metric, and the estimated criticality calculated using our proposal. In all the cases only 5 (or less) registers were used to code the applications.
Notice that the estimations of criticality ranks obtained by the dynamic criticality metric are quite approximate to the ranks obtained by fault injection. An 82.0% of the estimated positions in the rank correspond to the respective expected value in the criticality rank (match level = 0).
Furthermore, in case of giving an acceptable error margin to the estimations, i.e., considering match levels equal to 0 and 1 as well, this percentage goes up to 98.0%. That is, the estimated positions in the criticality rank can be at a maximum distance of one position to its expected value based on the fault injection results.
Comparing these results with the originally published results (from Tables VII and VIII in [12] ) using the match level, it can be seen that only 25.0% of the estimations match to the respective expected value in the criticality rank (match level = 0). Including results from match levels equal to 0 and 1, this percentage only increases to 62.5%.
Nevertheless, for the sake of comparison, it is necessary to analyze the estimated results obtained by the static criticality metric for the same microprocessor and the same benchmark (Table 1) . In this case, from Table 1 one can see that only a 46.0% of the estimated positions in the rank match the expected value in the criticality rank. This percentage is increased to 78.0% when considering as acceptable match levels equal to 0 and 1. These results are 36.0% and 20% less accurate than our proposal, respectively. Fig. 3 represents the match level results presented in Table 1 . This premits to compare, at a glance, estimation results obtained by the state-of-the-art static approach [12] with results achieved using our proposal. As commented above, the experimental outcomes show that our proposal improves significantly the related work accuracy results. Fig. 3 . Proposal Vs. Static metric match levels in PicoBlaze   Fig. 4 represents the correlation coefficient between the estimations (static metric and dynamic metric) and the error rate obtained in the fault injection campaigns for each test program, which shows the strength of the linear association between the variables. All the correlation coefficients from the proposed metric are greater than the coefficients obtained in the static metric (closer to 1.0). This means that the criticality estimations obtained using our metric represent the expected criticality in a much better way than the static metric. The only two programs with significant discrepancies for our proposal were PID and MULT, i.e., their correlation coefficients were less than 0.8. One possible reason for this divergence is that the criticality estimations in these cases are under the sensitivity of the proposed metric.In both cases the standard deviation of the results is less than 0.060, which means that the estimations are very close to each other and the criticality rank could vary slightly. The standard deviation for the rest of the test programs is greater than 0.060.
Case Study 2: miniMIPS
MiniMIPS is a 32 bits core based on MIPS I architecture. It has a pipeline of 5 stages and 32 general purpose registers ($0-$28, $sp, $fp and $31). All miniMIPS instructions take five cycles to be executed and the peak throughput is 1 instruction per cycle. Register $0 is constant, so it is not considered in the analysis.
Five case-study applications are selected as benchmak for miniMIPS: a bubble sort (BS), a matrix multiplication (MM), a non-recursive Depth-Fist Search (nDFS), a recursive Depth-First Search (rDFS) and the Tower of Hanoi (TH). The number of used registers ranges from 7 to 16 depending of the application.
A Register Transfer Level (RTL) description of the microprocessor is submited to a fault injection campaign performed by means of a VHDL simulator [23] . The faults are injected by forcing a bit-flip in the registers' signals. The duration of a fault is set to one clock cycle to increase the probability of the fault causes an error. The fault injection results are then compared to the results obtained by the proposed metric. Table 2 presents the obtained results for the miniMIPS case study. For each studied application and for each register (RX) used within the program source code, it is presented the register error rate obtained in the fault injection tests, the estimated criticality calculated using the static criticality metric, and the estimated criticality calculated using our proposal.
Only 33.3% of the estimated positions in the rank correspond to the expected criticality rank (match level = 0), but it is still better than the 29.8% presented by the static criticality metric. The low correspondence for the miniMIPS is due to the higher number of registers used by the programs and also to the close error rates presented by many registers. That makes the probability of ranking them correctly more unlikely.
Furthermore, in case of giving an acceptable error margin to the estimations, i.e., considering match levels equal to 0 and 1 as well, the percentage goes up to 66.7% for the proposed metric. It is an improvement when compared with the state-of-the-art because the static criticality metric reaches 47.4%. Match level results obtained by the state-of-the-art approach [12] and the results achieved using our proposal are presented in Fig. 5 .
One fact that must be pointed out is about the register 6 in the matrix multiplication. Our approach says it is the second most critical when it is not that much (match level = 9). The reason for this is that register 6 has actually no much effect in the program outputs, but as it has a long lifetime, our metric see it as critical. There is a loop (whose limits are unknown in compilation-time) between the write and read of register 6. In this way, the lifetime of this register in the static approach is short because the positions of write and read are close but due the loop between such positions our dynamic approach sees a long lifetime, and consequently, a high criticality is assigned to this register. Anyway, as this kind of situation is not common, the average match level for all case-study applications is smaller in our proposal than in the static approach as shown in Table 3 , which clearly shows the improvements of our approach in the state-of-the-art. However, the influence of the registers in the outputs is a topic for a future work to avoid counting these dummy registers within the resultant criticality estimation. Fig. 6 shows the correlation coefficient between the estimations and the error rate obtained in by fault injection. Four of five correlation coefficients from the proposed metric are greater than the coefficients obtained in the static metric, and the other one differs only by 3% from the static metric. The miniMIPS results support that the criticality estimations obtained using our metric represent better the expected criticality. 
CONCLUSION
Given that providing early estimations of the register file criticality is a key task within the fault tolerant design of embedded systems, this paper presents an application-based metric to estimate the criticality of each register from the register file in microprocessorbased systems. This metric facilitates the selection of the hardened set of registers when a selective hardening strategy is required in software, avoiding costly design space explorations guided by costly brute force strategies. Furthermore, early estimation of the register file criticality permits to perform preliminary reliability assessments before the system is fully implemented and fault injections can be carried out.
Experimental results demonstrate the applicability and accuracy of the proposed metric. Criticality estimations obtained using our metric represent the expected criticality (based on fault injection campaigns). Moreover, results showed that criticality estimations based on dynamic code analysis improve significantly the accuracy of results compared to metrics based on static code analysis.
This work opens up interesting new boundaries in the criticality analysis of resources in the design of fault tolerant embedded systems. Beside applicationbased criticality criteria, we will investigate new metrics taking into account more factors for the proper selection of the hardened set of registers, e.g., the influence of the registers in the output, expected time execution overheads. Dynamic criticality metric (proposed)
