Abstract To improve the reliability of spaceborne electronic systems, a fault-tolerant strategy of selective triple modular redundancy (STMR) based on multi-objective optimization and evolvable hardware (EHW) against single-event upsets (SEUs) for circuits implemented on field programmable gate arrays (FPGAs) based on static random access memory (SRAM) is presented in this paper. Various topologies of circuit with the same functionality are evolved using EHW firstly. Then the SEU-sensitive gates of each circuit are identified using signal probabilities of all the lines in it, and each circuit is hardened against SEUs by selectively applying triple modular redundancy (TMR) to these SEU-sensitive gates. Afterward, each circuit hardened has been evaluated by SEU Simulation, and the multi-objective optimization technology is introduced to optimize the area overhead and the number of functional errors of all the circuits. The proposed fault-tolerant strategy is tested on four circuits from microelectronics center of North Carolina (MCNC) benchmark suite. The experimental results show that it can generate innovative trade-off solutions to compromise between hardware resource consumption and system reliability. The maximum savings in the area overhead of the STMR circuit over the full TMR design is 58% with the same SEU immunity.
Introduction
With the fast development of science and technology, the reliability of electronics in space and avionics has become crucial due to the increased complexity of the architecture and function. Field programmable gate arrays (FPGAs) based on static random access memory (SRAM) have gained a steadily increasing interest for such applications because of their short time to market, good reconfigurability and low cost. Unfortunately, along with these advantages this technology has a high susceptibility to the so-called single event upsets (SEUs). 1 An SEU stands for the inversion of a memory bit caused by heavy ions, protons and/or ground level radiation. SEU is the largest contributor to device soft failure, 2 which may even lead to failure in the mission. Hence, aerospace industry will benefit significantly from SEU mitigation technologies for SRAM-based FPGAs.
Triple modular redundancy (TMR) 3, 4 is the most widely adopted one for hardening circuits implemented on SRAM-based FPGAs. For digital circuits mapped on FPGAs, not only the flip-flops (FFs) to form the feedback path of sequential circuits, but also the logic gates in combinational and sequential circuits, need to be hardened. The reason for this is that the logic gates are mapped on the FPGA using look up tables (LUTs), which consist of SRAM cells. Even the interconnection is also controlled using the data stored in SRAM cells.TMR can be applied based on different granularities, such as device redundancy, system redundancy, module redundancy or logic element redundancy. The finer the granularity is, the higher the probability is. For example, in the system level TMR system, the original system is replicated three times and the output extracted from a majority voter. Each replica of the system works independently and is named domain. If an SEU occurs in one domain, TMR masks the fault by majority voting and thus propagates the correct output. This method provides the TMR system with resistance against SEUs, and can harden the system without affecting its normal operation. However, TMR system can withstand only single upset at any instant of time. If two out of three domains give faulty results, the system will produce wrong answers. To enable the system to mask multiple faults, logic element level TMR can be applied. In the logic element level TMR system, each logic element, including the logic gate and the FF, is hardened by TMR, so it allows every logic element to tolerate one failure. Obviously, the anti-SEU ability of logic element level TMR is better than system level TMR, but the area overhead of voter insertion is also significant.
To reduce the area overhead of TMR system, a special kind of TMR named reduced triple modular redundancy (RTMR) for specific very long instruction word VLIW processors is proposed in Ref. 5 . The key idea is to employ the redundancy of operators in the data path of a VLIW processor. I.e., every operation is executed twice by two different operators during normal program execution. Only in case a mismatch between both computed results occurs, the operation is executed by a third operator and its result is used for voting. Therefore, during most of the execution time, the area overhead of (RTMR) is only 100%. However, RTMR is somewhat a system level TMR, and it is only suitable for application to specific VLIW processors. Moreover, the VLIW architecture must be modified in order to detect a mismatch in computed results, and necessary program transformations must be introduced to obtain an internal representation for fault tolerant programs that can be scheduled to the proposed VLIW architecture. So if it is used for logic element level TMR, many additional hardware logic and complex scheduling mechanism should be added.
To reduce the hardware resource consumption of logic element level full TMR system, a fault-tolerant method of selective TMR (STMR) for circuit mapped on FPGAs is proposed in Ref. 6 . In this method, only the SEU sensitive gates, i.e., gates that are prone to upset in case of SEU, in the circuits are detected using the signal probabilities of the line and are further hardened with TMR; while those non-sensitive to SEU are not hardened. Because only part of the gates are selectively hardened by TMR, so the STMR method can significantly reduce the area overhead of the hardened circuit compared to full TMR; moreover, since the gates not hardened are not prone to upset by SEU, the loss of SEU immunity is small. However, the area overhead and SEU immunity (namely reliability, which is inversely proportional to functional errors in case of SEU) of a circuit conflict with each other, i.e., if the former increases then the latter increases too; vice versa. If we want to increase the reliability of a circuit, more gates need to be hardened by STMR, so the area overheard increases too; otherwise, if we want to decrease the area overhead, the number of gates to be hardened by TMR must decrease, so the reliability decreases too in the case of SEU. Therefore, a compromise between the area overhead and the number of functional errors is required. Moreover, faulty domains in the STMR system cannot be repaired.
In this paper, the multi-objective evolutionary design of STMR system against SEUs is presented, which combines the novel design and self-repairing capabilities of evolvable hardware (EHW) with the less area overhead of the STMR technique. Moreover, this strategy can result in a tradeoff between reliability and resource consumption by using the multi-objective optimization technology. In general, the procedure of this strategy can be divided into three steps. Firstly, various topologies of circuit with the same functionality are evolved using EHW. Then these circuits are hardened against SEUs by introducing the STMR technique to greatly reduce the area overhead with a slight loss of reliability. Lastly, through introducing multi-objective optimization algorithm 7 based on weighted summation, the number of functional errors and the area overhead of the circuits with different topologies are optimized simultaneously. The proposed fault-tolerant strategy is tested on four circuits taken from microelectronics center of North Carolina (MCNC) benchmark library. Not only is the area overhead of the STMR circuit decreased significantly over that required for the full TMR design of the same circuit with a small loss of reliability, but also a tradeoff between the number of functional errors and the area overhead is achieved.
STMR method based on multi-objective optimization and EHW

Evolvable hardware
EHW is a novel kind of bio-inspired smart hardware, which is capable of self-assembly, self-repairing and self-adaptation. It is an integration of evolutionary computation and reconfigurable hardware devices. 8, 9 It applies evolutionary algorithms, particularly genetic algorithm (GA), as the global search engine, and in-situ configurable devices as the physical medium. The goal of EHW is to obtain expected circuits and topologies through evolution without human intervention or designers' knowledge, 10 and then adapt to the new environment by reconfiguring its own internal structure dynamically and autonomously according to changing environment.
Extrinsic or intrinsic evolution 11 is applied to EHW in which circuit architecture as well as property parameters are encoded into chromosomes, then each candidate circuit can either be simulated or implemented physically on reconfigurable devices to evaluate using evaluation function. An evaluation function, known as the fitness function, is used to evaluate each chromosome in terms of being a good solution to the problem and is the optimization objective of GA. Offspring individuals are thus derived from operators like selection, crossover, and mutation according to their fitness. The evolution cycle is then repeated until a satisfying solution (a circuit providing the desired behavior) is found.
Extrinsic evolution is performed using a circuit simulator. So it is suitable for discussing new evolution mechanisms and exploring new architecture models for EHW. For example, Carlos et al. 12 performed a comparative study of several heuristics with respect to a traditional GA in the design of combinational logic circuits. Sushil 13 combined GA with a case-based memory to improve the performance of a series of similar design problems. Phillip and Ganesh 14 presented an algorithm inspired from quantum evolution and particle swarm to evolve combinational logic circuits; a multi-objective fitness function is used to obtain feasible circuits with minimal number of gates. Vijayakumari and Mythili 15 proposed a faster 2 dimensional (2D) technique for the design of combinational digital circuits using GA, in which the combinational digital circuits is represented as 2D chromosomes, and suitable 2D crossover and mutation techniques have been proposed in order to increase the convergence speed of GA. However, extrinsic evolution is computation-intensive and time-consuming, and there is considerable difference between the performance obtained by simulation and that of the real-world application.
In intrinsic evolution, all the candidate circuits are evaluated in a physical reconfigurable device. So it can not only accelerate the evaluation speed, but also make full use of the real characteristic (e.g. temperature, power consumption, local failure and so on) of the device. Intrinsic evolution is capable of online self-adaption and self-repairing, and is the foundation for EHW's real-world application. For example, Isamu Kajitani et al. 16 proposed the variable-length chromosome GA for the evolution of pattern recognition system to recognize hand-written symbols on an EHW-board including four Xilinx LCX XC 4025 FPGAs. Zhang et al. 17 presented a reconfigurable architecture inspired by Cartesian genetic programming and dedicated for implementing high-performance digital image filters on a custom Xilinx Virtex FPGA xcv1000. L. Sekanina and S. Friedl 18 introduced a complete implementation of a digital evolvable hardware system in which a virtual reconfigurable circuit and evolutionary algorithm were implemented independently as soft IP cores. And the COMBO6 card is employed for the evolutionary design of small combinational circuits, such as 3 · 3-bit multipliers.
Intrinsic EHW can be classified into three different classes according to the location of the reconfigurable hardware and the evolutionary algorithm (EA), i.e., PC-based inter-board intrinsic EHW that consists of one or more boards hosting reconfigurable devices and a PC on which EA is run, inter-chip intrinsic EHW in which the reconfigurable devices and the embedded controller that runs EA are located on the same board, and complete intrinsic EHW in which the reconfigurable hardware and EA are implemented on the same chip. Since complete intrinsic EHW yields the best performance as the communication delays are due to intra-chip wires, in this paper, a complete intrinsic EHW is implemented on an Xilinx Virtex-5 FPGA evaluation platform, and STMR is introduced to different topologies of a given circuit evolved by using EHW.
STMR for combinational circuits
With a small loss of SEU immunity, the STMR strategy can greatly reduce the area overhead of the hardened circuit compared to full TMR by selectively applying TMR to SEU sensitive gates. The basic concept is as follows: (1) a set of the primary input probabilities has been generated and propagated through the combinational circuit; (2) the output signal probabilities of all the lines in the circuit are calculated; (3) SEU sensitive gates are identified; (4) TMR is introduced to these sensitive gates.
Output signal probabilities computation of a Boolean gate
Typically, the input environment can be summarized either in terms of input signal probabilities or in the form of ''representative'' input sequences. 19, 20 Referring to Ref. 6 , in this paper we characterize the input environment in terms of input signal probabilities and model an SEU upset as a single event upset (SET) on a given combinational circuit to design the STMR system. The signal probabilities of the output of an n-input gate are calculated as shown in Table 1 , which is highly dependent on the nature of the circuit and the environment it will be subject to. In other words, given the input signal probabilities of a combinational circuit, according to Table 1 , the signal probabilities of all the nets in the circuit can be determined through propagating the signal probabilities of the primary inputs level by level until the primary outputs of the circuit are reached.
Definition and identification of SEU-sensitive gates
If a fault on one of the inputs of a gate can propagate to the output of the gate, we say that the gate is sensitive to SEUs. In order to detect the SEU sensitive gates of the circuit after input and output signal probabilities' calculation, a sensitive input whose concept is introduced by the critical path tracing algorithm 21 is determined as follows:
Definition 1. The input of a gate is considered to be sensitive only if the changes of its value results in the change of the gate output value.
The input sensitivity of a gate with two or more inputs is identified as follows:
(1) The input is considered to be sensitive only if its value is dominant over other inputs. (2) All inputs are considered to be sensitive if none of the inputs has dominant value.
According to Definition 1, the dominant value for AND gate as well as NAND gate is ''0'' and the dominant value for OR as well as NOR gate is ''1''. 
In order to use Definition 1, a threshold probability is identified as follows: Definition 2. If the signal probability by a line is more than the threshold probability, then its logic value assumed is ''1''; otherwise its logic value assumed is ''0''. Thus, given a threshold probability and a set of input probabilities, we can assign the input logic values of a gate and then detect the gate's SEU sensitivity according to Definitions 1-3, as shown below.
Definition 3. If a gate has one or more sensitive inputs, then the gate is sensitive to SEUs. We take the circuit in Fig. 1 as an example to illustrate how to find a gate's SEU sensitivity. Let the threshold probability be 0.5. The 3-input OR gate with the signal probabilities of the inputs A, B, and C are equal to 0.7, 0.2, and 0.2 as shown in Fig. 1(a) . Then input ''A'' is at logic ''1'', and input ''B'' and ''C'' are at logic ''0''. Assume a fault due to SEU on inputs ''A'' at some instant of time, then the fault will propagate to the output of the gate. This is because all other signals are at non-dominant values at that instant. In other words, a fault on one of the inputs can upset its output only if this input has dominant value whereas all the other inputs have non-dominant values. We can explain this in the form of probabilities as follows: an SEU on one of the inputs of a gate may propagate through the gate only when the signal probability of all other inputs is smaller than the threshold probability to a large extent. Therefore the gate is considered as SEU sensitive gate. As shown in Fig. 1(b) when the signal probabilities of the 3-input OR gate are equal to 0.7, 0.2, and 0.8, a fault on the input signal ''A'' would not upset its output. This is because input A is not the only input which has dominant value at that time. Hence, this gate is insensitive to SEUs on its inputs.
STMR method
In real applications, a combinational circuit consists of multiple Boolean gates. The design flow of applying STMR to a combinational circuit is as follows:
(1) Signal probabilities assignment. Assign a set of randomly generated probabilities to all the inputs of the given combinational circuit. (2) Input logic value ascertainment. Propagate the primary input probabilities through the circuit, calculate the inputs and outputs probability of each gate in the circuit and determine the input logic values of each gate. (3) Gate sensitivity identification. Propagate through the input logic value of each gate in the circuit from the first level to the last level. If only one or less inputs of the gate has the dominant value, then the gate is sensitive. XOR, XNOR, and NOT gates propagate faults no matter what the signal probabilities of the inputs are, so these gates are always assumed to be sensitive. The gates in the last level of the circuit are also considered as SEU sensitive because a heavy ion bombarding the gate has a higher probability of upsetting the final output. (4) STMR insertion. Apply TMR to all the SEU sensitive gates in the circuit.
If the output of a sensitive gate is connected to only sensitive gates, then the outputs of the triplicates can be directly connected to the inputs of the triplicates of the next level. This is illustrated in Fig. 2(a) . Both of the two gates, Gate 1 and Gate 2, are assumed to be sensitive to SEUs (marked by dotted circles), and Gate 1 is only connected to Gate 2. Therefore, as presented in Fig. 2(b) , the triplicated structure of the circuit does not need to have a voter.
If the output of the sensitive gate is connected to an insensitive gate, then a voter should be used between the sensitive gate and the insensitive gate. For example, as shown in Fig. 3(a) , Gate 1 is assumed to be sensitive, whereas Gate 2 is insensitive. Hence, the triplicated structure for the circuit is shown in Fig. 3(b) , in which the outputs of the triplicated structure D_1, D_2, and D_3 are mitigated using a voter as an output, D. Then D is fed to Gate 2.
The normal full TMR structure of the original circuit in Figs. 2 and 3 is the same, as shown in Fig. 2(c) . It can be seen from Figs. 3 and 2 that STMR structure can obtain a significant reduction of area overhead over normal full TMR structure: where there is one gate insensitive, there is a saving of two gates and one voter; even if all the gates are sensitive, there is still a saving of the quantity of voters that is equal to the number of the gates. 
STMR for sequential circuits
It is well-known that a sequential circuit can be viewed as a combinational circuit with a feedback path of FFs. Combinational circuits can be hardened by applying STMR as mentioned before. Since FFs are crucial to the entire circuit, they can be hardened by full TMR or be replaced with any SEU hardened latches reported in the literature.
Multi-objective optimization
In science and engineering area, most problems actually have two or more objectives to be optimized and these objects cannot be normally compared but conflict with each other. A particular optimization solution with respect to a single objective can lead to unacceptable results with respect to other objectives. Therefore, an appropriate solution to a multi-objective problem is to investigate a set of solutions which satisfy the objectives at an acceptable level without being dominated by any other solutions. 22 This is so-called multi-objective optimization problem (MOP) (also called multiple objectives and vector optimization) 23 in which the goal is to minimize or maximize several conflicting objective functions simultaneously.
In general, problem of minimizing multiple objectives can be described as follows: where x ¼ ½x 1 ; x 2 ; . . . ; x n 2 X indicates an n dimensional decision vector composed of decision variables x 1 ; x 2 ; . . . ; x n in the decision space; y represents the objective space composed of conflicting objective functions y 1 ; y 2 ; . . . ; y q ; gðxÞ represents the constraint condition determining the domain of feasible solution for decision space. A set of all the decision vectors x fulfilling constraint condition is named as feasible solution set x f . If x among x f is non-inferior, then x is Pareto optimal solution or non-dominated solution. In multi-objective optimization, there are usually several optimal solutions with different tradeoffs, called Pareto optimal solutions. The set of Pareto optimal solutions in the objective space is known as Pareto frontier.
A traditional approach for solving an MOP is to translate it into a single-objective problem and use the solutions of a single objective optimization problem to approximate Pareto optimal solutions of MOP. 24 To do so, a group of weights must be assigned to each normalized objective function so as to translate the MOP to a single objective problem with a scalar objective function. The problem must be solved multiple times with different weighted vectors to obtain multiple Pareto optimal solutions, resulting in Pareto optimal solution set.
Weighted summation approaches based on uniform weight are then introduced, after applying STMR to different topologies of a given circuit evolved by EHW. This summation is to optimize the two objectives in the system, i.e., overhead area and the number of times the circuit is functionally upset. Here the former (in terms of the number of gates used) and the latter are denoted by f 1 and f 2 respectively. Then the function to be optimized is as follows:
where w 1 and w 2 are the weights of f 1 and f 2 respectively, whose ranges are from 0 to1 uniformly with a certain step size. 25 In this paper, the step size is set to 0.1, so there are 11 weight vectors, i.e., 11 pairs of [w 1 
Experimental setup and results
Experimental environment
The Xilinx Virtex-5 evaluation platform, ML507, is used as the hardware platform in this paper. The experimental flows are as follows:
(1) Obtaining the required topologies using EHW. The complete intrinsic EHW platform is built on ML507 as follows: firstly, virtual reconfigurable circuits (VRCs) modeled in very-high-speed integrated circuit hardware description language (VHDL) using ISE design suite are customized to an evolvable IP core attached to the processor local bus (PLB) bus of a MicroBlaze processor system created using embedded development kit (EDK) to form the hardware platform; secondly, the cells in the VRC are encoded into chromosome, the matching degree of a circuit's actual output response to its ideal output response is viewed as fitness function, and the standard GA (in order to reduce the computational complexity, there is only even variation operator, no crossover operator. The population size is 128, mutation rate 2, tournament selection size 5, and the maximum number of generation is 20,000.) modeled in ''C'' language is implemented on the MicroBlaze processor using software development kit (SDK); finally, the construction of the EHW platform is finished, the host computer is connected to the target board, and the initial bitstream of the EHW platform is configured and downloaded to Fig. 3 Connections between triplicated module and non-triplicated module which needs to introduce a voter.
the FPGA. Then different topologies of a given combinational circuit can be evolved by mapping its truth table, test vectors and fitness function in the standard GA implemented on the MicroBlaze processor, and the optimal chromosomes displayed on the hyper terminal are decoded to obtain novel topologies. (2) Constructing a new platform for STMR circuit design.
Each optimal chromosome evolved in (1) is translated into VHDL format to design a new VRC structure capable of implementing STMR. The new VRC is customized to construct the new hardware platforms. Then the STMR algorithm described in Section 2.2 is implemented on the MicroBlaze processor using SDK. (3) Designing STMR circuits. Sensitive gates are identified using the STMR algorithm running on the MicroBlaze processor as follows: a group of primary input probabilities are generated randomly, then the signal probabilities of each line in the circuit are calculated, and the SEU sensitive gates are identified. After that, the gates to be hardened are fed into IP core to reconfigure the corresponding VRCs into their TMR version dynamically and autonomously. Thus, an STMR circuit has been designed completely. Multiple alternatives of STMR circuits can be obtained by repeating this process as different groups of primary input probabilities may result in deferent STMR circuits. Because STMR is adopted only on logic gates of both combinational and sequential circuits, whereas the FFs of sequential circuits are hardened by full TMR, all the instances in this section are combinational circuits.
Comparison between STMR and TMR techniques
STMR technique and full TMR technique were tested on Alu4, Parity16e, Add03, and Con1, respectively. The experimental results obtained from the two methods for three sets of threshold probabilities (0.3, 0.4, and 0.5) are shown in Table 2 . The circuits Alu4, Parity16e, Add03 and Con1 are taken from MCNC benchmark library. The columns corresponding to ''STMR'' represent the synthesized STMR circuits; whereas the ones marked by ''Full TMR'' are the full TMR design of the same circuit.
The column marked as ''S'' denotes the average percentage savings in the area overhead of the STMR circuit over the full TMR design. The columns marked as ''N'' show the average number of times that an induced SEU affects the correct operation of the circuit. The columns marked as ''A'' represent the average area of the circuit in terms of gates. The columns marked as ''FR'' show the average fault rate (i.e., the ratio of ''N'' to A) of the circuit. The STMR circuits are simulated with the set of 200 test vectors for each set of threshold probability. The input test vectors randomly generated adhere to the appropriate probabilities of the inputs that is employed in generating the corresponding STMR circuits.
As seen from Table 2 (the threshold probabilities are 0.3, 0.4, and 0.5), the maximum average percentage savings in the area overhead of the STMR circuit over full TMR design of Alu4 and Parity16e are 58% and 21% with the same SEU immunity. The reason for this is that all the gates of the TMR circuit utilize the triplicated structure, whereas the STMR circuits selectively apply TMR to the gates upsetting one or more primary outputs of the circuit. This shows that the area of the STMR design is significantly less than that required for full TMR design in the context of guaranteeing 100% immunity of the circuits against SEUs. The reason why Alu4 and Parity16e have zero FRs may be that the circuits of them all consist of many sub-circuits that are similar or symmetric. So after the SEU-sensitive gates, as well as all the gates in the first and last level of the circuits being hardened by their triplicates, when there is only one SEU affecting one gate each time, the outputs of the circuits would not be affected.
As for Con1, when the sets of threshold probabilities are 0.3, 0.4, and 0.5, at the cost of a slight loss of SEU immunity (the average fault rate are about 3.8%, 3.8% and 4.2% respectively), the average area savings of STMR over the TMR design are 21%, 21%, and 27%, respectively. As for Add03, when the sets of threshold probabilities are 0.3, 0.4, and 0.5, the average area savings of STMR over the full TMR design are 19%, 46%, and 39% and the numbers of times an induced SEU affecting the correct operation of the circuit are 5, 13, and 11 respectively; the fault rates are about 7.6%, 30.2%, and 22.4% respectively. Therefore, although STMR circuit would propagate some errors to the final output of the circuit, the STMR method achieves a balance between the area overhead and the number of functional errors.
Various topologies of full adder and results of its STMR circuits
It is well-known that various topologies of a combinational circuit can be evolved using EHW technology. In general, each topology is quite different from others. Thus multiple diversity solutions can be obtained after applying STMR to each topology. And we can choose the most suitable solution among them for a certain application. Two typical topologies of the full adder are shown in Fig. 4 , in which Fig. 4(a) represents a full adder consisting of only NAND gates, whereas Fig. 4(b) shows a full adder that contains AND gates, OR gates and XOR gates. Fig. 5 illustrates the average area overhead and the number of average functional errors in the STMR circuit of the full adder made of only AND gates. Fig. 5(a) indicates the area overhead's changing with the threshold probability Fig. 5(b) represents the variation of functional errors with threshold probability.
As seen from Fig. 5 , for the circuit consists of only AND and/or NAND gates, with the increase in the threshold probability, the area overhead of the STMR circuits decreases and the number of error times increases. This is because, as mentioned before, the non-dominant value for AND as well as NAND gate is ''1''. Thus, with the increase of the threshold probability, less logic values of lines are considered ''1'' and hence the number of sensitive gates and the area of the STMR circuits decrease. Consequently, more number of functional errors in the STMR circuit affects the final output.
Similarly, if the circuit consists of only OR and/or NOR gates, as the threshold probability increases, more numbers of gates are marked sensitive and then the area overhead of the STMR circuits increases. This leads to the decrease in the number of errors upsetting the outputs.
Whereas if a circuit contains not only AND and/or NAND gates but also OR and/or NOR gates, then a change in the threshold probability may not directly cause a decrease or increase in the area overhead and functional errors of the STMR circuit. It means that the area overhead and the number of functional errors of the STMR circuits may be independent on the threshold probability.
XOR, XNOR, and NOT gates propagate faults no matter what the signal probabilities of the inputs are, so these gates are always considered SEU sensitive. Therefore, the area of the STMR circuit is highly dependent on the number of XOR, NOT, and XNOR gates. Likewise, because the last level output gates are also considered sensitive no matter what the primary signal probabilities of the inputs are, as the last level gates numbers increases, the area overhead of the STMR circuit increases.
3.4.
Results of multi-objective optimization in fault-tolerant system of STMR Fig. 6 indicates the results of multi-objective optimization for the STMR circuits of Alu4, Parity16e, Add03 and Con1.
As seen from Fig. 6 , for circuits Alu4 and Parity16e, the Pareto-optimal solutions for 11 weight vectors are all located at the vertical axes. The reason for this is that when STMR strategy is applied to Alu4 and Parity16e, the numbers of functional errors are almost 0, so the corresponding area overhead minimums in terms of gates are the best solutions.
As for Add03 and Con1, it can be seen from Fig. 6 (c) and (d) that multiple alternatives are offered to decision-makers; if small number of times the circuit is functionally upset is needed, data points on the upper left of the graph can be selected; if little area overhead is preferred, data points on the lower right of the graph can be selected; if both of them is concerned, the middle ones can be selected. Therefore, decision-makers can select satisfied solutions depending on their own requirements. In consequence, it is flexible to apply multi-objective optimization algorithms to fault-tolerant system of STMR in order to meet different application requirements. Note that in Fig. 6(d) , some suboptimal solutions, such as the data points (1, 29), (2, 25) and (3, 21) , have been optimized away because of the existence of the optimal solution, data point (0, 21); therefore, alternatives offered are relatively less.
Conclusions
(1) To improve the reliability of electronics in space, the multi-objective evolutionary design of STMR system against SEUs is proposed in this paper. In our method, the EHW technique is employed to synthesize circuits with various topologies and the same functionality. Then the SEU sensitive gates in the circuits are detected and hardened against SEUs by applying STMR technique. Afterwards the multi-objective optimization techniques are brought into optimize the number of functional errors and the area overhead of the STMR circuits simultaneity. To the best of our knowledge, the combinations of EHW, STMR and multi-objective optimization techniques, as well as the design flow, have not been reported yet. (2) In our method, the lower area overhead of STMR, the novel design capability of EHW and the tradeoff advantages between reliability and resource consumption of multi-objective optimization technique are combined. Compared to TMR method, STMR method offers huge area reduction, with a small loss of reliability. EHW techniques may find innovative topologies of a given circuit that distinguish from the traditional topologies, and then by introducing STMR to each topology, alternative solutions with multiple diversities can be obtained, among them even the best solution may exist. Multi- Fig. 6 Results of multi-objective optimization for Alu4, Parity16e, Add03 and Con1.
objective optimization technique provides plenty of alternative options for user to choose in certain situations. In addition, due to the self-repairing potential of EHW, the proposed fault-tolerant strategy can effectively deal with fault modules in the STMR circuits. (3) In the future, we will investigate how to use abundant tri-state buffers to construct SEU-immune majority voter circuits, as well as find out how to repair faults using the online self-repairing capacity of EHW.
Yao Rui is currently an associate professor at Nanjing University of Aeronautics and Astronautics in 2008. Her main research interests are bio-inspired hardware and intelligent circuits, computer control systems and intelligent information processing.
Chen Qinqin is a Master Degree Candidate at Nanjing University of Aeronautics and Astronautics. Her research interest includes computer control systems.
Li Zengwu is a Master Degree Candidate at Nanjing University of Aeronautics and Astronautics. Her research interest is computer control systems.
Sun Yanmei is a Master Degree Candidate at Nanjing University of Aeronautics and Astronautics. Her research interest is computer control systems.
