Abstract
Introduction
The emergence of nano-scale circuits accompanies the advantage of employing more gates in a single chip. Utilizing more components reminds the necessity of employing superior connection infrastructure in place of traditional System-on-Chip (SoC). The Network-on-Chip (NoC) infrastructure has been considered to accommodate better modularity, scalability, and higher bandwidth compared to busbased infrastructure. Controlling the physical parameters in the fabrication process is unexpected due to the shrinkage of the technology size [1] . Fault occurrence such as crosstalk, single event upset (SEU), and single event transient (SET) caused by electromagnetic interferences (EMI), alpha particles' strikes, and cosmic radiation can affect the functionality of NoC router or, eventually lead it to failure [2] . Fault-tolerant design (FTD) has been considered to reduce or mask such phenomena. Rerouting algorithms have been proposed to update the routing table [3] [4] . The idea of bypassing faulty data paths within failed routers has been proposed as light weight fault tolerant method [5] . Only SEU and cross talk fault effects have been evaluated on a synchronous NoC router [6] [7] . In some cases the reliability of NoC router has been assessed by the help of analytical method imposing some assumptions which might not be accepted. In [8] , an analytical model of an NoC architecture has been presented with the assumption of not being the faulty and destination router at same columns in a mesh topology. An exhaustive research seems necessary to find the most tenuous components in an NoC router while keeping the cost of overhead low. Such experimental results can help the designer to utilize well-chosen fault-tolerant methods based on the application. This experiment evaluates the reliability of NoC router against various fault models through simulation-based method with the help of SINJECT. SINJECT is a fault injection tool which examines the robustness of modeled system in Verilog or VHDL [9] . This paper is organized as follows: Section 2 introduces the implemented NoC router in this experiment. Part 3 includes the fault injection characteristics. The experimental results have been considered in different aspects in fourth portion. Finally, section 5 delivers some conclusion remarks.
Synchronous router architecture
The architecture of the synchronous NoC router applying in this experiment has been indicated in Figure 1 . It is apt to be employed in both mesh and torus topology supporting five bidirectional ports. It has been synthesized with the help of Leonardo, which results in 38560 gates. It consists of three main components including input buffer, routing unit, and switch component. Two state diagrams in the buffer component are responsible to take and to transmit a flit in a single clock. The upper state diagram is triggered in positive edge of clock and the lower one is stimulated in negative edge of clock. Employing both edges of clock increases the performance of NoC router. The routing unit component is the central unit including sub components such as buffer management, header processor, arbitration unit, and channel allocator. The buffer management unit is in charge of controlling the buffer space to preclude losing older flits before their transmission. The buffer is implemented as a circular queue to optimize of applying buffer efficiently. New routing decision is made by implementing XY routing algorithm once a new header flit has been detected and the previous 
Fault injection characteristics
Transient faults are much more common than permanent ones in upcoming digital systems [10] . Such experiment examines the effects of transient faults by activating each FIS for a short period of time which a flit needs to pass a router. The injected faults treat as a permanent if the selected life time equals the total running time of experiment. 112 FISes have been injected into the VHDL model of the NoC. The employed NoC router is simulated with a 10ns clock period. According to the part 2 a flit needs one clock period to reach its destination in normal condition. The faults have been injected randomly by the average duration of 20ns (The fault duration is exponentially distributed). The FISes are activated with a uniform distribution between 200ns to 8000ns, in normal condition. The fault injection of each FIS has been repeated 100 times. The FISes are distributed through main components based on their complexity. This experiment considers single fault effects. In other words, the FISes are activated and their effects are observed separately. Table 1 specifies the infected points and injected fault models in more detail. Five different fault models, including dead clause, stuckthen, micro-operation, crosstalk, and SEU have been injected into the VHDL model of NoC router. Switch component is infected by fewer FISes in contrast to the other components. SEU fault model affects sequential circuits. Based on the router specification in part 2, the switch consists of a combinational circuit without any decision making functionality. SEU fault models are injected into routing unit and buffer components. Both of such components consist of sequential circuits and decision making subcomponents. Table 2 shows the number and percentage of overwritten faults, latent errors, failure experiments, and the average of fault latency for each of component separately and in total. The results are based on the comparison of golden run and faulty run of simulation. According to the result, 69.76%, 31.17%, and 35.44% of real injected faults are overwritten (replaced by new Dead-clause  6  10  6  Stuck-then  2  8  6  Micro-operation  3  5  ----------Crosstalk  9  12  12  SEU  13  20  ----------Total  33  55  24 values before being detected), in buffer, routing unit, and switch components, respectively. Switch component got the second rank of failure rate although it is infected less than the others. That might because of the high activity of this component which is involved per each flit's transmission. The highest failure results are related to the routing unit due to its decision making role. Figure 2 shows the percentage of overwritten faults in progressive time. The highest ratio of overwritten faults can be found in buffer component and the lowest one relates to the routing component. That might be behind the fact that an eight flit buffer has been employed to implement a wormhole switching in this architecture. Fault occurrence in input buffer might be overwritten at most each eight clock whenever the new flit is written to the buffer. On the other hand, the routing unit's values are updated whenever the incoming flit is header. The fault occurrence on data-payload flits are not detected in this experiment since these types of failures must be distinguished in the destination IP core. Only fault effect on header flit in buffer component can result in miss routing and consequently a failure.
Experimental results

Conclusion
In this experiment a synchronous NoC router architecture has been introduced that is capable of receiving and transmitting a flit in a single clock. The fault-tolerant behavior of the router has been evaluated to keep the redundancy overheads low in upcoming designs. Routing unit has been detected as the most tenuous component among the all. The overwritten fault percentage of this component is reported as the lowest one. An information redundancy as an error correction remedy might be best choice to impose lower hardware penalty while achieving the robustness. Some error detection techniques which need retransmission packets dramatically influence the performance of the NoC router. Employing some fault masking method to reduce the fault propagation might be another approach of strengthening the NoC router.
