An SR Flip-Flop based Physical Unclonable Functions for Hardware
  Security by Challa, Rohith Prasad et al.
ar
X
iv
:1
90
9.
06
35
7v
1 
 [c
s.C
R]
  1
1 S
ep
 20
19
IEEE Copyright Notice
c©2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any
current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new
collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Accepted to be Published in: Proceedings of the 2019 IEEE International Midwest Symposium on Circuits
and Systems (MWSCAS), Aug. 4-7, 2019, Dallas, TX, USA.
An SR Flip-Flop based Physical Unclonable
Functions for Hardware Security
Rohith Prasad Challa, Sheikh Ariful Islam and Srinivas Katkoori
Department of Computer Science and Engineering
University of South Florida
Tampa, FL 33620
Email: {challa1, sheikhariful, katkoori}@mail.usf.edu
Abstract—Physical Unclonable Functions (PUFs) have emerged
as a promising solution to identify and authenticate Integrated
Circuits (ICs). In this paper, we propose a novel NAND-based
Set-Reset (SR) Flip-flop (FF) PUF design for security enclosures
of the area- and power-constrained Internet-of-Things (IoT) edge
node. Such SR-FF based PUF is constructed during a unique
race condition that is (normally) avoided due to inconsistency. We
have shown, when both inputs (S and R) are logic high (‘1’) and
followed by logic zero (‘0’), the outputsQ and Q can settle down to
either 0 or 1 or vice-versa depending on statistical delay variations
in cross-coupled paths. We incorporate the process variations
during SPICE-level simulations to leverage the capability of SR-FF
in generating the unique identifier of an IC. Experimental results
for 90nm, 45nm, and 32nm process nodes show the robustness
of SR-FF PUF responses in terms of uniqueness, randomness,
uniformity, and bit(s) biases. Furthermore, we perform physical
synthesis to evaluate the applicability of SR FF PUF on five designs
from OpenCores in three design corners. The estimated overhead
for power, timing, and area in three design corners are negligible.
I. INTRODUCTION
Due to the horizontal business model and vertical disintegra-
tion of IC design, most of ICs’ manufacturing and testing of
fabless design houses are performed in foreign foundries. In the
heart of this design ecosystem, original IP owners face several
security challenges including overproduction, counterfeiting,
authentication, and trust in manufactured products. Among all
the possible existence of security solutions, Physical Unclon-
able Function (PUF) acts as one-way function that can map
certain stable inputs (challenges) to pre-specified outputs (re-
sponses). Although cryptography algorithms have been put into
practice to perform the authentication, they are difficult to up-
load due to recent attacks [1], [2]. Moreover, the deployment of
computationally intensive cryptographic algorithms in resource-
constrained IoT devices limit their wide adoption. In contrast,
PUF utilizes inherent silicon variations. If a similar design is
manufactured onto two different dies, process variations would
act differently within and across both dies and this forms the
basis for a PUF. Ideally, a PUF implementation should be low-
cost, tamper-evident, unclonable, and reproducible. The PUF
response also need to be invariant to environmental variations
In recent years, a wide variety of PUF architectures have been
investigated that can transform device properties (e.g. thresh-
old voltage, temperature, gate length, oxide thickness, edge
roughness) to a unique identifier of certain length. Metastability
in cross-coupled paths have been exploited to design PUF
with SR latch [3]–[5] and Ring Oscillator (RO) [6]. Although
latch-based PUF designs offer unique signatures to ICs, they
suffer from signal skew and delay imbalance in signal routing
paths and Error Correction Code (ECC) circuitry is commonly
employed to post-process the instable PUF responses [7]. On
the contrary, RO-PUF in [6] incurs significant area overhead
that includes a counter, an accumulator, and a shift register.
These serve as a motivation to harvest deep-metastability in
bi-stable memory, SR FF, to design a low-cost PUF and high-
quality CRPs.
In this paper, we design and analyze a novel SR FF based
PUF. For a NAND gate based SR FF, the input condition
for S(Set) = ‘1’ and R(Reset) = ‘1’ must be avoided as it
produces an inconsistent condition. When S=R=‘1’ is applied
followed by S=R=‘0’, the outputs Q and Q would undergo race
condition. Due to manufacturing variations, the state due to race
condition will settle in either ‘0’ or ‘1’. Further, due to intra-
chip process variations, some flip flops in a chip will settle
in ‘0’ state, while others will settle in ‘1’ state. In addition,
due to inter-chip variations, such signature will be different
across the chips. We investigate delay variations in NAND
gates of the feedback path that affect most the gate delay.
We validate the proposed idea with SPICE-level simulations
for 90nm, 45nm, and 32nm process nodes to establish the
robustness of the proposed PUF responses for 16-, 32-, 64-, and
128-bit responses. We also perform layout-level simulation with
foundry data on five designs that incorporate SR-FF and present
their figures of merit (power, timing, and area). In summary,
we present the following novel contributions:
• utilizes SR-FFs already present in the register of a design
without any ECC and helper data. The responses are free
from multiple key establishments round that can thwart
reliability based attack.
• input dependent random yet stable binary sequence aided
by unpredictable manufacturing variability. Depending on
input challenges, only a fraction of SR-FFs would be
utilized to create unique device signature. Therefore, it
would increase the attacker reverse engineering effort to
determine the exact location of such SR-FFs that partici-
pate in PUF responses generation.
• a centroid architecture such that surrounding transistor
variations would only affect PUF responses and evaluate
the associated overhead through layout-based synthesis.
The rest of the paper is organized as follows: Section II
provides background on the types of PUFs using metastability.
Section III describes the construction of SR flip-flop based PUF
design. Section IV reports in detail the experimental results.
Finally, Section V draws the conclusion and future work.
II. BACKGROUND AND RELATED WORK
A PUF is a digital fingerprint that serves as a unique identity
to silicon ICs and characterized by inter-chip and intra-chip
variations. Inter-chip offers the uniqueness of a PUF that
helps to conclude that the key produced for a die is different
from other keys. Intra-chip determines the reliability of the
key produced that should not change for multiple iterations
on the same die. For a signal, metastability occurs when
the specifications for setup and hold time are not met and
unpredictable random value appears at the output. Although
metastable is an unstable condition, due to process variations,
such metastability generates a stable but random state (either
‘0’ or ‘1’), which is not known apriori.
Transient Effect Ring Oscillator (TERO) PUF [6] utilizes
metastability to generate the responses with a binary counter,
accumulator, and shift register. Although the architecture is
scalable, it requires large hardware resources. Su et al. [5], [8]
presented cross-coupled logic gates to create a digital ID based
on threshold voltage. The architecture is composed of latch
followed by a quantizer and a readout circuit to produce the
PUF ID. However, readout circuit is generally expensive and
limits its application to the low-power device. FPGA-based SR-
latch PUF has been presented in [3], [4]. Due to temporal oper-
ating conditions, ECC is employed to reliably map one-to-one
challenge-response pair in both approaches. To alleviate power-
up values from memory-based PUF, registers based on edge-
triggered D-FF are proposed in [5], [9]. The authors suggested
to include an expensive synchronizer in Clock Domain Signal
(CDC) signals to get stable PUF response. A framework of
‘body-bias’ adjusted voltage on SR-latch timing using FD-SOI
technology is presented in [10]. To get correct PUF response,
authors employed buffers along the track of top and bottom of
latches that suffer from responses biasedness.
The majority of works utilizing metastability to design PUF
employ additional hardware to count the oscillation frequency.
Our work is unlike these previous studies in that it (a) employs
SR-FF to construct low-cost PUF and (b) reuses the SR-
FF already in the original IP by varying channel length and
temperature to account for intra- and inter-chip variations.
III. PROPOSED SR FLIP FLOP PUF
Our approach presents a PUF design that relies on the cross-
coupled path in an SR-FF configuration. Each bit of a PUF
response can be extracted from metastability induced random
value in the output (Q) due to a particular input sequence
at SR-FF. This random value would eventually evaluate to a
stable logic due to process variability. A clock enabled cross-
coupled NAND-based SR-FF construction is shown in Fig.
1. It does not require additional synchronizer to control the
input conditions. Set-Reset (SR) Latch has the forbidden input
combination, namely, S=R=1 which results in both Q and Q
equal to 1. After S=R=1 input, if we lower both inputs (S=R=0),
there is a race condition between the two cross-coupled NAND
gates (ND1 and ND2) making Q and Q to linger around
V dd
2
value. Although such race condition is prohibited during
normal circuit operation, it can influence the output to generate
a state determined by the mismatch in the underlying device
parameters (transistor length, threshold voltage). Analysis of the
race behavior is seemingly dependent on precise phase relation
between clock and input data. We exploit such input-referred
event sequence to generate PUF response.
s
EN=1
R
Q
Q’ND2
ND11
0
Fig. 1: SR Flip Flop
SR FF [0]
SR FF [1]
SR FF [n-2]
SR FF [n-1]
PUF Mode
Regular Mode
Selector
ID [0]
ID [1]
ID [n-2]
ID [n-1]
Fig. 2: Architecture of dual-mode n-bit array SR FFs
Fig. 2 shows the architecture of n-bit array SR-FFs with
an input multiplexer (MUX) to select either PUF or regular
mode. As each SR-FF would generate a single bit key, we can
obtain a PUF signature of the maximum size of FF instances.
However, it suffers from multiplexer output that has to be
sufficiently long to reach all SR-FF instances. It would also
increase the delay to produce random output at Q depending on
the longest distance from MUX output to an SR-FF instance.
As a result, both higher wire length from MUX output and
longest transition time due to metastability would decrease the
timing performance of an SR-FF based PUF during regular
operation. Furthermore, such architecture may be susceptible
to key-guessing attack under a single clock pulse. Hence, the
architecture in Fig. 2 would be biased towards variations in the
connecting wire length and width. This, in turn, reduces the
impact of transistors’ local variation. In short, the higher the
depth of PUF timing paths, the less its response would depend
on transistors behavior.
SR FF 
[0]
SR FF 
[1]
SR FF 
[3]
SR FF 
[2]
SR FF 
[8]
SR FF 
[11]
SR FF 
[10]
SR FF 
[4]
SR FF 
[5]
SR FF 
[7]
SR FF 
[6]
SR FF 
[12]
SR FF 
[13]
SR FF 
[15]
SR FF 
[14]
SR FF 
[9]
SR-FF 
Sel.
Mode 
Sel.
PUF Mode
Regular Mode
SR [3-0]
SR-FF 
Sel.
PUF Mode
Regular Mode
SR [7-4]
Mode 
Sel.
SR-FF 
Sel.
Mode 
Sel.
PUF Mode
Regular Mode
SR [11-8]
Mode 
Sel.
SR-FF 
Sel.
PUF Mode
Regular Mode
SR [15-12]
M
U
X
 1
M
U
X
 2
M
U
X
 3
M
U
X
 4
2
2
2
2
Fig. 3: Centroid of 16-bit SR Flip-Flops
Fig. 3 shows a centroid architecture (4x4 grid) of 16-bit
SR-FFs built upon Fig. 2 with additional MUXs to improve
the delay and thwarting the key-guessing attack. It also results
in improved bit distribution by preventing edge-effects [11].
Each multiplexer has three selector bit, of which, two would
be used to select an SR-FF in a grid and the remaining would
TABLE I: Specifications of wireload model in three design
corners (cap. unit, fF and res. unit, KΩ)
Wireload Model
Best Typical Worst
Cap. Res. Cap. Res. Cap. Res.
8000 0.00028 1.42E-03 0.000312 1.57E-03 0.000343 1.73E-03
16000 0.000512 1.15E-03 0.000569 1.28E-03 0.000625 1.41E-03
35000 0.000243 1.07E-03 0.00027 1.19E-03 0.000297 1.31E-03
70000 0.000128 9.00E-03 0.000143 1.00E-02 0.000157 1.10E-02
TABLE II: Wire specifications across all wireload models in
three design corners
Wire width (0.45, 0.9, 1.35, 1.8, 2.25)
Wire Spacing
(0.45, 0.9, 1.35, 1.8, 2.25, 2.7, 3.15, 3.6,
4.05, 4.5, 4.95, 5.4, 5.85, 6.3, 6.75, 7.2)
determine mode (PUF or normal) selection. A simple controller
is embedded in the original architecture to aid in the signal
extraction process. Depending on the number of controllable
MUXs, the size of the partitions can increase or decrease.
IV. EXPERIMENTAL RESULTS
We perform Monte Carlo (MC) simulations of SR-FF PUF
at SPICE level using Synopsys HSPICE for three CMOS
processes (90nm, 45nm, and 32nm) [12]. MC can perform
device variability analysis within six-sigma limit, hence the
Challenges-Response Pairs (CRPs) collected using MC is com-
parable to CRPs from manufactured ICs. We simulate the PUF
structure for 1000 iterations, analogous to 1000 different dies
on a 300mm wafer at nominal voltage (1V). Several works [2],
[13], [14] in the literature have validated PUF design through
SPICE level simulations. We then evaluate PUF responses
according to the parameters proposed by Maiti et al. [15].
Although process variations impact the channel length, we
maintain length variability within (intra-die) 15% and across
(inter-die) 33% of nominal value to generate CRPs [16]. We
also report the performance overhead of physical synthesis for
five RTL designs [17] with centroid architecture.
Uniqueness: Uniqueness provides the measurement of inter-
chip variation. We can measure the uniqueness by calculating
Hamming Distance (HD) of two pair-wise dies. Ideally, two
dies (chips) show a distinguishable response (HD ∼ 50%) to
a common challenge. Fig. 4(a-c) shows inter-chip HD of four
different key sequences. For all keys, we made two thousand
comparisons to verify uniqueness. One can see that the average
HD for all key-lengths are close to 49%.
Reliability: We can measure the reliability from Bit Error
Rate (BER) of PUF responses for intra-chip variation. Ideally,
a PUF should maintain the same response (100% reliable or 0%
variation) on different environmental variations (supply voltage,
temperature) under the same challenge. Fig. 4(d-f) shows the
intra-die HD for five key length in three process nodes different
temperature (0oC to 80oC). The reliability (HD = 0) for 16-,
32-, 64-, and 128-bit registers are 92.3%, 92.2%, 90.7%, and
92.7% respectively.
Uniformity/Randomness: Uniformity measures the ability
of a PUF to generate uncorrelated ‘0’s and ‘1’s in the response.
Ideally, PUF should generate ‘0’s and ‘1’s with equal probabil-
ity in a response. This ensures the resilience of guessing PUF
response from a known challenge. The probability of zero is
bound within 0.5 and 0.7 for four different key lengths in Fig.
5. Although the ideal value of uniform probability should be
0.5, variability in gate delay due to process variability impacts
the even distribution of ‘0’s and ‘1’s.
Bit aliasing/Response collision: To evaluate the bit aliasing,
we use the same set of responses used in uniqueness. We see
the average probability of collision less than 30% as shown
in Fig. 6. As the reference response is chosen randomly and
compared to the rest of the responses, an adversary can guess,
on average, less than 30% of the correct responses. Hence, the
generated responses are resistant to the key-guessing attack.
Physical Synthesis Analysis: We report the physical syn-
thesis results of designs from OpenCores [17]. We perform the
logic synthesis with Synopsys Design Compiler and the layout
(floor planning, placement, and routing) of the mapped netlist
using Synopsys IC Compiler. We evaluate the area, power, and
timing overhead for SAED 90nm technology in three design
corners, namely, best, typical and worst.
Table I lists the required resistance and capacitance (routing
and parasitic) values during cell characterization for achieve-
ing metastable state. The inter-transistor routing across all
wireload models are presented in Table II. The capacitance
values include both routing and parasitic capacitance. We vary
input voltages (0.7V-1.32V) with (on_chip_variation)
enabled during synthesis. It confirms that the responses are not
biased towards a particular input voltage value and adversary
can not further tamper the device responses with aggressive
supply voltage changes. We maintain a 4by4 grid across all
designs to implement centroid architecture and distribute it
randomly. Depending on the dimension of the grid, the total
number of the grid would grow or shrink. Following that, we
report the overhead after physical synthesis in Table III. The
number of bits in Table III represent the possible key length of
design. Across different wireload models of a particular design
corner, we observe more delay and power variations due to
variable resistance and capacitance. For 8-bit uP, the centroid
architecture is adjacent to high-activity logic, hence we see
increased PPA overhead. In the remaining designs, best-case
minimizes the area and delay overhead and during worst-case,
we see a reduction in power overhead.
V. CONCLUSION
In this work, we have proposed to use the existing SR
flip-flop in the design to quantify its race condition for PUF
implementation. We embed a centroid architecture with SR FFs
so that PUF responses conform to local transistor variations
only. The generated responses exhibit better uniqueness, ran-
domness, reliability and reduced bit-aliasing compared to other
metastability-based PUFs. For future work, we would evaluate
the uniqueness of SR-FF PUF responses from transient noise
based simulation and the resilience against adversarial machine
learning attack.
REFERENCES
[1] M. Fyrbiak et. al. Hardware reverse engineering: Overview and open
challenges. In 2017 IEEE IVSW, pages 88–94, July 2017.
[2] D. Lim et. al. Extracting secret keys from integrated circuits. IEEE
TVLSI, 13(10):1200–1205, Oct 2005.
[3] B. Habib et al. Implementation of efficient sr-latch puf on fpga and soc
devices. Microprocessors and Microsystems, 53:92 – 105, 2017.
Process Node: 32nm_inter
0 8 16 25 32 40 50 6470 90 100 128
Hamming distance
0
50
100
150
200
Fr
eq
ue
nc
y
128b 64b 32b 16b
Process Node: 45nm_inter
0 8 16 25 32 40 50 6470 90 100 128
Hamming distance
0
50
100
150
200
250
Fr
eq
ue
nc
y
128b 64b 32b 16b
Process Node: 90nm_inter
8 16 25 32 40 50 6470 90 100 128
Hamming distance
0
50
100
150
200
250
Fr
eq
ue
nc
y
128b 64b 32b 16b
(a) (b) (c)
Process Node: 32nm_intra
0 1 2 3
Hamming distance
0
200
400
600
800
1000
Fr
eq
ue
nc
y
256b 128b 64b 32b 16b
Process Node: 45nm_intra
0 1 2 3
Hamming distance
0
200
400
600
800
1000
Fr
eq
ue
nc
y
256b 128b 64b 32b 16b
Process Node: 90nm_intra
0 1 2
Hamming distance
0
200
400
600
800
1000
Fr
eq
ue
nc
y
256b 128b 64b 32b 16b
(d) (e) (f)
Fig. 4: Overlay Histogram of inter-chip HD (a-c) and intra-chip HD (d-f) for four key lengths in the three process nodes.
Process Node: 32nm_uniformity
0 10 20 30 40 50 60 70 80 90 100 110 120
Bit Position
0
0.2
0.4
0.6
0.8
1
Pr
ob
ab
ili
ty
 o
f Z
er
o 
(P
r 0)
 128b 64b 32b 16b
Process Node: 45nm_uniformity
0 10 20 30 40 50 60 70 80 90 100 110 120
Bit Position
0
0.2
0.4
0.6
0.8
1
Pr
ob
ab
ili
ty
 o
f Z
er
o 
(P
r 0)
 128b 64b 32b 16b
Process Node: 90nm_uniformity
0 10 20 30 40 50 60 70 80 90 100 110 120
Bit Position
0
0.2
0.4
0.6
0.8
1
Pr
ob
ab
ili
ty
 o
f Z
er
o 
(P
r 0)
 128b 64b 32b 16b
(a) (b) (c)
Fig. 5: Probability of zero in each bit position for four (16-, 32-, 64-, and 128-bit) key lengths in three process nodes.
Process Node: 32nm_bit_aliasing
0 10 20 30 40 50 60 70 80 90 100 110 120
Bit Position
0
0.2
0.4
0.6
0.8
1
Pr
ob
ab
ili
ty
 o
f b
it 
co
lli
si
on
 128b 64b 32b 16b
Process Node: 45nm_bit_aliasing
0 10 20 30 40 50 60 70 80 90 100 110 120
Bit Position
0
0.2
0.4
0.6
0.8
1
Pr
ob
ab
ili
ty
 o
f b
it 
co
lli
si
on 128b 64b 32b 16b
Process Node: 90nm_bit_aliasing
0 10 20 30 40 50 60 70 80 90 100 110 120
Bit Position
0
0.2
0.4
0.6
0.8
1
Pr
ob
ab
ili
ty
 o
f b
it 
co
lli
si
on 128b 64b 32b 16b
(a) (b) (c)
Fig. 6: Average probability of collision in each bit position.
TABLE III: Overhead comparison of design attributes in SR-FF PUF with centroid architecture
Design No. of bits
Best-case Typical-case Worst-case
Area (%) Power (%) Delay (%) Area (%) Power (%) Delay (%) Area (%) Power (%) Delay (%)
AES128 1072 0.009 4.065 2.622 0.017 1.301 3.836 0.320 0.473 6.671
DES 1827 0.022 0.963 0.824 0.037 0.39 1.923 0.604 0.058 3.968
Triple DES 2083 0.010 0.698 0.781 0.035 0.510 1.858 0.711 0.067 3.095
8-bit uP 386 0.584 4.884 0 2.51 2.658 0.175 4.185 0.021 6.096
Cannyedge Detector 2027 0.109 1.587 1.354 2.487 0.164 5.812 3.585 0.101 6.676
Average 0.146 2.439 1.116 1.017 1.004 2.720 1.881 0.144 5.301
[4] A. Ardakani and S. B. Shokouhi. A secure and area-efficient FPGA-based
SR-Latch PUF. In 2016 IST, pages 94–99, Sept 2016.
[5] S. Wang et al. Register PUF with No Power-Up Restrictions. In 2018
IEEE ISCAS, pages 1–5, May 2018.
[6] L. Bossuet et. al. A PUF Based on a Transient Effect Ring Oscillator
and Insensitive to Locking Phenomenon. IEEE TETC, 2(1):30–36, 2014.
[7] S. S. Zalivaka et. al. Multi-valued Arbiters for quality enhancement of
PUF responses on FPGA implementation. In 2016 ASP-DAC, pages 533–
538, Jan 2016.
[8] S. A. Islam and S. Katkoori. High-level synthesis of key based obfuscated
RTL datapaths. In 2018 19th International Symposium on Quality
Electronic Design (ISQED), pages 407–412, March 2018.
[9] S. A. Islam et. al. Empirical Word-Level Analysis of Arithmetic
Module Architectures for Hardware Trojan Susceptibility. In 2018 Asian
Hardware Oriented Security and Trust Symposium (AsianHOST), pages
109–114, Dec 2018.
[10] J. Danger et. al. Analysis of Mixed PUF-TRNG Circuit Based on SR-
Latches in FD-SOI Technology. In 2018 DSD, pages 508–515, Aug 2018.
[11] Y. Su, J. Holleman, and B. P. Otis. A Digital 1.6 pJ/bit Chip Identification
Circuit Using Process Variations. IEEE Journal of Solid-State Circuits,
43(1):69–77, Jan 2008.
[12] Predictive Technology Model. http://ptm.asu.edu/.
[13] G. E. Suh and S. Devadas. Physical Unclonable Functions for Device
Authentication and Secret Key Generation. In 2007 44th ACM/IEEE DAC,
pages 9–14, June 2007.
[14] U. Rhrmair et. al. PUF Modeling Attacks on Simulated and Silicon Data.
IEEE TIFS, 8(11):1876–1891, Nov 2013.
[15] A. Maiti et. al. A Systematic Method to Evaluate and Compare the
Performance of Physical Unclonable Functions, 2011.
[16] S. Nassif. Delay variability: sources, impacts and trends. In 2000 IEEE
ISSCC, pages 368–369, Feb 2000.
[17] OpenCores. https://opencores.org/.
