Area Efficient Floating Point Addition Unit With Error Detection Logic  by Aswani, T.S. & Premanand, B.
 Procedia Technology  24 ( 2016 )  1149 – 1154 
Available online at www.sciencedirect.com
ScienceDirect
2212-0173 © 2016 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license 
(http://creativecommons.org/licenses/by-nc-nd/4.0/).
Peer-review under responsibility of the organizing committee of ICETEST – 2015
doi: 10.1016/j.protcy.2016.05.068 
International Conference on Emerging Trends in Engineering, Science and Technology 
(ICETEST - 2015) 
Area EŧŧŪcient Floating Point Addition Unit With Error Detection 
Logic 
Aswani T. S.a, Premanand B.b 
aPG Scholar, Department of ECE, Government Engineering College Idukki, Kerala 685606, India 
bAssistant Professor, Department of ECE, Government Engineering College Idukki, Kerala 685606, India 
 
 
Abstract 
Applications that involve large dynamic range make use of the floating point operations. Addition is one of the complex operation 
in a floating point unit. This paper proposes an area eŧŧŪcient floating-point addition unit with error detection logic. Existing 
Leading Zero Anticipators (LZA) and error detection logics helps to reduce the delay of the general floating point unit, but are not 
area eŧŧŪcient. Here a single precision area eŧŧŪcient floating point addition unit is designed using an eŧŧŪcient Carry Select Adder 
together with the error detection logic. EŧŧŪcient Carry Select Adder is developed using Binary to Excess-1 Converter instead of 
Ripple Carry Adder for cinľ‘1’. The proposed design is simulated using ModelSim and is tested on Xilinx. 
§c  2016 The Authors. Published by Elsevier Ltd. 
Peer-review under responsibility of the organizing committee of ICETEST - 2015. 
Keywords:  FPGA; IEEE 754; VHDL; ModelSim; LZA; Xilinx 
 
 
 
1. Introduction 
 
Numerical computation involving floating point operands is fundamental in many scientific applications. Because 
of the high dynamic range and good precision, floating point operations are used in several fields. Sign, exponent and 
mantissa are the three values by which a real number is represented in a standard floating point format(IEEE 754). 
IEEE 754 single precision binary format representation is shown in fig.1. The first field represents the one bit sign, 
second field consists of 8 bit exponent and the last field comprises of the 23 bit mantissa part. It’s value is calculated 
using the formula : 
r ľ(-1)sign  x 1.mantissa x 2exponent (1) 
The first bit is the most significand bit and it is hidden for a real number as it is always 1. Exponent field of the two 
numbers must be the same for direct addition or subtraction. In order to make it so, smaller one should be shifted right 
 
 
∗  Corresponding author. Tel.: Ĭ91-8289957498. 
E-mail address: t.s.aswani@gmail.com 
 
©   t r . li   l i r t . This is an open access article under the CC BY-NC-ND license 
(http://creativecommons.org/licenses/by-nc- d/4.0/).
Peer-review under responsibility of the organizing committee of ICETEST – 2015
1150   T.S. Aswani and B. Premanand /  Procedia Technology  24 ( 2016 )  1149 – 1154 
 
 
Fig. 1. IEEE 754 single precision binary format representation 
 
 
Fig. 2. General Floating point Addition unit with LZA 
 
 
corresponding to the diŧŧerence value. After that it can be added or subtracted. The result of the operation may not be 
in the normalized form. So the result must be shifted left in order to make the MSB 1. A Leading Zero Anticipator 
(LZA) is required to calculate the shift amount that is needed for the normalization operation. A simple floating point 
addition unit with LZA is illustrated in Fig. 2. LZA is a vital block to increase the performance of FP processors [3]. 
The LZA anticipates the leading zeros count of the result directly from the input operands(chance for a one bit error) 
and is obtained prior to the adder’s output. It works in parallel with the adder, thus it reduces the delay introduced by 
the normalization process. An error detection logic is needed to compensate the one bit error that may be introduced, 
due to LZA. 
 
 
 
2. Related Works on LZA and Error Detection Logics 
 
Regarding the error detection circuit, many algorithms have been proposed [4-6] as it plays a major role in LZAs. 
Fig.3 (a) shows a design that uses a compensation shifter for error correction. It’s major disadvantage is that, it lies in 
the critical path. Another design which make use of the carry bits from adder is shown in Fig. 3 (b). Here, a coarse 
shifter is utilized for shifting the adder output by the amount given by LZA. After that with the help of a carry select 
circuit, error detection logic check for carry at leading digits position and the error in LZA output is detected. At last 
fine shifter is used for the one bit error correction. 
The existing floating point unit with error detection logics reduces the delay, but they are not area eŧŧŪcient.      So 
there is a scope for designing an area eŧŧŪcient floating point unit with error detection logic. The proposed architecture 
make use of an area eŧŧŪcient carry select adder. 
The rest of the paper is organized as follows :  Section 3 describes the existing and modified carry select adder.  
In Section 4, the proposed floating point unit design with a novel approach for area reduction together with error 
detection is presented. Simulation results and analysis reports are presented in Section 5, followed by conclusion in 
Section 6. 
1151 T.S. Aswani and B. Premanand /  Procedia Technology  24 ( 2016 )  1149 – 1154 
 
 
 
Fig. 3. a) Error correction using compensator; (b) Error correction using carry select 
 
 
3. Existing and Modified Carry Select Adder 
 
 Existing carry select adder 
 
The carry select adder(CSLA) generally consists of two ripple carry adders and a multiplexer. Fig. 4 shows the 
regular structure of 16-bit Linear CSLA. Area consumption is more in this structure due to the dual Ripple- Carry 
Adder (RCA) and it is evident from the gate count of CSLA which is obtained using the gate count of basic blocks of 
CSLA. Table 1 shows the area count of basic blocks of CSLA. 
 
Table 1. Delay and area count of basic blocks of CSLA  
Adder Blocks Delay (ns) Area count 
XOR 3 5 
2:1 Mux 3 3 
Half Adder 3 6 
Full adder 6 13 
 
The evaluation steps that determine the gate count is given here. In the existing structure, the group2 consists of 
two sets of 4-bit RCA. Group2 has a gate count of 117 that is determined as follows : 
Gate count ľġ117 (No. of Full Adders ĬġNo. of Half Adders ĬġNo. of Multiplexers) 
No. of Full Adders ľġ91 (7 * 13) 
No. of Half Adders ľġ6 (1 * 6) 
No. of Multiplexers ľġ20 (5 * 4) 
Group2, 3 and 4 have the same gate count as they are uniform. Group1 have a gate count of 52 as it contains 4 full 
adders. Area count of all groups in the regular 16-bit linear CSLA are evaluated and listed in Table 2. 
 
Table 2. Area count of 16-bit regular linear CSLA groups 
Group Areacount 
Group1 52 
Group2 117 
Group3 117 
Group4 117 
1152   T.S. Aswani and B. Premanand /  Procedia Technology  24 ( 2016 )  1149 – 1154 
 
 
Fig. 4. 16-bit Conventional Carry Select Adder 
 
 Modified carry select adder 
 
Simple and eŧŧŪcient gate level modification is used in order to reduce the area of CSLA. For reducing area, the 
CSLA can be implemented by using a single RCA and an add-one circuit instead of using dual RCA. Structure of a 
4-bit BEC and the modified CSLA architecture developed using Binary to Excess-1 converter (BEC) is shown in Fig. 
5(a) and (b) respectively[7]. 
The 16-bit modified Linear CSLA has 4 groups of same size RCA and BEC. Each group contains one RCA, one 
BEC and Mux. In the modified Linear CSLA, the group2 has one 4-bit RCA which has 3 FA and 1 HA for carry- 
in(Cin) ľġ0. Instead of another 4-bit RCA with Cin ľġ1, a 5-bit BEC is used which adds one to the output from  4-bit 
RCA. Then the area count of group2 is determined as follows : 
Gate count ľġ89 (FA ĬġHA ĬġMux ĬġġBEC) 
FA ľġ39 (3 * 13) 
HA ľġ6 (1 * 6) 
Mux ľġ20 (5 * 4) 
NOT ľġ1 
AND ľġ3 (3 * 1) 
XOR ľġ20 (4 * 5) 
Group2,3 and 4 have the same gate count. Group1 have a gate count of 52 as it contains 4 full adders. All the gate 
count are determined and is listed in Table 3. Comparing the gate count of existing and modified CSLA, it is evident 
that the modified architecture is area eŧŧŪcient. 
 
Table 3. Area count of 16-bit modified linear CSLA groups 
Group Area count 
Group1 52 
Group2 89 
Group3 89 
Group4 89 
4. Proposed Floating Point Unit with Error Detection Logic 
The proposed architecture with LZA and error detection logic make use of the modified carry select adder. Among 
the various error detection logics, the one that uses a carry select circuit to check for carry at leading digits position 
and detects the error in LZA output is chosen for the modification. First, the adder output is shifted by shift amount, 
given by LZA, using a coarse shifter.  Error detection logic uses modified carry select adder using BEC to check for 
1153 T.S. Aswani and B. Premanand /  Procedia Technology  24 ( 2016 )  1149 – 1154 
 
Fig. 5. (a) 4-bit BEC (b) Block diagram of BEC eŧŧŪcient CSLA 
 
Fig. 6. Error correction logic using modified carry select adder 
 
carry at leading digits position and detects the error in LZA output. This one bit error correction is done using a fine 
shifter. The existing logic with regular carry select adder is replaced with carry select adder using BEC. Modified 
carry select adder using BEC is discussed in the previous section and it is evident that the area is reduced to a great 
extent which in turn helps to reduce the power. Thus area and power eŧŧŪcient logic is obtained. 
Fig. 6 shows the logic which uses the carry select adder using binary to excess-1 converter. This logic is used 
instead of the normal leading zero counter in the general floating point unit. LZA replaces LZC and the modified 
error detection logic replaces the existing one to get an eŧŧŪcient floating point addition unit in terms of area, delay and 
power. 
 
 
5. Simulation Results 
 
The eŧŧŪcient single precision floating point addition unit is implemented in VHDL language and it is simulated 
using Modelsim and synthesized in Xilinx ISE 14.2. The VHDL code is verified using MATLAB[8-10]. Fig. 7 shows 
the output waveform of single precision floating point adder. Table 4 shows the comparison of general floating point 
unit, floating point unit with lza, and the proposed architecture in terms of area, delay and power. Area is evaluated in 
terms of no. of LUT’s and occupied slices. 
1154   T.S. Aswani and B. Premanand /  Procedia Technology  24 ( 2016 )  1149 – 1154 
 
 
Fig. 7. Output Waveform 
Table 4. Comparison of area, power and delay of diŧŧerent architectures 
 
 
 
 
 
 
 
6. Conclusion 
 
An eŧŧŪcient approach is proposed in this paper to reduce the area and power of Floating point addition unit with 
error detection logic by using eŧŧŪcient carry select adder using BEC. The outputs of conventional and the modified 
designs has been checked and getting the correct and the same result. With the help of ISE Design Suite 14.2, design 
is synthesized and results are obtained. 
The compared results shows that the architecture using LZA is eŧŧŪcient in terms of delay. But they are not area 
eŧŧŪcient. The proposed architecture has a reduced area and power compared to architecture using LZA with a slight 
increase in delay. Comparing with general structure it has reduced delay. So in eŧŧect the proposed architecture is an 
eŧŧŪcient architecture in terms of delay and area, thus leads to a good alternative for adder implementation for many 
data processing processors to perform fast Arithmetic functions. 
 
References 
 
[1] Institute of Electrical and Electronics. Standard for Binary floating point arithmetic.IEEE Std 754-1985 
[2]  B. Parhami. Computer Arithmetic: Algorithms and Hardware Designs. Oxford University Press, 2000. 
[3] Giorgos Dimitrakopoulos, Kostas Galanopoulos, Christos Mavrokefalidis, Dimitris Nikolos. Low Power Leading-Zero Counting and Antici- 
pation Logic for High-Speed Floating Point Units. In Proc. of, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 16, 
No. 7, pages 837-850, July 2008. 
[4]     Ali malik, Dongdong chenand Soek bum ko. Design tradeoŧŧġanalysis of floating point adders in FPGAs. Can. J. elect. Comput. Eng., 2008 
IEEE. 
[5] Loucas Louca, Todd A cook and William H. Johnson. Implementation of IEEE single precision floating point addition and multiplication on 
FPGAs.  1996 IEEE. 
[6] Z. Luo and M. Martonosi. Accelerating pipelined integer and floating-point accumulations in configurable hardware with delayed addition 
techniques. IEEE Transactions on Computers, vol. 49, no. 3, pp. 208-218, 2000. 
[7]     B.Ramkumar, and Harish M Kittur. Low Power and Area EŧŧŪcient Carry Select Adder. IEEE Transactions on Very Large Scale Integration 
(VLSI) Systems, pp. I-S (2012). 
[8] Simulink HDL Coder 1. User’s Guide. 2006-2010 by the MathWorks, Inc. 
[9] Hikmat N. Abdullah and Hussein A. Hadi. Design and Implementation of FPGA Based Software Defined Radio Using Simulink HDL Coder. 
Engineering and Technology Journal, Iraq, ISSN 1681-6900 01İ2010; Vol.28 (No.23):pp.6750-6767. 
[10]  Naresh Grover, M.K.Soni. Design of FPGA based 32-bit Floating Point Arithmetic Unit and verification of its VHDL code using  MATLAB. 
I.J. Information Engineering and Electronic Business, 2014, 1, 1-14 Published Online February 2014 in MECS 
Floating Point Adder Topology General Floating Point Unit Floating Point Unit Using LZA Prposed Architecture 
No. of 4 input LUT’s 703 854 760 
Occupied Slices 367 471 415 
Delay (ns) 64.93 31.989 39.522 
Total on chip power(W) .063 .07 .069 
