Applications such as address generation, encoding, decoding, data shifting, etc are of primary importance in many computing and processing applications. Design of Barrel shifters therefore demands more attention and the advent of quantum computation and reversible logic, design and implementation of all sub-systems in reversible logic has received more attention. Moore"s law in VLSI designs today is no more a simple reality, the device dimensions are shrinking exponentially and the circuit complexity is growing exponentially. Various low power design techniques are proposed and successfully achieved. Device scaling is limited by the power dissipation; and demands better power optimizations methods. Techniques like Energy recovery, Reversible Logic are becoming more and more prominent special optimization techniques in Low Power VLSI designs. Reversible logic opens tremendous avenues for power optimizations in the areas such as Quantum Computing, Nanotechnology, Sprintronics and Optical Computing. Reversibility plays an important role when energy efficient computations are to be designed. The objective of this work is to design a Universal Reversible Barrel Shifter that performs shifting left, right, rotates left and right. The performance characteristics of the existing design and the proposed design are compared with respect to transistor cost, Garbage outputs and Quantum Cost. The performance characteristics analysis is carried out in cadence digital design environment and CMOS implementation in cadence virtuoso.
INTRODUCTION 1.1 Motivation

Need for Low-power, area-efficient design
The need for low power, area-efficient and design is motivated by several factors, such as the emergence of portable systems, thermal considerations, reliability issues, and, finally, environmental concerns. The evolution of portable or mobile communication devices such as laptops, cellular phones, video games, etc. is the most important factor driving the need for low power design. The main reason behind the development of low power circuits is that many portable devices and their applications require low power dissipation and high throughput. The commercial success of portable or mobile devices depends significantly on their weight, cost, and battery life. In most cases, the cost and weight of batteries become a bottleneck that prevents the reduction of system cost and weight. Moreover, for most portable systems, the IC components consume a significant portion of the total system power. Portable devices have a strict demand for power consumption since they have limited battery capacity. Low power design also plays a significant role in high-performance integrated circuits such as microprocessors and other high-speed digital computational circuits. Due to the increase in clock frequency, there is a proportional increase in power dissipation. The power consumed by the integrated circuit is dissipated in the form of heat. This may lead to problems such as circuit degradation and operating failures.
The power consumption in microprocessors is projected to grow linearly in proportion to their die size and clock frequency. Various cooling systems have been introduced to reduce the heat from power dissipation and keep the chip temperature at an admissible level. This in turn has increased the packaging cost, which results in large revenue.
Reversible computing
In recent years, reversible computing system design is attracting a lot of attention. Reversible computing is based on two concepts: logic reversibility and physical reversibility. A computational operation is said to be logically reversible if the logical state of the computational device before the operation of the device can be determined by its state after the operation i.e., the input of the system can be retrieved from the output obtained from it. Irreversible erasure of a bit in a system leads to generation of energy in the form of heat. An operation is said to be physically reversible if it converts no energy to heat and produces no entropy. Landauer [1] has shown that for every bit of information lost in logic computations that are not reversible, kTlog2 joules of heat energy is generated, where k is Boltzmann"s constant and T the absolute temperature at which computation is performed. The amount of energy dissipation in a system increases in direct proportion to the number of bits that are erased during computation. Bennett showed that kTln2 energy dissipation would not occur, if a computation were carried out in a reversible way. Reversible computation in a system can be performed if the system is composed of reversible gates. The amount of energy dissipated in a system bears a direct relationship to the number of bits erased during computation and Reversible circuits do not lose information. In reversible logic there is one-to-one mapping between the input and output vectors and vice-versa. Reversible logic has applications in several technologies such as nanotechnology, DNA computing, low power design, optical computing, Spintronics and quantum computing.
Barrel shifters
In DSP, where multiple shifts are required to do the computations such as FFT, circular convolution etc. barrel shifters are used to produce multiple shifts in only on clock cycle. For a reversible DSP system, barrel shifter also needs to be designed using reversible gates.
LITERATURE SURVEY
Physical limitations placed on computation by heat dissipation were studied for many years. The usual digital computer program frequently performs operations that seem to throw away information about the computer's history, leaving the machine in a state whose immediate predecessor is ambiguous [1] . Such operations include erasure or overwriting of data, and entry into a portion of the program addressed by several different transfer instructions. In other words, the typical computer is logically irreversible -its transition function (the partial function that maps each whole-machine state onto its successor, if the state has a successor) lacks a single-valued inverse.
Landauer [1] has raised the question of whether logical irreversibility is an unavoidable feature of useful computers, arguing that it is, and has demonstrated the physical and philosophical importance of this question by showing that whenever a physical computer throws away information about its previous state it must generate a corresponding amount of entropy. Therefore, a computer must dissipate at least kTln2 of energy (about 3 X 10-21 joule at room temperature) for each bit of information it erases or otherwise throws away.
At this point of time Bennett [2] showed: An irreversible computer can always be made reversible by having it save all the information it would otherwise throw away. For example, the machine might be given an extra tape (initially blank) on which it could record each operation as it was being performed, in sufficient detail that the preceding state would be uniquely determined by the present state and the last record on the tape. However, as Landauer pointed out, this would merely postpone the problem of throwing away unwanted information, since the tape would have to be erased before it could be reused. It is therefore reasonable to demand of a useful reversible computer that, if it halts, it should have erased all its intermediate results, leaving behind only the desired output and the originally furnished input. (The machine must be allowed to save its input-otherwise it could not be reversible and still carry out computations in which the input was not uniquely determined by the output.) Generalpurpose reversible computers (Turing machines) satisfying these requirements indeed exist, and they need not be much more complicated than the irreversible computers on which they are patterned. Computations on a reversible computer take about twice as many steps as on an ordinary one and may require a large amount of temporary storage.
While designing reversible system, the designer has to keep track of constraints [3] : only one fan-out is allowed and loops are not permitted.
A circuit is said to be reversible if the input vector can be uniquely recovered from the output vector and there is a one to one correspondence between its input and output assignments [8] .Thus, the number of inputs and outputs in reversible logic circuits are equal. Reversible circuits allow the reproduction of the inputs from the observed [2] . Data shifting and rotating is often used operation , in this regard, barrel shifters which are capable of performing n-bit shifting and rotating of data in a single cycle, are normally used in embedded processors such as: digital signal processors [4] and high performance processors [5] , high-speed/low-power applications [6] etc. The paper [7] shows the implementation of a reversible unidirectional barrel shifter and the second paper [9] shows the optimization of the same.
DESIGN METHODOLOGY 3.1 Basic reversible gates
Reversible gate: For a gate to be reversible the input and output should be one to one mapped (bijective) [2] . For instance Feynman gate has two inputs and two outputs; which are one to one mapped as in the Figure 1 .
Figure 1: Feynman Gate and its input and output relation.
In the process of designing a reversible system, using reversible gates, certain unused outputs are produced which are called as Garbage Outputs. Garbage outputs are unavoidable but can be reduced.
Feynman Gate: Reversible 2*2 Feynman Gate (FE), its input and output logic relation is in the Figure 1 . Feynman Gate is used in the scenario when it"s required to produce multiple copies of an input signal, since only one fan-out is allowed in reversible logic gate [3] .
Fredkin gate: Reversible 3*3 Fredkin gate (FR), its input and output logic relation is in the Figure 2 . FG can be used as a 2*2 multiplexer. 
RLM gate:
The input and output relation of new reversible 3*3 RLM gate is defined in the Figure 3 .the function of this gate is direction control, left of right shift and also the number of shifts of the reversible barrel shifter. 
Shifting using logarithmic shifters.
A barrel shifter is a digital circuit that can shift a data word by a specified number of bits in one clock cycle. It can be implemented as multiple stages of multiplexers (mux.), and in such an implementation the output of one mux is connected to the input of the next mux in a way that depends on number of shifts. The number of shift control bits is given by n = log N, where N is number of bits and D is shift direction control. Table 1 . 
Existing left shift/rotate barrel shifter
The existing shifters in [4] and [5] are limited in terms of their incomplete universality and the unidirectional logarithmic shifter use multiplexers (Fredkin gate) and Feynman Gate for replicating the inputs. 
Proposed Universal Barrel Shifter
The proposed barrel shifter shown in Figure 6 can shift and rotate in both the directions. This enhancement is due to the new additional gate named as RLM gate (instantiated as JB in Figure 7 ) which manipulates the control bits "s0" and "s11" depending upon the value of direction control bit "d". 
IMPLEMENTATION AND RESULTS
The proposed design is functionally verified and the results are compared with existing [4] , [5] (as indicated in Figures 8  through 14 ).
The area, average power and timing report was obtained after obtaining the netlist for the structural model of the digital implementation. The transistor cost was found in analog flow. Functionality was verified in both analog and digital flow. The universal barrel shifter is built by adding the RLM reversible gate and the verification results are as follows. 
Design implementation and simulation in cadence virtuoso.
The schematic shown in the Figure 7 is a test circuit for input combination S=00, a square wave is applied to"d" to test for both the directions. Similar test circuits were built for other combinations and the results were verified for its functionality.
Design implementation and simulation in cadence digital design
For the HDL structural design, the test vectors for excitation has been provided, and the response is as shown in Figure 8 .
Here the input reference vector is i=1101. 
Synthesis
The block diagram after the synthesis clearly shows the RLM gate with an instance name jb1 block in Figure 9 , which forms the control logic. 
International Journal of Computer Applications (0975 -8887) Volume 40-No.15, February 2012
Area report
The Area report for the existing design [5] and the proposed designs are generated and listed in Tables 2 and 3 . 
Area Report of existing design.
Power report
The power report for the existing design [5] and the proposed designs are generated and listed in Tables 4 and 5 . 
Power Report of existing design
Timing report
The Timing report for the existing design [5] and the proposed designs are generated and shown in Figures 10 and 11 . The maximum, which is the worst case delay from input to the output pin, is shown in the Figures 10 and 11 . The Figure 10 and 11 show only the pins in the critical path.
Summary of the results
The performance contrast of the prior and the proposed barrel shifter design is tabulated as in table 6 . From the table 6 it is evident that that the proposed design offers additional functionality of bidirectional shifting and rotation with 16.55% increased delay, 24.47% increased power consumption and with 11.34% increased area. The reversible parameter contrast of the prior and the proposed barrel shifter designs is shown in Table 7 . Summarizing, the bi-directional functionality has been realized with reduced costs. Only at a marginal increase in the costs, both performance and universality are realized; this is indicated in Figure 12 . Design in [4] Design in [5] Proposed In order to optimize the performance of the circuit, the number of garbage outputs need to be reduced; this in turn reduces the area, power dissipation of the circuit.
ii. The number of garbage outputs may be reduced to 1 always irrespective of the size of the Shifter. In order to achieve this, a new control gate called as RLMn gate is proposed as shown in Figure 14 . Design the RLMn Gate reversible control gate, by adding another control signal "D" for direction control (left/right). The RLMn Gate maps the inputs d, s n-1 ... The garbage output G is given by Eq. 5.2 for all the sizes of shift register. The garbage output is used to make the RLMn Gate bijective (one to one mapping). RLMn Gate hence produces only one garbage output for all the cases irrespective of size of the shift register. Table 8 . Hence, determine the input output relationship by using the CAD tool or manually using Eq. 5. 
CONCLUSIONS AND FUTURE WORK
The existing unidirectional and new bi-directional reversible barrel shifter designs have been designed and verified for its functionality in cadence digital flow (using gscl45nm technology library) and also in cadence analog flow (using gpdk180 technology library). The proposed barrel shifter can shift and rotate in both left and right directions. The additional new RLM gate offers bi-directional functionality with only a marginal increase in the overheads such as quantum cost, garbage outputs, gate count and delay. The present design is of 4-bit size, similar design methodology can be used to design 8-bit and higher reversible barrel shifters. Many barrel shifter designs have been produced but nevertheless the designs are not impeccable. Hence there is always a scope to enhance the design; further optimization can be achieved by reducing the overheads.
