International Journal of Research in Advent Technology, Vol.2, No.6, June 2014 E-ISSN: 2321-9637
SQRT CSLA with Less Delay and Reduced Area Using FPGA Shrishti khurana1, Dinesh Kumar Verma2 Electronics and Communication P.D.M College of Engineering Shrishti.khurana16@gmail.com, er.dineshverma@gmail.com Abstract: Carry select adder (CSLA) is used to perform fast arithmetic operations in many data processing processors. It is also used in many computational systems to alleviate the problem of carry propagation delay by independently generating multiple carries and the select a carry to generate sum. Carry Select Adder consists of two ripple carry adder and multiplexer. This paper outlines the method to reduce the area and delay in the SQRT CSLA. Area and delay has been reduced by implying an efficient gate level modification. This paper has shown the comparison between 16, 32, 64 bit regular SQRT CSLA and 16, 32, 64 bit modified SQRT CSLA. The regular SQRT CSLA uses multiple pairs of Ripple carry adders to generate partial sum and carry by considering carry input Cin=0 and Cin=1, then the final sum and carry are selected by multiplexers and hence is not area efficient. In modified SQRT CSLA, Binary to Excess converter is used instead of RCA with cin=1 to achieve lower area and lower delay. Also, the performance of proposed designed is measured in terms of area, delay and synthesis are implemented in Xilinx ISE. The results analysis shows that the modified SQRT CSLA structure is better than the regular SQRT CSLA. Index terms:-field programmable logic device (FPGA), area efficient, CSLA, low delay. 1.
INTRODUCTION
Reduced area and high speed data path logic systems are the main area of research in VLSI system design. In digital adder, the speed of addition is limited by the time require to propagate a carry through the adder. The sum for each bit position in an elementary adder is generated sequentially only after the previous bit position has be summed and carry propagate into next position.
multiplexer (MUX). Adding two n-bit number with a carry select adder is done with two ripple carry adders in order to perform calculation twice one time with the assumption of carry being zero and other assuming carry one, then the final sum and carry are selected by the multiplexer(MUX). However CSLA is not area efficient because it uses multiple pair of ripple carry adder (RCA).
Carry select adder (CSLA) is the fastest adder used in data processing process to perform arithmetic function. The carry select adder is classified as linear CSLA and square root (SQRT) CSLA .Linear CSLA is by chaining a number of equal length adder stages. For n-bit adder, it could be implemented with equal length of carry select adder. Linear CSLA does always have the best performance. SQRT CSLA is also known as nonlinear CSLA. It is constructed by equalizing the delay through two carry chains and the block multiplexer signal from previous stage. A CSLA has good performance in propagation delay especially the non-linear one, however it compensate with large area. The SQRT CSLA can be implemented in different length.
The basic idea of the work is to use binary to excess-1 converter (BEC) instead of RCA with cin=1 in regular CSLA to achieve lower area and lower delay[2]-[4]. The main advantage of this BEC logic comes from lesser number of logic gates.
The CSLA is used in many computational system to alleviate the problem of carry propagation delay by independently generating multiple carries and then select a carry to generate the sum [1]. It consists of two ripple carry adder (RCA) and a
This work in brief is structured as follows. Section II deals with the delay and area equivalent methodologies of the basic adder blocks. Section III deal with structures and functions of BEC logic. Section IV presents the architecture of the regular and modified SQRT CSLA. Section V implementation methodology and finally work is concluded section IV. 2.
DELAY AND AREA EQUIVALENT METHODOLOGY OF THE BASIC ADDER BLOCKS The AND, OR and Inverter (AOI) implement on the XOR gate. The delay area methodology all gates to be made up of AOI, each delay equal to 1 unit and area equal to 1 unit[5]. Add up the 94