1,891 research outputs found

    New approaches for the design of low-complexity radix-based FFT and FHT algorithms

    Get PDF
    The discrete Fourier transform (DFT) and discrete Hartley transform (DHT) play a crucial role in one- and multi-dimensional digital signal processing applications. Traditionally, the main concern in the design of fast Fourier transform (FFT) and fast Hartley transform (FHT) algorithms has been the reduction of the arithmetic complexity. However, with the recent advances in the digital technology and the present demands of such transforms in low-power high-performance real-time applications, a more comprehensive treatment of the computational and structural complexities must be considered in the design of the algorithms. The objective of this thesis is to design one- and multi-dimensional FFT and FHT algorithms that address the problem of reducing the number of arithmetic operations, data transfers, address generations, and twiddle factor evaluations or accesses to the lookup table, while possessing features such as simplicity, regularity, modularity, easy indexing scheme, and butterfly-style and in-place computations that are highly desirable characteristics for software or hardware implementations of the algorithms. To achieve these objectives, radix-based algorithms are proposed by introducing new decomposition strategies, efficient index mappings, and by an appropriate use of the Kronecker product. A general decomposition method, which is based on the radix-2 approach, valid for any dimension and applicable to both the DHT and DFT, and which significantly reduces the complexity of the FHT algorithms, is proposed. This method enables us to develop multidimensional FHT and FFT algorithms. A new approach for computing the DFT and DHT using a unified structure is proposed by establishing a close relationship, valid for any dimension, between the radix-2 based FHT and FFT algorithms. An efficient method, based on the radix-2 approach, for pruning output samples of a 1-D or 2-D DFT is proposed by grouping in its 1-D or 2-D FFT algorithm all the stages that involve unnecessary operations into a single stage and by introducing a new recursive technique for the computations required in the resulting stage. A technique is presented to improve the performance of the radix-4, radix-8 and radix-16 FFT algorithms in terms of the number of twiddle factor evaluations or accesses to the lookup table without any increase in the computational or structural complexities of the algorithms. In order to take advantage of the lowest structural complexity provided by the radix-2 approach and reduced computational complexity offered by the radix-4 approach, a technique suitable for combining these two approaches is introduced in order to develop efficient 3-D FFT and FHT algorithms. A radix-2/8 approach for reducing the complexity in the computation of the 1-D DFT and DHT of lengths N = q {604} 2 m is proposed by appropriately mixing the radix-2 and radix-8 index maps. This approach is extended to 2-D and 3-D DFTs. It is shown that the proposed radix-2/8 approach is superior to all the other existing radix-based approaches in providing low-complexity 1-D, 2-D and 3-D FFT, and 1-D FHT algorithms

    New approaches for the design of low-complexity radix-based FFT and FHT algorithms

    Get PDF
    The discrete Fourier transform (DFT) and discrete Hartley transform (DHT) play a crucial role in one- and multi-dimensional digital signal processing applications. Traditionally, the main concern in the design of fast Fourier transform (FFT) and fast Hartley transform (FHT) algorithms has been the reduction of the arithmetic complexity. However, with the recent advances in the digital technology and the present demands of such transforms in low-power high-performance real-time applications, a more comprehensive treatment of the computational and structural complexities must be considered in the design of the algorithms. The objective of this thesis is to design one- and multi-dimensional FFT and FHT algorithms that address the problem of reducing the number of arithmetic operations, data transfers, address generations, and twiddle factor evaluations or accesses to the lookup table, while possessing features such as simplicity, regularity, modularity, easy indexing scheme, and butterfly-style and in-place computations that are highly desirable characteristics for software or hardware implementations of the algorithms. To achieve these objectives, radix-based algorithms are proposed by introducing new decomposition strategies, efficient index mappings, and by an appropriate use of the Kronecker product. A general decomposition method, which is based on the radix-2 approach, valid for any dimension and applicable to both the DHT and DFT, and which significantly reduces the complexity of the FHT algorithms, is proposed. This method enables us to develop multidimensional FHT and FFT algorithms. A new approach for computing the DFT and DHT using a unified structure is proposed by establishing a close relationship, valid for any dimension, between the radix-2 based FHT and FFT algorithms. An efficient method, based on the radix-2 approach, for pruning output samples of a 1-D or 2-D DFT is proposed by grouping in its 1-D or 2-D FFT algorithm all the stages that involve unnecessary operations into a single stage and by introducing a new recursive technique for the computations required in the resulting stage. A technique is presented to improve the performance of the radix-4, radix-8 and radix-16 FFT algorithms in terms of the number of twiddle factor evaluations or accesses to the lookup table without any increase in the computational or structural complexities of the algorithms. In order to take advantage of the lowest structural complexity provided by the radix-2 approach and reduced computational complexity offered by the radix-4 approach, a technique suitable for combining these two approaches is introduced in order to develop efficient 3-D FFT and FHT algorithms. A radix-2/8 approach for reducing the complexity in the computation of the 1-D DFT and DHT of lengths N = q {604} 2 m is proposed by appropriately mixing the radix-2 and radix-8 index maps. This approach is extended to 2-D and 3-D DFTs. It is shown that the proposed radix-2/8 approach is superior to all the other existing radix-based approaches in providing low-complexity 1-D, 2-D and 3-D FFT, and 1-D FHT algorithms

    Low-power Programmable Processor for Fast Fourier Transform Based on Transport Triggered Architecture

    Get PDF
    This paper describes a low-power processor tailored for fast Fourier transform computations where transport triggering template is exploited. The processor is software-programmable while retaining an energy-efficiency comparable to existing fixed-function implementations. The power savings are achieved by compressing the computation kernel into one instruction word. The word is stored in an instruction loop buffer, which is more power-efficient than regular instruction memory storage. The processor supports all power-of-two FFT sizes from 64 to 16384 and given 1 mJ of energy, it can compute 20916 transforms of size 1024.Comment: 5 pages, 4 figures, 1 table, ICASSP 2019 conferenc

    Efficient FPGA implementation of high-throughput mixed radix multipath delay commutator FFT processor for MIMO-OFDM

    Get PDF
    This article presents and evaluates pipelined architecture designs for an improved high-frequency Fast Fourier Transform (FFT) processor implemented on Field Programmable Gate Arrays (FPGA) for Multiple Input Multiple Output Orthogonal Frequency Division Multiplexing (MIMO-OFDM). The architecture presented is a Mixed-Radix Multipath Delay Commutator. The presented parallel architecture utilizes fewer hardware resources compared to Radix-2 architecture, while maintaining simple control and butterfly structures inherent to Radix-2 implementations. The high-frequency design presented allows enhancing system throughput without requiring additional parallel data paths common in other current approaches, the presented design can process two and four independent data streams in parallel and is suitable for scaling to any power of two FFT size N. FPGA implementation of the architecture demonstrated significant resource efficiency and high-throughput in comparison to relevant current approaches within literature. The proposed architecture designs were realized with Xilinx System Generator (XSG) and evaluated on both Virtex-5 and Virtex-7 FPGA devices. Post place and route results demonstrated maximum frequency values over 400 MHz and 470 MHz for Virtex-5 and Virtex-7 FPGA devices respectively

    A class of AM-QFT algorithms for power-of-two FFT

    Full text link
    This paper proposes a class of power-of-two FFT (Fast Fourier Transform) algorithms, called AM-QFT algorithms, that contains the improved QFT (Quick Fourier Transform), an algorithm recently published, as a special case. The main idea is to apply the Amplitude Modulation Double Sideband - Suppressed Carrier (AM DSB-SC) to convert odd-indices signals into even-indices signals, and to insert this elaboration into the improved QFT algorithm, substituting the multiplication by secant function. The 8 variants of this class are obtained by re-elaboration of the AM DSB-SC idea, and by means of duality. As a result the 8 variants have both the same computational cost and the same memory requirements than improved QFT. Differently, comparing this class of 8 variants of AM-QFT algorithm with the split-radix 3add/3mul (one of the most performing FFT approach appeared in the literature), we obtain the same number of additions and multiplications, but employing half of the trigonometric constants. This makes the proposed FFT algorithms interesting and useful for fixed-point implementations. Some of these variants show advantages versus the improved QFT. In fact one of this variant slightly enhances the numerical accuracy of improved QFT, while other four variants use trigonometric constants that are faster to compute in `on the fly' implementations

    Non-power-of-Two FFTs: Exploring the Flexibility of the Montium TP

    Get PDF
    Coarse-grain reconfigurable architectures, like the Montium TP, have proven to be a very successful approach for low-power and high-performance computation of regular digital signal processing algorithms. This paper presents the implementation of a class of non-power-of-two FFTs to discover the limitations and Flexibility of the Montium TP for less regular algorithms. A non-power-of-two FFT is less regular compared to a traditional power-of-two FFT. The results of the implementation show the processing time, accuracy, energy consumption and Flexibility of the implementation

    Radix-2 x 2 x 2 algorithm for the 3-D discrete hartley transform

    Get PDF
    The discrete Hartley transform (DHT) has proved to be a valuable tool in digital signal/image processing and communications and has also attracted research interests in many multidimensional applications. Although many fast algorithms have been developed for the calculation of one- and two-dimensional (1-D and 2-D) DHT, the development of multidimensional algorithms in three and more dimensions is still unexplored and has not been given similar attention; hence, the multidimensional Hartley transform is usually calculated through the row-column approach. However, proper multidimensional algorithms can be more efficient than the row-column method and need to be developed. Therefore, it is the aim of this paper to introduce the concept and derivation of the three-dimensional (3-D) radix-2 2X 2X algorithm for fast calculation of the 3-D discrete Hartley transform. The proposed algorithm is based on the principles of the divide-and-conquer approach applied directly in 3-D. It has a simple butterfly structure and has been found to offer significant savings in arithmetic operations compared with the row-column approach based on similar algorithms

    A new Truncated Fourier Transform algorithm

    Full text link
    Truncated Fourier Transforms (TFTs), first introduced by Van der Hoeven, refer to a family of algorithms that attempt to smooth "jumps" in complexity exhibited by FFT algorithms. We present an in-place TFT whose time complexity, measured in terms of ring operations, is comparable to existing not-in-place TFT methods. We also describe a transformation that maps between two families of TFT algorithms that use different sets of evaluation points.Comment: 8 pages, submitted to the 38th International Symposium on Symbolic and Algebraic Computation (ISSAC 2013
    corecore