As the channel frequency selectivity becomes severer, the bit error rate (BER) performance of direct sequence spread spectrum (DSSS) signal transmission with rake combining degrades due to an increasing inter-path interference (IPI). Frequency-domain equalization (FDE) can replace rake combining with much improved BER performance in a severe frequency-selective fading channel. For FDE, accurate estimation of the channel transfer function is required. In this paper, we propose an iterative channel estimation that uses pilot chips which are timemultiplexed within each chip block for fast Fourier transform (FFT). The pilot acts as a cyclic-prefix of FFT block as well. The achievable BER performance is evaluated by computer simulation. It is shown that the proposed channel estimation has a very good tracking ability against fast fading.
Introduction
In direct sequence spread spectrum (DSSS) wireless transmissions, a coherent rake combining can be employed to exploit the channel frequency-selectivity for improving the transmission performance. Wideband DSSS technique has been adopted in the 3rd generation mobile communication systems for data transmissions of up to a few Mbps [1] . In the next generation mobile communication systems, much higher speed data transmission (e.g., higher than several 10 Mbps) is required. However, for such high-speed data transmission, the channel becomes severely frequencyselective [2] and too many rake fingers (or correlators) are required in DSSS and the transmission performance significantly degrades due to large inter-path interference (IPI) even if coherent rake combining is used.
Recently, orthogonal frequency division multiplexing (OFDM) and multi-carrier code division multiple access (MC-CDMA) [3] - [6] have been attracting much attention. However, OFDM and MC-CDMA signals have large peakto-average power ratio (PAPR) and thus, a linear transmit power amplifier with large peak power is required. More recently, single-carrier (SC) transmission using onetap frequency-domain equalization (FDE) has been gaining popularity [7] . We have also shown that the use of FDE can significantly improve the bit error rate (BER) performance of DSSS compared to rake combining [8] - [11] . The DSSS signal transmission has advantages that the problem of high PAPR can be alleviated and that the computational complexity of FDE does not depend on the degree of the channel frequency-selectivity. Accurate estimation of the channel transfer function (or the channel gain at each frequency) is necessary for FDE. Many works concerning channel estimation can be found in [11] - [19] . The pilot-assisted channel estimation schemes for DSSS (or DS-CDMA) using rake combining, proposed in [17] - [19] , can be applied to DSSS signal transmission using FDE. However, in Refs. [17] - [19] , the pilot chip block needs to be transmitted frequently to track against fast fading, and as a result, the transmission efficiency (or the data rate) decreases. Recently, a data-dependent pilot structure has been proposed in [20] for the DS-CDMA downlink with FDE. In a data-dependent pilot structure proposed in [20] , pilot chip sequence is inserted into each data block for channel estimation. Then, the cyclic prefix (CP) is added to avoid inter-block interference.
In this paper, we propose a new channel estimation scheme using a guard interval (GI) as a pilot which is suitable for FDE and can achieve a very good tracking ability against fast fading without loss of the transmission efficiency. Although the idea of channel estimation using the GI as the pilot has been suggested for the SC transmission with FDE in [7] , [21] , [22] , the implementation of the channel estimation scheme based on the idea was not presented in [7] , [21] . The channel estimation proposed in [22] uses a short pilot chip sequence of N g /2 chips and repeats it twice to form a CP of N g chips. The first pilot chip sequence of N g /2 chips plays a role of the CP of the second pilot chip sequence of N g /2 chips. Hence, channel estimation is possible only if the maximum time delay difference between the propagation paths is less than N g /2 chips. However, our proposed scheme can be used for a channel having maximum time delay difference up to N g chips.
The objective of this paper is to propose a new pilotassisted channel estimation scheme and to show how close is the BER performance of our proposed scheme to the ideal channel estimation performance. Therefore, only the uncoded BER performance is presented.
The remainder of this paper is organized as follows. The transmission system model of DSSS transmission with FDE is presented in Sect. 2. The proposed iterative channel estimation scheme is described in Sect. 3. In Sect. 4, the computer simulation results for the BER performance of DSSS transmission with FDE using the proposed channel Copyright c 2007 The Institute of Electronics, Information and Communication Engineers estimation scheme are presented. The paper is concluded in Sect. 5. Figure 1 shows the transmitter/receiver structure. In [8] - [10] , the transmit data chip sequence is divided into blocks of N c chips each and the last N g chips in each chip block are copied and inserted as the CP into the GI placed at the beginning of each block. However, in this paper, the pilot chip sequence of N g -chip length for channel estimation is inserted at the end of each chip block and the previous pilot chip sequence is used as the GI for the present block. The chip block structure is shown in Fig. 2 . The fast Fourier transform (FFT) window size is N c chips and the number N d of data chips in each block is N d = N c − N g . The transmission efficiency ζ is given by
Transmission System Model of DSSS with FDE
where N is the pilot block insertion period for the conventional block structure (i.e., a pilot block of N c chips is inserted after every N data chip blocks). The transmission efficiency for the block structure used in this paper is higher than the conventional one if N < (N c /N g ) 2 − 1. Moreover, the proposed channel estimation has a higher tracking ability than the pilot-assisted decision feedback channel estimation (PA-DFCE) [11] . This will be discussed later.
Chip-spaced discrete time representation is used throughout the paper. N d /SF data-modulated symbols are transmitted in each block, where SF is the spreading factor. Without loss of generality, we consider the transmission of one chip block of N c chips. The data symbol sequence and represents the ensemble average operation. The pilot-inserted (or the GIinserted) DSSS signal {s(t); t = 0∼N c −1}, to be transmitted, can be expressed using the equivalent baseband representation as
where E c and T c represent the chip energy and the chip length, respectively, and x represents the largest integer smaller than or equal to x and p(t) represents the pilot chip sequence of N g chips with |p(t)|=1. The transmitted signal is received by N r receive antennas at the receiver. Assuming that the channel has L independent propagation paths with T c -spaced distinct time delays {τ l ; l=0∼L−1}, the discrete-time impulse response h m (t) of the multipath channel experienced by the mth antenna is expressed as
where h m,l is the lth path gain with
The received signal r m (t), on the mth antenna, m=0∼N r − 1, can be expressed as where η m (t) represents a zero-mean additive white Gaussian noise (AWGN) process having variance equal to 2N 0 /T c with N 0 representing the single sided power spectrum density. Here, we have assumed block fading for the sake of brevity, where path gains remain constant over one chip block; however in the computer simulation, we assume continuous fading. At the receiver, the received signal r m (t) is decomposed into N c frequency components
where P(k) and D(k) are the kth frequency components of the transmitted pilot and data chip sequences, respectively, and H m (k) and Π m (k) are the channel gain and noise component due to AWGN at the kth frequency, respectively. They are given by
Since the pilot chip sequence p(t) and the spreading chip sequence c(t) are assumed to be random,
The iterative channel estimation will be described in Sect. 3. The channel estimate for H m (k) obtained after the ith iteration is denoted byĤ 
where w
is the MMSE weight. We assume that the residual interference is modeled by a zero-mean complex Gaussian process and the sum of residual interference and noise is treated as a new Gaussian noise. w
where 2σ 
Despreading is performed ons (i) (t) to obtain the decision variablẽ
based on which symbol decision is performed. The recovered data symbol is denoted byd (i) (n). In the proposed iterative channel estimation, a series of FDE operation, despreading, symbol decision, respreading and channel estimation is repeated a sufficient number of times. Finally, data-demodulation is carried out.
Iterative Channel Estimation
For computing the MMSE weight,Ĥ Figure 3 illustrates a detailed structure of the channel estimation block in Fig. 1 . Only the pilot is used for the initial channel estimation (i=0) and both the pilot and recovered data chips are used as new pilot chips for the first iteration onwards (i ≥ 1).
In the proposed scheme, the GI is used as the pilot so as not to reduce the transmission efficiency while achieving a good tracking ability against fading. Although the idea of channel estimation using the GI as the pilot has been suggested for the SC transmission with FDE in [7] , [21] , [22] , the implementation of the channel estimation scheme based on the idea was not presented in [7] , [21] . The channel estimation proposed in Ref. [22] uses a short pilot chip sequence (whose length is half the GI) and repeats it twice in the GI as shown in Fig. 4 . This scheme can only be applied to a channel having a maximum time delay difference less than N g /2 chips. On the other hand, our proposed channel estimation Fig. 4 Chip block structure of the channel estimation scheme proposed in Ref. [22] .
scheme can be applied to a channel having a maximum time delay difference up to N g chips. However, in a frequencyselective fading channel, the received pilot chip block suffers from a large interference from the data chip part in a block and hence, the channel estimation accuracy significantly degrades. Therefore, in the proposed scheme, the initial channel estimation is carried out after the data chip interference is suppressed by applying the rectangular windowing. Then, the channel estimation is carried out again using both the pilot and decision-feedback data sequence as a new pilot. This is repeated a sufficient number of times.
In this paper, we refer to [11] as the conventional channel estimation scheme to compare with the proposed iterative channel estimation. For the channel estimation for DSSS signal transmission with rake combining [17] - [19] , the channel impulse response is estimated using the timedomain correlation method and then, the channel transfer function (or the channel gain at each frequency) is estimated by applying the Fourier transform. Another way to directly estimate the channel transfer function is to use the reverse modulation (or removing the pilot modulation) in the frequency-domain as used in the channel estimation for DSSS signal transmission using FDE [11] and also for OFDM signal transmission [12] , [13] . Since the channel transfer function is the Fourier transform of the channel impulse response, the channel estimation proposed in [11] is equivalent to those in [17] - [19] .
Initial Channel Estimation Using Pilot Only (i = 0)
We assume that the maximum time delay difference ∆τ max between the propagation paths is less than the GI, i.e., ∆τ max < N g . To reduce the interference from the data chip sequence and the noise, the received signal r m (t) is replaced with zeros over the time interval of t = N g ∼N d − 1, where N d = N c − N g (or rectangular windowing), as shown in Fig. 5 . Then, N c -point FFT is applied to decompose the zero-replaced received signal into N c frequency components: 
where
m (k) are the interference components from the data chip sequence and the noise component, respectively, and are given by
We want to estimate H m (k). First, the instantaneous estimate of the channel gain H m (k) is obtained by removing the pilot modulation as
However,H
m (k) is perturbed by the interference and noise. To reduce the interference and noise and then improve the channel estimation, frequency-domain filtering [14] and delay time-domain windowing [11] - [13] can be applied. In this paper, the delay time-domain windowing method is used, as shown in Fig. 6 . First, the channel impulse response estimate {h 
where the 2nd and 3rd terms are the interference and noise components, respectively. The interference and noise components are uniformly distributed over the entire range of the delay time (τ = 0∼N c − 1). Assuming that the actual channel impulse response is present only within the GI, the impulse response beyond the GI can be replaced with zeros (or rectangular windowing):
Then, N c -point FFT is applied to {ĥ (0) m (τ)} to obtain the improved channel gain estimates {Ĥ (0) m (k)} as follows:
The variance 2σ 2 0 of the sum of residual interference and noise is necessary for computing the MMSE weight of Eq. (8), which can be estimated as
Channel Estimation Using Pilot and Data Chips (i ≥ 1)
The recovered data symbol sequence {d (i−1) (n)} obtained at the (i − 1)th iteration is re-spread and the pilot chip sequence is inserted to generate the transmitted signal replicaŝ (i) (t):
which is decomposed into N c frequency components 
is the kth frequency component of the recovered data symbol sequence {d (i−1) (n)} and is given bŷ
The substitution of Eqs. (5) and (20) into Eq. (19) gives
If the symbol decision in the (i − 1)th iteration is correct, the first term in Eq. (22) becomes
Comparing this with Eq. (13) shows that the channel estimation accuracy can be much improved.
To further improve the estimation accuracy, the delay time-domain windowing is applied, similar to the i=0 case. Applying N c -point IFFT to {H 
Complexity Consideration
The computational complexity of the proposed channel estimation is compared with the conventional one [11] , based on the number of FFT and IFFT operations. The complexity of the despreading operation is neglected since it is much less than those of FFT/IFFT operations. In the conventional pilot-assisted channel estimation scheme, the instantaneous channel gain estimate is obtained by reverse modulation of each frequency component of the received pilot block after performing FFT. Then, the IFFT operation and the delay time-domain windowing are applied to obtain the noise-suppressed instantaneous channel impulse response. Finally, the improved channel transfer function (or the channel gain at each frequency) is obtained by applying FFT. Therefore, the total number of FFT/IFFT operations is 3 per pilot block for the conventional channel estimation. If a decision feedback channel estimation is used as in [11] , the number of FFT/IFFT operations per block is 3.
Next, we consider the complexity of our proposed scheme. In the initial estimation, the received signal is replaced with zeros over the time interval t = N g ∼N d − 1 as shown in Sect. 3.1 and then decomposed into frequency components by applying FFT. The estimate of instantaneous channel transfer function is obtained by reverse modulation. Then, IFFT is applied to obtain the instantaneous channel impulse response. After the noise is suppressed by the delay time-domain windowing, the noise-suppressed channel transfer function to be used for FDE is obtained by applying FFT. After performing FDE, IFFT is applied to bring the frequency-domain signal into a time-domain chip sequence to obtain the despreading and tentative symbol decision. Therefore, the total number of FFT/IFFT operations required for the initial channel estimation is 4. The tentatively recovered data symbol sequence is fed back and decomposed into frequency components by applying FFT and the channel transfer function is estimated again using both the pilot and data. Therefore, the total number of FFT/IFFT operations per iteration is 3. In the proposed iterative channel estimation, the above procedure is repeated. However, as shown in Sect. 4, only two iterations are sufficient. As a result, the total number of FFT/IFFT operations for the pro-posed channel estimation is 10 per block.
The proposed scheme requires 7 more FFT/IFFT operations than the conventional one. Therefore, the proposed scheme provides better tracking ability against fast fading at the cost of increased complexity.
3.4
Comparison with the Channel Estimation Scheme Proposed in [22] How the channel estimation process of our scheme is different from that of [22] is described below. The channel estimation proposed in [22] uses a short pilot chip sequence (whose length is half the GI) and repeats it twice in the GI as shown in Fig. 4 . At a receiver, the received pilot chip sequence is decomposed into N g /2 orthogonal frequency components by applying N g /2-point FFT and the pilot modulation is removed from each component. Channel gains at N c frequencies need to be estimated for performing FDE. However, the channel estimation scheme in [22] provides channel gains at only N g /2 frequencies. Therefore, some interpolation techniques are needed. In [22] , the zero padding is used for interpolation. N g /2-point IFFT is first applied to the channel gain estimates of N g /2 frequencies to get the impulse response over a time delay interval of N g /2 chips. Using zero-padding and N cpoint FFT, the improved channel gain estimates of N c frequencies are obtained. On the other hand, our proposed scheme uses an N g -sample pilot, which is affected by the interference from the data chip sequence, and therefore, we first apply the zero replacement to the received chip block in order to suppress the noise and the interference.
In [22] , since the first pilot chip sequence of N g /2 chips plays a role of the GI for the second pilot chip sequence of N g /2 chips, no interference is produced from the data chip sequence if the maximum time delay difference between the propagation paths is less than N g /2 chips. However, our proposed channel estimation scheme can be used for a channel having a maximum time delay difference up to N g chips. Table 1 shows the computer simulation conditions. We assume block length (=FFT window) of N c =1024 chips, pilot chip length (or GI length) of N g =64 chips, and quadrature-phase shift keying (QPSK) data modulation. A chip-spaced L=16-path frequency-selective Rayleigh fading channel having an exponential power delay profile with a decay factor of α is assumed. Each path gain h m,l is generated based on the Jake's model [2] assuming 64 plane waves coming from all directions with the same amplitude. In the computer simulation, a sequence of 10 million bits is transmitted to compute the average BER. In the next generation mobile communication systems, much higher speed data transmissions than the present systems are required. To achieve high speed data transmissions for the given chip rate (or bandwidth), the spreading factor SF must be small. In high speed downlink packet access (HSDPA), SF=16 is used [23] . Therefore, SF=1∼16 is considered. Ideal sampling timing is also assumed at a receiver.
Computer Simulation Results
First, we examine the effect of iterative channel estimation (CE) for no antenna diversity (N r =1) case. Figure 7 shows the average BER performance as a function of the average received signal energy per bit-to-the AWGN power spectrum density ratio
with the number of iterations as a parameter when f D T c =10
−5 , decay factor α=0 dB, N r =1, and τ l = l (i.e., the maximum time delay difference between the propagation paths is less than N g /2). For comparison, ideal CE and the channel estimation scheme of [22] are also plotted (since the maximum time delay difference of the channel is 15 chips which is shorter than half the GI and therefore, the channel estimation of [22] can be applied). In our proposed scheme, the channel estimation accuracy is degraded due to the interference from the data chips and the performance severely degrades for no spreading case (SF=1), i.e., SC transmission. However, since the interference can be suppressed by the despreading process in the DSSS transmission case, our proposed channel estimation scheme provides better BER performances than the channel estimation scheme proposed in [22] when SF ≥ 4. Even with two iterations (i=2), the BER performance is significantly improved and the E b /N 0 degradation for BER= 10 −3 , from ideal CE, is only about 1.3 (1.6) dB (including a pilot insertion loss of 0.28 dB) when SF=4 (16) . The reason for the superiority of our channel estimation scheme to the one proposed in [22] is as follows. At the initial channel estimation (i=0), the channel estimation accuracy is poor due to the interference from the data chip sequence. However, since the channel estimation is repeated using both the pilot and the data chip replica obtained from the decision-feedback as a new pilot, the channel estimation accuracy improves. For the perfect decision feedback, the signal-to-noise power ratio (SNR) of the channel estimate is given from Eq. (22) as Fig. 7 Effect of iterative CE.
On the other hand, the average SNR for the channel estimation scheme proposed in [22] is given as
Comparison of Eqs. (24) and (25) shows that the proposed channel estimation in this paper provides N c /N g times larger SNR and hence our channel estimation accuracy is better.
The channel estimation scheme of [22] can only be applied to a channel having a maximum time delay difference less than N g /2 chips. However, our proposed channel estimation can be applied to a channel having 2-times longer maximum time delay difference. Furthermore, the proposed scheme can provide better BER performance at the cost of increased computational complexity.
We can see that almost the same E b /N 0 degradation from ideal CE is seen for both SF=4 and 16. Additional BER performance improvement obtained with i=3 is very small. Therefore, in what follows, the BER performance is evaluated with i=2 assuming SF=4.
We examine the impact of the delay factor α. Figure 8 shows the average BER performance with the proposed CE as a function of the average received E b /N 0 with α as a parameter when f D T c =10 −5 . The proposed CE provides better BER performance than the channel estimation scheme of [22] . The BER performance degrades as α becomes larger (or the channel frequency-selectivety becomes weaker) because of less frequency diversity gain, obtained by FDE. However, the E b /N 0 degradation from ideal CE for BER=10 −3 is almost insensitive to α and only about 1.3 dB. To properly perform FFT, the channel gains must be constant over the FFT window (i.e., a block of N c chips). However, the channel gains vary within one block for a fast fading. This distorts the signal frequency components obtained by FFT. Here, we examine the impact of the fading rate. Figure 9(a) shows the average BER performance using the proposed CE, as a function of f D T c . For comparison, the BER performances with PA-DFCE [11] and channel estimation scheme of [22] are also plotted. For PA-DFCE, we assumed that a pilot block of 1024 chips is inserted after every N data chip blocks of 1024 chips each (see Fig. 10 ). For the same pilot-to-data chip ratio as our proposed CE (i.e., the same data rate for the given chip rate), we have N=255 from Eq. (1). In PA-DFCE, the first order filter with the forgetting factor β (0 ≤ β ≤ 1) using decision feedback of the previous blocks is employed [11] . It can be seen that the proposed CE provides a slightly worse BER performance than PA-DFCE when f D T c ≤ 10 −5 . However, as the fading becomes faster (or f D T c becomes larger), the tracking ability of PA-DFCE tends to be lost, thereby degrading the achievable BER performance. It can also be seen that the proposed CE provides better BER performance than the channel estimation scheme of [22] . To clearly show that our proposed scheme is superior to the channel estimation scheme of [22] , Fig. 11 plots the BER dependency on the time delay when L=16. The lth path time delay τ l is set to τ l = l, 2l, 3l and 4l chips (l=0∼15). Accordingly, the maximum time delay difference ∆τ max becomes ∆τ max =15, 30, 45 and 60 chips, respectively. It can be seen that our proposed channel estimation scheme performs better than the channel estimation scheme of [22] . The BER performance of the proposed scheme is almost insensitive to τ l as far as the maximum time delay difference ∆τ max is shorter than the CP (N g =64). However, the BER performance of the channel estimation scheme of [22] degrades when τ l = 3l and 4l chips. This is because channel estima- tion is possible only if the maximum time delay difference ∆τ max is shorter than N g /2 chips.
Finally, we examine the BER performance with antenna diversity. It is seen from Fig. 12 that the use of receive antenna diversity is always beneficial. As the fading becomes faster (e.g., f D T c =10 −4 ), the achievable BER performance without antenna diversity (N r =1) degrades and a BER floor appears. However, with antenna diversity reception (N r =2 and 4), almost no performance degradation is seen even in a very fast fading environment with f D T c =10 −4 .
Conclusion
In this paper, an iterative channel estimation scheme suitable for DSSS signal transmission with FDE was proposed and the BER performance was evaluated by computer simulation. The proposed channel estimation scheme uses a pilot, which is time-multiplexed within the FFT block and therefore, a very good tracking ability against fast fading is achieved. Moreover, since the pilot also acts as the cyclicprefix, high transmission efficiency is achieved. The simulation results obtained in the paper can be summarized as follows:
(a) Since the pilot is time-multiplexed within the FFT block, channel estimation accuracy without iteration is very poor due to a large interference from the data chips. However, even two iterations provide sufficient estimation accuracy and the performance loss from the ideal channel estimation in E b /N 0 is as small as 1.3 dB when SF=4. (b) Even for a very fast fading (e.g., f D T c =10 −4 ), a good tracking ability against fading is obtained and the E b /N 0 degradation for BER=10 −4 is as small as 2.0 dB when two-antenna receive diversity is used.
The objective of this paper is to propose a new pilotassisted channel estimation scheme and to show how close is the BER performance of our proposed scheme to the ideal channel estimation performance. Therefore, in this paper, only the uncoded BER performance was presented. The evaluation of the coded BER performance using the proposed channel estimation scheme is left as a future study.
