433 research outputs found
Efficient Fast-Convolution-Based Waveform Processing for 5G Physical Layer
This paper investigates the application of fast-convolution (FC) filtering
schemes for flexible and effective waveform generation and processing in the
fifth generation (5G) systems. FC-based filtering is presented as a generic
multimode waveform processing engine while, following the progress of 5G new
radio standardization in the Third-Generation Partnership Project, the main
focus is on efficient generation and processing of subband-filtered cyclic
prefix orthogonal frequency-division multiplexing (CP-OFDM) signals. First, a
matrix model for analyzing FC filter processing responses is presented and used
for designing optimized multiplexing of filtered groups of CP-OFDM physical
resource blocks (PRBs) in a spectrally well-localized manner, i.e., with narrow
guardbands. Subband filtering is able to suppress interference leakage between
adjacent subbands, thus supporting independent waveform parametrization and
different numerologies for different groups of PRBs, as well as asynchronous
multiuser operation in uplink. These are central ingredients in the 5G waveform
developments, particularly at sub-6-GHz bands. The FC filter optimization
criterion is passband error vector magnitude minimization subject to a given
subband band-limitation constraint. Optimized designs with different guardband
widths, PRB group sizes, and essential design parameters are compared in terms
of interference levels and implementation complexity. Finally, extensive coded
5G radio link simulation results are presented to compare the proposed approach
with other subband-filtered CP-OFDM schemes and time-domain windowing methods,
considering cases with different numerologies or asynchronous transmissions in
adjacent subbands. Also the feasibility of using independent transmitter and
receiver processing for CP-OFDM spectrum control is demonstrated
On the Comparison of Various Overhead Arrangements for Massive MIMO-OFDM Channel Estimation
Massive multi input multi output (MIMO) systems incorporate orthogonal frequency division multiplexing (OFDM) technology to render high data rate services for future wireless communication applications. The channel estimator (CE) employed by a reliable massive MIMO-OFDM system requires huge amount of overhead in the form of known and null data transmissions, hence limiting the system spectral efficiency (SE). Often, CE design is a tradeoff between SE and system reliability. In this paper, CE with three different overhead arrangements, namely time domain synchronous (TDS), comb type with cyclic prefix (CTCP), grid type with cyclic prefix (GTCP) are investigated and a GTCP based CE is proposed which offers both high SE and improved system reliability. The proposed CE uses autocorrelation based denoising threshold for channel impulse response (CIR) estimation and does not require any knowledge of channel statistics (KCS). A MIMO-OFDM system is simulated in a rayleigh fading channel environment with U-shaped doppler spectrum. From the bit error rate (BER) performance results in WiMax SUI-, Advanced Television Technology Center (ATTC) and Brazil A channel environments, it is verified that the proposed CE with GTCP overhead and proposed denoising scheme, indeed improves both SE and system reliability. Hence it is suitable for application in all massive MIMO-OFDM systems
Algorithm Development and VLSI Implementation of Energy Efficient Decoders of Polar Codes
With its low error-floor performance, polar codes attract significant attention as the potential standard error correction code (ECC) for future communication and data storage. However, the VLSI implementation complexity of polar codes decoders is largely influenced by its nature of in-series decoding. This dissertation is dedicated to presenting optimal decoder architectures for polar codes. This dissertation addresses several structural properties of polar codes and key properties of decoding algorithms that are not dealt with in the prior researches. The underlying concept of the proposed architectures is a paradigm that simplifies and schedules the computations such that hardware is simplified, latency is minimized and bandwidth is maximized.
In pursuit of the above, throughput centric successive cancellation (TCSC) and overlapping path list successive cancellation (OPLSC) VLSI architectures and express journey BP (XJBP) decoders for the polar codes are presented.
An arbitrary polar code can be decomposed by a set of shorter polar codes with special characteristics, those shorter polar codes are referred to as constituent polar codes. By exploiting the homogeneousness between decoding processes of different constituent polar codes, TCSC reduces the decoding latency of the SC decoder by 60% for codes with length n = 1024. The error correction performance of SC decoding is inferior to that of list successive cancellation decoding. The LSC decoding algorithm delivers the most reliable decoding results; however, it consumes most hardware resources and decoding cycles. Instead of using multiple instances of decoding cores in the LSC decoders, a single SC decoder is used in the OPLSC architecture. The computations of each path in the LSC are arranged to occupy the decoder hardware stages serially in a streamlined fashion. This yields a significant reduction of hardware complexity. The OPLSC decoder has achieved about 1.4 times hardware efficiency improvement compared with traditional LSC decoders. The hardware efficient VLSI architectures for TCSC and OPLSC polar codes decoders are also introduced.
Decoders based on SC or LSC algorithms suffer from high latency and limited throughput due to their serial decoding natures. An alternative approach to decode the polar codes is belief propagation (BP) based algorithm. In BP algorithm, a graph is set up to guide the beliefs propagated and refined, which is usually referred to as factor graph. BP decoding algorithm allows decoding in parallel to achieve much higher throughput. XJBP decoder facilitates belief propagation by utilizing the specific constituent codes that exist in the conventional factor graph, which results in an express journey (XJ) decoder. Compared with the conventional BP decoding algorithm for polar codes, the proposed decoder reduces the computational complexity by about 40.6%. This enables an energy-efficient hardware implementation. To further explore the hardware consumption of the proposed XJBP decoder, the computations scheduling is modeled and analyzed in this dissertation. With discussions on different hardware scenarios, the optimal scheduling plans are developed. A novel memory-distributed micro-architecture of the XJBP decoder is proposed and analyzed to solve the potential memory access problems of the proposed scheduling strategy. The register-transfer level (RTL) models of the XJBP decoder are set up for comparisons with other state-of-the-art BP decoders. The results show that the power efficiency of BP decoders is improved by about 3 times
Algorithm Development and VLSI Implementation of Energy Efficient Decoders of Polar Codes
With its low error-floor performance, polar codes attract significant attention as the potential standard error correction code (ECC) for future communication and data storage. However, the VLSI implementation complexity of polar codes decoders is largely influenced by its nature of in-series decoding. This dissertation is dedicated to presenting optimal decoder architectures for polar codes. This dissertation addresses several structural properties of polar codes and key properties of decoding algorithms that are not dealt with in the prior researches. The underlying concept of the proposed architectures is a paradigm that simplifies and schedules the computations such that hardware is simplified, latency is minimized and bandwidth is maximized.
In pursuit of the above, throughput centric successive cancellation (TCSC) and overlapping path list successive cancellation (OPLSC) VLSI architectures and express journey BP (XJBP) decoders for the polar codes are presented.
An arbitrary polar code can be decomposed by a set of shorter polar codes with special characteristics, those shorter polar codes are referred to as constituent polar codes. By exploiting the homogeneousness between decoding processes of different constituent polar codes, TCSC reduces the decoding latency of the SC decoder by 60% for codes with length n = 1024. The error correction performance of SC decoding is inferior to that of list successive cancellation decoding. The LSC decoding algorithm delivers the most reliable decoding results; however, it consumes most hardware resources and decoding cycles. Instead of using multiple instances of decoding cores in the LSC decoders, a single SC decoder is used in the OPLSC architecture. The computations of each path in the LSC are arranged to occupy the decoder hardware stages serially in a streamlined fashion. This yields a significant reduction of hardware complexity. The OPLSC decoder has achieved about 1.4 times hardware efficiency improvement compared with traditional LSC decoders. The hardware efficient VLSI architectures for TCSC and OPLSC polar codes decoders are also introduced.
Decoders based on SC or LSC algorithms suffer from high latency and limited throughput due to their serial decoding natures. An alternative approach to decode the polar codes is belief propagation (BP) based algorithm. In BP algorithm, a graph is set up to guide the beliefs propagated and refined, which is usually referred to as factor graph. BP decoding algorithm allows decoding in parallel to achieve much higher throughput. XJBP decoder facilitates belief propagation by utilizing the specific constituent codes that exist in the conventional factor graph, which results in an express journey (XJ) decoder. Compared with the conventional BP decoding algorithm for polar codes, the proposed decoder reduces the computational complexity by about 40.6%. This enables an energy-efficient hardware implementation. To further explore the hardware consumption of the proposed XJBP decoder, the computations scheduling is modeled and analyzed in this dissertation. With discussions on different hardware scenarios, the optimal scheduling plans are developed. A novel memory-distributed micro-architecture of the XJBP decoder is proposed and analyzed to solve the potential memory access problems of the proposed scheduling strategy. The register-transfer level (RTL) models of the XJBP decoder are set up for comparisons with other state-of-the-art BP decoders. The results show that the power efficiency of BP decoders is improved by about 3 times
Recommended from our members
Space-time-frequency methods for interference-limited communication systems
textTraditionally, noise in communication systems has been modeled as an additive, white Gaussian noise process with independent, identically distributed samples. Although this model accurately reflects thermal noise present in communication system electronics, it fails to capture the statistics of interference and other sources of noise, e.g. in unlicensed communication bands. Modern communication system designers must take into account interference and non-Gaussian noise to maximize efficiencies and capacities of current and future communication networks. In this work, I develop new multi-dimensional signal processing methods to improve performance of communication systems in three applications areas: (i) underwater acoustic, (ii) powerline, and (iii) multi-antenna cellular. In underwater acoustic communications, I address impairments caused by strong, time-varying and Doppler-spread reverberations (self-interference) using adaptive space-time signal processing methods. I apply these methods to array receivers with a large number of elements. In powerline communications, I address impairments caused by non-Gaussian noise arising from devices sharing the powerline. I develop and apply a cyclic adaptive modulation and coding scheme and a factor-graph-based impulsive noise mitigation method to improve signal quality and boost link throughput and robustness. In cellular communications, I develop a low-latency, high-throughput space-time-frequency processing framework used for large scale (up to 128 antenna) MIMO. This framework is used in the world's first 100-antenna MIMO system and processes up to 492 Gbps raw baseband samples in the uplink and downlink directions. My methods prove that multi-dimensional processing methods can be applied to increase communication system performance without sacrificing real-time requirements.Electrical and Computer Engineerin
Quasi-Synchronous Random Access for Massive MIMO-Based LEO Satellite Constellations
Low earth orbit (LEO) satellite constellation-enabled communication networks
are expected to be an important part of many Internet of Things (IoT)
deployments due to their unique advantage of providing seamless global
coverage. In this paper, we investigate the random access problem in massive
multiple-input multiple-output-based LEO satellite systems, where the
multi-satellite cooperative processing mechanism is considered. Specifically,
at edge satellite nodes, we conceive a training sequence padded multi-carrier
system to overcome the issue of imperfect synchronization, where the training
sequence is utilized to detect the devices' activity and estimate their
channels. Considering the inherent sparsity of terrestrial-satellite links and
the sporadic traffic feature of IoT terminals, we utilize the orthogonal
approximate message passing-multiple measurement vector algorithm to estimate
the delay coefficients and user terminal activity. To further utilize the
structure of the receive array, a two-dimensional estimation of signal
parameters via rotational invariance technique is performed for enhancing
channel estimation. Finally, at the central server node, we propose a majority
voting scheme to enhance activity detection by aggregating backhaul information
from multiple satellites. Moreover, multi-satellite cooperative linear data
detection and multi-satellite cooperative Bayesian dequantization data
detection are proposed to cope with perfect and quantized backhaul,
respectively. Simulation results verify the effectiveness of our proposed
schemes in terms of channel estimation, activity detection, and data detection
for quasi-synchronous random access in satellite systems.Comment: 38 pages, 16 figures. This paper has been accepted by IEEE JSAC SI on
3GPP Technologies: 5G-Advanced and Beyond. Copyright may be transferred
without notice, after which this version may no longer be accessibl
Multi-carrier CDMA using convolutional coding and interference cancellation
SIGLEAvailable from British Library Document Supply Centre-DSC:DXN016251 / BLDSC - British Library Document Supply CentreGBUnited Kingdo
- …