SYSTEM-LEVEL CHARACTERIZATION OF A REAL-TIME 4 4 MIMO-OFDM TRANSCEIVER ON FPGA

Similar documents
2002 IEEE International Solid-State Circuits Conference 2002 IEEE

Performance Analysis of n Wireless LAN Physical Layer

Wireless Communication Systems: Implementation perspective

An FPGA 1Gbps Wireless Baseband MIMO Transceiver

MIMO RFIC Test Architectures

4x4 Time-Domain MIMO encoder with OFDM Scheme in WIMAX Context

UK-China (B)4G Wireless MIMO Testbed: Architecture and Functionality

Next Generation Wireless Communication System

CHAPTER 3 MIMO-OFDM DETECTION

Nutaq OFDM Reference

Performance Evaluation of STBC-OFDM System for Wireless Communication

Field Experiments of 2.5 Gbit/s High-Speed Packet Transmission Using MIMO OFDM Broadband Packet Radio Access

OFDM and FFT. Cairo University Faculty of Engineering Department of Electronics and Electrical Communications Dr. Karim Ossama Abbas Fall 2010

1. Introduction. Noriyuki Maeda, Hiroyuki Kawai, Junichiro Kawamoto and Kenichi Higuchi

Basic idea: divide spectrum into several 528 MHz bands.

A Low Power and Low Latency Inter Carrier Interference Cancellation Architecture in Multi User OFDM System

The Case for Optimum Detection Algorithms in MIMO Wireless Systems. Helmut Bölcskei

The Optimal Employment of CSI in COFDM-Based Receivers

Implementation of High-throughput Access Points for IEEE a/g Wireless Infrastructure LANs

OFDMA PHY for EPoC: a Baseline Proposal. Andrea Garavaglia and Christian Pietsch Qualcomm PAGE 1

Amplitude and Phase Distortions in MIMO and Diversity Systems

Receiver Designs for the Radio Channel

Study of Turbo Coded OFDM over Fading Channel

A REAL TIME 4X4 MIMO-OFDM SDR FOR WIRELESS NETWORKING RESEARCH

Carrier Frequency Offset Estimation Algorithm in the Presence of I/Q Imbalance in OFDM Systems

Channel Estimation by 2D-Enhanced DFT Interpolation Supporting High-speed Movement

Capacity Enhancement in WLAN using

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ICCE.2012.

OFDM system: Discrete model Spectral efficiency Characteristics. OFDM based multiple access schemes. OFDM sensitivity to synchronization errors

ELEC E7210: Communication Theory. Lecture 11: MIMO Systems and Space-time Communications

2015 The MathWorks, Inc. 1

THE DESIGN OF A PLC MODEM AND ITS IMPLEMENTATION USING FPGA CIRCUITS

Maximizing MIMO Effectiveness by Multiplying WLAN Radios x3

With a lot of material from Rich Nicholls, CTL/RCL and Kurt Sundstrom, of unknown whereabouts

Practical issue: Group definition. TSTE17 System Design, CDIO. Quadrature Amplitude Modulation (QAM) Components of a digital communication system

A Complete Real-Time a Baseband Receiver Implemented on an Array of Programmable Processors

DSP IMPLEMENTATION OF HIGH SPEED WLAN USING OFDM

A Research Concept on Bit Rate Detection using Carrier offset through Analysis of MC-CDMA SYSTEM

DESIGN, IMPLEMENTATION AND OPTIMISATION OF 4X4 MIMO-OFDM TRANSMITTER FOR

Project: IEEE P Working Group for Wireless Personal Area Networks N

Rate and Power Adaptation in OFDM with Quantized Feedback

HOW DO MIMO RADIOS WORK? Adaptability of Modern and LTE Technology. By Fanny Mlinarsky 1/12/2014

Hardware Development of Baseband Transceiver and FPGA-Based Testbed in 8 8 and 2 2 MIMO-OFDM Systems

An OFDM Transmitter and Receiver using NI USRP with LabVIEW

ENHANCING BER PERFORMANCE FOR OFDM

Optimized BPSK and QAM Techniques for OFDM Systems

: IEEE C802.16e-04/533r5. IEEE Broadband Wireless Access Working Group <

Bit Error Rate Performance Evaluation of Various Modulation Techniques with Forward Error Correction Coding of WiMAX

Merging Propagation Physics, Theory and Hardware in Wireless. Ada Poon

ETSI Standards and the Measurement of RF Conducted Output Power of Wi-Fi ac Signals

Supplemental Slides: MIMO Testbed Development at the MPRG Lab

TSTE17 System Design, CDIO Lecture 7. Additional information resources. Testing. Check timing of the IP blocks Testing

UNIVERSITY OF MICHIGAN DEPARTMENT OF ELECTRICAL ENGINEERING : SYSTEMS EECS 555 DIGITAL COMMUNICATION THEORY

Using Modern Design Tools To Evaluate Complex Communication Systems: A Case Study on QAM, FSK and OFDM Transceiver Design

1

Technical Aspects of LTE Part I: OFDM

An FPGA Case Study: Narrowband COFDM Video Transceiver for Drones, UAV, and UGV. Produced by EE Times

Research Letter Throughput of Type II HARQ-OFDM/TDM Using MMSE-FDE in a Multipath Channel

CHAPTER 3 ADAPTIVE MODULATION TECHNIQUE WITH CFO CORRECTION FOR OFDM SYSTEMS

Comparison of ML and SC for ICI reduction in OFDM system

Presentation Outline. Advisors: Dr. In Soo Ahn Dr. Thomas L. Stewart. Team Members: Luke Vercimak Karl Weyeneth. Karl. Luke

Space Time Block Coding - Spatial Modulation for Multiple-Input Multiple-Output OFDM with Index Modulation System

Professor Paulraj and Bringing MIMO to Practice

Comparison of MIMO OFDM System with BPSK and QPSK Modulation

- 1 - Rap. UIT-R BS Rep. ITU-R BS.2004 DIGITAL BROADCASTING SYSTEMS INTENDED FOR AM BANDS

SOFTWARE IMPLEMENTATION OF a BLOCKS ON SANDBLASTER DSP Vaidyanathan Ramadurai, Sanjay Jinturkar, Sitij Agarwal, Mayan Moudgill, John Glossner

VLSI Implementation of Area-Efficient and Low Power OFDM Transmitter and Receiver

Implementation of OFDM Modulated Digital Communication Using Software Defined Radio Unit For Radar Applications

Outline / Wireless Networks and Applications Lecture 7: Physical Layer OFDM. Frequency-Selective Radio Channel. How Do We Increase Rates?

Keywords SEFDM, OFDM, FFT, CORDIC, FPGA.

AN FPGA IMPLEMENTATION OF ALAMOUTI S TRANSMIT DIVERSITY TECHNIQUE

UTILIZATION OF AN IEEE 1588 TIMING REFERENCE SOURCE IN THE inet RF TRANSCEIVER

What s Behind 5G Wireless Communications?

Venkatesan.S 1, Hariharan.J 2. Abstract

ni.com The NI PXIe-5644R Vector Signal Transceiver World s First Software-Designed Instrument

Advanced 3G & 4G Wireless Communication Prof. Aditya K. Jaganathan Department of Electrical Engineering Indian Institute of Technology, Kanpur

2.

OFDMA and MIMO Notes

A GENERIC ARCHITECTURE FOR SMART MULTI-STANDARD SOFTWARE DEFINED RADIO SYSTEMS

Real-time FPGA realization of an UWB transceiver physical layer

Ten Things You Should Know About MIMO

Flexible Radio - BWRC Summer Retreat 2003

10 Gbps Outdoor Transmission Experiment for Super High Bit Rate Mobile Communications

Performance Analysis of Cognitive Radio based WRAN over Rayleigh Fading Channel with Alamouti-STBC 2X1, 2X2&2X4 Multiplexing

OPEN SOURCE TRANSPARENCY FOR OFDM EXPERIMENTATION

Transmitting Multiple HD Video Streams over UWB Links

From Antenna to Bits:

Wireless LANs IEEE

One Cell Reuse OFDM/TDMA using. broadband wireless access systems


TSTE17 System Design, CDIO. General project hints. Behavioral Model. General project hints, cont. Lecture 5. Required documents Modulation, cont.

Baseband Receiver Design for IEEE ah

Symbol Timing Detection for OFDM Signals with Time Varying Gain

SourceSync. Exploiting Sender Diversity

ICI Mitigation for Mobile OFDM with Application to DVB-H

SDR OFDM Waveform design for a UGV/UAV communication scenario

EC 551 Telecommunication System Engineering. Mohamed Khedr

OFDM AS AN ACCESS TECHNIQUE FOR NEXT GENERATION NETWORK

Underwater communication implementation with OFDM

Research and Implementation of 2x2 MIMO-OFDM System with BLAST Using USRP-RIO

Transcription:

SYSTEM-LEVEL CHARACTERIZATION OF A REAL-TIME 4 4 MIMO-OFDM TRANSCEIVER ON FPGA Simon Haene, David Perels, and Wolfgang Fichtner Integrated Systems Laboratory, ETH Zurich, Switzerland email: {haene,perels,fw}@iis.ee.ethz.ch web: http://www.iis.ee.ethz.ch/ mimo ABSTRACT The performance of an FPGA-based MIMO-OFDM testbed is investigated through measurements. The setup includes two real-time terminals, supporting up to 4 spatial streams, and a wideband multipath channel emulator. The performance of the system, transmitting at data rates up to 216 Mbit/s over a MHz channel in the 2.4 GHz ISM band, is benchmarked under different channel scenarios. The impact of different algorithm choices at the receiver, including parameter estimation for synchronization and channel estimation, on system-level performance is also evaluated. The FPGA implementation results for the different PHY-layer subsystems provide relevant insight into possible tradeoffs between performance and silicon complexity. 1. INTRODUCTION Multiple-input multiple-output (MIMO) combined with orthogonal frequency division multiplexing (OFDM) modulation is the key technology for next generation wireless local area networks (WLANs) and will enable a significant increase in throughput and range. While the throughput is maximized by transmitting multiple spatial data streams concurrently (a technique called spatial multiplexing), the transmission reliability, and hence range, is increased by adopting space-time diversity schemes or beamforming. In parallel to the standardization of IEEE 82.11n, which is expected to become the first MIMO-OFDM WLAN standard, a number of prototypes have been reported. Given the demanding signal processing required by wideband multiantenna receiver algorithms, most of the real-time testbeds are based on dedicated VLSI circuits prototyped on modern field programmable gate arrays (FPGAs), rather than on digital signal processors. Incidentally, these FPGA circuits also provide information on the main bottlenecks and silicon area expected on application specific integrated circuits, which is the target technology for high-volume consumer products implementing a MIMO-OFDM physical (PHY) layer. The release of commercial products based on draft versions of the upcoming IEEE 82.11n standard has sparked a debate on the real-world performance of the new generation WLANs. Most of the initial measurements highlight interoperability issues and bottlenecks in the medium access control layer. The system-level measurements in this paper, instead, restrict the focus on the PHY layer, including the effects of finite wordlength digital signal processing, data conversion, and the radio frequency (RF) transceiver. Contribution: Reproducible measurements, based on a wideband multipath channel emulator, show the impact of various channel conditions on the achievable data rates. Furthermore, the analysis considers the impact of selected receiver algorithms on both throughput and increase in FPGA resources, hence highlighting possible tradeoffs between receiver performance and silicon area. Outline: The next section briefly describes the general setup and the hardware components of the measured real-time transceiver. In Sec. 3 the PHY layer digital signal processing operations are detailed and FPGA implementation results are reported. Measurement results are presented in Sec. 4, together with an introduction to the adopted performance metric. 2. TESTBED DESCRIPTION The testbed under consideration is based on the OFDM modulation parameters (such as FFT size, symbols duration, and data tone allocation) defined in the IEEE 82.11a singleantenna WLAN standard [1], but supports spatial multiplexing with antenna configurations up to 4 4. When transmitting four spatial streams, the data rate is increased by a factor of four compared to the single-antenna standard, achieving a peak data rate of 216 Mbit/s. As shown in Fig. 1, each terminal consists of a host-pc, a peripheral component interconnect (PCI) based FPGA platform for digital signal processing, and a multi-antenna RF transceiver for the signal conversion into the 2.4 GHz industrial, scientific, and medical (ISM) band. 2.1 PCI-Based FPGA Platform The employed PCI platform hosts three hardware modules: a large FPGA (Xilinx XC2V-6) module for baseband signal processing and two converter modules for digital-toanalog and analog-to-digital signal conversion. On an FPGA (Xilinx XC2V1-4) provided on each of the converter modules, the baseband signals are digitally converted to an intermediate carrier frequency (IF) of MHz. Hence, the total number of converters is reduced by a factor of two compared to traditional baseband systems with separate converters for the I and Q components. Incidentally, the digital IF also avoids any I/Q imbalance, at the cost of higher sampling MIMO-OFDM terminal Host-PC Matlab MEX-func. middleware driver PCI Digital signal processing AGC control digital IF at MHz Multi-antenna 2.4 GHz RF transceiver Figure 1: Hardware components of a single MIMO-OFDM terminal. 7 EURASIP 1146

Interface PHY Channel coding ChipScope OFDM modulation Synchronization Host PCI bus Tx buffer Rx buffer Demux Mux Scrambling Descrambling Convolutional encoding Viterbi decoding Puncturing Depuncturing Interleaving Deinterleaving Demapping Sync. Pilot tracking MIMO processing MMSE MIMO detection H mem H estim. Demux Symbol map. FFT processor Add preambles and cyclic prefix FIFO buffer Freq. offset est. and comp. Frame timing DUC DDC Digital AGC 4xDAC 4xADC Figure 2: Digital signal processing architecture overview. rate and resolution. Both the 14 bit digital to analog converters (DACs) and the 12 bit analog to digital converters (ADCs) run at a sampling rate of 8 MS/s. 2.2 Analog RF Frontend Each terminal is equipped with four single-input singleoutput (SISO) super-heterodyne RF chains with an analog IF of 475 MHz. The chains are built from discrete RF components and support MHz bandwidth channels with center frequencies in the 2.4 GHz ISM band. The received signal power at the ADCs is regulated by digitally controllable analog attenuators. These attenuators extend the dynamic range of the receive chains by 31 db. In conjunction with the 12 bit resolution of the ADCs, the useful dynamic range amounts to approximately 7 db. 2.3 Software Stack The testbed is controlled from a host-pc through multiple layers of software that allow communication with the hardware. The software stack includes a hardware driver for the PCI board, a transceiver-specific API that provides several functions (for the digital baseband configuration and the transmission or reception of OFDM frames), and a Matlab interface (MEX) function to call these API functions. Demo applications, configuration tools, and measurement sequences are programmed as Matlab scripts. 3. DIGITAL SIGNAL PROCESSING ARCHITECTURE The digital signal processing includes the MIMO-OFDM PHY layer and an interface to the host-pc. The architecture is shown in Fig. 2, where the PHY layer is partitioned into channel coding, OFDM modulation, synchronization, and MIMO processing. A standard bus interface (AMBA APB) enables run-time configuration and status monitoring of the individual functional units. Additionally, hardware cores generated by a Xilinx software tool (ChipScope) are inserted for in-circuit signal monitoring. 3.1 OFDM Modulation The OFDM modulation unit includes the mapping of binary data to complex-valued constellation points that determine how amplitude and phase of each OFDM subcarrier are modulated, the computation of the superposition of all modulated subcarriers with an IFFT, and the insertion of cyclic prefixes. The subcarriers are QAM-modulated according to different modulation orders (ranging from BPSK to 64-QAM), depending on the desired data rate and transmission robustness. A 64-point I/FFT processor with a single radix-4 processing element [2] de/modulates the OFDM symbols. The different spatial streams are processed in a time-interleaved fashion on the same hardware unit. Each frame starts with a preamble that is divided into two parts: the first part consists of special OFDM symbols for frame start detection, automatic gain control (AGC) setting, and frequency offset estimation. The second part includes an OFDM training symbol for each transmit antenna and is dedicated to channel estimation. 3.2 MIMO Processing The MIMO processing unit estimates the channel matrices and handles the separation of the spatially multiplexed data streams. The adopted block-type channel training sequence is designed to be frequency orthogonal, so that, given an OFDM subcarrier, a single transmit antenna is sounding this particular tone exclusively. Which of the antennas is active depends on the OFDM symbol index. With four such training symbols, the frequency-domain channel coefficients of all spatial subchannels can be estimated on a tone-by-tone basis. In order to exploit the correlation between frequency-domain channel coefficients, the initial estimate can optionally be refined with an FFT-processor-based algorithm [2]. The resulting channel matrices are stored in a dedicated memory. A linear minimum mean squared error (MMSE) detector spatially equalizes the received data streams on a tone-bytone basis. The same hardware unit [3] computes the MMSE equalizer matrices and applies them to the received vectorsymbols. The complex-valued equalizer matrices are stored in place of the channel matrices, thereby minimizing memory requirements. During the latency incurred by the computation of the MMSE estimators, the incoming data is buffered in a FIFO buffer (shown in the OFDM modulation unit in Fig. 2). The MMSE unit implements direct matrix inversion and has therefore stringent numerical requirements. Hence, a tone-by-tone conversion to block-floating point number representation of the channel matrices guarantees the numeric precision of the matrix-inversion unit being optimally exploited also when the transceiver is operating in frequencyselective wireless channels. The application of the computed estimators requires matrix-vector multiplications and the results are converted back to fixed-point representation. 7 EURASIP 1147

phase error DC SRO RFO tones Figure 3: Phase errors on tones of an equalized OFDM symbol due to sample rate offset (SRO) and residual frequency offset (RFO). 3.3 Frontend and Synchronization During transmission, the digital baseband signals are I/Q modulated, up-sampled, and filtered by digital up-conversion (DUC) to an IF of MHz before the resulting real-valued signals are converted to the analog domain. The same IF is adopted in the receiver, where digital down-conversion (DDC) performs the inverse operation. The AGC operates both in the analog and in the digital domain. The received signal power at the input of the ADCs is controlled over the analog attenuators in the RF chains. Additionally, in case of weak input levels, the signal is amplified digitally to optimally exploit the dynamic range of the digital receiver and to minimize the effect of quantization noise. Signal power estimation is based on preambles that are constructed to be frequency orthogonal, i.e., so that only one transmit antenna is sounding a given tone [4]. Frame-timing synchronization is based on the detection of a sudden power increase in the received signals [5]. The detection reliability is improved, in particular at lower signalto-noise ratios (SNRs), taking into account an additional metric that confirms the distinctive periodicity of the preamble. Frequency offset estimation is based on the autocorrelation algorithm presented in [6] extended for the MIMO case and operates on the first part of the frame preamble. In particular, a lower estimation variance is achieved, compared to the SISO case, by combining the correlations obtained on different antennas. Frequency offset compensation requires a digital oscillator, whose period depends on the estimated frequency offset. The uncompensated signals are combined with the rotating phasor by a single complexvalued and time-shared multiplier. The error incurred during the preamble-based estimation leads to residual frequency offset and causes a rotation of the spatially equalized constellation points. This phase deviation, which increases with the OFDM symbol index but is constant over the subcarriers, can be compensated after the MMSE equalizer by derotating all constellation points in an OFDM symbol according to the phase of the BPSK tones [1], which are known to the receiver. Another impairment to be compensated by means of the tones is sampling rate offset, generated by small differences in the DAC and ADC sampling frequencies at the transmitter and receiver. This causes an additional phase error on the received constellation points that increases linearly with the subcarrier index, as shown in Fig. 3. Two -tracking algorithms are evaluated through measurements. The first algorithm estimates the offset and the slope of the curve in Fig. 3 by observing the phase errors on the tones, computed by means of CORDIC units [7]. A different compensating phasor is computed for each subcarrier and Table 1: Required FPGA resources XC2V-6 FPGA Block Slice %Slice Mult Ram Synchronization 538 17.6 17 5 OFDM modulation 2879 1.1 12 44 MIMO processing 8847 31. 32 1 Channel decoding 982 31.8 4 5 Others 2725 9.5 7 Total 28571 1 65 134 XC2V1-4 FPGAs Block Slice %Slice Mult Ram DUC 98 19.5 DDC 997 19.8 6 AGC 9 18.7 4 Others 2113 42. 39 Total 53 1 3 39 a complex-valued multiplier, that is time shared among the spatial streams, applies the computed phase corrections. A second -tracking algorithm that compensates frequency offset only is examined for comparison: a single phasor is computed for all subcarriers by arithmetically averaging the equalized constellation points, weighted negatively if -1 was transmitted on the corresponding tone. This requires only a small number of additions and hence reduces circuit complexity significantly, compared to the CORDIC-based solution. The derotation is achieved by multiplying all subcarriers with the complex-conjugate version of this phasor. This approach reduces silicon complexity at the cost of tracking precision. 3.4 Channel Coding As forward error correction scheme, bit-interleaved coded modulation (BICM), based on the convolutional code, interleaving, and puncturing schemes defined in IEEE 82.11a [1], is employed. Per-antenna convolutional coding with subsequent cross-antenna interleaving is adopted in order to enable the real-time decoding on Virtex-II FPGAs, following the approach in [8]. The Viterbi decoder accepts 5 bit wide soft-information and has a traceback length of 54. Bit-scrambling removes long sequences of s or 1s in the transmitted payload data. The de/interleavers rely on built-in block RAMs for the bitwise interleaving. Low-complexity soft-information is extracted, based on the MMSE-equalized and -tracked constellation points, handling each spatially multiplexed data stream independently, according to [9]. For performance comparison, hardmetrics generated by slicing the equalized vector symbols can also be computed instead of soft-metrics. 3.5 FPGA Implementation Results The entire digital signal processing (as shown in Fig. 2) is running on a single XC2V-6 FPGA, with the exception of DUC, DDC, and digital AGC, which are mapped onto two separate XC2V1-4 FPGAs on the corresponding data 7 EURASIP 1148

Table 2: Impact on required FPGA resources for different optional receiver algorithms Algorithm Slice %Slice Mult Ram Refined channel est. 71 2.5 - - Soft-info extraction 625 2.2 4 2 Block-floating point 454 1.6 - - Improved track. 764 2.7 1 1 Total 2553 9. 5 3 Table 3: Considered 4 4 data rates R in [Mbit/s] Coding Subcarrier modulation rate BPSK QPSK 16-QAM 64-QAM 1/2 24 48 96 144 2/3 - - - 192 3/4 36 72 144 216 converter modules. The required FPGA resources for both designs are detailed in Tbl. 1, where Mult and Ram refer to the built-in real-valued 18 18-bit multipliers and 18 kbit SRAM blocks, respectively. The resources reported under Others refer to PCI interface, ChipScope cores, intermodule-communication interfaces, and other functional units that are not in strict relation with the PHY layer. The main system clock frequency is 8 MHz, but some parts of the design run at MHz and 1 MHz. The PHY layer needs a total of 64 RAMs, whereof 32 for the FIFO buffer required to bridge the latency incurred by MIMO preprocessing and channel estimation. This buffer was intentionally oversized (by at least a factor of two) to enable special debug modes that allow access to the received data after the FFT or after the tracking. Hence, a significant amount of memory could be saved, in a real-world system, by eliminating these debug modes. Channel estimation refinement, extraction of softinformation, block-floating point conversion of channel matrices, and improved -tracking are optional receiver algorithms. Their impact on system performance is reported in the next section while their contribution to the required FPGA resources in Tbl. 1 is given in Tbl. 2. The slice percentage in this second table refers to the total number of occupied slices on the XC2V-6 FPGA. 4. MEASUREMENTS Measurements have been carried out with two real-time terminals communicating over a wideband multipath channel emulator with RF interfaces. The measurement operations are coordinated over a wired TCP/IP network by the receiver terminal, which controls the channel emulator and triggers the transmitter to send data frames. Even though smaller antenna configurations are supported, the focus is on the 4 4 transmission modes listed in Tbl. 3. 4.1 Performance Metric The average throughput is adopted as performance metric and computed according to the following procedure. A channel realization labeled k is drawn according to the desired 1 1 1 1 8 TGn-A TGn-B TGn-C 2 tap, flat PDP Figure 4: Average throughput for different channel scenarios. statistics and uploaded to the emulator. For all the data rates R[m] in Tbl. 3, multiple OFDM frames carrying 5 bytes payload data each are transmitted over the selected channel realization. By checking the received data, an empirical frame error rate FER[m,k] is computed for each of the transmission modes. Assuming genie-aided adaptive modulation, the best throughput D[k] over the specific channel realization under consideration is given by D[k] = max m R[m]( 1 FER[m,k] ). (1) In order to compute FER[m, k], frames are transmitted per transmission mode and channel realization. Finally, the performance metric is obtained by averaging D[k] over 5 channel realizations. Performance charts are obtained by computing the average throughput for different receive powers. The MIMO- OFDM frames at the channel emulator input, generated by the transmitting terminal, have a constant power level. After applying the channel model, the average power at the RF inputs of the receiver is controlled by adjusting the output gain of the emulator. All measurements were taken with 1 khz frequency offset and 1.6 khz sample-rate offset, which corresponds to a ppm frequency precision. 4.2 Performance for Different Channels The measured average throughput under different blockfading channel scenarios is shown in Fig. 4. The TGn channel models A, B, and C [1] were generated assuming an antenna spacing of 2 wavelengths at the transmitter and at the receiver. For comparison a spatially uncorrelated Rayleighfading channel with flat power delay profile and 2 taps spaced 5 ns apart is also considered. From the chart in Fig. 4 it can be observed that the system suffers from increasing channel lengths, even though the impact of different channel types is not very significant. Even though the highest PHY layer data rate is 216 Mbit/s, the measured average throughputs saturate between 1 Mbit/s and 1 Mbit/s. This can be attributed to channel emulator and multi-antenna RF transceiver, which limit the achievable receiver SNR. Additionally, the wordwidth of the multipliers in the MMSE equalizer must also be increased for SNRs beyond 3 db [3]. 7 EURASIP 1149

1 1 1 1 8 Flat Rayleigh fading TGn-A, antenna spacing 1 TGn-A, antenna spacing 2 TGn-A, antenna spacing 1 TGn-A, antenna spacing.5 Figure 5: Impact of different antenna spacings (in wavelengths) on the average throughput achieved over TGn-A. 1 1 1 1 8 All receiver algorithms on Channel estimation refinement off Soft information extraction off Block-floating point conversion off Improved -tracking algorithm off Figure 6: Impact of different receiver algorithms on the average throughput achieved over a TGn-B channel. The topmost curve refers to the performance when all receiver algorithms are enabled. With each additional algorithm that is disabled, the performance degrades further. 4.3 Impact of Antenna Correlation When reducing the antenna spacing, the correlation between the received signals increases and MIMO equalization with a linear MMSE equalizer becomes less reliable. The impact of this correlation is shown in Fig. 5 for the particular case of a TGn-A channel model, where the antenna spacing is set to 1, 2, 1, and.5 wavelengths. For comparison a flat Rayleigh-fading channel, corresponding to the special case of TGn-A without any spatial correlation, is considered. As expected, the system suffers significantly when the antenna spacing is reduced below one wavelength. 4.4 Impact of Receiver Algorithms The impact of different receiver algorithms on the average throughput achieved over a TGn-B channel is shown in Fig. 6. The topmost curve corresponds to the performance when all receiver algorithms, including the ones listed in Tbl. 2, are enabled. For comparison, the performance was measured after disabling the four optional signal processing algorithms, one after the other. From Tbl. 2 and the curves in Fig. 6, it can be seen that a significant increase in performance is achieved with less than 1% hardware overhead. 5. CONCLUSION A real-time MIMO-OFDM physical layer transmitting at a peak data rate of 216 Mbit/s over MHz bandwidth was prototyped and characterized through measurements. Compared to a SISO system many functional units for synchronization, OFDM modulation, and channel coding units are replicated for each spatial stream. The equalization of spatial streams, instead, is specific to MIMO receivers and requires about 3% of the FPGA slices and 5% of the multipliers with linear MMSE detection. The measured average data rates are clearly beyond those achievable by single-antenna IEEE 82.11a WLAN systems, which transmit over the same bandwidth but are limited to a peak data rate of 54 Mbit/s. The system performance is affected by increasing channel lengths and by increasing antenna correlation. As expected, the MIMO gain is reduced when the antenna spacing is not sufficient. By investigating selected receiver algorithms, including parameter estimation for synchronization and channel estimation, the impact of more sophisticated signal processing has been shown to be significant. With thorough hardwareoriented algorithm design, the implementation of the considered algorithms increases the overall silicon complexity by less than 1%. REFERENCES [1] IEEE 82.11a Standard, iso/iec 882-11:1999/Amd 1:(E). [2] S. Haene, A. Burg, P. Luethi, N. Felber, and W. Fichtner, FFT processor for OFDM channel estimation, in Proc. IEEE ISCAS, 7. [3] A. Burg, S. Haene, D. Perels, P. Luethi, N. Felber, and W. Fichtner, Algorithm and VLSI architecture for linear MMSE detection in MIMO-OFDM systems, in Proc. IEEE ISCAS, 6. [4] D. Perels, A. Burg, S. Haene, N. Felber, and W. Fichtner, An automatic gain controller for MIMO-OFDM WLAN systems, in Proc. IEEE ICCSC 6, vol. 1, 6, pp. 55. [5] D. Perels, C. Studer, and W. Fichtner, Implementation of a low-complexity frame-start detection algorithm for MIMO systems, in Proc. IEEE ISCAS, 7. [6] T. M. Schmidl and D. C. Cox, Robust frequency and timing synchronization for OFDM, IEEE Trans. Commun., vol. 45, no. 12, pp. 1613 1621, Dec. 1997. [7] B. Parhami, Computer Arithmetic, Algorithms and Hardware Design. Oxford University Press,. [8] S. Haene, A. Burg, D. Perels, P. Luethi, N. Felber, and W. Fichtner, FPGA implementation of Viterbi decoders for MIMO-BICM, in Proc. 39th IEEE Asilomar Conf. Signals Syst. Comput., 5. [9] I. B. Collings, M. R. G. Butler, and M. McKay, Low complexity receiver design for MIMO bit-interleaved coded modulation, in IEEE Int. Symp. on Spread Spectrum Techniques and Applications, 4, pp. 12 16. [1] V. Erceg et al., TGn Channel Models, IEEE 82.11 document 3/9r4. 7 EURASIP 115