10 1 1 129
10 1 1 129
10 1 1 129
B.S. (University of California, Berkeley) 1994 M.S. (University of California, Berkeley) 1996
A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Engineering-Electrical Engineering and Computer Sciences in the GRADUATE DIVISION of the UNIVERSITY OF CALIFORNIA, BERKELEY
Committee in charge: Professor Robert W. Brodersen, Chair Professor Paul R. Gray Professor Paul K. Wright
Spring 2001
Abstract A Design Methodology for Highly-Integrated Low-Power Receivers for Wireless Communications by Dennis Gee-Wai Yee Doctor of Philosophy in Engineering-Electrical Engineering and Computer Sciences University of California, Berkeley Professor Robert W. Brodersen, Chair
Due to its potential to offer ubiquitous information access, wireless connectivity is playing an increasingly significant role in communications systems. The success of future wireless systems will depend heavily on their ability to provide high capacity while maintaining low cost, small form factor, and low power consumption in the portable devices. However, many existing commercial transceivers are expensive, consist of a large number of discrete components, and exhibit moderate to high levels of power consumption. One possible explanation for these inefficient solutions lies in the historically unilateral relationship between system designers and hardware designers. An efficient solution requires a design strategy which tightly incorporates implementation issues throughout the process of defining the system specifications. This thesis describes a design methodology which facilitates the evaluation of tradeoffs between implementation issues and overall system performance, focusing primarily on the receiver as an example. First, system-level specifications, such as modulation scheme and signal bandwidth, strongly influence the choice of receiver architecture, which in turn, has ramifications on the achievable power consumption and integration level. When system-level specifications are determined without considering their impact on receiver architecture selection, single-chip solutions may be very difficult to achieve or just simply infeasible. Based on system-level considerations, guidelines are presented for the
selection of receiver architectures, including the heterodyne, direct-conversion, imagereject, and low-IF topologies. Second, the rapid improvements in digital CMOS technology provide an opportunity to use advanced digital signal processing algorithms which in the past were considered too complex to implement in the mobile device. These algorithms promise significant increases in system performance but their performance may ultimately be limited by analog circuit impairments, such as noise and distortion. This thesis describes the detrimental effects of a number of these impairments and presents a system-level simulation framework which facilitates the direct evaluation of these effects on the performance of digital communications algorithms. The simulation framework is implemented in Simulink, which offers compatibility with MATLAB, a simulation tool already widely used for the development and evaluation of communications algorithms. This simulation framework relies on baseband-equivalent models for all of the RF building blocks in order to avoid simulation at the carrier frequency, resulting in faster simulation times. These strategies are then applied to the design of a high-speed wireless downlink for an indoor picocellular system. The system provides an aggregate data rate of 50 Mb/s with a transmission bandwidth of 32.5 MHz and a carrier frequency of 2 GHz. The wide bandwidth of the desired signal facilitates the use of a direct-conversion architecture. A receiver prototype is implemented to meet the specifications determined from the systemlevel simulations. A power-efficient solution is achieved by taking advantage of the relaxed specifications as well as by using low-power circuit implementation techniques. This receiver prototype includes the low-noise amplifier, frequency synthesizer, mixers, baseband amplifiers and filters, and analog-to-digital converters, all implemented on a single chip with a power dissipation of about 100 mW.
Table of Contents
List of Figures List of Tables Acknowledgments Chapter 1 Introduction 1.1 1.2 1.3 Chapter 2 Motivation Research Goals Thesis Organization vii xiii xv 1 1 2 3 5 5 7 7 8 9 10 12 13 16 17 17 19 20 21 22 23 25 25 26 27 30 32 35 37 39 40 45
Receiver Architectures 2.1 2.2 Introduction Heterodyne Architecture 2.2.1 The Image Problem 2.2.2 Implementation Direct-Conversion Architecture 2.3.1 DC Offsets 2.3.2 Flicker Noise Image-Reject Architecture 2.4.1 Practical Considerations Low-IF Architecture 2.5.1 Digital Frequency Translation to Baseband Receiver Architecture Selection 2.6.1 Direct-Conversion Architecture 2.6.2 Image-Reject Architecture 2.6.3 Low-IF Architecture 2.6.4 Receiver Architecture Selection Guidelines
2.3
Chapter 3
Receiver Impairments 3.1 3.2 3.3 3.4 3.5 3.6 3.7 3.8 3.9 3.10 Introduction Quadrature Phase-Shift Keying Modulation Receiver Noise Gain Mismatch Quadrature Phase Mismatch Frequency Offset LO Phase Noise Receiver Distortion Filtering DC Offsets
iii
System-Level Simulation Framework 4.1 4.2 Introduction Receiver Performance Calculations 4.2.1 Noise Calculations 4.2.2 Distortion Calculations 4.2.3 Summary Baseband-Equivalent Models 4.3.1 RF Amplifiers 4.3.2 Local Oscillators 4.3.3 Mixers 4.3.4 Summary Simulation Framework Implementation 4.4.1 Thermal Noise 4.4.2 Flicker Noise 4.4.3 RF Amplifiers 4.4.4 Mixers 4.4.5 Local Oscillators 4.4.6 Baseband Amplifiers and Filters 4.4.7 Analog-to-Digital Converters 4.4.8 Receiver Performance Metrics Summary
4.3
4.4
4.5 Chapter 5
A High-Speed Wireless Downlink 5.1 5.2 Introduction Base-Station Transmitter 5.2.1 Multiple Access Method and Power Control 5.2.2 Pulse Shaping 5.2.3 Pilot Channel/Symbol 5.2.4 Mobility Support: Picocells 5.2.5 Analog Front-End Mobile Receiver 5.3.1 Propagation Models 5.3.2 Receiver Sensitivity 5.3.3 Receiver Processing Gain 5.3.4 Receiver Architecture 5.3.5 Flicker-Noise Suppression 5.3.6 DC-Offset Compensation 5.3.7 Receiver Noise Figure 5.3.8 ADC Performance 5.3.9 Receiver Gain 5.3.10 Receiver Distortion 5.3.11 Receiver AGC Loop
5.3
iv
5.4
5.5 Chapter 6
5.3.12 Multiuser Detection 5.3.13 Summary System Simulation 5.4.1 Base-Station Transmitter 5.4.2 Channel Model 5.4.3 Mobile Receiver 5.4.4 Simulation Outputs Summary
121 127 129 129 130 131 131 133 135 135 136 137 140 141 147 151 167 178 180 182 183 183 185 185 187 187 192 197 197 198 200 200 205 205 205 207 208 209 216 217 217 219
Receiver Prototype 6.1 6.2 Introduction Low-Noise Amplifier 6.2.1 Microwave Filter Design 6.2.2 LNA Performance Metrics 6.2.3 Transistor Noise Model 6.2.4 Matching for Minimum Noise Figure 6.2.5 LNA Topologies 6.2.6 Inductively-Degenerated Differential LNA Frequency Synthesizer 6.3.1 Voltage-Controlled Oscillator 6.3.2 Other Design Considerations Mixer 6.4.1 Passive Mixers 6.4.2 Active Mixers 6.4.3 Mixer Implementation Baseband Amplification and Filtering 6.5.1 Low-Pass Filtering Implementation of Baseband Amplifiers and Filters Analog-to-Digital Converter 6.7.1 Pipeline Architecture 6.7.2 Timing Recovery Considerations 6.7.3 Sigma-Delta Analog-to-Digital Converter Receiver Test Chips
6.3
6.4
6.8 Chapter 7
Simulated Performance and Measurement Results 7.1 Simulated Performance 7.1.1 LNA/Mixer/Baseband Simulations 7.1.2 LNA/Mixer/PLL/Baseband Simulation 7.1.3 LNA/Mixer/PLL/Baseband/ADC Simulation Measurement Results Measurement Issues 7.3.1 Yield and Reliability 7.3.2 Packaging Technology
7.2 7.3
Chapter 8
Conclusion
8.1 8.2
Research Summary Future Work 8.2.1 Bottom-Up Verification 8.2.2 Improved Behavioral Models 8.2.3 Single-Chip Integration
219 220 221 221 222 225 225 227 231 231 231 237 237 238 241 242 245 247 249 251 253 253 255 255 256 259
Appendix B DC-Offset Cancellation B.1 B.2 Introduction Alternative DC-Offset Cancellation Techniques
Appendix C Why 50 ? C.1 C.2 C.3 C.4 Introduction Impedance Matching for Maximum Power Transfer Impedance Matching for Minimum Noise Figure Impedance Matching for Maximum Voltage Transfer C.4.1 Common-Source LNA C.4.2 Inductively-Degenerated Common-Source LNA Antenna Circuit Model Summary
C.5 C.6
Appendix D Inductor Test Structures D.1 D.2 D.3 D.4 Introduction ASITIC Simulation Results Test Chip Layout Measurement Results
References
vi
List of Figures
Figure 2.1: Figure 2.2: Figure 2.3: Figure 2.4: Figure 2.5: Figure 2.6: Figure 2.7: Figure 2.8: Figure 2.9: Figure 2.10: Figure 2.11: Figure 2.12: Figure 2.13: Figure 2.14: Figure 2.15: Figure 2.16: Figure 3.1: Figure 3.2: Figure 3.3: Figure 3.4: Figure 3.5: Figure 3.6: Figure 3.7: Figure 3.8: Figure 3.9: Figure 3.10: Figure 3.11: Figure 3.12: Figure 3.13: Figure 3.14: Figure 3.15: Figure 3.16: Figure 3.17: Figure 3.18: Figure 3.19: Figure 3.20: Figure 3.21: Figure 3.22: RF front-end for an ideal RF input signal. RF input signal with a weak desired signal and strong adjacent interferers. Heterodyne architecture block diagram. The image problem. Typical ceramic filter characteristic. Heterodyne architecture implementation. Direct-conversion architecture block diagram. LO self-mixing. Input offset voltage for a differential CMOS amplifier. Noise power spectral density for MOS transistors. Frequency translation to fIF using a complex sinusoidal LO signal. Complex mixing. (a) Concept. (b) Implementation. Image-reject (Weaver) architecture block diagram. Low-IF architecture block diagram. Digital frequency translation block diagram. LO signals for fs = 4fIF. QPSK constellation diagram. Probability of error for QPSK. Effect of thermal noise on a QPSK constellation. Effect of flicker noise on a QPSK constellation. I and Q gain mismatch in a direct-conversion receiver. Effect of gain mismatch on a QPSK constellation. Probability of error for QPSK with gain mismatch. Quadrature phase mismatch in a direct-conversion receiver. Effect of quadrature phase mismatch on a QPSK constellation. Probability of error for QPSK with quadrature phase mismatch. Frequency translation using I and Q LO signals with frequency offset. Effect of frequency offset on a QPSK constellation. Frequency spectra of LO signals. (a) Ideal. (b) With phase noise. Reciprocal mixing. Effect of reciprocal mixing on a QPSK constellation. Frequency translation using I and Q LO signals with phase noise. Effect of close-in phase noise on a QPSK constellation. Harmonic distortion. Intermodulation distortion. Third-order intermodulation distortion. Effect of intermodulation distortion on a QPSK constellation. Butterworth filter magnitude responses with = 1. 5 6 7 8 8 9 10 11 12 13 14 14 15 18 18 19 26 28 29 30 30 31 32 32 33 35 35 36 37 37 38 38 39 39 40 40 41 42
vii
Figure 3.23: Figure 3.24: Figure 3.25: Figure 3.26: Figure 3.27: Figure 3.28: Figure 3.29: Figure 4.1: Figure 4.2: Figure 4.3: Figure 4.4: Figure 4.5: Figure 4.6: Figure 4.7: Figure 4.8: Figure 4.9: Figure 4.10: Figure 4.11: Figure 4.12: Figure 4.13: Figure 4.14: Figure 4.15: Figure 4.16: Figure 4.17: Figure 4.18: Figure 4.19: Figure 4.20: Figure 4.21: Figure 4.22: Figure 4.23: Figure 4.24: Figure 5.1: Figure 5.2: Figure 5.3: Figure 5.4: Figure 5.5: Figure 5.6: Figure 5.7: Figure 5.8: Figure 5.9: Figure 5.10: Figure 5.11:
Chebyshev filter magnitude responses with = 1. Phase responses for third-order Butterworth and Chebyshev filters. Butterworth filter. (a) Effect on a QPSK constellation. (b) Eye diagram. Chebyshev filter. (a) Effect on a QPSK constellation. (b) Eye diagram. Effect of dc offsets on a QPSK constellation. Probability of error for QPSK with dc offsets. Effect of ADC resolution on an adaptive MUD algorithm [22]. Circuit model of antenna with receiver input. Frequency translation of RF signal directly to baseband. Frequency translation of RF signal to an intermediate frequency. 1-dB compression point. Third-order intermodulation intercept point. Example spectrum of s(t) in (4.29). Magnitude response of H (e j ) in (4.57). Filter initialization code corresponding to (4.59). Simulated flicker noise power spectral density. Partitioning of broadband white noise. Example spectrum of phase noise generated by simple behavioral model. PLL block diagram. Linear model of PLL. Linear model of PLL with noise sources. Simulink implementation of complex phase noise model. Simple ADC behavioral model. First-order converter. Switched-capacitor integrator. Model of ideal integrator. Model of integrator with finite OTA gain. Model of integrator with noise. Switched-capacitor integrator transient response. Model of integrator with a nonlinear transfer function. Structural model of first-order converter. Block diagram of base-station transmitter (digital baseband section). Frequency response of raised-cosine pulse ( 1 / T = 25 MHz , = 0.3 ). Cellular reuse pattern with N = 4 . Block diagram of base-station transmitter (analog front-end). Comparison of FDMA and CDMA systems. Chopper stabilization. Effect of high-pass filtering. (a) 100 kHz. (b) 500 kHz. (a) K = 15 spreading code. (b) After 500-kHz high-pass filter. Signal and noise power spectral densities. Simulated transmit spectrum. DC offset removal. (a) Before ADC. (b) After mixer. viii
Figure 5.12: Figure 5.13: Figure 5.14: Figure 5.15: Figure 5.16: Figure 5.17: Figure 5.18: Figure 5.19: Figure 5.20: Figure 5.21: Figure 5.22: Figure 5.23: Figure 5.24: Figure 5.25: Figure 5.26: Figure 5.27: Figure 5.28: Figure 5.29: Figure 5.30: Figure 5.31: Figure 6.1: Figure 6.2: Figure 6.3: Figure 6.4: Figure 6.5: Figure 6.6: Figure 6.7: Figure 6.8: Figure 6.9: Figure 6.10: Figure 6.11: Figure 6.12: Figure 6.13:
Figure 6.16:
(a) ADC before data recovery. (b) ADC after data recovery. AGC architectures. (a) Feedforward. (b) Feedback. Amplitude estimates with frequency offset . Equivalent combiner for feedback AGC loop based on (5.49). Equivalent combiner based on I 2 + Q 2 . Equivalent combiner for feedback AGC loop based on (5.50). Equivalent combiner for feedback AGC loop based on (5.51). AGC loop based on sign-data LMS algorithm using (5.51). Performance of AGC loop depicted in Fig. 5.19. AGC loop based on the update equation in (5.101). Performance of AGC loop depicted in Fig. 5.21. Single-user detector. Decorrelating detector. Decorrelating detector for a single data channel. MMSE detector. Adaptive MMSE detector for a single data channel. Proposed direct-conversion receiver. Top-level schematic of system downlink simulation. Receiver front-end schematic. Constellation diagrams from system-level simulation. Block diagram of receiver prototype. Antenna, RF filter, and LNA. Conjugate impedance matching for maximum power transfer. Circuit for second-order Butterworth low-pass filter. Transformation of a low-pass response to a band-pass response. MOS small-signal equivalent circuit with noise generators. Distributed gate capacitance and channel resistance at high frequencies. Revised MOS small-signal equivalent circuit with noise generators. (a) Common-source amplifier. (b) Small-signal equivalent circuit. Common-source LNA with tuned input. (a) Common-gate LNA. (b) Small-signal equivalent circuit. (a) Common-source LNA with inductive degeneration. (b) Smallsignal equivalent circuit. NFmin versus ID based on short-channel and long-channel equations for the inductively-degenerated LNA topology. ( = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s, L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, Esat = 5104 V/cm.) (a) Local shunt feedback LNA. (b) Small-signal equivalent circuit. NFmin versus ID for the LNA topology with local shunt feedback. ( = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s, L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, Esat = 5104 V/cm, Rl = 350 .) (a) Drain current versus noise figure. (b) Gain versus noise figure. ( = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s,
104 110 112 113 115 116 117 119 120 121 122 123 124 125 126 126 127 129 131 133 136 136 137 138 140 142 145 147 148 151 152 155
158 159
162
ix
Figure 6.17:
Figure 6.18: Figure 6.19: Figure 6.20: Figure 6.21: Figure 6.22: Figure 6.23: Figure 6.24: Figure 6.25: Figure 6.26: Figure 6.27: Figure 6.28: Figure 6.29: Figure 6.30: Figure 6.31: Figure 6.32: Figure 6.33: Figure 6.34: Figure 6.35: Figure 6.36: Figure 6.37: Figure 6.38: Figure 6.39: Figure 6.40: Figure 6.41: Figure 6.42: Figure 6.43: Figure 6.44: Figure 6.45: Figure 6.46: Figure 6.47: Figure 6.48: Figure 6.49: Figure 6.50: Figure 6.51:
L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, Esat = 5104 V/cm, Rl = 350 .) Designing for minimum noise figure in the common-source LNA with inductive degeneration. (a) Cgs. (b) gm. (c) Lg. (d) Ls. ( = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s, L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, Esat = 5104 V/cm, Rl = 350 .) Inductively-degenerated differential LNA. (a) Differential LNA with source conductance G s' and tail current
' I D . (b) Equivalent half-circuit. Drain current versus noise figure of the inductively-degenerated differential LNA for several values of source conductance. Layout of LNA transistors M1 M4. Small-signal equivalent circuit of the inductively-degenerated LNA including gate-drain overlap capacitance. Summary of on-chip spiral inductors. Interface between LNA and input pads. LNA layout. Bias circuit for the LNA input. (a) Circuit model for on-chip spiral inductors. (b) Component values at 2 GHz. Simulated LNA gain. Simulated LNA S11. Block diagram of phase-locked loop frequency synthesizer. Example of a differential LC-tuned VCO. Example of a ring-oscillator VCO. Four-stage ring-oscillator VCO. Inverter implementations. (a) Static CMOS. (b) SCL. (c) DCVSL. CMOS double-balanced passive mixer. CMOS double-balanced current-commutating mixer. Mixer topology used in the receiver prototype. Sallen and Key filter block diagram. Quality factor of a complex pole pair. Tow-Thomas biquad. Integrators. (a) RC. (b) MOSFET-C. (c) Transconductance-C. Switched-capacitor integrator. Block diagram of baseband amplification and filtering. Noninverting amplifier schematic. Capacitance of poly/n-well structure versus bias voltage. High-pass filter schematic. Buffer schematic. Sallen and Key filter circuit schematic. Equivalent half-circuit of Sallen and Key filter Layout of baseband amplifiers and filters. Pipeline ADC architecture.
165
167 168 169 169 170 172 173 174 175 176 176 177 177 179 180 180 181 182 184 184 186 188 188 189 189 190 191 192 193 193 193 194 195 196 198
Figure 6.52: Figure 6.53: Figure 6.54: Figure 6.55: Figure 6.56: Figure 7.1: Figure 7.2: Figure 7.3: Figure 7.4: Figure 7.5: Figure 7.6: Figure 7.7: Figure 7.8: Figure 7.9: Figure 7.10: Figure 7.11: Figure 7.12: Figure 7.13: Figure 7.14: Figure 7.15: Figure B.1: Figure B.2: Figure B.3: Figure B.4: Figure B.5: Figure C.1: Figure C.2: Figure C.3: Figure C.4: Figure C.5: Figure C.6: Figure C.7: Figure D.1: Figure D.2: Figure D.3: Figure D.4:
-assisted timing recovery scheme. 2-1-1 cascade architecture. Switched-capacitor integrator. Test chip (SCRRX) micrograph. Test chip (SCRBARF) micrograph. Simulated frequency response at the output of the Sallen and Key filter over process corners. Simulated frequency responses at the outputs of various receiver components (typical process corner). Baseband transmit spectrum from Simulink. Spectrum of receiver output signal from transient envelop simulations. (a) 43-dBm input power. (b) 33-dBm input power. Simulated transient response. (a) Measured receiver S11. (b) Zoomed view. Measured frequency response at the output of each of the I and Q Sallen and Key filters. Measured I and Q gain mismatch. Measured receiver noise performance. Measured receiver distortion performance. Measured receiver power consumption. Measured phase noise performance of the frequency synthesizer. modulator operating at 200 MHz. (a) Measured dynamic range. (b) Measured output spectrum. Test setup for receiver measurement with a modulated RF input signal. Measured constellation diagrams using a modulated RF input signal. TDMA time slots. DC-offset cancellation. (a) Capacitive storage. (b) Feedback DAC. Equivalent combiner for a feedback dc-offset correction loop. Implementation of adaptive dc-offset correction loop. Simulation of dc-offset cancellation loop. Transmission matrix representation of the RF filter. Rs and Rl are fixed but the network connecting them is allowed to vary. Rs and Rl are connected through a transformer. Common-source LNA. Singly-terminated RF filter. Inductively-degenerated common-source LNA. Antenna circuit model. Definition of geometric parameters. (a) Inductors 1 6. (b) Inductor 7. Layout of inductor test structures. Pad structure for inductors. Patterned polysilicon ground shield.
199 201 201 202 203 206 207 208 208 209 210 211 211 212 213 213 214 214 216 216 232 232 233 234 235 240 243 244 245 247 248 250 254 256 256 257
xi
List of Tables
Table 2.1: Table 2.2: Table 4.1: Table 5.1: Table 5.2: Table 6.1: Table 6.2: Table 6.3: Table 6.4: Table 6.5: Table 6.6: Table 7.1: Table 7.2: Table 7.3: Table 7.4: Table 7.5: Table A.1: Table A.2: Table A.3: Table A.4: Table A.5: Table D.1: Table D.2: Table D.3: Table D.4: Table D.5: Table D.6: Comparison of receiver hardware requirements. Receiver architecture summary and guidelines. Effect of noise sampling time on simulation accuracy. Summary of receiver specifications. Receiver specifications for system-level simulation. Summary of LNA topologies ( = 0.8, = 4, = 2, c = j0.395). Summary of LNA noise performance from simulation. Summary of LNA simulation. Comparison of LC-tuned and ring-oscillator VCOs. Poles for a third-order Butterworth low-pass frequency response. Capacitor values and bias currents for ADC. Summary of receiver frequency response. Summary of simulated receiver noise performance (output noise integrated over 100 MHz). Summary of simulated receiver distortion performance. Summary of simulated receiver noise performance including PLL for typical process corner (output noise integrated over 100 MHz). Summary of receiver performance measurements. Baseband-equivalent model for RF amplifiers (I). Baseband-equivalent model for RF amplifiers (II). Baseband-equivalent model for mixers (direct-conversion and lowIF). Baseband-equivalent model for IF mixers (I). Baseband-equivalent model for IF mixers (II). Geometry of inductor test structures. Summary of inductor simulation results from ASITIC. Summary of measured results from the inductor test chip. Measured dc resistance of inductor test structures. Summary of inductor simulation results from ASITIC version 3.19.00. Summary of inductor simulation results from Momentum. 20 23 67 128 132 163 178 179 181 194 201 206 207 207 209 215 226 227 228 229 230 254 255 257 258 258 258
xiii
Acknowledgments
The work described in this thesis could not have been accomplished without the help and support of others. Foremost, I would like to thank my research advisor Professor Robert Brodersen for his guidance during the past seven years. I also want to thank you, along with Professor Jan Rabaey, for providing such a wonderful research environment at the BWRC for all of us. I would like to thank Professor Paul Gray and Professor Paul Wright for reviewing this thesis and for providing helpful comments. I would also like to thank both of you along with Professor Robert Meyer for providing advice and guidance during various stages of this project. Finally, I am very grateful to all of you along with all of the other faculty members at UC Berkeley for providing me with such a high-quality education at both the undergraduate and graduate levels. I am greatly indebted to the members of the Baseband Analog and RF (BARF) group: Sayf Alalusi, Sam Blackman, Chinh Doan, Brian Limketkai, Ian ODonnell, David Sobel, and Johan Vanderhaegen. Chinh, Dave, and Brian made significant contributions to the receiver hardware design, while Chinh and Dave, along with Johan, also made significant contributions to the implementation of the Simulink simulation framework. Thanks Chinh for doing a great job on the PLL and for helping out on all different aspects of the project. Thanks Dave for doing a great job on the ADC and the SCRBARF test board and for expanding my vocabulary with words like absquatulate, callipygian, and nugatory. Thanks Brian for doing a great job on the mixer and for giving me a ride when I really needed one. Thanks Johan for doing a great job on modeling the basestation transmitter in Simulink and for all the delicious Belgian chocolates. Thanks Sayf for designing the SCRRX test board. Thanks Ian for being a great project partner in EE225C, EE241, and EE290Y. And thanks Sam for all the great stock tips. You BARF dudes are the best!
xv
Craig Teuscher has helped me to better understand all aspects of digital communications. There are a lot of excellent communications engineers out there, but you are one of the few who are experts in both communications engineering as well as the art of communicating. I am indeed lucky to have worked with someone who has such an innate ability to clearly explain even the most difficult concepts. I would like to thank Tom Burd, Chris Rudell, Jeff Ou, Sekhar Narayanaswami, Jeff Weldon, Andy Abo, Keith Onodera, Jeff Gilbert, and Ali Niknejad for all their help and advice over the many years of graduate school. Thank you all for providing me with invaluable feedback for papers and practice talks and for just being a bunch of cool dudes. Thanks Tom and Jeff G. for always providing a non-RF perspective which improved the papers and talks tremendously. Also, thanks Jeff G. for making our whirlwind trip through Italy such a memorable experience. Thanks Chris and Keith for helping me with EE140 when I was still an undergrad. I would also like to thank Chris and Jeff O. for all the edifying discussions on various aspects of RF design. Thanks Ali for ASITIC and for all the helpful discussions about inductors. Thanks Sek for being a great project partner in EE231 and for all the lighthearted discussions about Cal basketball and football. Go Bears! I am also grateful to the original members of the RF group: Lapoe Lynn, Ian ODonnell, Jim Peroulas, Sam Sheng, and Kevin Stone. In particular, I am greatly indebted to Sam Sheng both for being the best TA I have ever had and for mentoring me during my first couple of years in graduate school. I have also had the pleasure of working with many other past and present graduate students including but not limited to Ada Poon, Andy Klein, Henry Jen, Chris Taylor, George Chien, Marlene Wan, Varghese George, Ning Zhang, Arthur Abnous, Li Lin, Martin Tsai, Rhett Davis, Josie Ammer, Heather Bowers, Roy Sutton, Trevor Pering, David Lidsky, Srenik Mehta, Shankar Narayanaswamy, and Arnie Feldman. Thanks Ada, Andy, and Ning for all the helpful discussions on various topics in communications. Also, thanks Ada for always being so nice and cheery, and always remember to respect your grandpa!
xvi
The students at the BWRC are very fortunate to have the opportunity to interact with many industrial and academic visitors. I have had the pleasure of working with Ken Kundert, Bob Frye, Andrei Vladimirescu, Marco Sabitini, Isabelle Telliez, and Bhusan Gupta. Thanks Ken for all the help with SpectreRF. And thanks Marco, Isabelle, and Bhusan for all the help with getting our chips taped out. The students at the BWRC are also very fortunate to have support from such friendly and efficient administrative and technical staff members, including Tom Boot, Elise Mills, Kahren Kim, Gary Kelson, Kevin Zimmerman, Brian Richards, Fred Burghardt, and Sue Mellers. I want to thank all of you as well as Peggye Brown in Cory Hall for help in purchasing parts for test boards, arranging itineraries for travel to conferences and workshops, and for just keeping the BWRC running so smoothly. I am grateful to Professor Rinaldo Castello at the University of Pavia and Professor Andrea Baschirotto at the University of Lecce along with all of your students for being such gracious hosts during my trip to Italy. Danilo Gerna, who was my partner for the EECS foosball tournament when he was a visiting scholar at Berkeley, was also a very gracious host during my stay in Pavia. I would also like to thank everyone who has helped to make the stress of graduate school much more tolerable. Working out at the RSF has helped to relieve the stress associated with the trials and tribulations of graduate school. My past and present workout buddies include: Jeff Hanz Gilbert, Sayf Alalusi, Tom Burd, Ian ODonnell, Chinh Doan, Chris Taylor, Arnie Feldman, and Dave Lidsky. I have also enjoyed mountain biking and more recently, road biking with Brian Limketkai, Sayf Alalusi, Ian ODonnell, Tom Burd, Chris Rudell, Jeff Weldon, and Danilo Gerna. Together we have enjoyed many of the awesome bike routes in the Berkeley area. I also want to thank Gabe Moy for answering all of my questions about bikes. Another pastime which I picked up during graduate school is golf. Those who have witnessed by dreadful swing include Keith Onodera, Andy Abo, Tom Burd, George
xvii
Chien, Chinh Doan, Henry Jen, Andy Klein, and Jeff Weldon. It can be difficult for a bunch of poor graduate students to support an expensive pastime like golf; so I am very grateful for Tildens College Night, when a round of golf is only $10 for Cal students. Also, thanks Jeff for helping me with my swing and for being so patient. Finally, I would like to thank my family and friends for their support and encouragement. I have many fond memories of my high school and college years with friends like Burton Lee, Milton Yee, Calvin Low, Ivan Choi, Norman Chan, Wayne Yeung, Nick Mitchell, Garvin Leon, Albert Yee, John Balk, Moses Kim, Ben Shieh, Paul Chou, and Jim Young. I am also very fortunate to have great friends like Albert Lee, Richard Ng, and Bryant Woo. Thanks for always being there for me, through the good times and the bad. I would also like to thank my Aunt and Uncle and all of my cousins along with their wonderful families: Randy and Kelly; Phillis, Randy, Kelsey, Mitchell, and Kristen; Andy, Ginger, Ryan, and Nicole; and Debbie, Richard, and Adam. Thanks Andy for all the help and advice during my first couple of years at Cal. Yvette, thank you for your encouragement, your patience, and your understanding. And above all, thank you for making the past eleven years so rich and so fulfilling. Davis, I am truly fortunate to have a big brother who always watches over me and so selflessly shares everything with me. Thank you for giving me the benefit of all of your experiences and for teaching me the meaning of friendship, loyalty, courage, and perseverance. Mom and Dad, both of you have sacrificed so much for Davis and me. Your unwavering support has been a continual source of strength for both of us. Thank you for instilling in us the importance of a good education and for encouraging the both of us to pursue our dreams. The accomplishments which I have achieved belong as much to the both of you as they do to me. I dedicate this work to the both of you with all my love.
xviii
Chapter 1
Introduction
1.1
Motivation
Due to its potential to offer ubiquitous information access, wireless connectivity is continuing to play an increasingly significant role in communications systems. The proliferation of wireless technologies is already evident in the success of modern paging and cellular telephony applications. Although wireless connectivity is inherent to the functionality of these devices, wireless connectivity is still absent from many portable devices such as laptops and personal digital assistants (PDAs). Future communications systems will offer new wireless services for devices such as laptops and PDAs as well as expand on the existing wireless capabilities of devices such as cellular telephones and pagers. These applications include Internet access, video teleconferencing, high-fidelity audio, and other high-speed services. Wireless connectivity is not limited to only portable devices, but can also be used for applications which currently rely on tethered connections, including local area networks (LANs) and local loop applications such as Integrated Services Digital Network (ISDN) and Digital Subscriber Line (DSL), both of which rely on copper twisted pair, as well as
cable applications, which rely on a combination of fiber optic and coaxial cables. Bluetooth is one example of a wireless standard which is targeted at applications which currently rely on wires [1]. Bluetooth aims to use wireless connectivity to replace cables such as the those connecting peripheral devices to a computer. Emerging wireless solutions for LAN applications include the IEEE 802.11a and 802.11b standards at 5 GHz and 2.4 GHz, respectively, in the United States [2] as well as the ETSI HIPERLAN standards in Europe [3]. Several wireless solutions have also been proposed for local loop applications including the Local Multipoint Distribution Service (LMDS) operating at 28 GHz and the Multichannel Multipoint Distribution Service (MMDS) operating at 2.5 GHz [4]. For systems designed to provide wireless connectivity to mobile devices, the success of these systems will depend heavily on their ability to provide high capacity while maintaining low cost, small form factor, and low power consumption in the portable units. However, many existing commercial transceivers are expensive, consist of a large number of discrete components, and exhibit moderate to high levels of power consumption. One possible explanation for these inefficient solutions lies in the historically unilateral relationship between system designers and hardware designers: first system designers develop standards concentrating mainly on communications issues, and then hardware designers must implement a solution to meet these standards. An efficient solution requires cooperation between both system and hardware designers as well as a design strategy which tightly incorporates implementation issues throughout the process of defining the system specifications.
1.2
Research Goals
The goal of this research is to establish a design framework to evaluate tradeoffs between implementation issues and system performance. This framework will focus on one of the key units in an indoor wireless system: the receiver in the mobile device. In order to achieve a single-chip solution with low power consumption, three design strategies are proposed. First, at the system level, implementation issues must be considered even during the earliest stages of system definition. Clearly this approach is not possible for
systems which have already been defined. However, selecting system features which allow for relaxed hardware requirements is critical for achieving single-chip, low-power receiver implementations in future wireless systems. In addition, the allocation of unlicensed spectra in the Industrial, Scientific, and Medical (ISM) bands at 900 MHz and 2.4 GHz and the Unlicensed National Information Infrastructure (U-NII) band at 5 GHz provides opportunities for development of custom wireless systems. Second, efficient implementations require careful evaluation of the effects of analog receiver impairments on the performance of digital communications algorithms. The rapid improvements in digital CMOS technology facilitate the integration of increasingly more functionality onto a single chip. In particular, advanced signal processing algorithms are very amenable to low-power digital design techniques and promise increased capacity along with higher data rates [5]. However, the performance of these algorithms may ultimately be limited by analog circuit impairments, such as noise, distortion, and mismatch. By accounting for analog impairments during the earliest stages of algorithm exploration, it may be possible to relax some of the analog hardware requirements without necessarily sacrificing overall system performance. Third, low-power circuit implementation techniques are required to minimize the power consumption of the analog circuits. Despite efforts to simplify the analog hardware, the analog section of the receiver can still dominate the overall receiver power consumption.
1.3
Thesis Organization
Chapter 2 provides an overview of various receiver architectures. The choice of receiver architecture affects both the power consumption and the level of integration. The chapter describes the key features of the heterodyne, direct-conversion, image-reject, and low-IF architectures and presents some design guidelines for receiver architecture selection. Chapter 3 describes the effects of receiver front-end impairments, such as noise, distortion, and mismatch, which can potentially degrade the performance of digital communications systems. A QPSK signal constellation is used to demonstrate the detrimental effects of many of these impairments.
Chapter 4 first provides an overview of the conventional approach of using link budget calculations to determine the allowable levels of receiver impairments and then describes a system-level simulation framework that includes models for the analog impairments described in Chapter 3. This simulation framework is implemented in Simulink and facilitates the direct evaluation of the effects of analog impairments on the performance of digital communications algorithms. Chapter 5 describes the design of a high-speed wireless downlink for an indoor picocellular system. System specifications are chosen in order to facilitate the use of a direct-conversion architecture as well as to relax many of the performance requirements of the analog hardware without significantly degrading the overall system performance. Chapter 6 describes the implementation of the receiver prototype including the low-noise amplifier, frequency synthesizer, mixers, baseband amplifiers and filters, and analog-todigital converters, focusing primarily on design choices which result in the most powerefficient implementation. All of these components are integrated onto a single-chip, and a power-efficient solution is achieved by taking advantage of these relaxed requirements along with low-power circuit implementation techniques. Chapter 7 presents the simulated and measured performance results of the receiver prototype, and Chapter 8 concludes with a summary as well as suggestions for future research.
Chapter 2
Receiver Architectures
2.1
Introduction
One of the key components of portable devices used in wireless communications systems is the receiver, which senses an incoming signal and extracts the desired information. Since the Federal Communications Commission (FCC) regulates the frequencies at which signals can be transmitted, the incoming signal is typically centered at a frequency which is much larger than the bandwidth of the desired signal. For the ideal case illustrated in Fig. 2.1, the radio-frequency (RF) front-end of the receiver simply translates the incoming signal from a carrier frequency, fc, down to baseband. Unfortunately, in a real wireless transmission environment, the received signal is almost
ideal RF input spectrum desired BB spectrum
RF front-end fc fc DC
interferers desired
fc
Figure 2.2: RF input signal with a weak desired signal and strong adjacent interferers. always far from ideal. The signal which reaches the receiver can be very weak because of attenuation by objects which obstruct the transmission path between the transmitter and receiver or simply because of the loss due to spatial separation between the transmitter and receiver. In addition, the received signal can include unwanted signals along with the desired one. These unwanted signals, or interferers, can be significantly stronger than the desired signal as illustrated in Fig. 2.2. Due to the limited amount of attenuation achievable by practical filter designs as well as the noise and distortion introduced by circuits used to implement the RF front-end, the design of a highly-integrated, low-power receiver becomes increasingly challenging when the received signal consists of a very weak desired signal in the presence of strong adjacent interferers. Two metrics which are used to evaluate receiver performance are sensitivity and selectivity. A receiver with high sensitivity can correctly process a very weak desired signal whereas a receiver with high selectivity can correctly process a desired signal in the presence of very strong interferers at adjacent frequencies. The required sensitivity and selectivity of a receiver are highly dependent on the specifications of the underlying communications system. In order to meet the sensitivity and selectivity requirements of a particular system while facilitating a highly-integrated, low-power implementation, the architecture used for the receiver must be carefully considered. This chapter provides an overview of various receiver architectures, starting with the heterodyne architecture, which, unfortunately, is not very amenable to high levels of integration, and followed by an overview of a few other receiver architectures which are more conducive to single-chip implementations.
RF Input (fc)
ADC
0 90
LO2 (fIF)
ADC
2.2
Heterodyne Architecture
The heterodyne architecture (also called the superheterodyne architecture) is probably the most commonly used architecture in current commercial receiver implementations. In this architecture the received signal is converted to baseband in multiple frequency translation steps. A block diagram of the heterodyne architecture with two frequency translation steps is illustrated in Fig. 2.3. In this architecture, the signal received at the antenna first passes through an RF filter before being amplified by a low-noise amplifier (LNA). The signal is then filtered by an image-reject (IR) filter before being frequency translated to an intermediate frequency (IF) by the first local oscillator (LO). At the intermediate frequency the signal is further filtered and amplified before being frequency translated to baseband along parallel in-phase (I) and quadrature (Q) signal paths by the second LO. At baseband, the signal is further amplified and filtered before being converted to a digital signal by the analog-to-digital converter (ADC).
2.2.1
The selection of the intermediate frequency in this architecture is directly related to the image problem. In Fig. 2.4, the desired signal centered at the carrier frequency fc is frequency translated to the intermediate frequency fIF by an LO located at the frequency fc fIF. However, the signal centered at the image frequency fc 2fIF is also frequency translated to fIF. Since the image signal can be much stronger than the desired signal, the image signal must be sufficiently attenuated before frequency translation. The choice of fIF depends on the characteristics of practical filter implementations. For a typical ceramic filter [6] (Fig. 2.5), the amount of attenuation increases at frequencies
LO image desired
fc2fIF
fc
fIF
Figure 2.4: The image problem. farther away from the center frequency f0. Consequently, in order to achieve a large amount of image signal attenuation, it is preferable to select a high intermediate frequency so that the image signal is far away from the center frequency of the filter. However, a high intermediate frequency also increases the design challenges in the IF filtering and amplification circuits. Consequently, the choice of fIF must be based on the following tradeoffs: a high intermediate frequency results in maximum image signal attenuation from the IR filter, while a low intermediate frequency results in relaxed IF filtering and amplification requirements.
2.2.2
Implementation
20 40 60 80 f0500
Attenuation (dB)
Si BJT or Si CMOS RF Input (fc) GaAs or Si BJT SAW LNA RF Filter ceramic IR Filter ceramic LO1 (fcfIF) GaAs or Si BJT with off-chip resonator ADC Q IF Filter IF Amp 0 90 LO2 (fIF) Si BJT with off-chip resonator ADC I
Figure 2.6: Heterodyne architecture implementation. The heterodyne architecture is commonly used in current commercial receiver implementations because of its excellent sensitivity and selectivity performance. This excellent performance is achieved by using the best technologies to implement the various components. For example, the RF and IF filters are typically implemented using ceramic filter technology while the IR filter is typically implemented using surface acoustic wave (SAW) technology. The remaining components are implemented using an assortment of gallium arsenide, silicon bipolar, and silicon CMOS technologies. As illustrated in Fig. 2.6, a typical implementation consists of a large number of components, implemented in multiple technologies. Since small form factor and low power consumption are two critical design goals in the design of portable units, the heterodyne architecture is inadequate and other receiver architectures which are more amenable to highly-integrated, low-power implementations must be considered for future wireless communications systems. These architectures include: 1. the direct-conversion architecture; 2. the image-reject architecture; and 3. the low-IF architecture.
2.3
Direct-Conversion Architecture
Rather than frequency translating the received signal to an intermediate frequency, the direct-conversion receiver architecture downconverts the received signal directly to
RF Input (fc)
ADC
LNA RF Filter
0 90
LO (fc)
ADC
Figure 2.7: Direct-conversion architecture block diagram. baseband, and consequently, image rejection is no longer necessary. A block diagram of the direct-conversion architecture is illustrated in Fig. 2.7. The RF signal appearing at the antenna is filtered and amplified before being downconverted to baseband along parallel I and Q signal paths. The frequency translation is performed using two mixers and an LO fixed at the carrier frequency and operating in quadrature. The I and Q baseband signals are then amplified and low-pass filtered prior to analog-to-digital conversion. Because the RF signal is converted directly to baseband, this architecture eliminates all intermediatefrequency components and their associated design challenges, including the image-reject problem. Moreover, all of the remaining analog components can be integrated onto a single chip using a single technology such as silicon CMOS, with the exception of the antenna and the RF filter [7][9]. However, two practical considerations have limited the use of the direct-conversion architecture: dc offsets and flicker noise.
2.3.1
DC Offsets
Implementations based on the direct-conversion architecture are particularly sensitive to dc offsets since the desired signal is downconverted directly to baseband. DC offsets are problematic for two reasons. First, dc offsets can saturate the baseband circuits, such as amplifiers and filters. Second, even if the baseband circuits do not saturate, dc offsets, if uncorrected, degrade the bit-error rate (BER) performance of the system. There are three primary sources of dc offsets: LO self-mixing, even-order distortion, and baseband circuit mismatch. As illustrated in Fig. 2.8, the LO signal can couple to the RF signal path, for example, through the input of the LNA or through the RF port of the
10
RF Input (fc)
LO (fc)
DC tone
LNA RF Filter
Figure 2.8: LO self-mixing. mixer. The LO signal then mixes with itself, creating a dc offset. LO self-mixing can be represented as the product of two sinusoids at the same frequency fc,
S1 cos(2 f c t ) S 2 cos(2 f c t ) = S1 S 2 {1 + cos[2 (2 f c )t ]} 2
(2.1)
which results in a dc component as well as a sinusoidal component at 2fc. A second source of dc offsets is even-order harmonic distortion. The transfer function of the analog front-end can be represented by the following equation:
s o = a1 s i + a 2 s i2 + a 3 s i3 + K
(2.2)
where the first term represents the gain and the remaining terms represent the nonlinear behavior. In the case of second-order harmonic distortion, for an input sinusoid si = S1 cos(2 f 1t ), the resulting output signal is so = a 2 S12 {1 + cos[2 (2 f 1 )t ]} 2 (2.3)
which again consists of a dc component and a sinusoidal component at 2f1. Although this example considers only second-order distortion, in fact, all even-order distortion terms result in a dc offset component. In addition, as seen in (2.3), the amount of dc offset resulting from even-order distortion depends on the input signal amplitude S1, which varies over time. Consequently, the cancellation of dc offsets can be challenging because of its time-varying nature. In addition to dc offsets, even-order distortion can also result in other low-frequency components which can be detrimental to the performance of direct-conversion receivers.
11
RL VOD = 0
RL
VOS IS
Figure 2.9: Input offset voltage for a differential CMOS amplifier. In the case of second-order intermodulation distortion, for an input signal si = S1 cos(2 f 1t ) + S 2 cos(2 f 2 t ), the resulting output signal is s o = a 2 S1 S 2 {cos[2 ( f 1 f 2 )t ] + cos[2 ( f 1 + f 2 )t ]} (2.4)
which consists of sinusoidal components at frequencies f1 f2 and f1 + f2. If the frequency separation between the two input sinusoids is small, then second-order intermodulation distortion results in a low-frequency component, f1 f2, which can potentially corrupt the desired baseband signal. Since even-order intermodulation distortion depends on the input signal amplitudes, S1 and S2, the amplitude of the resulting low-frequency components are also time-varying. Finally, a third source of dc offsets is baseband circuit mismatch. If differential circuits are used to implement the baseband amplifiers and filters, device mismatches give rise to dc offsets. For the differential CMOS amplifier illustrated in Fig. 2.9, the input offset voltage is [10]
VOS = Vt + (VGS Vt ) RL 2 R L (W / L) (W / L) .
(2.5)
The dc offset in this case is related to the mismatch in the transistor threshold voltage, the mismatch in the transistor geometry, and the mismatch in the load resistors. Moreover, dc offsets arising from circuit mismatches are also dependent on temperature variations.
2.3.2
Flicker Noise
12
Flicker noise can also degrade the performance of direct-conversion receivers. The power spectral density of the input-referred voltage noise for a MOS transistor consists of a flicker noise component which is inversely proportional to frequency and a thermal noise component:
(2.6)
The noise power spectral density is plotted as a function of frequency in Fig. 2.10. For high frequencies, the thermal noise component dominates, but for low frequencies, the flicker noise component is stronger. In fact, since the power spectral density of flicker noise is inversely proportional to frequency, the flicker noise component can be quite large near dc. Since the desired signal is frequency translated directly to baseband, flicker noise can be particularly problematic for direct-conversion architectures.
2.4
Image-Reject Architecture
In the heterodyne architecture, the image problem arises from the use of a real sinusoidal LO signal to frequency translate the input signal to an intermediate frequency. More specifically, the Fourier transform of a real sinusoidal LO signal, cos(2 f LO1t ), consists of a negative frequency component at fLO1 and a positive frequency component at +fLO1:
1 cos(2 f LO1t ) [ ( f f LO1 ) + ( f + f LO1 )] . 2
(2.7)
During the frequency translation process, the negative frequency component of the LO signal downconverts the positive frequency component of the desired signal to fIF, while the positive frequency component of the LO signal downconverts the negative frequency
vi
13
LO
image desired
fc-2fIF
fc
fIF
Iout
in out
in
ej2fLO1t
0 90
LO (fLO1)
Qout
(a)
(b)
Figure 2.12: Complex mixing. (a) Concept. (b) Implementation. component of the image signal also to fIF (Fig. 2.4). The preceding description of the mechanism behind the image problem suggests a potential solution: use a complex sinusoidal LO signal to downconvert the input signal to the intermediate frequency. The Fourier transform of a complex sinusoidal LO signal, e j 2 f LO1t , consists of only a negative frequency component at fLO1:
e j 2 f LO1t ( f + f LO1 ) .
(2.8)
When the input signal is multiplied by the LO signal, only the positive frequency components of the input signal are translated to the intermediate frequency as illustrated in Fig. 2.11. Thus, the image problem is avoided. The complex mixing function described above can be implemented using the imagereject mixer illustrated in Fig. 2.12. The image-reject mixer consists of a pair of real mixers driven by an LO operating in quadrature. This complex mixing function serves as
14
ADC
Figure 2.13: Image-reject (Weaver) architecture block diagram. the basis for the image-reject receiver architecture, also called the Weaver architecture [11], illustrated in Fig. 2.13. Suppose that the signal appearing at the antenna is
s (t ) = s desired (t ) + simage (t )
(2.9)
where sdesired(t) is the desired signal and simage(t) is the undesired image signal. The desired signal can be expressed as s desired (t ) = I (t ) cos(2 f c t ) + Q(t ) sin(2 f c t ) (2.10)
where I(t) and Q(t) are the desired baseband I and Q signals, and the image signal can be expressed as
simage (t ) = I image (t ) cos[2 ( f c 2 f IF )t ] + Qimage (t ) sin[ 2 ( f c 2 f IF )t ] .
(2.11)
The received signal s(t) is filtered and amplified before being downconverted to the intermediate frequency along two signal paths using two mixers driven by I and Q LO signals at fc fIF. At the intermediate frequency, each of the two signals is low-pass filtered in order to remove the mixing components at 2fc fIF. At points A and B, the signals are, respectively,
1 s A (t ) = {[ I (t ) + I image (t )] cos(2 f IF t ) + [Q(t ) Qimage (t )] sin(2 f IF t )} 2 1 s B (t ) = {[Q(t ) + Qimage (t )] cos(2 f IF t ) + [ I (t ) + I image (t )] sin(2 f IF t )} . 2
(2.12) (2.13)
15
Each of the signals, sA(t) and sB(t), are then downconverted to baseband along two signal paths using two mixers driven by I and Q LO signals at fIF. The signals at points C, D, E, and F are, respectively,
1 sC (t ) = [ I (t ) + I image (t )] 4 1 s D (t ) = [Q(t ) Qimage (t )] 4 1 s E (t ) = [Q(t ) + Qimage (t )] 4 1 s F (t ) = [ I (t ) + I image (t )] . 4
Subtracting (2.17) from (2.14) gives the desired baseband signal I(t) while eliminating the unwanted image signal Iimage(t). Similarly, the sum of (2.15) and (2.16) gives the desired baseband signal Q(t) while eliminating the unwanted image signal Qimage(t).
2.4.1
Practical Considerations
Unfortunately, in practice, the amount of image rejection achievable by implementations based on this architecture is limited by the gain mismatch between the different signal paths of the receiver as well as by the quadrature phase mismatch between the I and Q signals in the two local oscillators. The image rejection is given by [11] 1 + (1 + A) 2 + 2(1 + A) cos( LO1 + LO 2 ) R(dB) = 10 log 2 1 + (1 + A) 2(1 + A) cos( LO1 LO 2 ) (2.18)
where A is the gain error, and LO1 and LO2 are the phase errors in the first and second local oscillators, respectively. For A = 3%, LO1 = 2, and LO2 = 0, the Weaver architecture achieves an image rejection of 32.8 dB. Integrated circuit implementations typically achieve 25 40 dB of image rejection. If additional image rejection is required, a high intermediate frequency can be used so that the image signal is far away from the center frequency of the RF filter, which then provides additional attenuation of the image signal.
16
Finally, the Weaver architecture is also susceptible to dc offsets and flicker noise. Selfmixing due to the second LO can occur during downconversion of the received signal from the intermediate frequency to baseband. And similar to the direct-conversion architecture, even-order distortion and baseband circuit mismatch can also result in dc offsets in the Weaver architecture. In addition, since the desired signal is frequency translated to baseband prior to analog-to-digital conversion, flicker noise from the baseband analog amplifiers and filters can potentially corrupt the desired signal.
2.5
Low-IF Architecture
One way to avoid the problems associated with dc offsets and flicker noise is to perform analog-to-digital conversion at the intermediate frequency. By using digital circuit techniques to downconvert the desired signal from the intermediate frequency to baseband as well as to perform the subsequent amplification and filtering, impairments associated with analog implementation techniques can be avoided. However, due to conversion speed limitations in analog-to-digital converters, this approach is limited to low intermediate frequencies, and consequently, a third receiver architecture which is based on this technique is called the low-IF architecture [12] (Fig. 2.14). Since the received signal is downconverted to an intermediate frequency, the low-IF architecture must also contend with the image problem. Consequently, the same imagereject mixing technique used in the Weaver architecture must also be used in the low-IF architecture. As with the Weaver architecture, the amount of image rejection achievable by integrated circuit implementations based on the low-IF architecture is limited by gain and phase mismatches. Moreover, the RF filter can not be used to provide additional image rejection since the use of a low intermediate frequency places the image signal close to the desired signal and within the passband of the RF filter. Consequently the lowIF architecture is limited to applications which have relaxed image-rejection requirements at small frequency offsets from the desired signal.
2.5.1
17
RF Input (fc)
ADC
LNA RF Filter
0 90
LO (fcfIF)
ADC
Figure 2.14: Low-IF architecture block diagram. In the low-IF architecture, additional digital circuitry is required to perform the downconversion from the intermediate frequency to baseband. In the example illustrated in Fig. 2.15, the in-phase and quadrature LO signals are generated digitally, for example, using a read-only memory (ROM) look-up table, and digital frequency translation is performed using four multipliers. The multiplier outputs are then combined using two adders to provide cancellation of the image signal. Finally, one particular relationship between the intermediate frequency and the ADC sampling frequency results in tremendous simplifications in the hardware required for digital frequency translation. Suppose the ADC sampling frequency, fs, is exactly four
Multiplier Adder from I ADC Multiplier I
Multiplier from Q ADC Adder Q Multiplier Sin Generator (fIF) Cos Generator (fIF) -1
18
+1 cos[ n] 2
+1
2 1 +1
n 1
+1
sin[
n] 2
3 1
Figure 2.16: LO signals for fs = 4fIF. times the intermediate frequency, fIF: f s = 4 f IF . Then the in-phase and quadrature LO signals are, respectively,
cos(2 f IF t ) = cos(2 sin(2 f IF t ) = sin(2
(2.19)
fs t ) cos[ n] 4 2 fs t ) sin[ n] . 4 2
(2.20) (2.21)
As illustrated in Fig. 2.16, a cosine wave of frequency fIF sampled at 4fIF yields the sequence { +1, 0, 1, 0, } while a sine wave of frequency fIF sampled at 4fIF yields the sequence { 0, +1, 0, 1, }. Consequently, frequency translation to baseband can be accomplished by simply deinterleaving each of the ADC outputs into two data streams and then toggling the sign of every other data sample. By choosing fs = 4fIF, the digital hardware becomes trivial, eliminating the need for multipliers as well as circuits to generate the digital in-phase and quadrature LO signals.
2.6
The direct-conversion architecture, the image-reject architecture, and the low-IF architecture are all potential candidates for highly-integrated receiver implementations. The hardware requirements for the three receiver architectures are summarized in Table 2.1. The number of LNAs, mixers, LOs, and ADCs are listed for each architecture.
19
The operating frequency for each LO as well as the minimum sampling rate, fs, for each ADC are also indicated in the table.
2.6.1
Direct-Conversion Architecture
A major advantage of the direct-conversion architecture is its simplicity. Although, implementations based on the direct-conversion architecture require a minimal number of RF circuit components, they must also contend with dc offsets and flicker noise. The direct-conversion architecture has been used for paging applications which use frequency-shift keying (FSK) as the modulation scheme [13]. In this case, a high-pass filter can be used to eliminate dc offsets since FSK signals have little frequency content near dc. In [13], high-pass filters, which are implemented using 330-pF on-chip capacitors, provide dc blocking up to 150 Hz. Although high-pass filtering is a simple and effective way of eliminating dc offsets, this technique may not be feasible if prohibitively large capacitors or resistors are required. In particular, for signals which occupy a narrow bandwidth, high-pass filters with very low corner frequencies are required, which in turn require very large on-chip capacitors and resistors. In [13], even though the 330 pF on-chip capacitors are implemented using high-density dielectric capacitors (1 nF/mm2), they still occupy over half of the 18-mm2 chip area. Moreover, although techniques such as autozeroing and chopper stabilization (Section 5.3.5) are effective in suppressing DC offsets and flicker noise, these techniques also introduce additional complexity in implementing the baseband circuits. Consequently, the directconversion architecture is best suited for applications which use a wideband signaling direct-conversion LNAs mixers LOs ADCs other 1 2 1 @ fc 2 @ fs > 2fsig dc offsets flicker noise image-reject 1 6 1 @ fc fIF 1 @ fIF 2 @ fs > 2fsig dc offsets flicker noise low-IF 1 2 1 @ fc fIF 2 @ fs > 2( fsig + fIF) digital frequency translation
20
scheme. In this case, a very low cutoff frequency is not required and dc offsets can be removed without requiring prohibitively large capacitors or resistors.
2.6.2
Image-Reject Architecture
A major drawback of the image-reject Weaver architecture is that it requires a large number of RF circuit blocks, including six mixers and two local oscillators. In addition, implementations based on the Weaver architecture must also deal with dc offsets and flicker noise. Nevertheless, this architecture still has its advantages. One major advantage of the Weaver architecture is that it facilitates integration of the frequency synthesizer [11]. In a frequency-division multiple access (FDMA) system, the receiver must be able to perform channel selection by tuning the LO to different frequencies. For example, in the heterodyne architecture, channel selection is performed by the first LO, which frequency translates the desired signal to the fixed center frequency of the IF filter (Fig. 2.3). For a narrowband system, this LO must be able to tune to closely-spaced frequency channels, and consequently, implementations based on a phase-locked loop (PLL) require a low reference frequency. However, in order to ensure loop stability, the PLL bandwidth is limited to one tenth of the reference frequency. Such a narrow bandwidth provides very little attenuation of the phase noise in the voltage-controlled oscillator (VCO). Consequently, in a heterodyne architecture, the VCO is usually implemented using discrete-component high-quality inductors and varactor diodes in order to achieve very low phase noise. In a completely integrated approach, however, a very low phase noise VCO is difficult to implement due to the lack of high-quality on-chip passive components, especially when using a standard digital CMOS process. In [11], a receiver for the Digital European Cordless Telephone (DECT) system is implemented based on the image-reject Weaver architecture. Since this architecture does not rely on fixed-frequency IF filters for channel selection, the first LO is fixed at 1.7 GHz and the second LO, with a tuning range of 181 197 MHz, is used to downconvert the desired signal to baseband. Since the first LO is fixed at 1.7 GHz, it can be implemented using a PLL with a wide loop bandwidth, which significantly reduces the phase noise contributed by an integrated VCO. In
21
addition, since the second LO operates at a much lower frequency, it can be implemented with good phase noise performance even with low-quality on-chip passive components. Consequently, for communications systems with very stringent phase noise requirements, the image-reject Weaver architecture with a fixed-frequency first LO is highly amenable to single-chip receiver implementations.
2.6.3
Low-IF Architecture
In the low-IF architecture, if the ADC sampling frequency is four times the intermediate frequency, then the additional digital circuitry needed to downconvert the desired signal to baseband becomes trivial. In this case, the low-IF architecture may appear to be a better alternative to the direct-conversion architecture since it also requires a minimal number of RF circuit components but avoids the problems associated with dc offsets and flicker noise. However, implementations based on the low-IF architecture may not be feasible under certain conditions. First, if the bandwidth of the desired signal is very large, then a very high ADC sampling rate is required. In order to avoid aliasing, the sampling frequency of the ADCs must be at least twice the highest frequency component of the input signal. For the low-IF architecture, the minimum ADC sampling frequency is
f s > 2( f IF + f sig )
(2.22)
where fIF is the intermediate frequency and fsig is the single-sided bandwidth of the desired baseband signal. Since fIF must be greater than fsig in order to avoid problems associated with dc offsets and flicker noise, a wideband signaling scheme would require a prohibitively fast ADC sampling frequency. Consequently, the low-IF architecture is best suited for applications which use a narrowband signaling scheme. The low-IF architecture also requires relaxed image-reject requirements at small frequency offsets from the desired signal. This requirement, however, is not prohibitively restrictive since many communications systems provide for relaxed interferer levels in nearby frequency channels. For example, in [14], a GSM (Global System Mobile) receiver is implemented based on the low-IF architecture. GSM is a European digital cellular system which uses a narrowband signaling scheme with a single-sided baseband
22
bandwidth of 100 kHz. The receiver described in [14] uses an intermediate frequency of 200 kHz and requires only 32-dB image rejection. In contrast, the DECT receiver described in [11], which is based on the Weaver architecture, uses an intermediate frequency in the range 181 197 MHz and requires more than 70-dB image rejection.
2.6.4
As seen from the examples presented above, system-level specifications, such as modulation scheme, signal bandwidth, and interference rejection requirements, strongly influence the choice of receiver architecture. In some cases, these system-level specifications are determined without considering implementation issues, and
advantages heterodyne
excellent sensitivity and selectivity performance minimal number of RF components
disadvantages
large number of discrete components dc offsets and flicker noise
design guidelines
use this architecture when all else fails for wideband signaling schemes, dc offsets can be removed with a highpass filter using onchip capacitors and resistors fixed-frequency first LO facilitates the use of a wideband PLL for VCO phase noise suppression tunable low-frequency second LO for channel selection narrowband signaling schemes relax ADC sampling requirement requires relaxed image-rejection requirements at small frequency offsets from the desired signal if fs = 4fIF, the digital downconversion circuitry becomes trivial
direct-conversion
image-reject
large number of RF components dc offsets and flicker noise image-rejection is limited by gain and phase mismatches ADC sampling frequency must be at least fIF + fsig image-rejection is limited by gain and phase mismatches
low-IF
minimal number of RF components avoids problems associated with dc offsets and flicker noise
23
consequently, single-chip implementations are very difficult to achieve or just simply infeasible. With the allocation of unlicensed spectra in the 900-MHz, 2.4-GHz, and 5-GHz frequency bands, many new wireless communications systems will be designed for custom applications. These custom applications present an opportunity to design systems which are more amenable to highly-integrated low-power CMOS receiver implementations. The advantages and disadvantages of the heterodyne architecture, direct-conversion architecture, the image-reject architecture, and the low-IF architecture are summarized in Table 2.2. In addition, a few general design guidelines are also provided.
24
Chapter 3
Receiver Impairments
3.1
Introduction
Wireless communications systems are continuing to take advantage of the exponential improvements in mainstream CMOS technology by integrating increasingly more functionality onto a single chip. Advanced communications algorithms, which in the past were considered too complex to implement due to the stringent power consumption and form-factor restrictions of mobile devices, are now being considered for future wireless systems. These algorithms are very amenable to low-power digital design techniques and promise orders of magnitude improvement in spectral efficiency, resulting in increased capacity and higher data rates. However, one of the most critical components of any wireless system which may ultimately limit the performance of these communications algorithms is the receiver front-end. Analog front-end impairments, such as noise, distortion and mismatch, may affect algorithm performance by degrading the integrity of the desired signal. This chapter describes the various impairments associated with analog front-ends as well as their consequences on a quadrature phase-shift keying (QPSK) signal constellation [15].
25
3.2
In digital communications systems, the binary data must be mapped to a set of corresponding signal waveforms for transmission. In a phase-shift keying (PSK) signaling scheme, the data is modulated on the phase of the carrier and the corresponding signal waveforms are represented as [16]
2 s m (t ) = g (t ) cos 2 f c t + (m 1) , m = 1, 2 ,K , M, 0 t T M 2 2 = g (t ) cos (m 1) cos(2 f c t ) g (t ) sin (m 1) sin(2 f c t ) M M
(3.1)
where g(t) is a signal pulse which shapes the spectrum of the transmitted signal, fc is the carrier frequency, M is the number of symbols, and T is the symbol period. As seen in (3.1) the signal waveforms can also be represented as the linear combination of two quadrature carriers, cos(2 f c t ) and sin(2 f c t ) . The amplitude cos[2 (m 1) / M ] is typically referred to as the in-phase (I) data while the amplitude sin[2 (m 1) / M ] is typically referred to as the quadrature (Q) data. The mapping of the data bits to the phase of the carrier can be represented in a constellation diagram. The constellation diagram for a four-phase PSK ( M = 4 ) signaling scheme with an initial phase value of /4 is illustrated in Fig. 3.1. Four-phase PSK or quadrature phase-shift keying (QPSK) is a very popular modulation scheme used in wireless communications.
10
00
11
01
26
When a QPSK signal is corrupted by additive white Gaussian noise (AWGN), the probability of error is [16]
d2 1 Pb = 1 1 Q 2N 0 2 d2 = Q 2N 0 d2 Q 2N 0
2
1 2 d 2 Q 2 2N 0
(3.2)
where d is the distance between adjacent symbols, N0 is the noise power spectral density, and Q(x) is defined as
Q(x) = 1 2
t e x
2
/2
dt, x 0 (3.3)
x 1 = erfc . 2 2 In the case of ideal QPSK, the distance between adjacent signals is just d = 2 Eb where Eb is the energy per bit. Consequently, (3.2) becomes
2 Eb Pb = Q N 0
(3.4)
(3.5)
where Eb/N0 is the SNR. The bit-error probability is plotted as a function of SNR in Fig. 3.2. From the figure, an SNR of 8.4 dB is required to achieve a BER of 104.
3.3
Receiver Noise
Two types of electrical noise which are particularly important in CMOS implementations are thermal noise and flicker noise. Thermal noise arises from the random thermal motion
27
10
10
10
10
Pb
10
4
10
10
10
10
12
Eb/N0 (dB)
v 2 = 4kTRf
(3.6)
where k is Boltzmanns constant, T is temperature and f is bandwidth in hertz. Since the voltage spectral density is constant over frequency, thermal noise is white. In addition, its amplitude has a Gaussian probability distribution. At the receiver, thermal noise is present in the antenna as well as in the other passive and active devices which are used to implement the analog front-end. The thermal noise in the antenna is associated with the antennas radiation resistance and originates from the black-body radiation in the transmission environment, while the thermal noise in active devices is associated with the resistive channel of the underlying CMOS transistors. Transistor thermal noise can be represented by a noise generator between the device drain and source terminals with mean-square current
2 id = 4kT g do f
(3.7)
where gdo is the drain-source conductance when the drain-source voltage, VDS, is 0 V and
28
1.5
0.5
0.5
1.5 1.5
Figure 3.3: Effect of thermal noise on a QPSK constellation. blocks such as mixers as well as baseband components such as amplifiers and filters. The effect of thermal noise on a QPSK constellation is illustrated in Fig. 3.3 and the probability of error is given by (3.5). Another type of noise which may adversely affect the performance of digital communications systems is flicker noise. Flicker noise occurs in CMOS transistors and can be modeled by a noise generator between the device drain and source terminals with mean-square current
2 id = K a ID f f
(3.8)
where K is a constant for a particular device, ID is the drain bias current and a is constant for a particular technology. The flicker noise current spectral density is inversely proportional to frequency and its amplitude is often not Gaussian [10]. Flicker noise is especially problematic in receivers which frequency translate the desired signal to dc prior to analog-to-digital conversion, since the flicker noise spectral density can be quite large at low frequencies. The problem is exacerbated in CMOS implementations since flicker noise performance is significantly worse for CMOS transistors than for devices in other technologies such as silicon bipolar. The effect of flicker noise on a QPSK constellation is illustrated in Fig. 3.4. Although the effect on the constellation is similar to
29
1.5
0.5
0.5
1.5 1.5
Figure 3.4: Effect of flicker noise on a QPSK constellation. that of thermal noise, unfortunately, the probability of error in this case is difficult to determine since flicker noise is not AWGN.
3.4
Gain Mismatch
Another analog impairment which degrades performance in digital communications systems is gain mismatch along the different receiver signal paths. For example, in the direct-conversion architecture illustrated in Fig. 2.7, the received signal is downconverted to baseband along parallel I and Q signal paths. The gain along these two signal paths should be identical, but in practical implementations, may be different due to circuit mismatches. Gain mismatch in a direct-conversion receiver can be modeled by the block diagram illustrated in Fig. 3.5, where A is the average gain and is the difference in gain
cos(2fct) A+/2 I I(t)cos(2fct)+Q(t)sin(2fct) Q A/2 sin(2fct)
30
along the I and Q signal paths. Thus, with gain mismatch, the output of the I signal path is given by
I (t ) A 1 + 2 2A
(3.9)
(3.10)
The effect of gain mismatch on a QPSK constellation is illustrated in Fig. 3.6. When a QPSK signal with gain mismatch is corrupted by AWGN, the probability of error can be approximated by averaging the error probabilities for two binary antipodal signals separated by distances d 0 (1 + 2 A) and d 0 (1 2 A) :
Pb 1 d 02 Q 1 + 2 2 A 2 N 0 1 2 Eb Q 1 + 2 2 A N 0 d 02 + Q 1 2 A 2 N 0 2 Eb + Q 1 2 A N 0 .
(3.11)
The error probabilities as a function of SNR for various amounts of gain mismatch are plotted in Fig. 3.7. As seen in Fig. 3.7, the BER degradation is minimal for a gain
1
0.5
d0(1+/2A)
d0(1/2A)
0.5
31
10 0 10 1 10 2 10 3
Pb
10 4 10 5 10 6 10 7 0 2 4 6 8 10 12
ideal 5% gain mismatch 10% gain mismatch 50% gain mismatch
Eb/N0 (dB)
Figure 3.7: Probability of error for QPSK with gain mismatch. mismatch of 5% or less, which is indeed achievable in highly-integrated receiver implementations. Consequently, the direct-conversion architecture is relatively
insensitive to gain mismatch. The image-reject and low-IF architectures, however, are much more sensitive to gain mismatch. In these two architectures, gain mismatch limits the amount of image rejection, as already discussed in Section 2.4.1.
3.5
Quadrature phase mismatch in the LO signals also degrades the performance of digital communications systems. For example, in the direct-conversion architecture, frequency
cos(2fct+/2)
I I(t)cos(2fct)+Q(t)sin(2fct) Q
sin(2fct/2)
32
translation is performed using two mixers and an LO fixed at the carrier frequency and operating in quadrature. In practical implementations, the phases of the LO signals may deviate from quadrature due to circuit mismatches. Quadrature phase mismatch in a direct-conversion receiver can be modeled by the block diagram illustrated in Fig. 3.8, where is the deviation from quadrature in the two LO signals. Thus, with quadrature phase mismatch, the output of the I signal path is given by 1 I (t ) cos 2 Q(t ) sin 2 2 while the output of the Q signal path is given by 1 Q(t ) cos 2 I (t ) sin 2 . 2 (3.13) (3.12)
The effect of quadrature phase mismatch on a QPSK constellation is illustrated in Fig. 3.9. When a QPSK signal with quadrature phase mismatch is corrupted by AWGN, the probability of error can be approximated by averaging the error probabilities for the symbols in each of the four quadrants of the constellation diagram. Each of the symbols in the first and third quadrants are located at a distance (d 0 2) [cos( 2) sin( 2)] from
1
0.5
d0/2[cos(/2)sin(/2)]
0.5
d0/2[cos(/2)+sin(/2)] 0 0.5 1
33
the decision boundaries and the error probability for these two symbols is approximately
1 d 02 d 02 Q cos sin + Q cos sin 2 2 2 2N0 2 2 2N 0 d 02 Q cos sin . 2 2 2N 0
Pb ( I , III )
(3.14)
Similarly, each of the symbols in the second and fourth quadrants are located at a distance (d 0 2)[cos( 2) + sin( 2)] from the decision boundaries and the error probability for these two symbols is approximately
Pb ( II , IV )
d 02 + Q cos + sin 2 2 2N 0
(3.15)
(3.16)
2 Eb 1 Q cos sin 2 2 2 N0
The error probabilities as a function of SNR for various amounts of gain mismatch are plotted in Fig. 3.10. As seen in Fig. 3.10, the BER degradation is minimal for a phase mismatch of 5 or less, which is indeed achievable in highly-integrated receiver implementations. Consequently, the direct-conversion architecture is also relatively insensitive to quadrature phase mismatch. As in the case of gain mismatch, the image-reject and low-IF architectures, are also much more sensitive to quadrature phase mismatch. In these two architectures, phase mismatch limits the amount of image rejection, as already discussed in Section 2.4.1.
34
10 0 10 1 10 2 10 3
Pb
10 4 10 5 10 6 10 7 0 2 4 6 8 10 12
ideal 3 phase mismatch 5 phase mismatch 10 phase mismatch
Eb/N0 (dB)
Figure 3.10: Probability of error for QPSK with quadrature phase mismatch.
3.6
Frequency Offset
The local oscillators in the transmitter and receiver are based on accurate frequency references. However, due to their physical separation, different frequency references are used in the transmitter and receiver. The frequency stability of these references are limited to 50 100 ppm for low-cost crystal references [17] or 1 50 ppm for highstability references [6], where the frequency stability is defined as
frequency stability [ppm] f [Hz] f c [MHz]
(3.17)
where fc is the nominal frequency and f is the frequency offset. The use of different
cos[2(fc+f)t]
I I(t)cos(2fct)+Q(t)sin(2fct) Q
sin[2(fc+f)t]
Figure 3.11: Frequency translation using I and Q LO signals with frequency offset.
35
frequency references in the transmitter and receiver introduces a frequency offset between the local oscillators, which can be modeled by the block diagram in Fig. 3.11. The output of the I signal path is
1 [ I (t ) cos(2ft ) Q(t ) sin(2ft )] 2
(3.18)
(3.19)
Consequently, frequency offset results in a spinning constellation, as illustrated in Fig. 3.12. If the frequency offset is sufficiently small relative to the signal bandwidth, a differentially-encoded signaling scheme such as differential phase-shift keying (DPSK) can be used [18]. In this case, the data is encoded in the transitions between symbols so that only the phase difference between successive symbols is needed for signal demodulation. Unfortunately, differentially-encoded modulation schemes require a larger SNR for the same BER performance as modulation schemes which use absolute phase encoding. For example, at large SNR, four-phase DPSK requires about 2.3 dB additional SNR for the same BER performance as four-phase PSK [16]. If coherent demodulation is preferred or if the frequency offset is large relative to the
1.5
0.5
0.5
36
fLO
fLO
(a)
(b)
Figure 3.13: Frequency spectra of LO signals. (a) Ideal. (b) With phase noise. signal bandwidth, then frequency offset compensation is required. Several techniques for frequency estimation and compensation are discussed in [19].
3.7
LO Phase Noise
Both thermal noise and flicker noise contribute to the nonideal behavior of the LO called phase noise. The frequency spectra of LO signals with and without phase noise are illustrated in Fig. 3.13. At the receiver, the desired signal can be quite weak and may be accompanied by very strong interfering signals at adjacent frequencies. If the interfering signals are not sufficiently attenuated, they can potentially corrupt the desired signal through reciprocal mixing, as illustrated in Fig. 3.14. Fig. 3.15 illustrates the constellation diagram for a QPSK signal when the desired signal is accompanied by a single-tone interferer at an adjacent frequency and is downconverted by an LO signal with phase noise. Even if interfering signals are not present, the close-in phase noise of the LO degrades the receiver performance by corrupting the information contained in the phase of the carrier.
interferer
desired
fc
fc+f
fIF
fIF+f
fLO
37
1.5
0.5
0.5
1.5 1.5
Figure 3.15: Effect of reciprocal mixing on a QPSK constellation. Frequency translation using in-phase and quadrature LO signals with phase noise (t ) can be modeled by the block diagram in Fig. 3.16. The output of the I signal path is
1 [ I (t ) cos (t ) Q(t ) sin (t )] 2
(3.20)
(3.21)
Consequently, close-in phase noise results in a time-varying phase-offset in the received symbols. The effect of close-in phase on a QPSK constellation is illustrated in Fig. 3.17, where the amount of constellation rotation is equal to the magnitude of (t ) .
cos[2fct+(t)]
I I(t)cos(2fct)+Q(t)sin(2fct) Q
sin[2fct+(t)]
Figure 3.16: Frequency translation using I and Q LO signals with phase noise.
38
1.5
0.5
0.5
1.5 1.5
3.8
Receiver Distortion
Another source of performance degradation in a receiver is distortion. In a nonlinear system, the output can contain frequency components which are not present in the input signal. A nonlinear system can be described by the following transfer function:
s o = a1 s i + a 2 s i2 + a 3 s i3 + K
(3.22)
where si and so are the input and output signals, respectively. Harmonic distortion occurs when a single sinusoid is applied to a nonlinear system. In this case, the output signal consists of frequency components which are integer multiples of the input frequency, f1, as illustrated in Fig. 3.18. Intermodulation distortion occurs when two sinusoids of different frequencies, f1 and f2, are applied to a nonlinear system. In the case of second-order intermodulation distortion,
...
f1 f 0 f1 2f1 3f1 f
39
...
f1 f2 f f2f1 f1 2f1f2 f2 2f2f1 f
As an example of how distortion can degrade the performance in the receiver, consider a received signal which consists of a weak desired signal accompanied by two large fixedamplitude sinusoidal interferers at adjacent frequencies, f1 and f2, as illustrated in Fig. 3.20. The third-order intermodulation product at 2 f 1 f 2 from the two interferers corrupts the desired signal and results in the constellation diagram illustrated in Fig. 3.21. In addition, as already mentioned in Section 2.3.1, even-order distortion is particularly troublesome in direct-conversion receivers since it gives rise to signal-dependent dc offsets and low-frequency components, all of which can potentially corrupt the desired signal.
3.9
Filtering
The received signal usually contains the desired signal as well as undesired signals at
interferers
desired
f1
f2
2f1f2
f1
f2
2f2f1
40
1.5
0.5
0.5
1.5 1.5
adjacent frequencies. In the receiver, filters are used to pass the desired signal while rejecting the unwanted frequency components. Several types of filters are commonly used to approximate an ideal low-pass filter response, including the Butterworth and Chebyshev filters [20]. The magnitude response of a Butterworth filter is given by
H ( j ) = 1 1+ 2 p
2N
(3.23)
where N is the filter order, p is the passband edge, and determines the magnitude variation in the passband. At = p , the magnitude is H ( j ) =
1 1+ 2
(3.24)
The magnitude responses for several Butterworth filters with = 1 are illustrated in Fig. 3.22. A Butterworth filter provides a maximally flat response and the degree of flatness in the passband increases as the filter order increases. Another commonly used filter is the Chebyshev filter, which exhibits an equiripple response in the passband. The magnitude response of a Chebyshev filter is given by
41
1 0.9 0.8 0.7 0.6 |H(j)| 0.5 0.4 0.3 0.2 0.1 0 0 0.2 0.4 0.6 0.8 1 /p 1.2 1.4 1.6 1.8 2
N=1 N=2 N=4 N=6
(3.25)
42
where N is the filter order, p is the passband edge, and determines the magnitude variation in the passband. At = p , the magnitude is
H ( j ) =
1 1+ 2
(3.26)
The magnitude responses for several Chebyshev filters with = 1 are illustrated in Fig. 3.23. For the same order and the same passband variation, a Chebyshev filter provides greater stopband attenuation than a Butterworth filter. Alternatively, a lowerorder Chebyshev filter can achieve the same stopband attenuation as a higher-order Butterworth filter. However, the Chebyshev filter achieves its superior stopband attenuation at the price of increased nonlinearity in the phase response. The phase responses for third-order Butterworth and Chebyshev filters are illustrated Fig. 3.24. The effect on receiver performance must also be considered when selecting a particular filter response. The effect of a fourth-order Butterworth filter, with = 1, on a QPSK signal along with the corresponding eye diagram are illustrated in Fig. 3.25. This Butterworth filter degrades performance slightly when using a QPSK signaling scheme. As seen from the constellation diagram, this Butterworth response affects the I and Q signal amplitudes but leaves their phase relationship unchanged. The eye diagram offers a
0 0.5 1 1.5 2 H(j) 2.5 3 3.5 4 4.5 0 0.2 0.4 0.6 0.8 1 /p 1.2 1.4 1.6 1.8 2
Butterworth Chebyshev
Figure 3.24: Phase responses for third-order Butterworth and Chebyshev filters.
43
1.5
1.5
1
1
0.5
0.5
0.5
0.5
(a)
(b)
Figure 3.25: Butterworth filter. (a) Effect on a QPSK constellation. (b) Eye diagram.
1.5
1.5
1
1
0.5
0.5
0.5
0.5
(a)
(b)
Figure 3.26: Chebyshev filter. (a) Effect on a QPSK constellation. (b) Eye diagram. different perspective, revealing that this Butterworth response introduces a small amount of intersymbol interference (ISI), which is consistent with the amplitude deviation seen in the constellation diagram. The effect of a fourth-order Chebyshev filter, with a maximum passband ripple of 1 dB, on a QPSK signal and the corresponding eye diagram are illustrated in Fig. 3.26. The
44
constellation diagram reveals that the nonlinear phase response of the Chebyshev filter alters the phase relationship between the I and Q signals. Consequently, when selecting a filter response, in addition to achieving sufficient attenuation of undesired signals, the filter must also maintain the integrity of the desired signal. In addition to the Butterworth and Chebyshev filters, other potential candidates for filtering in the receiver include the Bessel, inverse Chebyshev, and elliptic filters [20].
3.10
DC Offsets
Both the direct-conversion and image-reject architectures are sensitive to dc offsets since in both architectures, the desired signal is downconverted to baseband prior to analog-todigital conversion. DC offsets are problematic for two reasons. First, dc offsets can saturate the baseband circuits, such as amplifiers and filters, and consequently, the receiver ceases to function properly. Second, even if the baseband circuits do not saturate, dc offsets, if uncorrected, degrade the performance of the system. Fig. 3.27 illustrates the effect of dc offsets on a QPSK constellation, where dI and dQ are the dc offsets in the I and Q channels, respectively. When a QPSK signal with dc offsets is corrupted by AWGN, the probability of error can be approximated by averaging the error probabilities for two binary antipodal signals,
1.5
1 dQ 0.5 dI
0.5
45
each with a dc offset. For the I component, the dc offset is dI, and the two symbols are located at distances (d 0 / 2)(1 2d I / d 0 ) and (d 0 / 2)(1 + 2d I / d 0 ) from the decision boundary. Consequently, the probability of error for the I component is approximately
Pb ( I ) 1 2d I Q 1 2 d0 d 02 2N 0 2d I + Q 1 + d0 d 02 2N 0 .
(3.27)
Similarly, for the Q component, the dc offset is dQ, and the two symbols are located at distances (d 0 2)(1 2d Q / d 0 ) and (d 0 2)(1 + 2d Q / d 0 ) from the decision boundary. The probability of error for the Q component is approximately
1 2d Q Q 1 2 d0 d 02 2N 0 2d Q + Q 1 + d0 d 02 2N 0 .
Pb (Q )
(3.28)
(3.29)
The error probabilities as a function of SNR for various amounts of dc offset are plotted in Fig. 3.28. For dc offsets greater than 1%, the BER degradation can be significant.
3.11
ADC Quantization
In many receivers, an ADC converts the received signal to a digital signal for further processing. In order to avoid destructive aliasing, the ADC must sample the input signal
46
10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7
ideal dI/d0 = dQ/d0 = 0.01 dI/d0 = dQ/d0 = 0.05 dI/d0 = dQ/d0 = 0.10
10
12
Figure 3.28: Probability of error for QPSK with dc offsets. at the Nyquist sampling rate, f s 2 f sig , where fsig is the highest frequency contained in the input signal. The ADC quantizes each sample to one of 2R amplitude levels, where R is the number of bits used to represent each sample. The finite resolution of the quantizer results in an error, qn, in the digital output signal,
q n = x n xn
(3.30)
where xn is the analog input signal and x n is the digital output signal. For a uniform quantizer, the error is statistically characterized by the uniform probability density function [21]
p(q) = 1 1 1 , q 2 2
(3.31)
(3.32)
where VFS is the full-scale level of the ADC, and the mean square value of the quantization error is
2 1 2 1 VFS E (q ) = = 2 R = 20 log VFS 6 R 10.8 dB . 12 12 2 2
(3.33)
47
50 40 14 bit 10 bit 8 bit 6 bit 4 bit 0 1000 2000 3000 number of symbols 4000
SIR (dB)
30 20 10 0
10
Figure 3.29: Effect of ADC resolution on an adaptive MUD algorithm [22]. Consequently the quantization noise decreases by 6 dB for each additional bit used in the quantizer. The amount of quantization noise introduced by the ADC affects the performance of the algorithms implemented in the subsequent digital circuitry. For example, Fig. 3.29 illustrates the effect of ADC resolution on an adaptive multiuser detection (MUD) algorithm [22]. Clearly, a larger number of bits improves the signal-tointerference ratio (SIR).
3.12
Summary
Receiver front-end impairments, such as noise, distortion, and mismatch, can potentially degrade the performance of digital communications systems. This chapter described the effects of many of these impairments on a QPSK signal constellation. By including models for all of the analog impairments described in this chapter, the simulation framework described in the next chapter facilitates the direct evaluation of the effects of these impairments on the performance of digital communications algorithms.
48
Chapter 4
System-Level Simulation Framework
4.1
Introduction
All of the analog front-end impairments described in Chapter 3 can potentially degrade the performance of digital communications systems. In a conventional approach, link budget calculations based on a few system-level specifications are performed to determine the allowable levels of receiver impairments, such as noise and distortion [23]. This approach can be advantageous since it abstracts the analog hardware design from the detailed system-level specifications. However, the abstraction offered by this approach can also be a drawback since it does not offer the ability to more closely evaluate the effects of the analog front-end impairments on the performance of digital communications algorithms. This chapter describes a system-level simulation framework which allows for complete end-to-end simulations of communications systems. This framework includes models for the nonideal behavior of analog front-end components and facilitates the exploration of tradeoffs between analog impairments and overall system performance.
4.2
In a conventional approach, the allowable levels of receiver impairments are determined from a few system-level specifications, such as required BER, reference sensitivity, and worst-case out-of-band blocker levels. This method of determining the performance requirements of the analog front-end components is based strictly on numerical calculations. The following sections provide a brief overview of receiver performance calculations for receiver noise and distortion requirements.
4.2.1
Noise Calculations
In RF circuit design, the receiver noise performance is typically characterized in terms of noise factor or noise figure. The noise factor, F, of an RF circuit component is defined as
SNRin SNRout
(4.1)
where SNRin and SNRout are the signal-to-noise ratios at the input and output, respectively. The noise figure is simply the noise factor expressed in decibels, 10log(F). Although it is the preferred metric for noise performance in RF circuit design, noise figure is typically not a system-level specification. However, the noise figure can be calculated based on the reference sensitivity, the bandwidth of the desired signal, and the required BER. One of the primary system-level specifications is the reference sensitivity, which is defined as the minimum signal level which the receiver must be able to correctly detect. The input SNR, SNRin in (4.1), is simply the reference sensitivity divided by the noise power at the receiver input. The noise at the input of the receiver originates from the black-body radiation in the transmission environment and is modeled by a radiation resistance, Rs, associated with the antenna. Consequently, the noise power which appears at the receiver input is determined by the voltage divider between the receiver input resistance and the antenna source resistance (Fig. 4.1):
Pnoise =
(4.2)
50
antenna Rs + Vs Vin
receiver
Rin
Most receivers are designed for maximum power transfer from the antenna to the input of the receiver, in which case, Rin = Rs and (4.2) becomes Pnoise = kTf . (4.3)
In RF circuit design, power levels are commonly referred to in decibels referenced to 1 mW, or 0 dBm. At 300 K, the noise power in dBm is Pnoise [dBm] = 10 log(1.38 10 23 J/K 300 K 10 3 mW/W) + 10 log f = 173.8 + 10 log f (4.4)
where f is the signal bandwidth in hertz. Consequently, the input SNR in decibels is just
SNRin [dB] = Psig [dBm] Pnoise [dBm] = Psig [dBm] 10 log(f [Hz]) + 173.8 where Psig is the reference sensitivity. The other parameter required to calculate the receiver noise figure is the output SNR, SNRout in (4.1). Although the output SNR is typically not explicitly specified at the system-level, it can be determined from the maximum tolerable BER. A reliable communications link is guaranteed when the specified BER is achieved. The BER for a particular communications system is related to the SNR of the received signal, and the exact relationship depends on a number of factors including the modulation scheme as well as the specific algorithm used for signal detection. For example, for a QPSK (4.5)
51
modulation scheme, the BER as a function of SNR is given in (3.5) and illustrated in Fig. 3.2. Consequently, the required noise figure can be expressed as NF [dB] = SNRin [dB] SNRout [dB] = Psig [dBm] 10 log(f [Hz]) SNRout [dB] + 173.8 (4.6)
where Psig is the reference sensitivity, f is the bandwidth of the desired signal, and SNRout is the SNR corresponding to the maximum tolerable BER. Once the required noise performance of the receiver is determined, the noise budget must be partitioned between the various receiver building blocks. For cascaded receiver stages, the total noise factor is given by the Friis equation [24]: Ftot = F1 + F 1 F2 1 + K + NN G1 Gi
i =1
(4.7)
where Fi and Gi are the noise factor and power gain, respectively, of the ith receiver stage. Finally, before concluding this section, it is worthwhile to clarify the distinction between single-sideband (SSB) noise figure and double-sideband (DSB) noise figure [25]. Fig. 4.2 depicts an RF input signal which is corrupted by AWGN. The power and bandwidth of the RF signal are Psig and W, respectively, while the PSD of the AWGN is N / 2 , and consequently, the input SNR is SNRin = Psig N W .
LO = cos(2fct) Psig/2 Psig/2
(4.8)
PSD = N/2
fc W
Psig/8
Psig/4
Psig/8
52
PSD = N/2
fc W
Psig/8
Psig/8 fIF
Psig/8
PSD = N/4
Psig/8 2fcfIF
Figure 4.3: Frequency translation of RF signal to an intermediate frequency. If the RF signal is frequency translated directly to baseband by an LO signal cos(2 f c t ), as illustrated in Fig. 4.2, the output SNR of the resulting baseband signal is SNRout = Psig N W . (4.9)
In this case, the noise figure determined by (4.8) and (4.9) is referred to as the DSB noise-figure. However, if the RF signal is frequency translated to an intermediate frequency by an LO signal cos[2 ( f c f IF )t ] , as illustrated in Fig. 4.3, the output SNR of the resulting IF signal is SNRout = Psig 2N W . (4.10)
In this case, the noise figure determined by (4.8) and (4.10) is referred to as the SSB noise figure and is 3 dB higher than the DSB noise figure: NFSSB = NFDSB + 3 dB . (4.11)
4.2.2
Distortion Calculations
In RF circuit design, the distortion performance of the receiver is typically characterized by several measured parameters such as 1-dB compression point and intermodulation intercept point. These parameters are commonly modeled using approaches based on either power series or Volterra series [26]. The latter approach provides a very accurate characterization of nonlinearities in circuits with memory, i.e., circuits with capacitors
53
and inductors. However, calculations using Volterra series are rather complex, even when using computer simulation techniques. On the other hand, calculations using power series are much more tractable at the expense of only providing an accurate description of distortion in circuits that are memoryless. A system with second-order and third-order nonlinearities can be described by the following power series:
s o = a1 s i + a 2 s i2 + a 3 si3
(4.12)
where si and so are the input and output signals, respectively. For an input signal, si = A cos(2 ft ) , the output signal is so = a2 A2 a A2 3a A3 + a1 A + 3 cos(2 ft ) + 2 cos[2 (2 f )t ] 2 4 2 + a3 A cos[2 (3 f )t ]. 4
3
(4.13)
As seen in (4.13), third-order distortion alters the gain of the fundamental component,
cos(2 ft ). For large input signal amplitudes A,
3a3 A 2 a1 . 4
(4.14)
Since a3 is typically negative, the gain of the fundamental component decreases, or compresses, for large input signal amplitudes. The 1-dB compression point is defined as the input signal level which causes the gain of the fundamental to drop by 1 dB, as illustrated in Fig. 4.4. The 1-dB compression point of the receiver should be larger than the strongest anticipated input signal. The 1-dB compression point can be determined from (4.13):
2 3a3 Acomp
|= 20 log | a1 | 1 dB .
(4.15)
Acomp = 0.145
a1 . a3
(4.16)
54
signal consists of two sinusoids, A1 cos(2 f 1t ) and A2 cos(2 f 2 t ), then the third-order intermodulation products at 2 f 1 f 2 and 2 f 2 f1 are 3a A A 2 3a3 A12 A2 cos[2 (2 f1 f 2 )t ] + 3 1 2 cos[2 (2 f 2 f 1 )t ] . 4 4 (4.17)
In RF circuit design, the third-order intermodulation performance is characterized by the third-order intermodulation intercept point (IP3), which is the point where the amplitudes of the intermodulation products and the fundamental components are equal when two equal amplitude sinusoids are applied at the input. The input amplitude at which the intercept point occurs can be determined from (4.17):
3 3 | a3 | AIP 3 =| a1 | AIP 3 . 4
(4.18)
55
AIP 3 =
4 a1 . 3 a3
(4.19)
Equation (4.19) can also be expressed in terms of the 1-dB compression point in (4.16):
AIP 3 =
(4.20)
In practice, the higher-order distortion terms become significant at the intercept point, and consequently, the IP3 measurement must be performed with sufficiently small input amplitudes so that the contribution from the higher-order distortion terms is negligible. The power levels of the third-order intermodulation products and the fundamental components are plotted for a few input power levels, and the intercept point is determined by the intersection of the two lines extrapolated from these points, as illustrated in Fig. 4.5. The receivers required third-order intermodulation distortion requirement can be determined from the system-level specifications for the anticipated levels of out-of-band interferers, the reference sensitivity, and the desired BER. As described in Section 4.2.1, the maximum BER specification corresponds to a minimum SNR which depends on various factors, including the type of modulation scheme and the specific algorithm used for signal detection. Consequently, at the output of the receiver, the third-order
fundamental IP3
third-order intermodulation
56
intermodulation products due to the out-of-band signals which fall within the bandwidth of the desired signal must be small enough to still maintain the minimum SNR:
G Psig PIM 3
SNR
(4.21)
where G = a12 is the power gain of the receiver, Psig is the power of the desired signal at receiver input, or the reference sensitivity, and PIM3 is the power of the intermodulation product at the receiver output. In this case, the intermodulation product is assumed to be uncorrelated with the desired signal and equivalent to AWGN. From (4.17), the power of the intermodulation product is
PIM 3 = 9 2 3 a3 Pint 16
(4.22)
where Pint is the power of each of the out-of-band interferers. Substituting (4.22) into (4.21) gives 16 a12 Psig SNR . 2 3 9 a3 Pint (4.23)
But from (4.19), the input power level at the third-order intermodulation intercept point is
PIIP 3 =
4 a1 3 a3
(4.24)
so (4.23) becomes
2 Psig PIIP 3 3 Pint
SNR .
(4.25)
Consequently, the input power level corresponding to the required third-order intermodulation intercept point is
PIIP 3 Pint SNR Pint Psig
(4.26)
or
57
1 PIIP 3 [dBm] Pint [dBm] ( SNR [dB] + Pint [dBm] Psig [dBm]) . 2
(4.27)
Once the required IP3 performance of the receiver is determined, the distortion budget must be partitioned between the various receiver building blocks. For cascaded receiver stages, if the distortion contribution from the various stages are all uncorrelated, then the input power level corresponding to the third-order intermodulation intercept point can be determined from the following expression
N 1
(4.28)
where PIIP3(i) is the input power at the intercept point of the ith stage and Gi is the power
4.2.3
Summary
In a conventional approach, a few system-level specifications, such as maximum BER and reference sensitivity, are converted to other metrics more commonly used in RF circuit design, such as noise figure and intermodulation intercept point, using equations such as (4.6) and (4.27). These requirements are budgeted between the various blocks which make up the receiver using additional equations such as (4.7) and (4.28), and spreadsheets are commonly used in order to facilitate such calculations. Although this approach is relatively straightforward and simple, it lacks the ability to model the detailed interactions between the analog front-end impairments and the digital back-end algorithms. Because of its simplicity, the equation-based method still serves as a good starting point. However, a simulation framework which is capable of end-to-end simulations can provide a much more accurate assessment of the effects of receiver impairments on system-level performance. For such an approach, the simulation environment must be simple and capable of rapid simulation speeds. Consequently, behavioral models, rather than circuit models, should be used for the receiver components. The following sections provide detailed descriptions of a system-level
58
simulation framework which can be used to explore the effects of analog front-end impairments as well as to evaluate overall receiver performance.
4.3
Baseband-Equivalent Models
Even with the use of behavioral models, a modeling framework which simulates the analog front-end at the carrier frequency is unacceptable in terms of speed. For such a simulation, the maximum step size must be based on the carrier frequency, whereas the total number of steps must be based on the symbol rate. Since the carrier frequency is typically much higher than the symbol rate in wireless communications systems, such a simulation would be prohibitively slow. In order to decrease the simulation time, the simulation framework relies on basebandequivalent models for the analog RF building blocks, such as amplifiers, mixers, and oscillators. The method is similar to envelop simulation techniques used in some RF circuit-level simulators [27]. The baseband-equivalent models for the various RF building blocks are based on the following expression which can be used to represent any real signal along the RF signal path:
s (t ) = s DC (t ) + [ s In (t ) cos(n c t ) + sQn (t ) sin(n c t )]
n =1 N
(4.29)
where the bandwidths of s DC (t ) , s In (t ) , and sQn (t ) are assumed to be much less than c. For example,
s (t ) = s I 1 (t ) cos( c t ) + sQ1 (t ) sin( c t )
(4.30)
represents an ideal RF signal where s I 1 (t ) and sQ1 (t ) are the baseband I and Q signals, respectively, and c is the carrier frequency. An example spectrum of s(t) in (4.29) is illustrated in Fig. 4.6. Baseband-equivalent models are derived by letting the input and output signals of the various RF building blocks take the same form as (4.29). Then the dependence on c is eliminated by keeping track of only the time-varying coefficients, s DC (t ) , s In (t ) , and sQn (t ) . The baseband-equivalent models for RF amplifiers, mixers, and oscillators are derived in the following sections.
59
S()
...
2c c 0
SDC()
...
c 2c
4.3.1
RF Amplifiers
The transfer function of any RF gain block such as the LNA can be represented by the following relationship:
xo (t ) = a n xin (t )
n =0 N
(4.31)
where xi(t) and xo(t) are the input and output signals, respectively. The coefficients a0 and a1 represent dc offset and gain, respectively, while the remaining coefficients an represent the nonlinear behavior of the RF gain block. Then let xi(t) and xo(t) have the same form as (4.29):
xi (t ) = xiDC (t ) + [ xiIn (t ) cos(n c t ) + xiQn (t ) sin(n c t )]
n =1 N N
(4.32) (4.33)
and solve for the output coefficients xoDC(t), xoIn(t), and xoQn(t) in terms of the input coefficients xiDC(t), xiIn(t), and xiQn(t). The results for N = 3 are given in Appendix A. By pre-computing the relationship between these time-varying coefficients instead of keeping track of the actual signals, xo(t) and xi(t), the dependence on the carrier frequency is removed.
4.3.2
Local Oscillators
The baseband-equivalent model for oscillators is also derived in a similar fashion by letting the oscillator output signal have the same form as (4.29):
60
(4.34)
By specifying appropriate functions for the time-varying coefficients yLODC(t), yLOIn(t), and yLOQn(t), (4.34) accounts for local oscillator impairments such as quadrature phase mismatch, frequency offset, and phase noise. For example, for an oscillator with frequency offset and phase offset , the output signal can be expressed as
y LO (t ) = cos[( c + )t + ] = cos( t + ) cos( c t ) sin( t + ) sin( c t ) = y LOI 1 (t ) cos( c t ) + y LOQ1 (t ) sin( c t )
(4.35)
4.3.3
Mixers
A similar method is used to derive the baseband-equivalent model for mixers in directconversion receivers. The transfer function of a mixer can be represented by y o (t ) = y i (t ) y LO (t ) (4.36)
where yi(t), yLO(t), and yo(t) are the input, oscillator, and output signals, respectively. Then let yi(t), yLO(t), and yo(t) have the same form as (4.29):
yi (t ) = y iDC (t ) + [ y iIn (t ) cos(n c t ) + y iQn (t ) sin(n c t )]
n =1 N N
(4.37)
(4.38)
(4.39)
and solve for the output coefficients yoDC(t), yoIn(t), and yoQn(t) in terms of the input coefficients yiDC(t), yiIn(t), yiQn(t), yLODC(t), yLOIn(t), and yLOQn(t). Additional reductions in simulation complexity can be achieved by keeping track of only the baseband component yoDC(t). This approximation is valid for large carrier frequencies since the frequency
61
components at c and greater are significantly attenuated by baseband filtering. The results for N = 3 are given in Appendix A. The derivation of this baseband-equivalent model is based on (4.29), in which all of the harmonic components are assumed to have bandwidths much less than c. In the case of direct-conversion receivers, application of this model is straightforward since the LO and the desired signal are at the same frequency, c, and all the mixing products fall on multiples of c. However, in the case of heterodyne receivers, this baseband-equivalent model must be used with caution since the harmonic components of the output may overlap. For example, consider an input signal centered at the carrier frequency c yi (t ) = y iI 1 (t ) cos( c t ) and an LO signal at c IF y LO = cos[( c IF )t ] = y LOI 1 (t ) cos( c t ) + y LOQ1 (t ) sin( c t ) where y LOI 1 (t ) = cos( IF t )
y LOQ1 (t ) = sin( IF t ) .
(4.40)
(4.41)
(4.42) (4.43)
(4.44)
where
y oDC (t ) = y oI 2 (t ) = y oQ 2 (t ) = 1 y iI 1 (t ) cos( IF t ) 2 1 y iI 1 (t ) cos( IF t ) 2 1 y iI 1 (t ) sin( IF t ) . 2
62
In low-IF receivers, the intermediate frequency, IF, is typically much smaller than the carrier frequency c. In this case, the harmonic components do not overlap, and consequently, the same baseband-equivalent model used for mixers in direct-conversion receivers can be used for mixers in low-IF receivers. However, if IF > c / 2 , then the spectral content of yoDC(t) in (4.45) actually extends beyond c / 2 , which violates the bandwidth constraints for the coefficients in (4.29). Likewise, both yoI2(t) in (4.46) and yoQ2(t) in (4.47) have positive frequency components which extend beyond 2 c c / 2 . Consequently, a different baseband-equivalent model is used for intermediate-frequency mixers in heterodyne architectures, such as the image-reject architecture. In this case, yi(t) in (4.36) is still represented by (4.37), but yLO(t) in (4.36) is instead represented by the following expression y LO (t ) = y LODC (t ) + { y LOIn (t ) cos[n( c IF )t ] + y LOQn (t ) sin[n( c IF )t ]} = y LODC (t ) + {[ y LOIn (t ) cos(n IF t ) y LOQn (t ) sin(n IF t )] cos(n c t ) + (4.48)
n =1 n =1 N N
[ y LOIn (t ) sin(n IF t ) + y LOQn (t ) cos(n IF t )] cos(n c t )}. Equation (4.48) differs from (4.38) in that the dependence on the intermediate frequency,
[ y
n =1 N N N N
oIRFn
(4.49)
oIPnm
{ y
n =1 m =1
{ y
n =1 m =1
oINnm
The baseband-equivalent model is derived by solving for the output coefficients of yo(t) in terms of the input coefficients of yi(t) and yLO(t). Although this model seems quite complex, reductions in simulation complexity can be achieved by keeping track of only the relevant output coefficients. In heterodyne receivers, the components far from IF are 63
significantly attenuated by filtering, and consequently, it is only necessary to keep track of the components near IF. The results for N = 3 are given in Appendix A.
4.3.4
Summary
By letting the input and output signals of the various RF building blocks take the same form as (4.29) for direct-conversion and low-IF receivers or (4.49) for heterodyne receivers and pre-computing the relationship between the time-varying coefficients, the dependence on the carrier frequency is eliminated. By specifying appropriate functions for the time-varying coefficients, these baseband-equivalent models account for many circuit impairments in the RF components, including distortion, phase noise, quadrature phase mismatch, frequency offset, and dc offsets. Moreover, the maximum step size of the simulation is now determined by the maximum frequency component in the timevarying coefficients rather than the carrier frequency. The additional simulation complexity of this technique depends on the number of harmonics, N, which must be chosen in order to accurately model the effects of analog circuit impairments such as distortion. For typical wireless communications systems, the number of harmonics should be chosen to be at least three.
4.4
A simulation framework which allows for detailed, yet rapid, simulations of the analog front-end is implemented in Simulink, a graphical simulation environment built on top of MATLAB. Implementing the simulation framework in Simulink offers compatibility of the analog front-end simulations with MATLAB, which is already widely used for development and evaluation of communications algorithms. Consequently, using this simulation framework allows for complete end-to-end simulations of communications systems, including all analog, mixed-signal, and digital components. Such a framework facilitates the exploration of analog and digital design tradeoffs, leading to solutions which can potentially achieve lower power consumption and higher levels of integration. Since behavioral models offer fast simulation speed, the simulation environment described here includes behavioral models for all of the analog front-end components.
64
The behavioral models for all high-frequency components, such as RF amplifiers, mixers, and oscillators, are based on the baseband-equivalent models described in Section 4.3. In addition, structural models are also available for some of the more complex receiver components, such as phase-locked loops and sigma-delta analog-to-digital converters. Although structural models require longer simulation times, they offer increased accuracy when used in system-level simulations. Finally, the simulation framework described here also supports the conventional design approach in which the performance requirements of the analog front-end components are determined by the numerical calculations described in Section 4.2. However, instead of using spreadsheets to facilitate such calculations, equations such as (4.7) and (4.28) are directly incorporated into the models of the various receiver building blocks. Consequently, the simulation framework can provide overall receiver performance metrics, such as noise figure and input IP3, based on the specifications of the individual cascaded components. The following sections provide additional detail about various aspects of this system-level simulation framework.
4.4.1
Thermal Noise
Thermal noise introduced by the various components in the analog front-end is modeled by the Band-Limited White Noise block in Simulink. A few precautions should be exercised when specifying the simulation parameters for this block. First, the noise power parameter is used to specify the desired double-sided power spectral density (PSD) of the white noise. The double-sided PSD is half of the single-sided PSD, which is more commonly used in circuit analysis. For example, the single-sided PSD of the voltage noise introduced by a resistor is
v2 = 4kTR . f
(4.50)
When using the Band-Limited White Noise block to model the noise introduced by a resistor, the noise power parameter should be set to the double-sided PSD, 2kTR. Next, the sample time parameter is used to specify the correlation time of the noise. The inverse of this parameter is just the noise bandwidth. Ideally, white noise should have infinite bandwidth or a correlation time of zero. In order to approximate the wideband 65
nature of the noise, the noise bandwidth should be much greater than the largest frequency component in the system. In other words, the sample time parameter should be chosen to be much smaller than the shortest time constant in the system. The following condition on the sample time, tc, is recommended for good results [28]:
tc 1 2 . 100 f max
(4.51)
Clearly, this constraint can increase the simulation time significantly. In fact, the speed of the simulation framework described here is limited by the ability to accurately model the behavior of broadband white noise. One way to reduce the simulation time is simply to relax the noise sampling time constraint. Simulations were performed to evaluate the effect of noise sampling time on accuracy by feeding broadband white noise with a single-sided PSD of 1 V2/Hz through a third-order Butterworth low-pass filter, the cutoff frequency of which determines the smallest time constant of the system. The magnitude response of a third-order Butterworth filter is
| H ( f ) |2 = 1 1 + ( f / fc )6
(4.52)
1+ ( f / f
df
2 = fc . 3 c)
6
(4.53)
These simulation results are summarized in Table 4.1. As expected, the simulated output noise variance is very accurate when the noise bandwidth is much larger than the filter bandwidth. However, the results are still fairly accurate for a noise bandwidth which is only five to ten times larger than the filter bandwidth. Thus, if simulation speed is critical, then the constraint on the noise sampling time in (4.51) can be relaxed, but should be no more than about one-fifth the shortest time constant in the system. Finally, the seed parameter in the Band-Limited White Noise block sets the starting seed for the random number generator. Different seeds should be used for different noise sources in order to keep the various noise sources uncorrelated. Setting the seed
66
noise bandwidth 1 GHz 1 GHz 1 GHz 1 GHz 1 GHz 1 GHz 100 MHz 100 MHz 100 MHz 100 MHz 100 MHz 100 MHz
filter bandwidth 1 MHz 10 MHz 20 MHz 30 MHz 40 MHz 50 MHz 1 MHz 10 MHz 20 MHz 30 MHz 40 MHz 50 MHz
expected output noise variance 63.2 dB 73.2 dB 76.2 dB 78.0 dB 79.2 dB 80.2 dB 63.2 dB 73.2 dB 76.2 dB 78.0 dB 79.2 dB 80.2 dB
simulated output noise variance 63.2 dB 73.2 dB 76.2 dB 78.0 dB 79.2 dB 80.2 dB 63.2 dB 73.0 dB 75.9 dB 77.3 dB 78.2 dB 78.7 dB
Table 4.1: Effect of noise sampling time on simulation accuracy. parameter to a random number guarantees that all the noise sources in the same simulation are independent.
4.4.2
Flicker Noise
The power spectral density of the input-referred voltage noise for a MOS transistor consists of a flicker noise component and a thermal noise component and is given by (2.6) and repeated here Kf vi2 2 1 = + 4kT . f WLCox f 3 gm (4.54)
Flicker noise is particularly important in the direct-conversion and Weaver architectures since in both cases, the desired signal is frequency translated to baseband prior to analogto-digital conversion. In addition, it can also be a dominant contributor to phase noise in the LO. In order to examine the effects of flicker noise on overall system performance, a flicker noise model is included in the simulation framework. Flicker noise is generated by passing white noise through a filter with magnitude response [29] | H ( f ) |= 1 f . (4.55)
67
magnitude
1 (1ej)1/2 1 1/2
0 0
0.5
1.5
2.5
Figure 4.7: Magnitude response of H (e j ) in (4.57). Such a filter can be approximated by the discrete-time transfer function [30]
H ( z) = 1 . (1 z 1 )1 / 2
(4.56)
(4.57)
and the magnitude of H (e j ) is plotted in Fig. 4.7 and matches well with the magnitude response of 1 / 1 / 2 . The denominator in (4.56) can be expanded in a power series, giving
H ( z) = 1 1 1 1 1 1 z 1 1 z 2 K 2 2 2 2!
(4.58)
The coefficients of the power series in the denominator of (4.58) are given by the following iterative equations:
a1 = 1 5 a a k = k k 1 . 2 k 1
(4.59)
68
Figure 4.8: Filter initialization code corresponding to (4.59). In Simulink, this filter is implemented using the Direct-Form II Transpose Filter block and the denominator coefficients are determined by the initialization code illustrated in Fig. 4.8. The amount of flicker noise is determined by specifying the power spectral density, PSDa, of the noise at the flicker noise corner frequency, fa, which is the frequency at which the flicker noise and thermal noise asymptotes intersect. The BandLimited White Noise block is then used to generate white noise with power spectral density equal to 2 f a PSDa . The output of the Band-Limited White Noise block is then passed through the Direct-Form II Transpose Filter block, and the resulting power spectral density is
PSD = 2 f a PSDa f 1 = PSDa a . 2 f f
(4.60)
The accuracy of the filter can be increased by increasing the number of terms in the power series expansion of the denominator in (4.58). Of course, this increased accuracy comes at the cost of increased simulation time. The power spectral density corresponding to a filter with 1000 taps is illustrated in Fig. 4.9. As seen in Fig. 4.9, the simulated power
10 15
magnitude (dB)
20 25 30 35 40 45 0.1 1 10
frequency (MHz)
69
4.4.3
RF Amplifiers
The baseband-equivalent model for RF amplifiers described in Section 4.3.1 is implemented in Simulink using S-functions. S-functions may be written in either the MATLAB or C programming languages. For this simulation framework, all the S-functions are written in C in order to maintain compatibility with the Real-Time Workshop. Compiling Simulink designs using the Real-Time Workshop offers simulation speed improvements of up to ten times. Unfortunately, the Real-Time Workshop cannot be used to compile Simulink designs which contain S-functions written in the MATLAB programming language. The transfer function of an RF gain block is described by (4.31), and in this simulation framework, the baseband-equivalent model is implemented for N = 3 . The behavior of each RF gain block is specified by the coefficients of the equation
xo (t ) = a 0 + a1 xi (t ) + a 2 xi2 (t ) + a3 xi3 (t )
(4.61)
as well as by the blocks noise performance. Since N = 3 , the input and output of an RF gain block each consists of seven signals representing the time-varying coefficients of the equation
s (t ) = s DC (t ) + [ s In (t ) cos(2 nf c t ) + s Qn (t ) sin(2 nf c t )] .
n =1 3
(4.62)
In order to model broadband white noise, noise sources with the appropriate power spectral densities must be added to each of the seven input signals. In Fig. 4.10, the total double-sided noise PSD is partitioned into multiple non-overlapping components. The
PSD N0 nn(f)
3fc
2fc
fc
fc
2fc
3fc
70
component from f c / 2 < f < f c / 2 has a double-sided PSD equal to N0. The components centered at fc and fc can be represented by the following expression: n(t ) = x(t ) cos(2 f c t ) y (t ) sin(2 f c t ) (4.63)
where n(t) is a wide-sense stationary stochastic process with zero mean and PSD nn ( f ) , and x(t) and y(t) are the I and Q components of n(t), respectively, both of which are band-limited from f c / 2 < f < f c / 2 . Since n(t) is zero mean, x(t) and y(t) are also zero mean. In addition, since n(t) is stationary, the autocorrelation and cross-correlation functions of x(t) and y(t) satisfy the following properties:
xx ( ) = yy ( ) xy ( ) = yx ( ) .
(4.64) (4.65)
Moreover, since the PSD for band-pass white noise is symmetric about f = 0 ,
nn ( ) = xx ( ) cos(2 f c )
and the PSD of n(t) is
1 nn ( f ) = [ xx ( f f c ) + xx ( f + f c )] . 2
(4.66)
(4.67)
From Fig. 4.10, nn ( f ) = N 0 for 3 f c / 2 < f < f c / 2 and f c / 2 < f < 3 f c / 2 , and therefore, the power spectral densities of x(t) and y(t) are
xx ( f ) = yy ( f ) = 2 N 0 .
(4.68)
Consequently, if the noise component centered at dc is modeled by a white noise source with a double-sided PSD equal to N0, then the I and Q noise components centered at fc, 2fc, and 3fc are each modeled by an independent white noise source with a double-sided PSD equal to 2N0.
4.4.4
Mixers
71
The baseband-equivalent model for direct-conversion and low-IF mixers described in Section 4.3.3 is implemented for N = 3 using S-functions written in C. Similar to the model for RF amplifiers, the gain and distortion performance of each mixer is specified by the coefficients in (4.61). In fact, the same baseband-equivalent model used for RF amplifiers is used to model the gain and distortion performance of each mixer. In this implementation, the gain and distortion introduced by the mixer is assumed to occur before the frequency translation process. Finally, similar to the RF amplifier blocks, the broadband white noise introduced by the mixer is modeled by adding noise sources with the appropriate power spectral densities to each of the seven input signals. If the noise component centered at dc is modeled by a white noise source with a double-sided PSD equal to N0, then the I and Q noise components centered at fc, 2fc, and 3fc are each modeled by an independent white noise source with a double-sided PSD equal to 2N0.
4.4.5
Local Oscillators
Many receivers use in-phase and quadrature oscillator signals for frequency translation:
(4.69)
(4.70)
where AI and AQ are the amplitudes of the in-phase and quadrature oscillator signals, respectively, fc is the oscillation frequency, f is the frequency offset, 0 is the quadrature phase mismatch and n(t) is the phase noise. Equations (4.69) and (4.70) can be expressed in the form of (4.29):
(4.71)
72
(4.72)
This baseband-equivalent model is implemented using built-in Simulink blocks [31]. The phase noise performance of the oscillator can be represented by either a simple behavioral model or a more complex structural model [31]. In the simple behavioral model, at small frequency offsets from the center frequency, the power spectral density of the phase noise is assumed to be constant, while at large frequency offsets, it is assumed to fall off at f
2
first-order low-pass filter. An example spectrum is illustrated in Fig. 4.11. In this case, the phase noise is specified to be 70 dBc/Hz at low frequencies and 100 dBc/Hz at a 100-kHz frequency offset. This behavioral model provides a good first-order approximation of the phase noise performance, while being simple enough to provide fast simulation times. A more complex phase noise model is also available is this simulation framework. In many receivers, the local oscillator signal is generated from a very accurate reference frequency using a phase-locked loop (PLL) as illustrated in Fig. 4.12. The phase detector (PD) compares the phases the two input signals, and its output passes through a loop filter (LF) before being applied to the voltage-controlled oscillator (VCO). A divider reduces
60 70
magnitude (dB)
10
100
frequency (kHz)
Figure 4.11: Example spectrum of phase noise generated by simple behavioral model.
73
fi
PD
LF
VCO
fo
KPD
FLF(s)
1 N
the frequency of the oscillator signal by a factor N and this signal is fed back to the PD to complete the loop. When the loop is locked, f o = Nf i . Under this condition, the PLL can be modeled as a linear system (Fig. 4.13). From this linear model, the noise contribution from each of the PLL components can be determined. Fig. 4.14 illustrates the linear model along with all relevant noise sources. The noise source n1 models the noise contributions from the PD, reference oscillator, and divider, while n2 models the noise from the LF and n3 models the noise from the VCO. The total noise at the PLL output is
o = o1 + o 2 + o 3
where
FLF ( s ) s o1 = K PD K VCO FLF ( s ) n1 1+ s N K PD K VCO
n1 i(s) KPD FLF(s) n2 KVCO s n3 o(s)
(4.73)
(4.74)
1 N
74
1 Complex Phase
o2 =
K K F (s) 1 + PD VCO LF s N
n2
(4.75)
o3 =
1 K K F ( s) 1 + PD VCO LF N s
n3 .
(4.76)
This phase noise model is implemented for a PLL with a second-order loop filter using built-in Simulink blocks and is illustrated in Fig. 4.15 [31]. In this model, the phase noise performance is specified by PLL parameters such as PD gain, VCO gain, and divider ratio N, as well as by circuit parameters such as the resistor and capacitor values used to implement the LF. Because of its accuracy, this model is very useful as an aid in exploring the tradeoffs between various PLL parameters. Finally, although this model provides a very accurate representation of the PLL phase noise, its complexity results in longer simulation times. Consequently, this phase noise model should be avoided in system-level simulations unless very accurate results are required.
4.4.6
Simulink provides a comprehensive library of blocks which can be used to model baseband amplifiers and filters. In particular, the Transfer Fcn block can be used to
75
implement any linear transfer function by specifying the coefficients of the numerator and denominator polynomials: a1 s n 1 + a 2 s n 2 + a3 s n3 + K + a n . H ( s) = b1 s m1 + b2 s m 2 + b3 s m3 + K + bm (4.77)
Alternative, the Zero-Pole block can be used to implement any linear transfer function by specifying the poles and zeros: H ( s) = K ( s z1 )( s z 2 )( s z 3 ) K ( s z n ) . ( s p1 )( s p 2 )( s p3 ) K ( s p n ) (4.78)
Finally, Simulink also provides an Analog Filter Design block in the Filter Designs library of the DSP Blockset which can be used to model Butterworth, Chebyshev Type I, Chebyshev Type II, and Elliptic filter responses. Filters can be specified as low-pass, high-pass, band-pass, or band-stop. For a Bessel response, the simulation framework provides a custom Simulink block based on the besself MATLAB function. The numerator and denominator coefficients of an Nth-order Bessel filter with cutoff frequency are determined by the following MATLAB command:
[num, den] = besself ( N , ) .
(4.79)
Once the filter coefficients are determined, the Bessel filter response is implemented using the Transfer Fnc block.
4.4.7
Analog-to-Digital Converters
A simple ADC behavioral model implemented using Simulink blocks is illustrated in Fig. 4.16. In this model, the analog input signal is sampled at the ADC sampling rate by the Zero-Order Hold block and the sampled value is quantized to one of 2R amplitude levels by the Quantizer block, where R is the ADC resolution. In addition, the Saturation
1 Out1
76
block models the clipping which can occur when the input signal amplitude exceeds the full-scale voltage of the ADC. This simple behavioral model results in rapid simulation times and is adequate for the preliminary evaluation of performance degradation resulting from ADC impairments. A more accurate evaluation of the effects of ADC impairments requires a more complex structural model. Since there are many types of ADCs, developing a comprehensive library of ADC structural models is impractical. However, despite requiring different structural models, the modeling issues for the different types of ADCs are similar since they are based on similar building blocks, such as switched-capacitor circuits and comparators. The structural model of a first-order sigma-delta () converter is described below [32], [33] as an example and similar techniques can be used to develop structural models for other types of ADCs. The block diagram of a first-order converter is illustrated in Fig. 4.17. The integrator is implemented as a single-ended switched-capacitor circuit illustrated in Fig. 4.18, and when the gain of the operational transconductance amplifier (OTA) is large, the transfer function is given by
DAC
OTA
Vout
77
1 In1
Cs/Ci Gain
1 Out1
(4.80)
In this ideal case, the integrator can be modeled in Simulink using the Gain and Discrete Filter blocks as illustrated in Fig. 4.19. In reality, the gain the of the OTA is limited. In this case, the OTA transfer function is given by C Vout z 1 = H ( z) = S C I 1 (1 ) z 1 Vin where (4.81)
1 C + CI 1+ S AC I 1 C 1+ S AC I
(4.82)
1 =
(4.83)
and A is OTA gain. Consequently, finite OTA gain reduces the forward gain of the integrator by in addition to shifting the pole of H(z) inside the unit circle. An integrator with finite OTA gain is modeled in Simulink as illustrated in Fig. 4.20. In a practical implementation, integrator noise also degrades the performance of the converter. These noise sources include thermal noise from the sampling switches as well
1 In1
(Cs/Ci)*beta
1 Out1
Gain
78
1 In1
Cs/Ci Gain
1 Out1
Input-referred noise
as thermal noise and flicker noise from the OTA. An integrator with equivalent inputreferred thermal and flicker noise sources is modeled in Simulink as illustrated in Fig. 4.21. The performance of the converter is also affected by the nonlinear transfer function of the integrator. In this case, the transient response of the switched-capacitor integrator is illustrated in Fig. 4.22. The nonlinear transfer function can be expressed as [34]
SRN Kvi + ( K + 1)vi (1 e n ), ( K + 1)vi n ( K +1) vi n n 1 1 SRN SRN SRN < ( K + 1)vi SRN 1 (4.84) vo (vi ) = vi sgn(vi ) e , n n n 1 SRN 1 < ( K + 1)vi Kvi + sgn(vi ) SRN , n where K accounts for the charge feed-through, n is the number of time constants occurring during the settling period, and SRN is the normalized slew rate. The nonlinear transfer function of the integrator is modeled in Simulink using the Look-Up Table block (Fig. 4.23).
79
1 In1
Cs/Ci Gain
1 Out1
Input-referred noise Cs/Ci*beta Zero-Order Hold FS=1.00 Ntau=6.0 SRN=0.7 K=0.2 SC Transient Response z-1 1-(1-eps)z -1 Integrator Relay
1 In1
1 Out1
Gain
Figure 4.24: Structural model of first-order converter. Finally, the performance of the converter is also affect by comparator offset or hysteresis, which is modeled by the Relay block in Simulink. The complete structural model for the first-order sigma delta converter is illustrated in Fig. 4.24.
4.4.8
Finally, the simulation framework provides overall receiver performance metrics for the analog front-end, including total gain, noise figure, input IP2, and input IP3, based on the specifications of the individual cascaded components. The noise figure and input IP3 are calculated based on (4.7) and (4.28), respectively, while the input IP2 is calculated from
N 1
(4.85)
where PIIP2(i) is the input power at the second-order intermodulation intercept point of the
4.5
Summary
The simulation framework presented in this chapter facilitates the exploration of tradeoffs between analog front-end impairments and system-level performance. This framework is implemented in Simulink, a graphical simulation environment built on top of MATLAB. 80
The use of baseband-equivalent models for all RF building blocks in addition to compiling Simulink designs using the Real-Time Workshop result in fast end-to-end simulations of entire receiver systems.
81
Chapter 5
A High-Speed Wireless Downlink
5.1
Introduction
Next-generation wireless systems will have to support new applications which require higher and higher data rates. These applications include wireless internet as well as other multimedia applications including full-motion video and real-time high-fidelity audio. This chapter describes a wideband code-division multiple access (WCDMA) system which is designed to operate in an indoor picocellular environment [22], [35]. Within each cell, a single base station supports aggregate data rate of 50 Mb/s. The system bandwidth is 32.5 MHz and operates at a carrier frequency of 2 GHz.
5.2
Base-Station Transmitter
A block diagram of the digital baseband section of the base-station transmitter is illustrated in Fig. 5.1. Two different systems are proposed. In the first case, each cell supports up to 15 binary data channels, each with a data rate of up to 3.33 Mb/s [22]. In the second case, each cell supports up to 31 binary data channels, each with a data rate of up to 1.61 Mb/s [36]. In both cases, however, the aggregate data rate is 50 Mb/s and the total transmission bandwidth is 32.5 MHz. Each of the K binary data channels is mapped 83
K = 15 K = 31
25 MHz a1
d1
...
gK[n]
aK
I Q
dK
QPSK modulator
spreading
Figure 5.1: Block diagram of base-station transmitter (digital baseband section). to a four-point QPSK constellation, as described in Section 3.2. Although higher-order constellations result in increased spectral efficiency, they also require higher SNRs for the same BER. The proposed system achieves a spectral efficiency in excess of 1.5 b/s/Hz. The output from each of the QPSK modulators consists of I and Q data streams each operating at 1.67 MHz for K = 15 or at 0.81 MHz for K = 31 .
5.2.1
Code-division multiple access (CDMA) is used as the multiple-access scheme for the proposed system. In CDMA, each of the K data channels is assigned a distinct signature sequence, or a spreading code, and all data channels are transmitted simultaneously using the entire frequency band. Since CDMA is a direct-sequence spread-spectrum signaling technique, this multiple-access strategy is robust against frequency-selective narrowband fading which results from multipath propagation [16]. In addition, the processing gain of spread-spectrum signals provides some immunity against narrowband interference. Each of the K data channels is assigned a discrete-time signature waveform gk[n] with unit energy. This spreading code is superimposed on both the I and Q binary antipodal
84
data streams at the output of the QPSK modulator. For K = 15 , the spreading codes are based on a 4-bit maximal-length shift-register (MLSR) sequence, e.g.,
{1,1,1,+1,1,1,+1,+1,1,+1,1,+1,+1,+1,+1} .
(5.1)
In this case, 15 distinct codes are formed by using different phase offsets of the MLSR sequence in (5.1). For K = 31 , the spreading codes are based on a 5-bit MLSR sequence, e.g.,
{+1,+1,+1,+1,+1,1,1,+1,+1,1,+1,1,1,+1,1, 1,1,1,+1,1,+1,1,+1,+1,+1,1,+1,+1,1,1,1}.
(5.2)
Similarly, the 31 distinct codes are formed by using different phase offsets of the MSLR sequence in (5.2). For both K = 15 and K = 31 , after spreading, the data rate of each of the I and Q data streams for a particular channel is 25 MHz. Optional power control levels which are adjustable from 0 10 dB are provided for each data channel after spreading. Power control can be used to compensate for signal attenuation due to shadowing as well as to provide variable quality of service (QOS) for different data channels. All of the I data streams from the K data channels are then combined to form a single I data channel, while all of the Q data streams are combined to form a single Q data channel. These I and Q signals can be expressed as, respectively,
s I [ n] = sQ [ n] = 1 2 1 2
a b
k =1 K k
Ik
g k [ n] g k [ n]
(5.3)
a b
k =1
k Qk
(5.4)
where the subscript k denotes the kth data channel, ak sets the power level, gk[n] is the spreading code, and bIk and bQk are the binary antipodal data streams at the output of the QPSK modulator.
5.2.2
Pulse Shaping
85
1.2
0.8
X(f)/T 0.6
0.4
0.2
0 20
15
10
10
15
20
frequency (MHz)
Figure 5.2: Frequency response of raised-cosine pulse ( 1 / T = 25 MHz , = 0.3 ). After all of the I data streams and all of the Q data streams are combined to form a single I channel and a single Q channel, each of these channels is shaped by a raised-cosine filter with an excess bandwidth of 30%. This pulse-shaping filter limits the double-sided bandwidth of each of the I and Q signals to 32.5 MHz. In addition, the raised-cosine filter satisfies the Nyquist condition for zero intersymbol interference (ISI) [16]. A raisedcosine spectrum is described in the time domain as
x(t ) = sin( t / T ) cos( t / T ) t / T 1 4 2t 2 / T 2
(5.5)
where is the rolloff factor or excess bandwidth. The frequency characteristic of a raised-cosine pulse is given by
T , T 1 T X ( f ) = 1 + cos f 2T 2 0,
1 2T 1+ 1 . f , 2T 2T 1+ f > 2T 0 f
(5.6)
X ( f ) is plotted for 1 / T = 25 MHz and = 0.3 in Fig. 5.2. In the proposed system, the
input I and Q data streams are oversampled by a factor of four [18] so the data rate of the outputs from the I and Q raised-cosine filters is 100 MHz [36].
86
5.2.3
Pilot Channel/Symbol
In order to facilitate receiver functions such as timing recovery, automatic gain control, and channel estimation, the base station transmits either a pilot tone or pilot symbols. In the former case the base station reserves one of the K data channels for transmitting a pilot tone, which is simply one of the K distinct signature waveforms. Although an entire channel is dedicated to transmitting the pilot tone, it is shared by all of the active users in the cell. Since the pilot tone is continuously available to the mobile receivers, this approach is advantageous when the transmission channel is changing relatively quickly. This technique was proposed originally for the K = 15 case [22]. In an alternative approach, pilot symbols are periodically inserted along with the data in each of the K data channels. In this approach, the mobile receivers perform functions such as timing recovery and channel estimation during the transmission of the pilot symbols. The information acquired during the training period is then used to recover the actual data which is transmitted during subsequent time slots. The frequency of transmission of the pilot symbols depends on how rapidly the channel changes. Clearly, this approach is advantageous when the transmission channel is changing relatively slowly since fewer pilot symbols are required. This technique was proposed for the
K = 31 case [36].
5.2.4
The achievable integration level and power consumption of a receiver are directly related to the system sensitivity and selectivity requirements, and consequently, choosing system features which allow for relaxed requirements is critical for achieving a single-chip, lowpower receiver implementation. Using a picocellular system architecture provides a large coverage area, and the sensitivity requirement is relaxed by restricting transmission distances within each cell to less than 5 m with an aggregate transmit power of 0 dBm at each base station. For hexagonal cells, the cellular reuse factor is restricted to [37]
87
1 2 1 4 1 3 2 4 1
where i and j are integers. In the case of K = 31 , there are six unique 5-bit MLSR sequences [16]. Consequently, a cellular reuse pattern with N = 4 (Fig. 5.3) can be used, with each of the four cells transmitting at the same carrier frequency but using a different MSLR code. In the case of K = 15 , there are only two unique 4-bit MLSR sequences, so unfortunately, a similar cellular reuse strategy is not possible. In this case, if a cellular architecture based on frequency reuse is employed, then the reuse factor should be small, e.g., K = 3 , in order to minimize the total system bandwidth. In both cases, cell site planning should take into account the physical characteristics of the coverage area in order to minimize the amount of interference between cells, resulting in relaxed selectivity requirements. In particular, the proposed system should take advantage of indoor partitions such as floors, ceilings, and walls in order to increase the isolation between adjacent cells. Moreover, the base stations are restricted to a total output power of 0 dBm or 1 mW. This relatively low output power is sufficient to cover a small cell with a 5-m radius while minimizing the amount of interference introduced in neighboring cells.
5.2.5
Analog Front-End
A block diagram of the analog front-end in the base-station transmitter is illustrated in Fig. 5.4 [36]. Two 8-bit 100-MHz digital-to-analog converters (DACs) convert the I and Q digital data streams from the output of the raised-cosine filters to analog signals [38].
88
DAC
LO
0 90
PA RF Filter
DAC
The DAC creates unwanted images of the desired signal at multiples of the DAC sampling frequency. The transfer function introduced by the zero-order hold operation performed by the DAC provides some attenuation of the unwanted images:
H ( j ) =
T sin 2
(5.8)
where T is the sampling period of the DAC. A subsequent second-order low-pass Butterworth filter provides additional attenuation of out-of-band spectral energy. The Butterworth response provides a good compromise between maximally flat gain and linear phase response. Translation of the baseband I and Q signals to the 2-GHz carrier frequency is based on a direct-conversion architecture [39]. The frequency translation is performed using two mixers and an LO fixed at the carrier frequency and operating in quadrature. At the outputs of the two mixers, the I and Q signals are combined before being amplified by a power amplifier (PA). Next, a band-pass filter centered at the carrier frequency further attenuates out-of-band spectral energy before the signal is transmitted through the antenna. One potential drawback of a direct-conversion transmitter architecture is LO pulling [39]. LO pulling occurs when the output signal from the PA corrupts the LO signal, either by shifting the LO frequency or by deteriorating its spectral purity. LO pulling usually occurs when the output signal from the PA is both close in frequency and comparable in size to the LO signal. Since small form factor is not as critical in the base-station
89
transmitter, physically separating the LO from the rest of the transmitter is one way of avoiding LO pulling. For example, implementing the LO on a separate chip increases the isolation between the PA and the LO circuitry and helps to prevent the PA output signal from corrupting the LO signal. For a single-chip solution, implementing the LO signal as the product of two lower frequency sinusoids is a second way of avoiding LO pulling. In this case, the PA output signal does not affect the two oscillators used to generate the LO signal which operate at frequencies much lower than the carrier frequency. Finally, the base stations are restricted to a total output power of 1 mW. Not only does this specification minimize the amount of interference introduced in neighboring cells, but it also significantly relaxes the design requirements of the PA. A linear PA can be used and low power consumption can still be achieved despite the low efficiency of linear amplifiers due to the low output power requirement of 1 mW.
5.3
Mobile Receiver
The design requirements for the mobile receiver are much more stringent than those for the base-station transmitter. The portable nature of the mobile unit imposes strict requirements on the form factor and power consumption of the receiver. The following sections describe the performance requirements and the architecture of the receiver beginning with a brief discussion of wireless propagation models.
5.3.1
Propagation Models
(5.9)
where GT is the directivity gain of the transmitting antenna, PT is the transmit power, and d is the distance between the transmitter and the receiver. If the receiving antenna has an aperture A, the received power is
PR = A S R .
However, the gain G and aperture A of all antennas are related by 90
(5.10)
A=
2 G. 4
(5.11)
(5.12)
where GR is the gain of the receiving antenna. Equation (5.12) can be rewritten as
(5.13)
where PR and PT are the receive and transmit powers in dBm, respectively, and L is the propagation loss in dB: L [dB] = 32.44 + 20 log f [GHz] + 20 log d [m] G R [dB] GT [dB] . (5.14)
For example, for transmitting and receiving antenna gains of 0 dB, the propagation loss of a 2-GHz signal over 5 m is about 52 dB in free space. In a real wireless transmission environment, the propagation loss seldom behaves as indicated by (5.12). For indoor wireless propagation, several loss models have been proposed. One model which is commonly used is similar to (5.12) but assumes that the received power is inversely proportional to d n [41]:
1 PR = G R GT PT n. 4 d
2
(5.15)
In this case the power index n is determined empirically and is typically greater than two, accounting for losses due to walls, ceilings, floors and other objects. A second model which is commonly used assumes the same path-loss model described by (5.12) but includes an additional loss factor [42], [43]:
PR = G R GT PT . 4d
2
(5.16)
In this case, the loss factor , also determined empirically, accounts for the signal attenuation due to shadowing by various objects in an indoor transmission environment.
91
A third model, which is the one used in this work, is a combination of the first two models and has demonstrated a better fit to experimental measurements [44], [45]:
G G P 1 PR = R T T n. 4 d
2
(5.17)
(5.18)
(5.19)
For the proposed system, it is assumed that G R = GT = 1 , n = 3 , and that the worst-case shadowing loss, , is 10 dB.
5.3.2
Receiver Sensitivity
The weakest signal expected to appear at the receiver input determines the sensitivity requirement of the receiver. For the case of K = 15 data channels, two different scenarios can potentially result in very weak received signals. In the first scenario, all portable units are located at the edge of the cell and experience the worst case shadowing loss of 10 dB. The total path loss is given by (5.19):
4 (2 10 9 ) 2 3 L = 10 log 5 + 10 = 69.43 dB . 8 3 10 (5.20)
In this case, the 1-mW transmit power from the base station is equally shared between the 15 data channels, so the transmit power for an individual data channel is
PT = 1 1 mW = 0.067 mW = 11.76 dBm . 15
(5.21)
Consequently, for this first scenario, the receiver sensitivity must be better than 81.2 dBm.
92
In the second scenario, all portable units are located at the edge of the cell and all but one experience the worst case shadowing loss of 10 dB. In order to compensate for the shadowing loss experienced by the 14 portable units, the base-station transmitter increases the power of each of the 14 data channels by 10 dB. Since the total transmit power is limited to 1 mW, the power allocated to the single data channel which does not experience any shadowing decreases correspondingly. For this data channel, the transmit power is ten times less than the power of each of the other 14 data channels. Consequently, the transmit power for the single data channel which does not experience any shadowing is determined by the following equation:
PT + 14 10 PT = 1 mW PT = 1 mW or 21.5 dBm. 141
(5.22)
For this data channel, the total path loss at a distance 5 m away from the base station is 59.43 dB, so the receiver sensitivity must be better than 80.9 dBm. For both scenarios described above, the receiver sensitivity requirements are virtually identical. The receiver sensitivity requirement for the case of K = 31 data channels can be determined in a similar way. When all the portable units are located at the edge of the cell and experience the worst case shadowing loss of 10 dB, the total path loss is still 69.43 dB. However, the 1-mW transmit power from the base station is now equally shared between the 31 data channels, so the transmit power for an individual data channel is
PT = 1 1 mW = 0.032 mW = 14.91 dBm . 31
(5.23)
Consequently, the receiver sensitivity must be better than 84.3 dBm. For K = 31 , when all portable units are located at the edge of the cell and all but one experience the worst case shadowing loss of 10 dB, the transmit power for the single data channel which does not experience any shadowing is determined by the following equation:
93
(5.24)
For this data channel, the total path loss at a distance 5 m away from the base station is still 59.43 dB, so the receiver sensitivity must be better than 84.2 dBm. Again, for both scenarios described above, the receiver sensitivity requirements are virtually identical.
5.3.3
Since spread-spectrum signals provide processing gain, which enhances the SNR of the received signal after data recovery, CDMA systems may appear to have an advantage over frequency-division multiple access (FDMA) or time-division multiple access (TDMA) systems. Although the SNR of the received signal is indeed enhanced by the processing gain in CDMA systems, these systems in fact do not have increased noise tolerance when compared to FDMA or TDMA systems. This section resolves this common misconception and explains the implications of processing gain on receiver noise performance. Consider the CDMA and FDMA systems depicted in Fig. 5.5. Both systems support up to K data channels and are limited to a total transmission bandwidth of fT and a total transmit power of PT. In the FDMA system, both the total transmission bandwidth and
FDMA: kth data channel
BW = fT/K
SNR = PT/(NfT)
CDMA: kth data channel P = PT/K BW = fT PSD = N BW = fT SNR = PT/(KNfT) sk SNR = PT/(NfT)
94
the total transmit power are equally divided between the K data channels, so the bandwidth and transmit power of each data channel are f T / K and PT / K , respectively. After transmission, assume that the signal is neither amplified nor attenuated but is corrupted by AWGN with a single-sided power spectral density of N. At the receiver, an ideal band-pass filter with bandwidth f T / K selects the kth data channel. The received signal power is PT / K while the received noise power is Nf T / K , and consequently, the SNR for the FDMA system is
SNR =
PT P K = T . K Nf T Nf T
(5.25)
In the CDMA system, the total transmit power is equally divided between the K data channels but since the distinct signature sequences are mutually orthogonal, each data channel can transmit over the entire system bandwidth. In this case the bandwidth and transmit power of each data channel are fT and PT / K , respectively. As in the FDMA system, assume that the transmitted CDMA signal is neither amplified nor attenuated but is corrupted by AWGN with a single-sided power spectral density of N. At the receiver, an ideal band-pass filter limits the bandwidth of the received signal to fT. At this point, the signal power of the kth data channel is PT / K while the received noise power is NfT, and consequently, the SNR of the kth data channel is
SNR =
PT P 1 = T . K Nf T KNf T
(5.26)
The kth data channel is selected by multiplying the received signal by the kth signature sequence. After data recovery, the signal power of the kth data channel is still PT / K but the signal bandwidth is reduced to f T / K . However, the noise power spectral density is still N since the noise and the kth signature sequence are uncorrelated. Consequently, in the CDMA system, the SNR of the kth data channel after data recovery is SNR = PT P K = T . K Nf T Nf T (5.27)
95
After data recovery, the SNR of the kth data channel increases by the processing gain K. However, as expected, the SNR in the CDMA system is exactly the same as the SNR in the FDMA system, and indeed, the CDMA system does not have an increased noise tolerance. Although the SNR in the CDMA system does increase by the processing gain after data recovery, this merely compensates for the lower SNR at the receiver input resulting from the larger noise bandwidth. Although the processing gain does not appear to provide the CDMA system any advantage over the FDMA system in the above example, spread-spectrum techniques do provide some advantages. In particular, CDMA signals, which are spread across the entire system bandwidth, are particularly robust against frequency-selective narrowband fading due to multipath propagation.
5.3.4
Receiver Architecture
Due to its simplicity and potential for high integration, the direct-conversion architecture (Fig. 2.7) is the most promising candidate for implementing the receiver for the proposed system [46], [47]. As discussed in Sections 2.3.1 and 2.3.2, receiver implementations based on the direct-conversion architecture must contend with dc offsets and flicker noise. Although the low-IF architecture has the advantage of avoiding problems associated with dc offsets and flicker noise, the wide bandwidth of the desired signal precludes the use of this architecture, since digitizing the IF signal would require a prohibitively fast ADC sampling frequency. However, as already discussed in Section 2.6.1, the wide bandwidth of the desired signal in the proposed system makes the direct-conversion architecture a very attractive approach since on-chip high-pass filtering can be used as a very simple and effective way of eliminating dc offsets and lowfrequency flicker noise. The following two sections will review the different techniques for mitigating the problems associated with dc offsets and flicker noise.
5.3.5
Flicker-Noise Suppression
In the direct-conversion architecture, the flicker noise introduced by the baseband circuits as well as by the mixer in some cases can corrupt the potentially weak desired signal.
96
Since the power spectral density of the input-referred flicker noise of an MOS transistor is inversely proportional to the device dimensions as given by (2.6), flicker noise can be minimized by using large transistor sizes. When circuit speed requirements limit the amount that devices sizes can be increased as well as when the flicker noise performance is not well controlled, which is the case in many CMOS processes, circuit techniques such as autozeroing and chopper stabilization can be used to suppress the flicker noise [48]. The autozero technique is typically implemented using a two-phase clock. During the first phase, the circuit with flicker noise is disconnected from the signal path and its flicker noise is sampled and stored. During the second phase, the circuit is reconnected to the signal path and the stored value of flicker noise is subtracted from the desired signal. In this technique, the previous value of the noise rather than its current value is subtracted from the desired signal, and consequently, this technique is effective only when the noise varies slowly relative to the frequency at which it is sampled. Thus, the autozero technique essentially high-pass filters this noise and is effective at eliminating lowfrequency flicker noise but not broadband thermal noise. Since this technique requires that the circuit with flicker noise be disconnected from the signal path, it may not be compatible with continuous-time applications but is well suited for sampled-data applications based on switched-capacitor circuit implementations. Chopper stabilization is a second technique which can be used to suppress flicker noise. Fig. 5.6 illustrates the chopper stabilization technique applied to an amplifier with flicker noise. In this approach, the input signal is multiplied by a periodic waveform m1(t) with frequency fchop, which translates the input signal to a higher frequency where flicker noise is negligible. The resulting signal is then amplified at this higher frequency before being translated back down to baseband. This final frequency translation is performed by
m1(t) m2(t)
Vin Amplifier
Vout
97
multiplying the amplifier output signal by a second periodic waveform m2(t) also with frequency fchop. Although both autozeroing and chopper stabilization are effective in suppressing flicker noise, both techniques also introduce additional complexity in implementing the baseband circuits. Consequently, the proposed implementation relies solely on large transistor sizes to minimize the flicker noise contribution of the baseband circuits. In addition, on-chip high-pass filtering is used to eliminate the low-frequency flicker noise introduced by the downconversion mixers.
5.3.6
DC-Offset Compensation
One very simple and effective way of eliminating dc offsets is through capacitive coupling or high-pass filtering. Since practical single-chip implementations prohibit the use of very large capacitors and resistors, this method is feasible only for systems with large signal bandwidths. However, even if the signal bandwidth is large, this method still results in some BER degradation, since capacitive coupling or high-pass filtering removes low-frequency signal energy along with dc offsets. Fig 5.7 illustrates the effects of high-pass filtering on an otherwise ideal signal constellation for the proposed system with K = 15 . For simplicity, the transmitted signal consists of a single data channel while
1
0.5
0.5
0.5
0.5
pre-correlation post-correlation
pre-correlation post-correlation
1 1
0.5
0.5
0.5
0.5
(a)
(b)
Figure 5.7: Effect of high-pass filtering. (a) 100 kHz. (b) 500 kHz.
98
1.5
1.5
0.5
0.5
0.5
0.5
1.5
0.1
0.2
0.3
0.4
0.5
0.6
time (s)
time (s)
(a)
(b)
Figure 5.8: (a) K = 15 spreading code. (b) After 500-kHz high-pass filter. data recovery is performed using a single-user correlator (Section 5.3.12). As seen from the signal constellations before data recovery, high-pass filtering results in ISI. In particular, the 500-kHz high-pass filter results in significantly more ISI than the 100-kHz filter. In both cases, however, after data recovery, the signal constellations are close to ideal. In order to gain a better understanding of the effect of high-pass filtering on the performance of the proposed system, consider the K = 15 spreading code in (5.1). For this sequence, the discrete-time autocorrelation function is
[m] = s[n]s[n + m] =
n =1
15
15, m = 0 1, m 0
(5.28)
and the average power, Pave, is [0] = 15 . Fig. 5.8 illustrates the detrimental effects of filtering this sequence with a 500-kHz high-pass filter. However, correlating this signal with the original spreading sequence results in Pave 14 , a degradation in signal power of only about 0.27 dB. Consequently, after correlation, the effect of the ISI introduced by the high-pass filter is reduced, as evident in the signal constellations illustrated in Fig. 5.7. In order to approximate the SNR degradation resulting from high-pass filtering, first consider a received signal with a single-sided bandwidth of fsig as depicted in Fig. 5.9.
99
The signal has a constant PSD of Psig / 2 f sig over this frequency range, so the total power is Psig. This signal is corrupted by AWGN with PSD N / 2 , and the resulting SNR is given by
SNR = Psig Nf sig
(5.29)
After data recover, the signal power and the noise PSD remain unchanged, but the signal bandwidth decreases by the spreading factor K. Thus the resulting SNR is
SNR = KPsig Nf sig
(5.30)
Next suppose that a first-order high-pass filter is used to remove the dc offsets from the received signal. The transfer function of such a filter is given by
H( f ) = 1 f 1 j HPF f
(5.31)
where fHPF is the corner frequency of the high-pass filter. The high-pass filter removes the low-frequency content of both the desired signal as well as the noise, and consequently, the SNR before and after high-pass filtering are identical. After data recovery, the signal power remains unchanged and is given by
f sig
PHPF =
f sig
H( f )
Psig 2 f sig
(5.32)
while the signal bandwidth decreases by the spreading factor K. In addition, the correlation process whitens the filtered noise, and after data recovery, the noise PSD is
N / 2 . The resulting SNR is
100
20
PSD
40
60
80 50
25
0 frequency (MHz)
25
50
(5.33)
L = 1
(5.34)
For f sig = 25 MHz and f HPF = 500 kHz , the SNR degradation is about 0.14 dB. From simulations, the SNR degradation in this case is approximately 0.1 dB. The calculated value is slightly pessimistic since the received signal actually does not have a constant PSD. The K = 15 sequence in (5.1) has very little spectral content near dc with an average value of about 0.067. The simulated PSD is illustrated in Fig. 5.10. Consequently, by using spreading codes with little or no spectral content near dc, the SNR degradation after high-pass filtering can be minimized. Moreover, for wideband signals, the resulting degradation in spectral efficiency due to such coding is insignificant. As discussed in Section 2.3.1, dc offsets originate from multiple sources, including LO self-mixing, even-order distortion, and systematic offsets in the baseband circuits. If dc offsets are removed immediately prior to analog-to-digital conversion, the dc offsets from earlier stages can still saturate the subsequent baseband stages prior to dc-offset removal
101
...
baseband stages
ADC
LO (fc)
(a)
RF Input (fc) DC offset: LNA RF Filter
...
baseband stages
ADC
LO (fc)
(b) Figure 5.11: DC offset removal. (a) Before ADC. (b) After mixer. (Fig. 5.11a). Alternatively, if dc offsets are removed immediately after frequency translation to baseband, the dc offsets caused by subsequent baseband stages can still be problematic (Fig. 5.11b). In the latter approach, the dc offsets introduced by the subsequent baseband stages can be minimized by using the same techniques used to reduce flicker noise, such as using large transistor dimensions, autozeroing, and chopper stabilization (Section 5.3.5). Alternatively, additional coupling capacitors or high-pass filters can also be used to remove the dc offsets in these subsequent stages. In the proposed implementation, a high-pass filter is located immediately after the mixer along each of the I and Q signal paths in order to remove dc offsets. This filter is effective in removing dc offsets caused by LO self-mixing but does not address the dc offset problem in subsequent baseband stages. The proposed implementation relies on large transistor dimensions as well as layout techniques which improve transistor matching in order to minimize the dc offsets introduced by these baseband stages. Finally, although capacitive coupling or high-pass filtering using on-chip capacitors and resistors are simple and effective ways of eliminating dc offsets for wideband systems, this technique is not feasible for narrowband systems based on the direct-conversion
102
architecture. For these systems, the narrow bandwidth of the desired signal requires very large capacitance and resistance values for removal of the dc offsets. One alternative is to use off-chip components for these passive structures. However, this approach is inconsistent with the goal of a highly-integrated implementation. An overview of some the techniques used for dc-offset cancellation in direct-conversion narrowband receivers is provided in Appendix B.
5.3.7
The required noise figure is determined from the receiver sensitivity as given by (4.6), which is repeated here for convenience:
NF [dB] = Psig [dBm] 10 log(f [Hz]) SNRout [dB] + 173.8 .
(5.35)
In order to guarantee an average BER of 104, an SNR of approximately 15 dB is required [22]. This specification assumes a multipath transmission channel with QPSK modulation. If data recovery is performed in the digital section of the receiver, the required SNR at the output of the analog section is relaxed by an amount equal to the processing gain. The processing gains for K = 15 and K = 31 are 11.76 dB and 14.91 dB, respectively. For a system bandwidth of 32.5 MHz, the noise figure must be better than 14.2 dB in both cases. The noise figure requirements for both cases are virtually identical since the larger processing gain in the K = 31 case offsets its more stringent sensitivity requirement.
5.3.8
ADC Performance
The proposed direct-conversion receiver requires two ADCs, one for each of the I and Q baseband channels. The partitioning of receiver functions between the analog and digital sections directly impacts the performance requirements of the ADCs [49]. Two possible configurations are illustrated in Fig. 5.12. The I and Q channels are identical, so for simplicity, only one channel is shown. In the architecture depicted in Fig. 5.12a, data recovery is performed in the digital section after analog-to-digital conversion. In this case, the sampling rate of the ADC must be at least 25 MHz in order to avoid destructive aliasing. In the architecture depicted in Fig. 5.12b, data recovery is performed in the 103
RF Input (fc)
LNA RF Filter
ADC
data recovery
LO (fc)
(a)
RF Input (fc)
LNA RF Filter
data recovery
ADC
LO (fc)
(b) Figure 5.12: (a) ADC before data recovery. (b) ADC after data recovery. analog section prior to analog-to-digital conversion. In this case, the Nyquist sampling rate requirement of the ADC is reduced to the symbol rate of 1.67 MHz for K = 15 or 0.81 MHz for K = 31 . Determining the resolution requirements of the ADCs is not as straightforward since these requirements depend heavily on the specific algorithms used for data recovery. For example, for CDMA systems which rely on single-user techniques (Section 5.3.12) for data recovery, the architecture depicted in Fig. 5.12a typically requires approximately four bits of resolution in the ADC [18], [50], while the architecture depicted in Fig. 5.12b requires only one bit of resolution in the ADC [49]. For this example, based on the ADC requirements alone, the architecture depicted in Fig. 5.12b appears to be the obvious choice, since the ADC has both a lower sampling rate requirement as well as a lower resolution requirement. However, a fair comparison of the two architectures must also take into account the implementation of the data recovery algorithm. For CDMA systems which rely on singleuser techniques for data recovery, the power consumption of an analog implementation of the data recovery algorithm is about the same as that of a digital implementation [49], and consequently, the architecture depicted in Fig. 5.12b still appears to be the obvious choice. Nevertheless, the architecture depicted in Fig. 5.12a may actually be more
104
attractive for several reasons. First, a digital implementation of the data recovery algorithm provides increased design flexibility. Since digital implementations can take advantage of circuit synthesis techniques, much faster design times are possible. Second, the continued scaling of CMOS technology results in significant improvements in the speed, size, and power consumption of digital circuits. In contrast, the scaling of CMOS technology has actually hindered the design of analog circuits, mainly because of decreasing supply voltages [51]. Finally, for receivers which rely on more advanced techniques for data recovery, a digital implementation may be the only feasible alternative. In order to increase system performance, receivers are beginning to incorporate more advanced algorithms for timing synchronization [19] and data detection, such as multiuser techniques [52]. While the complexity of these algorithms along with the decreasing supply voltages of CMOS processes result in very challenging analog implementations, these algorithms are actually very well suited to low-power digital implementation techniques [5]. As a result, the benefits of a digital implementation of the data recovery block may actually outweigh the disadvantages of implementing an ADC with higher speed and resolution requirements. Consequently, the proposed direct-conversion receiver is based on the configuration depicted in Fig. 5.12a. The remainder of this section focuses on the performance requirements of the I and Q ADCs in the proposed architecture. In order to avoid destructive aliasing, the sampling rate of each of the I and Q ADCs must be at least 25 MHz. Although the single-sided bandwidth of the desired signal is actually 16.25 MHz, a minimum sampling rate of 25 MHz rather than 32.5 MHz is actually sufficient to avoid aliasing, despite the forebodings of the Nyquist Sampling Theorem, which states that a signal with single-sided bandwidth fB is uniquely represented by samples taken at the Nyquist frequency, f s 2 f B [16]. In the proposed system, the bandwidth expansion results from the raised-cosine pulse-shaping filter with 30% excess bandwidth, which satisfies the Nyquist criterion for zero ISI [16]. The equivalent lowpass transmitted signal can be expressed as
y (t ) = I n x(t nT )
n =0
(5.36)
105
where {In} represents the discrete symbol sequence and x(t) is the raised-cosine pulse given by (5.5). The single-sided bandwidth of y(t) is (1 + ) / 2T . If y(t) is sampled at f s = 1 / T then the resulting sequence is
y (kT ) = I n x[(k n)T ], k = 0, 1, 2, K
n =0
= In
n =0
(5.37)
= Ik
since
I
n =0 nk
(5.38)
Consequently, when the transmitted symbol sequence is shaped by a raised-cosine pulse resulting in a single-sided bandwidth of (1 + ) / 2T , a minimum sampling rate of 1 / T rather than (1 + ) / 2T is required in order to recover the desired symbol sequence. For the proposed system, the minimum ADC sampling rate requirement is 25 MHz. However, the sampling rate may actually be greater than 25 MHz since oversampling the received signal facilitates digital timing recovery. Next, the resolution requirement of each of the I and Q ADCs is determined [22]. The mean square value of the quantization error introduced by the ADC is given by (3.32) and is repeated here:
2 1 VFS = 2R . 12 2 2 e
(5.39)
In the proposed system, each of the received baseband I and Q signals can be represented as
r[ n ] = 1
a b g 2
k =1 k k
[ n]
(5.40)
106
where a k / 2 , bk, and gk[n] are the amplitude, bit sequence, and spreading code,
respectively, of the kth data channel. The signal r[n] may be approximated as a Gaussian random variable with zero mean and variance
r2 =
1 K 2 ak . 2 k =1
(5.41)
If the signal amplitude has a Gaussian distribution, then only 0.064% of the samples have an amplitude greater than 4r [21]. Thus, by setting
VFS = 4 r 2
(5.42)
(5.39) becomes
K 64 K 2 ak 8 a k2 1 2 k =1 e2 = = k =12 R . 12 3 2 22R
(5.43)
a k2 3 2R = 2 K 16 ak2
k =1
(5.44)
2 ak
a
k =1
dB.
2 k
In order to guarantee an average BER of 104, an SNR of approximately 15 dB is required [22]. Assuming that the receiver noise is dominated by thermal noise, then the SNR due to only quantization noise must be much better than 15 dB. Since data recovery is performed in the digital section of the receiver, the required SNR at the output of the ADC is relaxed by an amount equal to the processing gain. For an SNR of 25 dB due to quantization noise alone, the ADC resolution requirement is 6 bits for both K = 15 and
K = 31 when all data channels equally share the total transmit power. If power control is
employed at the base station, then the ADC resolution requirement is more stringent due
107
to the increased dynamic range of the received signal. If the transmit power of the kth data channel is ten times less than the power of each of the other K 1 data channels, then the SNR of the kth data channel after data recovery is SNR = 7.27 + 6.02 R + 10 log 1 + 10 log K . 1 + 10( K 1) (5.45)
In this case, for an SNR of 25 dB, the ADC resolution requirement is 7 bits for both
K = 15 and K = 31 .
5.3.9
Receiver Gain
The receiver gain requirements are determined by the minimum and maximum signal levels expected to appear at the receiver input. Since the proposed system is designed to operate over short distances, out-of-band interference is assumed to negligible. The minimum gain requirement is determined by the largest in-band signal appearing at the receiver input. Assuming that the minimum separation between the transmitter and the receiver is 1 m and that no shadowing losses occur, the minimum path loss is given by
4 (2 10 9 ) 2 3 L = 10 log 1 = 38.46 dB . 8 3 10
(5.46)
For a transmit power of 0 dBm, the maximum received power is 38.46 dBm. Assuming a 1-V swing for the baseband circuits, the minimum receiver gain is Gmin = 13.01 dBm (38.46 dBm) = 51.47 dB . (5.47)
Similarly, the maximum gain requirement is determined by the smallest in-band signal appearing at the receiver input. Assuming that the maximum separation between the transmitter and the receiver is 5 m and that the maximum shadowing loss is 10 dB, the maximum path loss is 69.43 dB as given by (5.20). For a transmit power of 0 dBm, the minimum received power is 69.43 dBm. Again, assuming a 1-V swing for the baseband circuits, the maximum receiver gain is Gmax = 13.01 dBm (69.43 dBm) = 82.44 dB . (5.48)
108
Consequently, the receiver must have a dynamic range of at least 31 dB. One approach is to increase the ADC resolution in order to accommodate this dynamic range. However, doing so exacerbates the already stringent design requirements of the ADC. Alternatively, an automatic gain control (AGC) loop can be used to adjust the gain of the receiver depending on the received signal power.
109
xin(t)
xout(t)
ADC
xout(k)
xin(t)
xout(t)
ADC
xout(k)
gain control
gain control
(a)
(b)
Figure 5.13: AGC architectures. (a) Feedforward. (b) Feedback. [53]. For the proposed system, the AGC loop is partitioned between analog and digital circuits. The gain control algorithm is implemented in the digital section and then the proper gain setting is fed back to the analog VGA. Two types of AGC loops are illustrated in Fig. 5.13. The feedforward AGC generally converges faster than the feedback AGC. Moreover, feedforward loops generally do not have stability problems. However, the feedforward architecture is not very amenable to a mixed-signal implementation. The multiplier or VGA must precede the ADC in order to set the proper signal amplitude at the ADC input. Consequently, in order for the gaincontrol algorithm to be performed digitally in the feedforward AGC, an additional ADC is required at the input of the gain-control block. In the feedback AGC, only a single ADC is required. Consequently, the AGC loop for the proposed system is based on the feedback architecture. By designing the VGA to have discrete gain settings, the digital signal can control the VGA directly without the need for a DAC. Since the AGC loop for the proposed system must have a dynamic range of 31 dB, the gain can be set by activating various combinations of five amplifiers with gains of 1 dB, 2 dB, 4 dB, 8 dB, and 16 dB. In order to determine the correct gain, an estimate of the received amplitude is required. The amplitude estimate should be robust even in the presence of receiver impairments such as circuit noise, frequency offset, and distortion. For the proposed system, the estimate is based on both the I and Q data, which in the ideal case, form a four-point constellation centered and symmetric about the origin. Three methods of amplitude estimation are [54]: 1.
I 2 + Q2
(5.49)
110
2. | I | + | Q | 3. max(| I |, | Q |) + 1 min(| I |, | Q |) . 2
(5.50) (5.51)
If the receiver is noiseless and the only impairment is the frequency offset between the transmitter and receiver oscillators, then the constellation will rotate along a circle as illustrated in Fig. 3.12. The I and Q data can be represented as, respectively, I = I o cos( t ) Q = Qo sin( t ) (5.52) (5.53)
where is the frequency offset and | I o | = | Qo | for an ideal constellation. Assuming that | I o | = | Qo | = 1 , (5.49) becomes
I 2 + Q 2 = 1.
(5.54)
(5.55) (5.56)
These three estimates are plotted in Fig. 5.14. The amplitude estimate given by (5.54) is constant over time and the average value is equal to one. A low-pass filter can be used to reduce the variation in the estimates given by (5.55) and (5.56). If the low-pass filter bandwidth is sufficiently small, then the output of the low-pass filter is simply the average value of the input. The average values of (5.55) and (5.56) are, respectively,
8 1.27 2 2+ 2
(5.57)
1.09 .
(5.58)
For practical low-pass filter implementations, some variation still exists, and consequently, (5.49) provides the best amplitude estimate when the receiver is noiseless and the only impairment is frequency offset.
111
1.5
Figure 5.14: Amplitude estimates with frequency offset . When the noise power is significantly larger than the signal power, then the performance of all three estimates is comparable. In this case I and Q are given by, respectively, I = I o cos( t ) + N I N I
Q = Qo sin( t) + N Q N Q .
(5.59) (5.60)
Assuming that the noise components are both independent Gaussian random variables
2 with zero mean and variance, N , the expected value of the estimate given by (5.49) is
E[ I 2 + Q 2 ] = E[V ]
(5.61)
where V = I 2 + Q 2 is a Rayleigh random variable [55] with mean and variance given by, respectively, E[V ] =
(5.62) (5.63)
2 VAR[V ] = 2 N . 2
The expected value of the estimate given by (5.50) is E[| I | + | Q |] = 2 | x |
1 2 112
x2
2 2
dx = 2
(5.64)
xI(k)
( )2 y(k) ( )2
d(k) e(k)
g(k)
xQ(k)
Figure 5.15: Equivalent combiner for feedback AGC loop based on (5.49). while the expected value of the estimate given by (5.51) is 1 3 2 E max(| I |, | Q |) + min(| I |, | Q |) = N. 2 2 (5.65)
The gain-control algorithm can be implemented using an adaptive least mean squares (LMS) algorithm [56]. The equivalent combiner for a feedback AGC loop based on (5.49) for amplitude estimation is illustrated in Fig. 5.15. The error signal e(k) is given by
2 e(k ) = d (k ) g (k ) x I2 (k ) + xQ (k ) .
(5.66)
Adaptation using the stochastic gradient descent method results in the following update equation:
g (k + 1) = g (k ) 1 2 {e(k )} e ( k ) = g ( k ) e ( k ) g (k ) g (k ) 2
(5.67)
where is the step size. Taking the partial derivative of e(k) with respect to g(k), (5.67) becomes
2 g (k + 1) = g (k ) + e(k ) x I2 (k ) + xQ (k ) .
(5.68)
(5.69)
(5.70)
113
(5.71)
(5.72)
If the algorithm converges, then the parameter error vector update at time k + 1 must be less than that at time k. Consequently, the summed squared parameter error increment must always be negative. For a positive step size , the following relationship must be satisfied if the algorithm converges: 0< < 2 . 2 x ( k ) + xQ ( k )
2 I
(5.73)
division operation may be eliminated by using the sign-data algorithm [57] instead of the stochastic gradient descent method described above. Adaptation using the sign-data algorithm results in the following update equation:
2 g (k + 1) = g (k ) + e(k ) sgn[ x I2 (k ) + xQ (k ) ] .
(5.74)
Since
2 x I2 (k ) + xQ (k ) > 0
(5.75)
(5.74) becomes
g (k + 1) = g (k ) + e(k ) .
(5.76)
114
g (k + 1) = g (k ) +
x (k ) + x (k )
2 I 2 Q
2 e(k ) x I2 (k ) + xQ (k ) .
(5.77)
Comparing (5.77) with the update equation for the stochastic gradient descent algorithm in (5.68), the sign-data algorithm is stable if 0< < 2
2 x I2 (k ) + xQ (k )
(5.78)
The computational complexity can be further reduced by eliminating the square-root operation. In this case, the square of the signal amplitude, instead of the signal amplitude itself, is estimated: I 2 + Q2 . The equivalent combiner is illustrated in Fig. 5.16. The error signal e(k) is given by
2 e(k ) = d (k ) y (k ) = d (k ) g 2 (k )[ x I2 (k ) + xQ (k )] .
(5.79)
(5.80)
Unfortunately, the error signal is not linear in g(k), and consequently, the techniques used to analyze the stochastic gradient descent and sign-data algorithms cannot be applied in this case. An update equation which results in a simple implementation is given by
g (k + 1) = g (k ) + e(k ) .
(5.81)
The performance and stability of this algorithm is beyond the scope of this discussion and the interested reader is referred to [58] for more details. From simulations, the performance of this algorithm is very similar to that of the sign-data algorithm using
xI(k)
( )2 y(k) ( )2
d(k) e(k)
g(k)
xQ(k)
115
xI(k)
|| y(k) ||
d(k) e(k)
g(k)
xQ(k)
Figure 5.17: Equivalent combiner for feedback AGC loop based on (5.50). (5.49) for amplitude estimation. The equivalent combiner for a feedback AGC loop using (5.50) for amplitude estimation is illustrated in Fig. 5.17. The error signal e(k) is given by
e(k ) = d (k ) [| g (k ) x I (k ) | + | g (k ) xQ (k ) |] .
(5.82)
(5.83)
Adaptation using the stochastic gradient descent method results in the following update equation:
g (k + 1) = g (k ) + e(k )[| x I (k ) | + | xQ (k ) |] .
(5.84)
In this case, the computational complexity can also be reduced by using the sign-data algorithm rather than the stochastic gradient descent method. Adaptation using the signdata algorithm results in the following update equation:
g (k + 1) = g (k ) + e(k ) sgn[| x I (k ) | + | xQ (k ) |] .
(5.86)
Since
x I (k ) + xQ (k ) > 0
(5.87)
116
xI(k)
||
max
g(k)
xQ(k)
||
min
1/2
Figure 5.18: Equivalent combiner for feedback AGC loop based on (5.51). (5.86) becomes
g (k + 1) = g (k ) + e(k ) .
(5.88)
Finally, the equivalent combiner for a feedback AGC loop using (5.51) for amplitude estimation is illustrated in Fig. 5.18. The error signal e(k) is given by
e(k ) = d (k ) {max[| g (k ) x I (k ) |, | g (k ) xQ (k ) |] + 1 min[| g (k ) x I (k ) |, | g (k ) xQ (k ) |]} . 2
(5.90)
(5.91)
Adaptation using the stochastic gradient descent method results in the following update equation:
1 g (k + 1) = g (k ) + e(k )max[| x I (k ) |, | xQ (k ) |] + min[| x I (k ) |, | xQ (k ) |] . (5.92) 2
(5.93)
117
Again, the computational complexity can be reduced by using the sign-data algorithm rather than the stochastic gradient descent method. Adaptation using the sign-data algorithm results in the following update equation:
1 g (k + 1) = g (k ) + e(k ) sgn max[| x I (k ) |, | xQ (k ) |] + min[| x I (k ) |, | xQ (k ) |] . (5.94) 2
Since
max[| x I (k ) |, | xQ (k ) |] + 1 min[| x I (k ) |, | xQ (k ) |] > 0 2
(5.95)
(5.94) becomes
g (k + 1) = g (k ) + e(k ) .
(5.96)
(5.97)
The sign-data LMS algorithm based on (5.79) for amplitude estimation offers excellent performance under both high SNR and low SNR conditions. However, implementation of this algorithm requires two squaring circuits as illustrated in Fig. 5.16. In contrast, implementation of the sign-data LMS algorithm based on (5.50) is much simpler, requiring two absolute value circuits instead of two squaring circuits. Moreover, if a signmagnitude number representation is used, the absolute value operation is trivial. However, this algorithm performs poorly under high SNR conditions since amplitude estimates using (5.50) result in large variations as illustrated in Fig. 5.14. The sign-data LMS algorithm based on (5.51) for amplitude estimation provides a good compromise between good performance and ease of implementation. Implementation of this algorithm requires just two absolute value circuits and two comparators, while multiplication by 1/2 can be accomplished by a simple shift operation. Moreover, this algorithm offers good performance under high SNR conditions. The variations in the amplitude estimates using (5.51) as illustrated in Fig. 5.14 can be minimized by passing the estimates through a low-pass filter.
118
xI(t)
zI(t)
ADC
zI(k)
d(k)
max
||
g(k)
z-1
e(k)
y(k)
LPF
1/2
min
||
xQ(t)
zQ(t)
ADC
zQ(k)
Figure 5.19: AGC loop based on sign-data LMS algorithm using (5.51). An AGC loop based on the sign-data LMS algorithm using (5.51) for amplitude estimation is illustrated in Fig. 5.19. The AGC loop converges without the need for timing recovery since the amplitude estimation algorithm accounts for frequency offsets between the transmitter and receiver oscillators. The performance of this AGC loop is evaluated using Simulink. The ADCs are modeled by the simple behavioral model depicted in Fig. 4.16. Each of the ADCs samples the input signal at 25 MHz and quantizes it to 8 bits. Amplitude estimation based on (5.51) is performed after the I and Q ADCs and the digital update signal g(k) controls both the I and Q VGAs. For simplicity, each of the VGAs is modeled as a multiplier. Simulations reveal a potential problem with the AGC loop depicted in Fig. 5.19. The time constant of the sign-data LMS algorithm is inversely proportional to the step size and the amplitude estimate [57]
1 1 max[| x I (k ) |, | xQ (k ) |] + min[| x I (k ) |, | xQ (k ) |] 2
(5.98)
For a fixed step size, the time constant depends on the amplitude of the input signal, and consequently, a step size which results in fast convergence for weak input signals may be too large for strong input signals, resulting in convergence noise, or perhaps even worse, the loop may become unstable. Conversely, a step size which results in fast convergence for strong input signals may be too small for weak input signals, resulting in very slow
119
30
gain (dB)
15
10
0 0 0.05 0.1 0.15 0.2 0.25 time (ms) 0.3 0.35 0.4
Figure 5.20: Performance of AGC loop depicted in Fig. 5.19. convergence. As illustrated in Fig. 5.20, a step size of = 0.005 results in fast convergence for large input signals but very slow convergence for small input signals. This simulation takes into account all receiver impairments including a receiver noise figure of approximately 13 dB and a frequency error of 50 ppm. One way of speeding up the convergence time for all signal amplitudes is to use the normalized LMS algorithm [57]. The update equation for this algorithm is
g (k + 1) = g (k ) +
e(k )
1 max[| x I (k ) |, | xQ (k ) |] + min[| x I (k ) |, | xQ (k ) |] 2
(5.99)
(5.100)
Consequently, for a fixed step size, this algorithm results in the same convergence time for all input signal amplitudes. Unfortunately, implementation of the update equation in (5.99) requires division by the amplitude estimate.
120
xI(t)
zI(t)
ADC
zI(k)
d(k)
max
||
g(k)
z-1
e(k)
y(k)
LPF
1/2
min
||
xQ(t)
zQ(t)
ADC
zQ(k)
Figure 5.21: AGC loop based on the update equation in (5.101). Fig. 5.21 illustrates the AGC loop for the proposed system. This AGC loop is based on an adaptive algorithm which converges rapidly for all signal amplitudes. The update equation for this algorithm is
g (k + 1) = g (k ) + g (k )e(k ) .
(5.101)
(5.102)
g (k )max[| x I (k ) |, | xQ (k ) |] + min[| x I (k ) |, | xQ (k ) |]
1 2
1 . (5.103) d (k )
The time constant for this algorithm is only weakly dependent on the input signal amplitude. Although the implementation of this algorithm requires an additional multiplier, the benefit in performance is significant. As illustrated in Fig. 5.22, for the same step size of = 0.005 , the AGC loop converges much more rapidly for weak input signals without affecting the convergence performance for strong input signals.
121
30
gain (dB)
15
10
0 0 0.05 0.1 0.15 0.2 0.25 time (ms) 0.3 0.35 0.4
In the proposed system, data recovery is performed using multiuser detection [22], [52]. This section provides a brief overview of detection algorithms for CDMA systems before describing the adaptive multiuser detection algorithm used in the proposed system. The interested reader is referred to [22] for a more detailed discussion. In the proposed system, each of the baseband I and Q signals at the transmitter can be represented as
s[n] = 1 2
a b g
k =1 k k
[ n]
(5.104)
where a k / 2 , bk, and gk[n] are the amplitude, bit sequence, and spreading code,
respectively, of the kth data channel. Assuming that the signal is corrupted by AWGN with PSD N / 2 during transmission and that perfect timing synchronization and gain control are maintained at the receiver, then each of the baseband I and Q signals at the ADC outputs can be represented as
r[ m ] = 1 2
a b g
j =1 j j
[m] + n[m]
(5.105)
122
r[m]
gk[m]
yk bk
Figure 5.23: Single-user detector. The conventional single-user detector used for data recovery in CDMA systems is illustrated in Fig. 5.23. This approach is commonly used for data recovery in the mobile receivers of many CDMA systems, including the IS-95 standard for digital cellular telephony, because of its simplicity and ease of implementation. In this approach, the kth data channel is recovered by multiplying the received signal r[m] with the kth signature sequence and accumulating the result over K samples: y k = r[ m ] g k [ m ]
m =1 K
= =
1 2 1 2
a k bk g k [m]g k [m] +
m =1
a b g 2
m =1 j =1 jk j j
g
m =1 j =1
1, [ m] g k [ m ] = 0,
j=k jk
(5.107)
(5.108)
In this case, system performance is noise-limited and the single-user detector is the optimum detector. However, when the signature sequences are not orthogonal, as in the case of the proposed system, the second term in (5.106) can be significant. For the MLSR signature sequences used in the proposed system,
g
m =1 j =1
1, [ m] g k [ m ] = 1 / K ,
j=k jk
(5.109)
123
a j b j + n[m]g k [m] .
j =1 j k m =1
(5.110)
The interference from the other data channels can be significant, especially if any of the amplitudes aj are sufficiently larger than the amplitude of the desired data channel. In this case, system performance is limited by multiple-access interference (MAI) and the single-user detector is no longer the optimum detector. The decorrelating detector is the first of two linear multiuser detectors described in this section for performing data recovery in CDMA systems (Fig. 5.24). The detector output is
b = sgn( z ) = sgn( R 1 y ) . The vector y represents the output from the K correlators and is given by
y = RAb + n
(5.111)
(5.112)
where R is the cross-correlation matrix of the signature sequences with the entries of R given by
R (i, j ) =
1 K
g [ m] g
m =1 i
[ m]
(5.113)
g1[m]
y1
z1 b1
...
r[m]
R1
gK[m]
yK
zK bK
124
...
a1 1 0 A= 2 0 0
0 0 0 a2 0 0 0 O 0 0 0 aK
(5.114)
b is a vector of the binary antipodal data streams, and n is a noise vector. The detector
output is
b = sgn( R 1 y ) = sgn( R 1RAb + R 1n) = sgn( Ab + R 1n) . Consequently, the decorrelating detector eliminates MAI, and when the system performance is interference-limited, the decorrelating detector is the optimum detector. However, since R 1 (k , k ) 1 , the decorrelating detector eliminates MAI at the expense of noise enhancement, and when system performance is noise-limited, the decorrelating detector is no longer the optimum detector. The decorrelating detector depicted in Fig. 5.24 recovers all K data channels. However, in many cases, only a single data channel needs to be recovered at the mobile receiver. Fig. 5.25 illustrates an implementation of the decorrelating detector which recovers only the kth data channel. This detector is very similar to the single-user detector illustrated in Fig. 5.23. However, in this case, the received signal r[m] is multiplied by a modified signature sequence hk[m]:
hk [m] = R 1 (k , i ) g i [m] .
i =1 K
(5.115)
(5.116)
r[m]
yk bk
hk[m] = R1(k,i)gi[m]
i=1
125
g1[m] r[m]
y1
z1 b1
(R + N I)1 2
...
gK[m]
yK
zK bK
Finally, a second linear multiuser detector for performing data recovery in CDMA systems is the minimum mean-square error (MMSE) detector depicted in Fig. 5.26. In
this approach, the algorithm attempts to minimize the mean-square error between b and z. The detector output is
b = sgn( z )
1 N = sgn R + I y 2
...
(5.117)
where R is the cross-correlation matrix of the signature sequences, the vector y represents the output from the K correlators, N / 2 is the PSD of the AWGN, and I is the identity matrix. The performance of the MMSE detector approaches that of the decorrelating detector when N 0 , while its performance approaches that of the single-user detector when N . Consequently, the MMSE detector provides a good compromise between MAI suppression and noise enhancement when the system is neither interference-limited
dk r[m] ck[m] adaptive algorithm
yk
ek
126
nor noise-limited. Data recovery in the proposed system is performed by an adaptive MMSE detector which recovers only a single data channel as illustrated in Fig. 5.27. The received signal r[m] is multiplied by an adaptive sequence ck[m]. Adaptation of ck[m] in the MMSE detector is achieved through the LMS algorithm. Additional details about the adaptive MMSE detector used in the proposed system is described in [22].
5.3.13 Summary
A block diagram of the proposed receiver is illustrated in Fig. 5.28. The receiver is based on a direct-conversion architecture and dc offsets are eliminated by high-pass filtering the I and Q signals immediately after translating the RF signal down to baseband. Due to the wide bandwidth of the desired signal, the corner frequency of each of the high-pass filters can be as high as 500 kHz, and consequently, these filters can be implemented using onchip passive structures exclusively. DC offsets and flicker noise in the subsequent baseband stages are minimized by using large transistor dimensions. The carrier frequency and system bandwidth are 2 GHz and 32.5 MHz, respectively, and the required noise figure must be better than 14.2 dB in order to guarantee an average BER of 104, which corresponds to an SNR of approximately 15 dB after data recovery. The minimum
RF Input (fc) VGA ADC adaptive MMSE detector VGA ADC Q I
LNA RF Filter
VGA
0 90
LO (fc)
AGC algorithm
d(k)
max
||
g(k)
z-1
e(k)
y(k)
LPF
1/2
min
||
127
and maximum gain requirements are about 51 dB and 82 dB, respectively, and an AGC loop with a dynamic range of 31 dB is used to adjust the amplitude of the received signal. The AGC loop is based on a feedback architecture and is partitioned between analog and digital circuits. The gain control algorithm is implemented in the digital section and then the proper gain setting is fed back to the analog VGA. By designing the VGA to have discrete gain settings, the digital signal can control the VGA directly without the need for a DAC. The sampling rate and resolution of each of the I and Q ADCs must be at least 25 MHz and 7 bits, respectively, while data recovery is performed using multiuser techniques. An adaptive MMSE detector provides a good compromise between MAI suppression and noise enhancement. The receiver specifications are summarized in Table 5.1. The specifications summarized in Table. 5.1 serve as a starting point for designing the analog front-end of the receiver. However, before circuit design can begin, additional information is required. For example, the noise and gain requirements must be partitioned
K = 15 K = 31 direct conversion: high-pass filtering for dc offsets; large transistor sizes for dc offsets and flicker noise 2 GHz 32.5 MHz 81.2 dBm 84.3 dBm 11.76 dB 14.91 dB 14.2 dB P1dB > 38.46 dBm minimum: 51.47 maximum: 82.44 Nyquist rate: 25 MHz resolution: 7 bits corner frequency: < 500 kHz dynamic range: 31 dB amplitude estimate: max(| I |, | Q |) + 1 min(| I |, | Q |) 2 update equation: g (k + 1) = g (k ) + g (k )e(k ) digital implementation of AGC algorithm; analog VGA with discrete gain settings adaptive MMSE detector
receiver architecture carrier frequency system bandwidth sensitivity processing gain noise figure distortion gain ADC high-pass filter AGC loop
data recovery
128
between the various receiver blocks, such as the LNA and the mixer. In addition, all of the analog front-end impairments described in Chapter 3 can potentially degrade the performance of the MMSE multiuser detection algorithm. The next section determines the effects of these analog impairments using the system-level simulation framework described in Chapter 4.
5.4
System Simulation
The adaptive MMSE multiuser detection algorithm described in Section 5.3.12 provides a good compromise between MAI suppression and noise enhancement. However, the performance of this algorithm may be compromised by the analog impairments introduced by the receiver front-end. In addition to noise, these impairments also include receiver distortion, gain mismatch, quadrature phase mismatch, and LO phase noise. The system-level simulation framework described in Chapter 4 is used to explore the tradeoffs between these analog impairments and overall system performance. The system downlink is simulated in Simulink and a top-level schematic is illustrated in Fig. 5.29.
5.4.1
Base-Station Transmitter
The base-station transmitter consists of a digital section and an analog section. Since this research focuses primarily on the design and implementation of the receiver, the simulation does not include any transmitter impairments. The digital section of the transmitter implements the QPSK modulation, signal spreading, power control, and pulse shaping described in Sections 5.2.1 and 5.2.2. For this particular simulation, the basestation transmitter supports up to 15 channels, one of which is a pilot channel as
channel model analog
I I I Iin
receiver digital
Iout
receiver output
performance metrics
82 13.53 -10.99 -17.74
BB TX
Q Q
RF TX RF RF Transmitter
Channel
RF
RCVR
DSP
Qout
postcor.mat To File I Channel: Total Gain (dB) NF (dB) IIP2 (dBm) IIP3 (dBm)
BB Transmitter
Channel
129
described in Section 5.2.3. As described in Section 5.2.5, digital-to-analog conversion and translation of the baseband I and Q signals to the 2-GHz carrier frequency are implemented in the analog section of the transmitter. The signal appearing at the output of an ideal transmitter is
s (t ) = s I 1 (t ) cos( c t ) + sQ1 (t ) sin( c t )
(5.118)
where sI1(t) and sQ1(t) are the baseband I and Q signals and c is the carrier frequency. In order to decrease the simulation time, the simulation framework relies on basebandequivalent behavioral models for the receiver RF components. The inputs to these baseband-equivalent models are the time-varying coefficients of the equation
s (t ) = s DC (t ) + [ s In (t ) cos(n c t ) + sQn (t ) sin(n c t )] .
n =1 3
(5.119)
Consequently, the outputs of the transmitter block in the simulation are simply sI1(t) and sQ1(t), and frequency translation to the carrier frequency is unnecessary. Finally, the analog section of the transmitter also restricts the total transmit power to 1 mW or 0 dBm.
5.4.2
Channel Model
For this simulation, the channel block only models the attenuation due to free-space propagation and shadowing as described in Section 5.3.1. However, a more complex model which includes other effects such as multipath propagation can be easily incorporated into the channel block. For example, if the transmitted signal is given by (5.118), then multipath propagation results in the following signal appearing at the receiver: r (t ) = n (t )s (t n )
n
(5.120)
where n(t) and n are the attenuation factor and propagation delay, respectively, for the signal received on the nth path. A baseband-equivalent model for multipath propagation
130
2 VGA In
ON/OFF ON/OFF Sig Dout @ Fn
1 I
LPF I LPF4
AnaIn
Dout @ M*Fn
VGA Out X
Out
LPF I LPF1
HPF I HPF
LPF I LPF2
In
Clk ON/OFF
I VGA2
converter
int4
RF VGA Out 1 RF In RF VGA Low Noise (BB Equivalent) Amplifier (BB Equivalent) LNA
In
I Gain1
I VGA1
Clk
X
Sig
Out
LPF Q LPF1
HPF Q HPF
LPF Q LPF2
In
Dout @ Fn
2 Q
LPF Q LPF3
In
VGA Out
ON/OFF
LPF Q LPF4
AnaIn
Dout @ M*Fn
Q VGA1
Q VGA2
converter
int4
Figure 5.30: Receiver front-end schematic. which can be incorporated into the proposed simulation framework is derived by expressing (5.120) in the form of (4.29):
r (t ) = n (t )[cos( c n ) s I 1 (t n ) sin( c n ) sQ1 (t n )] cos( c t ) +
n
(5.121)
5.4.3
Mobile Receiver
The mobile receiver consists of an analog section and a digital section. The analog section models the front-end of the direct-conversion receiver and is illustrated in Fig. 5.30. Simulation of the high-frequency components, such as the RF amplifiers, the PLL, and the I and Q mixers, relies on the baseband-equivalent behavioral models described in Section 4.3. The simulation also models all of the amplification and filtering in the baseband portion of the analog front-end and includes a first-order high-pass filter for eliminating dc offsets. In addition, the analog section includes a structural model for the two I and Q ADCs, each of which is a 7-bit 25-MS/s converter operating at 200 MHz. The digital section of the receiver performs data recovery and includes a structural model of the adaptive multiuser detection (MUD) algorithm described in Section 5.3.12. For this simulation, the multiuser detector has a diversity order of two, providing increased robustness against fading due to multipath propagation.
5.4.4
Simulation Outputs
131
Overall system performance is determined by evaluating the I and Q outputs from the digital section of the receiver. With a small target BER of 104, determining the BER directly would require simulating a large number of data bits, resulting in very long simulation times. A much better approach is to infer the average BER from an estimate of the output SNR, which requires the simulation of much fewer data bits. Finally, the simulation also provides conventional receiver performance metrics for the analog frontend, including total gain, noise figure, input IP2, and input IP3. For this simulation, the transmitter output signal consists of ten equal-power data channels, including the pilot channel. The receiver specifications for this simulation are summarized in Table 5.2. The overall cascaded double-sideband (DSB) noise figure of the receiver is 13.5 dB. Since the noise performance of the receiver is most critical when the received signal is very weak, the transmitted signal experiences the worst-case channel attenuation of 69.43 dB and the receiver gain is set to the maximum level of 82 dB. This simulation accounts for other receiver impairments, including a gain mismatch of 4% between the I and Q signal paths, a quadrature phase mismatch of 2.5, as well as PLL phase noise. PLL phase noise is modeled by the simple behavioral model described in Section 4.4.5 and is specified to be 80 dBc/Hz at a 100-kHz offset. The overall cascaded input IP2 and input IP3 are 11.0 dBm and 17.7 dBm, respectively, while the 1-dB compression point of the receiver is estimated from the specification for center frequency noise figure (DSB) gain I/Q gain mismatch PLL phase noise I/Q phase mismatch IIP2 IIP3 P1dB HPF corner frequency ADC data recovery 2 GHz 13.5 dB 82 dB 4% 80 dBc/Hz @ 100 kHz 2.5 11.0 dBm 17.7 dBm 27.3 dBm 500 kHz 7-bit, 25-MS/s adaptive MUD with second-order diversity
132
1.5
0.5
0.5
Figure 5.31: Constellation diagrams from system-level simulation. input IP3 as given by (4.20). The simulation includes a pair of first-order, 500-kHz highpass filters for dc-offset removal as well as a pair of 7-bit, 25-MS/s modulators for analog-to-digital conversion. Fig. 5.31 illustrates the constellation diagrams for the I and Q signals at the output of the ADCs and at the output of the multiuser detector. The SNR of the I and Q data from the output of the multiuser detector is approximately 15 dB, which corresponds to an average BER of 104 for this system. Thus, the target BER is achieved despite the relaxed receiver performance specifications as indicated by the constellation diagram for the I and Q data from the output of the ADCs. Indeed, most of the receiver specifications listed in Table 5.2 can be easily achieved in a highly-integrated CMOS implementation. Although a couple of the specifications, such as the maximum gain and the ADC requirements, are not as easily achievable, by applying low-power design techniques for these receiver circuits, the proposed system is still quite amenable to a low-power singlechip solution.
5.5
Summary
133
The exponential improvements in mainstream CMOS technology are clearly facilitating the implementation of advanced digital communications algorithms. However, the potential performance improvements may not be realized if these algorithms are very sensitive to impairments introduced by the analog front-end of the receiver. A highperformance WCDMA system which is relatively insensitive to analog front-end impairments was presented in this chapter. The system is designed to be used in an indoor picocellular environment, and each base station supports as many as 15 data channels, each with a data rate of up to 3.33 Mb/s. The design of this system relied heavily on the system-level simulation environment described in Chapter 4. This simulation framework allows the designer to rapidly and efficiently evaluate the affects of analog front-end impairments on overall system performance. The proposed system relies on an adaptive MUD algorithm for data recovery and the analog front-end of the receiver is based on a very simple direct-conversion architecture. Most of the receiver specifications are not very stringent and can be easily achieved in a low-power, highly-integrated CMOS implementation.
134
Chapter 6
Receiver Prototype
6.1
Introduction
By adhering to a design strategy which tightly incorporates implementation issues at the system level, many of the analog hardware requirements are relaxed while still achieving excellent overall system performance. A power-efficient solution is achieved by taking advantage of these relaxed requirements along with low-power circuit implementation techniques. The direct-conversion receiver is integrated onto a single chip and implements all analog receiver functions except for variable gain amplification (Fig. 6.1). All circuits on this chip use a 2.5-V supply, and a fully-differential signal path is used to mitigate the coupling between different receiver components. The LNA is capacitively coupled to the RF ports of the I and Q mixers, while the frequency synthesizer connects directly to the LO ports. Along each baseband signal path, a high-pass filter is used to eliminate dc offsets, while large transistor sizes are used to minimize the flicker noise contribution of the baseband circuits. In addition, the baseband signal paths provide moderate amplification as well as low-pass filtering before digitization of the I and Q signals. This direct-conversion receiver was fabricated in a 0.25-m, single-poly, 6-metal CMOS process. The rest of this chapter describes the design and implementation of each
135
I OUT
Q OUT
Figure 6.1: Block diagram of receiver prototype. circuit block, focusing primarily on design choices which result in the most powerefficient implementation.
6.2
Low-Noise Amplifier
The LNA is one of the first components along the received signal path and its design must be considered in conjunction with the components which precede it, including the antenna and RF filter (Fig. 6.2). The antenna receives electromagnetic waves from the wireless transmission environment, and although the antenna usually has a tuned frequency response, the signal at its output consists of the desired signal as well as potentially strong out-of-band interferers. The RF filter immediately following the antenna helps to attenuate these out-of-band signals, while the subsequent LNA amplifies the received signal. In addition, the noise contribution of the LNA must be sufficiently low so as not to corrupt the potentially weak desired signal. Traditional implementations require that the impedances at each of the component interfaces be 50 and the usual explanation for this requirement is the desire for
Antenna
LNA RF Filter
136
Zs
Vs
Zl = Zs*
Figure 6.3: Conjugate impedance matching for maximum power transfer. maximum power transfer. More precisely, maximum power transfer requires conjugate impedance matching between the source and load as illustrated in Fig. 6.3, and the 50- requirement is a legacy from microwave designs using coaxial cables, where the 50- interface resistance is a compromise between the 30- resistance for maximum power handling and the 77- resistance for minimum loss [25], [59]. Integrated-circuit implementations have already abandoned this antiquated requirement, and more recently, the 50- requirement at the interface between external and on-chip components, e.g., between the external RF filter and the on-chip LNA, has also come under intense scrutiny. In order to clarify the need for a well-defined LNA input impedance, a review of microwave filter design follows.
6.2.1
The insertion loss method is a very common approach in microwave filter design, where the filter response is characterized by its insertion loss, or power loss ratio [59]:
PL = 1 1 ( )
2
(6.1)
where () is the reflection coefficient as a function of frequency . Since |()|2 is an even function of , it can be expressed as a polynomial in 2 : M ( 2 ) ( ) = . M ( 2 ) + N ( 2 )
2
(6.2)
137
Rs
Vs
Rl
Zin
Zout
(6.3)
The power loss ratio may be specified for various filter responses. For example, the power loss ratio for a Butterworth low-pass filter response is given by
PL , Butterworth = 1+ c
2
2N
(6.4)
where determines the magnitude variation in the passband, c is the passband edge, and N is the filter order. Consider the design of a second-order Butterworth low-pass filter based on a single LC section illustrated in Fig. 6.4. In this case, the filter is doubly terminated with a load resistor Rl at the output and a source resistor Rs at the input. If the 3-dB frequency is c, then the desired power loss ratio is
PL , Butterworth = 1+ c .
4
(6.5)
The input and output impedances of the filter are, respectively, Z in = j L + Z out = and the reflection coefficient is Rl 1 + j Rl C (6.6)
R s + j L 1 2 LC + j Rs C
(6.7)
138
Z in Rs . Z in + Rs
(6.8)
1 R R PL = + l + s 2 4R 4 Rl s
L2 Rl LC 1 R L2 C 2 4 2 + 2 + l . 4 R R 2 R + 4 Rs Rl C 4 Rs s l s
(6.9)
By equating (6.5) and (6.9), the following component values are required in order to achieve the desired second-order Butterworth low-pass frequency response:
Rl = Rs
L= C=
(6.10) (6.11)
2 Rs
2 Rs c
(6.12)
In this case, the load resistance must equal the source resistance in order to achieve the desired filter response, i.e., the filter will not function properly unless this condition is satisfied. A similar analysis can be applied to other filter responses. For Bessel and oddorder Chebyshev responses, the load and source resistances must also be equal, while for an even-order Chebyshev response, the load and source resistances are related but unequal [59]. In Fig. 6.2, the antenna and LNA present source and load impedances, respectively, to the RF filter. In order to design the RF filter for a particular frequency response, the source and load impedances of the antenna and LNA, respectively, must be known a priori. Since RF filters are usually designed independently from the antenna and the LNA, a standard impedance must be chosen. Commercially-available filters are typically designed assuming 50- source and load impedances, and consequently, deviating from 50 results in poor and unpredictable RF filter performance. Since the receiver prototype is intended to be used with a commercially-available RF filter, the on-chip LNA must be designed to have a 50- input impedance. Future designs will rely on both a custom RF filter and a custom antenna so that the 50- input 139
low-pass element
L
C 2 1
Figure 6.5: Transformation of a low-pass response to a band-pass response. impedance requirement of the LNA can be removed. The implications of removing this constraint are explored in Appendix C. Finally, the RF filter should actually have a band-pass response. A low-pass response can be transformed to a band-pass response by applying the transformations illustrated in Fig. 6.5.
6.2.2
The main function of the LNA is to amplify the potentially weak desired signal without corrupting it through mechanisms such as noise or distortion. Since the linearity performance of receivers is usually limited by components following the LNA, such as the mixer, the distortion performance of the LNA is usually not very stringent. However, the noise contribution of the LNA must be sufficiently low so as not to corrupt the potentially weak desired signal at its input. The noise performance of an LNA may be characterized by a couple of different metrics: noise factor and noise measure. Noise factor or noise figure is the most common metric and is a measure of how much the LNA degrades the SNR of the received signal. Noise factor is defined as
SNRin SNRout
(6.13)
140
where SNRin and SNRout are the SNRs at the input and output, respectively, of the LNA. A related metric, noise figure, is simply the noise factor expressed in decibels, 10log(F). One potential drawback of this metric is that it does not account for amplification. For example, an ideal wire is obviously not a very good LNA since it does not provide any amplification, although it does have an excellent noise figure of zero. Consequently, specifying the noise factor or noise figure of an LNA is meaningless without also specifying its gain. A metric less commonly used to characterize the noise performance of an LNA is noise measure. Noise measure accounts for both the noise and gain of the LNA and is defined as [60]
M F 1 1 1/ G
(6.14)
where F and G are the noise factor and power gain, respectively, of the LNA. In the case of an ideal wire, the noise measure is infinite and is consistent with our notion that a wire in not a very good LNA. The power gains of practical LNA topologies are sufficiently large, e.g., G > 10 , so that noise measure and noise factor become equivalent metrics. Consequently, only noise factor is evaluated for the LNA topologies described later in Section 6.2.5.
6.2.3
Regardless of which metric is preferable, the key design goal is to minimize the noise figure while maximizing the gain of the LNA. Since the noise performance of the LNA is so critical, an accurate transistor noise model is essential. In particular, the measured thermal noise in short-channel MOS devices is greater than the amount predicted by long-channel theory [61][63]. This section begins with a review of the long-channel MOS noise model followed by a discussion of some recently proposed noise models for short-channel MOS devices.
Long-Channel MOS Noise Model. Since the channel material of an MOS device is
resistive, the drain current exhibits thermal noise. For an MOS transistor operating in
141
D ro
Figure 6.6: MOS small-signal equivalent circuit with noise generators. strong inversion, the small-signal equivalent circuit with noise generators is illustrated in Fig. 6.6. According to long-channel theory, the power spectral density of the drain current noise may be expressed as [10]
2 id 2 = 4kT g m S ID ( f ) = 3 f
(6.15)
where gm is the device transconductance. Equation (6.15) assumes that the device is operating at frequencies well above the flicker noise corner frequency so that flicker noise may be neglected. This model significantly underestimates the actual noise present in short-channel MOS devices. Recently, more accurate noise models have been proposed for short-channel MOS devices. Two proposed mechanisms resulting in the observed excess thermal noise include high-field effects [65] and induced gate current noise [64], [65].
MOS Noise Model including High-Field Effects. The first modification to the traditional
MOS noise model is an increased drain current noise resulting from high-field effects in short-channel devices. In this case, the power spectral density of the drain current noise is expressed instead as
S I D = 4kT g do
(6.16)
where is a bias-dependent parameter used to account for the increased drain current noise and gdo is the zero-bias drain conductance of the device. For long-channel devices in strong inversion, is equal to 2/3 and gdo is equal to the device transconductance, gm,
g do = g m = C ox W (VGS Vt ) L
(6.17)
142
so (6.16) reduces to (6.15). For short-channel devices, may be as high as two to three and may be attributed to hot electron effects [65]. Under high electric fields, the temperature of electrons in the channel can rise above that of the lattice, resulting in an increase in the drain current noise. In this case, the power spectral density of the drain current noise is given by [65]
S ID 4kT d T = 2 e g 2 (V )d V L ID 0 T
V
(6.18)
where g (V ) is the channel conductance at a given point along the channel, V is the corresponding voltage, and Te and T are the electron and lattice temperatures, respectively. Equating (6.16) and (6.18) results in the following expression for :
1 = g do L2 I D
Vd
T
0
Te
g 2 (V )d V .
(6.19)
(6.20)
I D = g (V ) E ( y ) = g (V )
dV . dy
(6.21)
Equating (6.20) and (6.21) results in the following expression for g(V):
ID . E sat
(6.22)
Next, in order to evaluate the integral in (6.19), an expression for Te/T is also required. Unfortunately, the exact dependence of electron temperature on electric field strength is unknown. For this calculation, it is assumed that
143
Te E ( y ) = 1 + E sat T
ID = 1 + . g (V ) E sat
(6.23)
2 eff
2 C oxW 2 (VGS Vt V ) 2 dV
C W 2VD
g do L I D
(6.24)
I D = I Dsat
(VGS Vt ) 2 1 = eff C oxWE sat 2 (VGS Vt ) + E sat L (VGS Vt ) E sat L . (VGS Vt ) + E sat L
(6.25)
VD = VDsat =
(6.26)
g do
(VGS
(6.27)
VD = 0
W (VGS Vt ). L
Substituting (6.25), (6.26), and (6.27) into (6.24) results in the following expression for
: =
1 2 ( E sat L) 2 + 2(VGS Vt ) E sat L + 2(VGS Vt ) 2 . 2 Vt ) + E sat L] 3 (6.28)
[(VGS
For long-channel devices, EsatL is much larger than VGS Vt, and (6.28) reduces to
= 2/3. However, in the limit that EsatL is much smaller than VGS Vt, (6.28) reduces to = 2.
144
Figure 6.7: Distributed gate capacitance and channel resistance at high frequencies.
Induced Gate Current Noise. At high frequencies, induced gate noise becomes
significant, which arises from the distributed nature of the device as illustrated in Fig. 6.7. In this case, the gate admittance consists of an additional conductive component [65]:
Yg = j C gs + g g
(6.29)
(6.30)
gg =
2 2 C gs
5 g do
(6.31)
Since gg is a physical resistance, it has an associated noise current with power spectral density given by
S IG =
2 ig
= 4kT g g
(6.32)
where is 4/3 for long-channel devices. For short-channel devices where high-field effects may be significant, the power spectral density of the gate current noise is given by [65]
S IG =
2 4kT 2CoxW 2 3 ID VD
T
0
Te
g 2 (V )(Va V ) 2 dV
(6.33)
where
(VGS Vt ) VGS
2 VDsat V Dsat 2 6 V Vt Dsat 2
Va = VDsat +
V 2 E sat L
2 Dsat
V 1 + Dsat . E sat L
(6.34)
145
T
0
Te
g 2 (V )(Va V ) 2 dV .
(6.35)
[(VGS
(6.36)
For long-channel devices, EsatL is much larger than VGS Vt, and (6.36) reduces to
= 4/3, as expected. However, in the limit that EsatL is much smaller than VGS Vt, (6.36)
reduces to = 15/2, which is more than five times larger than the long-channel limit. Consequently, for short-channel devices operating at very high frequencies, induced gate current noise can be quite detrimental to low noise performance. Finally, since the induced gate current noise originates from the distributed nature of the gate capacitance and the channel resistance, it is partially correlated with the drain current noise with a correlation coefficient given by
c=
* i g id
i i i i
* g g
(6.37)
* d d
where
* id i d = 4kT g do f 2 2 C gs
(6.38)
i i = 4kT
* g g
5 g do
(6.39) (6.40)
* i g id = 4kT j C gs f .
(6.41)
146
where and are given by (6.19) and (6.35), respectively. Equating (6.40) with
VD
(6.42)
T
0
Te
g 2 (V )(Va V )dV .
(6.43)
[(VGS
(6.44)
For long-channel devices, EsatL is much larger than VGS Vt, and (6.44) reduces to
(6.45)
where the equality holds for long-channel devices. For an MOS transistor operating in strong inversion, the revised small-signal equivalent circuit with noise generators is illustrated in Fig. 6.8. When using this model for noise calculations, the correlation between the gate current noise and the drain current noise must be taken into account.
6.2.4
Minimizing the noise figure is one of the key design goals when designing an LNA. For a
Cgd G
2 ig
D ro
Figure 6.8: Revised MOS small-signal equivalent circuit with noise generators.
147
+ i
2 s
Ys
2 g
gg
Cgs
vgs
gmvgs
2 id
Ys = Gs + jBs
(a)
(b)
The current noise components appearing at the drain due to is, ig, and id are given by, respectively, ios =
iog =
g m is g g + G s + j ( C gs + Bs )
g mig g g + G s + j ( C gs + Bs )
(6.47)
(6.48) (6.49)
F = 1+
(6.50)
where for two zero-mean random variables, A and B, the sum of the two random variables has variance VAR[ A + B] = VAR[ A] + VAR[ B] + E [ AB * ] + E [ A* B] . (6.51)
148
This formulation is necessary because of the correlation between the gate current noise and the drain current noise. Consequently, the noise factor is F = 1+ where
2 2 iod = id 2 2 g m ig 2 2 * * iog + iod + iog iod + iog iod 2 ios
(6.52)
(6.53) (6.54)
2 og
( g g + G s ) 2 + ( C gs + Bs ) 2
2 g m i s2 i = ( g g + G s ) 2 + ( C gs + Bs ) 2 2 os * g m i g id
(6.55)
i i
* og od
g g + G s + j ( C gs + Bs )
* g m i g id
(6.56)
i i From (6.37),
* og od
g g + G s j ( C gs + Bs )
(6.57)
* * * * * * i g id = i g id = c i g i g i d i d = j | c | i g i g id id .
( )
(6.58)
5 g m Gs
g m Gs
[( g g + G s ) 2 + ( C gs + Bs ) 2 ] + 2|c|
C gs
5 g m Gs
(6.59)
( C gs + Bs )
where is defined as
(6.60)
149
2 2 G s = G sopt = g g + 2 2 C gs
(1 | c | 2 ) 5
(6.61)
Bs = Bsopt = C gs 1 + | c | 5
(6.62)
Fmin where
= 1+ 2 5 T
(1 | c |) + 5
2
2 + 5 T
(6.63)
T =
For << T , (6.63) becomes Fmin = 1 + 2
gm . C gs
(6.64)
(1 | c | 2 )
= 1 + 2.32 T
where the latter equality assumes that = 2, = 4, and |c| = 0.395.
(6.65)
Minimum noise figure for the common-source transistor amplifier is achieved with an optimum source conductance and susceptance given by (6.61) and (6.62), respectively, and consequently, for minimum noise figure, the RF filter which precedes this amplifier should have an output admittance equal to the required optimum noise admittance. At the same time, maximum power transfer from the RF filter to the amplifier requires a conjugate match between the output admittance of the RF filter and the input admittance of the amplifier, which is given by
Yin = g g + j C gs .
(6.66)
* A simultaneous noise and power match requires that Ys = Yin , and comparing (6.61) and
(6.62) with (6.66) reveals that such a match is impossible for this amplifier topology. For
150
a power match alone, the required source admittance is Ys = g g j C gs and the corresponding noise factor is
4 F = 1+ + 5 T 5 or 6.99 dB
2
(6.67)
where the latter inequality assumes that << T and = 4. Under this condition of maximum power transfer, the noise performance of the common-source transistor amplifier is rather poor.
6.2.5
LNA Topologies
Since the receiver prototype is intended to be used with a commercially-available RF filter, the on-chip LNA must be designed to have a 50- input impedance. In this section, several potential LNA topologies are analyzed, including the common-source, commongate, common-source with inductive degeneration, and local shunt feedback topologies.
Common-Source LNA. In order to achieve a 50- input resistance, a slight modification
is made to the common-source LNA already analyzed in Section 6.2.4. A shunt inductor is added to the input of the LNA in order to tune out the gate capacitance of the transistor (Fig. 6.10), where
Bs = 1 Ls
(6.68)
Rl Gs jBs
151
2 =
1 . Ls C gs
(6.69)
Under this condition, the input conductance of the LNA is Yin = g g and a 50- input impedance is achieved by setting Yin = g g = 1 /(50 ) . Also, the output conductance of the RF filter is Gs = 1 /(50 ) and the corresponding noise factor is given by (6.67). Alternatively, a broadband input match to 50 may be achieved by eliminating the shunt inductor at the LNA input and selecting
g g >> C gs
>>
5 T
(6.70)
1+ +
20
67.5 or 18.3 dB
where the latter inequality assumes that = 0.8, = 4, and = 2. Consequently, a broadband input match is achieved at the expense of very poor noise performance. Finally, the voltage gain of this LNA is Av = g m Rl where Rl is the load resistance at the output of the LNA. (6.72)
(6.71)
Gs
jBs
Rl
2 is
Ys
2 ig
gg
Cgs
vsg
gmvsg
2 id
(a)
(b)
152
Common-Gate LNA. A second topology which may be used to achieve a 50- input
resistance is the common-gate LNA illustrated in Fig. 6.11a. From the small-signal equivalent circuit with noise generators illustrated in Fig. 6.11b, the noise factor is
F = 1+
2 2 C gs
5 g m Gs
g m Gs
[( g g + G s ) 2 + ( C gs + Bs ) 2 ] + 2|c|
C gs
5 g m Gs
(6.73)
( C gs + Bs ) 2
which is identical to the noise factor of the common-source LNA in (6.59). Consequently, the minimum noise factor for the common-gate LNA is also given by (6.65) and the corresponding source conductance and susceptance are given by (6.61) and (6.62), respectively. In order to achieve a 50- LNA input resistance, a shunt inductor is added at the input of the LNA in order to tune out the gate capacitance of the transistor (Fig. 6.11a). Under this condition, the input conductance of the LNA is Yin = g g + g m and a 50- input resistance is achieved by setting Yin = g g + g m = 1 /(50 ) . Also, the output conductance of the RF filter is Gs = 1 /(50 ) and the corresponding noise factor is given by
F = 1+
+ 5
1+
= 3.5 or 5.44 dB
1+ 5 T
4 + 5
(6.74)
where the latter inequality assumes that << T , = 2, and = 0.8. Under this condition, the noise performance of the common-gate transistor amplifier is rather poor. Alternatively, a broadband input match to 50 may be achieved by eliminating the shunt inductor at the LNA input and selecting
C gs << g m << T .
153
(6.75)
Under this condition, gg is also much less than gm, and consequently, the input admittance reduces to Yin = gm and the noise factor is
4 F = 1+ + 5
1+
= 3.5 or 5.44 dB
(6.76)
where the latter inequality assumes that << T , = 2, and = 0.8. Finally, the voltage gain of the common-gate LNA is Av = g m Rl (6.77)
where Rl is the load resistance at the output of the LNA. When the LNA input resistance is matched to 50 , the voltage gain is Av = g m Rl = 0.02 Rl .
Common-Source LNA with Inductive Degeneration. A third topology which may be
used to achieve a 50- input resistance is the common-source LNA with inductive degeneration illustrated in Fig. 6.12a [11], [64], while the corresponding small-signal equivalent circuit illustrated in Fig. 6.12b. In order to simplify the calculations, gg is ignored. This approximation is valid when g g << C gs
2 2 C gs
5 g do
<< C gs
5 T
(6.78)
<<
where is defined in (6.60). Indeed this condition is easily met in most designs where the devices are designed to operate at frequencies much less than T. The input impedance of this LNA is Z in = 1 2 C gs ( L g + Ls ) g m Ls . j C gs C gs (6.79)
154
Lg
Lg Rl
i
2 s
+ Ys i
2 g
gg Cgs
vgs
gmvgs
2 id
Ls Ys = Gs + jBs
Ls
(a)
(b)
Figure 6.12: (a) Common-source LNA with inductive degeneration. (b) Small-signal equivalent circuit. A 50- input resistance is achieved by equating the real part of Zin to 50 and then selecting the values of inductors Lg and Ls to tune out the imaginary part of Zin: g m Ls = 50 C gs L g + Ls = 1 . C gs
2
(6.80)
(6.81)
5 g m Gs
{[1 Bs ( L g + Ls )]2 + 2 G s2 ( Lg + Ls ) 2 } +
g m Gs
2|c|
{G s2 [1 2 C gs ( L g + Ls )] 2 + [ C gs + Bs [1 2 C gs ( L g + Ls )]] 2 }+
(6.82)
C gs
5 g m Gs
( Lg + Ls )G s2 [1 2 C gs ( L g + Ls )]}.
Substituting (6.81) into (6.82) results in the following expression for noise factor:
2 2 2 2 2 C gs G s2 Bs C gs + 2 2 + 1 F = 1+ + 5 g m G s C gs C gs g m G s
(6.83)
2|c|
C 1 B s . 5 g m G s C gs
2 2 gs
155
C gs 5 (1 | c | 2 )
(6.84)
| c | 5 Bs = Bsopt = C gs 1 +
(6.85)
(1 | c |) 2
(6.86)
which is identical to the minimum noise factor given in (6.65) for the common-source and common-gate topologies. As in those two cases, a simultaneous noise and power match is impossible for the common-source LNA with inductive degeneration. When the source admittance is purely real, Bs = 0 and the corresponding noise factor is F = 1+
2 2 C gs 2 2 2 2 G 2 C gs C gs +2|c| . 1+ 2 s 2 + 5 g m Gs C gs g m G s
5 g m Gs
(6.87)
In this case, the noise factor is minimum when G s = G sopt = C gs 1 + and the corresponding noise factor is Fmin = 1 + 2 5
2 | c | 5
(6.88)
+ +2|c| 5 5 5 T T
(6.89)
= 1 + 3.26
where the latter equality assumes that = 0.8, = 4, = 2, and c = j0.395. The above approach identifies the optimum source conductance for a MOS device with a fixed geometry at a particular bias point. However, for integrated-circuit
implementations, the device geometry is actually a design variable, while the source
156
conductance is fixed. Thus, a more appropriate design goal is to determine the optimum device geometry for a particular bias current and a fixed source conductance, e.g., 50 . For short-channel devices, the drain current may be expressed as a function of transconductance and gate capacitance:
ID = y x x x x y 2
(6.90)
(6.91) (6.92)
y = 2 g m E sat L .
Solving for gm in (6.90) and substituting the result into (6.87) results in an expression for noise factor which depends only on Cgs, Gs, and ID. The optimum value of Cgs can be determined by differentiating the noise factor with respect to Cgs, and the corresponding device width is given by W= 3C gs 2 LC ox . (6.93)
Unfortunately, for short-channel devices, the relationship between ID, gm, and Cgs is rather complicated, and the resulting equations are too complex to provide any insight into the design process. Although the long-channel equations are invalid, the results derived from these equations can still provide some rough design guidelines. For long-channel devices, the transconductance is given by g m = 2 I D eff C ox W L .
3I D eff C gs L
(6.94)
Substituting (6.94) into (6.87) results in an expression for noise factor that is minimum when
157
C gs = C gsopt = Gs
5 3 + +2|c| 5 5
(6.95)
Fmin
Gs 4 = 1+ L 3 eff I D
4 1
4 . + +2|c| 5 5 3 5
(6.96)
Equation (6.96) is plotted as a function of ID in Fig. 6.13 along with the result derived from the short-channel equations. The noise performance predicted by (6.96) agrees well with the short-channel result and is only slightly optimistic at high bias currents. Once Cgs and gm are determined, the remaining design equations are: Ls = Lg = C gs g m Gs
2
(6.97)
1 Ls . C gs
(6.98)
NFmin (dB)
0 105
104
103
102
101
ID (A)
Figure 6.13: NFmin versus ID based on short-channel and long-channel equations for the inductively-degenerated LNA topology. ( = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s, L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, Esat = 5104 V/cm.)
158
Rf
Rf
Rl
+
2 is
Ys
2 ig
gg
Cgs
vgs
gmvgs
2 id
Gs
jBs
(a)
(b)
Figure 6.14: (a) Local shunt feedback LNA. (b) Small-signal equivalent circuit.
(6.99)
input resistance is based on a single transistor with local shunt feedback as illustrated in Fig. 6.14a [67]. The input impedance of this LNA is
Z in = R f + Rl 1 + g m Rl + ( R f + Rl )( g g + j C gs )
(6.100)
A narrowband input match may be achieved by adding a shunt inductor at the input of the LNA in order to tune out the gate capacitance of the transistor. In this case, the input impedance is
Z in = R f + Rl 1 + g m Rl + g g ( R f + Rl )
(6.101)
Alternatively, a broadband input match may be achieved by eliminating the shunt inductor at the LNA input and selecting
C gs ( R f + Rl ) << g m Rl << T
Rl . R f + Rl (6.102)
159
Under this condition, g g ( R f + Rl ) is also much less than g m Rl , and consequently, the input impedance reduces to Z in = R f + Rl 1 + g m Rl . (6.103)
In the case of a broadband input match, both gg and Cgs may be ignored, and the noise factor is
F = 1+
2 2 C gs
5 g m Gs
g m [(1 + R f G s ) 2 + ( R f Bs ) 2 ] G s (1 g m R f ) 2
2
+ (6.104) .
R f [( g m + G s ) 2 + Bs2 ] G s (1 g m R f )
2|c|
C gs R f Bs
5 G s (1 g m R f )
(6.105)
Bs = Bsopt = | c |
C gs 5
(6.106)
(1 | c | 2 )
(6.107)
which is identical to the minimum noise factor for the other LNA topologies. As in those cases, a simultaneous noise and power match is also impossible for this LNA. When the source admittance is purely real, the noise factor becomes
F = 1+
2 2 C gs
5 g m Gs
g m (1 + R f G s ) 2 R f ( g m + G s ) 2 . + + G s (1 g m R f ) 2 G s (1 g m R f ) 2
(6.108)
160
G s = G sopt =
2 2 5 g m ( + g m R f ) + 2 2 C gs (1 g m R f ) 2
5 g m R f ( + g m R f )
(6.109)
(1 g m R f ) 2
2 2 2 g m R f ( + g m R f )[5 g m ( + g m R f ) + 2 2 C gs (1 g m R f ) 2 ] . 5 g m (1 g m R f ) 2
(6.110)
By assuming g m R f >> 1 , (6.108) (6.110) may be further simplified. In this case, the noise factor becomes F = 1+
2 2 C gs
5 g m Gs
Gs 1 + . g m R f Gs
(6.111)
The minimum noise factor is achieved when G s = G sopt and the corresponding noise factor is Fmin = 1 + 2
2 2 2 g m C gs = + Rf 5
(6.112)
5 T = 1 + 2.53 T
(6.113)
where the latter equality assumes that = 4 and = 2. The above approach identifies the optimum source conductance for a MOS device with a fixed geometry at a particular bias point. However, for integrated-circuit
implementations, the device geometry is actually a design variable, while the source conductance is fixed. Thus, a more appropriate design goal is to determine the optimum device geometry for a particular bias current and a fixed source conductance, e.g., 50 . In this case, using the long-channel expression for gm from (6.94) still results in very
161
12
10
NFmin (dB)
0 105
104
103
102
101
ID (A)
Figure 6.15: NFmin versus ID for the LNA topology with local shunt feedback. ( = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s, L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, Esat = 5104 V/cm, Rl = 350 .) complicated expressions for Cgsopt and the corresponding Fmin, which are not reported here. However, the minimum noise figure based on the short-channel equations is plotted in Fig. 6.15. This result is derived by substituting expressions for gm and Rf into (6.108). An expression for gm is derived from (6.90), while an expression for Rf is derived by setting the source conductance equal to the LNA input conductance in (6.103): Gs = 1 + g m Rl . R f + Rl (6.114)
Finally, in the case of a broadband input match, the voltage gain of this LNA is Av = (1 g m R f ) Rl R f + Rl (6.115)
where Rl is the load resistance at the output of the LNA. For g m R f >> 1 , the voltage gain becomes
Av = g m ( R f || Rl ) .
(6.116)
Summary of LNA Performance. The performance of the four LNA topologies are
summarized in Table 6.1. When the device geometry and bias current are fixed, all four
162
common-source
=1 + 2
common-gate
=1 + 2
(1 | c |2 )
Fmin
(1 | c |2 )
(1 | c |2 )
(1 | c |2 )
= 1 + 2.32
= 1 + 2.32
= 1 + 2.32
= 1 + 2.32
2 2 (1 | c |2 ) 2C gs 5
2 2 (1 | c |2 ) 2C gs 5
C gs 5 (1 | c | 2 )
C gs 1 +
2 2 2 2 g m (1 | c | ) C gs + Rf 5
C gs 1 + | c | 1 gg 1+
C gs 1 + | c | 1 g g + gm
1 +
5 or 6.99 dB
3.5 or 5.44 dB
1 or 0 dB
Rl
=j
Av
g m Rl
g m Rl
=j
Table 6.1: Summary of LNA topologies ( = 0.8, = 4, = 2, c = j0.395). LNA topologies have the same minimum noise factor, although the optimum source admittance varies. In all four cases, an impedance match which simultaneously provides minimum noise factor and maximum power transfer is impossible. When the source impedance is purely real and fixed at 50 , a power match results in minimum achievable noise figures of 6.99 dB and 5.44 dB for the common-source and common-gate topologies, respectively. On the other hand, the minimum noise figures of the common-source LNA with inductive degeneration and the LNA with local shunt feedback both have an asymptotic limit of 0 dB. Consequently, for applications with very stringent noise requirements, the common-source topology with inductive degeneration and the topology with local shunt feedback are the best candidates for low-noise amplification. However, for applications with relaxed noise figure requirements, all four topologies are viable options. In this case, the selection should be based on other criteria, such as feasibility of integration or power consumption, which will be discussed next.
163
| c | 5
C gs | c | R f + Rl 1 + g m Rl
1 or 0 dB
g m Ls C gs
Ls
G s g m Rl C gs
g m ( R f || Rl )
Integration and Power Consumption Issues. The topologies which rely on inductors for
narrowband input matching are less feasible for integration. These topologies include the common-source LNA and the common-source LNA with inductive degeneration. Although the common-source topology may also be designed for a broadband input match, the noise factor in this case is in excess of 18.3 dB, which is intolerable even for applications with very relaxed noise requirements. For topologies with a narrowband input match, an inductor is added to the input of the LNA in order to tune out the gate capacitance of the transistor. Although an ideal inductor is noiseless, practical realizations introduce noise due to the series resistance of the inductor. Since this inductor appears at the input of the LNA, excessive noise from the inductor series resistance may not be tolerable. Many standard digital CMOS technologies rely on lowresistivity silicon substrates, and the quality factors of on-chip spiral inductors implemented on these substrates tend to be rather poor, resulting in increased noise. Consequently, off-chip inductors must be used for these topologies. Nevertheless, small form factor is still achievable for implementations based on these topologies by implementing the off-chip inductors as bond wires or by incorporating them into the package. For applications with relaxed noise figure requirements, a second criteria for topology selection is power consumption. For the common source topology with a narrowband input match to 50 , the noise figure is given by (6.67) and gg = Gs = 1/(50 ). The corresponding expressions for gm and Cgs are, respectively,
gm = C gs = 4 G s (F 1 ) 4 . 5( F 1 )
(6.117)
5G s
(6.118)
In this case, the drain current is given by (6.90), where gm and Cgs are given by (6.117) and (6.118), respectively. The drain current is plotted as a function of noise figure in Fig. 6.16a assuming = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s, L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, and Esat = 5104 V/cm. Also, the
164
101
102
common-source
103
ID (A)
104
common-gate
106
10
NF (dB)
(a)
60
50 common-source 40
gain (dB)
10
0 0 1 2 3 4 5 6 7 8 9 10
NF (dB)
(b) Figure 6.16: (a) Drain current versus noise figure. (b) Gain versus noise figure. ( = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s, L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, Esat = 5104 V/cm, Rl = 350 .) gain is plotted versus noise figure in Fig. 6.16b with the additional assumption that Rl = 350 . For the common-gate topology with a broadband input match to 50 , the noise figure is given by (6.76) and gm = Gs = 1/(50 ). The corresponding gate capacitance is C gs = Gs 5 F 1 . 165 (6.119)
The drain current and gain are plotted as a functions of noise figure in Fig. 6.16. When the inductively-degenerated common-source topology is matched to 50 , the expression for the minimum noise factor and the corresponding expressions for gm and Cgs are rather complicated as already discussed in Section 6.2.5. Although the results are not explicitly reported here, the drain current and gain are plotted versus noise figure in Fig. 6.16. Finally, when the topology with local shunt feedback is matched to 50 , the expression for the minimum noise factor and the corresponding expressions for gm and Cgs are also rather complicated. Although the results are not explicitly reported here, the drain current and gain are plotted versus noise figure in Fig. 6.16. The inductively-degenerated common-source LNA provides the best overall
performance. For the same noise figure performance, the common-source LNA with inductive degeneration requires the least amount of current, and consequently, is the lowest power solution. Also, its gain performance is comparable to that of the LNA with local shunt feedback. For relaxed noise figure requirements, the common-source topology provides the highest gain but also consumes significantly more power than the other LNA topologies for the same noise performance. For the receiver prototype described here, low power consumption is the most important design consideration. The common-source LNA with inductive degeneration is a good candidate since it has the lowest power consumption while providing adequate gain. The drain current and gain are plotted as a functions of noise figure in Fig. 6.16, while Cgs, gm, Ls, and Lg are plotted versus noise figure in Fig. 6.17. In summary, the following guidelines facilitate the design of the inductively-degenerated common-source LNA for a particular noise figure requirement: 1. For a given noise figure requirement, ID is plotted in Fig. 6.16. 2. For this noise figure requirement, Cgs is plotted in Fig. 6.17a, and the corresponding device width is given by W = 3C gs /(2 LC ox ) . 3. The corresponding device transconductance is determined from (6.90).
166
4. The values of Ls and Lg are then determined by (6.97) and (6.98), respectively.
6.2.6
The LNA for this prototype is implemented using an inductively-degenerated differential amplifier topology illustrated in Fig. 6.18 [11]. Since the LNA is integrated onto the same chip along with other receiver components, noise introduced by other circuits can couple to the LNA through the supply or the substrate. The common-source LNA with inductive degeneration described in Section 6.2.5 is implemented in a differential configuration in order to improve its common-mode rejection, resulting in increased robustness against
500
0.30
475
0.25
450
0.20
Cgs (fF)
gm (
425
0.15
400
0.10
375
0.05
350 0 1 2 3 4 5 6 7 8 9 10
0 0 1 2 3 4 5 6 7 8 9 10
NFmin (dB)
NFmin (dB)
(a)
17 16 15 14 13
(b)
10 9 8 7 6
Lg (nH)
12 11 10 9 8 7 0 1 2 3 4 5 6 7 8 9 10
Ls (nH)
5 4 3 2 1 0
10
NFmin (dB)
NFmin (dB)
(c)
(d)
Figure 6.17: Designing for minimum noise figure in the common-source LNA with inductive degeneration. (a) Cgs. (b) gm. (c) Lg. (d) Ls. ( = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), = 2 2109 rad/s, L = Leff = 0.18 m (Ldrawn = 0.25 m), eff = 400 cm2/Vs, Esat = 5104 V/cm, Rl = 350 .)
167
L6 = 6 nH Vout+ M4 = 200/0.25
Vin L2
L4 = 0.8 nH 4.5 mA
Figure 6.18: Inductively-degenerated differential LNA. noise coupling. Although the design guidelines for the differential topology are similar to those for the single-ended case, there are a few minor differences. First the input impedance of the differential LNA is given by
g m Ls 1 2 C gs ( L g + Ls ) Z in = 2 j C gs C gs
(6.120)
where g m = g m1 = g m 2 , C gs = C gs1 = C gs 2 , Ls = L3 = L4 , and L g = L1 = L2 . In this case the input impedance is twice that of the single-ended topology. For this receiver prototype, an external balun is used to convert the single-ended RF input signal to a differential signal for the LNA. The unbalanced impedance of commercially-available baluns is 50 in order to provide a match to the preceding RF filter, while the balanced impedance can be either 50 , 100 , or 200 [6]. An inductively-degenerated differential LNA is depicted in Fig. 6.19 along with the equivalent half-circuit. Comparing the half-circuit in Fig. 6.19b with the single-ended circuit in Fig. 6.12a, the
' two circuits are equivalent for G s' = G s / 2 and I D = 2 I D . The drain current of the
differential LNA is plotted in Fig. 6.20 versus noise figure for G s' = 1 /(50 ) , 1/(100 ), and 1/(200 ). For the same noise figure performance, the differential LNA matched to the lowest source conductance consumes the least amount of current. Conversely, for the 168
M1 ' Gs L1 L3 L2 L4 ' ID
M2
L1
' ID /2
' 2Gs L3
(a)
(b)
' Figure 6.19: (a) Differential LNA with source conductance G s' and tail current I D . (b) Equivalent half-circuit.
101
102
' ID (A)
103
104
105
106 0 1 2 3 4 5 6 7 8 9 10
NFmin (dB)
Figure 6.20: Drain current versus noise figure of the inductively-degenerated differential LNA for several values of source conductance. same current consumption, the differential LNA matched to the lowest source conductance has the best noise figure [68]. Unfortunately, the impact of the source conductance on the noise figure was not fully appreciated at the time, and the LNA used in this receiver prototype was designed to match a balanced resistance of 50 instead of 200 . In the receiver prototype, the dimensions of each of the transistors M1 and M2 is 500 m 0.25 m. The LNA is biased at 4.5 mA and is powered by a separate 2.5-V supply,
169
which helps to isolate the LNA from the potentially noisy supplies of the other receiver circuits. In the layout, each of the input transistors is partitioned into ten blocks of ten fingers of 5 m 0.25 m devices. Fingering the devices helps to reduce the resistance associated with the polysilicon gate [64], [69]. Keeping this resistance small is critical since the noise associated with this resistance appears directly at the input of the LNA. In addition, substrate contacts are placed generously around and between the ten blocks in order to reduce the substrate resistance near the LNA. The noise voltage associated with this resistance modulates the bulk of the transistor and introduces an additional component to the drain current noise,
2 2 id ,sub = 4kTRsub g mb f
(6.121)
which can degrade the noise performance of the LNA [25], [70], [71]. The layout of the input transistors M1 and M2 is illustrated in Fig. 6.21. The noise factor of the differential LNA in Fig. 6.18 is also affected by the cascode transistors M3 and M4. While these cascode transistors slightly degrade the noise performance of the LNA, they provide increased isolation, reducing the amount of LO leakage from the mixer to the receiver input. In direct-conversion receivers, the frequency
M3
M4
M1
M2
170
of the leakage signal is the same as that of the desired signal, and consequently, is radiated from the antenna without any attenuation from the RF filter. This leakage signal is problematic since it can potentially interfere with other systems operating in the same frequency band. In addition to reducing the amount of LO leakage to the receiver input, the cascode transistors also reduce the influence of the gate-drain overlap capacitances of M1 and M2 on the LNA input impedance [68]. Including the gate-drain overlap capacitance, the input impedance of the equivalent half-circuit may be determined by the small-signal equivalent circuit illustrated in Fig. 6.22: Z in = g m Ls + Rl C gd 2 g m L g Ls C gd 2 Rl ( Lg + Ls )C gd C gs C gs + C gd (1 + g m Rl 2 Ls C gs + j g m Ls + j Rl C gs ) + (6.122) .
1 2 ( Lg + Ls )C gs 2 L g C gd 2 g m Rl ( Lg + Ls )C gd + 4 L g Ls C gd C gs j [C gs + C gd (1 + g m Rl 2 Ls C gs + j g m Ls + j Rl C gs )] Assuming that
2 =
(6.122) becomes
1 ( L g + Ls )C gs
(6.123)
Z in =
Ls C gs + C gd 1 + g m Rl + j ( g m Ls + Rl C gs ) L g + Ls C gd L g L g Ls + g m Rl + C gs L g + Ls ( L g + Ls ) 2 . Ls j C gs + C gd 1 + g m Rl + j ( g m Ls + Rl C gs ) L g + Ls
L g C gd g m Ls 1 ( L g + Ls )C gs
(6.124)
Since Cgd is typically much less than Cgs, (6.124) reduces to gmLs/Cgs as long as Rl is small. For the differential LNA in Fig. 6.18, the cascode transistors M3 and M4 present a small load resistance (Rl = 1/gm3 = 1/gm4) to each of the input devices M1 and M2, thus reducing the effect of the gate-drain overlap capacitances of M1 and M2 on the input impedance. Consequently, for increased insensitivity to the effects of the gate-drain 171
Lg + Cgs vgs
Cgd
gmvgs
Rl
Ls
Figure 6.22: Small-signal equivalent circuit of the inductively-degenerated LNA including gate-drain overlap capacitance. overlap capacitances, the transconductance of each of the cascode transistors should be large. However, a large transconductance also increases the noise contribution of each of the cascode devices. The widths of the cascode transistors should be chosen as a compromise between these two opposing factors. In the receiver prototype, the dimensions of each of the transistors M3 and M4 is 100 m 0.25 m. Inductors L3 L6 are realized as on-chip spiral inductors. A test chip (RFTRIPLED) was fabricated in order to evaluate the performance of various inductor test structures and the results are reported in Appendix D. The spiral inductors implemented in the receiver prototype use the top three layers of metal all shorted together using a large number of vias in order to reduce the series resistance, thus improving the inductor quality factor. Separate test structures were fabricated in order to characterize the performance of the inductors actually used in the receiver prototype. The geometries of inductors L3 L6 are summarized in Fig. 6.23. At 2 GHz, the measured quality factors of inductors L3 and L5 are 3.9 and 3.6, respectively. These low quality factors are typical for on-chip spiral inductors implemented on low-resistivity silicon substrates. The low quality factors of these on-chip spiral inductors are not detrimental to the low noise performance of the amplifier. Indeed, the series resistance of each of the inductors L3 and L4 must be kept small since the noise associated with each of these resistances contributes directly to the LNA noise figure. However, despite the low quality factor, the series resistance of each of the source inductors is actually quite small due to the low inductance values. The series resistance may be estimated by the following expression for quality factor:
172
N turns
L
R
(6.125)
From (6.125), the series resistance of each of the inductors L3 and L4 is only 2.6 at 2 GHz. The noise associated with the series resistance of each of the load inductors L5 and L6 also contributes to the LNA noise figure. In this case, the mean-square value of the input-referred voltage noise is approximately
v
2 i , Rl
L = 4kTRl s L l
(6.126)
where Ls is the source inductance and Ll and Rl are the inductance and series resistance, respectively, of the load inductor. Since Ls is much less that Ll, the noise contribution due to the series resistance of each of the load inductors is minimal with an equivalent noise resistance of only 0.37 at 2 GHz. Each of the LNA input bond pads consists of the top three layers of metal, all shorted together, while a fourth lower layer of metal acts as a shield [72]. However, rather than connecting the shield to ground, the shield instead is connected to the source terminal of the input transistor so that the pad capacitance appears in parallel with the gate-source capacitance of the input transistor as illustrated in Fig. 6.24 [73]. This technique significantly reduces the effect of the pad capacitances on the input matching. Including the pad capacitances, the input impedance of the amplifier can be described by
173
M1 Cp Cs L3 L4
M2
(6.127)
where CT is the parallel combination of the transistor gate-source capacitance and the pad capacitance Cp, while Cs is the capacitance between the pad shield and the substrate. In this case, input matching is achieved by equating the real part of Zin to 50 and then selecting the values of inductors L1 and L2 to tune out the imaginary part of Zin at the carrier frequency: L1 = L2 = L3 1 . CT 1 2 C s L3
2
(6.128)
On-chip spiral inductors should be avoided when implementing inductors L1 and L2. These inductors appear directly at the LNA input and any series resistance can potentially degrade the noise performance of the LNA. In particular, for low noise performance, the gate inductance must be relatively large, and implementation as an on-chip spiral inductor results in a relatively large series resistance. Instead, the inductors L1 and L2 are realized using the input bond wires, which have quality factors in excess of 20 and provide an inductance of about 1 nH/mm. The LNA input bond pads are recessed about 300 m from the edge of the chip in order to accommodate longer input bond wires, and consequently, higher inductance values. The layout of the LNA is illustrated in Fig. 6.25.
174
300 m
Figure 6.25: LNA layout. Inductors L5 and L6 provide a tuned response at the output of the LNA. At resonance, the load impedance is
1+ j Zl =
Ll
(6.129)
Rl 1 + jQ = j C l j C l
Q Cl
where Cl is the load capacitance, and Ll and Rl are the inductance and series resistance, respectively, of the load inductor. Larger voltage gains are achieved by using larger load impedances. However, the load impedance is limited by two opposing factors. First, for the same resonance frequency, larger load impedances are achieved by using smaller capacitance and larger inductance values. However, on-chip spiral inductors with larger inductance values tend to have lower quality factors. In the receiver prototype, the LNA load impedance is about 290 . The output of the LNA is connected to the input of the subsequent mixer through a pair of 2.3-pF coupling capacitors. Large coupling capacitors are used in order to minimize the signal attenuation resulting from the capacitive voltage divider formed by the coupling capacitors and the input capacitances of the mixer. The coupling capacitors are implemented as sandwich structures using the top four layers of metal [74]. In order to
175
Mb2 = 20/0.5
Mb3 = 20/0.5
100 A
Figure 6.26: Bias circuit for the LNA input. reduce the effects of voltage division, the parasitic bottom plate capacitances are placed at the LNA output, where all the parasitic capacitances are tuned out by load inductors L5 and L6. Finally, the bias circuit for the LNA input is illustrated in Fig. 6.26 [18]. The bias voltage at the LNA input is determined by the gate voltage of transistor Mb1. Each of the R-C-R networks acts as a choke for high-frequency signals without degrading the noise performance of the LNA.
Simulation Results. The LNA was simulated in SpectreRF with device models based on
the Philips MOS Model 9, which includes the effects of induced gate current noise [75]. Due to the tuned nature of the LNA, accurate simulation results require inclusion of all parasitic capacitances. A netlist for the LNA including the input bond pads was extracted
L C1 Rs C2
R1
R2
L Rs C1 C2 R1 R2
L5 , L6 6 nH 20 180 fF 180 fF 10 10
(a)
Figure 6.27: (a) Circuit model for on-chip spiral inductors. (b) Component values at 2 GHz.
176
30
25
gain (dB)
20
15
frequency (GHz)
10
S11 (dB)
15
20
25
frequency (GHz)
Figure 6.29: Simulated LNA S11. from the layout for use in simulations. In addition, the circuit illustrated in Fig. 6.27 was used to model the on-chip spiral inductors. The simulated gain of the LNA is illustrated in Fig. 6.28. The peak gain is 26 dB and occurs at approximately 2.1 GHz, while the gain at 2 GHz is about 25.8 dB. The simulated S11 of the LNA is illustrated in Fig. 6.29. S11 is the reflection coefficient seen looking into the input of the LNA [59]:
177
output noise (V2/Hz) 6.23 108 4.17 108 1.69 108 6.41 109 3.60 109 1.32 107
Table 6.2: Summary of LNA noise performance from simulation. S11 = V1 Z in Z o = V1+ Z in + Z o
(6.130)
where V1+ and V1 are the voltage amplitudes of the incident and reflected waves, respectively, and Zin and Zo are the source and LNA input impedances, respectively. Consequently, S11 is a measure of how well the LNA input impedance is matched to the source impedance, where S11 = 0 for a perfect match. This simulation was performed with a pair of ideal 4.5-nH gate inductors to complete the input tuning. The S11 at 2 GHz is about 22 dB. The simulated noise figure of the LNA is about 3.26 dB and the dominate noise contributors are summarized in Table 6.2. The simulated noise figure due to M1 and M2 alone is about 2.23 dB and agrees well with the 2.18-dB noise figure calculated from (6.87) using the simulated values of gm and Cgs and assuming that = 0.8, = 4, = 2, c = j0.395, Gs = 1/(50 ), and = 2 2109 rad/s. The simulated values of gm and Cgs are 0.03 1 and 453 fF, respectively, where Cgs consists of the parallel combination of the 165-fF pad capacitance and the 288-fF gate-source capacitance of transistor M1. Although the simulated noise figure agrees well with the performance predicted by (6.87), the noise figure of the LNA can actually be improved slightly. As illustrated in Fig. 6.20 for
G s' = 1 /(50 ) , the minimum achievable noise figure due to M1 and M2 alone is less than
2 dB for a bias current of 4.5 mA. The simulated performance of the LNA is summarized in Table 6.3.
6.3
Frequency Synthesizer
178
4.5 mA 2.5 V 1/(50 ) 500 m 0.25 m 0.03 1 288 fF 100 m 0.25 m 0.017 1 4.5 nH @ 2 GHz 0.8 nH 4 @ 2 GHz 6 nH 3.77 @ 2 GHz 165 fF 165 fF 827 mV 25.8 dB @ 2 GHz 21.9 dB @ 2 GHz 3.26 dB
Table 6.3: Summary of LNA simulation. An excellent and detailed description of the design and implementation of the frequency synthesizer is provided in [31]. This section offers a brief overview of the design along with a discussion of some of the design choices made for low power consumption. The 2-GHz I and Q LO signals are generated using a fully-differential, wide-bandwidth PLL illustrated in Fig. 6.30. The 2-GHz output signal from the VCO is divided by ten to produce the 200-MHz sampling clock for the ADC. This 200-MHz signal is then divided by four and locked to an external 50-MHz reference signal. The PLL has a nominal loop
50 MHz PFD CP/LF VCO 2 GHz
200 MHz
179
bandwidth of 3 MHz.
6.3.1
Voltage-Controlled Oscillator
The phase noise performance of an oscillator at an offset frequency from the center frequency 0 can be described by [76]
2 FkT L( ) = 10 log Psig 2 1 / f 3 0 1 + 2Q 1 +
(6.131)
where F is an excess noise factor, Psig is the output power of the oscillator, Q is the quality factor of the tank, and is the corner frequency between the 1 / f
2
and 1 / f 3
regions. From (6.131), the oscillator phase noise can be improved by increasing Psig as well as by increasing Q. Unfortunately, the lack of high-quality on-chip passive components exacerbates the difficulty of integrating low-power VCOs. Two commonly used VCO topologies are the LC-tuned oscillator and the ring oscillator.
Vctrl Vo Vo+
180
Table 6.4: Comparison of LC-tuned and ring-oscillator VCOs. An example of a differential LC-tuned VCO is illustrated in Fig. 6.31, while an example of a ring-oscillator VCO is illustrated in Fig. 6.32. The advantages and disadvantages of each of these oscillator topologies are summarized in Table. 6.4. The phase noise performance of LC-tuned oscillators is typically much better than that of ring oscillators due to the higher tank quality factor. Although the quality factor of on-chip spiral inductors is limited to about four for processes which rely on low-resisitivity silicon substrates (Appendix D), the equivalent quality factor of ring oscillators is even less with values ranging from 1 to 1.5 [77]. Nevertheless, for applications with relaxed phase noise requirements, the ring-oscillator VCO has two major advantages. First, the area of the ring-oscillator VCO is much less than that of the LC-tuned VCO, which requires large on-chip spiral inductors for implementing the tank. Second, the ring-oscillator VCO inherently provides the I and Q outputs required for quadrature demodulation. Although the power consumption of the LC-tuned oscillator alone is probably less than that of a ring oscillator for the same phase noise performance, the additional power consumption required for quadrature generation can be significant. Due to the relaxed phase-noise requirement of the application described here, the VCO in this receiver prototype is implemented as a four-stage ring oscillator illustrated in Fig. 6.33. Additional circuits for quadrature generation are not required, resulting in substantial savings in power consumption. The I and Q outputs are connected directly to the mixer LO ports, while a third output is connected to the frequency divider. Dummy
to dummy load I to divider Q
181
divider circuits are connected to the remaining VCO output in order to provide load matching for improved quadrature accuracy. Since all users transmit simultaneously in the same frequency band, the frequency synthesizer needs to generate only a single carrier frequency and can be implemented using a wide-bandwidth PLL. The wide loop bandwidth of the PLL suppresses the close-in phase noise of the ring-oscillator VCO [78], thus improving the overall phase noise performance of the frequency synthesizer.
6.3.2
A deadzone-free phase-frequency detector (PFD) is used to increase the pull-in range of the PLL. The PFD outputs rail-to-rail signals to drive the current steering charge pump, reducing the leakage currents that cause spurious tones to appear at the synthesizer output. A passive, second-order loop filter is chosen in order to minimize noise as well as power consumption. In addition, because of the wide loop bandwidth, the passive components used to implement the loop filter do not require significant die area. Minimizing the amount of signal coupling between the LO and the rest of the receiver is critical when designing frequency synthesizers for highly-integrated implementations. Unwanted signal coupling can occur through numerous mechanisms, including substrate current injection, capacitive coupling to long interconnects, and power supply bounce. In this prototype, the amount of coupling is reduced by implementing the digital portions of the PLL using either source-coupled logic (SCL) or differential cascode voltage switch logic (DCVSL) [79]. Fig. 6.34 illustrates inverter implementations based on these two
Vout
Vin
Vout
Vin
Vout
Vin
(a)
(b)
(c)
Figure 6.34: Inverter implementations. (a) Static CMOS. (b) SCL. (c) DCVSL.
182
logic styles along with static CMOS. Substrate current injection caused by charging and discharging capacitors to the substrate can be canceled to first-order by using fullydifferential or pseudo-differential circuit topologies, such as SCL or DCVSL, respectively. These two logic styles are also more robust against common-mode noise than single-ended implementations, such as static CMOS. Furthermore, the amount of supply bounce during transitions can be significantly reduced by using SCL, which drains constant current from the supply. Although SCL circuits consume static power, this logic style is used to realize the VCO and the high-frequency stages of the divider in order to minimize the amount of highfrequency current that can potentially couple into sensitive circuit components such as the LNA. Implementation of the last stage of the divider and the PFD is based on DCVSL, which eliminates static power consumption, but unlike static CMOS, also lessens the substrate current noise. In addition, a separate 2.5-V supply is used for these two blocks in order to further increase the isolation between the digital and analog circuit components. In terms of power consumption, the use of different logic styles is feasible since the overhead required to convert between SCL and DCVSL logic levels is minimal.
6.4
Mixer
An excellent and thorough description of the design and implementation of the I and Q mixers is provided in [74]. This section offers a brief overview of the design along with a discussion of some of the design choices made for low power consumption. The main function of the mixer is to frequency translate the desired RF signal to baseband without corrupting it through mechanisms such as noise or distortion. Two different types of mixers are passive mixers and active mixers. An example of a CMOS passive mixer is illustrated in Fig. 6.35 [68], while an example of a CMOS active mixer is illustrated in Fig. 6.36. The following criteria must be considered when selecting a mixer topology for a particular application: linearity, noise, conversion gain, power consumption, and port isolation.
6.4.1
Passive Mixers
183
LO RF LO IF
LO
LO
IF
LO
RF
Figure 6.36: CMOS double-balanced current-commutating mixer. Passive mixers do not provide any conversion gain, and in fact, actually result in conversion loss. In direct-conversion receivers, the desired signal can still be relatively weak at the input to the mixer, and this conversion loss imposes more stringent noise requirements in the subsequent baseband circuits. Although passive mixers introduce thermal noise, they do not introduce any flicker noise due to the absence of current, and thus passive mixers are potentially attractive for use in direct-conversion receivers. Low power consumption and excellent linearity performance [80] are two additional advantages of passive mixers. However, both of these advantages are negated if an additional amplifier is needed to compensate for the conversion loss of the passive mixer.
184
Finally, the port-to-port isolation of passive mixers is poor. In particular, the LO signal can couple to the RF port through the gate capacitances of the switches. Poor LO-to-RF isolation is unacceptable in direct-conversion receivers, which are susceptible to problems such as LO radiation from the antenna as well as DC offsets resulting from LO self-mixing.
6.4.2
Active Mixers
A popular active mixer is the double-balanced current-commutating mixer (Fig. 6.36) based on the Gilbert cell multiplier [81]. Unlike passive mixers, active mixers actually provide conversion gain, which relaxes the noise and gain requirements in the subsequent baseband circuits. However, active mixers contribute both thermal noise and flicker noise, which may be problematic in direct-conversion receivers. As already mentioned in Section 6.4.1, the power consumption and linearity of passive mixers alone are generally superior to that of active mixers. However, a fair comparison must also include the power consumption and linearity of any additional amplifiers to compensate for the lack of conversion gain in passive mixers. In this case the, the choice between an active mixer and a passive mixer is not as clear-cut. Finally, the port-to-port isolation of current-commutating mixers is generally better than that of passive mixers. In particular, the path between the LO and RF ports is separated by two transistors rather than just one.
6.4.3
Mixer Implementation
For this receiver prototype, the RF signal is frequency translated to baseband along parallel I and Q signal paths using a pair of double-balanced current-commutating mixers. The I and Q mixers are both based on the same topology illustrated in Fig. 6.37. An active mixer is selected because of its ability to provide conversion gain, and a double-balanced configuration is chosen in order to increase the mixers immunity to common-mode variations, including substrate noise introduced by the digital sections of the receiver. In addition, this topology provides excellent isolation between the LO and
185
Vload M9 Vout
M10 = 15/2
M5 VLO
M6
M7
M8 = 12.2/0.25
M4 = 7.5/0.35
420 A
Figure 6.37: Mixer topology used in the receiver prototype. RF ports of the mixer, which is further improved by adding cascode transistors M3 and M4. The gates of the switching transistors M5 M8 are connected directly to the I and Q outputs of the VCO. By dc coupling the LO signals to the input of the mixer and by locating these transistors immediately adjacent to the VCO in the layout, the capacitive loading on the VCO is minimized. Consequently, the need for clock buffers is avoided, resulting in substantial power savings. The sizes of transistors M5 M8 are chosen as a compromise between flicker noise performance of the mixer and power consumption in the VCO. Since the VCO outputs are directly connected to these transistors, small device dimensions are desirable in order to reduce the capacitive loading on the VCO. On the other hand, large device dimensions are desirable for improved flicker noise performance. Although the receiver prototype is based on a direct-conversion architecture, the wide bandwidth of the desired signal reduces the impact of flicker noise on the performance of this system. Consequently, the mixers flicker noise requirement is relaxed in favor of reduced power consumption in the frequency synthesizer.
186
Finally, the load devices M9 and M10 are biased in the linear region and their resistances can be changed by adjusting their gate bias voltages to provide variable gain capability [11].
6.5
Each of the I and Q baseband signals must be amplified and filtered before subsequent analog-to-digital conversion. Although the VGAs are not implemented in this receiver prototype, the baseband section still must provide a moderate amount of fixed gain since the gain from the LNA and mixer alone is not sufficient to meet the minimum gain requirement of this system. The baseband section of this direct-conversion receiver also provides high-pass filtering for dc-offset removal. Moreover, low-pass filtering provides rejection of out-of-band interferers as well as anti-alias filtering for the subsequent ADCs.
6.5.1
Low-Pass Filtering
For the receiver prototype, a Sallen and Key section is used for low-pass filtering. Alternative approaches include switched-capacitor techniques as well as other continuous-time techniques such as MOSFET-C or transconductance-C filters. In general, a continuous-time approach based on Sallen and Key sections is more appropriate for applications with relaxed filtering requirements and results in very simple, low-power implementations. On the other hand, switched-capacitor techniques and continuous-time techniques based on MOSFET-C or transconductance-C filters are more appropriate for applications with increased selectivity requirements. A brief discussion of each of these analog filtering techniques follows.
Sallen and Key Filter. A block diagram of a Sallen and Key section based on an
amplifier with gain K is illustrated in Fig. 6.38 [82], [21]. The voltage transfer function is given by Vout a K = 2 0 Vin s + a1 s + a0 (6.132)
187
C1
R1 Vin
R2 K Vout
C2
Figure 6.38: Sallen and Key filter block diagram. where a0 and a1 are given by, respectively,
a0 = a1 = 1 R1 R2 C1C 2
(6.133)
1 1 1 + + (1 K ) . R1C1 R2 C1 R2 C 2
(6.134)
Thus, the Sallen and Key section can be used to realize a second-order response using only a single active gain element. In fact, for K = 1, the active gain element can be implemented simply as a voltage follower. Although the Sallen and Key section lends itself to very simple circuit implementations, the response of this filter is particularly sensitive to component variations for poles with high quality factors [83], where the quality factor of a complex pole pair is a measure of the distance of the poles from the imaginary axis in the s plane as illustrated in Fig. 6.39. Thus, this approach may be problematic when used to implement higher-order filter responses which require complex
j s plane
0 2Q
188
R3 R1 C1
C2 r2
R4 Vin
R2
r1 Vout
Figure 6.40: Tow-Thomas biquad. pole pairs with high quality factors.
MOSFET-C and Transconductance-C Filters. Lower sensitivities to component
variations may be achieved for higher-order filters by implementing a second-order transfer function using additional active gain elements, such as the Tow-Thomas biquad illustrated in Fig. 6.40 [20], [83], [84]. The voltage transfer function is given by
r2 1 r1 R2 R4 C1C 2 . = r2 1 1 2 s + s+ R1C1 r1 R2 R3C1C 2
Vout Vin
(6.135)
The key building block for this biquad is the active RC integrator illustrated Fig. 6.41a which has the following transfer function: Vout 1 = . Vin sRC (6.136)
Vbias C
Vin Gm C Vout
R Vin Vout
Vin
Vout
(a)
(b)
(c)
189
CI
CS Vin 2 1 Vout
Figure 6.42: Switched-capacitor integrator. MOSFET-C integrator (Fig. 6.41b) [85] and the transconductance-C integrator (Fig. 6.41c) [86], [87]. In the former case, the resistance R is implemented using an MOS transistor operating in the linear region, and the resistance can be adjusted through the transistor gate bias voltage, Vbias, resulting in the ability to compensate for process variations. Alternatively, the RC integrator can be implemented using a transconductance element, resulting in the following integrator transfer function. Vout Gm = . Vin sC (6.137)
In this case, the effective resistance can be adjusted through the bias current of the transconductance element.
Switched-Capacitor Filter. The RC integrator can also be implemented using switched-
capacitor techniques as illustrated in Fig. 6.42 [88], [89]. The MOS switches are driven by a nonoverlapping clock with frequency f and phases 1 and 2. In this case, the time constant of the integrator is
CI f CS
(6.138)
and is dependent only on the clock frequency and the ratio of the integrating and sampling capacitors. Consequently, filters based on this technique are very robust against process variations and do not require additional tuning.
190
Comparison of Filter Implementation Techniques. Sallen and Key sections are very
amenable to simple, low-power circuit implementations. However, the response of highorder filters based on this technique is particularly sensitive to component variations, and thus, the use of Sallen and Key sections is most appropriate for applications with relaxed filtering requirements. For higher-order filter responses, lower sensitivity to component variations may be achieved by using switched-capacitor techniques as well as continuous-time techniques based on MOSFET-C and transconductance-C integrators. The main tradeoff for this reduced sensitivity is increased power consumption. For MOSFET-C and
transconductance-C filters, tuning is still required in order to achieve very accurate filter responses. On the other hand, switched-capacitor filters are very robust against process variations, and consequently, very accurate filter responses can be achieved without the need for tuning. Finally, continuous-time techniques based on MOSFET-C and transconductance-C integrators are more appropriate for applications with higher bandwidth requirements, while switched-capacitor techniques are more appropriate for applications with lower bandwidth requirements.
amplifier
HPF
buffer
Vin+
1 pF 2 k
40 pF 5 k 45 k 5 k
Vout+
375 fF
4 k 3 pF 40 pF 5 k 45 k 5 k
2 k
Vin
1 pF
Vout
375 fF
191
50/0.35
50/0.35
Vo 2 k 200/0.35 Vi
200 A
200 A
6.6
The I and Q baseband sections are each based on the block diagram illustrated in Fig. 6.43. All of the baseband circuits are implemented using large transistor sizes in order to reduce the amount of flicker noise. Immediately after frequency translation, shunt 1-pF capacitors in combination with the mixer output impedance provide first-order low-pass filtering of each of the baseband I and Q signals. A noninverting amplifier then provides moderate gain in order to reduce the impact of noise contributed by subsequent stages. The circuit schematic of this amplifier is illustrated in Fig. 6.44. The input of this amplifier is dc coupled to the mixer output, which sets the common-mode bias voltage at a nominal value of 1.9 V. Each of the baseband signals then passes through a first-order high-pass filter, which removes dc offsets and flicker noise from previous receiver stages. System-level simulations reveal that the SNR degradation is less than 0.5 dB for a high-pass corner frequency of up to 500 kHz. However, a much lower corner frequency is implemented in order to further reduce the SNR degradation as well as to account for process variations. Each filter is realized using on-chip passive structures, which include a pair of 40-pF capacitors and a pair of 45-k resistors, placing the high-pass corner frequency at about 90 kHz. The on-chip resistors and capacitors are implemented using unsalicided n+-poly and poly/n-well structures, respectively. The poly/n-well structure operating in
192
VFB
VGB
45 k 15 k
45 k
10 k
50/0.35
50/0.35
200/0.35 1 pF
200/0.35 1 pF
Vi+
200/0.35
Vo+
Vo
200/0.35
Vi
100 A
100 A
100 A
100 A
Figure 6.47: Buffer schematic. accumulation as illustrated in Fig. 6.45 [90] offers a large capacitive density of about 6 fF/m2 when biased above the flat-band voltage, VFB, of about 90 mV. The circuit schematic of the high-pass filter is illustrated in Fig. 6.46. The resistor string sets the common-mode bias voltage at the filter output. Large resistors are used in order to minimize the power consumed by the bias string. The high-pass filter is followed by the unity-gain buffer illustrated in Fig. 6.47.
193
Table 6.5: Poles for a third-order Butterworth low-pass frequency response. Next, each of the baseband signals passes through a second-order Sallen and Key lowpass filter. This low-pass filter provides attenuation of out-of-band interferers as well as anti-alias filtering for the subsequent ADC. Although implementations based on continuous-time filtering techniques are susceptible to variations in component values due to process variations, such variations are not as critical for this particular application due to the relaxed selectivity requirements. Moreover, since the baseband filtering is followed by a ADC operating at 200 MHz, the anti-alias filtering requirements are also much less stringent. The main requirement of the low-pass filter is that the corner frequency should be no less than the 16.25-MHz single-sided bandwidth of the desired signal. The poles of the Sallen and Key filter in combination with the pole at the mixer output provide an overall third-order Butterworth low-pass frequency response. The maximally flat gain and very linear phase response of the Butterworth filter result in very little signal distortion. When normalized to the corner frequency, 0 , the poles of a third-order Butterworth low-pass response are given by the roots of the equation [20]
200 A 3 pF
Vi+
5 k
5 k
Vi
375 fF
194
gm(VgVout) C1 Vout
R1 Vin
R2 Vg
C2
(6.139)
and the pole locations are summarized in Table 6.5. The circuit schematic of the Sallen and Key filter is illustrated in Fig. 6.48. The gain element of the Sallen and Key filter is implemented using a PMOS voltage follower [91]. In this process the source and bulk nodes of PMOS devices can be connected together in order to eliminate the body effect. In this case, the small-signal model of the equivalent half-circuit is illustrated in Fig. 6.49 and the gain of the voltage follower is
Vout R2 C1C 2 s 2 . K= = 1+ Vg C1 s + g m
(6.140)
Substituting (6.140) into (6.132) results in the following expression for the voltage transfer function of this filter:
Vout Vin
(6.141)
The passive component values are determined by setting ( R1 + R2 )C1C 2 1 + R1 R2 C1C 2 = 2 gm 0 (6.142)
195
C1 1 + ( R1 + R2 )C 2 = . 0 gm
(6.143)
For R1 = R2 = 5 k, 0 = 2 (16.25 MHz) , and 1 / g m = 372 , the corresponding values for C1 and C2 are 4 pF and 830 fF, respectively. However, in order to account for process variations as well as parasitic capacitances, the nominal values of C1 and C2 were chosen to be 3 pF and 375 fF, respectively. For these selected component values, simulations over process corners indicate that the 3-dB corner frequency varies between 17 MHz and 22 MHz. The resistors and capacitors are implemented using unsalicided n+-poly and poly/n-well structures, respectively. The transfer function in (6.141) also contains zeros which can affect the desired Butterworth frequency response. The zero locations are given by
1 2 R2 C 2
C 1 j 4 g m R2 2 1 . C1
s=
(6.144)
amplifier
HPF
buffer
196
For R1 = R2 = 5 k, 0 = 2 (16.25 MHz) , 1 / g m = 372 , C1 = 4 pF, and C2 = 830 fF, the zero locations are s = 1.2 108 j3.8 108. The magnitude of these zeros is approximately 63 MHz and is sufficiently greater than the corner frequency of 16.25 MHz so as not to adversely affect the desired Butterworth frequency response. Nevertheless, these zeros can still be problematic since they decrease the out-of-band attenuation of the filter. However, the inevitable presence of higher-order poles mitigates the severity of this problem. Finally, the layout of the baseband amplifiers and filters is illustrated in Fig. 6.50. The symmetric layout helps to improve the matching between the I and Q sections.
6.7
Analog-to-Digital Converter
An excellent and detailed description of the design and implementation of the I and Q ADCs is provided in [32]. This section offers a brief overview of the design along with a discussion of some of the design choices made for low power consumption. The ADC requirements are determined in Section 5.3.8 and summarized in Table. 5.1. For this system, each of the ADCs must have a Nyquist rate of at least 25 MHz and a resolution of at least 7 bits.
6.7.1
Pipeline Architecture
For these specifications, one possible approach of implementing the ADC is to use a pipeline architecture illustrated in Fig. 6.51 [92]. In this architecture, each of the N stages samples the signal from the previous stage and quantizes it to B bits. The quantized signal is then subtracted from the input signal and the result is amplified by an interstage gain block before being sampled by the next stage. This architecture is particularly amenable to low-power implementations. First, for an N B -bit ADC, this architecture requires
N 2 B comparators compared to 2 N B comparators for a flash architecture. Second and
more importantly, the circuit requirements of subsequent stages are relaxed, and capacitive scaling techniques can be used, resulting in significant power savings. Finally,
197
Vin
stage 1 B bits
stage 2 B bits
...
stage N B bits
Vin
2B
Vout
Figure 6.51: Pipeline ADC architecture. one potential disadvantage of the pipeline ADC architecture, as with any pipeline architecture, is latency. Several low-power ADCs have been implemented based on the pipeline architecture with capacitive scaling, including a 10-bit, 20-MS/s ADC consuming 35 mW [93] and a 10-bit, 40-MS/s ADC consuming 28 mW [94],[95]. The former was implemented in a 1.2-m process and the latter was implemented in a 0.6-m process. Although the pipeline architecture is a promising approach for implementing a lowpower ADC with the minimum requirements specified for this system, other factors must be considered in order to minimize the overall power consumption of the entire receiver. In particular, one of the most critical receiver functions is timing recovery, and a more efficient receiver implementation may be achieved when the ADC is designed in conjunction with the timing recovery algorithm.
6.7.2
In order to achieve an even more efficient receiver implementation, receiver functions must be carefully partitioned between the analog and digital hardware. Despite the relatively high Nyquist rate requirement of the ADC, a modulator is used for analogto-digital conversion in this receiver. This ADC is designed in conjunction with an
198
Digital Filters
Timing Recovery
M parallel streams
Multiplexer
z-1
Figure 6.52: -assisted timing recovery scheme. all-digital timing recovery algorithm, and when both the analog and digital hardware are considered, this approach results in a very efficient implementation. The use of different frequency references in the base-station transmitter and the mobile receiver introduces a frequency offset between the two LOs, and consequently, frequency estimation and compensation must be performed at the receiver before the data can be recovered. In order to provide adequate granularity for digital timing recovery, receivers typically oversample the input signal by at least two times the Nyquist rate [19]. Since oversampling is an inherent property of modulators, a ADC is an attractive approach for analog-to-digital conversion [96]. A block diagram of the proposed timing recovery algorithm for this receiver is illustrated in Fig. 6.52. The output of the modulator before decimation is a low-resolution, oversampled version of the data signal. Timing recovery can be performed by properly adjusting the phase of the digital decimation filter following the modulator. By including a variable-length delay line before the decimation filter, the timing recovery block can control the effective phase of the ADC sampling instant to within the granularity provided by the oversampling ratio (OSR). In order to achieve the same timing granularity with a pipeline ADC, a converter with a sampling rate of OSR 25 MHz would be required. Thus, when timing recovery issues are also considered, a pipeline ADC is no longer the definitive choice for low power consumption.
...
to data recovery
z-1
199
6.7.3
In the receiver prototype, the baseband I and Q signals are digitized using a pair of 7-bit, 25-MS/s ADCs operating at 200 MHz. Since the high Nyquist rate of the baseband signals restricts the converter to a low OSR of 8, the required dynamic range is achieved by using a 2-1-1 cascade architecture with single-bit quantization in each stage (Fig. 6.53). The integrators are implemented as fully-differential switched-capacitor circuits using folded-cascode operational amplifiers with NMOS input devices to maximize speed as illustrated in Fig. 6.54. The device sizes and bias points of each amplifier are optimized for minimum power consumption. Power consumption in the ADCs is further reduced by using capacitive scaling techniques [97]. However, the presence of parasitic capacitances limits the achievable power savings resulting from this approach. The capacitor values and bias current of each of the four integrators are summarized in Table 6.6.
6.8
A pair of test chips were fabricated to characterize the performance of the individual circuit components as well as the performance of the entire receiver. A micrograph of the first test chip (SCRRX) is illustrated in Fig. 6.55. This test chip includes the analog receiver consisting of the LNA, the mixer, the PLL, and the baseband circuits for amplification and filtering. Separate test circuits for the LNA, the mixer, and the PLL as well as a test circuit which consists of just the LNA and the mixer are also included. Test structures were also fabricated in order to facilitate the characterization of inductors used in the implementation of the LNA. A micrograph of the second test chip (SCRBARF) is illustrated in Fig. 6.56. The complete direct-conversion receiver including the ADCs is fabricated on this test chip. The area of this chip is 5.0 mm 5.2 mm including bond pads, but the circuits alone require only about 5 mm2.
200
1/3 -1/3
3/5 -2/5
DAC
DAC
DAC
CI
OTA +
Vout
Vin+ Ibias
Vin
Vout
Vout+
CI
Figure 6.54: Switched-capacitor integrator. Integrator 1 2 3 4 CS (fF) 90 90 87.5 75 CI (fF) 270 150 105 150 Ibias (A) 270 270 240 138
201
PLL
receiver
202
PLL
203
Chapter 7
Simulated Performance and Measurement Results
7.1
Simulated Performance
Extensive simulations were performed on the individual building blocks and many of these results are reported elsewhere. Simulation results for the LNA are reported in [98], the mixer in [74], the PLL in [31], and the ADC in [34]. In this section, the simulation results for the entire receiver are reported.
7.1.1
LNA/Mixer/Baseband Simulations
Most of the simulations were performed without the PLL and the ADCs since long simulation times are required when these blocks are included. Instead, only the LNA, the mixers, and the baseband amplifiers and filters are included. For these simulations, the LO signal is modeled as a 2-GHz noiseless sinusoid with an amplitude of 0.4 V. All simulations were performed using netlists extracted from the layout in order to include the effects of parasitic capacitances. In addition, all simulations were simulated across process corners: typical, fast, and slow. The gain at the output of the Sallen and Key filter is plotted versus frequency in Fig. 7.1, and a few characteristics of the frequency response are summarized in Table 7.1. The first-order high-pass filter and the third-order 205
50
40
30
gain (dB)
20
10 0.001
0.01
0.1
10
100
frequency (MHz)
Figure 7.1: Simulated frequency response at the output of the Sallen and Key filter over process corners. typical 43.8 dB 82.9 kHz 19.7 MHz fast 45.0 dB 104.4 kHz 22.5 MHz slow 42.2 dB 67.0 kHz 17.1 MHz
Table 7.1: Summary of receiver frequency response. Butterworth low-pass filter responses are evident in the overall frequency response of the receiver. For the typical process corner, the simulated frequency responses at the outputs of various receiver building blocks are illustrated in Fig. 7.2. At 1 MHz, the gains at the outputs of the mixer, the amplifier, the high-pass filter, and the Sallen and Key low-pass filter are 41.0 dB, 44.8 dB, 43.9 dB, and 43.8 dB, respectively. The simulated noise performance of the receiver is summarized in Table 7.2. For the typical, fast, and slow process corners, the noise figures are 5.52 dB, 4.81 dB, and 6.26 dB, respectively. The distortion performance of the receiver is summarized in Table 7.3. The 1-dB compression points are 34 dBm, 39 dBm, and 33 dBm for the typical, fast, and slow process corners.
206
50
40
30
gain (dB)
20
10
mixer output amplifier output HPF output Sallen and Key output
10 0.001
0.01
0.1
10
100
frequency (MHz)
Figure 7.2: Simulated frequency responses at the outputs of various receiver components (typical process corner). typical 448 V 845 V 5.52 dB fast 546 V 950 V 4.81 dB slow 352 V 722 V 6.26 dB
Table 7.2: Summary of simulated receiver noise performance (output noise integrated over 100 MHz). typical 34 dBm fast 39 dBm slow 33 dBm
Finally, a transient envelop simulation was performed using SpectreRF. The I and Q baseband transmit signals are first generated in Simulink (Fig. 7.3). Next, these signals are frequency translated in quadrature to RF using a behavioral model in SpectreRF. The spectra of the receiver output signals for input power levels of 43 dBm and 33 dBm are illustrated in Fig. 7.4. For an input power level of 33 dBm, the output signal is distorted due to compression.
7.1.2
LNA/Mixer/PLL/Baseband Simulation
207
10 0
10
magnitude (dB)
20 30 40 50 60 70 50 25
25
50
frequency (MHz)
magnitude (dB)
50 60 70 80 90
magnitude (dB)
20
25 30 35 40 45 50 55
100 50
25
50
60 50
25
25
50
frequency (MHz)
frequency (MHz)
(a)
(b)
Figure 7.4: Spectrum of receiver output signal from transient envelop simulations. (a) 43-dBm input power. (b) 33-dBm input power. When including the PLL along with the LNA, the mixers, and the baseband amplifiers and filters, the simulation times are much longer. Consequently, only a single simulation was performed in order to characterize the noise performance for the typical process corner. The simulated noise performance is summarized in Table 7.4. When including the PLL, the noise figure increases slightly to 5.7 dB compared to 5.52 dB without the PLL.
7.1.3
LNA/Mixer/PLL/Baseband/ADC Simulation
A transient simulation was performed on the entire receiver, including the LNA, the mixers, the PLL, the baseband amplifiers and filters, and the ADCs. A 2.01-GHz sinusoidal RF signal is applied to the receiver input and the 10-MHz output signals from
208
Table 7.4: Summary of simulated receiver noise performance including PLL for typical process corner (output noise integrated over 100 MHz).
200
150
100
50
I/Q (mV)
0 50
100 150 200 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
time (s)
Figure 7.5: Simulated transient response. each of the I and Q Sallen and Key filters is illustrated in Fig. 7.5. In addition, the digital output signals from each of the I and Q modulators were processed in MATLAB in order to verify the functionality of the entire receiver.
7.2
Measurement Results
The two receiver test chips, SCRRX and SCRBARF, are directly attached to the test boards using chip-on-board (COB) packaging technology. This technique offers reduced package parasitics since the chip pads are bonded directly to landing zones on the test board and the package is essentially eliminated. However, this benefit comes at the expense of decreased testing flexibility since the chips are not easily interchangeable.
209
Although initial testing verified the basic functionality of the various test structures on the SCRRX chip, two main factors prohibited extensive characterization of this chip. First, antenna protection diodes were not included on this chip, and the lack of these diodes resulted in inconsistent performance. These diodes are required at the gate nodes of transistors with long interconnects in order to prevent charge accumulation during the plasma processing steps [99]. Failure to do so results in unpredictable transistor threshold voltages, and unfortunately, the ramifications of not including these diodes were not fully appreciated at the time. Second, several design errors in the test board exacerbated the difficulty of fully characterizing the various test structures on the SCRRX test chip. All of these errors were corrected in the SCRBARF prototype chip and the corresponding test board, from which all of the results reported in this section were measured. In order to ease testing, rather than iterating over different bond-wire lengths, a combination of the input bond wires and a pair of 1-nH chip inductors were used to complete the LNA input tuning. The receiver input provides an excellent match to 50 with a measured S11 better than 30 dB (Fig. 7.6). The measured frequency response at the output of each of the I and Q Sallen and Key filters is illustrated in Fig. 7.7. The high-pass and low-pass corner frequencies are about 100 kHz and 17 MHz, respectively. The receiver gain is 41 dB with less than 0.5-dB gain
0 5 10 15
10 15 20 25
S11 (dB)
S11 (dB)
20 25 30 35 40 45 50 1.5 1.6 1.7 1.8 1.9 2.0 2.1 2.2 2.3 2.4 2.5
30 35 40 45 50 1.96
1.97
1.98
1.99
2.00
2.01
2.02
2.03
2.04
frequency (GHz)
frequency (GHz)
(a)
(b)
210
50
40
30
20
gain (dB)
10
0 10 20 30 0.01 I Q
0.1
10
100
frequency (MHz)
Figure 7.7: Measured frequency response at the output of each of the I and Q Sallen and Key filters.
2.0
1.5
1.0
0.5
0.1
10
100
frequency (MHz)
Figure 7.8: Measured I and Q gain mismatch. mismatch between the I and Q paths over the 3-dB bandwidth (Fig. 7.8). The noise performance of the receiver is measured at the output of the Sallen and Key filter. However, since the receiver output noise is below the noise floor of the spectrum analyzer, additional low-noise baseband amplifiers are used to amplify the receiver output noise. The noise performance of the receiver is then determined by first applying a noise source in the cold state at the receiver input and measuring the output noise on the spectrum analyzer [100]. In this case, the noise source produces a noise power of kTB. 211
12
11
10
4 0 5 10 15 20 25
frequency (MHz)
Figure 7.9: Measured receiver noise performance. Next, the output noise is measured when a noise source in the hot state is applied at the receiver input. In this case, the noise source produces a noise power of ENR kTB , where ENR is the excess noise power ratio. The noise factor is given by
F= ENR 1 OPR 1
(7.1)
where OPR is the ratio between the measured output noise powers for the two cases. The noise figure of the receiver is plotted versus frequency in Fig. 7.9. The noise figure is less than 9 dB over the 3-dB bandwidth and includes approximately 1 dB of insertion loss from the external balun. The measured distortion performance of the receiver is illustrated in Fig. 7.10. The input 1-dB compression point of the receiver is 31.1 dBm. The out-of-band IIP2 is measured by applying two sinusoids at the receiver input with offsets of 27 MHz and 37 MHz from the carrier frequency. In this case, the second-order intermodulation product appears at 10 MHz and the measured IIP2 is 6.7 dBm. The out-of-band IIP3 is measured by applying two sinusoids at the receiver input with offsets of 35 MHz and 60 MHz from the carrier frequency. In this case, the third-order intermodulation product also appears at 10 MHz and the measured IIP3 is 18.3 dBm.
212
30 20
I/Q mixers 3 mW 3%
Figure 7.11: Measured receiver power consumption. The measured LO-to-RF leakage is 81 dBm and the receivers total power consumption is 106 mW. A breakdown of the power consumption is illustrated in Fig. 7.11. The frequency synthesizer and the ADC were also characterized separately [31], [34]. The phase noise performance of the frequency synthesizer is illustrated in Fig. 7.12. The phase noise is 85 dBc/Hz at a 2.5-MHz offset. The ADC has a dynamic range of 42 dB and a peak SNDR of 40 dB when operating at a frequency of 200 MHz (Fig. 7.13a). The output spectrum when a 33-dBFS, 3.125-MHz sinusoid is applied to the input of the modulator is illustrated in Fig. 7.13b. The noise shaping of the modulator is evident in the measured output spectrum.
213
80
85
90
95
100
105
110 10k
100k
1M
10M
30 25
20
SNDR (dB)
20 15 10 5 0 5
power (dBFS)
40
60
80
DR = 42 dB
100
10 50
45
40
35
30
25
20
15
10
frequency (MHz)
(a)
(b)
Figure 7.13: modulator operating at 200 MHz. (a) Measured dynamic range. (b) Measured output spectrum. All of the receiver performance measurements are summarized in Table 7.5 along with the simulated results. Except for the phase noise of the PLL, all of the other measured receiver specifications either match or exceed the specifications used for the initial system-level simulation listed in Table 5.2. The system downlink was resimulated in Simulink using the measured performance specifications, and the SNR of the I and Q data from the output of the multiuser detector is still approximately 15 dB.
214
carrier frequency noise figure (DSB) S11 voltage gain I/Q gain mismatch HPF corner frequency LPF corner frequency PLL phase noise I/Q phase mismatch IIP2 IIP3 1-dB compression dynamic range SNDR LO-to-RF leakage power dissipation
simulated performance 2 GHz 4.81 6.26 dB < 20 dB 42.2 45 dB not simulated 67.0 104.4 kHz 17.1 22.5 MHz 92 85 dBc/Hz @ 2.5 MHz not simulated not simulated not simulated 39 33 dBm 46.4 dB @ 200 MHz 45 dB @ 200 MHz not simulated not simulated
measured performance 2 GHz < 9 dB < 30 dB 41 dB < 2% 100 kHz 17 MHz 85 dBc/Hz @ 2.5 MHz < 2.5 6.7 dBm 18.3 dBm 31.1 dBm 42 dB @ 200 MHz 40 dB @ 200 MHz 81 dBm 106 mW
Finally, the receiver was tested using a modulated RF input signal. The test setup is illustrated in Fig. 7.14. The Simulink simulation framework is used to help verify the functionality of the receiver test chip. The same Simulink blocks that were used to evaluate the overall performance of the system are also used to generate the digital I and Q input signals for the base-station transmitter as well as to process the digital I and Q output signals from the receiver. The TX DSP block in Simulink performs QPSK modulation, signal spreading, and pulse shaping, and the 100-MHz output streams from this block, which consist of the combined data channels for ten users, are converted to baseband analog I and Q signals by the Arbitrary Waveform Generator. Next, the Vector Signal Generator translates the baseband I and Q signals to the 2-GHz carrier frequency, and this modulated RF signal is then applied to the receiver input. The digital I and Q output signals from the receiver are captured by the Logic Analysis System, and then the RX DSP block in Simulink performs timing and data recovery on the acquired data. The constellation diagrams for the I and Q signals before and after data recovery using the adaptive MUD algorithm are illustrated in Fig. 7.15. These measured constellation
215
I TX DSP Q
10 users, equal power
I RX DSP Q
timing & data recover
I Q
2-GHz RF
I receiver Q
Figure 7.14: Test setup for receiver measurement with a modulated RF input signal.
1
0.5
0.5
1 1
0.5
0.5
Figure 7.15: Measured constellation diagrams using a modulated RF input signal. diagrams demonstrate that the system achieves good overall performance when a modulated RF signal is applied to the input of the receiver test chip.
7.3
Measurement Issues
216
7.3.1
In the receiver prototype described in this thesis, electrostatic discharge (ESD) protection structures were not included in order to avoid additional parasitic capacitances which would be detrimental to the operation of the high-speed RF circuits. Unfortunately, the reliability of the prototype chips is significantly reduced without ESD protection structures [101], [102]. The yield of the receiver prototype chips was also affected by the lack of sufficient dummy structures. Dummy structures are required on the active, polysilicon, and metal layers for processes which rely on chemical mechanical polishing (CMP) for planarization. These dummy structures were not included around the receiver circuits since the effect of these structures on noise coupling was not well understood. Unfortunately, the yield of the prototype chips is significantly reduced without the dummy structures.
7.3.2
Packaging Technology
The receiver prototype chips were attached to the test boards using COB packaging technology. Although this technique offers reduced package parasitics, this benefit comes at the expense of decreased testing flexibility since the chips are not easily interchangeable. The inability to easily interchange prototype chips from the test boards was particularly problematic due to the low yield resulting from the lack of sufficient dummy structures. Alternatively, the ball grid array (BGA) package provides low parasitics without the disadvantage of decreased testing flexibility [103]. Moreover, this packaging technology can be used to include passive structures with high quality factors as an alternative to on-chip structures with much lower quality factors.
217
Chapter 8
Conclusion
8.1
Research Summary
The success of future wireless systems will depend heavily on their ability to provide high capacity while maintaining low cost, small form factor, and low power consumption in the portable devices. By tightly incorporating implementation issues throughout the process of defining the system specifications, an efficient solution can be achieved without necessarily sacrificing overall performance. This thesis described a design methodology which facilitates the evaluation of tradeoffs between implementation issues and overall system performance, focusing primarily on the receiver as an example. First, system-level specifications, such as modulation scheme and signal bandwidth, strongly influence the choice of receiver architecture, which in turn, has ramifications on the achievable power consumption and integration level. When system-level specifications are determined without considering their impact on receiver architecture selection, single-chip solutions may be very difficult to achieve or just simply infeasible. Some selection guidelines were presented in Chapter 2 for the heterodyne, direct-conversion, image-reject, and low-IF receiver architectures.
219
Second, the rapid improvements in digital CMOS technology provide an opportunity to use advanced digital signal processing algorithms which in the past were considered too complex to implement in the mobile device. These algorithms promise significant increases in system performance but their performance may ultimately be limited by analog circuit impairments, such as noise and distortion. This thesis described the detrimental effects of a number of these impairments and presented a system-level simulation framework which facilitates the direct evaluation of these effects on the performance of digital communications algorithms. The simulation framework is implemented in Simulink, which offers compatibility with MATLAB, a simulation tool already widely used for the development and evaluation of communications algorithms. This simulation framework relies on baseband-equivalent models for all of the RF building blocks in order to avoid simulation at the carrier frequency, resulting in faster simulation times. These strategies were then applied to the design of a high-speed wireless downlink for an indoor picocellular system. The system provides an aggregate data rate of 50 Mb/s with a transmission bandwidth of 32.5 MHz and a carrier frequency of 2 GHz. The wide bandwidth of the desired signal facilitates the use of a direct-conversion architecture. In this case, on-chip high-pass filtering can be used to remove dc offsets, and system-level simulations confirmed that the SNR degradation is less than 0.5 dB for a high-pass corner frequency of up to 500 kHz. Also, complete end-to-end simulations of the system downlink were performed in Simulink and revealed that the digital multiuser detection algorithm used for data recovery is relatively insensitive to analog hardware impairments. Finally, a receiver prototype was implemented to meet the specifications determined from the system-level simulations. A power-efficient solution was achieved by taking advantage of the relaxed specifications as well as by using low-power circuit implementation techniques. This receiver prototype includes the low-noise amplifier, frequency synthesizer, mixers, baseband amplifiers and filters, and analog-to-digital converters, all implemented on a single chip with a power dissipation of about 100 mW.
8.2
Future Work
220
8.2.1
Bottom-Up Verification
The research presented in this thesis demonstrates that by tightly incorporating implementation issues throughout the process of system definition, a very efficient solution can be achieved without necessarily sacrificing overall performance. A top-down design approach based on Simulink was used to determine the tolerable levels of analog circuit impairments. However, once the receiver circuits are designed, it is necessary to verify that the system actually achieves the desired performance using these circuits. For the receiver prototype described in this thesis, verification was performed by resimulating the entire system downlink in Simulink using both simulated results from SpectreRF and measured results from the receiver prototype. Verification was also performed using the transient envelop simulation capability of SpectreRF. Unfortunately, both of these verification approaches have their shortcomings. Although the former approach is quite straightforward, as with any approach based on behavioral models, the accuracy of the simulation results depends on the accuracy of the models. Unfortunately, the Simulink simulation framework described in this thesis relies on behavioral models which may not accurately represent the behavior of the actual circuits over all input power levels [104]. Finally, although transient envelop simulations in SpectreRF provide very accurate results, these simulations are also very slow since they rely on transistorlevel models rather than on behavioral models.
8.2.2
In the Simulink simulation framework described in this thesis, the behavioral models rely on a third-order power series in order to model circuit distortion (Appendix A). For some communications systems it may be necessary to include the effects of higher-order nonlinearities. Implementing the behavioral models so that the order of the power series is an input parameter rather than being fixed allows for increased flexibility. Moreover, an approach based on power series provides an accurate description of distortion in circuits that are memoryless but may not be adequate for modeling distortion in circuits at high frequencies, where the effects of parasitic capacitances become
221
significant. Although an approach based on Volterra series provides much more accurate results in this case, calculations based on this approach are rather complex, even when using computer simulation techniques. Further investigation is needed in order to determine accurate yet efficient methods of modeling distortion in high-frequency circuits.
8.2.3
Single-Chip Integration
The receiver prototype presented in this thesis achieves a very high level of integration with a low number of off-chip components. All RF and analog baseband components are integrated onto a single chip, while the external components include an antenna, RF filter, crystal reference, a pair of chip inductors, and a pair of baluns. The off-chip inductors were used for convenience in order to avoid iterating over different bond wire lengths for the LNA input matching, and consequently, are not a serious impediment to eventual integration. For this receiver prototype, a fully-differential on-chip signal path was used in order to mitigate the coupling between different receiver components. The use of a differential signal path necessitates the use of off-chip baluns to convert the single-ended off-chip signals to differential signals. One possible way of eliminating these baluns is to also use a fully-differential off-chip signal path, which requires further work in the areas of differential antenna design as well as differential RF filter design. Furthermore, higher levels of integration may be achieved by taking advantage of packaging technologies such as the BGA package already mentioned in Section 7.3.2. This packaging technology can be used to include passive structures with high quality factors so that components such as the RF filter can be integrated seamlessly into the package. Finally, in addition to using a fully-differential on-chip signal path, other techniques were used to mitigate the coupling between the different receiver components, including using separate supplies for the analog and digital circuits as well as implementing the digital sections of the PLL using logic styles, such as SCL and DCVSL, which result is less
222
substrate current injection. Despite these efforts, the consequences of integrating all of the digital baseband circuits, such as those required to implement the data and timing recovery algorithms, is unclear. In particular, further investigation is needed in order to determine the feasibility of integrating all of the analog and digital signal processing circuits onto a single chip.
223
Appendix A
Baseband-Equivalent Models
A.1
RF Amplifiers
The transfer function of any RF gain block can be represented by the following relationship:
xo (t ) = a n xin (t )
n =0 N
(A.1)
where xi(t) and xo(t) are the input and output signals, respectively. Let xi(t) and xo(t) have the same form as (4.29):
xi (t ) = xiDC (t ) + [ xiIn (t ) cos(n c t ) + xiQn (t ) sin(n c t )]
n =1 N
(A.2)
(A.3)
The output coefficients xoDC(t), xoIn(t), and xoQn(t) in terms of the input coefficients xiDC(t), xiIn(t), and xiQn(t) for N = 3 are given in Tables A.1 and A.2.
225
xoI 1 (t )
xoI 2 (t )
226
xoI 3 (t )
xoDC (t )
1 3 3 a2 x2 a3 xiDC x2 a1 xiDC a2 x2 iDC a3 xiDC iI1 iI1 2 2 1 3 3 a3 x2 xiI2 a2 x2 a3 xiDC x2 a3 xiI1 xiI2 xiI3 iI1 iI2 iI2 2 2 2 3 1 3 3 a2 x2 a3 xiDC x2 a2 x2 a3 xiDC x2 a3 xiI2 x2 iI3 iI3 iQ1 iQ1 iQ1 2 2 2 4 3 1 3 a3 xiI1 xiQ1 xiQ2 a3 xiI3 xiQ1 xiQ2 a2 x2 a3 xiDC x2 iQ2 iQ2 2 2 2 3 1 3 a3 xiI2 xiQ1 xiQ3 a3 xiI1 xiQ2 xiQ3 a2 x2 a3 xiDC x2 iQ3 iQ3 2 2 2 3 a3 x3 a1 xiI1 2 a2 xiDC xiI1 3 a3 x2 xiI1 iDC iI1 a2 xiI1 xiI2 4 3 3 a3 xiI1 x2 a3 x2 xiI3 a2 xiI2 xiI3 3 a3 xiDC xiI1 xiI2 iI2 iI1 2 4 3 3 3 a3 x2 xiI3 a3 xiI1 x2 a3 xiI1 x2 3 a3 xiDC xiI2 xiI3 iI2 iI3 iQ1 4 2 4 3 3 a3 xiI3 x2 a3 xiI1 x2 iQ1 a2 xiQ1 xiQ2 3 a3 xiDC xiQ1 xiQ2 iQ2 4 2 3 3 a3 xiI3 x2 a3 xiI1 xiQ1 xiQ3 a2 xiQ2 xiQ3 3 a3 xiDC xiQ2 xiQ3 iQ2 4 2 3 3 a3 xiI2 xiQ2 xiQ3 a3 xiI1 x2 iQ3 2 2 1 3 2 a2 x2 a3 xiDC x2 iI1 iI1 a1 xiI2 2 a2 xiDC xiI2 3 a3 xiDC xiI2 2 2 3 3 a3 x2 xiI2 a3 x3 iI1 iI2 a2 xiI1 xiI3 3 a3 xiDC xiI1 xiI3 2 4 3 3 1 3 a3 xiI1 xiI2 xiI3 a3 xiI2 x2 a2 x2 a3 xiDC x2 iI3 iQ1 iQ1 2 2 2 2 3 3 3 a3 xiI2 x2 a3 xiI3 xiQ1 xiQ2 a3 xiI2 x2 iQ1 iQ2 a2 xiQ1 xiQ3 2 2 4 3 3 a3 xiI2 xiQ1 xiQ3 a3 xiI1 xiQ2 xiQ3 3 a3 xiDC xiQ1 xiQ3 2 2 3 a3 xiI2 x2 iQ3 2 1 3 a3 x3 a3 xiI1 x2 iI1 a2 xiI1 xiI2 3 a3 xiDC xiI1 xiI2 iI2 a1 xiI3 4 4 3 3 2 a2 xiDC xiI3 3 a3 x2 xiI3 a3 x2 xiI3 a3 x2 xiI3 iDC iI1 iI2 2 2 3 3 3 a3 x3 a3 xiI1 x2 a3 xiI3 x2 iI3 iQ1 iQ1 a2 xiQ1 xiQ2 4 4 2 3 3 3 a3 xiI2 xiQ1 xiQ2 a3 xiI1 x2 a3 xiI3 x2 3 a3 xiDC xiQ1 xiQ2 iQ2 iQ2 2 4 2 3 a3 xiI3 x2 iQ3 4
a0 3 4 1 2 3 2 3 2
xoQ1 (t )
xoQ 2 (t )
A.2
Mixers
where yi(t), yLO(t), and yo(t) are the input, oscillator, and output signals, respectively. For a mixers in direct-conversion and low-IF receivers, let yi(t), yLO(t), and yo(t) have the same form as (4.29):
yi (t ) = y iDC (t ) + [ y iIn (t ) cos(n c t ) + y iQn (t ) sin( n c t )]
n =1 N
(A.5)
227
xoQ 3 (t )
a2 xiI1 xiQ1 3 a3 xiDC xiI1 xiQ1 a2 xiI3 xiQ1 3 a3 xiDC xiI3 xiQ1 3 a3 xiI2 xiI3 xiQ1 a1 xiQ2 2 a2 xiDC xiQ2 3 a3 x2 xiQ2 iDC 2 3 3 3 3 a3 x2 xiQ2 a3 x2 xiQ2 a3 xiI1 xiI3 xiQ2 a3 x2 xiQ2 iI1 iI2 iI3 2 4 2 2 3 3 a3 x2 xiQ2 a3 x3 iQ1 iQ2 a2 xiI1 xiQ3 3 a3 xiDC xiI1 xiQ3 2 4 3 3 3 a3 xiI1 xiI2 xiQ3 a3 xiQ1 xiQ2 xiQ3 a3 xiQ2 x2 iQ3 2 2 2 3 3 a3 x2 xiQ1 a2 xiI2 xiQ1 3 a3 xiDC xiI2 xiQ1 a3 x2 xiQ1 iI1 iI2 4 4 1 3 a3 x3 a3 xiI1 xiI2 xiQ2 iQ1 a2 xiI1 xiQ2 3 a3 xiDC xiI1 xiQ2 4 2 3 3 2 a3 xiQ1 x2 a3 x2 xiQ3 iQ2 a1 xiQ3 2 a2 xiDC xiQ3 3 a3 xiDC xiQ3 iI1 4 2 3 3 3 3 a3 x2 xiQ3 a3 x2 xiQ3 a3 x2 xiQ3 a3 x2 xiQ3 iI2 iI3 iQ1 iQ2 2 4 2 2 3 a3 x3 iQ3 4
+ +
3 a1 xiQ1 2 a2 xiDC xiQ1 3 a3 x2 xiQ1 a3 x2 xiQ1 a2 xiI2 xiQ1 iDC iI1 4 3 3 3 3 a3 xiDC xiI2 xiQ1 a3 x2 xiQ1 a3 xiI1 xiI3 xiQ1 a3 x2 xiQ1 iI2 iI3 2 2 2 3 a3 x3 iQ1 a2 xiI1 xiQ2 3 a3 xiDC xiI1 xiQ2 a2 xiI3 xiQ2 4 3 3 3 3 a3 xiDC xiI3 xiQ2 a3 xiI2 xiI3 xiQ2 a3 xiQ1 x2 a3 x2 xiQ3 iQ2 iI1 2 2 4 3 3 a2 xiI2 xiQ3 3 a3 xiDC xiI2 xiQ3 a3 x2 xiQ3 a3 x2 xiQ3 iI2 iQ1 4 4 3 3 a3 x2 xiQ3 a3 xiQ1 x2 iQ2 iQ3 4 2
+ +
+ +
(A.6)
(A.7)
The output coefficients yoDC(t), yoIn(t), and yoQn(t) in terms of the input coefficients yiDC(t), yiIn(t), yiQn(t), yLODC(t), yLOIn(t), and yLOQn(t) for N = 3 are given in Table A.3.
yiI1 yLOI1 yiI2 yLOI2 2 2 yiI3 yLOI3 yiQ1 yLOQ1 yiQ2 yLOQ2 yiQ3 yLOQ3 2 2 2 2 yiI2 yLOI1 yiI1 yLOI2 yiI3 yLOI2 yiI1 yLODC yiDC yLOI1 2 2 2 yiI2 yLOI3 yiQ2 yLOQ1 yiQ1 yLOQ2 yiQ3 yLOQ2 yiQ2 yLOQ3 2 2 2 2 2 yiI1 yLOI1 yiI3 yLOI1 yiI2 yLODC yiDC yLOI2 2 2 yiI1 yLOI3 yiQ1 yLOQ1 yiQ3 yLOQ1 yiQ1 yLOQ3 2 2 2 2 yiI2 yLOI1 yiI1 yLOI2 yiQ2 yLOQ1 yiQ1 yLOQ2 yiI3 yLODC yiDC yLOI3 2 2 2 2 yiQ2 yLOI1 yiQ1 yLOI2 yiQ3 yLOI2 yiQ2 yLOI3 yiQ1 yLODC 2 2 2 2 yiI2 yLOQ1 yiI1 yLOQ2 yiI3 yLOQ2 yiI2 yLOQ3 yiDC yLOQ1 2 2 2 2 yiQ1 yLOI1 yiQ3 yLOI1 yiQ1 yLOI3 yiQ2 yLODC 2 2 2 yiI1 yLOQ1 yiI3 yLOQ1 yiI1 yLOQ3 yiDC yLOQ2 2 2 2 yiQ2 yLOI1 yiQ1 yLOI2 yiI2 yLOQ1 yiI1 yLOQ2 yiQ3 yLODC yiDC yLOQ3 2 2 2 2 yiDC yLODC
y oDC (t )
y oI 1 (t )
y oI 2 (t )
y oQ1 (t )
y oQ 2 (t )
Table A.3: Baseband-equivalent model for mixers (direct-conversion and low-IF). For mixers in heterodyne receivers, let yi(t) be given by (A.5) while yLO(t) and yo(t) are given by, respectively,
228
y oQ 3 (t )
y oI 3 (t )
+ +
+ +
+ +
+ +
+ + +
y LO (t ) = y LODC (t ) + { y LOIn (t ) cos[n( c IF )t ] + y LOQn (t ) sin[n( c IF )t ]} = y LODC (t ) + {[ y LOIn (t ) cos(n IF t ) y LOQn (t ) sin(n IF t )] cos(n c t ) +
n =1 n =1 N
(A.8)
[ y
n =1 N N N N
oIRFn
(A.9)
oIPnm
{ y
n =1 m =1
{ y
n =1 m =1
oINnm
The output coefficients of yo(t) in terms of the input coefficients of yi(t) and yLO(t) for
N = 3 are given in Tables A.4 and A.5.
y oDC (t ) y oIIF 1 (t ) y oIIF 2 (t ) y oIIF 3 (t ) y oIRF 1 (t ) y oIRF 2 (t ) y oIRF 3 (t ) y oIP11 (t ) y oIP12 (t ) y oIP13 (t )
yiDC yLODC
y oQP13 (t )
229
y oQP12 (t )
y oQP11 (t )
y oQIF 3 (t )
y oQIF 2 (t )
+ + +
y oQIF 1 (t )
y oIN 11 (t ) y oIN 12 (t ) y oIN 13 (t ) y oIP 21 (t ) y oIP 22 (t ) y oIP 23 (t ) y oIN 21 (t ) y oIN 22 (t ) y oIN 23 (t ) y oIP 31 (t ) y oIP 32 (t ) y oIP 33 (t ) y oIN 31 (t ) y oIN 32 (t ) y oIN 33 (t )
yiDC yLOI1
y oQN 11 (t )
yiQ1 yLOQ2 2 yiQ2 yLOQ3 2 yiQ3 yLOQ1 2
yiDC yLOQ1
0 0
y oQP 22 (t ) y oQP 23 (t )
yiQ1 yLOQ1 2
0 0
yiDC yLOI2
y oQN 22 (t )
yiQ1 yLOQ3 2
yiDC yLOQ2
0 0 0
0 0 0
yiDC yLOI3
y oQN 33 (t )
yiDC yLOQ3
230
y oQN 32 (t )
y oQN 31 (t )
yiI1 yLOI3 2
y oQN 23 (t )
1 yiQ1 yLOI3 2
yiI1 yLOI1 2
y oQN 21 (t )
yiQ1 yLOI1 2
y oQP 21 (t )
yiI1 yLOQ1 2
y oQN 13 (t )
+ + +
y oQN 12 (t )
1 yiI1 yLOQ2 yiQ1 yLOI2 2 2 1 yiI2 yLOQ3 yiQ2 yLOI3 2 2 yiQ3 yLOI1 yiI3 yLOQ1 2 2
yiI1 yLOQ3 2
Appendix B
DC-Offset Cancellation
B.1
Introduction
For systems with narrowband signals, dc offsets in a direct-conversion receiver can also be removed by using capacitive coupling or high-pass filtering. However, this technique requires very large capacitance and resistance values in order to remove as little lowfrequency signal energy as possible. Consequently, an implementation using on-chip passive devices is not feasible unless very high-density structures are available in the process as in [13]. As an alternative, off-chip passive structures can be used to eliminate dc offsets. However, this approach is inconsistent with the goal of a highly-integrated implementation. The following sections provide an overview of alternative techniques used to eliminate dc offsets when a direct-conversion receiver architecture is used in narrowband systems.
B.2
In TDMA systems, data is received only during certain time intervals as illustrated in Fig. B.1. In this example, the TDMA system supports four data channels and each data
231
time
...
S1
S2
S3 DC-offset cancellation
S4
S1
...
Figure B.1: TDMA time slots. channel communicates only during one of the four time slots. Consequently, the receiver for the first data channel processes data only during time slots S1 and remains idle during time slots S2, S3, and S4. In this case, dc-offset cancellation can be performed during these idle time intervals [39], [105] in a manner very similar to the autozeroing technique described in Section 5.3.5. During the idle periods, the dc offset is stored, and then during the active time slot, the offset is subtracted from the received signal. Fig. B.2 illustrates two different approaches based on this technique. In one approach, the offset is stored on a capacitor [39], while in the second approach, the offset is stored digitally and then subtracted from the received signal using a DAC [11]. The drawback of this technique is that it relies on the offset not changing between the idle and active time slots. In fact, dc offsets do vary over time as described in Section 2.3.1. In this case the dc offsets must be tracked even during the active time slots. A method which tracks the dc
RF Input (fc)
LNA RF Filter
...
baseband stages
ADC
LO (fc)
(a)
RF Input (fc)
LNA RF Filter
...
baseband stages DAC
LO (fc)
(b) Figure B.2: DC-offset cancellation. (a) Capacitive storage. (b) Feedback DAC.
232
offsets while data is being received can be used for any system, not just TDMA systems. For systems which use signal constellations centered and symmetric about the origin, such as QPSK, the dc content of the received signal is ideally zero. For such systems, an averaging circuit, such as a low-pass filter, can be used to estimate the dc content of the received signal. The estimated dc offset can then be subtracted from the received signal. This dc-offset cancellation scheme can be implemented using an adaptive LMS algorithm [56]. Consider the equivalent combiner for a feedback dc-offset correction loop illustrated in Fig. B.3, where x(k) is the received signal, g(k) is update signal, y(k) is the estimated dc-offset, d(k) is the desired value of the dc-offset, and e(k) is the error signal. The error signal e(k) is
(B.1)
Adaptation using the stochastic gradient descent method [56] results in the following update equation:
g (k + 1) = g (k ) 1 2 {e(k )} e ( k ) = g ( k ) e ( k ) g (k ) 2 g (k )
(B.2)
where is the step size. Taking the partial derivative of e(k) with respect to g(k), (B.2) becomes
g (k + 1) = g (k ) + e(k ) .
(B.3)
The stability criterion for this algorithm is determined by first setting d (k ) = LPF {x(k )} . Then the prediction error is
d(k) x(k) LPF g(k) y(k) e(k)
(B.4)
233
(B.5)
(B.6)
(B.7)
If the algorithm converges, then the parameter error vector update at time k + 1 must be less than the parameter error vector update at time k. Consequently, the summed squared parameter error increment must be negative. For a positive step size , the following relationship must be satisfied if the algorithm converges:
0 < < 2.
(B.8)
Fig. B.4 illustrates one possible implementation of a dc-offset correction loop based on this adaptive LMS algorithm. A digital low-pass filter is used to estimate the dc content of the received signal and is followed by a digital implementation of the stochastic gradient descent LMS algorithm. The dc offset is then subtracted at the output of the mixer using a DAC. This approach is similar to the one illustrated in Fig. B.2b but can be used for any system which uses a signal constellation centered and symmetric about the origin and not just TDMA systems. The effectiveness of this dc-offset cancellation loop is verified using Simulink. The simulation assumes a worst-case dc offset of 100 mV after the mixer due to LO selfRF Input (fc)
LNA RF Filter
...
baseband stages
ADC
234
0.15
0.1
0.05
51-dB gain 82-dB gain 0 0 2000 4000 6000 time ( 40 ns) 8000 10000
Figure B.5: Simulation of dc-offset cancellation loop. mixing and a worst-case dc offset of 100 mV due to systematic offsets in the baseband circuits. The digital low-pass filter has a relative bandwidth of 0.1 and an 8-bit DAC is used to subtract the dc offset at the output of the mixer. The update signal from the dcoffset correction circuit is illustrated in Fig. B.5 for two different cases: 1. weak input signal and maximum gain (82 dB); and 2. strong input signal and minimum gain (51 dB). The ripple in the update signal is due to the finite precision of the DAC. The step size used for the stochastic gradient descent LMS algorithm is 1/512, which was chosen as a compromise between convergence time and the amount of signal ripple after convergence. Finally, in addition to the digital low-pass filter, the LMS algorithm described above also incorporates an implicit low-pass filtering operation. If the digital low-pass filter is omitted from the dc-offset correction loop, the update equation becomes
g (k ) = g (k 1) + [ x(k 1) g (k 1)] .
(B.9)
z 1 X ( z) . 1 (1 ) z 1
(B.10)
235
(B.11)
In order for this filter to be stable, the pole must be located inside the unit circle, or
0 < < 2 , which is the same result derived in (B.8). Since the value of determines the
cutoff frequency of the low-pass filter, a small value of is desirable. However, a small value of also results in a very long convergence time. Consequently, including the additional digital low-pass filter in the dc-offset correction loop allows the step size to be set independently from the low-pass filter corner frequency. However, for applications where a longer convergence time is tolerable, the digital low-pass filter may actually be omitted.
236
Appendix C
Why 50 ?
C.1
Introduction
Transmission line theory calls for conjugate impedance matching for maximize power transfer from the source to the load. In order to facilitate the independent design of different components, most microwave designs are based on a standard interface impedance of 50 . Integrated-circuit implementations have already abandoned this antiquated requirement. For these implementations, the connections between on-chip components are typically much less than the signal wavelength so transmission line effects can be neglected. For example, for a 2-GHz signal, the wavelength is
c 3 10 8 m/s = = 0.15 m f 2 10 9 Hz
(C.1)
while the lengths of on-chip connections are typically no more than 1 mm. Consequently, integrated-circuit implementations do not need to adhere to the 50- requirement. Recently, the 50- requirement at the interface between external and on-chip components has also come under intense scrutiny. This appendix addresses two questions which are at the heart of the controversy over the 50- requirement. First, if maximum
237
power transfer is indeed the relevant design metric, then is 50 the optimum interface impedance? And second, is maximum power transfer even the correct design goal?
C.2
Conjugate impedance matching results in maximum power transfer from the source to the load. Under this condition, the reflection coefficient is not necessarily zero. The reflection coefficient is defined as [59] = Z s Zl Z s + Zl (C.2)
where Zs and Zl are the source and load impedances, respectively. Maximum power transfer requires Z l = Z s* , while = 0 requires Z l = Z s . For a complex source impedance, conjugate impedance matching does not eliminate reflections on the transmission line connecting the source and the load. However, if the source impedance is purely real, then the condition for maximum power transfer delivered to the load is identical to the condition for no reflections, which partially explains the choice of a standard 50- interface impedance. The choice of 50 is also based on the use of coaxial cables, where the 50- interface resistance is a compromise between the 30- resistance for maximum power capacity and the 77- resistance for minimum attenuation [25], [59]. The characteristic impedance of a coaxial line is
Z0 =
b ln 2 a
(C.3)
c =
Rs 1 1 + 2 ln(b / a ) a b
(C.4)
where Rs is the surface resistivity of the conductors, is the intrinsic impedance of the dielectric material, and a and b are the radii of the inner and outer conductors, respectively. The attenuation is minimum when x ln x = 1 + x , where x = b / a , and the
238
corresponding characteristic impedance is 77 for = 0 = 377 in free-space. The power capacity of a coaxial line is given by Pmax
a 2 E d2 b = ln 0 a
(C.5)
where Ed is the electric field strength at breakdown. The power capacity is maximum when ln(b / a ) = 1 / 2 and the corresponding characteristic impedance is 30 . Despite the integration of increasingly more components onto a single chip, many highlyintegrated transceivers still rely on an external antenna and an external RF filter. In this case, a coaxial cable usually connects the antenna to the filter, while a short board trace usually connects RF filter to the transceiver chip. The use of a coaxial cable between the antenna and the RF filter motivates the use of a 50- interface impedance. In particular, one of the most important goals of the transmitter is to efficiently deliver as much signal power as possible to the transmission medium. Conventional designs usually also rely on a 50- interface impedance between the output of the RF filter and the input of the transceiver. Commercially-available RF filters are typically designed assuming doublyterminated source and load impedances of 50 , and consequently, deviating from 50 results in poor and unpredictable RF filter performance. For a transceiver which relies on a custom RF filter, a 50- interface impedance between the antenna and the RF filter is still a prudent choice when a coaxial cable connects the two components. However, the interface impedance between the RF filter and the transceiver chip no longer needs to be 50 since a short board trace instead of a coaxial cable connects the two components. Nevertheless, the 50- requirement between the antenna and the RF filter does impose some restrictions on the interface impedance between the RF filter and the transceiver chip. In Fig. C.1, the RF filter is represented by a transmission, or ABCD, matrix:
V1 A B V2 I = C D I . 2 1
(C.6)
239
Antenna I1
+
A B CD
I2
+
Tranceiver Chip Z4 Z2
Z1 Z3
V1
V2
Figure C.1: Transmission matrix representation of the RF filter. The input and output impedances of the RF filter are, respectively,
Z3 = Z4 =
V1 AV2 + BI 2 AZ 2 + B = = I 1 CV2 + DI 2 CZ 2 + D
(C.7)
V2 DV1 BI 1 DZ 1 + B . = = I2 CV1 + AI 1 CZ 1 + A
(C.8)
Next, assuming that the RF filter is reciprocal, i.e., no active devices, ferrites, or plasmas, results in the following constraint:
AD BC = 1
(C.9)
while assuming that the RF filter is lossless yields the following additional constraints:
In order to achieve maximum power transfer between the antenna and the RF filter, a conjugate match is required:
A ( R + jX ) + jBI AR R j ( BI + AR X ) Z1 = Z = R = DR C I X = jC I R jC I ( R + jX ) + DR
* 3 *
(C.14)
where Z 2 = R + jX . Substituting (C.14) into (C.8) results in the following expression for
Z4 :
240
AR R j ( BI + AR X ) + jBI DR C I X jC I R * = R jX = Z 2 . Z4 = A R j ( BI + AR X ) jC I R + AR DR C I X jC I R DR
(C.15)
Hence, when a conjugate impedance match is required between the antenna and the RF filter, a conjugate match automatically results between the RF filter and the transceiver chip. In particular, for a 50- match between the antenna and the RF filter, the interface impedance between the RF filter and the transceiver chip must also be purely real. However, this interface impedance does not have to be 50 . For example, with fixed current consumption, a larger interface impedance results in better noise figure performance in the inductively-degenerated LNA as illustrated in Fig. 6.20. However, even for a custom RF filter implementation, the interface impedances cannot be chosen independently. For Butterworth, Bessel, and odd-order Chebyshev responses, the load and source resistances of the filter must be equal, while for an even-order Chebyshev response, the load and source resistances are related but not necessarily equal [59]. In the latter case, the relationship between the load and source resistances depends on the amount of passband ripple. An impedance transformation network may be placed between the RF filter and the transceiver to increase design flexibility at the expense of increased complexity.
C.3
One of the most important goals of the transmitter is to efficiently deliver as much signal power as possible to the transmission medium. Consequently, impedance matching for maximum power transfer is a worthy design goal. On the other hand, one of the most important goals of the receiver is to amplify a potentially weak desired signal without corrupting it with noise. In this case, impedance matching for maximum power transfer is not as critical as matching for minimum noise figure. However, when the transmitter and receiver share the same RF filter and antenna, the interface impedance between the RF filter and the LNA is usually designed for maximum power transfer out of convenience rather than necessity.
241
Impedance matching for minimum noise figure is critical for very low noise applications, such as receivers for radio astronomy. Several LNA topologies are analyzed in Sections 6.2.4 and 6.2.5 and the optimum source admittance resulting in minimum noise figure for each of these topologies is summarized in Table 6.1. Minimum noise figure is achieved by designing the output admittance of the RF filter that precedes the LNA to be equal to the required optimum source admittance. If the input admittance of the LNA is set equal to the conjugate of the optimum admittance for minimum noise figure, then a simultaneous match for minimum noise figure and maximum power transfer may be achieved. For all of the LNA topologies described in Sections 6.2.4 and 6.2.5, an impedance match which results in both minimum noise figure and maximum power transfer is impossible.
C.4
With the increasing use of CMOS technology for RF applications, the traditional microwave approach of conjugate impedance matching for maximum power transfer has become quite controversial, especially for the receiver. One of the main goals of the RF section of the receiver is to amplify the voltage of the received signal for processing by the subsequent baseband section. In this case, designing the front-end components of the receiver, including the antenna, RF filter, and LNA, for maximum voltage transfer seems to be a more appropriate design goal. In order to better understand the implications of impedance matching for either criterion, consider the following two design scenarios: 1. the source resistance is fixed but the designer has the freedom to select the load resistance; 2. both the source and load resistances are fixed but the designer has the freedom to select the network which connects the two resistances (Fig. C.2). For the first scenario, selecting the load resistance Rl to be equal to the source resistance
Rs results in maximum power transfer, Po = Vs2 /(4 Rl ) , where Vs is the source voltage. In
this case, the corresponding output voltage is Vo = Vs / 2 . In contrast, selecting Rl 242
Rs Vo Vs Rl
Figure C.2: Rs and Rl are fixed but the network connecting them is allowed to vary.
results in maximum voltage transfer, Vo = Vs , while the corresponding output power is zero. In the latter case, the power gain is zero but the voltage gain is a factor of two higher. In this scenario, the two design criteria have significantly different ramifications on the optimum value of Rl. For the second scenario, there are numerous alternatives for connecting Rs and Rl. One approach is to simply connect the two resistances directly. In this case, the output voltage and output power are, respectively,
Vo = Po =
Rl Vs Rs + Rl
(C.16)
Rl Vs2 . 2 ( Rs + Rl )
(C.17)
If Rs = 50 and Rl = 5 k, then Vo = 0.99Vs and Po = 1.96 10 4 Vs2 . A second approach is to connect Rs and Rl together, and then, assuming that Rl > Rs , connect a third resistor Rp in parallel with Rl such that the equivalent resistance is
R p || Rl = Rs . In this case, the output voltage and output power are Vo = Vs / 2 and
Po = Vs2 /[4( R p || Rl )] . If Rs = 50 and Rl = 5 k, then Vo = 0.5Vs and Po = 5 10 3 Vs2 . By connecting Rp in parallel with Rl, the equivalent resistance is matched to Rs, resulting in a higher output power but a lower output voltage than the corresponding values in the first approach. A third approach is to connect the two resistances through a transformer as illustrated in Fig. C.3 [18]. In this case, the output voltage and output power are, respectively,
243
Rs
1:n
Vo Rl
Vs
Ri
Ro
Vo = Po =
nRl Vs n Rs + Rl
2
(C.18)
n2 Vs2 (n 2 Rs + Rl ) 2
(C.19)
where n is the transformer turns ratio. The maximum output voltage and maximum output power are, respectively, Vomax =
1 Rl Vs 2 Rs (C.20)
Pomax =
Vs2 4 Rs
(C.21)
nopt =
Rl . Rs
(C.22)
In this case, if Rs = 50 and Rl = 5 k, then Vomax = 5Vs and Pomax = 5 10 3 Vs2 , both of which are significantly larger than the corresponding values in the first approach. Although the output power is identical to that in the second approach, the output voltage is significantly higher. In this approach, the two design criteria result in identical values for the optimum transformer turns ratio, and thus, designing for maximum power transfer is equivalent to designing for maximum voltage transfer. In addition, for n = nopt, the resistance Ri is equal to Rs, while the resistance Ro is equal to Rl. In other words, when
244
power transfer and voltage transfer are both maximum, Ri and Ro are matched to Rs and
Rl, respectively.
The examples for the second scenario have profound implications for the design of the LNA in highly-integrated receiver implementations. For these implementations, Rs is the fixed driving-point resistance of the external antenna, while Rl is the fixed load resistance at the output of the integrated LNA, which is given by (6.129) for a tuned load. In this case, the designer has the freedom to choose the LNA topology which connects Rs and Rl. Indeed, selecting the LNA topology for maximum voltage transfer rather than maximum power transfer is commensurate with the overall goal of sufficiently amplifying the voltage of the received signal for processing by the subsequent baseband section. Based on this design criterion, the common-source and inductively-degenerated common-source topologies are analyzed below.
C.4.1
Common-Source LNA
When the source and load resistances are connected through a transistor in the commonsource configuration as illustrated in Fig. C.4, the voltage gain is given by
Vo g m Rl = . Vs 1 + j Rs C gs
From (6.59), assuming that g g << C gs , the noise factor is
2 2 G s C gs . + + +2|c| F = 1+ g m 5 5 g m Gs
(C.23)
(C.24)
Vo Rs Rl
Vs
245
For simplicity, the long-channel expression for gm in (6.94) is substituted into (C.24), and the noise factor is minimum when
C gs = C gsopt =
Gs
3 + +2|c| 5 5
(C.25)
Fmin = 1 + = 1+
Gs 4 L 3 eff I D
0.0410 ID
3 4
1 + +2|c| 5 3 5
1 4
(C.26)
= 2 2109 rad/s,
L = Leff = 0.18 m
and
Vo = 1.48 I D Rl . Vs
(C.28)
As discussed in Section 6.2.1, designing the RF filter for a particular frequency response requires knowledge of the source and load impedances of the antenna and LNA, respectively. A standard interface impedance, e.g., 50 , allows RF filters to be designed independently from the antenna and the LNA based on doubly-terminated filter design techniques [20]. For the common-source LNA, the load to the RF filter is the gate-source capacitance of the MOS device. The RF filter is still preceded by an antenna, so the driving-point resistance of the antenna serves as the source impedance to the RF filter. The equivalent circuit for a second-order low-pass network is illustrated in Fig. C.5, and
246
Rs
L Vo
Vs
Cgs
Zin
Zout
Figure C.5: Singly-terminated RF filter. the design of this filter is based on the same approach used for singly-terminated filters [20]. The voltage transfer function is
Vo 1 = 2 Vs 1 L(C + C gs ) + j Rs (C + C gs )
(C.29)
and the following component values are required in order to achieve a second-order Butterworth low-pass frequency response:
L=
C= 2
Rs
2 c
C gs .
(C.30)
Rs c
(C.31)
Thus, the desired filter response may still be achieved in the case of a common-source LNA designed for maximum voltage transfer. However, a full reflection occurs at the interface between the antenna and the RF filter. This reflected signal is reradiated from the antenna and can potentially interfere with other receivers nearby. For low transmit power levels, e.g., 1 mW, the received signal is quite weak and the reradiated signal is even weaker, and thus, the impact of a full reflection occurring at the interface between the antenna and the RF filter is minimal.
C.4.2
When the source and load resistances are connected through an inductively-degenerated common-source LNA as illustrated in Fig. C.6, the voltage gain is given by
247
Vo Rs Lg Rl Vs
Ls
g m Rl j c ( g m Ls + Rs C gs )
(C.32)
assuming that
2 = c2 =
1 . C gs ( Lg + Ls )
(C.33)
g m Rl . j Rs C gs
(C.34)
Rs =
and the voltage gain becomes
Vo Vs
g m Ls C gs
(C.35)
=
g m Ls = Rs C gs
g m Rl . j 2 Rs C gs
(C.36)
Thus, designing for a power match at the LNA input degrades the voltage gain by a factor of two. Relying on the long-channel expression for gm in (6.94), the minimum noise figure is given by (6.96) and repeated here for convenience:
248
Fmin = 1 + = 1+
Gs 4 L eff I D 3
0.0148 ID .
1 + +2|c| 5 5 3 5
3 4
1 4
(C.37)
(C.38)
Thus, both the voltage gain and the noise performance of the inductively-degenerated common-source topology are superior. In addition, even if this LNA is designed for a power match at the input, the voltage gain is still superior to that of the common-source topology. In the case of a power match, the LNA input resistance is equal to the driving-point resistance of the antenna. In this case, design of the preceding RF filter for a particular response is based on the approach used for doubly-terminated filters already described in Section 6.2.1. However, when Ls = 0, the input impedance of the LNA is zero at = c, and the preceding RF filter is neither singly terminated nor doubly terminated. Thus, for applications which require an RF filter for increased selectivity, this latter approach may not be feasible. Nevertheless, even when the inductively-degenerated common-source LNA is designed for a power match, its voltage gain is superior to that of the commonsource topology designed for maximum voltage transfer.
C.5
A common circuit model for the components which precede the LNA, such as the antenna, is illustrated in Fig. C.7. In this case, the source resistance Rs is usually 50 and represents the driving-point impedance of the antenna, while Vs represents the voltage of the received signal from the antenna. One potential pitfall of this model is the implied independence between the voltage Vs and the resistance Rs, which may lead to the incorrect conclusion that the input SNR can be arbitrarily improved by decreasing Rs for
249
Rs Vo
Vs
Figure C.7: Antenna circuit model. a fixed transmit power level. The most important design goal for receivers is maximizing the output SNR: SNRo [dB] = SNRi [dB] NF [dB] . (C.39)
In order to maximize the output SNR, the receiver noise figure should be minimized while the input SNR should be maximized. The open-circuit voltage of the antenna is Voc = Vs and the noise voltage due to Rs is Vn = 4kTRs f . Therefore, the input SNR is SNRi =
2 Voc Vs2 = . Vn2 4kTRs f
(C.40)
(C.41)
(C.42)
If Vs and Rs are independent, then the input SNR may be increased by decreasing Rs. In reality, this is not the case and Vs actually depends on Rs. The vertical whip antenna is an antenna commonly used in mobile devices for cellular communications systems. For a vertical whip antenna over a group plane, the radiation resistance and series capacitance are given by [106], respectively,
h Rr = 40
2 2
(C.43) (C.44)
C a [pF] =
250
where h is the antenna height, is the wavelength, and a is the antenna diameter. The resistance seen at the driving-point is typically larger than the radiation resistance due to additional losses such as that resulting from the physical resistance of the antenna. The open-circuit voltage of the antenna is given by Voc = Vs = Eh (C.45)
where E is the electric field strength and h is the antenna height, and the input SNR is
2 2 Voc Voc ( E ) 2 = . SNRi = 4kTRs f 4kTRr f 40 2
(C.46)
Thus, the input SNR depends only on the electric field strength and the signal wavelength and cannot be arbitrarily increased by decreasing the driving-point resistance as suggested by (C.42). Consequently, if the transmit power level is fixed, then the only way to increase the output SNR is by decreasing the receiver noise figure.
C.6
Summary
Two questions were posed in the introduction to this appendix: 1. If maximum power transfer is indeed the relevant design metric, then is 50 the optimum interface impedance? 2. Is maximum power transfer even the correct design goal? As discussed in Section C.2, the 50- requirement is a legacy from designs based on coaxial cables, for which 50 is a good compromise between maximum power capacity and minimum attenuation. The use of a coaxial cable to connect the antenna to the RF filter in many transceivers still motivates the use of 50- interface impedance. In particular, maximum power transfer is one of the most important design goals of the transmitter, which much efficiently deliver as much signal power as possible to the transmission medium. In addition, commercially-available RF filters are typically designed assuming doubly-terminated source and load impedances of 50 , and consequently, deviating from 50 results in poor and unpredictable RF filter performance.
251
For transceivers which rely on stripline or microstrip transmission lines rather than coaxial cables to connect the RF front-end components, maximum power transfer is still an appropriate design goal, particularly for the transmitter. Although the interface impedance no longer needs to be 50 , deviating from a 50- interface impedance dictates the use of a custom RF filter. For broadcast applications which require a receiver rather than a transceiver, the design of the receiver is no longer constrained by the maximum power transfer requirement of the transmitter. For these applications, the receiver may be designed for other metrics such as minimum noise figure or maximum voltage transfer. Again, for both cases, the interface impedance no longer needs to be 50 . However, deviating from a 50- interface impedance again dictates the use of a custom RF filter.
252
Appendix D
Inductor Test Structures
D.1
Introduction
As described in Section 6.2.6, the inductively-degenerated differential LNA used in this receiver prototype relies on on-chip spiral inductors. A test chip (RFTRIPLED) was fabricated in order to evaluate the performance of various inductor structures. The process consists of six metal layers and a single polysilicon layer. The top two metal layers have a sheet resistance of 35 m/ while the first metal layer has a sheet
resistance of 250 m/ . The sheet resistance of the remaining metal layers is 55 m/ . This process uses a low-resistivity 10-m-cm silicon substrate with a 10--cm epitaxial layer. A total of seven different inductor structures were designed in ASITIC, a tool which provides rapid analysis, design, and optimization of inductors [107], [108]. The geometries of the seven inductors are summarized in Table D.1 and the parameters D, W, S, and N are defined in Fig. D.1. The first test structure is a planar spiral inductor implemented using only the top layer of metal, while the second test structure includes a patterned polysilicon ground shield [109]. This ground shield prevents the inductor electric field from penetrating the silicon substrate, which degrades the quality factor of the inductor, while patterning the shield prevents current flow which reduces the overall
253
inductor 1 2 3 4 5 6 7
description m6 m6 with polysilicon shield m5/m6 shunt m4/m5/m6 shunt m4/m5/m6 shunt with polysilicon shield m5/m6 series m2/m6 solenoid
S (m) 2 2 2 2 2 2 2
D
W
W
S N turns
N turns
(a)
(b)
Figure D.1: Definition of geometric parameters. (a) Inductors 1 6. (b) Inductor 7. inductance. One disadvantage of using a ground shield is that the capacitance between the inductor and ground increases, resulting in a lower self-resonance frequency. Another source of degradation in the inductor quality factor is the resistance of the metal used to implement the inductor. The third and fourth test structures are implemented using the top two and top three layers of metal, respectively. These metal layers are all shorted together with numerous vias in order to reduce the series resistance of the inductor. As in the case of using a ground shield, using multiple metal layers also increases the quality factor of the inductor at the expense of lowering the self-resonance frequency. The fifth test structure combines both techniques, implementing the inductor using the top three layers of metal all shorted together as well as including a patterned polysilicon ground shield.
254
Finally, the sixth test structure is implemented using the top two layers of metal connected together in series, while the seventh inductor is a solenoid structure implemented using the second and sixth layers of metal as illustrated in Fig. D.1b.
D.2
ASITIC was used to simulate four of the inductor test structures. The equivalent circuit used to model the inductors is illustrated in Fig. 6.27. The simulated component values at 2 GHz as well as the self-resonance frequency of each of the inductors are summarized in Table D.2.
description m6 m5/m6 m4/m5/m6 m5/m6 series L (nH) Rs () C1 (fF) R1 () C2 (fF) R2 () fs (GHz) 6.21 11.2 125 7.27 94.3 18.2 5.71 5.97 7.57 158 8.61 122 14.5 5.18 5.79 6.77 200 8.49 160 14.3 4.68 7.37 21.7 86.4 6.17 188 10.7 6.29 Q 6.0, 6.2 8.2, 8.5 8.4, 8.8 3.8, 3.2
D.3
The layout of the seven inductor test structures on the RFTRIPLED test chip is illustrated in Fig. D.2. All inductors are connected to the pad structure illustrated in Fig. D.3. The dimensions of the pad structure are designed to be compatible with GS-SG probes with a 125-m pitch. The signal pads consist of the top three metal layers, all shorted together, while a fourth lower layer of metal (m2) acts as a ground shield [72]. The ground pad is implemented using the top five layers of metal, all shorted together. A ground ring consisting of the bottom three metal layers is placed around each of the inductor test structures at a distance of 50 m, and all of these ground rings are connected to the substrate through a large number of substrate contacts. A patterned polysilicon ground shield is positioned below two of the inductor test structures. The shields are connected to ground along the ground rings which surround the inductors. The layout of the polysilicon shield is illustrated in Fig. D.4.
255
80 m
24 m
130 m
D.4
Measurement Results
The S parameters for each of the inductor test structures were measured using an HP8719C network analyzer. The component values at 2 GHz for the equivalent circuit illustrated in Fig. 6.27 were extracted from the measured S parameters and these results are summarized in Table D.3 along with the self-resonance frequency of each of the
256
Figure D.4: Patterned polysilicon ground shield. inductors. The dc resistances of some of the inductor test structures were also measured and are summarized in Table D.4. The measured quality factors are significantly worse than the values predicted by ASITIC. Because this process uses a low-resistivity substrate, eddy currents play a significant role in limiting the achievable quality factor in this process. The version of ASITIC used for the initial simulations does not account for the effect of eddy currents flowing in the substrate. Although the most recent version of ASITIC does include eddy current effects, this version was not available at the time of initial simulations. The inductor test structures were simulated again in the latest version of ASITIC and these results are summarized in Table D.5. The quality factors predicted by the most recent
description m6 m6 with polysilicon shield m5/m6 m4/m5/m6 m4/m5/m6 with polysilicon shield m5/m6 series m2/m6 solenoid L (nH) Rs () C1 (fF) R1 () C2 (fF) R2 () fs (GHz) Q 5.7 22.5 170 35 150 5.7 2.5, 2.8 18 5.6 5.45 5.25 5.25 9 2 19.5 19.8 15.4 15.9 82 37 160 190 220 246 270 580 8 50 19 4 16 21 160 190 235 253 130 320 8 40 18 11 50 13 5.5 5.2 4.8 4.6 2.2 5.4 3, 3.1 2.6, 3 3.3, 3.6 3.1, 3.3 0.5, 1 0.25, 0.45
Table D.3: Summary of measured results from the inductor test chip.
257
description m6 m6 with polysilicon shield m5/m6 shunt m4/m5/m6 shunt m4/m5/m6 shunt with polysilicon shield
Table D.5: Summary of inductor simulation results from ASITIC version 3.19.00.
description m6 m5/m6 L (nH) Rs () C1 (fF) R1 () C2 (fF) R2 () fs (GHz) Q 6.2 18 143 5 136 18 3.6, 3.6 5.9 14.2 178 8 168 12.5 4.1, 4.2
Table D.6: Summary of inductor simulation results from Momentum. version of ASITIC are much closer to the measured values. A couple of the inductors were also simulated in Momentum, an electromagnetic simulation tool which is part of the Advanced Design System software from Agilent. The simulation results at 2 GHz are summarized in Table D.6. From the measured results, the inductor implemented using the top three metal layers, all shorted together, offers the best quality factor. Consequently, this configuration is used to implement the on-chip spiral inductors for the inductively-degenerated differential LNA in the receiver prototype.
258
References
[1] [2] [3] [4] [5] Bluetooth, http://www.bluetooth.com. S. V. Saliga, An introduction to IEEE 802.11 wireless LANs, 2000 IEEE Radio Frequency Integrated Circuits Symposium, pp. 1114, June 2000. European Telecommunications Standards Institute, http://www.etsi.org. H. D. Graves, A detailed analysis of MMDS and LMDS, 1997 IEEE MTT-S Symposium on Technologies for Wireless Applications, pp. 710, February 1997. N. Zhang, A. Poon, D. Tse, R. Brodersen, and S. Verdu, Trade-offs of performance and single chip implementation of indoor wireless multi-access receivers, IEEE Wireless Communications and Networking Conference, pp. 226 230, September 1999. Murata Electronics North America, Inc., http://www.murata.com. A. Abidi, Direct-conversion radio transceivers for digital communications, IEEE Journal of Solid-State Circuits, vol. 30, pp. 13991410, December 1995. C. Hull, J. Tham, and R. Chu, A direct-conversion receiver for 900 MHz (ISM band) spread-spectrum digital cordless telephone, IEEE Journal of Solid-State Circuits, vol. 31, pp. 19551963, December 1996. T. Cho, E. Dukatz, M. Mack, D. Macnally, M. Marringa, S. Mehta, C. Nilson, L. Plouvier, and S. Rabii, A single-chip CMOS direct-conversion transceiver for 900 MHz spread-spectrum digital cordless phones, IEEE International SolidState Circuits Conference, pp. 228229, February 1999. P. R. Gray and R. G. Meyer, Analysis and Design of Analog Integrated Circuits. New York: John Wiley, 1993. J. C. Rudell, J.-J. Ou, T. B. Cho, G. Chien, F. Brianti, J. A. Weldon, and P. R. Gray, A 1.9-GHz wide-band IF double conversion CMOS receiver for cordless telephone applications, IEEE Journal of Solid-State Circuits, vol. 32, pp. 2071 88, December 1997. J. Crols and M. S. J. Steyaert, A single-chip 900 MHz CMOS receiver front-end with a high performance low-IF topology, IEEE Journal of Solid-State Circuits, vol. 30, pp. 14831492, December 1995.
[9]
[10] [11]
[12]
259
[13]
J. F. Wilson, R. Youell, T. H. Richards, G. Luff, and R. Pilaski, A single-chip VHF and UHF receiver for radio paging, IEEE Journal of Solid-State Circuits, vol. 26, pp. 19441950, December 1991. J. Crols and M. Steyaert, CMOS Wireless Transceiver Design. Boston: Kluwer Academic Publishers, 1997. B. Law and M. Groh, Identifying RF-related impairments in full-service digital networks, Microwave Journal, vol. 39, pp. 88, 90, 92, 94, March 1996. J. G. Proakis, Digital Communications. New York: McGraw-Hill, 1995. Digi-Key Corporation, http://www.digikey.com. S. Sheng and R. Brodersen, Low-Power CMOS Wireless Communications: A Wideband CDMA System Design. Boston: Kluwer Academic Publishers, 1998. H. Meyr, M. Moeneclaey, and S. A. Fechtel, Digital Communication Receivers: Synchronization, Channel Estimation, and Signal Processing. New York: John Wiley,1998. M. E. Van Valkenburg, Analog Filter Design. New York: Oxford University Press, 1982. A. V. Oppenheim and R. W. Schafer, Discrete-Time Signal Processing. New Jersey: Prentice-Hall, 1989. C. M. Teuscher, Low Power Receiver Design for Portable RF Applications: Design and Implementation of an Adaptive Multiuser Detector for an Indoor, Wideband CDMA Application, Ph.D. Dissertation, University of California, Berkeley, 1998. J. C. Rudell, J. A. Weldon, J.-J. Ou, L. Lin, and P. R. Gray, An Integrated GSM/DECT Receiver: Design Specifications, University of California, Berkeley, Electronics Research Laboratory, Memorandum No. UCB/ERL M97/82, 1997. H. T. Friis, Noise Figure of Radio Receivers, Proceedings of the IRE, vol. 32, pp. 419422, July 1944. T. H. Lee, The Design of CMOS Radio-Frequency Integrated Circuits. New York: Cambridge University Press, 1998. R. G. Meyer, Class Notes, EECS 242: Advanced Integrated Circuits for Communications, University of California, Berkeley, Spring 1995. K. S. Kundert, Introduction to RF simulation and its application, IEEE Journal of Solid-State Circuits, vol. 34, pp. 12981319, September 1999.
[23]
260
The MathWorks, Inc., Band-Limited White Noise, Using Simulink Version 3, pp. 8-188-19, January 1999. The MathWorks, Inc., How do I generate pink noise in MATLAB or Simulink?, http://www.mathworks.com/support/solutions/data/8333.shtml. N. J. Kasdin, Discrete simulation of colored noise and stochastic processes and 1/f power law noise generation, Proceedings of the IEEE, vol. 83, pp. 802827, May 1995. C. H. Doan, Design and Implementation of a Highly-Integrated Low-Power CMOS Frequency Synthesizer for an Indoor Wireless Wideband-CDMA DirectConversion Receiver, M.S. Project Report, University of California, Berkeley, 2000. D. A. Sobel, University of California, Berkeley, personal communication. S. Brigati, F. Francesconi, P. Malcovati, D. Tonietto, A. Baschirotto, and F. Maloberti, Modeling sigma-delta modulator non-idealities in SIMULINK Proceedings of the 1999 IEEE International Symposium on Circuits and Systems, pp. 384387, May 1999. D. A. Sobel, A 25 Ms/s Nyquist-Rate Sigma-Delta Modulator for a Wideband CDMA Receiver, M.S. Project Report, University of California, Berkeley, Electronics Research Laboratory, Memorandum No. UCB/ERL M00/32, May 2000. C. Teuscher, D. Yee, N. Zhang, and R. Brodersen, Design of a wideband spread spectrum radio using adaptive multiuser detection, Proceedings of the 1998 IEEE International Symposium on Circuits and Systems, pp. 593599, May 1998. J. P. Vanderhaegen, Design and Implementation of a Wideband CDMA Basestation Transmitter, M.S. Project Report, University of California, Berkeley, 1999. I. Katzela and M. Naghshineh, Channel assignment schemes for cellular mobile telecommunication systems: a comprehensive survey, IEEE Personal Communications, vol. 3, pp. 1031, June 1996. S. Blackman, A Low Power, 8-bit, 200 MHz Digital-to-Analog Converter, M.S. Project Report, University of California, Berkeley, Electronics Research Laboratory, Memorandum No. UCB/ERL M99/61, 1999. B. Razavi, RF Microelectronics. New Jersey: Prentice-Hall, 1998. R. C. V. Macario, Cellular Radio: Principles and Design. New York: McGrawHill, New York, 1997.
[31]
[32] [33]
[34]
[35]
[36]
[37]
[38]
[39] [40]
261
[41] [42]
S. E. Alexander, Radio propagation within buildings at 900 MHz, Electronics Letters, vol. 18, pp. 913914, October 1982. J.-F. Lafortune and M. Lecours, Measurement and modeling of propagation losses in a building at 900 MHz, IEEE Transactions on Vehicular Technology, vol. 39, pp. 101108, May 1990. G. K. Chan and F. A. Razaqpur, Spectrum requirements of an indoor pico-cell radio system, IEEE Transactions on Vehicular Technology, vol. 44, pp. 2430, February 1995. A. J. Motley and J. M. P. Keenan, Personal communication radio coverage in buildings at 900 MHz and 1700 MHz, Electronics Letters, vol. 24, pp. 763764, June 1988. A. M. D. Turkmani and A. F. de Toledo, Modelling of radio transmissions into and within multistorey buildings at 900, 1800 and 2300 MHz, IEE Proceedings I, vol. 140, pp. 462470, December 1993. B. Razavi, A 2.4-GHz CMOS receiver for IEEE 802.11 wireless LANs, IEEE Journal of Solid-State Circuits, vol. 34, pp. 13821385, October 1999. A. Prssinen, J. Jussila, J. Ryynnen, L. Sumanen, and K.Halonen, A 2-GHz wide-band direct conversion receiver for WCDMA applications, IEEE Journal of Solid-State Circuits, vol. 34, pp. 18931903, December 1999. C. C. Enz and G. C. Temes, Circuit techniques for reducing the effects of opamp imperfections: autozeroing, correlated double sampling, and chopper stabilization, Proceedings of the IEEE, vol. 84, pp. 15841614, November 1996. K. K. Onodera, Low Power Techniques for High-Speed Wireless Systems, Ph.D. Dissertation, University of California, Berkeley, 2000. QUALCOMM Incorporated, http://www.qualcomm.com. K. Bult, Analog broadband communication circuits in pure digital deep submicron CMOS, IEEE International Solid-State Circuits Conference, pp. 7677, February 1999. S. Verdu, Multiuser Detection. Cambridge: Cambridge University Press, 1998. R. G. Meyer and W. D. Mack, Monolithic AGC loop for a 160 Mb/s transimpedance amplifier, IEEE Journal of Solid-State Circuits, vol. 31, pp. 13311335, September 1996.
[43]
[44]
[45]
[46] [47]
[48]
[52] [53]
262
[54]
K. M. Cho, Optimum gain control for A/D conversion using digitizing I/Q data in quadrature sampling, IEEE Transactions on Aerospace and Electronic Systems, vol.27, pp.178181, January 1991. A. Leon-Garcia, Probability and Random Processes for Electrical Engineering. Massachusetts: Addison-Wesley, 1994. S. S. Haykin, Adaptive Filter Theory. New Jersey: Prentice-Hall, 1996. P. S. R. Diniz, Adaptive Filtering: Algorithms and Practical Implementation. Boston: Kluwer Academic Publishers, 1997. G. Tavares and M. S. Piedade, High performance algorithms for digital signal processing AGC, IEEE International Symposium on Circuits and Systems, pp. 15291532, May 1990. D. M. Pozar, Microwave Engineering. New York: John Wiley, 1998. G. D. Vendelin, A. M. Pavio, and U. L. Rohde, Microwave Circuit Design Using Linear and Nonlinear Techniques. New York: John Wiley, 1990. A. A. Abidi, High-frequency noise measurements on FETs with small dimensions, IEEE Transactions on Electron Devices, vol. ED-33, pp. 1801 1805, November 1986. R. P. Jindal, Hot-electron effects on channel thermal noise in fine-line NMOS field-effect t ransistors, IEEE Transactions on Electron Devices, vol. ED-33, pp. 13951397, September 1986. B. Wang, J. R. Hellums, and C. G. Sodini, MOSFET thermal noise modeling for analog integrated circuits, IEEE Journal of Solid-State Circuits, vol. 29, pp. 833 835, July 1994. D. K. Shaeffer and T. H. Lee, A 1.5-V, 1.5-GHz CMOS low noise amplifier, IEEE Journal of Solid-State Circuits, vol. 32, pp. 745759, May 1997. A. van der Ziel, Noise in Solid State Devices and Circuits. New York: John Wiley, 1986. S. Wolf, Silicon Processing for the VLSI Era, Volume 3: The Submicron MOSFET. Sunset Beach, California: Lattice Press, 1995. S. Sheng, L. Lynn, J. Peroulas, K. Stone, I. O'Donnell, and R. Brodersen, A lowpower CMOS chipset for spread spectrum communications, IEEE International Solid-State Circuits Conference, pp. 346347, February 1996.
[62]
[63]
263
[68]
A. R. Shahani, D. K. Shaeffer, and T. H. Lee, A 12-mW wide dynamic range CMOS front-end for a portable GPS receiver, IEEE Journal of Solid-State Circuits, vol. 32, pp. 20612070, December 1997. B. Razavi, R.-H. Yan, and K. F. Lee, Impact of distributed gate resistance on the performance of MOS devices, IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, vol. 41, pp. 750754, November 1994. Q. Huang, P. Orsatti, and F. Piazza, Broadband, 0.25m CMOS LNAs with sub2dB NF for GSM applications, Proceedings of the IEEE 1998 Custom Integrated Circuits Conference, pp. 6770, May 1998. Q. Huang, P. Orsatti, and F. Piazza, GSM transceiver front-end circuits in 0.25m CMOS, IEEE Journal of Solid-State Circuits, vol. 34, pp. 292303, March 1999. A. Rofougaran, J. Y.-C. Chang, M. Rofougaran, and A. A. Abidi, A 1 GHz CMOS RF front-end IC for a direct-conversion wireless receiver, IEEE Journal of Solid-State Circuits, vol. 31, pp. 880889, July 1996. J.-J. Ou, University of California, Berkeley, personal communication, 1999. B. Limketkai, The Design and Implementation of a Downconversion Mixer for a Wideband CDMA Receiver, M.S. Project Report, University of California, Berkeley, Electronics Research Laboratory, Memorandum No. UCB/ERL M99/66, December 1999. Philips Semiconductors, http://www.semiconductors.philips.com. D. B. Leeson, A simple model of feedback oscillator noise spectrum, Proceedings of the IEEE, vol. 54, pp. 329330, February 1966. B. Razavi, A study of phase noise in CMOS oscillators, IEEE Journal of SolidState Circuits, vol. 31, pp. 331343, March 1996. L. Lin, L. Tee, and P. R. Gray, A 1.4GHz differential low-noise CMOS frequency synthesizer using a wideband PLL architecture, IEEE International Solid-State Circuits Conference, pp. 204205, February 2000. J. M. Rabaey, Digital Integrated Circuits: A Design Perspective. New Jersey: Prentice-Hall, 1996. J. Crols and S. J. Steyaert, A 1.5 GHz highly linear CMOS downconversion mixer, IEEE Journal of Solid-State Circuits, vol. 30, pp. 736742, July 1995. B. Gilbert, A precise four-quadrant multiplier with subnanosecond response, IEEE Journal of Solid-State Circuits, vol. SC-3, pp. 365373, December 1968.
[69]
[70]
[71]
[72]
[73] [74]
264
R. P. Sallen and E. L. Key, A practical method of designing RC active filters, IRE Transactions on Circuit Theory, vol. CT-2, pp. 7485, March 1955. M. S. Ghausi and K. R. Laker, Modern Filter Design: Active RC and Switched Capacitor. New Jersey: Prentice-Hall, 1981. A. S. Sedra and K. C. Smith, Microelectronic Circuits. Philadelphia: Saunders College Publishing, 1991. Y. Tsividis, M. Banu, and J. Khoury, Continuous-time MOSFET-C filters in VLSI, IEEE Journal of Solid-State Circuits, vol. SC-21, pp. 1529, February 1986. H. Khorramabadi and P. R. Gray, High-frequency CMOS continuous-time filters, IEEE Journal of Solid-State Circuits, vol. SC-19, pp. 939948, December 1984. C. A. Laber and P. R. Gray, A 20-MHz sixth-order BiCMOS parasiticinsensitive continuous-time filter and second-order equalizer optimized for diskdrive read channels, IEEE Journal of Solid-State Circuits, vol. 28, pp. 462470, April 1993. R. Gregorian, K. W. Martin, and G. C. Temes, Switched-capacitor circuit design, Proceedings of the IEEE, vol. 71, pp. 941966, August 1983. R. Castello and P. R. Gray, Performance limitations in switched-capacitor filters, IEEE Transactions on Circuits and Systems, vol. CAS-32, pp. 865876, September 1985. R. S. Muller and T. I. Kamins, Device Electronics for Integrated Circuits. New York: John Wiley, 1986. G. T. Uehara, Circuit Techniques and Considerations for Implementation of High Speed CMOS Analog-to-Digital Interfaces for DSP-Based PRML Magnetic Disk Read Channels, Ph.D. Dissertation, University of California, Berkeley, Electronics Research Laboratory, Memorandum No. UCB/ERL M93/96, August 1993. T. B. Cho, Low-Power Low-Voltage Analog-to-Digital Conversion Techniques using Pipelined Architectures, Ph.D. Dissertation, University of California, Berkeley, Electronics Research Laboratory, Memorandum No. UCB/ERL M95/23, 1995. T. B. Cho and P. R. Gray, A 10 b, 20 Msample/s, 35 mW pipeline A/D converter, , IEEE Journal of Solid-State Circuits, vol. 30, pp. 166172, March 1995.
[86]
[87]
[88] [89]
[90] [91]
[92]
[93]
265
[94]
T. B. Cho, G. Chien, F. Brianti, and P. R. Gray, A power-optimized CMOS baseband channel filter and ADC for cordless applications, 1996 Symposium on VSLI Circuits, pp. 6465, June 1996 G. Chien, High Speed, Low Power, Low Voltage Pipelined Analog-to-Digital Converter, M.S. Project Report, University of California, Berkeley, Electronics Research Laboratory, Memorandum No. UCB/ERL M96/27, 1996. H.-K. Yang and M. Snelgrove, Symbol timing recovery using oversampling techniques, IEEE International Conference on Communications, pp. 12961300, June 1996. A. R. Feldman, High-Speed, Low-Power Sigma-Delta Modulators for RF Baseband Channel Applications, Ph.D. Dissertation, University of California, Berkeley, Electronics Research Laboratory, Memorandum No. UCB/ERL M97/62, 1997. D. G. Yee, A Design Methodology for Highly-Integrated Low-Power Receivers for Wireless Communications, Ph.D. Dissertation, University of California, Berkeley, 2001. M. C. Chang, C. Y. Wu, G. L. Lin, I. C. Liao, Y. H. Lee, and C. N. Chen, Degradation of MOS transistor characteristics by gate charging damage during plasma processing, 1993 International Symposium on VLSI Technology, Systems, and Applications, pp. 320324, May 1993.
[95]
[96]
[97]
[98]
[99]
[100] D. K. Shaeffer and T. H. Lee, The Design and Implementation of Low-Power CMOS Radio Receivers. Boston: Kluwer Academic Publishers, 1999. [101] A. Z. Wang, A new design for complete on-chip ESD protection, Proceedings of the IEEE 2000 Custom Integrated Circuits Conference, pp. 8790, May 2000. [102] M.-D. Ker, T.-Y. Chen, C.-Y. Wu, and H.-H. Chang, ESD protection design on analog pin with very low input capacitance for RF or current-mode applications, Twelfth Annual IEEE International ASIC/SOC Conference, pp. 352356, September 1999. [103] Y. L. Low, Y. Degani, K. V. Guinn, T. D. Dudderrar, J. A. Gregus, and R. C. Frye, RF flip-module BGA package, IEEE Transactions on Advanced Packaging, vol. 22, pp. 111115, May 1999. [104] L. Moult and J. E. Chen, The K-model: RF IC modeling for communication systems simulation, IEE Colloquium Analog Signal Processing, pp. 11/111/8, October 1998. [105] H. Yoshida, H. Tsurumi, and Y. Suzuki, DC offset canceller in a direct conversion receiver for QPSK signal reception, Proceedings of Ninth
266
and
Mobile
Radio
[106] U. L. Rohde, J. C. Whitaker, and T. T. N. Bucher, Communications Receivers: Principles and Design. New York: McGraw-Hill, 1997. [107] A. M. Niknejad and R. G. Meyer, Analysis, design, and optimization of spiral inductors and transformers for Si RF ICs, IEEE Journal of Solid-State Circuits, vol. 33, pp. 14701481, October 1998. [108] ASITIC, http://formosa.eecs.berkeley.edu/~niknejad/asitic.html. [109] C. P. Yue and S. S. Wong, On-chip spiral inductors with patterned ground shields for Si-based RF ICs, IEEE Journal of Solid-State Circuits, vol. 33, pp. 743752, May 1998.
267