0% found this document useful (0 votes)
57 views33 pages

Lecture10 CDR

Uploaded by

Ahmed Shafeek
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
57 views33 pages

Lecture10 CDR

Uploaded by

Ahmed Shafeek
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 33

Clock and Data Recovery:

Architectures
Lecture 10
Reference-less PLL-based CDRs

Phase
locking

Freq.
locking

• The freq. detector (FD) provides an output proportional to the frequency error
between Din & VCO output → no need for external clock frequency
• At start-up, the FD drives the VCO frequency close to Din then PD takes over to
lock on Din phase:
• Aided Acquisition allows increasing loop locking range without sacrificing the
VCO ripple (or increasing fBB)
• The PD & FD may interfere during transferring over to the PD mode, also FD
might be confused with long CIDs→ control voltage ripple or failure/false lock
[Ref]: “Architectures for Multi-Gigabit Wire-Linked Clock and Data Recovery,” IEEE Circ. & Sys. Magazine, 2008
Cairo University Selected topics in wireline transceiver 2
circuits
Reference-less PLL-based CDRs (Cont’d)

• Each loop can drive it own CP & loop filter


• Frequency tracking is typically necessary at power up or to track slow
temperature drift → speed is not important
• Loop BW of frequency tracking loop should be << BW of phase tracking loop
• Large BW of phase tracking loop improves CDR jitter tolerance
• Disadvantage: large area → Could combine using analog & digital loop filters
Cairo University Selected topics in wireline transceiver 3
circuits
Frequency Detectors (Quadri-correlator)
d  A1 A2 
X out (t ) =  cos (1 −  )
2 
t
dt  2 
=−
A1 A2
(1 − 2 )sin (1 − 2 )t
2

cos(1 − 2 )t
A1 A2
X A (t ) =
2
sin (1 − 2 )t
A1 A2
X B (t ) =
2
  1 cos 2(1 − 2 )t 
2
  A1 A2  
X out (t ) = LPF −   1 ( −  2 )   − 

  2   2 2 

• X1(t) has to have a spectral line at fb→ Edge detector
• Close to lock, the component at 2(1-2) may cause a large ripple at
the output
Cairo University Selected topics in wireline transceiver 4
circuits
Random Data Balanced Quadri-correlator
A

B
2
AA 
X A (t ) = 1 2 cos(1 − 2 )t
AA
X out (t ) = − 1 2  (1 − 2 ) sin 2 (1 − 2 )t
2  2 

X B (t ) = 1 2 sin (1 − 2 )t
AA  A1 A2 
2

−  (1 − 2 ) cos (1 − 2 )t


2
2  2 

• The balanced quadri-correlator suppresses the


component at 2(1-2) improving the lock behavior
Cairo University Selected topics in wireline transceiver 5
circuits
Creating a Spectral line at fb (Edge Detection )

Differentiation is a linear operation:


will not create a spectral line

• Random data Din doesn't include a spectral line at fb


• Differentiating Din still produces 0 when correlated with cos(bt+) →
No Edge detection
• If the –ve impulses are converted to +ve impulses the correlation will
yield a non-zero value
• Differentiation & Rectification produces a spectral line at fb →Edge
detection
Cairo University Selected topics in wireline transceiver 6
circuits
Digital Quadri-correlator
FCLK>Fb

XA=1
FCLK-Fb

Double edge
FCLK<Fb
sampling FF

XA=0

• If FCLK>Fb, at t=t1 XA samples a 1 & XB samples 0


• Next sampling occurs at t=t1+TCLK+
• Eventually, XB will sample a 1 resulting in Vout=1
• FD exhibits a bang-bang characteristics
Cairo University Selected topics in wireline transceiver 7
circuits
PLL-based CDRs with External Reference Clock
• Advantages:
✓ Availability of separate VCO2 control →
faster acquisition time for phase
locking
• Disadvantages:
X Sensitive to mismatches between
VCO1 & VCO2 (f1f2)
X If the Tx has finite frequency offset the
Shared among multi-lanes
2 VCOs will be running at different
frequencies → frequency pulling
• Freq. tracking loop locks VCO2 output to Fref*M
• VCO2 control voltage is applied to VCO1 (VCO2 is a replica of VCO1)
• This centers VCO1 around the input data rate & allows the phase
tracking loop to lock the phase of VCO1 to Din
• The gain of the phase tracking loop must be (fine step) << the gain of
the frequency tracking loop (coarse step)
Cairo University Selected topics in wireline transceiver 8
circuits
PLL-based CDRs with External Reference Clock
• Advantages:
✓ Eliminates the extra CP, LF, &
VCO→ no mismatch issues
• Disadvantages:
X Transition from freq. to phase
tracking loops might cause
disruption to VCO control
causing freq. shift → failure to
lock or ripple on VCO control

• A lock detector (LD) can be used to sequentially switch


between the freq. & phase tracking loops
• When the LD senses that the difference between Fref and
Fvco/M is small it switches to the phase tracking loop
Cairo University Selected topics in wireline transceiver 9
circuits
Delay Lock Loop (DLL)-based CDR
• Advantages:
✓ VCDL directly alters the clk
phase → no freq. to phase
integration (1 less pole):
more stable
✓ Faster locking
✓ No jitter accumulation
Shared among • Disadvantages:
multi-lanes X Limited capture range
• Freq. tracking loop provides a high speed clk instead of a voltage
• Phase tracking loop uses a voltage-controlled delay line (VCDL) to
lock to the phase of Din
• Fref*M must be the same as the input data rate, suitable for source-
synchronous applications (chip-to-chip)
Cairo University Selected topics in wireline transceiver 10
circuits
Phase Interpolator (PI)-based CDR
• Advantages:
✓ Increases system stability
✓ Faster locking
✓ No jitter accumulation
✓ No jitter peaking (if loop latency
is small)
✓ Can operate over limited range
of frequency offset
Shared among • Disadvantages:
multi-lanes X Need to deliver quadrature clks
across the chip
• Similar to the DLL, but uses a digital loop filter DLF, current DAC, PI
• Output of the I-DAC is a current  to DLF output → controls how
much gain is assigned to the I-clk & Q-clk to control the recovered
clk phase
Cairo University Selected topics in wireline transceiver 11
circuits
Ideal Phase Interpolation
CKQ
Starting with 2 quadrature clocks, with amplitude A:
𝐶𝐾𝐼 = 𝐴𝑐𝑜𝑠 𝜔𝑡
CK
𝐶𝐾𝑄 = 𝐴𝑐𝑜𝑠 𝜔𝑡 − 90 = 𝐴𝑠𝑖𝑛 𝜔𝑡

CKI
For any desired angle , with fixed amplitude A:
𝐶𝐾 = 𝐴𝑠𝑖𝑛 𝜔𝑡 − 𝜑

𝐶𝐾 = 𝐴𝑠𝑖𝑛 𝜔𝑡 𝑐𝑜𝑠 𝜑 - 𝐴𝑐𝑜𝑠 𝜔𝑡 𝑠𝑖𝑛 𝜑

𝐶𝐾 = 𝐴𝐼 𝐶𝐾𝑄 +𝐴𝑄 𝐶𝐾𝐼 (phase interpolation)

𝐴𝐼 = 𝑐𝑜𝑠 𝜑 & 𝐴𝑄 = 𝑠𝑖𝑛 𝜑 → 𝐴2𝐼 + 𝐴2𝑄 =1 for constant amplitude (circle)


𝐴𝑄
𝜑 = 𝑡𝑎𝑛−1
𝐴𝐼
In practice, most designs use a simpler implementation
which results in 𝐴𝐼 + 𝐴𝑄 =1 → diamond shape

Cairo University Selected topics in wireline transceiver 12


circuits
Jitter of Digital PI-based CDRs
• In CDRs with digital loop filters dithering jitter is proportional to the
feedback loop delay and phase resolution of the PI
• Dithering jitter is the main source of the deterministic jitter in PI CDRs
• A decimation filter that counts early or late decisions of the phase
detector is employed to reduce the dithering jitter

m is the loop latency in the control loop (in UI), KPI is the phase step (or
resolution) of the PI, and KDLL is the decimation filtering factor
• Slew rate SR (maximum rate at which the loop updates the PI phase)
defines its tracking capability

where fupdate is the clock frequency of the digital CDR control blocks
• SR should be > the phase changing rate caused by input sinusoidal jitter Amod

where fjitter is the frequency of the jitter modulated on the incoming data
G. Wu et al., "A 1–16 Gb/s All-Digital Clock and Data Recovery With a Wideband High-Linearity Phase Interpolator," in IEEE
Transactions on Very Large Scale Integration (VLSI) Systems, vol. 24, no. 7, pp. 2511-2520, July 2016.
Cairo University Selected topics in wireline transceiver 13
circuits
JSSC 2006: A 10-Gb/s 5-Tap DFE/4-Tap FFE
Transceiver in 90-nm CMOS Technology

Proper choice of T-coil parameters


provides BW~2.2/RLCL

• Phase interpolation (PI) by phase rotators controlled by a digital CDR loop


generates the I & Q and clocks used to sample the centers and edges of the data bits
• Half-rate CDR logic converts the data and edge samples into early and late signals
which are digitally filtered to generate INC/DEC signals that control the PI
• Digital PI must be precise not to degrade the timing of the recovered clock
J. F. Bulzacchelli et al., "A 10-Gb/s 5-Tap DFE/4-Tap FFE Transceiver in 90-nm CMOS Technology," in IEEE Journal of Solid-State
Circuits, vol. 41, no. 12, pp. 2885-2900, Dec. 2006.
Cairo University Selected topics in wireline transceiver 14
circuits
JSSC 2006: A 10-Gb/s 5-Tap DFE/4-Tap FFE
Transceiver in 90-nm CMOS Technology
PI
• The phase rotator is driven
by two differential
quadrature clock phases
• The circuit selects the
quadrant then interpolates to
generate 16 phases per
quadrant for a total of 64 on
a 360o (2 UI) circle I-DAC

• The interpolator uses a current-steering DAC as tail currents


• Rotator has a diamond-shaped, due to constant total interpolator tail current → non
uniform angles close to quadrant edges
• The current-steering DAC has 2 fixed cells of half-size → 0.5:15.5 to 15.5:0.5
• Fixed cells improve rotator settling time
• 15 steering DAC cells are not uniform; the largest cells are switched near the quadrant
boundaries

Cairo University Selected topics in wireline transceiver 15


circuits
A 10-Gb/s 5-Tap DFE/4-Tap FFE Transceiver in
90-nm CMOS Technology: JSSC 2006

• Slew-rate-control buffers are


required to ensure the signals going
to the PI are sinusoidal for proper
interpolation
• PI step scales with data rate since it is 360/no. of states:
• At 2GHz step =2.7ps
• At 6 GHz step=8ps
• Amplitude variation is periodic, CDR samplers not sensitive to amplitude variation
Cairo University Selected topics in wireline transceiver 16
circuits
Phase Interpolator (PI)-based CDR

Shared among
multi-lanes

• I-DAC and PI replaced with a phase selector → All digital


implementation (more robust to PVT variations)
• Discrete phase steps
• Independent phase & frequency tracking loops → simplifies loop
BW/stability Requirements
Cairo University Selected topics in wireline transceiver 17
circuits
All Digital PLL-based CDRs
• In deep submicron technologies Analog CDRs suffer
from :
– Insufficient voltage headroom
– Large area occupied by integral loop capacitor
• Design sensitive to leakage currents
• All digital CDRs:
– Easy porting between process nodes
– Offers easy method to program loop behavior Kprop & Kint
→ settling, bandwidth, stability, jitter tolerance…

Cairo University Selected topics in wireline transceiver 18


circuits
ISSCC 2007: All-Digital CDR with a 10-bit Monotonic DCO

• The CDR consists of:


o Bang-band phase detector
o 1:8 de-serialiers for data and edge
samples
o Digitally controlled oscillator (DCO)
• High speed UP/Down signals directly control
the DCO frequency to form the proportional
path
• The integral path is formed by the custom logic:
– Generates low speed UP/Down signals
– 17 bit Loop integrator synthesized using a digital IIR filter
– A digital 1st order delta-sigma modulator converts the 17 bit frequency code to a dithered 10 bit
DCO code
– To guarantee the monotonicity of the DCO the 10 bits control code is converter to a segmented
thero-mometer code (32 + 31) bits.
• Frequency detection is also done by the custom logic block
• DSM (delta-sigma modulator) improves the resolution by dithering (17→10 bits)
D. -H. Oh, D. -S. Kim, S. Kim, D. -K. Jeong and W. Kim, "A 2.8Gb/s All-Digital CDR with a 10b Monotonic DCO," 2007 IEEE
International Solid-State Circuits Conference. Digest of Technical Papers, 2007, pp. 222-598
Cairo University Selected topics in wireline transceiver 19
circuits
ISSCC 2007: All-Digital CDR with a 10-bit Monotonic DCO

• DCO is a ring oscillator with


the supply connected through
a digitally controlled resistor
• Resistor is made up of 1024
PMOS transistor switches that
control the DCO frequency
• A segmented thermometer (32 rows & 31 columns) control is used for switching to
prevent glitches during the code transitions
– Full thermometer would require 1024 control line
• The DCO tuning steps (fstep=fn+1/fn) should be constant to maintain the same loop
stability, bandwidth across different data rates
• Equally spaced tuning steps mean that the frequency increases exponentially →
not easy to achieve
• PMOS transistors are inserted between the rows to produces an overall
resistance change close to an exponential, frequency can be tuned exponentially
Cairo University Selected topics in wireline transceiver 20
circuits
ISSCC 2007: All-Digital CDR with a 10-bit Monotonic DCO

• A cell in an even row is turned on when the corresponding column code is 1


and a cell in an odd row is turned on when the corresponding column code is 0
• A cell in the first column is turned on when the corresponding row code is 1
• DCO has 16 tuning cells for the 2-b forward path that receive the proportional
path UP/DN signal
Cairo University Selected topics in wireline transceiver 21
circuits
JSSC 2015: 26.5 Gb/s Optical Receiver With All-
Digital Clock and Data Recovery
• Optical front-end consists of TIA, a single-
to-differential converter (S2D), an limiting
amplifier (LA) with DC offset cancellation
• Output buffers are necessary to drive the
large capacitive load of the samplers
• CDR employs a half rate architecture with
CML buffers with
2 data and 2 edge samplers (4-phase clock) inductive peaking
for phase detection
• This is followed by a 2 to 64 bit de-multiplexers that generate the 64-bit parallel
data and edge information Data[63:0] & Edge[63:0]
• The proportional high speed pulses are feed directly to the LC-QDCO (quadrature
digitally controlled oscillator)
• The integral path is implemented using a low speed (Fb/64) 64-bit parallel digital
phase detector & loop filter
• Frequency detection is also done in the digital
S. -H. Chu et al., "A 22 to 26.5 Gb/s Optical Receiver With All-Digital Clock and Data Recovery in a 65 nm CMOS Process," in IEEE
Journal of Solid-State Circuits, vol. 50, no. 11, pp. 2603-2612, Nov. 2015.
Cairo University Selected topics in wireline transceiver 22
circuits
JSSC 2015: 26.5 Gb/s Optical Receiver With All-
Digital Clock and Data Recovery
MIM capacitors are used
to tune the oscillator
center frequency

• The DLF XOR incoming demultiplexed data and edge samples to generate the up/down signals
• The integral path control is generated by a 24 bit integrator (which replaces the large analog
loop filter) → Integral path gain is determined digitally by the integral gain 
• Upper 10 bit of the 24 bit integrated error are used for the frequency control word FCW[9:0]
• 2 coupled symmetrical LC oscillators are used to generate the quadrature half-rate clocks with
proportional/integral varactor banks to tune the DCO frequency
• To eliminate glitches 8 MSBs of the 10 bit FCW as 4 bit row and 4 bit column codes and these
are converted to two 15 bit thermometer codes, which represent a 30 bit coarse-tuning code
• The two LSBs of the FCW are for fine-tuning the LC-QDCO code
• The proportional varactor bank consists of binary weighted PMOS capacitors (selectively
activated for the proportional gain control)
Cairo University Selected topics in wireline transceiver 23
circuits
JSSC 2011: All-Digital CDR With 1.0-ps Period
Resolution DCO and Adaptive Proportional Gain
NTDCO
Idc:
NTDCO

𝑓𝑟𝑒𝑓
𝑒𝑟𝑟 = 𝑁 −1 +𝑎 … (−2 < 𝑎 > 2)
𝑓𝐷𝐶𝑂

• Frequency acquisition can be performed using a FSM with a digital frequency comparator
• The number of rising reference clock edges are counted during “N” DCO output clocks and
subtracting this value from N
• The FSM runs using fref, and the enable/disable signal is generated via fDCO
• The result is equal to the product of the frequency error and N
• The integral word is changed by an amount proportional to the measured frequency error
• Once the measured frequency error has declined below a predetermined value, the integral
word is then controlled by the DLF
• of the main phase-locking loop
H. Song, D. -S. Kim, D. -H. Oh, S. Kim and D. -K. Jeong, "A 1.0–4.0-Gb/s All-Digital CDR With 1.0-ps Period Resolution DCO and
Adaptive Proportional Gain Control," in IEEE Journal of Solid-State Circuits, vol. 46, no. 2, pp. 424-434, Feb. 2011.
Cairo University Selected topics in wireline transceiver 24
circuits
CDR SIMULINK DESIGN
EXAMPLE:
FULL RATE ALEXANDER PD BASED CDR
WITH ANALOG CP/LOOP FILTER

Cairo University Selected topics in wireline transceiver 25


circuits
Cairo University Selected topics in wireline transceiver 26
circuits
Cairo University Selected topics in wireline transceiver 27
circuits
Cairo University Selected topics in wireline transceiver 28
circuits
Cairo University Selected topics in wireline transceiver 29
circuits
Cairo University Selected topics in wireline transceiver 30
circuits
Cairo University Selected topics in wireline transceiver 31
circuits
Cairo University Selected topics in wireline transceiver 32
circuits
Cairo University Selected topics in wireline transceiver 33
circuits

You might also like