0% found this document useful (0 votes)
124 views191 pages

Timing Recovery Techniques For Digital Recording Systems: WWW - Tue.nl/taverne

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
124 views191 pages

Timing Recovery Techniques For Digital Recording Systems: WWW - Tue.nl/taverne

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 191

Timing recovery techniques for digital recording systems

Citation for published version (APA):


Wang, J. J. (2002). Timing recovery techniques for digital recording systems. [Phd Thesis 1 (Research TU/e /
Graduation TU/e), National University of Singapore]. Technische Universiteit Eindhoven.
https://doi.org/10.6100/IR559505

DOI:
10.6100/IR559505

Document status and date:


Published: 01/01/2002

Document Version:
Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers)

Please check the document version of this publication:


• A submitted manuscript is the version of the article upon submission and before peer-review. There can be
important differences between the submitted version and the official published version of record. People
interested in the research are advised to contact the author for the final version of the publication, or visit the
DOI to the publisher's website.
• The final author version and the galley proof are versions of the publication after peer review.
• The final published version features the final layout of the paper including the volume, issue and page
numbers.
Link to publication

General rights
Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners
and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.

• Users may download and print one copy of any publication from the public portal for the purpose of private study or research.
• You may not further distribute the material or use it for any profit-making activity or commercial gain
• You may freely distribute the URL identifying the publication in the public portal.

If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please
follow below link for the End User Agreement:
www.tue.nl/taverne

Take down policy


If you believe that this document breaches copyright please contact us at:
[email protected]
providing details and we will investigate your claim.

Download date: 27. mars. 2023


I

Timing Recovery Techniques for Digital


Recording Systems
II

Timing Recovery Techniques for Digital


Recording Systems

JIANJIANG WANG
(B. Eng. & M. Eng., Tsinghua University)

A THESIS SUBMITTED
FOR THE DEGREE OF DOCTOR OF PHILOSOPHY OF
THE ELECTRICAL AND COMPUTR ENGINEERING DEPARTMENT
OF THE NATIONAL UNIVERSITY OF SINGAPORE

2002
III

Timing Recovery Techniques for Digital


Recording Systems

PROEFSCHRIFT

ter verkrijging van de graad van doctor aan de


Technische Universiteit Eindhoven, op gezag van de
Rector Magnificus, prof.dr. R.A. van Santen, voor een
commissie aangewezen door het College voor
Promoties in het openbaar te verdedigen
op donderdag 5 december 2002 om 14.00 uur

door

JIANJIANG WANG

geboren te Korla, China


IV

Dit proefschrift is goedgekeurd door de promotoren:

prof.dr.ir. J.W.M. Bergmans


en
prof. D.S.H. Chan

Copromotor:

prof. T.C. Chong

©Copyright 2002 Jianjiang Wang


All rights reserved. No part of this publication may be reproduced, stored in a retrieval
system, or transmitted, in any form or by any means, electronic, mechanical, photocopying,
recording or otherwise, without the prior written permission from the copyright owner.

CIP-DATA LIBRARY TECHNISCHE UNIVERSITEIT EINDHOVEN

Wang, Jianjiang

Timing recovery techniques for digital recording systems / by Jianjiang Wang. - Eindhoven :
Technische Universiteit Eindhoven, 2002.
Proefschrift. – ISBN 90-386-1950-2
NUR 959
Trefw.: tijdmeting / signaalverwerking / dataopslag / magnetische registratie / datarecorders.
Subject headings: synchronization / signal processing / data recording / magnetic storage.

Printed by: University printing office, Eindhoven, The Netherlands


V

promotiecommissie:

prof. dr. ir. J.W.M. Bergmans

prof. dr. ir. W.M.G. van Bokhoven

prof. dr. P.P.J. van den Bosch

prof. dr. ir. A.C. Brombacher

prof. dr. D.S.H. Chan

prof. dr. T.C. Chong

dr. W.M.J. Coene

dr. G. Mathew

dr. ir. F.M.J. Willems


VI

to my parents
and my family
VII

ACKNOWLEDGEMENTS

First, I want to acknowledge my dissertation advisors Prof. Jan W. M. Bergmans, Prof.


Chong Tow Chong and Dr. George Mathew. They provided sustained guidance and
indispensable advice by setting aside large amounts of their time for discussions and review.
Completion of my Ph.D. would not have been possible without their support.
I want to thank Prof. Jan W. M. Bergmans from Electrical Engineering Department, the
Eindhoven University of Technology, The Netherlands. I owe a lot to him for the tremendous
source of knowledge and inspiration that he has been to me. He has freely shared his time and
insights with me and provided excellent guidance and continual support throughout the course
of this work. I have benefited much from his constructive criticism, invaluable advice and
many enlightening discussions with him. His enthusiastic support and high quality guidance
during my Ph.D. study will never be forgotten. His systematic and rigorous approach to
research has been a constant source of challenge to achieve greater heights.
I want to thank Prof. Chong Tow Chong from Electrical and Computer Engineering
Department, the National University of Singapore, for introducing me into this wonderful and
challenging area of data storage technology. I feel extremely fortunate to have had him as my
advisor. I really cannot thank him enough for his constant support and encouragement during
my work and study.
I want to thank Dr. George Mathew from the Data Storage Institute (DSI), Singapore, for
his counseling and expertise. It is difficult to imagine reaching this point without his help. He
dedicated large amounts of his time and expertise in reviewing my work, offering advice and
giving direction to my efforts. His positive attitude led me constantly to look for solutions to
problems that appeared to have none. He also provided me with a tremendous amount of
support, especially in the area of numerical analysis and computer simulation resources.

Second, I want to thank Prof. Daniel S.H. Chan from the National University of
Singapore, Prof. W.M.G. van Bokhoven, Prof. P.P.J. van den Bosch, Prof. Aarnout C.
Brombacher and Associate Prof. F.M.J. Willems from the Eindhoven University of
Technology, The Netherlands, and Dr. Wim M.J. Coene, principal scientist from the Digital
Signal Processing group of the Philips Research, Eindhoven, The Netherlands, for serving on
my committee and providing valuable comments.

I am also grateful to Dr. Y.X. Lee, IBM, San Jose, CA, who was one of the dissertation
advisors during the initial period of my research. He introduced me to the subject of timing
recovery, and helped me in the formulation and execution of the initial part of my research
work.

I am thankful to DSI for providing all the necessary support and an excellent research
environment for my thesis work. There are several persons in the Coding and Signal
Processing group of DSI who deserve to be acknowledged here. They are Dr. Y.X. Lee, Dr.
Victor Y. Krachkovsky, Dr. George Mathew, Mr. W.C. Ye, Dr. K.C. Indukumar, Dr. S.
Gopalaswamy, Dr. Q.W. Jia, Mr. L. Bi, Mr. Q. Li, Mr. B. Liu and Ms. M.Y. Lin. They have
VIII

been a source of constant moral and technical support to me during my time at DSI. While we
each had a different research focus, we successfully worked as a team in solving problems
related to signal processing techniques for data storage, enhancing the value of each other’s
work. I would also like to thank Mr. Lim Beng Wah for his dependable and cheerful technical
assistance.

I owe a debt of thanks to numerous people who helped me make the thesis work a reality.
These people include Dr. Roger W. Wood and Mr. John Hong from the IBM, San Jose, CA.
USA, Mr. Hiroshi Mutoh and Mr. Hiroaki Ueno from the Fujitsu, Japan, Mr. M. Umemoto
from the Hitachi, Japan, and Dr. Haralampos Pozidis, formerly from the Philips Research, The
Netherlands. I am also thankful to the management of Philips Research Labs for providing the
necessary support for my research work in Eindhoven, The Netherlands. Even though I cannot
list here all of the people who helped me accomplish this work, nevertheless I am indebted to
all of them.

Finally, I must thank my wife and my son, for their unfailing encouragement, love,
support, patience and sacrifice. I also owe a great deal of gratitude to my parents in China;
they have been behind me all the time, and their primary concern is always my well being. I
would like to dedicate this dissertation to them for encouraging me to pursue higher education.

Wang, Jianjiang

Fremont, California, USA


December 2002
IX

abstract  This thesis is devoted to the development of timing recovery techniques for
digital recording systems. The thesis begins with a detailed review and discussion of timing
recovery structures, requirements, performance measures, timing error detector (TED)
algorithms, and timing acquisition issues. The main contributions include five parts. The first
part examines the timing sensitivity of read channel detectors, and develops a new analytical
approach for evaluating the performance under static and random timing errors. The second
part examines the TED efficiencies, develops an improved TED for jitter minimization, and
studies optimality issues for timing acquisition. The third part investigates false lock and hang
up problems, and develops two novel acquisition techniques. The fourth part presents the
timing recovery loop design and implementation for an experimental read channel detector. The
fifth part develops a new asynchronous equalizer adaptation structure with fully digital
interpolative timing recovery (ITR) for digital optical recording systems.
X

SUMMARY
A critical part of data storage systems is the read-write channel, which consists of the
electronic circuits needed for writing the user-supplied data into the storage medium and for
reliably recovering the written data. A properly designed read-write channel has the potential
to enhance the recording density and data rate capabilities of a given head-medium
combination. Equalizer, data-detector and timing recovery circuitry are among the most
important blocks of a read-write channel. Usually, the tasks of equalization and detection
receive considerable attention from researchers. As recording channels become more efficient
in terms of modulation code, bandwidth and storage density, the task of timing recovery
becomes more difficult and, at the same time, increasingly critical for reliable data recovery.
The problem of timing recovery is concerned with the determination of the optimum sampling
instants for the readback signal. The research work that has been undertaken in the thesis deals
with the timing recovery issues in read-write channels.

This thesis is devoted to the study and development of timing recovery techniques for
digital recording systems. It investigates the structure and performance of timing recovery
schemes, and develops new timing recovery schemes for magnetic and optical recording
applications. The thesis contains eight chapters. Chapter 1 gives a brief introduction to digital
recording technology and a review of read-write channel techniques. It concludes with the
motivation, contributions and organization of the thesis. Chapter 2 gives a detailed review of
timing recovery, highlighting the different possible structures, requirements, and performance
measures. It also discusses the generic timing error detector (TED) algorithms and the main
issues that arise during timing acquisition. Analyses and simulation results support the
discussions, wherever possible. Chapters 3 to 7 describe the main new research contributions
made during the course of this Ph.D. work. Chapter 8 concludes the thesis with some remarks
on directions for further work.

Chapter 3 investigates the timing sensitivity of partial response (PR) Viterbi detectors and
decision feedback equalization (DFE) detectors, and develops an analytical approach to
evaluate the performance in the presence of static and random timing errors. Chapter 4
examines the efficiencies of existing and proposed TED algorithms for partial response and
DFE recording systems. It also develops a marginal detection based TED for minimizing jitter
in the multi-level DFE (MDFE) detector and analyzes its phase noise performance. Further, it
examines the optimality of the preamble and TED used for timing acquisition in MDFE.
Chapter 5 investigates the problem of timing acquisition in DFE detectors. It develops two
novel fast acquisition techniques that do not suffer from hang up and false lock problems, even
in the presence of large initial errors in timing, gain and DC offset. Chapter 6 describes the
design and implementation of a practical timing recovery system, which is designed and
prototyped in ECL (emitter-coupled logic) discrete-components, for a 100 Mb/s experimental
MDFE read channel (channel data rate is 150 Mb/s). Performance evaluations of the system
based on bench and spinstand tests are also presented.
XI

The focus of the development of the algorithms and systems presented in Chapters 3 to 6
is on magnetic recording, even though these are also applicable with minor modifications to
optical recording. In contrast, the focus of Chapter 7 is on optical recording. In Chapter 7, a
new and attractive architecture for fully digital zero-forcing based equalizer adaptation and
interpolative timing recovery (ITR) is developed. The development of algorithms and systems
in Chapters 3 to 7 is supplemented with computer simulation results. These simulation results
are used for demonstrating the effectiveness of the proposed algorithms and for corroborating
the analytical developments.
XII

CONTENTS
Acknowledgements.....................................................................................................................VII
Abstract ........................................................................................................................................IX
Summary.......................................................................................................................................X
Contents......................................................................................................................................XII
Glossary ..................................................................................................................................... XV
Abbreviations.......................................................................................................................................................... XV
Often Used Symbols............................................................................................................................................. XVII
Notational Conventions ....................................................................................................................................... XVIII
List of Tables............................................................................................................................. XIX
List of Figures ............................................................................................................................ XX

CHAPTER 1
INTRODUCTION ........................................................................................................ 1
1.1 Introduction to Digital Recording Technology ..............................................................................1
1.2 Overview of Read-Write Channel Techniques for Digital Magnetic Recording Systems .............2
1.3 Motivation for the Present Study ...................................................................................................8
1.4 Contributions and Organization of the Thesis..............................................................................10
References:.........................................................................................................................................12

CHAPTER 2
REVIEW OF TIMING RECOVERY ........................................................................... 17
2.1 Introduction to Timing Recovery.................................................................................................17
2.2 Structures, Requirements and Performance Measures .................................................................20
2.3 Timing Recovery Schemes and TED Algorithms........................................................................27
2.4 Timing Acquisition ......................................................................................................................35
2.5 Summary ......................................................................................................................................46
References:.........................................................................................................................................47

CHAPTER 3
TIMING SENSITIVITY ANALYSIS FOR MAGNETIC RECORDING........................ 51
3.1 Introduction..................................................................................................................................51
3.2 Channel Model with MDFE and PR4-VD Detectors...................................................................52
3.3 Timing Sensitivity of MDFE Detector.........................................................................................54
3.4 Timing Sensitivity of PR4-VD.....................................................................................................59
3.5 Conclusions..................................................................................................................................62
Appendix 3.A : PDF of ISI with Correlated Data ..............................................................................63
References:.........................................................................................................................................64

CHAPTER 4
ANALYSIS OF TIMING ERROR DETECTORS ....................................................... 67
4.1 Introduction..................................................................................................................................67
4.2 TED Analysis for PR Recording Systems....................................................................................68
4.3 TED Analysis for DFE Recording Systems .................................................................................75
4.4 Marginal Detection-Based TED...................................................................................................78
4.5 Optimality of MDFE Acquisition Performance ...........................................................................81
4.6 Conclusions..................................................................................................................................85
References:.........................................................................................................................................87
XIII

CHAPTER 5
FAST TIMING ACQUISITION FOR DECISION FEEDBACK EQUALIZATION
RECEIVERS IN MAGNETIC RECORDING ............................................................. 89
5.1 Introduction..................................................................................................................................89
5.2 Fast Acquisition for DFE with Modified Equalization ................................................................90
5.3 Modified Threshold-based Timing Acquisition Scheme for DFE ...............................................94
5.4 Summary ....................................................................................................................................102
References:.......................................................................................................................................103

CHAPTER 6
IMPLEMENTATION OF MULTI-LEVEL DECISION FEEDBACK EQUALIZATION
TIMING RECOVERY SYSTEM .............................................................................. 105
6.1 Introduction to MDFE Read Channel Prototype........................................................................106
6.2 Implementation of MDFE Timing Recovery System ................................................................110
6.3 Evaluation of MDFE Prototype .................................................................................................119
6.4 Considerations of Hardware Implementation.............................................................................123
6.5 Conclusions................................................................................................................................124
References:.......................................................................................................................................125

CHAPTER 7
ASYNCHRONOUS EQUALIZER ADAPTATION AND INTERPOLATIVE TIMING
RECOVERY FOR A DIGITAL OPTICAL RECORDING SYSTEM......................... 127
7.1 Introduction..........................................................................................................................127
7.2 Overview of Interpolation Techniques.................................................................................128
7.3 Advantages of Using ITR for DVR......................................................................................129
7.4 Novel Asynchronous Adaptation Structure..........................................................................130
7.5 DVR System Model .............................................................................................................136
7.6 Realization of SRC and ITR for DVR .................................................................................138
7.7 Simulations of Asynchronous Equalizer Adaptation and ITR Loops for DVR ...................143
7.8 Summary and Further Discussions.......................................................................................152
References:.......................................................................................................................................154

CHAPTER 8
SUMMARY AND CONCLUSIONS ......................................................................... 157
8.1 Further Work..............................................................................................................................158

Samenvatting ............................................................................................................................ 161


Publications by the author......................................................................................................... 163
Curriculum Vitae ....................................................................................................................... 165
XIV
XV

GLOSSARY

Abbreviations

A/D : Analog-to-Digital converter.


AGC : Automatic Gain Control.
AWG : Arbitrary Waveform Generator.
AWGN : Additive White Gaussian Noise.
BECM : Band-Edge Component Maximization.
BER : Bit Error Rate.
CAD : Computer Aided Design.
CD : Compact Disc.
D/A : Digital-to-Analog Converter.
DA : Data-Aided.
DC : Direct Current.
DD : Decision-Directed.
DDFE: Dual Decision Feedback Equalization.
DFE : Decision Feedback Equalization.
DFT : Discrete-time Fourier Transform.
DPO : Digital Phosphor Oscilloscope.
DVR: Digital Video Recording
DSP : Digital Signal Processor.
DVD : Digital Versatile Disc.
ECC : Error Control Coding.
ECL : Emitter-Coupled Logic.
EER : Error Event Rate.
EPR4 : Extended class IV Partial Response.
E2PR4 : Extended order-2 class IV Partial Response.
FBF : Feedback Filter.
FDTS/DF : Fixed Delay Tree Search with Decision Feedback.
FEQ : Forward Equalizer.
FIR : Finite Impulse Response.
HDD : Hard Disk Drive.
IC : Integrated Circuit.
IIR : Infinite Impulse Response.
ISI : Intersymbol Interference.
ITP : Interpolation.
ITR : Interpolative Timing Recovery.
XVI

LF : Loop Filter.
LPF : Low-Pass Filter.
LMS : Least-Mean-Square.
MAP : Maximum A Posteriori.
MDFE : Multi-level Decision Feedback Equalization.
MLSD : Maximum Likelihood Sequence Detector.
ML : Maximum-Likelihood.
MMSE : Minimum Mean-Square Error.
MR: Magneto-Resistive.
MSE : Mean-Square Error.
MSB : Most Significant Bit.
MT : Modified Threshold.
NCO : Numerically Controlled Oscillator.
NDA : Non-Data-Aided.
NDD : Non-Decision-Directed.
NRZ : Non-Return to Zero.
NRZI : Non-Return to Zero Inverse.
PAM : Pulse-Amplitude modulation.
PCB : Printed Circuit Board.
PDF : Probability Density Function.
PLL : Phase-Locked Loop.
PR : Partial Response.
PR4 : class IV Partial Response.
PR4-VD : class-IV Partial-Response with Viterbi Detection.
PRML : Partial Response with Maximum Likelihood.
PSD : Power Spectral Density.
RC : Resistor-Capacitor.
RF : Radio Frequency.
RLL : Run-Length-Limited.
RMS : Root-Mean-Square.
SNR : Signal-to-Noise Ratio.
SRC : Sample Rate Conversion.
STI : Spatio-Temporal Interpolation.
TA : Thermal Asperity.
TED : Timing Error Detector.
VCO : Voltage Controlled Oscillator.
VD : Viterbi Detector.
VGA : Variable Gain Amplifier.
ZCD : Zero-Crossing Detection.
ZF : Zero-Forcing.
Often Used Symbols

ak : RLL coded data bit sequence in alphabet ak ∈ {−1, 1} .


A(e j 2πΩ ) : PSD of data ak .
bk : RLL encoder output in alphabet bk ∈ {0, 1} .
Bl : PLL equivalent noise bandwidth.
(d, k) : two constraint-parameters of RLL codes.
d k : desired (reference) sequence.
Dch : channel bit density.
Du : user bit density.
ek : error sequence.
f : frequency variable in units Hz.
h(t ) : channel bit response.
h (t ) : isolated transition response.
k : discrete-time index in units T.
K d : TED gain, i.e., slope of ρ (τ ) at the origin τ = 0 .
n(t ) : channel additive noise.
N o / 2 : PSD of two-sided ‘white’ noise.
pw50 : pulse width (in seconds) at 50% base-to-peak amplitude of h (t ) .
r (t ) : readback (replay) signal at the channel output.
R : RLL modulation encoder rate, 0 < R ≤ 1 .
SNRloop : PLL loop signal-to-noise ratio.
SNRm : signal-to-noise ratio in the matched filter bound sense.
Su (e j 2πΩ ) : PSD of noise component uk at the TED output.
t : continuous-time index in seconds.
tk : kth sampling instant in seconds.
T : duration of one bit in seconds.
uk : noise component at the TED output.
yk : discrete-time detector input.
yk′ : sampled derivative of signal y (t ) with respect to t at instants tk .
γ : TED efficiency.
Γ : time-constant of the first-order PLL in units T .
ρ (τ ) : timing function, i.e. TED characteristics.
ς : damping factor of the continuous-time second-order PLL.
ς d : damping factor of the discrete-time second-order PLL.
θ k : input-referred phase noise of the PLL.
XVIII

σ θ2 : variance of θ k .
τ : timing phase error in units T.
φ : unknown channel delay in units T .
χ k : TED output.
ψ : sampling phase in units T .
ω n : natural frequency of the continuous-time second-order PLL.
ω nd : natural frequency of the discrete-time second-order PLL.
Ω : normalized frequency variable in units 1/T, i.e. Ω = fT .

Notational Conventions

D : one-bit interval delay operator.


E[ x] : expected value of x .
Im[ x] : imaginary part of a complex number x .
P : matrices or vectors are boldfaced.
P -1 : matrix inverse.
P T : matrix or vector transpose.
( )
−1 ∞
Q( x) : Q-function, i.e., Q ( x) = 2π ∫ exp ( − u 2 2 )du .
x

rx (l ) : autocorrelation function of x for lag l .


Re[ x] : real part of a complex number x .
sk : sample element of sequence {sk } at instant k .
{sk } : discrete-time sequence.
| x | : absolute value (magnitude) of x .
 x  : biggest integer not exceeding x .
x∗ : conjugate of a complex number x .
⊗ : convolution operator.
⊕ : modulo-2 addition operator.
δ (t ) : Dirac’s delta function.
δ k : discrete-time delta function.
XIX

LIST OF TABLES

Table 4-1: PR TARGETS FOR MAGNETIC RECORDING SYSTEMS. ...............................................69


Table 4-2: MEAN AND VARIANCE OF THE TED OUTPUT FOR PR MAGNETIC
RECORDING SYSTEMS. ................................................................................................................71
Table 4-3: RMS VALUES OF THE TIMING LOOP JITTER AND SUPPRESSION GAIN κ
(USER DENSITY 3.0).......................................................................................................................80
Table 4-4: TED EFFICIENCY LOSS OF THE MDFE TED RELATIVE TO THE ML TED. ................85
Table 5-1: INITIAL CONDITIONS OF MONTE-CARLO SIMULATIONS FOR MDFE TIMING
ACQUISITION..................................................................................................................................93
XX

LIST OF FIGURES

Fig.1-1: Block schematic of a magnetic recording system. The blocks enclosed by the dashed boxes
constitute the read-write channel of the recording system. ..................................................................3
Fig.1-2: Read channel with PR equalizer and Viterbi detector. ...................................................................7
Fig.1-3: Read channel with DFE detector....................................................................................................7
Fig.1-4: Example of timing recovery system. ..............................................................................................8
Fig. 2-1: Schematic of a read-write channel with timing recovery. ...........................................................18
Fig. 2-2: Linearized discrete-time phase-domain TED model. ..................................................................19
Fig. 2-3: Structures of discrete-time loop filters: (a) proportional type, (b) proportional-plus-
integral type. ......................................................................................................................................19
Fig. 2-4: Discrete-time phase-domain model of VCO. ..............................................................................20
Fig. 2-5: Discrete-time phase-domain model of first-order PLL ignoring VCO phase noise and
frequency error...................................................................................................................................20
Fig. 2-6: Receivers incorporating (a) deductive and (b) inductive timing recovery schemes. ...................21
Fig. 2-7: Eye patterns for a raised-cosine channel with (a) 25% and (b) 100% excess bandwidth............23
Fig. 2-8: Eye patterns of (a) PR4 and (b) E2PR4 partial response channels that have no excess
bandwidth...........................................................................................................................................24
Fig. 2-9: Recovered clock with phase jitter. ..............................................................................................24
Fig. 2-10: Data-aided ML TED scheme based on the matched filter.........................................................28
Fig. 2-11: Power spectral densities of 2/3 (1, 7) and 16/17 (0, 6/6) RLL coded data. ...............................29
Fig. 2-12: ML TED timing functions for the Lorentzian channel at different user densities Du with
(a) uncorrelated data, (b) 4T-training data, and (c) 6T-training data..................................................30
Fig. 2-13: ML TED timing functions for the RLL coded Lorentzian channel at different user
densities Du with (a) 2/3 (1,7) coded data and (b) 16/17 (0, 6/6) coded data. ....................................30
Fig. 2-14: ML TED efficiency for the Lorentzian recording channel with various data patterns
versus the channel density..................................................................................................................31
Fig. 2-15: TED structure of MMSE timing recovery scheme....................................................................32
Fig. 2-16: Simplified TED structure of MMSE timing recovery scheme. .................................................33
Fig. 2-17: TED structure of a ZF timing recovery scheme. .......................................................................34
Fig. 2-18: Demonstration of hang up problem when the unstable nulls in the timing function arise
from discontinuities. ..........................................................................................................................36
Fig. 2-19: Demonstration of hang up problem when the timing function has undesired nulls with
wrong slopes. .....................................................................................................................................37
Fig. 2-20: Demonstration of how hang up can be avoided by incorporating hysteresis in the timing
function. .............................................................................................................................................38
Fig. 2-21: Demonstration of false lock phenomenon arising from undesired zero-crossings of the
timing function...................................................................................................................................38
Fig. 2-22: Non-decision-directed TED for timing acquisition in recording systems. ................................40
Fig. 2-23: Symbol-rate samples (marked by ‘o’ at the ideal sampling phase) of replay signal of the
Lorentzian recording channel corresponding to (a) 6T pattern and (b) 4T pattern preamble
sequences. ..........................................................................................................................................41
Fig. 2-24: Timing functions corresponding to the NDD TEDs for using (a) 6T pattern and (b) 4T
pattern preamble sequences. ..............................................................................................................41
Fig. 2-25: Illustration of (a) a timing function and (b) its corresponding phase convergence with
maximum and minimum time constants. ...........................................................................................45
XXI

Fig. 2-26: Second-order continuous-time PLL transient responses with (a) timing phase error due
to a unit phase step and (b) timing phase error due to a unit frequency step. The damping
factor ς is used as a parameter...........................................................................................................46
Fig. 3-1: Magnetic recording system model incorporating (a) MDFE and (b) PR4-VD detector..............52
Fig. 3-2: Equalized bit response qk and the noiseless detector inputs yk for different timing phase
errors τ for (a) MDFE and (b) PR4-VD detectors at user density 2.5...............................................54
Fig. 3-3: PDF of inner level residual ISI in the presence of timing phase errors (obtained by
computation). .....................................................................................................................................57
Fig. 3-4: PDF of inner-level residual ISI in the presence of timing phase errors (obtained by
simulation). ........................................................................................................................................57
Fig. 3-5: Error event rate of MDFE in the presence of timing phase error τ (solid curves: EER
obtained by calculation). ....................................................................................................................58
Fig. 3-6: Error event rate of MDFE in the presence of random phase jitter (solid curves: EER
obtained by calculation). ....................................................................................................................59
Fig. 3-7: PDF of residual ISI z k (τ ) obtained using analytical and simulation approaches. ....................61
Fig. 3-8: Error event rate of PR4-VD in the presence of timing phase error (solid curves:
obtained using calculation based on Eq. (3.23)). ...............................................................................61
Fig. 3-9: PDF of inner-level ISI in the presence of timing phase errors (obtained using
calculation).........................................................................................................................................63
Fig. 4-1: TED efficiency for Lorentzian recording channel and rate 16/17 (0,6/6) code...........................73
Fig. 4-2: TED efficiency for Lorentzian recording channel and rate 2/3 (1,7) code..................................73
Fig. 4-3: Illustration of the existing transition-based MDFE TED algorithm. ...........................................75
Fig. 4-4: TED efficiency for the MDFE transition-based TED. ................................................................77
Fig. 4-5: TED output noise variance suppression factor κ at user density 3.0..........................................80
Fig. 4-6: Distribution histogram of timing phase error for the TEDs with (right) and without
(left) the marginal detection scheme. .................................................................................................81
Fig. 4-7: Replay waveforms at the output of Lorentzian channel. .............................................................82
Fig. 4-8: Timing SNR for the 4T, 6T and 8T preamble patterns, on a rate 2/3 (1,7) coded
Lorentzian channel.............................................................................................................................82
Fig. 4-9: Timing SNR for the 4T, 6T and 8T patterns in a (0,k) coded Lorentzian channel.......................83
Fig. 5-1: The averaged TED output when the normalized sampling phase offset is changed
dynamically from -1 to 1 and from 1 to –1. .......................................................................................90
Fig. 5-2: Phase convergence curves at Du = 3.0 and SNR = 27 dB with the MDFE TED (10
simulation runs). ................................................................................................................................91
Fig. 5-3: Modified forward equalizer based MDFE timing acquisition scheme. .......................................91
Fig. 5-4: Timing function of MDFE using the 4-step acquisition scheme. ................................................92
Fig. 5-5: Phase convergence curves at Du = 3.0 and SNR = 27 dB (with the 4-step acquisition
scheme). .............................................................................................................................................92
Fig. 5-6: Convergence of VCO frequency (data rate in the channel is 150 Mb/s). ....................................93
Fig. 5-7: Convergence of VGA gain (ideal value of VGA gain is 1.0)......................................................94
Fig. 5-8: Convergence of DC offset (ideal value of DC offset is 0). .........................................................94
Fig. 5-9: Timing functions based on the variable threshold ck=λâk-3 for MDFE acquisition
scheme using the 6T (‘ + + + − − − ’) preamble (Horizontal axis: normalized phase offset,
τ; Vertical axis: averaged TED output)..............................................................................................96
Fig. 5-10: Closed-loop phase convergence in the presence of a fixed DC offset 0.5 with
modified threshold ck=âk-3 in a noiseless channel..............................................................................96
Fig. 5-11: Modified threshold timing-acquisition scheme for DFE detector. ............................................97
Fig. 5-12: Structure of modified threshold filter. .......................................................................................97
Fig. 5-13: Illustration of the role of MT sequence ck to shift the decision sequence from one
phase of the 6T pattern to another. .....................................................................................................98
Fig. 5-14: Results indicating which of the first 25 tagged patterns are presented in the feedback
register at the time of the 25th bit instant from the start of acquisition...............................................99
Fig. 5-15: Timing functions based on the MT scheme ck=λ1(âk-3 -âk-3) +λ2∑i=1→6âk-i . (Horizontal
axis: normalized phase offset, τ; Vertical axis: averaged TED output). ..........................................100
XXII

Fig. 5-16: Closed-loop phase convergence in the presence of a fixed DC offset 0.5 with the MT
scheme ck=λ1(âk-3 -âk-3) +λ2∑i=1→6âk-i for a noiseless channel..........................................................100
Fig. 5-17: Convergence of parameters with closed timing, gain and DC loops (horizontal axis:
time in channel bits).........................................................................................................................101
Fig. 5-18: Phase convergence curves for 5000 acquisition trials at user density 3.0 and 23 dB
SNR..................................................................................................................................................101
Fig. 6-1: Structure of MDFE prototype....................................................................................................106
Fig. 6-2: Implementation of the forward path from the preamplifier output to the critical loop
input, consisting of a 4-pole/3-zero forward equalizer.....................................................................107
Fig. 6-3: Implementation of the feedback equalizer using current steering switches. .............................107
Fig. 6-4: Implementation of the critical-loop...........................................................................................108
Fig. 6-5: MDFE critical loop timing diagram. .........................................................................................109
Fig. 6-6: Eye pattern measured at the analog summing bus showing the four levels at ideal
sampling instants (the block marks the desired sampling instant). ..................................................109
Fig. 6-7: Mappings of input to 5-bit output of the 8-bit AD9002 A/D converter with error pattern
assignment. ......................................................................................................................................110
Fig. 6-8: Data format of one sector of magnetic disk for MDFE timing acquisition, sync byte
detection and clock tracking. ...........................................................................................................111
Fig. 6-9: Block schematic of the overall timing loop which uses the A/D output for generating the
error signal for TED.........................................................................................................................112
Fig. 6-10: Digital error generation for TED implementation...................................................................113
Fig. 6-11: Circuit for implementing the charge-pump and loop filter......................................................114
Fig. 6-12: (a) Measured static transfer function from the digital TED input to the output of RC
loop filter; (b) Table that shows the relation between the input patterns and timing errors.............115
Fig. 6-13: Measured transfer function of the charge-pump circuit for waveform input...........................115
Fig. 6-14: Comparison of measured and simulated transfer functions from TED input to the loop
filter output under high-gain conditions...........................................................................................116
Fig. 6-15: Lock-to-Preamble circuit for preloading the 6T pattern. .........................................................117
Fig. 6-16: Timing diagram of the lock-to-preamble circuit. ....................................................................117
Fig. 6-17: Sync-byte detection circuit......................................................................................................118
Fig. 6-18: Cross-correlation of the data in the shift-registers and the word ‘01100001’. ........................118
Fig. 6-19: Measured waveforms from the sync-byte detection circuit.....................................................119
Fig. 6-20: Bench test setup for measuring the BER performance of the critical loop board (the
portion in the dashed block) of the MDFE prototype. .....................................................................120
Fig. 6-21: Measured and theoretical BER performance of the critical loop board...................................120
Fig. 6-22: Histogram of MDFE burst errors measured using BitAlyzer400. ...........................................121
Fig. 6-23: Measurement of the BER performance of the MDFE board at user density of 2.5, with
and without active timing recovery loop..........................................................................................122
Fig. 6-24: Test setup for measuring random jitter in the recovered clock of the MDFE prototype. ........123
Fig. 6-25: MDFE prototype circuits and timing board.............................................................................124
Fig. 7-1: Synchronously sampled versus asynchronously sampled read channels...................................129
Fig. 7-2: System model for studying synchronous equalizer adaptation algorithms................................131
Fig. 7-3: Existing topologies for asynchronous equalizer adaptation with LMS algorithm.....................134
Fig. 7-4: Novel structure for asynchronous equalizer adaptation with the ZF algorithm.........................135
Fig. 7-5: DVR system model. ..................................................................................................................136
Fig. 7-6: DVR channel bit response h(t) and its magnitude response for Ωc=0.20, 0.25 and 0.33. .........137
Fig. 7-7: Target response gk and its magnitude response. ........................................................................138
Fig. 7-8: Eye patterns at the channel output and the equalizer output in the absence of noise for
Ωc=0.33. ...........................................................................................................................................138
Fig. 7-9: Block diagram of SRC. (a): principle of resampling after reconstruction; (b) digital
realization of SRC............................................................................................................................138
Fig. 7-10: Impulse response and frequency response of the six-point Lagrange interpolator..................139
Fig. 7-11: Asynchronously sampled DVR system with ITR loop. ..........................................................140
Fig. 7-12: Timing functions for the DVR with Ωc=0.33 TED, no noise, and d=1 data. ..........................143
Fig. 7-13: Block diagram with the equalizer adaptation and ITR loops. .................................................144
XXIII

Fig. 7-14: Block diagram of equalizer adaptation calculation in the adaptation loop based on the
latch-based asynchronous ZF adaptation structure. .........................................................................144
Fig. 7-15: Block diagram of ITR calculation for the NCO adjustment in the ITR loop. .........................144
Fig. 7-16: Simulation results of synchronous adaptation at Ts=T for a 9-tap equalizer. (a)
coefficient convergence for the noiseless channel; (b) for the 9 dB SNR channel; (c) steady-
state equalizer coefficients; (d) corresponding equalizer magnitude response; (e) time-
averaged squared error convergence; (f) detector inputs. ................................................................145
Fig. 7-17: Simulation results of undersampled asynchronous adaptation (Ts=(4/3)T): (a) time-
averaged error power convergence, (b) steady-state equalizer magnitude response........................146
Fig. 7-18: Simulation results of oversampled asynchronous adaptation (Ts=(3/4)T): (a) time-
averaged error power convergence, (b) steady-state equalizer magnitude response........................146
Fig. 7-19: Equalizer adaptation performance of the latch-based and STI-based structure for
various sampling rates with (a) no noise; (b) 9 dB SNR..................................................................148
Fig. 7-20: TED output for various sampling rates for the noiseless DVR channel with a fixed
equalizer...........................................................................................................................................149
Fig. 7-21: NCO output for various sampling rates for the noiseless DVR channel with a fixed
equalizer...........................................................................................................................................149
Fig. 7-22: Frequency offset for various sampling rates for the noiseless DVR channel with a fixed
equalizer...........................................................................................................................................150
Fig. 7-23: Simulation results of the latch-based equalizer adaptation and ITR loops for the
noiseless DVR channel for Ts=T, Ts=(10/9)T, Ts=(4/3)T, Ts=(4/5)T and Ts=(1/2)T. .......................151
Fig. 7-24: Error power convergence of equalizer output and steady-state equalizer magnitude
response with latch-based equalizer adaptation and ITR loops at 9 dB SNR for Ts=T,
Ts=(10/9)T, Ts=(4/3)T, Ts=(4/5)T and Ts=(1/2)T. .............................................................................151
XXIV
CHAPTER 1
INTRODUCTION
In this chapter, we first give a brief introduction to digital recording systems such as
magnetic and optical systems. Following this, we present an overview of the key techniques
used for data recovery in magnetic recording systems1. Our emphasis in this overview is on the
role of coding and signal processing techniques in digital recording systems. Thereafter, we
present the motivation for the work reported in this thesis. The chapter concludes with a brief
description of the contributions in each chapter of the thesis.

1.1 Introduction to Digital Recording Technology

The advent of the information era and the fast growth of information technology have
resulted in an enormous demand for the storage of digital data, along with demands for
processing and transmission in huge volumes and/or at high speeds. The recording density and
transfer rate of storage devices have had to increase at dramatically fast rates to accommodate
this growing demand. For instance, the areal density and transfer rate of magnetic hard disk
drives were about 20 Mb/in2 and 24 Mb/s, respectively, in 1986 [1]. By the beginning of 2001,
these had become 50 Gb/in2 and 700 Mb/s, respectively [2] [3]. Although this explosive growth
has been mainly due to the technological improvements made in the design of heads and disk-
media, sophisticated coding and signal processing techniques, as well as accurate servo control
algorithms, have also played a significant role [4] [5]. The potential of coding and signal
processing techniques to significantly enhance the storage capacity has been proved repeatedly
in the past. For a given head-medium combination, the use of better coding and signal
processing techniques allows more bits to be packed into a given area in the medium, resulting
in an increase in storage density from the improved reliability. A classic example of this is the
increase in density that resulted from substituting partial response detection for peak detection
in hard disk drives in the early 1990s [6] [4] [5]. As the developments in integrated circuits (IC)
technology allow ever more complex functions to be implemented at affordable costs, signal
processing techniques are going to play an increasingly important role in data storage systems.

1
The review in Chapters 1 and 2 is aimed at magnetic recording systems. This is because the
developments in Chapters 3 to 6, which constitute the bulk of the thesis, assume magnetic recording
systems. Chapter 7, which deals with optical systems, will also include the required review on optical
recording.
CHAPTER 1: Introduction
2

A digital recording system very much resembles a synchronous digital baseband


communication system. Whereas communication systems transport information from one
location to another, recording systems do it from one time to another. The common goal of both
systems is to eventually retrieve the transmitted or stored information as accurately as possible
[7]. The information is in the form of binary data. In baseband systems, no carrier modulation is
used for matching the channel and data characteristics. Instead, these systems typically use
modulation codes to adapt the data to the channel characteristics [8]. The received signal is
processed using equalization, detection and timing recovery techniques for recovering the
original digital data. Because of the operational and system level similarities between digital
storage systems and digital baseband communication systems, the techniques developed for
transmission and reception in communications have been widely exploited in storage systems.
The principles of digital magnetic recording and digital optical recording form the basis of
most of the data storage systems that are in use currently. Magnetic recording systems, in
particular hard disk drives (HDD), prevail for high volume and high-speed storage applications.
On the other hand, optical recording systems, such as compact disc (CD), digital versatile disc
(DVD), and their advanced generations, prevail for removable storage applications. Because of
the continuous demand from customers for storage systems capable of high densities and/or
high data rates, the research efforts in the last decade on improving the storage technology have
been very aggressive. As a result, for example, areal densities of 100 Gb/in2 and data rates of
more than 1 Gb/s will be realized very soon in magnetic recording [9] [10]. In fact, researchers
are being forced to make revolutionary changes in technology to sustain the advancements in
storage capabilities, since the existing technologies are fast approaching their physical limits
[11].
A critical part of data storage systems is the read-write channel, which consists of the
circuits and techniques needed for writing the user-supplied data into the storage medium and
for reliable recovery of the written data. A properly designed read-write channel has the
potential to enhance the recording density and data rate capabilities. The research work that has
been undertaken in this thesis deals with timing recovery issues in read-write channels. To
motivate the work, we first present a system-level overview of the read-write channel and the
various key techniques developed for implementing each of its subsystems. For the sake of
convenience, we present this review in the context of digital magnetic recording systems.

1.2 Overview of Read-Write Channel Techniques for


Digital Magnetic Recording Systems

Fig.1-1 shows the block schematic of a magnetic recording system, highlighting the read-
write channel part of the whole system. As shown, the read-write channel involves a write
channel and read channel. These are similar in functionality to the transmitter and receiver,
respectively, in a communication system. The write channel accepts the input binary data and
converts it into a form suitable for writing into the storage medium. The read channel recovers
the original data by processing the output of the read head in accordance with certain
algorithms. Throughout the thesis, the phrase ‘recording channel’ refers to the cascade of write
circuits, write head, storage medium, and the read head.
CHAPTER 1: Introduction
3

write channel
data to be
stored ECC modulation write
encoder encoder circuits
disk

write/read
heads

recovered
data ECC modulation data equalizer front-end
decoder decoder detector circuits

read channel

Fig.1-1: Block schematic of a magnetic recording system. The blocks enclosed by the dashed boxes
constitute the read-write channel of the recording system.

We now briefly describe the functions of each block shown in Fig.1-1. The ECC (error
control coding) encoder uses special coding schemes to introduce error detection and correction
capability into the input binary data. The ECC decoder uses this capability for detection and
correction of errors during data recovery [12] [13]. The modulation encoder, on the other hand,
is used for matching the data to the recording channel characteristics, and to help in the
operation of the various control-loops (e.g. timing/gain recovery) in the read-channel [14] [15]
[16]. The write circuits convert the binary output data of the modulation encoder to a write-
current waveform. Each current pulse is properly shaped and positioned (through pulse shaping
and write precompensation) to counteract the nonlinear distortions in the recording process.
These distortions arise from the bandwidth limitations of the write path and the
demagnetization fields in the medium [17] [8] [18]. The write-current waveform causes the
write-head to produce magnetic flux which magnetizes the storage medium in one of the two
directions, thereby recording the data.
The electrical signal generated by the read-head, in response to the magnetization pattern in
the medium, is processed by the frond-end circuits which condition the replay signal (e.g.,
amplify, limit noise bandwidth, regulate dynamic range, etc) prior to equalization [19]. The
equalizer shapes the signal according to certain pre-chosen criteria [20] [21] [22] [23] [8] so
that the data detector is able to recover the binary data from the equalized signal with as few
errors as possible [24] [6] [25] [26]. The modulation and ECC decoders operate on the output
bits of the data detector to give the estimate of the original data that was input to the storage
system. Not shown explicitly in Fig.1-1 are the control loops required for doing timing
recovery [27] [28], gain control [19] [29], DC offset cancellation and adaptive equalization [30]
[26].
In the rest of this section, we further elaborate on selected parts of the read-write channel,
namely, modulation codes, equalization, and detection, since these will be used extensively in
this thesis. The problem of timing recovery, which is the main subject of the thesis, is briefly
introduced in Section 1.3. A detailed review on timing recovery will be given in Chapter 2.

1.2.1 Modulation codes


As mentioned above, modulation codes are used for matching the characteristics of the data
to those of the recording channel [8] [31]. Run-length-limited (RLL) codes are the most
popularly used modulation codes in digital magnetic and optical recording systems [32].
CHAPTER 1: Introduction
4

In this thesis, the output sequence {bk } , bk ∈ {0,1} , of the RLL (modulation) encoder is
assumed to be in the NRZI (non-return to zero inverse) representation [33]. That is, bk = 1
implies a transition in the medium magnetization at the k th bit instant, whereas bk = 0 implies
no transition. To generate the write current from the transition sequence {bk } , a precoder is
used to convert the NRZI bits {bk } into NRZ (non-return to zero) representation {bk′ } , where
bk′ = 1 and bk′ = 0 imply the two directions of magnetization (i.e., write current polarities)
instead of the presence and absence of transitions [33] [15]. This precoding is done as
bk′ = bk′ −1 ⊕ bk (1.1)

where ‘ ⊕ ’ indicates modulo-2 addition. The write current polarity is given by


ak = 2bk′ − 1 (1.2)

so that ak ∈ {−1, 1} .

RLL codes are also known as (d, k) codes. The parameters d and k specify the constraints
on the minimum and maximum runs of consecutive zeros between two ones in the coded
sequence bk. The d-constraint, when d > 0 , helps to increase the minimum spacing between
transitions in the medium. This, in turn, helps to reduce the linear as well as nonlinear
interactions (called intersymbol interference) among the data bits recorded in the medium [34].
The k-constraint limits the maximum transition spacing and ensures that the control loops (e.g.
timing, gain and equalization) are updated frequently enough to maintain the loops in good
condition. The k-constraint also helps to reduce the path memory requirement as well as to
avoid certain catastrophic error events in Viterbi-algorithm-based data detectors [24] [25]. The
benefits provided by the d and k constraints carry a price tag in the form of redundancy added
to the coded data stream. This redundancy is characterized by a parameter called code-rate that
is defined as R=p/q, 0 < R < 1 , specifying that groups of p data bits at the encoder input are
coded into groups of q bits at its output. Clearly, the code-rate decreases with increase in d or
decrease in k. An important disadvantage of coding is that it decreases the signal-to-noise ratio
(SNR) in the readback signal. The lower the code-rate is, the greater will be the reduction in
SNR [35] [36]. Hence, it is important to design the data detector to minimize any further
reduction of SNR.
In practical recording systems, the d-constraint is restricted to 0, 1 or 2, and the k-constraint
ranges between 2 and 10. The most popular RLL codes are the rate 1/2 (1, 3) code used in
floppy disk drives [15], rate 8/17 (2, 10) code used in CD [32], rate 8/16 (2, 10) code used in
DVD [37], rate 1/2 (2, 7) and 2/3 (1, 7) codes used in earlier hard disk drives [15] [32], and
several d=0 codes such as rate 8/9 (0,4/4) [25], 16/17 (0, 6/6) [38], and 8/9 (0,11) codes [39]
used in hard disk drives. The rate 16/17 (0, 6/6) and 8/9 (0, 4/4) codes belong to the class of
RLL codes whose constraints are specified as (d, G/I), where d and G have the same meaning
as d and k discussed above. The I-parameter describes an additional constraint on the maximum
run-length of zeros in the odd and even interleaved sequences [16]. More recently, high-rate
codes combined with parity bits in conjunction with parity-based post-processing schemes have
been widely used for improving error performance and densities [40] [41] [42]. These codes
have the advantage of minimizing performance degradation due to rate loss and error
propagation at the modulation decoder. For example, in the disk-drive industry, to enhance
density and performance, the rates of d=0 modulation codes have steadily been increasing over
the years, from initially, rate 8/9 and 16/17 codes to currently 19/20 [43], 32/33, 64/65 [42], and
CHAPTER 1: Introduction
5

96/100 [42]. These codes keep the d-constraint to zero and allow the k-constraint to vary
between 4 and 8, thus providing the benefits of (d, k) codes while largely reducing the code
redundancy.

1.2.2 Equalization and detection techniques


A mathematical model for the readback signal at the read head output can be given as [44]
[6]

r (t ) = ∑ a h(t − kT ) + n(t )
k =−∞
k (1.3)

where {ak } , ak ∈ {−1, 1} , is the sequence of RLL coded bits in NRZ format, h(t ) is the
response of the combination of write-head, medium and read-head to the NRZ input bit ‘+1’,
and n(t ) is the noise due to read-head and electronics. The noise n(t ) is modeled as white
Gaussian with power spectral density N o / 2 Watts/Hz. Here, ‘t’ denotes time and ‘T’ denotes
the duration of one bit ak. The dispersion of each bit ak , which is caused by the bit response
h(t ) , normally results in linear intersymbol interference (ISI) in the readback signal since the
duration of h(t ) is much larger than T. The problem of ISI worsens with increase in recording
density. Recording density may be characterized by the user bit density Du , defined as the ratio
pw / T , where pw is the pulse-width at 50% of the peak amplitude of h (t ) , the isolated
50 u 50

transition response, and Tu is the duration of one user bit2. Clearly, Tu = T / R , where R is the
code-rate of the RLL encoder. The bit response and transition response are related by [6]

( )
h(t ) = h (t ) − h (t − T ) 2 . (1.4)

The readback signal can also be expressed in terms of h (t ) as [6]



r (t ) = ∑ a′ h(t − kT ) + n(t )
k =−∞
k (1.5)

where ak′ = (ak − ak −1 ) / 2 and ak′ ∈ {−1, 0, 1} . For instance, in longitudinal magnetic
recording, a commonly used model for the transition response h (t ) is the Lorentzian pulse
given by [33] [44] [45]

2
In this thesis, we do not include ECC encoder and decoder. Hence, the raw data appears directly at the
input of the RLL encoder. Following existing practices, we call the data bits at the input and output of the
RLL encoder user bits and channel bits, respectively [6] [33]. Further, the transition response h (t ) is the
response of the recording channel when the NRZ data pattern {ak } is of the form
{" , −1, −1, −1, +1, +1, +1,"} .
CHAPTER 1: Introduction
6

Vop
h (t ) = 2
(1.6)
 2t 
1+  
 pw50 
where Vop is the base-to-peak amplitude of h (t ) . The readback signal will be a series of such
pulses corresponding to the transitions ak′ in the magnetization pattern. The peak detector,
which was the first data detector for digital magnetic recording systems, detects the data bits by
identifying the locations of the peaks of these pulses [46]. When recording density increases,
pulses increasingly overlap, and it becomes increasingly difficult to reliably detect the pulse
positions. This problem has been circumvented to a certain extent by encoding the user data
using an RLL code with d=1 or d=2 constraint and by applying ‘pulse slimming’ to the
readback signal [47] [46] [15]. Pulse slimming is a form of equalization whereby the interaction
between adjacent pulses is minimized by filtering the readback signal for trimming the pulses
to be narrow.
At high recording densities, the peak detector breaks down due to the presence of severe
ISI in the readback signal. This necessitates the use of more sophisticated equalization and
detection techniques to ensure reliable data recovery. The purpose of equalization is to shape
the characteristics (e.g. spectrum) of signal and noise according to certain specifications. A
straightforward approach would be to design the equalizer transfer function to be the reciprocal
of the channel transfer function, so that the ISI is completely eliminated. This, however, is not a
practical approach since the resulting equalizer would result in extremely large noise
enhancement at frequencies close to zero and 1/(2T). This is because the channel has almost no
transfer at these frequencies [48]. Yet another straightforward approach would be to use an
optimum maximum likelihood sequence detector (MLSD) [49], implemented using the cost
efficient Viterbi algorithm [24], on the 1/T-sampled readback signal. This, however, is not
practical either, since the complexity of Viterbi detector (VD) increases exponentially with the
number of ISI components. As a result, two of the most commonly considered approaches for
detection are based on the principles of partial response (PR) equalization [21] and decision
feedback equalization (DFE) [22].
The basic idea of partial-response equalization is to use a linear filter, called equalizer, to
shape the long channel response h(t ) , which causes severe ISI, into a known partial response
p(t ) . That response is chosen such that i) the ISI components due to p(t ) are limited to a
specified small number, and ii) the spectra of h(t ) and p (t ) are as similar as possible. Such a
choice ensures that the complexity of Viterbi detector is practically affordable and the resulting
noise enhancement is minimum. A widely used family of PR polynomials is of the form [50]
P( D ) = (1 − D )(1 + D ) n , n = 0,1, 2," , (1.7)
where D denotes the ‘one bit delay’ operator and P(D) relates to a sampled version
pk = p(t ) |t =kT of p(t) according to P ( D) = ∑
pk D k . Some examples are PR4 (class IV
k
2
partial response), EPR4 and E PR4 (extended PR4) for n=1, 2, and 3, respectively [25] [51]
[52] [53]. Fig.1-2 shows the schematic of a read channel with PR equalization and Viterbi
detector.
CHAPTER 1: Introduction
7

readback
signal
r(t) LPF PR viterbi detected
(low-pass filter) 1/T equalizer detector bits
cut-off frequency
≈ 1/(2T)

Fig.1-2: Read channel with PR equalizer and Viterbi detector.

At high densities, PR detection results in significant improvement over peak detection [6].
However, the mismatch between the recording channel and the target responses causes the
noise at the PR equalizer output to be correlated. In the presence of correlated noise, the Viterbi
detector becomes a sub-optimal detector. To improve the performance of PR-based detectors,
several modifications have been proposed. An important contribution has been the ‘noise-
predictive PR scheme’ [54]. This scheme uses a noise predictor to effectively whiten the noise
at the equalizer output. Another key proposal has been the ‘modified-target PR’ scheme [55]. In
this scheme, instead of choosing a standard PR target, the PR target shape is optimized for the
given head-media combination to result in better detection performance. Yet another approach,
which is currently being pursued intensively, is the combination of distance-enhancing codes
and/or parity codes with PR equalization to improve the overall detection performance [56]
[42].
The decision feedback based approaches use a two-step procedure for ISI removal [22].
Fig.1-3 shows the schematic of a read channel with DFE detector.

readback
signal
r(t) LPF forward
slicer detected
(low-pass filter) 1/T equalizer bits
cut-off frequency
≈ 1/(2T)
feedback
equalizer

Fig.1-3: Read channel with DFE detector.

The DFE detector consists of a forward equalizer, a feedback equalizer, and a slicer. The
forward equalizer suppresses pre-cursive ISI (i.e. ISI from bits yet to be detected, or ak + l , l ≥ 1 )
and minimizes noise. The feedback equalizer removes post-cursive ISI (i.e. ISI from already
detected bits, or ak −l , l ≥ 1 ). Thus, the joint action of the forward and feedback equalizers
results in complete absence of the ISI at slicer input. Then, the detected bit (also called,
decision) is equal to the sign of the slicer input. One of the major advantages of DFE, because
of its 2-step ISI removal structure, is that it does not suffer from noise enhancement [23] [48].
On the other hand, a weakness of DFE is the phenomenon of error propagation, which arises
because of the use of past decisions to cancel the post-cursive ISI [23]. In other words, errors in
past decisions tend to cause further decision errors. Over all, the DFE offers a good
compromise between complexity and performance [26] [23] [33] [48].
There have been several modifications to the basic DFE to improve detection performance.
One key proposal has been to use multiple DFE detectors connected in parallel in a single
structure for making more accurate decisions on those samples which are not suitable for
making direct hard decisions [57] [58] [34]. This structure also helps in minimizing error
propagation. Examples of such detectors are parallel DFE [57], Dual DFE [34] [59], and multi-
level DFE family [58] [60]. Another key proposal has been to use a combination of decision
CHAPTER 1: Introduction
8

feedback and a fixed-depth tree-search based detector, called fixed delay tree search with
decision feedback (FDTS/DF) [6] [61] [62]. There have also been proposals where the principle
of decision feedback is used to reduce the complexity of the Viterbi detector [63] [64].
We can conclude from the above paragraphs that equalization is an effective technique to
provide the signal shaping suitable for detection while mitigating the effects of ISI and noise.
Adaptation techniques have also been widely used to compensate in real time for variations in
the recording system parameters. These techniques may involve the adaptation of equalizer
coefficients, timing phase, gain, and DC [25] [26]. Our focus in this thesis is on timing
recovery, which deals with the acquisition and tracking of timing phase in the read channel.
Timing recovery is regarded as one of the most important and also difficult tasks at the receiver
end, especially for high-density and high data-rate recording [8]. In the next two sections, we
summarize the topics of timing recovery, which are undertaken for investigation in this thesis,
and the contributions that resulted from this research work.

1.3 Motivation for the Present Study


For the present study, we choose ‘timing recovery’ as the broad area, and identify several
topics for in-depth investigation. We leave a broad review and detailed discussions on timing
recovery issues in Chapter 2. In this section, we elaborate on the motivations that underlie the
selection of these topics.
The data detector in the read channel operates on samples of the filtered readback signal for
detecting the recorded data bits. The problem of timing recovery is concerned with the
determination of the time instants at which these samples should be taken. Clearly, timing
recovery is very important since errors in the choice of sampling instants will directly translate
to poor detection performance. This is because the minimization or cancellation of ISI is
guaranteed only at the correct sampling instants. Further, with the steady increase in recording
densities and data rates, the resulting decreased bandwidth and deteriorated SNR reflect a
decreased amount of timing information, and at the same time, requirements on the accuracy of
timing recovery tend to become increasingly severe. As a result, timing recovery becomes
increasingly critical for reliable data recovery and, at the same time, more difficult to
accomplish. Hence, studying and developing reliable timing recovery techniques become
necessary and important.
An example of the timing recovery system in a read channel is depicted in Fig.1-4.

receiver
LF
χk

VCO TED
channel
n(t)

ak r(t) y(t) yk âk
h(t) φT prefilter detector
tk=(k+ψ)T

Fig.1-4: Example of a timing recovery system.


CHAPTER 1: Introduction
9

The input data sequence ak of data rate 1/T is applied to the channel with bit response h(t),
additive noise n(t), and an unknown delay φ (normalized in units T ). The receiver operates on
the received signal r(t) to produce decisions âk with respect to ak based on the recovered clock
signal that indicates the sampling instants tk = (k + ψ )T . Here, ψ is the normalized sampling
phase of the recovered clock signal that must closely approach φ in order for the detector to
function properly. To achieve this, a timing recovery subsystem has to be developed in order to
demarcate the instants tk that make the magnitude of the sampling phase error τ = ψ − φ
(normalized in units T) as small as possible. Obviously, the desired sampling instants are
tk = (k + φ )T . Then, the actual sampling instants tk can be interpreted as tk = tk + τ T that
exhibit a sampling phase error τ . For the sake of convenience, throughout this thesis, we set
φ = 0 , i.e. we denote by kT the desired sampling instants in the presence of the unknown
channel delay, which is φ in the present case. Therefore, the actual sampling instants tk are
expressed as tk = (k + τ )T , thus indicating the presence of the sampling phase error τ .

As illustrated in Fig.1-4, the heart of the timing recovery subsystem is a phase-locked loop
(PLL) that tracks the clock phase and frequency from the incoming signal [8] [65]. The PLL
consists of a timing-error detector (TED), a loop filter (LF), and a voltage-controlled oscillator
(VCO). The TED serves to generate a timing error output χ k , which is an indication of the
sampling phase error τ . The LF and the VCO serve to filter and average the timing error. As a
result, the frequency and phase of the VCO are adjusted by the filtered timing error. The output
of the VCO is the sampling clock signal that controls a sampling device, which is often an
analog-to-digital (A/D) converter. Clearly, the timing recovery system depends largely on the
TED. The loop properties also depend on the LF and VCO.
Typically, the timing recovery process takes place as follows. Initially, the read clock is
running freely at a frequency close to the specified channel bit rate 1/T. However, the timing
phase of the clock bears no relation to the timing of the incoming signal (filtered readback
signal). By ‘timing phase’ we mean the phase shift of the read channel clock with respect to the
ideal sampling instants at the frequency 1/T. The actual timing of the incoming signal depends
on the timing of the clock used in the write channel as well as the delays caused by the physical
or electrical systems from the write-head to the sampling point in the read channel. The system
needs to be brought into synchronism in both phase and frequency. Usually, a known training
sequence, which is called preamble, is recorded prior to the actual data sequence. To facilitate
acquisition, specially developed TED algorithms make use of this preamble to compensate for
initial errors in phase and frequency [66]. This is called the ‘acquisition mode’ of timing
recovery. Once the acquisition has been accomplished, small corrections are necessary for
tracking the slow variations in the actual timing. This is called the ‘tracking mode’.
The thesis work focuses on timing recovery techniques that ensure reliable acquisition and
tracking for recording channels. With a limited period of preamble, the objective of timing
acquisition is to select the sampling phase ψ to match the channel delay φ as rapidly and
accurately as possible. With the specific recording density and data rate, the objective of
tracking is to track the clock signal with a minimum phase error variance while maintaining the
averaged sampling phase error τ to zero. This is essential for the read channel to provide a
reliable detection performance.
There are several issues that should be addressed while developing a timing acquisition
system. The first and foremost requirement is that the acquisition should be fast and accurate.
The preamble pattern and the TED have significant roles in achieving this goal. The preamble
CHAPTER 1: Introduction
10

should be chosen to maximize the amount of timing information available at the TED input in
the face of noise. Further, a shorter length of the preamble is desirable to allow more storage
area for the user data. For a given preamble pattern, the TED should be chosen to result in a
performance that is as close to optimum as possible, with a practically affordable complexity
[67].
False lock and hang up are two serious problems that can arise during timing acquisition.
False lock means that the timing loop locks to a wrong phase and/or frequency. Hang up means
that the timing loop dwells at a wrong phase for a prolonged interval and therefore takes a very
long time to reach the correct phase [68]. Thus, false lock and hang up are disastrous for timing
acquisition in magnetic recording systems where fast and reliable timing acquisition is required
with a short preamble sequence. The preamble pattern and the acquisition mechanism must be
chosen appropriately in view of this requirement. In addition, a concern that deserves
significant attention during the development of acquisition techniques for timing recovery is the
feasibility of their implementation in a practical system. In other words, the techniques should
be simple as well as reliable. Hence, it is of interest to explore the practical implementation of
the fast acquisition schemes under consideration.
An important concern during the tracking mode of timing recovery is the steady-state jitter,
which tends to increase with increase in recording density and data rate. Steady-state jitter is
the timing jitter that is present in the phase when the timing loop is operating in the tracking
mode. Even though the timing jitter can be made small enough by careful choice of the timing
loop parameters, timing fluctuations can arise due to various reasons, such as variations in
spindle speed or flying height, fluctuations in the read head position, occurrence of magneto-
resistive (MR) thermal asperity, etc.. Because timing jitter causes a performance degradation, it
is useful to calibrate or evaluate the sensitivity of detectors to timing jitter. It is also of interest
to develop techniques that can help to minimize the timing jitter.
Recently, fully digital adaptive equalization and interpolative timing recovery techniques
for asynchronously sampled recording channels have appeared in the literature and they show
attractive advantages in terms of complexity, cost and stability [69] [70]. Development of these
techniques, from the point of view of real-time implementation, is a topic that is of much
interest in read-write channel research.
All of the above mentioned aspects have motivated the research work reported in this
thesis. In the next section, we summarize the contributions in each of the chapters.

1.4 Contributions and Organization of the Thesis


As recording systems become more efficient in terms of modulation code, bandwidth and
storage density, the task of timing recovery becomes more difficult and, at the same time,
increasingly critical to reliable data recovery. This thesis is devoted to the investigation and
development of timing recovery techniques for digital recording channels. It investigates the
structure and performance of timing recovery schemes, and develops new timing recovery
schemes for magnetic and optical recording applications.
The thesis contains eight chapters. Chapter 1 gives a brief introduction to digital recording
technology, and a quick review of read-write channel techniques. It concludes with the
motivation, contributions and organization of the thesis. Chapter 2 gives a detailed review of
timing recovery, highlighting the different possible structures, requirements, and performance
measures. It also discusses generic timing-error detector (TED) algorithms, and the main issues
CHAPTER 1: Introduction
11

that arise during timing acquisition. Analyses and simulation results are provided to support the
discussions, wherever possible. Chapters 3 to 7 describe the main new research contributions
made during the course of this Ph.D. work. Chapter 8 concludes the thesis with some remarks
on directions for further work.
Chapter 3 investigates the timing sensitivity of partial response and decision feedback
detectors, and develops an analytical approach to evaluate the performance in the presence of
static and random timing phase errors. Chapter 4 examines the efficiencies of existing and
proposed TED algorithms for partial response and DFE recording channels. It also develops a
marginal detection based TED for minimizing jitter in multi-level DFE (MDFE) detector and
analyzes its phase noise performance. Further, it examines the optimality of the preamble and
TED used for the timing acquisition in MDFE. Chapter 5 investigates the problem of timing
acquisition in DFE detectors. It develops two novel fast acquisition techniques that do not
suffer from hang up and false lock problems, even in the presence of large initial errors in
timing, gain and DC offset. Chapter 6 describes the implementation of a practical timing
recovery system, which is designed and prototyped in emitter-coupled logic (ECL) discrete-
components, for a 100 Mb/s experimental MDFE read channel (channel data rate is 150 Mb/s).
Performance evaluation of the system based on bench tests and spinstand tests are also
presented.
The focus of the development of the algorithms and systems in Chapters 3 to 6 is on
magnetic recording, even though these are also applicable with minor modifications to optical
recording. In contrast, our focus in Chapter 7 is on optical recording. In Chapter 7, an
asynchronously sampled optical DVR (digital video recording) channel is studied. It develops a
new, attractive and practicable architecture for fully digital zero-forcing (ZF) based equalizer
adaptation and interpolative timing recovery (ITR).
The development of new algorithms and systems presented in Chapters 3 to 7 is
supplemented with computer simulation results. These simulation results are used for
demonstrating the effectiveness of the proposed algorithms and for corroborating the analytical
developments.

1.4.1 About the publications by the author

Based on the research work carried out during the course of the Ph.D. work, the author
published thirteen papers, coauthoring with his supervisors and other project partners (see the
list of the publications by the author). The author’s contributions to timing recovery, as
reported in these publications, consist of four parts. The first part develops effective analytical
approaches with simulations to corroborate the analytical results in [1∼5] and [8]. The second
part proposes improved or simple TED algorithms in [2], [4], [10] and [12]. The third part
develops novel DFE acquisition structures in [3], [5], [10] and [12]. The fourth part performs
the design, implementation and verification of a timing recovery system in [6∼7], [9], [11] and
[13]. The contents of these publications are reorganized and reflected in the following chapters
of this thesis.
CHAPTER 1: Introduction
12

References:

[1] M. H. Kryder, “Guest editorial for the ‘Special issue on magnetic information storage
technology’,” Proc. IEEE, vol. 74, no. 11, pp. 1475-1476, Nov. 1986.
[2] K. Stoev et al., “Demonstration and characterization of greater than 50 Gbits/in2 recording
systems,” in Digests 8th Joint MMM-INTERMAG Conf., San Antonio, Texas, Jan. 2001, p.
485.
[3] S. Altekar et al., “A 700Mb/s BiCMOS read channel integrated circuit,” in Digests IEEE
Intl. Solid-State Circuits Conf. (ISSCC), Febr. 2001, pp. 184-185.
[4] J. Moon, “The role of SP in data storage systems,” IEEE Signal Processing Magazine, pp.
54-72, July 1998.
[5] H. Thapar, S. S. Lee, C. Conroy, R. Contreras, A. Yeung, J. G. Chern, T. Pan, and S. M.
Shih, “Hard disk drive channels: Technology and trends,” in Proc. IEEE Intl. Conf. Custom
Integrated Circuits, May 1999, pp. 309-316.
[6] J. Moon and L.R. Carley, “Performance comparison of detection methods in magnetic
recording,” IEEE Trans. Magn., vol. 26, no. 6, pp. 3155-3172, Nov. 1990.
[7] C. E. Shannon, “A mathematical theory of communication,” Bell System Technical Journal,
vol. 27, pp. 379-423, July 1948.
[8] J. W. M. Bergmans, Digital baseband transmission and recording. Boston: Kluwer
Academic Publishers, 1996.
[9] S. K. Khizroev, M. H. Kryder, Y. Ikeda, K. Rubin, P. Aornett, M. Best, and D. A.
Thompson, “Recording heads with track widths suitable for 100 Gbits/in2 density,” IEEE
Trans. Magn., vol. 35, no. 5, pp. 2544-2546, Sept. 1999.
[10] F. H. Liu, S. Shi, J. Wang, Y. Chen, K. Stoev, L. Leal, R. Saha, H. C. Tong, S. Dey, and
M. Nojaba, “Magnetic recording at a data rate of one gigabit per second,” IEEE Trans.
Magn., vol. 37, no. 2, pp. 613-618, March 2001.
[11] M. H. Kryder, “Magnetic recording beyond the superparamagnetic limit,” in Digests IEEE
Intl. Conf. Magnetics (INTERMAG), Toronto, Canada, April 2000, p. HA-01.
[12] S. Lin and D. J. Costello, Error control coding: Fundamentals and Applications. Prentice-
Hall, 1983.
[13] D. J. Costello, Jr., J. Hagenauer, H. Imai, and S. B. Wicker, “Applications of error-control
coding,” IEEE Trans. Inform. Theory, vol. 44, no. 6, pp. 2531-2560, Oct. 1998.
[14] P. H. Siegel, “Recording codes for digital magnetic storage,” IEEE Trans. Magn., vol. 21,
no. 5, pp. 1344-1349, Sept. 1985.
[15] P. H. Siegel and J. K. Wolf, “Modulation and coding for information storage,” IEEE
Commun. Magazine, vol. 29, no. 12, pp. 68-86, Dec. 1991.
[16] K. A. S. Immink, P. H. Siegel, and J. K. Wolf, “Codes for digital recorders,” IEEE Trans.
Inform. Theory, vol. 44, no. 6, pp. 2260-2299, Oct. 1998.
[17] D. Palmer, P. Ziperovich, R. W. Wood, and T. D. Howell, “Identification of nonlinear
write effects using pseudorandom sequences,” IEEE Trans. Magn., vol. 23, no. 5, pp.
2377-2379, Sept. 1986.
[18] A. Taratorin, J. Fitzpatrick, S. X. Wang, and B. Wilson, “Non-linear interactions in a
series of transitions,” IEEE Trans. Magn., vol. 33, no. 1, pp. 956-961, Jan. 1997.
[19] B. Bloodworth, P. Siniscalchi, G. De Veirman, A. Jezdic, R. Pierson, and R.
Sundararaman, “A 450Mb/s analog front-end for PRML read channels,” in Proc. IEEE
Intl. Conf. Solid-State Circuits (ISSCC), Febr. 1999, pp. 34-35.
CHAPTER 1: Introduction
13

[20] H. Kobayashi and D. T. Tang, “Application of partial-response channel coding to magnetic


recording systems,” IBM J. Res. Develop., vol. 14, pp. 368-375, July 1970.
[21] P. Kabal and S. Pasupathy, “Partial-response signaling,” IEEE Trans. Commun., vol. 23,
no. 9, pp. 921-934, Sept. 1975.
[22] C. A. Belfiore and J. H. Park, “Decision feedback equalization,” Proc. IEEE, vol. 67, no.
8, pp. 1143-1156, Aug. 1979.
[23] E. A. Lee and D. G. Messerschmitt, Digital communications. Boston: Kluwer Academic
Publisher, 1988.
[24] G. D. Forney, “The Viterbi algorithm,” Proc. IEEE, vol. 61, no. 3, pp. 268-278, March
1973.
[25] R. D. Cideciyan, F. Dolivo, R. Hermann, W. Hirt, and W. Schott, “A PRML system for
digital magnetic recording,” IEEE J. Selected Areas Commun., vol. 10. no. 1, pp. 38-56,
Jan. 1992.
[26] K. D. Fisher, J. M. Cioffi, W. L. Abbott, P. S. Bednarz, and C. M. Melas, “An adaptive
RAM-DFE for storage channels,” IEEE Trans. Commun., vol. 39 no. 11, pp. 1559-1568,
Nov. 1991.
[27] H. Mueller and M. Müller, “Timing recovery in digital synchronous receivers,” IEEE
Trans. Commun., vol. 24, no. 5, pp. 516-531, May 1976.
[28] H. Shafiee, “Timing recovery for sampling detectors in digital magnetic recording,” in
Proc. IEEE Intl. Conf. Commun. (ICC), Dallas, Texas, June 1997, pp. 577-581.
[29] A. Gomez and A. A. Abidi, “A 50-Mhz CMOS variable gain amplifier for magnetic data
storage systems,” IEEE J. Solid-State Circuits, vol. 27, no. 6, pp. 935-939, June 1992.
[30] S. U. H. Qureshi, “Adaptive equalization,” Proc. IEEE, vol. 73, no. 9, pp. 1349-1387,
Sept. 1985.
[31] H. Kobayashi, “A survey of coding schemes for transmission and recording of digital
data,” IEEE Trans. Commun. Technology, vol. 19, no. 6, pp. 1087-1100, Dec. 1971.
[32] K. A. S. Immink, “Run-length limited sequences,” Proc. IEEE, vol. 78, no. 11, pp. 1745-
1759, Nov. 1990.
[33] S. X. Wang and A. M. Taratorin, Magnetic information storage technology. Academic
Press, 1999.
[34] J. W. M. Bergmans et al., “Dual-DFE read/write channel IC for hard-disk drives,” IEEE
Trans. Magn., vol. 34, no. 1, pp. 172-177, Jan. 1998.
[35] K. A. S. Immink, “Coding techniques for the noisy magnetic recording channel: A state-
of-the-art report,” IEEE Trans. Commun., vol. 37, no. 6, pp. 413-419, May 1989.
[36] J. W. M. Bergmans, “On the SNR merits of run-length-limited codes in feedback-
equalized recording systems,” IEEE Trans. Commun., vol. 41, no. 8, pp. 1133-1136, Aug.
1993.
[37] K. A. S. Immink, “EFMPlus: The coding format of the multimedia compact disc,” IEEE
Trans. Consumer Electronics, vol. 41, no. 3, pp. 491-497, Aug. 1995.
[38] A. J. van Wijngaarden and K. A. S. Immink, “ Combinatorial construction of high rate
run-length-limited codes,” in Proc. Intl. Conf. Global Telecommun. (GLOBECOM), Nov.
1996, pp. 343-347.
[39] W. G. Bliss, “An 8/9 rate time-varying trellis code for high density magnetic recording,”
IEEE Trans. Magn., vol. 33, no. 5, pp. 2746-2748, Sept. 1997.
[40] H. Runsheng and J. R. Cruz, “High-rate coding system for magnetic recording,” IEEE
Trans. Magn., vol. 35, no. 6, pp. 4522-4527, Nov. 1999.
[41] T. Conway, “A new target response with parity coding for high density magnetic recording
channels,” IEEE Trans. Magn., vol. 34, no. 4, pp. 2382-2486, July 1998.
CHAPTER 1: Introduction
14

[42] R. D. Cideciyan, J. D. Coker, E. Eleftheriou, and R. L. Galbraith, “Noise predictive


maximum likelihood detection combined with parity-based post-processing,” IEEE
Trans. Magn., vol. 37, no. 2, pp. 714-720, March 2001.
[43] H. Saito, Y. Okamoto, and H. Osawa, “The rate 19/20 trellis code matched to enhanced
extended class-4 partial response channel,” IEEE Trans. Magn., vol. 37, no. 2, pp. 768-
772, March 2001.
[44] H. N. Bertram, Theory of magnetic recording. Cambridge Unitversity Press, 1994.
[45] H. N. Bertram, “Fundamentals of the magnetic recording process,” Proc. IEEE, vol. 74,
no. 11, pp. 1494-1512, Nov. 1986.
[46] R. W. Wood, “Magnetic recording systems,” Proc. IEEE, vol. 74, no. 11, pp. 1557-1569,
Nov. 1986.
[47] L. Barbosa, “Minimum noise pulse slimmer,” IEEE Trans. Magn., vol. 17, no. 6, pp. 3340-
3342, Nov. 1981.
[48] J. W. M. Bergmans, “Decision feedback equalization for digital magnetic recording,”
IEEE Trans. Magn., vol. 24, no. 1, pp. 683-688, Jan. 1988.
[49] G. D. Forney, “Maximum-likelihood sequence estimation of digital sequences in the
presence of intersymbol interference,” IEEE Trans. Inform. Theory, vol. 18, pp. 363-378,
May 1972.
[50] H. Thapar and A. Patel, “A class of partial response systems for increasing storage density
in magnetic recording,” IEEE Trans. Magn., vol. 23, no. 5, pp. 3666-3668, Sept. 1987.
[51] T. Suguwara, M. Yamagishi, H. Mutoh, K. Shimoda, and Y. Mizoshita, “Viterbi detector
including PRML and EPRML,” IEEE Trans. Magn., vol. 29, no. 6, pp. 4021-4023, Nov.
1993.
[52] M. Demicheli et al., “A 450Mb/s EPR4 PRML read/write channel,” in Proc. IEEE Intl.
Conf. Custom Integrated Circuits, May 1999, pp. 317-320.
[53] T. Pan et al., “A trellis-coded E2PRML digital read/write channel IC,” in Proc. IEEE Intl.
Conf. Solid-State Circuits (ISSCC), Febr. 1999, pp. 36-37.
[54] J. D. Coker, E. Eleftheriou, R. Galbraith, and W. Hirt, “Noise-predictive maximum
likelihood (NPRML) detection,” IEEE Trans. Magn., vol. 34, no. 1, pp. 110-117, Jan.
1998.
[55] H. Sawaguchi, M. Kondou, N. Kobayashi, and S. Mita, “Concatenated error correction
coding for high-order PRML channels,” in Proc. IEEE Intl. Conf. Global Telecommun.
(GLOBECOM), Sydney, Australia, Nov. 1998, pp. 2694-2699.
[56] E. Soljanin and A. J. van Wijngaarden, “Applications of distance enhancing codes,” IEEE
Trans. Magn., vol. 37, no. 2, pp. 762-767, March 2001.
[57] D. Yellin, A. Vardy, and O. Amrani, “Joint equalization and coding for intersymbol
interference channels,” IEEE Trans. Inform. Theory, vol. 43, pp. 409-425, March 1997.
[58] V. Y. Krachkovsky, R. W. Wood, Y. X. Lee, H. Mutoh, and M. Umemoto, “MxDFE
technology – Meeting tomorrow’s challenges in read-write channels,” Data Tech, 4th
edition, pp. 31-34, 2000.
[59] J. W. M. Bergmans, J. O. Voorman, and H. W. Wong-Lam, “Dual decision feedback
equalizer,” IEEE Trans. Commun., vol. 45, pp. 514-518, May 1997.
[60] K. C. Indukumar, Y. X. Lee, and G. Mathew, “Performance comparison of modified
multi-level decision feedback equalization detectors,” IEEE Trans. Magn., vol. 3, pp.
594-604, Jan. 1999.
[61] D. C. Wei, D. Q. Sun, and A. A. Abidi, “A 300 MHz mixed-signal FDTS/DFE disk read
channel in 0.6 µm CMOS,” in Proc. IEEE Intl. Conf. Solid-State Circuits (ISSCC), Febr.
2001, pp. 186-187.
CHAPTER 1: Introduction
15

[62] J. Moon and L.R. Carley, “Efficient sequence detection for intersymbol interference
channels with run-length constraints,” IEEE Trans. Commun., vol. 42, no. 9, pp. 2654-
2660, Sept. 1994.
[63] A. Duel-Hallen and C. Heegard, “Delayed decision-feedback sequence estimation,” IEEE
Trans. Commun., vol. 37, no. 5, pp. 428-436, May 1989.
[64] J.W.M. Bergmans, S.A. Rajput, and F.A.M. Van de Laar, “On the use of decision
feedback for simplifying the Viterbi detector,” Philips Journal Res., vol. 42, no. 4, pp.
399-428, 1987.
[65] R. E. Best, Phase-locked loops. New York: McGraw-Hill, 1984.
[66] F. Dolivo, W. Schott, and G. Ungerböck, “Fast timing recovery for partial-response
signaling systems,” in Proc. IEEE Intl. Conf. Commun. (ICC), Boston, USA, June 1989,
pp. 573-577.
[67] J. W. M. Bergmans, “Efficiency of data-aided timing recovery techniques,” IEEE Trans.
Inform. Theory, vol. 41, no. 5, pp. 1397-1408, Sept. 1995.
[68] F. M. Gardner, “Hang up in phase-lock loops,” IEEE Trans. Commun., vol. 25, pp. 1210-
1214, Oct. 1977.
[69] M. Spurbeck and R. T. Behrens, “Interpolated timing recovery for hard disk drive read
channels,” in Proc. IEEE Intl. Conf. Commun. (ICC), Montreal, Canada, June 1997, pp.
1618-1624.
[70] Z. N. Wu, J. M. Cioffi, and K. D. Fisher, “A MMSE interpolated timing recovery scheme
for the magnetic recording channel,” in Proc. IEEE Intl. Conf. Commun. (ICC), Montreal,
Canada, June 1997, pp. 1625-1629.
CHAPTER 2: Review of Timing Recovery
17

CHAPTER 2
REVIEW OF TIMING RECOVERY
In digital communication systems, synchronization is one of the most crucial tasks to be
performed at the receiver for ensuring reliable data recovery. In general, the problem of
synchronization involves carrier synchronization, timing recovery and word/block
synchronization. The objective of carrier synchronization is to generate a reference carrier
whose phase matches that of the received signal in carrier modulated systems. Since digital
recording systems are equivalent to digital baseband communication systems, the
synchronization tasks in these systems are limited to timing recovery and block
synchronization. The objective of timing recovery, which is also called clock recovery or bit
synchronization, is to recover a clock at the channel rate that is phase-locked to the readback
signal. The objective of block synchronization is to look for the start of a message in the
detected data or to divide the detected data sequence into smaller blocks. Our focus in this
thesis is on timing recovery techniques for digital recording systems. The goal is to ensure that
the samples of the readback signal taken at the set of sampling instants provided by the timing
recovery technique should result in reliable data recovery.
In this chapter, we present a reasonably broad review on timing recovery. This review
relates to different possible structures, requirements, and performance measures. It also
discusses generic timing error detector algorithms and the main issues that arise during timing
acquisition. Wherever possible, analyses and simulation results are presented for supporting the
discussion.

2.1 Introduction to Timing Recovery


Timing recovery, being a critical task at the receiver for reliable data recovery, has been a
subject under investigation for many decades (see [1] [2] and the references therein). Several
books are available (e.g. [3], [4], [5], and [6]) that deal with timing recovery issues in detail.
The key problem in timing recovery is the determination of time instants at which the received
(or, readback) signal should be sampled for doing reliable data recovery. A straightforward
solution to this problem is to send the clock signal to the receiver as side information. Since this
approach requires additional power and/or bandwidth, another approach, called self-timing, has
been widely adopted for implementing timing recovery. The timing-recovery schemes based on
self-timing recover the clock from the received signal itself [7] [8], thus they do not have the
additional requirements mentioned above. For this reason, the self-timing approach has been
utilized extensively in data transmission systems. The earliest large-scale application was in
regenerative repeaters [9] [10]. A key step in the development of a self-timing scheme is the
determination of an objective function that can be constructed using the received signal samples
CHAPTER 2: Review of Timing Recovery
18

such that the information on the correct timing can be obtained from the salient features of this
function such as minimizers, maximizers, or roots, without any ambiguity [11] [7] [1]. In this
chapter, we restrict the review to self-timing based timing recovery schemes, since read
channels in recording systems are based on this approach.
Fig. 2-1 shows the schematic of a read-write channel (i.e., transmitter and receiver) for
recording systems emphasizing the timing recovery part.

LF VCO

n(t) ak (k+τk)T
TED

ak r(t) y(t) âk
h(t) prefilter detector

Fig. 2-1: Schematic of a read-write channel with timing recovery.

As explained already in Section 1.2.2, the recording channel is modeled using the bit
response h(t) with input data bits ak∈{-1,1} at rate 1/T bits/s and additive noise n(t). The
prefilter and detector form part of the read channel. The prefilter conditions the readback signal
r(t) by doing noise limiting, equalization, or any such filtering functions. The detector operates
on the prefilter output y(t) to produce the decisions âk on the data bits ak. The detector, as
shown here, uses a phase-locked loop (PLL) [12], which consists of a timing error detector
(TED), loop filter (LF), and voltage-controlled oscillator (VCO), for generating the timing
instants at which the prefilter output should be sampled. It is assumed that the readback signal
r(t) may suffer from timing fluctuations arising from several undesired phenomena that occur in
the recording channel (e.g. variations in spindle speed and flying height, clock drift, etc.). In
addition to these, while sampling the signal y(t), the detector also needs to take into account the
delay caused by the physical or electrical elements in the recording system. The resulting
sampling instants are denoted by (k+τk)T where kT specifies the kth ideal sampling instant and
τk (normalized in units T ) is the estimate of the timing discrepancy with respect to this
reference. Clearly, τk, which we call ‘timing phase error’, accounts for unknown timing
fluctuations and delays in the recording channel.
It can be said that the TED is the most important block in the PLL, since the task of the
TED is to extract timing information from the incoming signal. The TED shown Fig. 2-1 is of
the data-aided type since it makes use of the data bits ak. There are also non-data aided TEDs
where the data ak is not used in the TED. The filtered TED output by the loop filter (LF) is used
to control the frequency and phase of the VCO. The PLL effectively performs an averaging
operation on the TED output, thereby suppressing the dynamic variations caused by noise and
other effects in the system. The loop filter has a significant role in deciding the various
properties (e.g. bandwidth, noise suppression gain, etc) of the overall PLL. Therefore, what is
of utmost importance to the PLL performance is the average output of the TED. The TED
output, in time or phase domain, can be modeled as [3]
χk = ρ(τ k ) + uk (2.1)

τ k = φk − ψ k (2.2)
CHAPTER 2: Review of Timing Recovery
19

where φk and ψ k (all normalized in units T) are the ideal timing phase and its estimate from
the PLL, respectively, and uk is the noise component of the TED output that is converted into
phase noise by the PLL. For obvious reasons, τ k is the timing phase error. The function ρ (τ ) ,
which is deterministic in nature, describes the average output of the TED over the ensemble of
possible data sequences, and is called the timing function. Since the TED output is meant to be
proportional to the timing phase error, it is clear that the function ρ (τ ) should have a zero
crossing only at τ = 0 and the sign of ρ (τ ) for τ > 0 should be opposite to that for τ < 0 .
Strictly speaking, the zero crossings in ρ (τ ) may also occur at integer values of τ . Sampling
with τ k = l for integers l ≠ 0 implies a shift of l samples in the original sequence of samples
obtained for τ k = 0 .

For small timing phase errors, we may linearize ρ (τ ) as

ρ (τ )  K dτ (2.3)

where K d is the slope of the timing function ρ (τ ) at origin τ = 0 , and is called the ‘TED
gain’. To conclude our discussion on the TED, we may say that a properly designed TED
should have zeros crossing only at τ = 0 and a large TED gain K d . The resulting linearized
discrete-time TED model is shown in Fig. 2-2.

ψk Kd uk

φk τk χk

Fig. 2-2: Linearized discrete-time phase-domain TED model.

Fig. 2-3 shows two structures that are used for the loop filter. Fig. 2-3(a) corresponds to a
simple proportional filter type with coefficient Kp and Fig. 2-3(b) corresponds to proportional-
plus-integral filter type with coefficients Kp and Kf. The PLLs that use these loop filters are
called ‘first-order PLL’ and ‘second-order PLL’, respectively. Because of these loop filter
configurations, the first-order PLL is able to handle only phase errors whereas the second-order
PLL can handle both phase and frequency errors.
Kp
Kp χk ηk
χk ηk Kf
Z-1

(a) (b)

Fig. 2-3: Structures of discrete-time loop filters: (a) proportional type, (b) proportional-plus-
integral type.
CHAPTER 2: Review of Timing Recovery
20

Fig. 2-4 shows a discrete-time phase-domain model of the VCO [3]. The gain factor Ko
determines the sensitivity of the VCO. The error between the VCO frequency and the
frequency of the signal at PLL input (i.e. bit rate 1/T) is modeled by the offset parameter λ. The
integrator performs the frequency-to-phase conversion within the VCO. Finally, υ k represents
the VCO phase noise.
Ko λ υk

ηk ψk
Z-1

Fig. 2-4: Discrete-time phase-domain model of VCO.

Let us look at an example of a first-order PLL . Combining Fig. 2-2, Fig. 2-3 and Fig. 2-4,
we get the discrete-time model of a first-order PLL as shown in Fig. 2-5. Here, the VCO phase
noise and frequency errors are ignored. The quantity θk is the input-referred phase noise and is
given by (using Fig. 2-2)
θ k = uk / K d . (2.4)

The overall loop gain Kt is given by K t = K d K p K o .

θk Kt
φk τk ψk
Z-1

Fig. 2-5: Discrete-time phase-domain model of first-order PLL ignoring VCO phase noise and
frequency error.

For a given channel, the most important part of work in the design of timing recovery is the
design of the TED. In the past fifty years, a vast array of techniques has been reported for
implementing self-timing based timing recovery in various applications. An excellent review as
well as summary of the key contributions can be found in [3, Chapter 10]. Based on how the
timing information is derived from the received signal (i.e. how the TED is designed), timing
recovery techniques can be classified as: maximum-likelihood (ML) schemes, minimum mean-
square error (MMSE) schemes, zero-forcing (ZF) schemes, threshold-crossing schemes, early-
late schemes, and nonlinear spectral line schemes, or a combination of these.

2.2 Structures, Requirements and Performance Measures


In this section, we discuss the different ways in which the timing recovery schemes can be
configured, the various requirements that should be taken into account to ensure proper
operation of the timing loop, and the various measures that can be used to assess the
performance of timing recovery schemes.
CHAPTER 2: Review of Timing Recovery
21

2.2.1 Timing recovery structures


In Fig. 2-1 we did not explicitly show the position of the sampler. We did this to keep the
discussion in Section 2.1 as general as possible. Depending upon whether the timing
information is extracted before or after this sampler, the timing recovery systems can be
classified as deductive or inductive [3]. Fig. 2-6 depicts receivers incorporating these two
configurations of timing recovery schemes.

LF VCO

TED VCO LF TED

r(t) y(t) âk r(t) y(t) yk âk


pre-filter delay detector pre-filter detector

(a) (b)

Fig. 2-6: Receivers incorporating (a) deductive and (b) inductive timing recovery schemes.

As shown, the sampler is assumed to be located between the prefilter and the detector. In
the deductive scheme, the timing information is extracted before the sampler, whereas in the
inductive scheme this is done after the sampler. Clearly, the deductive structure favors analog
implementation whereas the inductive structure favors digital implementation. Further, the
inductive structure can operate in a data-aided (DA) mode, in contrast to the deductive structure
that operates in a non-data-aided (NDA) mode. However, practical implementation of data-
aided inductive schemes often makes use of the decisions âk from the detector, rather than the
actual data ak, thereby resulting in the so-called ‘decision-directed’ (DD) mode of operation
[18].
The decision-directed inductive schemes tend to perform poorly during acquisition because
of decision errors. Clearly, the effect of decision errors is nonexistent in the deductive structure.
Thus, the deductive structure has the potential for fast and reliable timing acquisition. To
mitigate the effects of decision errors during acquisition, the inductive schemes usually use a
fixed training sequence, called preamble, which precedes the actual user data. The preamble is
usually a known periodic pattern. By using the knowledge of this preamble, acquisition speeds
in inductive schemes can be significantly improved.
Deductive and inductive timing recovery schemes may yield different performances in the
tracking mode, especially for low signal-to-noise ratio (SNR) channels. The key advantage of
the deductive scheme is that it is independent of the detection performance. However, the
deductive scheme has a big disadvantage in that it requires the received signal to have
reasonable excess bandwidth. This makes this scheme not practical for high-density recording
applications, where the readback signal has minimal or nil excess bandwidth. For this reason,
the inductive scheme attracts more interest and attention than the deductive scheme for timing
recovery in recording systems.
Currently, the trend in the industry is towards fully digital implementation of read channels.
Therefore, it is of utmost priority to ensure that the overall circuit complexity is within limits so
as to permit operation at high speeds and low power. Since cost and complexity of digital
circuits increase with data rate, timing recovery operating at as low as the symbol rate, also
CHAPTER 2: Review of Timing Recovery
22

called baud-rate, is most desirable. Symbol-rate timing recovery techniques were first proposed
in the classical paper of Mueller and Müller [1]. This paper is widely acknowledged as the basis
for most of the current algorithms for symbol-rate timing recovery [13]-[21]. In these schemes,
the timing loop is driven by samples at symbol-rate, thereby favoring digital implementation.
This is unlike the non-symbol-rate schemes, which usually have unsampled signals at the TED
input and may need additional samplers within the timing loop to generate the sampling phase
updates.

2.2.2 Timing recovery requirements


Since self-timing techniques extract timing information from the received signal, additional
requirements should be imposed on the signal to ensure that the timing loop performs
satisfactorily. For example, if the data sequence {ak} contains long runs of +1 or -1, then the
received waveform contains very minimal timing information. Consequently, the timing loop
cannot be updated frequently enough. As a result, the PLL may loose lock. Therefore, it is
required that the transmitted data have significant high frequency content and not be DC-like
for prolonged periods of time. For this reason, it is customary to encode the user data using
proper modulation codes to ensure that there is sufficient timing information embedded in the
received signal. Modulation codes can be used to limit the maximum run-lengths of data bits
[22]-[25]. Examples of such codes are DC-free and run-length-limited (RLL) codes. Such
codes are widely applied in digital recording systems. As we discussed in Section 1.2.1, the
price paid for the use of these codes is the reduced SNR in the received signal. An alternative
approach to the use of modulation codes is to randomize the data by means of a scrambler
[22][26][27], which lowers the probability of a long run length. This approach does not add
redundancy into the data sequence and has found widespread application in data modems.
For fast and reliable timing acquisition, a preamble is usually used to precede the actual
user data. The preamble often contains a known periodic pattern that can be used to acquire
clock rapidly before the user data begins. Therefore, it is desirable that the preamble contains as
many transitions as possible so as to convey sufficient timing information to enable fast
acquisition. We postpone further discussion on the issue of optimality of the preamble pattern
to Chapter 4, in which the impact of recording data on timing recovery is investigated in detail.
In addition to the items mentioned above, there are also several other issues and
requirements that must be taken into account for developing a reliable and efficient timing
recovery algorithm. Possibly the most important of these is that the timing function ρ(τ) must
not have multiple zero crossings or a zero crossing at τ ≠ 0 . Such undesired zero-crossings
could make the timing loop converge to a wrong phase, thereby resulting in significantly
inferior detection performance. In case ρ(τ)=0 for τ ≠ 0 , special precautions should be taken to
avoid wrong phases. Additional requirements are fast initial acquisition of the correct phase,
faithful tracking of the timing fluctuations in the steady state, good noise suppression,
minimum phase jitter etc. From the point of view of practical implementation, the algorithm
should be simple enough, while not compromising on reliability.

2.2.3 Timing recovery performance measures


In this section, we discuss various measures that can be used to evaluate the effectiveness
of timing recovery schemes. Our discussion includes measures that can be computed based on
theoretical analyses, as well as numerical measures that can be obtained by doing simulations.
Therefore, in cases where it is difficult to evaluate the analytical measures, one can resort to the
numerical measures for performance evaluation.
CHAPTER 2: Review of Timing Recovery
23

A. Timing sensitivity
For any system, the performance is dependent on the available timing phase margins. The
timing phase margin specifies the maximum error in the timing phase that the receiver can
tolerate before the performance becomes unacceptable. Clearly, the timing phase margin
depends on the characteristics of the channel and the timing loop. The timing sensitivity is a
measure of how fast the performance degrades with timing phase error. The sensitivity to
timing phase can be gauged by examining the eye pattern of the signal at the input of the
decision block within the receiver [52]. The decision block is, for example, the slicer in the
DFE detector [28] and the Viterbi algorithm in the partial response detector [16]. The eye
pattern, sometimes also called eye diagram, is obtained by overlaying segments of the signal in
a phase-aligned manner. That is, if the ith segment is given by y(t) for ti ≤ t < ti+ts and
i=1,2,3,…, then the ith and jth segments are related by tj=ti+nT where n is an integer and T is the
bit interval. Here, ts ≥ T is the duration of each segment. The overlaid plots resemble a human
eye. The shape and size of the ‘eye’ indicate the margins of the system against various
disturbances, such as timing phase error, intersymbol interference (ISI) and noise.
Fig. 2-7 shows the eye pattern for a noiseless channel that has raised-cosine characteristics with
25% and 100% excess bandwidth [52].
viable timing
viable timing
phases
phases
optimal optimal
timing phase timing phase
binary data, 25% excess bandwidth binary data, 100% excess bandwidth
2 1.5

1.5
1
1
0.5
Amplitude

0.5
Amplitude

0 0

-0.5
-0.5
-1
-1
-1.5

-2 -1.5
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1
interval of signal rate:T interval of signal rate:T
Timing phase error: τ Timing phase error: τ
(a) (b)

Fig. 2-7: Eye patterns for a raised-cosine channel with (a) 25% and (b) 100% excess bandwidth.

The optimal timing phase corresponds to τ=0 (τ is normalized in units T) where the eye
opening is the greatest. The eye opening decreases as the timing phase error moves away from
τ=0 in either direction. This is because the ISI, which is zero at τ=0, increases as τ deviates
from the optimum value. Clearly, as τ or ISI increases, the noise margin decreases. The width
of the interval around the optimal phase over which the eye is not closed is defined as the eye
width. In the absence of noise, the data bits can be correctly detected as long as the timing
phase is within the open eye region. Fig. 2-7 shows that compared to the case of 100% excess
bandwidth, the case of 25% excess bandwidth is more sensitive to timing phase error because
the eye closes more rapidly in the latter. For example, the eye width is T for 100% excess
bandwidth and 0.638T for 25% excess bandwidth. With zero excess bandwidth, the eye width
approaches zero. Thus, the eye width is a straightforward measure of timing sensitivity.
CHAPTER 2: Review of Timing Recovery
24

It should not be inferred, however, that all signals without excess bandwidth have zero eye
width. For example, consider the partial responses PR4 (P(D)=1-D2) and E2PR4 (P(D)=1+2D-
2D3-D4) which are widely used as equalization targets in magnetic recording systems. These
responses have no excess bandwidth and yet result in eye-patterns with open eyes, as illustrated
in Fig. 2-8. In particular, the eye width is 0.449T for PR4 and 0.345T for E2PR4.
3 8

6
2
4
1

Amplitude
2
Amplitude

0 0

-2
-1
-4
-2
-6

-3 -8
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1
interval of signal rate:T interval of signal rate:T
Timing phase error: τ Timing phase error: τ

(a) (b)

Fig. 2-8: Eye patterns of (a) PR4 and (b) E2PR4 partial response channels that have no excess
bandwidth.

Timing sensitivity can be examined using different approaches. Several researchers have
reported theoretical and simulation studies on timing sensitivity of detectors [29] [30] [31] [32]
[33]. In Chapter 3, we propose a novel analytical approach to evaluate timing sensitivity of
decision feedback and partial response detectors in magnetic recording systems [34].
B. Tracking performance
Practical timing recovery circuits in the receiver cannot perfectly duplicate the clock of
transmitter. Although the average frequency and phase of the derived clock may be correct, it
will inevitably exhibit phase jitter because of noise. Fig. 2-9 shows a typical situation of phase
jitter in the recovered clock. Statistical properties of the timing phase error (or, phase jitter) are
of much interest in the tracking mode since they determine the accuracy of the recovered timing
signal. In practice, a convenient performance measure is the root-mean-square (RMS) value of
the timing phase error.
T T

Fig. 2-9: Recovered clock with phase jitter.

Timing jitter causes mainly two degradations. First, it results in increased ISI because of
suboptimal sampling instants, thereby reducing the noise immunity of the receiver and
increasing the bit error rate. Second, the data signal emerging from the detector will generally
have similar timing jitter as in the recovered clock. This could cause degradation in the
performance of the systems that follow the receiver. Timing jitter has two principal
CHAPTER 2: Review of Timing Recovery
25

components, namely, pattern-dependent jitter and noise-induced jitter. In the tracking mode
where timing phase errors are small, it is possible to ensure that pattern-dependent jitter is very
small [35]. Therefore, noise-induced jitter is generally the main disturbance to be dealt with in
timing recovery systems. In principle, jitter can be minimized by appropriate design of the
timing recovery circuits, for example through a proper choice for the bandwidth of the timing-
recovery loop. Reduction of jitter via a reduction of the bandwidth of the loop, will adversely
affect the ability of the loop to track the phase fluctuations in the system. Because of this trade-
off, phase jitter remains a serious problem that deserves careful attention in the design of the
receiver.
Another key problem in practical timing recovery circuits is static timing phase error (also
called steady state phase bias), which may arise from unexpected errors at the TED input due to
channel variations and misequalization, or DC offsets within the timing loop. The static timing
phase error must remain very small with respect to the clock period to ensure a satisfactory
detection performance. For this reason, the steady state phase bias is an important parameter
that cannot be ignored when evaluating the tracking performance.
The PLL may lose lock when timing phase errors become too large. This can happen due
to noise as well as changes in the phase and frequency of the PLL input. For the PLL to lose
lock due to fluctuations in phase, the underlying fluctuations should be sudden and large. Two
key parameters that are used to specify the frequency range in which the PLL can track
satisfactorily are the ‘hold range’ and ‘pull-out range’ [36, Chap. 2] [3, Chap. 11]. The hold
range, which is also called hold-in range or static operating range, is the range of frequency
over which the loop will remain locked when the input frequency changes very slowly. The
pull-out range is the dynamic limit for stable operations, i.e. the loop will lose lock if the
frequency step exceeds the pull-out range.
Large noise peaks, which occur when the noise power is significant, may cause loss of lock
or cycle slips. A cycle slip is said to occur when the recovered clock drops or adds one or more
cycles of oscillations relative to the incoming signal. If there are cycle slips, the loop is
considered to have lost lock. This depends strongly on the loop signal-to-noise ratio defined as
1 K d2
SNRloop  = (2.5)
2σ θ2 Bl 2σ u2 Bl

where σ θ2 is the variance of the phase noise θ k at the PLL input (see Fig. 2-5) and Bl is the
equivalent noise bandwidth of the loop, which is given by
0.5
Bl  ∫ | Gψ (e j 2πΩ ) |2 d Ω (2.6)
−0.5

where Gψ (e j 2πΩ ) is the transfer function of the PLL from the TED input to VCO output [3,
Chap. 11], and Ω is a normalized frequency variable defined as Ω = fT . (Note that Ω is a
dimensionless frequency variable that is normalized with respect to the rate 1/ T while f is a
frequency variable that has dimension ‘Hz’. For the sake of convenience, throughout the thesis
we will use Ω instead of f to describe frequency-domain notations.) The averaged elapsed
time between cycle slips grows exponentially with the loop SNR [12]. In practice, the tracking
failure rate is required to be much smaller than the bit or symbol error rate.
CHAPTER 2: Review of Timing Recovery
26

C. Timing error detector (TED) efficiency


The capability of the timing loop to track the timing clock depends heavily on how much
timing information the TED can extract from the incoming signal, while rejecting the noise as
much as possible. To retain the signal component K dτ k while suppressing the noise uk (see
Fig. 2-2), the timing loop (i.e., PLL) must have a bandwidth that is much smaller than the
symbol rate 1/T. Therefore, all the noise will be rejected except for components near DC (i.e.,
zero frequency). If Su (e j 2πΩ ) denotes the power spectral density of uk , we see from the above
argument that Su (1) will largely determine the amount of noise-induced jitter. Thus, the timing
jitter at the TED input is proportional to Su (1) / K d2 . Using this, the efficiency of the TED is
formally defined as [18]
1 K d2
γ = (2.7)
S N R m S u (1)
where SNRm is the signal-to-noise ratio (SNR) in the matched filter bound sense, given as

SNRm =

−∞
h 2 (t )dt
(2.8)
No / 2
where h(t) is the channel bit response and N 0 / 2 is the power spectral density of the channel
noise. In (2.7), the division by SNRm removes the dependence of Su (1) on the noise level in the
channel. The TED efficiency, therefore, is a measure of the relative ease of accurate clock
recovery. Using this measure, it is possible to quantitatively evaluate the timing recovery
performance and compare various timing recovery schemes.

D. Acquisition performance measures


As mentioned already, the most desirable features during the acquisition mode are fast and
accurate acquisition of the timing phase. There are certain key parameters that are usually used
to describe the acquisition properties [3, Chap. 9], [36, Chap. 2]. The ‘pull-in time’ is the time
that the PLL takes in the beginning of the acquisition mode to adjust the VCO frequency to
approach that of the incoming signal. The ‘pull-in range’ is the frequency range over which the
loop can acquire lock. Acquisition of phase follows acquisition of frequency. The ‘lock time’ is
the time taken to acquire the phase, and is usually much smaller than the ‘pull-in time’.
The total acquisition time depends on initial errors in the frequency and phase, the loop
bandwidth, the TED used, and the type of preamble pattern used etc.. A large loop bandwidth
helps to speed up acquisition but increases phase jitter, and vice versa. Further, the preamble
pattern should be chosen optimally, so as to maximize the extracted timing information within a
given time duration.
The probability distribution function of the acquisition time for a particular timing loop
configuration is of much interest. When this distribution is combined with the false lock
probability and acquisition-end characteristics such as phase bias and phase jitter variance
before entering the tracking mode of operation, then comprehensive knowledge concerning the
acquisition performance is obtained. In practical systems, the acquisition failure rate is a critical
parameter, and stringent requirements are imposed on this rate to ensure a reliable timing
acquisition. The phenomena of false lock and hang up, mentioned earlier, are the major
obstacles to keeping the acquisition failure rate several orders below the symbol error rate. It is
CHAPTER 2: Review of Timing Recovery
27

noted that no theory presently exists that can be used to analytically predict the acquisition
failure rate. This is because the acquisition process is highly complicated when aspects such as
TED non-linearity and noise sources are taken into account. However, one can use Monte-
Carlo simulations to study the acquisition performance in specific applications. A detailed
discussion on the various aspects of timing acquisition is given in Section 2.4.

2.3 Timing Recovery Schemes and TED Algorithms


As mentioned in Section 2.1, the past several years of research on timing recovery have
resulted in a wide variety of techniques. This includes the maximum likelihood (ML) based
optimum timing recovery approach and several suboptimal approaches. Even though the ML
approach may be expensive to implement in practice, it serves as a reference as well as
motivation to develop practical schemes that are simple to implement while being near optimal.
Typical examples of suboptimal approaches are the threshold crossing based timing recovery
technique (e.g. [37] [38]), the nonlinear spectral line based technique (e.g. [39] [40]), and the
early-late timing recovery technique. In general, for these techniques to work well, the excess
bandwidth should be significant. Therefore, these techniques are not suitable for digital
recording systems, which have almost no excess bandwidth.
Most of the currently used timing recovery schemes in digital magnetic recording are based
on the minimum mean-square error (MMSE) and zero-forcing (ZF) principles [21] [41] [3,
Chap. 10]. These schemes can work well at sampling rates as low as the baud rate even when
systems have little excess bandwidth, and at the same time, yield near-optimal timing recovery
performance. They use inductive structures with data-aided (DA) or decision-directed (DD)
operation to extract timing information from an error signal. The error signal represents the
difference between the actual detector input and the desired one, and can be generated in
various simplified ways that lead to different implementation structures. The MMSE and ZF
timing recovery schemes suit a broad category of channels, modulation codes and detection
schemes. This section restricts the review and discussion to baud-rate data-aided timing
recovery techniques based on ML, MMSE and ZF schemes [3], and studies their underlying
TED algorithms and the resulting performances for magnetic read channels. In the following
subsections, the readback signal r(t) is modeled as shown in Fig. 2-1, and is assumed to contain
a delay that is unknown to the read channel. We use [3, Chap. 10] as the source for the ML,
MMSE and ZF based TED structures.

2.3.1 Data aided Maximum-Likelihood (ML) timing recovery


Denote the ideal sampling instants by kT and the actual sampling instants by tk = (k +τ )T
where τ is the timing phase error normalized in units T . Because the channel noise n(t) is
white and Gaussian, the ML estimate of the timing phase error τ can be obtained by finding the
minimum over all τ of the cost function

Λ (τ )  ∫ e 2 (t )dt (2.9)
−∞

where
e(t ) = ∑ ak h(t − (k + τ )T ) − r (t ) . (2.10)
k
CHAPTER 2: Review of Timing Recovery
28

Here, ak is the input data such that

r (t ) = ∑ ak h(t − kT ) + n(t ) . (2.11)


k

Minimization of Λ (τ ) with respect to τ results in the TED structure shown in Fig. 2-10 [3].
The algorithm of the DA ML TED reveals that the optimum TED output χ k is the product of
the data sequence ak with the samples of the differentiated output of the matched filter with
impulse response h(-t). As a result, the TED output can be written as

χ k (τ ) = ak ∫ r (u )h′(u − (k + τ )T )du (2.12)
−∞

dh(t )
where h′(t ) = T . The TED transfer characteristic, i.e. timing function, is given by [3,
dt
chapter 10]

2
∫ 2πΩA(e
j 2πΩ
ρ ML (τ ) = ) | H (Ω) |2 sin(2πΩτ )d Ω (2.13)
TN 0 −∞

where A(e j 2 πΩ ) denotes the power spectral density (PSD) of ak, H(Ω) is the Fourier transform
of h(t), and N 0 / 2 is the PSD of noise n(t). Recall that Ω is the normalized frequency variable
defined as Ω = fT .

t k = (k + τ k )T ak
matched filter
r(t) d
χk
h(-t) T
dt

Fig. 2-10: Data-aided ML TED scheme based on the matched filter.

Consider the digital magnetic recording channel with bit response h(t) given by (1.4) and
(1.6). We define the normalized user density Du , as

pw50 pw50 R
Du = = = Dch R (2.14)
Tu T
pw50
where Dch = is called the channel density, R is the code rate and Tu is the bit duration
T
before the modulation encoder. We get the unit step response of Lorentzian channel model as
(eq. (1.6))
h (t ) 0.5Vop 0.5Vop
. (2.15)
s (t ) = = 2
=
2 1 + (2t / pw50 ) 1 + (2 Rt / DuT ) 2
With Vop = 2 , the Fourier transform of s(t) is
π DchT
S (Ω) = e −π Dch |Ω| . (2.16)
2
CHAPTER 2: Review of Timing Recovery
29

Since H (Ω) = S (Ω)(1 − e − j 2πΩ ) (using eq. (1.4)), we get


2 | H (Ω) |2 2T
= (π Dch )2 sin 2 (πΩ)e−2π Dch |Ω| . (2.17)
N0T No
Substituting (2.17) in (2.13), the timing function becomes

4T π 3 Dch2
∫ Ωe
−2π Dch |Ω|
ρ ML (τ ) = A(e j 2πΩ ) sin 2 (πΩ) sin(2πΩτ )d Ω . (2.18)
No −∞

Without loss of generality, we normalize the noise power spectral density N o / 2 to 1. We want
to study the timing function for different cases of input data sequence, viz. uncorrelated data,
rate 2/3 (1,7) coded data, and rate 16/17 (0,6/6) coded data, and periodic training data with
periods 4T and 6T. The 4T and 6T patterns are given by ‘ " + + − − + + − −" ’ and
‘ " + + + − − − + + + − − −" ’, respectively. For uncorrelated data with unit power,
A(e j 2πΩ ) = 1 for all Ω. For the training data with periods of 4T and 6T with unit power,
1 ∞
A(e j 2πΩ) = ∑[δ (Ω+Ω0 + n) +δ (Ω−Ω0 − n)] for Ω0 = 1/ 4 and Ω0 =1/6 , respectively. Note that δ (Ω)
2 n=−∞

is Dirac’s delta function. That is, for any function f (Ω) , ∫−∞
f (Ω)δ (Ω −Ω0 )dΩ = f (Ω0 ) . Here, for
periodic preambles, we assume that the power in the harmonics is negligible compared to that
in the fundamental. This, in conjunction with the attenuation provided by the channel, implies
that we can neglect the harmonic contents at the channel output. For the rate 2/3 (1,7) and rate
16/17 (0,6/6) RLL coded data, the PSD can be evaluated numerically. We show the PSDs of
2/3 (1,7) and 16/17 (0,6/6) coded data in Fig. 2-11. Using these PSDs in (2.18), we can evaluate
the timing function of the DA ML TED for the Lorentzian channel model for various user
densities Du .

4 16/17 (0,6/6)
PSD (dB)

-4 2/3 (1,7)

-8
0 0.1 0. 2 0.3 0. 4 0.5

Normalized frequency: Ω

Fig. 2-11: Power spectral densities of 2/3 (1, 7) and 16/17 (0, 6/6) RLL coded data.

Fig. 2-12 shows the timing functions for the cases of uncorrelated data and training data
with periods 4T and 6T. Similarly, Fig. 2-13 shows the timing functions for the cases of data
coded with 16/17 (0,6/6) and 2/3 (1,7) RLL codes. These figures show that the value of user
density Du and the degree of randomness of the input data are two dominating factors that
influence the TED gain. The TED gain is the slope of the timing function at the origin (i.e.
CHAPTER 2: Review of Timing Recovery
30

slope at the ideal timing phase) and it represents the TED sensitivity to timing phase errors in
the incoming signal. The greater the TED gain, the better will be the ability of the TED to
detect even small timing phase errors, and vice versa. Observe that the TED gain decreases as
data becomes more random-like or as the user density increases. For low recording density or
periodic training data, the TED gain tends to become high. This explains why it is desirable to
use periodic preamble data during timing acquisition. Compared to the situations of low
recording density and the use of periodic training data, timing recovery becomes more difficult
with random data as the user density increases.
1 1.5
Du=2.0 Du=2.0
1
0.5 Du=2.5
Du=2.5
0.5
ρ (τ )

0 0
Du=3.0

ρ (τ )
-0.5 Du=3.0
-0.5
-1
-1 -1.5
-2 -1 0 1 2 -2 -1 0 1 2
Timing phase error: τ Timing phase error: τ
(a) (b)

1.5
Du=2.0
1
Du=2.5
0.5
ρ (τ )

-0.5 Du=3.0
-1

-1.5
-2 -1 0 1 2

Timing phase error: τ


(c)

Fig. 2-12: ML TED timing functions for the Lorentzian channel at different user densities Du with
(a) uncorrelated data, (b) 4T-training data, and (c) 6T-training data.

1 1

0.5 0.5

0 0
ρ (τ)
ρ (τ)

Du=2.0
Du=2.25 Du=2.0
Du=2.5 Du=2.25
Du=2.75 Du=2.5
-0.5 -0.5 Du=2.75
Du=3.0
Du=3.25 Du=3.0
Du=3.5 Du=3.25
Du=3.5
-1 -1
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1
Timing phase error: τ Timing phase error: τ
(a) (b)

Fig. 2-13: ML TED timing functions for the RLL coded Lorentzian channel at different user
densities Du with (a) 2/3 (1,7) coded data and (b) 16/17 (0, 6/6) coded data.

The ML TED performance can be assessed using the TED efficiency defined in (2.7).
Using (2.7), (2.8) and (2.13), the ML TED efficiency can be evaluated as [18]
CHAPTER 2: Review of Timing Recovery
31

γ ML =
∫ −∞
(2πΩ) 2 A(e j 2πΩ ) | H (Ω) |2 d Ω . (2.19)


2
| H (Ω) | d Ω
−∞

Because of the presence of Ω 2 in the numerator, the high frequency components of A( e j 2πΩ )
and H (Ω) have significant contribution to efficiency. Thus, for optimum efficiency, the data
should have its energy concentrated at those frequency regions where Ω 2 | H (Ω) |2 is large.
Consequently, sinusoidal data with a proper frequency will result in better efficiency compared
to random data.
For the Lorentzian channel, substituting (2.17) in (2.19), we get

γ ML = 8π 3Dch (1+ Dch2 )∫ Ω2 A(e j 2πΩ )sin2 (πΩ)e−2π D |Ω|dΩ . ch
(2.20)
−∞

We use (2.20) to evaluate the ML TED efficiency for the Lorentzian magnetic recording
channel with input training patterns 2T (‘ + − ’), 4T (‘ + + − − ’), 6T (‘ + + + − − − ’), and 8T
(‘ + + + + − − − − ’), and random data coded with rate 2/3 (1,7) code and rate 16/17 (0,6/6)
code. The calculation results are shown in Fig. 2-14. Here, the vertical axis is in dB and the
horizontal axis is the channel density Dch . Observe that the pseudo-random data, such as
uncorrelated data and the 16/17 (0, 6/6) and 2/3 (1,7) coded data, generally yields a much lower
γ ML than the periodic patterns for from medium to high densities. This is expected because of
the Ω 2 term in (2.20), as explained above.

1 2T
10
2 4T
1 3 6T
4 8T
5 uncorrelated
6 16/17(0,k)
ML TED efficiency (dB)

2 7 2/3(1,7)
5
3

0 7

5
6
-5
0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6

Channel density: Dch

Fig. 2-14: ML TED efficiency for the Lorentzian recording channel with various data patterns
versus the channel density.
Fig. 2-14 shows that the 2T pattern is unsuited for high densities. This is because the
recording channel response has little content at the Nyquist frequency 1/(2T) and its bandwidth
decreases rapidly with the channel density Dch. The efficiency γ ML declines, in general, with
Dch . But, the γ ML curves for 6T and 8T are non-monotonic. This is because the term
CHAPTER 2: Review of Timing Recovery
32

(2πΩ) 2 | H (Ω) |2 is non-monotonic in Dch for the Lorentzian channel with the 6T or 8T
pattern. Fig. 2-14 also shows that, for medium to high channel densities, the 6T and 8T patterns
outperform the 4T pattern, and the (1,7) coded data outperforms the (0,k) coded data. However,
if we account for the code rate by plotting Fig. 2-14 with user density on the horizontal axis, we
will find that the efficiencies resulting from the (1,7) and (0,k) coded data are comparable.

2.3.2 Minimum Mean-Square Error (MMSE) timing recovery


Applying the principle of the Least-Mean-Square (LMS) adaptation technique [52, Chapter
6] to timing recovery leads to MMSE timing recovery schemes. That is, the timing information
is obtained as the instantaneous gradient, with respect to phase, of an appropriate error signal
[11] [18]. Following [3, Chap. 10], we give a brief review of MMSE TED approaches.
tk=(k+τk)T
yk′
Td/dt wk χk
r(t) y(t) yk ek
f(t) wk

dk

Fig. 2-15: TED structure of MMSE timing recovery scheme.

Fig. 2-15 depicts the MMSE TED structure. The continuous-time prefilter with impulse
response f(t) is such that its output y(t), when sampled at the correct phase, approximates a
certain desired sequence dk. So an error ek is formed as ek = yk − d k where yk is the sampled
output of the prefilter. Since the noise at the prefilter output may not be white, a whitening filter
with taps wk is used to whiten the noise. This is because MMSE and ML are equivalent when
the underlying noise is white Gaussian and the channel has no excess bandwidth. Hence, the
aim is to adjust the phase by minimizing the power of the whitened error signal (e ⊗ w) k
∂ ( e ⊗ w ) 2k
where ‘ ⊗ ’ denotes convolution. Observe that = 2( e ⊗ w ) k ( y ′ ⊗ w ) k where
∂τ
dy(t)
yk′  T t=(k+τk )T
is the sampled derivative of the prefilter output y(t). Thus, the output of
dt
MMSE TED in Fig. 2-15 is given by
χ k = (e ⊗ w)k ( y′ ⊗ w)k . (2.21)
The timing loop acts to force the average of χk towards zero and in doing so minimizes the
power of (e ⊗ w) k . Note that many of the existing MMSE schemes merely minimize the power
of ek yielding a suboptimal version as [21] [41]

χ k = ek yk′ . (2.22)

Note from Fig. 2-15 that the MMSE TED needs two samplers that are required to operate
in exact synchronism. This makes its practical implementation difficult. To circumvent this, it
CHAPTER 2: Review of Timing Recovery
33

has been proposed to generate the samples yk′ from yk using a discrete-time differentiator,
instead of an additional sampler [18]. This leads to the simplified structure shown in Fig. 2-16.
tk=(k+τk)T yk′ χk
ck wk
r(t) y(t) yk ek
f(t) wk
dk

Fig. 2-16: Simplified TED structure of MMSE timing recovery scheme.

If y(t) has zero excess bandwidth, then yk′ can be generated from yk by using an ideal
differentiator with discrete-time impulse response ck given by [2]
 0, k = 0
 (2.23)
ck =  ( −1) k
 k , k ≠ 0.
In practice, the response ck may be limited to the first few terms when excess bandwidth is
insignificant. A common practice has been to use just the three center-taps of ck, viz. c-1=1,
c0=0 and c1=-1. This approximation for a simplified MMSE timing recovery scheme has been
widely used in various applications [2] [42]. The performance of the simplified scheme is
comparable to the original scheme of Fig. 2-15 [18].
Let q (t ) = (h ⊗ f )(t ) be the band-limited equalized channel response up until the prefilter
dq (t )
output. Then, with qk = q(t ) |t =kT , qτk = q(t ) |t =( k +τ )T and pτk = T , we can write
dt t =( k +τ )T

dy (t )
yk′ = T = (a ⊗ pτ ) k (2.24)
dt t =( k +τ )T

yk = y (t ) |t =( k +τ )T = (a ⊗ qτ ) k + vk (2.25)

where vk is the noise part of yk . Further, let g k be the target response such that
d k = (a ⊗ g ) k . Then, substituting for ek = yk − d k and yk′ in (2.21) using these expressions,
we get the timing function for MMSE TED as
0.5
ρ MS (τ ) = ∫ ( j 2πΩ)QA | W |2 (Qe j 2πΩτ − G )∗ e j 2πΩτ d Ω (2.26)
−0.5

where Q, G, and W are the Fourier transforms of qk, gk and wk , respectively, A is the PSD of the
data sequence ak, and X * denotes the conjugate of a complex quantity X . For the sake of
convenience, these frequency domain quantities are written without the argument e j 2πΩ in
(2.26). The above expression (2.26) does not contain a noise term since the cross-correlation of
the noise components in yk and yk′ is odd-symmetric about the origin. The resulting TED
efficiency is
CHAPTER 2: Review of Timing Recovery
34

0.5
1 [ ∫−0.5 (2πΩ) A | Q | | W | d Ω]
2 2 2 2

γ MS = (2.27)
SNRm 0.5 (2πΩ) 2 A | Q |2 | W |4 Vd Ω

−0.5

where V is the PSD of noise vk in yk .

For the simplified MMSE timing recovery, the timing function is


0.5
ρ MS (τ ) = ∫ AQC (Qe j 2πΩτ − G )∗ | W |2 e j 2πΩτ d Ω (2.28)
−0.5

where C is the Fourier transform of the impulse response ck of the differentiator. The
corresponding TED efficiency is given by
0.5
1 [ ∫−0.5 j 2πΩAQCG | W | d Ω] .
∗ 2 2

γMS = 0.5
(2.29)
SNRm

A | Q |2 | C |2 | W |4 Vd Ω
−0.5

2.3.3 Zero-Forcing (ZF) timing recovery


The origin of ZF timing recovery can be traced back to the work of Mueller and Müller [1].
The ZF timing recovery technique provides an alternative and usually cheaper approach
towards achieving near ML performance. It has been applied to a variety of codes, channels and
detection techniques [1] [20] [43] [15] [16] [19]. The principle of the ZF approach is to
generate the control information by cross-correlating a filtered form of the data ak with an error
ek that represents the difference between the actual and desired inputs of the detector. The
control loop acts to eliminate this cross-correlation in order to force intersymbol interference
components to zero.
tk=(k+τk)T aˆk χk
zk
detector p k
r(t) y(t) yk ek
f(t) wk
dk

Fig. 2-17: TED structure of a ZF timing recovery scheme.

The ZF TED structure is depicted in Fig. 2-17 [3, Chap. 10]. It is very similar to the
simplified MMSE scheme in Fig. 2-16. The derivative component yk′ in Fig. 2-16 is replaced
by zk in Fig. 2-17. The signal zk is produced from the decisions âk rather than from the noisy
signal y(t) or the sampler output yk . The resulting ZF TED output is

χ k = (e ⊗ w)k zk = (e ⊗ w)k (aˆ ⊗ p)


 k (2.30)

where p k is the impulse response of the filter used for generating the local reference zk , which
is normally chosen as ( w ⊗ p ) k where pk is the sampled derivative of the equalized channel
response q(t ) . Thus, the additional sampler and differentiator required in the MMSE schemes
CHAPTER 2: Review of Timing Recovery
35

are avoided in the ZF scheme. The timing loop acts to eliminate all cross-correlation between
ek and zk, thereby forcing a linear combination of ISI components towards zero. ML
performance will accrue when wk and pk are properly chosen and aliasing is avoided.
The timing function for the ZF TED given in Fig. 2-17 is [3, Chap. 10]
0.5
ρ ZF (τ ) = ∫ A(Qe j 2πΩτ − G )WP ∗ d Ω (2.31)
−0.5

where P denotes the Fourier transform of the filter response pk in the reference path, i.e.
P = WP where P is the Fourier transform of the response pk . Finally, the ZF TED efficiency
can be obtained as
0.5
1 [∫ A | P |2 d Ω]2
γ ZF = −0.5 . (2.32)
0.5
SNRm A | P |2 | W |2 Vd Ω
∫−0.5

Finally, we summarize the approaches of MMSE and ZF timing recovery schemes. In the
MMSE scheme, timing-error information is produced by correlating the filtered error (e ⊗ w) k
with a noisy reference sequence ( y ′ ⊗ w) k . This approach has various disadvantages. First, it
requires a second sampler to produce yk′ (the simplified MMSE approach removes this
drawback). Secondly, formation of (e ⊗ w)k ( y′ ⊗w)k is comparatively complicated when
carried out digitally. Thirdly, when the data ak exhibits long runs of identical symbols, the
TED output will be fully induced by noise. As a result, the TED may suffer from additional
noise-induced jitter. The advantage of MMSE approach is that it minimizes the mean-square
error (MSE) at the detector input. Compared to the MMSE TED, the ZF TED provides an
alternative and much cheaper approach aiming at ML performance. It has none of the three
disadvantages mentioned above, thus presenting an extraordinarily simple and effective scheme
for timing recovery. The resulting MSE can be poorer than that of an MMSE scheme. This may
be a potential disadvantage of ZF timing recovery.

2.4 Timing Acquisition


The purpose of timing acquisition techniques is to acquire the correct sampling phase
before the detection of user data begins. This task is accomplished during a limited length of
the preamble that precedes the user data. The acquisition behavior depends not only on the
preamble pattern but also on the TED scheme used. Compared to the TED used during the
tracking mode of timing recovery, the TED schemes during the acquisition mode are almost
always fitted with special mechanisms to ensure fast and accurate acquisition of the timing
phase. In this section, we discuss the main problems that may occur during timing acquisition,
solutions to take care of these, and some measures for evaluating the acquisition performance.

2.4.1 Timing acquisition problems: false lock and hang up


In the acquisition mode, the phase of the recovered clock varies over a considerable range
since the initial phase may deviate far from the correct phase. In the presence of large timing
phase errors, the timing function becomes highly nonlinear and the TED may even fail to give a
CHAPTER 2: Review of Timing Recovery
36

proper indication of the timing phase error. Further, with large timing phase errors, the bit
decisions from the receiver are most likely erroneous. In such situations, two serious problems
that the timing recovery loop may face are false lock and hang up. False lock means that the
timing loop gets locked to a wrong phase. Hang up means that the timing loop takes a long
period of time to get out of certain intervals of incorrect phases. Occurrence of false lock results
in complete failure of acquisition, whereas hang up results in retarding the acquisition process
significantly. Therefore, a serious consideration of these two problems is critical and essential
for ensuring a fast and reliable timing acquisition. This section elaborates the problems of false
lock and hang up, emphasizing why they occur and how to avoid them.

A. Hang up
The timing phase generated by the PLL loop may dwell at incorrect phases for prolonged
intervals of time before the loop settles to equilibrium on the correct phase. This phenomenon
is known as hang up and it was first investigated for time division multiple access (TDMA)
modems [44]. Since the PLL always settles eventually and hang up is a low probability event,
the hang up problem has been obscure for most PLL users who do not require fast acquisition.
However, hang up is very troublesome for applications in magnetic recording systems where
fast timing acquisition is absolutely necessary.
The reason for the occurrence of hang up is that the timing function has no response or
unstable nulls at one or more undesired phases. Fig. 2-18(a) illustrates one such timing
function. Observe that the timing function is discontinuous at the timing phase errors ±0.5
(normalized in units T ). Fig. 2-18(b) shows the closed-loop phase convergence curves (at the
PLL output) based on this timing function for 100 trials. In this experiment, the initial timing
phase error is fixed at +0.5 and the noise level is such that the root-mean-square (RMS) jitter
value at the output is 0.05. Observe that the convergence curves show a very sluggish behavior
in moving away from the initial timing phase error 0.5. The hang up phenomenon illustrated
here is caused by the TED that yields outputs with different signs at either sides of the
discontinuous timing phase error 0.5. In the presence of noise, the loop phase is misled to move
in inconsistent directions thereby resulting in hang up. Observe that the worst case delay in Fig.
2-18(b) is nearly 100 bits for the loop to begin moving away from the initial phase. Such a large
delay is intolerable for timing acquisition in magnetic recording.
0.6 1.2

1
Timing phase error: τ

0.8

0.6
ρ (τ)

0
0.4

0.2

-0.6 -0.2
-1 -0.5 0 0.5 1 0 50 100 150 200

Timing phase error: τ Number of bits

(a) Timing function with unstable (b) Closed-loop phase convergence


nulls at τ=±0.5 curves (RMS jitter = 0.05 )

Fig. 2-18: Demonstration of hang up problem when the unstable nulls in the timing function arise
from discontinuities.
CHAPTER 2: Review of Timing Recovery
37

Hang up may also occur when the slope of the timing function at undesired nulls is
opposite to that at the correct phase. One such hang up example is demonstrated in Fig. 2-19.
Observe that the timing function in Fig. 2-19(a) is continuous at the nulls at ±0.5. Further, these
nulls are unstable since for timing phase errors around ±0.5, the TED makes the timing phase
drift in opposite directions. In the presence of noise, if the initial phase happens to occur near
these unstable nulls, the loop phase will linger around these nulls over a prolonged period of
time before it consistently converges to the desired phase. One such example is shown in Fig.
2-19(b). The sluggish convergence of the phase caused by hang up arising from the unstable
nulls at 0.5 is clearly illustrated. These nulls would not cause false lock since false lock requires
the correct slope in the vicinity of the zero crossing, which is not true for the zeros in the case
considered here. In practice, since timing phase errors at ±0.5 are unstable, even small noise at
the TED input can make the phase drift away from these unstable phases and eventually
converge to one of the desired phases. Therefore, hang up is more probable than false lock in
this case.
0.4 1.5
Timing phase error: τ

1
ρ (τ)

0 0.5

-0.4 -0.5
-1 -0.5 0 0.5 1 0 50 100 150 200

Timing phase error: τ Number of bits

(a) Timing function with unstable nulls (b) Closed-loop phase convergence
at ±0.5 curves ( RMS jitter = 0.05 )

Fig. 2-19: Demonstration of hang up problem when the timing function has undesired nulls with
wrong slopes.

Hang up can be avoided by applying a large restoring force in the chosen direction
whenever the timing phase in the loop is near the unstable nulls. This can be accomplished by
adding hysteresis into the timing function in order to prevent the timing loop from ever
dwelling near these undesired phases. Fig. 2-20(a) illustrates such an example, showing a
significant hysteresis range in the timing function over the problematic unstable phase intervals
around ±0.5. The timing function is made up of parallel and partially overlapped linear
segments. At any point in time, one of these segments is active and uniquely determines the
TED output for the corresponding input. The TED output follows the changes of the input until
the end of a segment is reached. Then, the TED output jumps to the nearest segment. At that
point, the TED output is sharply decreased towards zero, implying that the phase is close to the
desired one. Therefore, the timing loop is likely to lock to this desired timing phase. Because of
the hysteresis in the timing function, there are no longer any unstable nulls at which the loop
might hang up for a prolonged period. Using this TED and the initial phase 0.5, we studied the
closed-loop phase convergence curves with the noise power selected to result in 0.05 RMS
phase jitter. The result is illustrated in Fig. 2-20(b). As expected, all the curves converge to the
desired timing phases within 100 bits. No hang up occurs even during the first 50 bits.
CHAPTER 2: Review of Timing Recovery
38

0.8 1.5

Timing phase error: τ


1

0.5
ρ(τ)

-0.8 -0.5
-1 0 1 0 50 100 150 200
Timing phase error: τ Number of bits

(a) Timing function with hysteresis (b) Closed-loop phase convergence


curves ( RMS jitter = 0.05 )

Fig. 2-20: Demonstration of how hang up can be avoided by incorporating hysteresis in the timing
function.

B. False lock
The TED is expected to produce zero output only for the desired input phase, thus forcing
the timing loop to lock on this input phase. However, if the TED yields zero output (i.e. timing
function becomes zero) for any undesired phases as well, then the timing loop may lock on
these incorrect phases, thereby resulting in false lock. Occurrence of false lock may lead to
failure of the timing acquisition process. For this reason, false lock must be avoided at any cost.
This is even more important in DFE read channels since erroneous past decisions can easily
induce false lock under random initial conditions [47].
0.2 1.5
Timing phase error: τ

0.5
ρ (τ)

-0.2 -0.5
-1 -0.5 0 0.5 1 0 50 100 150 200
Timing phase error: τ Number of bits

(a) Timing function with false lock (b) Closed-loop phase convergence curves
phases at ± 0.5 (RMS jitter = 0.05 )

Fig. 2-21: Demonstration of false lock phenomenon arising from undesired zero-crossings of the
timing function.
Consider the timing function shown in Fig. 2-21(a). Apart from the zero-crossings for
integer values of τ, which correspond to the desired phases, there are also spurious zero
crossings around ±0.5. The effect of this timing function is illustrated in Fig. 2-21(b), which
shows 100 runs of phase convergence curves embedded in 5% RMS jitter. The initial timing
phase error is uniformly distributed in the range [0.3, 0.7]. Observe that the curves with initial
phase errors located around +0.5 fail to converge to the desired timing phase. Thus, due to the
CHAPTER 2: Review of Timing Recovery
39

zero crossings of the timing function at the undesired phases ±0.5, the timing acquisition with
initial timing phase errors around ±0.5 is doomed to result in acquisition failure. We may
remark that the phase ambiguity resulting from the convergence of the timing loop to non-zero
integer values of τ (e.g. ±1) is of no big concern for acquisition. This is because this ambiguity
can be easily resolved at the block synchronization stage just before the detection of user data
begins.
An effective way to prevent false lock is to determine the cause of false lock and improve
the TED algorithm to prevent the underlying phenomenon. False lock can also be avoided by
ensuring that the initial phase is near a desired phase. A periodic clock run-in pattern of
preamble is used to this end and the prior knowledge of the preamble pattern is exploited.
Another way is to use a non decision-aided deductive TED architecture for acquisition so that
false lock arising from initial erroneous decisions can be prevented [45, 42].

2.4.2 TED solutions for reliable acquisition


We now briefly discuss two examples of TED schemes that do not suffer from false lock
and hang up. These examples illustrate the typical principles that one may use while working in
the decision-directed mode or non-decision-directed mode of timing acquisition.

A. Decision-directed TED schemes


Consider the decision-directed (DD) timing acquisition scheme presented in [16] for PR-IV
recording systems. The TED output is given by χ k = yk −1 xˆk − yk xˆk −1 , where yk is the input of
the partial response detector and xˆk is the estimate of the ideal value of yk . The estimate xˆk is
usually obtained by passing yk through a comparator with multiple thresholds. The DD TED
schemes exploit the knowledge of the preamble pattern for ensuring the correctness of xˆk .
Using the 4T pattern as the preamble, the ideal detector input is xk = ak − ak −2 = 2ak , which
takes on only two values {+2, -2}. This special structure of xk is exploited to minimize errors
in the estimates xˆk . Instead of using zero as the comparator threshold for making decisions, a
variable threshold mechanism given by

 2 if yk ≥ 2sgn( xˆk − 2 )
xˆk =  (2.33)
 −2 if yk < 2sgn( xˆk − 2 )

is used for obtaining xˆk from yk [16] [46]. Here, sgn( x) is the sign-function. The variable
threshold used here provides hysteresis as well as a large boost in the noise margin while
detecting xk . This makes the TED immune to problems arising from erroneous decisions
during acquisition.
For fast and reliable acquisition in DFE recording systems, two novel solutions to DD TED
schemes are proposed in Chapter 5. One solution uses a special equalizer and several switches
to ensure the correctness of the decisions at the slicer output [47]. Another solution employs a
modified threshold filter to help the detector make reliable decisions and to introduce hysteresis
in the TED transfer function [48]. It is shown using simulation results that the proposed TED
schemes help to effectively prevent false lock and hang up problems.
CHAPTER 2: Review of Timing Recovery
40

B. Non-decision-directed TED schemes


The DD TED schemes inevitably introduce a delay between the signal samples at the
sampler output and the TED output. This delay is not desirable since it can degrade the timing
loop performance and stability [3, Chap. 11]. The non-decision-directed (NDD) schemes, on
the other hand, form the TED output directly based on the equalizer or detector input without
having to know the decisions. Thus, the NDD TED schemes can effectively eliminate a large
part of the latency in the timing loop.
It is shown in [45, 49, 28, 50] that a near-optimum acquisition phase can be determined via
the ‘band-edge component maximization’ (BECM) technique. We briefly present the principle
of BECM here. Assume that the channel response h(t), whose Fourier transform is H (Ω) , has
less than 100% excess bandwidth and that the channel output is sampled at rate 1/T with a
timing phase error τ normalized in units T. Then, the Fourier transform H s (e j 2πΩ ) of the
sampled channel response within the bandwidth 1/T can be written as
H (Ω)e j 2πΩτ + H (Ω −1)e j 2π (Ω−1)τ
Hs (e j 2πΩ ) = for 0 ≤ Ω ≤ 1. (2.34)
T
The BECM technique suggests that the optimum phase for sampling the replay signal should
maximize the magnitude of H s (e j 2πΩ ) at the Nyquist frequency Ωn = 0.5 . Since the Lorentzian
model for the step response in magnetic recording is an even function (see eq. (2.15)), it can be
easily seen that H (−0.5) = H (0.5) . Using this in (2.34), we get

H (0.5)e jπτ + H (−0.5)e − jπτ 2 cos(πτ ) (2.35)


H s (e jπ ) = = H (0.5)
T T
which results in the optimum sampling phase at
τ = k for integer k . (2.36)

VCO LF TED
readback
signal
low-pass r(t) rk
equalizer
filter
(k+ τ )T

Fig. 2-22: Non-decision-directed TED for timing acquisition in recording systems.

By way of illustration we show in Fig. 2-22 the structure of a NDD TED scheme for timing
acquisition in recording systems. The bandwidth of low pass filter is same as that of the channel
H(f). The 4T pattern ‘ + + − − ’ and 6T pattern ‘ + + + − − − ’ are the most commonly used
acquisition preamble patterns for (0, k) (e.g. PRML system [16]) and (1,7) (e.g. MDFE system
[47]) coded magnetic recording channels. The sampler output sequences at the ideal sampling
phase, which result in BECM, are shown in Fig. 2-23 for the Lorentzian channel model in the
absence of noise. Exploiting the nature of the patterns formed by the readback signal samples,
we developed TEDs that are very simple to implement [51]. The corresponding TED outputs
for the 6T and 4T preambles are given by
χ k = (rk + rk −1 )[sgn(rk ) − sgn(rk −1 )] for the 6T-preamble (2.37)
CHAPTER 2: Review of Timing Recovery
41

and
χ k = ( −1) k rk rk −1 for the 4T-preamble. (2.38)

1.5
1
1

0.5
0.5
rk

rk
0 0

-0.5
-0.5
-1
-1
-1.5
50 60 70 80 90 100 50 60 70 80 90 100
Time: k Time: k
(a) (b)

Fig. 2-23: Symbol-rate samples (marked by ‘o’ at the ideal sampling phase) of replay signal of the
Lorentzian recording channel corresponding to (a) 6T pattern and (b) 4T pattern preamble
sequences.

Fig. 2-24 shows the timing functions corresponding to the TEDs given by (2.37) and (2.38)
for 6T and 4T sequences. Observe that the functions are saw-tooth and sinusoidal in nature with
zero-crossings located at only the ideal phases. Clearly, the resulting timing acquisition will not
suffer from the false lock problem. However, there is a potential that hang up may occur.
1 0.3

0.2
0.5
0.1

0
ρ(τ)

ρ(τ)

0
-0.1

-0.5 -0.2

-0.3

-1 -0.4
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1

Timing phase error: τ Timing phase error: τ


(a) (b)

Fig. 2-24: Timing functions corresponding to the NDD TEDs given by (2.37) and (2.38) for using
(a) 6T pattern and (b) 4T pattern preamble sequences.

While the NDD acquisition TED schemes shown above are able to effectively prevent
acquisition problems arising from wrong decisions, it is clear that these schemes can not be
used in the tracking mode since the input data sequence during tracking is random in nature.
Since the detected data during tracking are more reliable, a decision-directed TED can be used
during tracking. For this reason, when the timing loop enters the tracking mode, the TED has to
be switched into a DD based scheme.
CHAPTER 2: Review of Timing Recovery
42

2.4.3 Phase acquisition bound


A quantity that is useful in assessing the quality of timing acquisition is the variance of the
phase at the acquisition end. In this section, we analyze the timing acquisition process for a
given preamble length, and derive an expression for the resulting phase variance.
During the phase acquisition process, the received signal is essentially a sinusoid since the
input data sequence (i.e., the preamble) is periodic in nature. The channel, being band-limited,
is assumed to reject the harmonic components of the periodic input data except for the
fundamental. To study the phase acquisition bound, we assume that the received signal is
sampled at rate 1/T and the phase is estimated using N samples. The resulting samples are given
by
yk = Ac sin(2πΩc k + φ ) + nk , 0 ≤ k ≤ N − 1 (2.39)
where Ωc is the fundamental frequency of the preamble sequence that is normalized with
respect to 1/ T , Ac is the sinusoidal amplitude which is determined by the channel transfer
function at frequency Ωc , φ is the phase to be estimated and is assumed to be uniformly
distributed over the interval [−π , π ) , and noise nk is the channel noise and is assumed to be
white and Gaussian with zero mean and variance σ n2 .
For the observation of N samples y0, y1, …, yN-1, define y = [ y0 y1 " y N −1 ] . An
obvious approach would be to estimate the phase φ ∈ [−π , π ) by maximizing the posteriori
probability density function (PDF) p (φ | y ) . With the help of Bayes’ rule we may write
p(y | φ ) p(φ )
p(φ | y ) = where p (y | φ ) is the conditional PDF of y for a given phase φ ,
p(y )
p( y ) is the PDF of y , and p(φ ) is the prior PDF of phase φ . Since p( y ) does not depend
on φ and φ is uniformly distributed in [ −π , +π ] , maximization of p (φ | y ) boils down to
maximization of p (y | φ ) . This leads to the maximum-likelihood (ML) estimate of the phase
φ.
We note from (2.39) that for a given φ , the variables yk are independent and Gaussian
distributed with mean Ac sin(2πΩ c k + φ ) and variance σ n2 . Hence, p (y | φ ) can be written as
N /2
 1   1 N −1
2
p (y | φ ) =  2 
exp  − 2 ∑y k − Ac sin(2πΩ c k + φ )  . (2.40)
 2πσ n   2σ n k =0 
N −1
The term ∑| y
k =0
k − Ac sin(2πΩ c k + φ ) |2 can be expanded as
N −1 2 2 N −1
NAc A

k =0
2
| yk | +
2
− c
2
∑ cos(4πΩ k + 2φ ) − 2
k =0
c B 2 + C 2 cos(φ − φ0 ) (2.41)
N −1 N −1
C
where B = ∑
k =0
y k sin(2πΩ c k ) , C = ∑
k =0
yk cos(2πΩc k ) , and φ0 = tan −1 ( ) . By choosing
B
N appropriately so as to make the acquisition period NT considerably longer than the
preamble period 1/ f c , the third term of (2.41) amounts to zero. Substituting (2.41) in (2.40),
we get
{
p(y | φ ) = α exp β cos(φ0 − φ ) } (2.42)
CHAPTER 2: Review of Timing Recovery
43

N /2
 1   1  N −1 NAc  
2
Ac B 2 + C 2
2 ∑
where α =  exp  −  | y |2
+   and β = . Since α
2  k
 2πσ n   2σ n  k =0 2   σ n2
and β are positive scalars, the value of φ which maximizes p (y | φ ) in (2.42) is equal to φ . 0
That is, the ML estimate of the phase is given by
 N −1 
 ∑ yk cos(2πΩ c k ) 
φˆML = φ0 = tan −1  kN=−01  , − π ≤ φˆML < π . (2.43)
 
 ∑ yk sin(2πΩc k ) 
 k =0 
This result is identical to the ML estimate of the phase of an unmodulated carrier, as shown in
[52, Chap. 6]. The statistical behavior of (2.43) is also studied in [52, Chap. 6] [53]. The result
shows that the conditional PDF of φˆML , for a given φ , is determined by the number of samples
2
N in y and the signal-to-noise ratio γ  Ac /(2σ n2 ) according to the expression
pφˆ (φˆ | φ ) = e− Nγ [1 + 4π Nγ cos(φˆ − φ )eNγ cos (φ −φ )Q(− 2Nγ cos(φˆ − φ ))] ,
2 ˆ
φˆ ∈ [−π , π ) . (2.44)
ML

Using the fact that pφˆ (φˆ | φ ) is the circularly shifted version of pφˆ (φˆ) restricted by phase φ
ML ML

in the range [−π , π ) , one can readily verify that the PDF of φˆ can be obtained as
e− Nγ
pφˆ (φˆ) = [1 + 4π Nγ cos φˆ ⋅ eNγ cos φ Q(− 2 Nγ cos φˆ)] .
2 ˆ
(2.45)
ML

Accordingly, the variance of φˆ is obtained as
π
σ φ2ˆ = ∫ φˆ 2 pφˆ (φˆ)dφˆ
ML −π ML

− Nγ (2.46)
e π
∫ φˆ 2 [1 + 4π N γ cos φˆe Nγ cos φ Q(− 2 N γ cos φˆ)]dφˆ .
2 ˆ
=
π 0

This equation produces maximum variance σ φˆ = π 2 / 3 when the SNR γ = Ac 2 /(2σ n2 )


2
ML max

σ n2
decreases to zero, and produces minimum variance σ φ2ˆ  2
when γ is sufficiently
ML min
NAc
high. Thus, the result (2.46) provides a phase acquisition bound for the observation period of
NT seconds during acquisition. In other words, this result indicates that for a preamble with a
length of N samples used to estimate an ideal phase, the variance of estimate error cannot be
σ n2 π2
less than 2
and greater than .
NAc 3

2.4.4 Time constant and acquisition speed


The phase adjustment in the timing recovery loop that employs a first-order PLL can be
described as
τ k +1 = τ k − αχ k (2.47)
where α is a small step size parameter, τ k is the normalized timing phase error between the
VCO output and the PLL input signals at the instant k, and χ k is the TED output at the instant
k. Here, we assume that the VCO has a unity gain factor. Recall from Section 2.1 that the
CHAPTER 2: Review of Timing Recovery
44

timing function ρ (τ ) , which represents the averaged TED output, is an odd symmetric
function about the origin τ =0. Further, ρ (τ ) can be linearized with the slope Kd as in (2.3)
over a small area around τ =0, i.e. ρ (τ ) = K dτ . Beyond this linear area in the range [ −0.5 ,
0.5 ], ρ (τ ) can be viewed as a nonlinear function of τ , which is in between the two lines with
slopes s1 and s2 , i.e. | s1τ |≤| ρ (τ ) |≤| s2τ | . The slope of ρ (τ ) is usually maximum at the
origin τ =0 and minimum at τ =±0.5 in the range [ −0.5 , 0.5 ]. Thus, we can determine s1 and

s2 as s1 = ρ′(0.5) and s2 = ρ′(0) = Kd , respectively, with ρ ′ representing . Normally, both s1

and s2 are of non-negative values, i.e. s2 ≥ s1 ≥ 0 .
We use the simplified model of the timing function described above for analyzing the
average convergence behavior of the iteration (2.47). Let τ k denote the average of τ k , i.e.,
τ k = E[τ k ] . Then, the above model suggests that we can bound τ k as
(1 − α s2 ) k | τ 0 |≤| τ k |≤ (1 − α s1 ) k | τ 0 | (2.48)
where τ 0 is the initial timing phase error in the range [ −0.5 , 0.5 ]. Further, we model the
phase convergence in the linear region as τ k = (1 − α K d ) k τ 0 . In convergence studies, it is a
regular practice to specify the convergence speed using a time constant. Accordingly, we let
(1 − α K d ) k τ 0 = e − k / Γτ 0 (2.49)
where Γ denotes an effective time constant value and is given by
1
Γ=− . (2.50)
ln(1 − α K d )
Clearly, Γ is an important parameter for timing acquisition, because it explicitly describes the
acquisition phase convergence rate. For instance, the value of time constant Γ indicates the
time that is required to reduce the timing phase error by 63.21% in magnitude from the initial
value (i.e. e −1 times smaller). Usually, a period of four time constants is sufficient for
acquisition because the timing phase error reduces to 1.83% of its initial value within this
period.
Let us examine the fastest and slowest acquisition situations. The fastest acquisition
process is marked by the minimum time constant
1 1
Γ min = − =− , (2.51)
ln(1 − α s2 ) ln(1 − α K d )
and the slowest acquisition process is marked by the maximum time constant
1
Γ max = − . (2.52)
ln(1 − α s1 )
By way of illustration, we show in Fig. 2-25 a timing function and its corresponding phase
convergence region, which is bounded by the convergence curves corresponding to the
maximum and minimum time constants. Even though Γ max may be significantly larger than
Γ min , the quantity that is more important during acquisition is Γ min . This is because the
linearity of the timing function can be assumed to be true over a significantly large range of
timing phase errors around τ = 0 by virtue of the special techniques that are normally used
during acquisition. This is also true for tracking mode since the timing phase error during
CHAPTER 2: Review of Timing Recovery
45

tracking mode is usually confined to a very small region around τ = 0 , thereby guaranteeing
that ρ (τ ) is linear during tracking.

3 0.5

Timing phase error


0.4

s1τ 0.3 Γmax


ρ(τ)
ρ( τ )

0
0.2

0.5e-1
s2τ 0.1
Γmin
-3 0
-0.5 0 0.5 0 50 100 150 200 250 300
Timing phase error: τ Time (bits)
(a) (b)

Fig. 2-25: Illustration of (a) a timing function and (b) its corresponding phase convergence with
maximum and minimum time constants.

The characterization of phase convergence using minimum and maximum time constants is
sufficient to describe the timing acquisition for first-order timing-recovery loops. However, for
timing acquisition in the presence of a frequency error, timing recovery needs a second-order
PLL [3, Chap.11]. The operation of a second-order PLL can be described as (see Fig. 2-3(b))
τ k +1 = τ k − αχ k − δ k (τ k ), δ k (τ k ) = δ k −1 (τ k ) + βχ k −1 (2.53)
where α and β are step-size parameters, and δ k is the amount of frequency compensation.
The discrete-time PLL may be related to its continuous-time counterpart by replacing the
sample-delay operator z by the approximation z ≈ 1 + j 2π fT , for small | 2π fT | . For a
continuous-time second-order PLL, the natural frequency ω n and the damping factor ς are
two important parameters in the study of convergence. The corresponding parameters ω nd and
ς d for the discrete-time second-order PLL can be related to the step size parameters α and β
according to [3, Chap. 11]
α
ω nd T = β K d K o , ς d = Kd Ko (2.54)
2 β
where K d and K o are the gain factors of the TED and VCO, respectively, of the PLL.
For timing acquisition using a second-order PLL, the acquisition speed can be described
using its responses to the unit phase step and the unit phase ramp (frequency step). By way of
illustration, we show these responses for a continuous-time second-order PLL in Fig. 2-26(a)
and (b), respectively. The damping factor ς is used as a parameter and the timing phase error
decays with increase of the relative time t / T . Observe that large values of ς yield a slower
response to frequency steps but a faster response to phase steps. In practice, values of ς
significantly smaller than 0.707 are not often encountered. For a discrete-time second-order
PLL, the damping factor ς d is typically chosen as 0.707 and the natural frequency ω nd is
chosen for ω nd T < 0.1 , which covers most cases of practical interest. The parameters α and
CHAPTER 2: Review of Timing Recovery
46

β of the loop filter need to be properly chosen, taking into account the TED gain K d in order
for the loop to have the prescribed damping factor ς d and natural frequency ω nd .

1
ς= 1
ς=
0.3 0.3
0.5 0.8 0.5
Timing phase error

Timing phase error


0.707 0.707
0.6
0.5 1.0 1.0
2.0 0.4 2.0
5.0 5.0
0.2
0
0

-0.2

-0.5 -0.4
0 1 2 3 4 5 6 7 8 0 1 2 3 4 5 6 7 8

Relative time: t/T Relative time: t/T

(a) (b)

Fig. 2-26: Second-order continuous-time PLL transient responses with (a) timing phase error due
to a unit phase step and (b) timing phase error due to a unit frequency step. The damping factor ς
is used as a parameter.

2.5 Summary
In this chapter, we have presented a detailed review on timing recovery. This review
relates to timing recovery structures, requirements, and performance measures of timing
recovery. We have also discussed the generic timing error detector algorithms and the main
issues that arise during timing acquisition. Numerical analyses and simulation results are
presented for supporting our discussions. The review and the related discussions presented here
constitute the basics for the study and development of timing recovery techniques for digital
recording systems in the following chapters of this thesis.
CHAPTER 2: Review of Timing Recovery
47

References:

[1] K. H. Mueller and M. Müller, “Timing recovery in digital synchronous data receivers,”
IEEE Trans. Commun., vol. 24, no. 5, pp. 516-531, May 1976.
[2] S. U. H. Qureshi, “Timing recovery for equalized partial-response systems,” IEEE Trans.
Commun., vol. 24, no. 12, pp. 1326-1330, Dec. 1976.
[3] J. W. M. Bergmans, Digital baseband transmission and recording. Boston: Kluwer
Academic Publishers, 1996.
[4] E. A. Lee and D. G. Messerschmitt, Digital communication. Boston: Kluwer, 1988.
[5] H. Meyr and G. Ascheid, Synchronization in digital communications –Volume I. New York:
Wiley, 1990.
[6] W. C. Lindsey and M. K. Simon, Telecommunication systems engineering. Englewood
Cliffs, NJ: Prentice-Hall, 1973.
[7] L. E. Franks, “Carrier and bit synchronization in data communication- a tutorial review,”
IEEE. Trans. Commun., vol. 28, no. 8, pp. 1107-1120, Aug. 1980.
[8] R. D. Gitlin and J. Salz, “Timing recovery in PAM-systems,” Bell Syst. Tech. J., vol. 50, pp.
1645-1669, May-June 1971.
[9] E. D. Sunde, “Self-timing regenerative repeaters,” Bell Syst. Tech. J., vol. 36, pp. 891-937,
July 1957.
[10] W. R. Bennett, “Statistics of regenerative data transmission,” Bell Syst. Tech. J., vol. 37,
pp. 1501-1542, Nov. 1958.
[11] H. Kobayashi, “Simultaneous adaptive estimation and decision algorithm for carrier
modulated data transmission systems,” IEEE Trans. Commun. Tech., vol. 19, pp. 268-280,
June 1971.
[12] F. M. Gardner, Phaselock techniques. 2nd ed. New York: Wiley, 1979.
[13] C. -P. J. Tzeng, D. A. Hodges, and D. G. Messerschmitt, “Timing recovery in digital
subscriber loops using baud-rate sampling,” IEEE J. Select. Areas Commun., vol. 4, pp.
1302-1311, Nov. 1986.
[14] C. A. Ehrenbard and M. F. Tompsett, “A baud-rate line interface for two-wire high-speed
digital subscriber loops,” in Proc. IEEE Intl. Conf. Global Telecommun. (GLOBECOM),
1982, pp. 931-935.
[15] J. Armstrong, “Symbol synchronization using baud-rate sampling and data-sequence-
dependent signal processing,” IEEE Trans. Commun., vol. 39, no. 1, pp. 127-132, Jan.
1991.
[16] R. D. Cideciyan, F. Dolivo, R. Hermann, W. Hirt, and W. Schott, “A PRML system for
digital magnetic recording,” IEEE J. Select. Areas. Commun., vol. 10, no. 1, pp. 38-56, Jan.
1992.
[17] T. Aboulnasr, M. Hage, B. Sayar, and S. Aly, “Characterization of a symbol rate timing
recovery technique for a 2B1Q digital receiver,” IEEE Trans. Commun., vol. 42, no. 2/3/4,
pp. 1409-1415, Febr./March/April 1994.
[18] J. W. M. Bergmans, “Efficiency of data-aided timing recovery techniques,” IEEE Trans.
Inform. Theory, vol. 41, no. 5, pp. 1397-1408, Sept. 1995.
[19] J. W. M. Bergmans and H. W. Lam, “A class of data-aided timing recovery schemes,”
IEEE Trans. Commun., vol. 43, no. 2/3/4, Febr./March/April 1995.
[20] A. Jennings and B. R. Clarke, “Data-sequence selective timing recovery for PAM
systems,” IEEE Trans. Commun., vol. 33, pp. 729-731, July 1985.
CHAPTER 2: Review of Timing Recovery
48

[21] P. M. Aziz and S. Surendran, “Symbol rate timing recovery for higher order partial
response channels,” IEEE J. Select. Areas Commun., vol. 19, no. 4, pp. 635-648, April
2001.
[22] W. R. Bennett and J. R. Davey, Data transmission. New York: McGraw-Hill, 1965.
[23] P. H. Siegel, “Recording codes for digital magnetic storage,” IEEE Trans. Magn., vol. 21,
no. 5, pp. 1344-1349, Sept. 1985.
[24] P. H. Siegel and J. K. Wolf, “Modulation and coding for information storage,” IEEE
Commun. Magazine, vol. 29, no. 12, pp. 68-86, Dec. 1991.
[25] B. H. Marcus, P. H. Siegel, and J. K. Wolf, “Finite-state modulation codes for data
storage,” IEEE J. Select. Areas Commun., vol. 10, no. 1, pp. 5-37, Jan. 1992.
[26] J. E. Savage, “Some simple self-synchronizing digital data scramblers,” Bell Syst. Tech. J.,
vol. 46, pp. 449-487, Febr. 1967.
[27] R. D. Gitlin and J. F. Hayes, “Timing recovery and scramblers in data transmission,” Bell
Syst. Tech. J., vol. 54, pp. 569-593, March 1975.
[28] J. Salz, “On mean-square decision feedback equalization and timing phase,” IEEE Trans.
Commun., vol. 25, no. 12, pp. 1471-1476, Dec. 1977.
[29] P. Kabal and S. Pasupathy, “Partial response signaling,” IEEE Trans. Commun., vol. 23,
pp. 921-934, Sept. 1975.
[30] J. W. M. Bergmans, “Performance consequences of timing errors in digital magnetic
recording,” Philips J. Res., vol. 42, no. 3, pp. 281-307, 1987.
[31] S. Roy and S. A. Raghavan, “Timing sensitivity of (MMSE) linear and DF equalization for
digital magnetic recording channels,” Proc. IEE, Part-I, vol. 140, pp. 169-178, June 1993.
[32] J. Moon, “Timing sensitivity in discrete-time equalization,” IEEE Trans. Magn., vol. 29,
pp. 4027-4029, Nov. 1993.
[33] A. D. Weathers, “Sensitivity of PRML systems to timing offsets,” IEEE Trans. Magn.,
vol. 32, pp. 3971-3973, Sept. 1996.
[34] J. J. Wang and G. Mathew, “Timing sensitivity of decision feedback and partial response
detectors in magnetic recording,” in Proc. IEEE Intl. Conf. Global Telecommun.
(GLOBECOM), San Francisco, USA, Dec. 2000, pp. 1872-1876.
[35] F. M. Gardner, “Self-noise in synchronizers,” IEEE Trans. Commun., vol. 28, no. 8, pp.
1159-1163, Aug. 1980.
[36] R. E. Best, Phase-locked loops. New York: McGraw Hill, 1997.
[37] B. R. Saltzberg, “Timing recovery for synchronous binary data transmission,” Bell Syst.
Tech. J., vol. 46, pp. 593-622, March 1967.
[38] E. Roza, “Analysis of phase-locked timing extraction circuits for pulse code transmission,”
IEEE Trans. Commun., vol. 22, pp. 1236-1249, Sept. 1974.
[39] J. E. Mazo, “Jitter comparison of tones generated by squaring and by fourth-power
circuits,” Bell Syst. Tech. J., vol. 57, pp. 1489-1498, May-June 1978.
[40] M. Oerder and H. Meyr, “Digital filter and square timing recovery,” IEEE Trans.
Commun., vol. 36, no. 5, pp. 605-612, May 1988.
[41] H. Shaffiee, “Timing recovery for sampling detectors in digital magnetic recording,” in
Proc. Intl. Conf. Commun. (ICC), Dallas, Texas, June 1997, pp. 577-581.
[42] W. L. Abbott and J. M. Cioffi, “Timing recovery for adaptive decision feedback
equalization of the magnetic storage channel,” in Proc. IEEE Intl. Conf. Global
Telecommun. (GLOBECOM), San Diego, CA, Dec. 1990, pp. 1794-1799.
[43] F. M. Gardner, “A BPSK/QPSK timing-error detector for sampled receivers,” IEEE Trans.
Commun., vol. 34, pp. 423-429, May 1986.
[44] F. M. Gardner, “Hangup in phase-lock loops,” IEEE Trans. Commun., vol. 25, no. 10, pp.
1210-1214, Oct. 1977.
CHAPTER 2: Review of Timing Recovery
49

[45] D. L. Lyon, “Timing recovery in synchronous equalized data communication,” IEEE


Trans. Commun., vol. 23, pp. 269-274, Febr. 1975.
[46] J.-Y. Lin and C.-H. Wei, “Fast timing recovery scheme for class IV partial response
channels,” Electronics Letters, vol. 31, no. 3, pp. 159-161, Febr. 1995.
[47] Y. X. Lee, L. K. Ong, J. J. Wang, and R. W. Wood, “Timing acquisition for DFE
detection,” IEEE Trans. Magn., vol.33, pp. 2761-2763, Sept. 1997.
[48] J. J. Wang, J. W. M. Bergmans, Y. X. Lee, and G. Mathew, “DFE timing acquisition:
Analysis and a new approach for fast acquisition,” IEEE Trans. Magn., vol. 36, no. 5, pp.
2193-2196, Sept. 2000.
[49] J. E. Mazo, “Optimum timing phase for an infinite equalizer,” Bell Syst. Tech. J., vol. 54,
no. 1, pp. 189-201, Jan. 1975.
[50] B. Farhang-Boronjeny, “Near optimum timing recovery for digitally implemented data
receivers,” IEEE Trans. Commun., vol. 35, no. 9, pp. 1333-1336, Sept. 1990.
[51] J. J. Wang and G. Mathew, “Timing recovery for MxDFE detectors with reduced clock-
rate,” Presented at the 3rd meeting of M3DFE consortium, Kyongju, Korea, May 1999.
[52] J. G. Proakis, Digital communications. McGraw-Hill, 1995.
[53] R. Reggiannini, “A fundamental lower bound to the performance of phase estimators over
Rician-Fading channels,” IEEE Trans. Commun., vol. 45, no. 7, pp. 775-778, July 1997.
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
51

CHAPTER 3
TIMING SENSITIVITY ANALYSIS
FOR MAGNETIC RECORDING

At high densities and/or data rates, the data recovery performance of a read channel
depends strongly on the accuracy of the timing phase delivered by the timing recovery loop.
The timing sensitivity of recording channels indicates how fast the data detection performance
deteriorates when the timing phase deviates from its ideal value. In this chapter, we develop
analytical approaches for investigating the effect of timing error on the performance of MDFE
(multi-level decision feedback equalization) and PR4-VD (partial-response class-IV with
Viterbi detection) detectors in magnetic recording systems.
The chapter is organized as follows. A brief survey of the published work on timing
sensitivity is given in Section 3.1, followed by a summary of the contributions of this chapter.
Section 3.2 presents the magnetic recording system model incorporating MDFE and PR4-VD
detectors. The detection performance of MDFE in the presence of timing phase errors and
random phase jitter is investigated in Section 3.3. The detection performance of PR4-VD in the
presence of timing phase errors is investigated in Section 3.4. The chapter is concluded in
Section 3.5.

3.1 Introduction
Equalizer, data-detector and timing recovery circuits are among the most important blocks
that constitute the read-channel of a magnetic recording system. The equalization and detection
blocks usually receive considerable attention from researchers. However, with the steady
increase in data rates and recording densities, timing recovery has become crucial for reliable
data recovery. Therefore, it is necessary to evaluate the sensitivity of a detector to timing errors,
since timing sensitivity has significant influence on the required timing margins for reliable
acquisition and tracking [1, Chap. 9].
The problem of timing sensitivity was studied by Tufts and Berger [2], Gonsalves and
Tufts [3], Eriksson and Van den Elzen [4], and Yeo and Farhang-Boroujeny [5] in
communication systems, and by Bergmans [6], Bergmans and Janssen [7] in recording systems,
among other researchers. In these papers, a variety of methods was proposed for designing
transmitter-receiver filters and equalizers that are robust to timing jitter. Bergmans [9] reported
a theoretical study on the impact of static timing phase errors on various equalization and
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
52

detection schemes based on an equivalent discrete-time recording system with timing errors.
Investigations on the timing sensitivity of partial response systems were reported by Osawa et
al. [10], and Grami and Pasupathy [11]. Further, an analysis of timing sensitivity for decision
feedback and partial response (PR) equalizers with low-pass and matched-filter front-end filters
was reported by Roy and Raghavan [12]. Moon [13] examined the impact of a timing phase
error on finite-length equalizer performance in partial response and decision feedback
equalization (DFE) schemes. The effect of a static timing phase error on the performance of
Viterbi detector (VD) in PR systems was studied in [14], assuming white noise at the detector
input.
In this chapter, we propose a novel analytical approach for evaluating the timing sensitivity
of decision feedback and Viterbi based partial response detection schemes. For a given channel
and equalizer, this approach computes the probability density function (PDF) of the residual ISI
that arises due to misequalization and timing error. This PDF is in turn used to compute the
error event rate to quantify the detection performance in the presence of a timing error. This
approach provides a fast and simple method to evaluate the timing sensitivity in the presence of
misequalization. Unlike existing approaches, it can cover the performance consequences of
both static phase error and random timing jitter. We consider multi-level DFE (MDFE) and
class-IV partial response with Viterbi detection (PR4-VD) schemes for our study. MDFE, and
its advanced versions, are excellent detectors for (1,7)-coded recording channels, in particular
at high densities [15], [16]. From the family of partial response schemes, we choose the partial
response 1-D2 (D denotes one bit delay operator) for our study since it is one of the most widely
studied responses [17]. Nevertheless, our approach can be extended to more advanced detectors
from the DFE and PR families.

3.2 Channel Model with MDFE and PR4-VD Detectors


Fig. 3-1 shows the magnetic recording system model incorporating MDFE and PR4-VD
detectors.
n (t)
ak fo rw a rd x (t) xk yk
h (t) s lic e r âk
e q u a liz e r
1 /T
fe e d b a c k
e q u a liz e r
(a )

n (t)
ak y (t) yk
h (t) e q u a liz e r v ite rb i d e te c to r aˆ k − k 0
1 /T
(V D c la s s -IV )

(b )

Fig. 3-1: Magnetic recording system model incorporating (a) MDFE and (b) PR4-VD detector.

The input data {ak}, with ak∈ {-1,1} being in the NRZ format, is a sequence of 2/3(1,7)
coded bits for MDFE and 16/17 (0,6/6) coded bits for PR4-VD. The recording channel is
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
53

characterized by a bit response h(t) and additive white Gaussian noise (AWGN) n(t) of two-
sided power spectral density N 0 / 2 . Media noise, which is important at high densities [18]
[19], is not included in the channel model. The equalizer output, designated x(t ) for the MDFE
case and y (t ) for the PR4-VD case, is sampled at rate 1/T, where T is the channel bit duration,
resulting in xk = x((k + τ )T ) and yk = y ((k + τ )T ) for MDFE and PR4-VD, respectively.
Here, τ is the timing phase error (normalized in units T) at the sampler.
In the simulations, the continuous-time channel response h(t) is replaced by a fractionally-
spaced discrete-time filter with impulse response hm given by hm = h(t ) |t = mT / L where L is a
positive integer representing an oversampling factor. We use L=4. Consequently, the equalizer
is replaced by an FIR filter with a T/L-spaced impulse response, denoted by wm. Similarly, the
noise n(t) is replaced by discrete-time white Gaussian noise nm whose variance σ n2 is
determined by the SNR defined as1
 Vop2 L 
SNR [dB] = 10 log10  2  (3.1)
σ R
 n 
where Vop is the base-to-peak amplitude of the isolated transition response (see eq. (1.6)) and
L
the factor represents the bandwidth expansion resulting from the modulation encoding rate
R
R and the oversampling factor L in the recording channel. Note that σ n2 R L is the variance of
the noise in the user bandwidth R / T . Finally, an interpolator will be used at the equalizer
output to generate xk = x((k + τ )T ) and yk = y ((k + τ )T ) for MDFE and PR4-VD,
respectively, for a normalized timing phase error τ. The design of the equalizer is done as
described in [20]. The decision delay of the Viterbi detector (VD) is denoted ko .
We denote by qk the T-spaced discrete-time equalized bit response, from the recording
channel input to the sampler output. The equalizers in MDFE are designed to result in a target
response of the form q−1 D −1 + q0 + q1 D , with q0 > q−1 = q1 > 0 , at the slicer input [20]. Thus,
because of the d=1 code constraint, the ideal slicer input (noiseless, τ=0) has four levels2, ± q 0
and ± ( q−1 + q0 + q1 ) , and the decision at the kth instant is aˆk . This is illustrated in Fig. 3-2(a),
which shows the equalized bit response qk for τ=0 and the slicer input for different timing
phase errors. For the partial response system with PR target 1-D2, Fig. 3-2(b) shows the
equalized bit response qk for τ=0 and the equalizer output for different timing phase errors.

The figures on the right hand side of Fig. 3-2 show the inputs of the slicer and VD as
functions of timing phase error τ at the sampler. They resemble standard eye-patterns. Fig. 3-2
1
We may remark that the SNR defined in (3.1) does not take into account the recording density and code rate. This
definition helps to normalize the noise power spectral density with respect to the reference signal amplitude Vop for a
given user data rate. Starting from these normalized signal and noise levels, the effects of coding and density are
accounted for in the way that bit response hm and the noise variance σ n2 are defined. Thus, the SNR defined in (3.1)
can be used as a common reference to compare the performances with different codes and/or densities.
2
Note that due to the d=1 code constraint, the triplet { ak, ak-1, ak-2 } can assume only 6 out of the 8 possibilities, viz.
±{+1, +1, +1}, ±{+1, +1, -1}, ±{-1, +1, +1}. The patterns ±{-1, +1, -1} are not allowed.
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
54

clearly shows four decision levels for MDFE and three levels for PR4-VD when τ=0. The
smearing of decision levels at the ideal phase τ=0 is due to the residual ISI resulting from
misequalization. As the magnitude of timing phase error is increased from 0 to 0.5, the residual
ISI increases due to mismatch between the sampled equalized bit response and the ideal target
response. Therefore, the noise margins between the decision levels decrease sharply as τ
increases, and they vanish for τ=±0.5. Further, the effect of erroneous decision feedback in
MDFE around τ=±0.5 can also be clearly seen. In the timing sensitivity studies reported here,
the equalizers and the gain are not optimized for each timing phase error; instead they are kept
fixed at the optimum values corresponding to τ=0.
4

1 q0
2

q1
Slicer input
q −1
Equalized bit

0.5
0
response

0 -2

-4
-0.5 -1 -0.5 0 0.5 1
1 11 21

T im e (b its) T im in g p h ase erro r: τ


(a)

1 3
q0
2
0.5
Detector input

q1 1
Equalized bit
response

0 0

-1
-0.5
q2 -2

-1 -3
0 10 20 30 40 -1 -0 .5 0 0 .5 1
T im e (b its) T im in g p h ase erro r: τ

(b )

Fig. 3-2: Equalized bit response qk and the noiseless detector inputs yk for different timing phase
errors τ for (a) MDFE and (b) PR4-VD detectors at user density 2.5.

3.3 Timing Sensitivity of MDFE Detector


In this section, we develop an analytical approach to investigate the timing sensitivity of
the MDFE detector. This is done in two parts. First, we address the case of a static timing phase
error. Second, we address the case of random phase jitter by combining the result for a timing
phase error with the distribution of the jitter. We use error event rate (EER) as the performance
measure for studying the timing sensitivity. We use EER rather than bit error rate (BER) since
numerical evaluation of EER is easier compared to BER. In MDFE, the EER is the probability
of making a decision error when there are no erroneous decisions in the feedback register. Our
approach given below is aimed at deriving an expression for EER with the timing phase error τ
as a variable.
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
55

3.3.1 Error event rate of MDFE with static timing phase error
Since MDFE is a zero-threshold detector and the outer levels ± ( q−1 + q0 + q1 ) are
relatively large, its performance is mainly determined by the value of inner level ± q 0 . This is
true even with non-zero timing phase errors, because the slicer inputs at the outer levels vary
very slowly with τ for moderate excursions of τ (see Fig. 3-2(a)). Thus, the performance
mainly depends on the effective noise acting on the inner levels. Therefore, in our analysis, we
only consider the noise and residual ISI seen by the inner levels.
Let the feedback register be free from erroneous decisions. Then, with timing phase error τ,
the slicer input can be written as (see Fig. 3-1(a))
y k (τ ) = d k (τ ) + ξ k (τ ) (3.2)

where dk(τ) is either the inner level q0 (τ )ak or the outer level ( q−1 (τ ) + q0 (τ ) + q1 (τ ) ) ak , and
ξ k (τ ) = zk (τ ) + vk is the sum of residual ISI and noise. Here, zk(τ) is the residual ISI due to
timing phase error and misequalization, and vk is the Gaussian channel noise. The variables
q−1 (τ ) , q0(τ) and q1(τ) represent the values of the main samples q-1, q0 and q1 with timing
phase error τ. Assuming that decision errors occur only at the inner levels, the probability of
detection error (i.e., EER) for a given timing phase error can be given by
PrM DFE ( err τ ) = Prin ⋅ Pr (ξ k (τ ) > q 0 (τ ) ) (3.3)

where ‘Pr’ denotes probability and Prin  0.608 is the probability of occurrence of the inner level,
which is given by the sum of the probabilities of the patterns ±{-1, +1, +1} and ±{+1, +1, -1}
[22]. To compute the probability Pr (ξ k (τ ) > q0 (τ ) ) , we need to obtain the PDF of the random
variable ξk(τ). Note that ξk(τ) is not Gaussian because of the residual ISI zk(τ). Since the noise
component vk is a zero-mean Gaussian random variable, its PDF can be written as

( ) ( )
−1
pv (v ) = 2πσ v exp − v 2 ( 2σ v2 ) , −∞ < v < ∞ (3.4)

where σv2 is the variance of noise vk. Let pz(z|τ) be the PDF of the residual ISI component zk(τ).
Since the data ak is independent of the noise vk, the PDF of ξk(τ) can be obtained as

pξ (ξ | τ ) = ∫ pz ( z | τ ) pv (ξ − z ) dz. (3.5)
−∞

Substituting (3.5) and (3.4) in (3.3), we get


Prin
∞ ∞
 (v − z ) 2 
PrM DFE ( err | τ ) =
2π σ v ∫ ∫ p z ( z | τ ) exp  −
 2σ v2 
 dzdv
q 0 (τ ) −∞ (3.6)

 q (τ ) − z 
= Prin ⋅ ∫
−∞
p z ( z | τ )Q  0
 σv
 dz

where Q (u ) = ( 2π )−1 ∫ ∞ exp ( − x 2 2 )dx . It remains to determine the PDF, pz(z|τ), of inner-level
u

residual ISI zk(τ).

3.3.2 Distribution of inner-level ISI


In the absence of channel noise (i.e., vk=0), using (3.2), we can write
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
56

zk (τ ) = yk (τ ) − q0 (τ ) ak . (3.7)

This ISI can be modeled as the output of a filter with impulse response ci(τ) and input ak. The
coefficients ci(τ) can be obtained from the equalized bit response coefficients qi(τ) and the
feedback equalizer taps bi according to
 q i (τ ), − M ≤ i < − 1, N b + 1 < i ≤ N
 q (τ ) − q (τ ) + b , i = − 1
 −1 1 1
ci (τ ) =  (3.8)
 0, i = 0, 1
 q i (τ ) − bi , 2 ≤ i ≤ Nb + 1

where integers M>0 and N>0 define the span of the equalized bit response qi(τ), and N b + 1 is
the number of feedback equalizer taps. The tap values bi , i = 1, 2," , N b + 1 , at T-spacing, are
given by
b1 = q1 − q−1 = 0
bi = qi for i = 2," , N b + 1.
Using (3.8), we can express inner-level ISI as
N N
z k (τ ) = ∑
i =− M
ci (τ ) a k − i = ∑
i=− M
f k ,i (τ ) (3.9)

where f k ,i (τ ) = ci (τ ) ak −i . We need the joint PDF of the random variables {f k ,i (τ )} to compute


the PDF of zk(τ). Since these are correlated random variables because of the correlation in the
(1,7) coded bits {ak}, it requires considerable mathematical and computational effort to obtain
the exact PDF of zk(τ). Hence, we make the simplifying assumption that the random variables
{ak} are independent. We will later confirm the adequacy of this assumption. Thus, we get from
(3.9)
pz ( x | τ ) = p f− M ( x | τ ) ⊗ p f− M +1 ( x | τ ) ⊗ " ⊗ p f N ( x | τ ) , −∞ < x <∞ . (3.10)

Here, p z ( x τ ) and p f i ( x τ ) are the PDFs of z k (τ ) and f k , i (τ ) , respectively, and ‘ ⊗ ’


denotes convolution. Since ak takes on only two values +1 or –1 with probability 0.5, and ci(τ)
is deterministic, we obtain the PDF of f k , i (τ ) for a given τ as

0.5 if x ∈{+ci (τ ), −ci (τ )}


p fi ( x | τ ) =  (3.11)
0 otherwise.
Taking the Fourier transform on the both sides of (3.10), we get
N
Pz ( w | τ ) = ∏ P (w | τ )
i =− M
fi (3.12)

where Pz (w| τ ) and Pfi (w | τ ) denote the Fourier-transforms of pz ( x | τ ) and p fi ( x | τ ) ,


respectively. It follows from (3.11) that Pf ( w τ ) = cos( wci (τ )) , −∞< w< ∞. Substituting this in
i

(3.12), we get
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
57

N
Pz ( w | τ ) = ∏ cos( wc (τ )),
i =− M
i − ∞ < w < ∞. (3.13)

Taking the inverse Fourier transform on the both sides of (3.13) and noting that Pz(w|τ) is real
and even-symmetric, we obtain
∞ N
1
pz ( x | τ ) =
π ∫ ∏ cos( w c (τ )) cos( wx)dw,
0 i =− M
i
−∞ < x <∞. (3.14)

Fig. 3-3 shows the PDF of inner-level ISI, computed using (3.14), for various values of τ.
6

4 τ= 0%
pz(z/τ)

τ= 5%
3 τ=10%
τ=15%
2

0
-1 -0.5 0 0.5 1

ISI amplitude: z

Fig. 3-3: PDF of inner level residual ISI in the presence of timing phase errors (obtained by
computation using (3.14)).
We observe that as τ increases, the peak value of the distribution falls off rapidly and the
peak splits into two symmetric peaks about zero. For small τ, the residual ISI is dominated by
misequalization, which is normally very small in amplitude. This explains the large amplitude
single peak observed for small τ. As τ increases, the residual ISI becomes dominated by timing
phase error. Furthermore, the term c-1(τ)=q-1(τ)-q1(τ) becomes more significant for large τ and
hence the distribution peaks at ±(q-1(τ)-q1(τ)). Note that q-1(τ)-q1(τ)=0 for τ=0. This explains
the fall in peak value as well as the observed peak splitting.
8 4

6 τ =0% 3
τ =5%
pz (z/τ )

pz (z/τ )

4 2

2 1

0 0
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1

ISI amplitude: z ISI amplitude: z

4 4

τ =10% 3
τ =15%
3
pz (z/τ )

pz (z/τ )

2 2

1 1

0 0
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1

ISI amplitude: z ISI amplitude: z

Fig. 3-4: PDF of inner-level residual ISI in the presence of timing phase errors (obtained by
simulation).
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
58

We made Monte Carlo simulations to verify the above analytical observations. Fig. 3-4
shows the PDF of inner-level ISI for different τ, obtained by simulations. Observe that the
simulations do match the analytical results in Fig. 3-3 in terms of shape, peak amplitude, and
peak splitting of the distributions.

3.3.3 Evaluation of MDFE detection performance


In this section, we evaluate the error event rate (EER) of MDFE in the presence of timing
phase error and random phase jitter, by making use of the ISI distribution developed above. The
EER in the presence of a timing phase error can be evaluated by substituting the PDF of inner
level ISI given by (3.14) for the one in (3.6) and evaluating the integral. Doing this for the
Lorentzian channel with user density 2.5, we get the performance curves shown in Fig. 3-5. The
EER estimated using direct bit-by-bit simulations is also shown in the figure. Observe that the
analytically obtained values match well with those from simulations, especially for not too
large timing phase error. Note that the SNR loss due to a 5% timing phase error is about
0.4~0.5 dB at 1e-5 EER. Studies conducted for negative timing phase errors revealed that the
SNR loss is less compared to positive phase errors (e.g. 0.1 dB less for τ = −5% compared to
that for +5%). This is due to the asymmetry of the equalized bit response in MDFE, as was also
observed in [9].

τ = 5% ; sim ulated
10
-2 τ = 10% ; sim ulated
τ = 15% ; sim ulated
Error event rate (EER)

-4
10

τ=15%
-6 τ=10%
10
τ=5%
τ=0%
-8
10
22 24 26 28 30

SNR (dB)

Fig. 3-5: Error event rate of MDFE in the presence of timing phase error τ (solid curves: EER
obtained by calculation using (3.6)).

We can easily extend the above approach to evaluate the impact of steady state timing loop
phase jitter on detection performance. Let the PDF of steady state phase jitter be pτ(τ). Then,
the resulting EER can be obtained as

EER = ∫ PrMDFE (err | τ ) pτ (τ ) dτ (3.15)
−∞

where PrMDFE (err τ ) is obtained from (3.6). Assuming Gaussian distributed phase jitter in the
tracking loop, the EER calculated using (3.15) and that obtained by bit-by-bit simulations are
shown in Fig. 3-6. Observe that there is a close agreement between the results obtained from
the computations and simulations. Further, these curves show that RMS phase jitter of 3%, 5%
and 8% results in SNR loss of about 0.1 dB, 0.4 dB and 1.2 dB, respectively, at EER 1e-5,
compared to the ideal timing.
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
59

-2
10
+ 3% rm s jitter ; sim ulated
x 5% rm s jitter ; sim ulated
* 8% rm s jitter ; sim ulated

Error event rate (EER)


-4
10

-6
10

8% rm s jitter
5% rm s jitter
-8
10 3% rm s jitter
Ideal tim ing

22 24 26 28 30
SN R (dB )

Fig. 3-6: Error event rate of MDFE in the presence of random phase jitter (solid curves: EER
obtained by calculation using (3.15)).

The close agreement between the computed and simulated EER values in Fig. 3-5 and Fig.
3-6 justifies our simplifying assumption on the independence of data bits. For the sake of the
completeness, we give in Appendix 3.A the analysis of the ISI distribution by taking the
dependence of (1,7) data into account.

3.4 Timing Sensitivity of PR4-VD


The EER of PR4-VD is the probability that the Viterbi detector (VD) chooses an incorrect
sequence of bits aˆ = {aˆ k } instead of the correct sequence a = {a k } . For this to happen, the
path metric associated with the sequence of bits a in the trellis for the given partial response
target should be greater that that with aˆ [21, Chap. 9]. Thus, with timing phase error τ, the EER
can be given by
PrPR ( err τ ) = ∑ Pr( a ) Pr( aˆ a , τ ) (3.16)
a,aˆ

where

Pr(aˆ a ,τ ) = Pr ( m (a τ ) > m (aˆ τ ) ) . (3.17)

Here, m(a|τ) denotes the path metric for the path a and Pr(a ) denotes the probability of the
data sequence a .
Let gk be the ideal coefficients of the partial response target. For example,
[ 0 g1 , g2 ] = [1, 0, −1] and g k = 0 for k < 0 and k > 2 for PR4-VD. Following the
g ,
notation from the preceding section, let qk (τ ) denote the T-spaced equalized channel bit
response with timing phase error τ. Then, the path metric m(aˆ τ ) is given by [1, Chap. 3]

m(aˆ | τ ) = ∑ [ (a ⊗ q(τ ))k + vk − (aˆ ⊗ g ) k ] .


2
(3.18)
k

With ci(τ)=qi(τ)-gi, the residual ISI zk(τ) at the detector input with τ is given by (3.9).
Substituting for qi(τ) in (3.18), we get
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
60

m(aˆ | τ ) = ∑ [2(e ⊗ g ) k + zk (τ ) + vk ]2 (3.19)


k

where ek  (ak − aˆk ) / 2 is the error sequence between the two paths a and â . Substituting
(3.19) in (3.17), we get
 
Pr(aˆ a,τ ) = Pr ∑( vk + zk (τ )) (e ⊗ g)k < −∑(e ⊗ g)2k  . (3.20)
k k 
At reasonable SNRs, the EER can be approximated by considering the pairs (a, aˆ ) which
result in maximum Pr(aˆ a ,τ ) given in (3.20). In other words, we use the dominant error events
for computing error event rate. In the case of the partial response 1-D2, the single-bit error event
is the dominant event at the density 2.5. Hence, we get (e ⊗ g ) k = g k . Using this, (3.20)
becomes
Pr(aˆ | a ,τ )  Pr[vk + zk (τ ) < −2] (3.21)

where vk = vk − vk−2 and zk (τ ) = zk (τ ) − zk −2 (τ ) . Clearly, vk is Gaussian with zero mean and variance
σ v2 = 2σ v2 − 2r2v where r2v = E[vv
i i−2 ] .

To obtain the PDF of the ISI part z k (τ ) , we proceed as follows. Using bk=ak-ak-2, we can
write zk (τ ) = ∑ fk ,i (τ ) where fk ,i (τ ) = ci (τ )bk −i . Hence, the PDF of fk ,i (τ ) is given by
i

0.25, x ∈ {+2ci (τ ), −2ci (τ )}


 (3.22)
p fi ( x ) = 0.5 x=0
0 otherwise.

As we did before, we make the simplifying assumption that bk’s are independent3. Then,
following similar steps as in (3.12) to (3.14), we get the PDF of the residual ISI zk (τ ) as

1  
pz ( x τ ) = ∫ 
π 0 i
∏ cos 2 ( wci (τ )) cos( wx) dw  , −∞< x <∞ .

(3.23)

This follows from the fact that the Fourier transform of the PDF of fk ,i (τ ) is cos 2 (ci w) . Fig. 3-
7 shows the PDF of residual ISI zk (τ ) computed using (3.23) as well as bit-by-bit simulations.
Observe that the analytical results agree closely to those from simulations.

3
Because bk = ak − ak − 2 , the {bk} constitute a correlated sequence even if {ak} is a sequence of independent
data bits. However, for the sake of simplicity, we assume that bk’s are independent. We also note that (3.22) is based
on the assumption that the pair of data bits {ak, ak-2} assumes {-1, -1}, {-1,+1}, {+1, -1} or {+1, +1} with equal
probability 0.25.
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
61

Obtained using (3.23) Obtained using simulation


6
6
5 τ = 0%
5
τ = 5%
τ = 0% 4
4 τ =10%
pz (z / τ)

pz (z / τ)
τ = 5% τ =15%
3
3 τ =10%
2 τ =15% 2

1 1

0 0
-2 -1 0 1 2 -2 -1 0 1 2
ISI amplitude: z ISI amplitude: z

Fig. 3-7: PDF of residual ISI z k (τ ) obtained using analytical and simulation approaches.

Since vk and zk (τ ) are independent random variables, the PDF of vk + zk (τ ) can be
obtained by convolving their individual PDFs. For the 1-D2 recording system, the values of
Pr(aˆ a,τ ) given by (3.20) are identical for error events of the form {ek}= ±{+1}, ±{+1,0,+1},
±{+1,0,+1,0,+1}, …. Hence, following similar steps as for the derivation of (3.6), the EER for
the PR4-VD with τ can be approximated using the dominant error events (from (3.16), (3.17),
(3.21) and (3.23)) as
∞  2+ x
PrPR (err τ )  2∫ pz ( x | τ )Q  dx. (3.24)
−∞
 σ v 
The scale factor 2 on the right side of (3.24) is the sum of the probabilities of the data patterns
that support the single-bit error events [17].
Fig. 3-8 shows the EER computed using (3.24) as well as bit-by-bit simulations in the
presence of a timing phase error. Observe that the theoretical and simulation results match well.
Further, timing phase errors of 5% and 10% cause about 0.9 dB and 3.0 dB loss in SNR,
respectively, at 1e-5 EER.

10
-1 τ = 5% ; sim ulated
τ = 10% ; sim ulated
-2 τ = 15% ; sim ulated
10
Error event rate (EER)

-3
10
-4
10
-5
10
-6
10 τ = 15%
-7 τ = 10%
10 τ = 5%
-8 τ = 0%
10
22 24 26 28 30

SN R (dB )

Fig. 3-8: Error event rate of PR4-VD in the presence of timing phase error (solid curves: obtained
using calculation based on Eq. (3.23)).
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
62

Comparing Figs. 3-8 and 3-5, we see that the timing sensitivity of PR4-VD is higher than
that of MDFE at the density considered here. Separate studies based on bit error rate
performance rather than error event rate (not shown here) also led to the same conclusion as
above. However, if we specify the timing phase error in terms of user bit interval rather than
channel bit interval (i.e., multiply the timing phase errors considered here by 2/3 for MDFE and
16/17 for PR4-VD), then the timing sensitivities of MDFE and PR4-VD would be comparable.
The performance of PR4-VD for random jitter can also be evaluated in the same way as for the
MDFE detector in the preceding section. We skip this work here for brevity.

3.5 Conclusions
In this chapter, using the error event rate as a performance measure, we investigated the
timing sensitivity of MDFE and PR4-VD detectors. We developed analytical methods for
estimating the probability density function of the residual ISI in the presence of misequalization
and timing phase error. Using this, the error event rates of the detectors were evaluated
numerically. These numerical results have been shown to match well with those obtained from
bit-by-bit simulations. The principle of this approach can be easily extended to evaluate the
timing sensitivity of advanced versions of MDFE and PR4-VD detectors.
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
63

Appendix 3.A : PDF of ISI with Correlated Data


The analysis of the ISI PDF given in Section 3.3.2 assumed that the data bits {ak} are
independent. Clearly, this assumption is not true for (1,7) coded sequences. In this section, we
present a more accurate analysis by removing this independence assumption.
According to (3.9), we can write the MDFE inner-level ISI as
N
z k (τ ) = ∑
i=− M
ci (τ ) a k − i = c T (τ )a k (3.25)

where c(τ ) [c−M (τ ) " cN (τ )]T and ak  [ak + M " ak − N ]T . To obtain the PDF of zk(τ), the
probability of the (1,7) data vector a k needs to be computed. In (3.25), for a given τ, c(τ ) is
deterministic and a k is a stochastic (1,7) data vector at the time instant k. As a result, zk(τ) is a
discrete random variable with the probability equal to that of a k . Therefore, calculation of the
distribution of ISI boils down to computing the probability of all possible vectors a k in the
(1,7) data sequence. To do this, we resort to the method proposed by Howell [22] to compute
the probabilities of RLL 2/3 (1,7) data patterns. Then, the ISI distribution is given by
Pz ( z | τ ) = Pr( zk (τ ) = z | τ ) = ∑ Pr(ai ) (3.26)
i∈S z

where the set S z is such that cT (τ )ai = z for all i ∈ S z . One of the main advantages of this
approach is that we can compute the probabilities of zk (τ ) for any timing phase error τ by
using one-time calculated probabilities of data vectors a k .

Fig. 3-9 shows the PDF of ISI obtained using (3.26) for various values of τ . The length of
data vectors a k used in this computation is 20 bits. The computation results shown in Fig. 3-9
match well with the simulation results shown in Fig. 3-4.
8 4

6 τ=0% 3 τ=5%
Pz ( z/τ )

Pz ( z/τ )

4 2

2 1

0 0
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1
ISI amplitude: z ISI amplitude: z
4 4

3 τ=10% 3
τ=15%
Pz ( z/τ )
Pz ( z/τ )

2 2

1 1

0 0
-1 -0.5 0 0.5 1 -1 -0.5 0 0.5 1
ISI amplitude: z ISI amplitude: z

Fig. 3-9: PDF of inner-level ISI in the presence of timing phase errors (obtained using calculation
based on (3.26)).
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
64

References:

[1] J. W. M. Bergmans, Digital baseband transmission and recording. Boston: Kluwer


Academic Publishers, 1996.
[2] D. W. Tufts and T. Berger, “Optimum pulse amplitude modulation, Part II: Inclusion of
timing jitter,” IEEE Trans. Inform. Theory, vol. 13, no. 2, pp. 209-216, April 1967.
[3] R. A. Gonsalves and D. W. Tufts, “Data transmission through a random noisy channel by
PAM,” IEEE Trans. Commun. Tech., vol. 16, no. 3, pp. 375-379, June 1968.
[4] L-E. Eriksson and H. C. Van Den Elzen, “An equalizer structure with reduced sampling
time reference sensitivity,” IEEE Trans. Commun., vol. 25, pp. 1337-1343, Dec. 1976.
[5] S. H. Yeo and B. Farhang-Boroujeny, “An improved design of transmit digital and receive
analog filters to combat timing jitters,” in Proc. IEEE Intl. Conf. Global Telecommun.
(GLOBECOM), Phoenix, Arizona, Nov. 1997, pp. 1204-1208.
[6] J. W. M. Bergmans, “A method for designing robust linear partial response equalizers,”
Philips J. Res., vol. 42, no. 4, pp. 308-338, 1987.
[7] J. W. M. Bergmans and A. J. E. M. Janssen, “Robust data equalization, fractional tap
spacing and the Zak transform,” Philips J. Res., vol. 42, pp. 351-398, 1987.
[8] Z. Hang and M. Renfors, “A new symbol synchronizer with reduced timing jitter for QAM
systems,” in Proc. IEEE Intl. Conf. Global Telecommun. (GLOBECOM), Singapore, Nov.
1995, pp. 1292-1296.
[9] J. W. M. Bergmans, “Performance consequences of timing errors in digital magnetic
recording,” Philips J. Res., vol. 42, no. 3, pp. 281-307, 1987.
[10] H. Osawa, S. Tazaki, and S. Audo, “Performance analysis of partial response systems for
non return-to-zero recording,” IEEE Trans. Magn., vol. 22, no. 4, pp. 253-258, July 1986.
[11] A. Grami and S. Pasupathy, “Pulse shape, excess bandwidth, and timing error sensitivity in
PRS systems,” IEEE Trans. Commun., vol. 35, pp. 475-480, Aug. 1987.
[12] S. Roy and S. A. Raghavan, “Timing sensitivity of (MMSE) linear and DF equalization for
digital magnetic recording channels,” IEE Proc.-I, vol. 140, no. 3, pp. 169-175, June 1993.
[13] J. Moon, “Timing sensitivity in discrete-time equalization,” IEEE Trans. Magn., vol. 29,
no. 6, pp. 4027-4029, Nov. 1993.
[14] A. D. Weathers, “Sensitivity of PRML systems to timing offsets,” IEEE Trans. Magn.,
vol. 32, no. 5 , pp. 3971-3973, Sept. 1996.
[15] J. Kenney and R. W. Wood, “Multi-level decision feedback equalization: An efficient
realization of FDTS/DF,” IEEE Trans. Magn., vol. 31, no. 2, pp. 1115-1120, March 1995.
[16] K. C. Indukumar, S. Gopalaswamy, B. Liu, and Y. X. Lee, “Performance comparison of a
class of multi-level DFE and PRML detectors in the presence of channel non-linearities,”
IEEE Trans. Magn., vol. 35, no. 5, pp. 2283-2285, Sept. 1999.
[17] R. D. Cideciyan, F. Dolivo, R. Hermann, W. Hirt, and W. Schott, “A PRML system for
digital magnetic recording,” IEEE J. Sel. Areas Commun., vol. 10, no. 1, pp. 38-56, Jan.
1992.
[18] R. W. Wood, “Jitter vs. additive noise in magnetic recording: Effects on detection,” IEEE
Trans. Magn., vol. 23, no. 5, pp. 2683-2685, Sept. 1987.
[19] R. W. Wood, “Detection and capacity limits in media noise,” IEEE Trans. Magn., vol. 34,
no. 4, pp. 1848-1850, July 1998.
[20] G. Mathew, B. Farhang-Boroujeny, and C. Y. Ng, “Design of analog equalizer for partial
response detection in magnetic recording,” IEEE Trans. Magn., vol. 36, no. 4, pp. 2098-
2108, July 2000.
CHAPTER 3: Timing Sensitivity Analysis for Magnetic Recording
65

[21] E. A. Lee and D. G. Messerschmitt, Digital communication. 2nd ed., Kluwer Academic
Publishers, 1994.
[22] T. D. Howell, “Statistical properties of selected recording codes,” IBM J. Res. Dev., vol.
33, no. 1, pp. 60-73, Jan. 1989.
Chapter 4: Analysis of Timing Error Detectors
67

CHAPTER 4
ANALYSIS OF TIMING ERROR
DETECTORS

The timing error detector (TED) is considered to be the most important part of the timing
recovery system. The ability of the timing recovery loop to track the timing base of the input
signal depends on how much timing information the TED can extract from the noise-corrupted
input signal. The effectiveness of the TED can be characterized by the TED efficiency which is
a measure of the amount of extracted timing information. In this chapter, we present analyses
for examining the efficiencies of TEDs used in partial response (PR) and decision feedback
equalized magnetic recording systems. For the read channel with decision feedback
equalization (DFE), we analyze the noise performance at the TED output and provide an
improved TED for jitter minimization. We also study optimality issues for timing acquisition.
The chapter is organized as follows. A brief introduction to TED efficiency and a survey
of published work are given in Section 4.1, followed by a summary of the contributions of this
chapter. In Section 4.2, we examine the TED efficiency for PR systems using a given TED and
different PR targets. The TED efficiency for DFE read channels is examined in Section 4.3
using the multilevel DFE (MDFE) as an example. In Section 4.4, we propose a marginal
detection-based TED for reducing the phase jitter under low signal-to-noise ratio (SNR)
channel conditions, and analyze its output noise variance and the resulting timing jitter
performance. Section 4.5 deals with issues related to the optimality of MDFE timing
acquisition. The chapter is concluded in Section 4.6.

4.1 Introduction
When designing a timing recovery loop, we need to consider a fundamental trade-off
between tracking ability and noise rejection. Good suppression of noise requires the loop
bandwidth to be as small as possible, whereas a wider bandwidth is required for a better
tracking ability. This trade-off can be quantified using the measure ‘efficiency’. The efficiency
of a TED is a measure of the amount of timing information that the TED is able to extract from
the incoming signal per unit of time and SNR [15]. In Section 2.2.3 of Chapter 2, we have
given more details on the efficiency of timing error detectors. To recall, the TED efficiency is
defined as
1 K d2
γ = (4.1)
SNRm Su (1)
Chapter 4: Analysis of Timing Error Detectors
68

where SNRm is the SNR in the matched-filter bound sense, K d is the slope of the timing
function at the origin (also called the TED gain), and Su (1) is the power spectral density
Su (e j 2πΩ ) of the noise uk at the TED output (see Fig. 2.2) at DC ( Ω = 0 ) ( Ω being a
normalized frequency variable in units 1/T). Thus, whereas the efficiency is independent of the
loop filter and voltage-controlled oscillator, it is dependent on the power spectral densities of
the data and noise as well as the transfer function of the channel. This will become clearer in
the sections below.
Among the various types of TED (see Section 2.1), the zero-forcing (ZF) TEDs are
commonly used in practice because of their cost effectiveness and implementation simplicity
and relatively good performance. Bergmans and Lam [1] proposed, with analysis, a class of
data aided ZF based timing recovery techniques. In fact, the classical paper of Mueller and
Müller [2] reported the first ZF based baud-rate timing recovery scheme. In [15], Bergmans
proposed the concept of TED efficiency and presented analyses for evaluating the efficiencies
of maximum-likelihood (ML) and various baud-rate timing recovery schemes as a function of
the data and channel parameters. The TED algorithms given in [3] and [4] are typical examples
of ZF based timing recovery techniques used in PR systems. References [7] and [8] give typical
examples of timing recovery techniques used in DFE systems. The techniques in [7] and [8]
were not developed from a ZF point of view. In fact, the TED in [7] is of the minimum mean-
square error (MMSE) type. Further, the analysis of these algorithms has not been reported,
except by means of simulation results.
In this chapter, we analyze the TED algorithms used in PR and DFE read channels. We
derive the TED efficiencies of PR systems for different PR target polynomials and input data
coding schemes. We also derive the TED efficiency for the MDFE. In addition, we show that
the PR and MDFE TEDs can be reworked into a form that resembles an error-based ZF TED
[14]. We propose and analyze an improved TED for the MDFE detector that can reduce jitter
variance at low SNRs. Further, using the TED efficiency as a measure, we show that the MDFE
TED is near optimal during timing acquisition. We also examine the optimality of the different
preamble patterns for different recording densities and coding schemes.

4.2 TED Analysis for PR Recording Systems


The classical paper of Mueller and Müller [2] suggested that the TED output for baud-rate
timing recovery in digital synchronous receivers could be of the form χk = yk−1aˆk − yk aˆk−1 ,
where yk and aˆk are the equalizer output and the corresponding bit decision, respectively.
This concept was reapplied for timing recovery in PR magnetic recording systems [3] [4]. As
described in [3], the TED output for PR systems is given by
χ k = yk −1 xk − yk xk −1 (4.2)
where yk is the actual equalizer output which depends on a timing phase error, and xk is the
desired value of yk (See Fig. 3-1(b) in Chapter 3). In this section, we evaluate the efficiency of
this TED algorithm for PR magnetic recording systems with selected PR targets and input
coding schemes.
Chapter 4: Analysis of Timing Error Detectors
69

4.2.1 The TED analysis


We define the vector of PR target coefficients as p = [ p0 p1 " pN −1 ]T where N is the
number of target coefficients. Typical examples of PR targets used in magnetic recording are
listed in Table 4-1, where ‘ D ’ denotes one symbol delay [5] [6]. Here, p = [1 0 −1]T for PR-
IV, p = [1 1 −1 −1]T for EPR-IV, p = [1 2 0 − 2 −1]T for E2PR-IV, and p =[1 1 0 −1 −1]T for a
modified E2PR-IV.
Table 4-1: PR TARGETS FOR MAGNETIC RECORDING SYSTEMS.

PR scheme p(t) Target

1.5

PR-IV
0.5

0
P(D) = 1-D2
-0.5

-1

-1.5
-5 0 5 10

1.5

P(D) = 1+D-D2-D3
0.5
EPR-IV 0

-0.5

-1

-1.5
-5 0 5 10

1
2
E PR-IV 0
P(D) = 1+2D-2D3-D4
-1

-2

-3
-5 0 5 10

1.5

0.5
2
M-E PR-IV 0 P(D) = 1+D-D3-D4
-0.5

-1

-1.5
-5 0 5 10

For the analysis in this section for deriving the efficiency of the TED according to (4.2) for
PR systems, we will use Fig. 3-1(b) (see Chapter 3) which shows the magnetic recording
system model incorporating PR equalizer and detector. Let the continuous-time signal at the
equalizer output be given by
y(t ) = ∑ ai q(t − iT ) + v(t ) (4.3)
i

where ai is the input data at the rate 1/T, q (t ) is the bit response of the equalized PR channel,
and v(t ) is the noise at the equalizer output. For the sake of convenience, we assume in this
section that the equalized channel response q (t ) is equal to the ideal PR target response p (t )
given in Table 4-1. Further, the autocorrelation function of noise v(t ) is defined by the impulse
response of the equalizer, since the noise n(t ) at the equalizer input is assumed to be white. The
equalizer output y (t ) is sampled at the sampling instants tk = ( k + τ )T where τ denotes the
timing phase error normalized in units T, i.e., the ideal sampling instants correspond to
tk = kT . The resulting sampled output yk = y (tk ) of the equalizer is given by
Chapter 4: Analysis of Timing Error Detectors
70

yk = ∑ ai p((k + τ )T − iT ) + v((k + τ )T )
i
(4.4)
= ∑ ai pτk −i + vk
i
τ
where pk = p((k +τ )T ) and vk = v((k +τ )T ) . Denote the desired value of yk by xk, which is the
N −1
noise-free value of yk at the ideal sampling instants tk = kT , i.e. xk = ∑ pi ak −i where
i =0

pi = p Then, we can obtain the timing function, which is the expected value of χ k , as
0
i .

ρPR (τ ) = E[ χ k ] = E[ yk −1 xk − yk xk −1 ] .
For uncorrelated data ak, one can readily obtain
ρ PR (τ ) = p T (pτ−1 − p1τ ) (4.5)
τ τ τ τ T
where pi = [ pi pi +1 " pi + N −1 ] . Then, the TED gain Kd can be computed as
d ρ PR (τ )
Kd = = p T (p −1 − p 1 ) (4.6)
dτ τ =0
dp(t)
where p i = [ pi′ pi′+1 " pi′+ N −1 ]T and pi′ = T .
dt t=iT
The TED output χ k consists of contributions from the data ak and the noise vk . To
distinguish them, we calculate the variance of χ k as V(τ ) = E[χk2 ] − ( E[χk ])2 . This results in
V(τ ) = pT [(PIτ + PIIτ − 2PIIIτ ) − (pτ−1 − pτ1 ) ⋅ (pτ−1 − p1τ )T ]p + δυ (4.7)
N −1 N −1
where δ υ = 2[rv (0)∑ ( pi ) 2 − rv (1) ∑ pi pi −1 ] with rv (l ) = E [vk vk +l ] being the autocorrelation
i =0 i =1

function of vk for lag l . The matrices PIτ , PIIτ and PIII


τ
are given by
 p(1,1) p(1, 2) " p(1, N ) 
 p(2,1) p(2, 2) " p(2, N )  ,
PIτ = 
 # # " # 
 
 p( N ,1) p( N , 2) " p( N , N ) 
 p(−1, −1) p(−1,0) " p(−1, N − 2) 
 p(0, −1) p(0, 0) " p(0, N − 2)  ,
PIIτ = 
 # # " # 
 
 p( N − 2, −1) p( N − 2,0) " p( N − 2, N − 2)
and
 p (1,0) p (1,1) " p (1, N − 1) 
 p (2,0) p (2,1) " p (2, N − 1)  ,
PIIIτ = 
 # # " # 
 
 p ( N ,0) p ( N ,1) " p ( N , N − 1)

 2 piτ pτj , i ≠ j  piτ pτj −1 + piτ−1 pτj , i ≠ j


where and 
p(i, j ) =  ( pτ ) 2 , i = j p (i, j ) =  pτ pτ , i = j .

 i i
 ∑
i
i −1 i
Chapter 4: Analysis of Timing Error Detectors
71

Clearly, the expression (4.7) for V(τ ) shows the contributions from the data and noise to the
total TED output variance. The first term on the right side of (4.7) pertains to data-dependent
phase jitter, and vanishes at the ideal phase with τ = 0 if there is no misequalization. If
misequalization exists at τ = 0 then a static timing phase error could accrue as can be seen
from (4.5). The second term, δ υ , is due to noise and hence characterizes noise induced phase
jitter.
Using (4.5) and (4.7), we compute the mean and the variance of χ k for noiseless PR
channels and illustrate the results in Table 4-2.
Table 4-2: MEAN AND VARIANCE OF THE TED OUTPUT FOR PR MAGNETIC
RECORDING SYSTEMS.

T im ing fun ctio n V aria nce of T E D output


(K d is the T E D ga in )
P R schem e
x-axis: Tim in g p hase error (units T) x-axis: Tim ing ph ase error (un its T)
y-axis: M ean of the TE D ou tput y-axis: V ariance of the TE D outp ut

P R -IV :
P (D )= 1-D 2

K d = 5 .3

E P R -IV :
P (D )= 1+ D -D 2 -D 3
K d = 9 .1 7

E 2 P R -IV :
P (D )=
1 + 2D -2 D 3 -D 4
K d = 1 9 .6

M -E 2 P R -IV :
P (D )= 1 + D -D 3 -D 4

K d = 6 .6

In the computation of χ k , we assume that the desired reference sequence xk is without


errors. In practice, this sequence can be produced in various ways depending on the data and
Chapter 4: Analysis of Timing Error Detectors
72

the channel characteristics. For instance, xk takes on values {+2, 0, -2} for PR-IV and can be
constructed by using a simple 2-level threshold-detector1 [3]. But, xk takes on several values for
higher order PR channels (e.g., xk∈{-4, -2, 0, 2, 4} for EPR-IV). Hence, extending the threshold
method for constructing xk for such systems may not be effective since the detector would
require several threshold levels and the noise margin for correct detection has a decreasing
trend. In these situations, xk can be produced using the bit decisions âk according to
N −1
xk = ∑ pi aˆk −i . In the following TED efficiency analysis, we assume that decisions âk are
i =0
correct.
In the above analysis, we assume that the input data ak is uncorrelated. For correlated data
ak with autocorrelation function ra (l )  E [ak ak +l ] , the timing function based on (4.2) can be
shown to be
ρ PR (τ ) = ∑∑ piτ p j (ra ( j − i − 1) − ra ( j − i + 1)) (4.8)
i j

from which one can readily calculate the TED gain Kd as


K d = ∑∑ pi′ p j (ra ( j − i − 1) − ra ( j − i + 1)) . (4.9)
i j

Further, the noise component of the TED output can be obtained as


N −1 N −1
uk = vk −1 ∑ pi ak −i − vk ∑ pi ak −1−i . (4.10)
i =0 i =0

Then, it is not difficult to compute the noise autocorrelation function as


ru (l ) = E[uk uk +l ]
= 2rv (l )∑ rp (m)ra (m + l ) − rv (l + 1)∑ rp (m)ra (m + l − 1) − rv (l − 1)∑ rp (m)ra (m + l + 1)
m m m

(4.11)
N −1− m
where rp ( m ) = ∑
i =0
pi pi + m . Using this, we can calculate the DC portion of the power spectral

density of noise uk at the TED output as



Su (1) = ∑ r (l ) .
l =−∞
u (4.12)

Substituting (4.9) and (4.12) in (4.1), we get the efficiency of the TED of (4.2) as
[ ∑ ∑ pi′ p j ( ra ( j − i − 1) − ra ( j − i + 1))]2
1 i j (4.13)
γ PR = ∞
SNRm
∑ r (l )
l = −∞
u

where

∫ h (t )dt
2

SNRm = −∞
(4.14)
No / 2

1
The preamble pattern used during timing acquisition in PR-IV channels is the 4T pattern
{"+ + − − + + − −"} . With this, xk takes on only two values {-2, +2} and can be obtained using a
simple slicer (with zero-threshold) at the equalizer output.
Chapter 4: Analysis of Timing Error Detectors
73

with h (t ) and N o / 2 denoting the unequalized channel bit response and noise power spectral
density at the channel output, respectively.
To calculate the numerical values of the TED efficiency according to (4.13), we consider
the Lorentzian model of the magnetic recording channel (see eq. (2.15), Chapter 2) with PR
targets PR-IV, EPR-IV and E2PR-IV. Note that the absolute SNR should not matter because
efficiency is independent of SNR. The results for various user densities (see eq. (2.14), Chapter
2) are given in Fig. 4-1 and Fig. 4-2 for the coding schemes of the rate 16/17 (0,6/6) code and
the rate 2/3 (1,7) code, respectively.
6
PR-IV
5 EPR-IV
E2PR-IV
4
TED efficiency (dB)

-1

-2
2 2.5 3 3.5

User density

Fig. 4-1: TED efficiency for Lorentzian recording channel and rate 16/17 (0,6/6) code.

6
PR-IV
5 EPR-IV
E2PR-IV
4
TED efficiency (dB)

-1

-2
2 2.5 3 3.5

User density

Fig. 4-2: TED efficiency for Lorentzian recording channel and rate 2/3 (1,7) code.

Observe that the TED efficiency decreases as the user density increases because the
recording channel bandwidth decreases as user density increases. At high densities, the E2PR-
IV and EPR-IV targets afford better TED efficiency than PR-IV. This is because at high
densities the higher order PR schemes can better match the recording channel characteristics,
Chapter 4: Analysis of Timing Error Detectors
74

thereby extracting timing information more effectively with minimum noise enhancement or
coloration. Also observe that the use of the 2/3 (1,7) code results in better TED efficiency
compared to the 16/17 (0, 6/6) code. This is because the power spectrum of the 2/3 (1,7) code
has its power concentrated in the low frequency region whereas that of the 16/17 (0,6/6) code is
spread almost uniformly over the entire frequency range (see Fig. 2-11, Chapter 2). Therefore,
the 2/3 (1,7) code is better matched to the recording channel characteristics compared to the
16/17 (0,6/6) code. Combining this with the insight we got from the expression for the
efficiency of the ML TED (see eq. (2.19), Chapter 2), we see why the efficiency for the 2/3
(1,7) code is better than for the 16/17 (0,6/6) code, especially at high densities.

4.2.2 Error based ZF TED for PR systems


The analysis we presented above is for the TED according to (4.2), which uses the
equalizer output yk . We now show that this TED scheme can be realized alternatively in the
form of a symbol-rate ZF TED, which uses the error ek = yk − xk at the equalizer output rather
than yk . This results in an error based ZF TED for PR systems. One of the advantages of using
error based ZF TED is the simplicity of implementing the TED. Since both the equalizer output
yk and its reference xk have large magnitude ranges, calculation of the TED output according
to (4.2) yields an even larger range, which is not convenient for TED implementation. The use
of ek in the TED can greatly reduce the TED output range. Furthermore, by taking the sign of
ek we can even avoid the multiplier (a high-speed multiplier) for calculating the TED output.
These advantages make the error based ZF TED attractive in practice.
Note that the TED output according to (4.2) has two components, namely yk −1 xk and
yk xk −1 . Because the timing loop effectively averages the TED output, the relative delay
between yk −1 xk and yk xk −1 is immaterial to the TED performance. Therefore, advancing the
first component by one symbol interval, we can obtain an equivalent TED output
χ k = yk ( xk +1 − xk −1 ) . (4.15)
We rework this output into an equivalent error-based output by noting that ek = yk − xk .
Correspondingly,
χ k = ek ( xk +1 − xk −1 ) + ( xk +1 xk − xk xk −1 ) . (4.16)
Since the timing function is the expected value of χ k and the quantity E[xk+1xk − xk xk−1] is zero
irrespective of the input data, we can omit the corresponding component from (4.16). Thus, we
get an equivalent TED output
χ k = ek ( xk +1 − xk −1 ) . (4.17)
The TED according to (4.17) is in the form of a standard error-based ZF TED. Analysis shows
that the timing function ρ PR (τ ) and the TED gain K d derived for (4.2) in the preceding
analysis are also valid for (4.17). Further, analysis of the TED efficiency for (4.17) yields the
same result as given in (4.2). This means that the error based ZF TED does not degrade the
timing recovery performance in PR systems.
Chapter 4: Analysis of Timing Error Detectors
75

4.3 TED Analysis for DFE Recording Systems


4.3.1 The TED analysis
The TED algorithm analyzed in the previous section for PR systems can in principle be
used for any type of recording system including DFE. One such application is the simplified
TED suggested by Abbott and Cioffi [7] for an adaptive DFE magnetic recording system. For a
particular DFE read channel, however, depending on the specific features of the signals in the
forward path and at the input of the detector, the TED algorithm can be derived in various
forms for implementation simplicity. Kenney and Wood [8] reported a TED algorithm for
MDFE timing recovery, which makes use of the inner-level slicer input samples to form a
transition-based TED. The output of this TED is given by
χ k = ( yk + yk −1 )( aˆk − aˆk −1 ) (4.18)
where yk is the sample value of the slicer input at the time instant k and aˆk is the
corresponding decision (see Fig. 3-1(a), Chapter 3). This TED can be well understood with the
help of Fig. 4-3, which shows the samples of the waveform at the slicer input in the presence of
a timing phase error τ ( τ being normalized in units T ). At the ideal phase (i.e. τ = 0 ), the
positive and negative inner-level samples have equal magnitudes but opposite signs. This
results in zero at the TED output. For a positive (resp. negative) τ, the sample magnitudes at the
positive and negative transitions do not match each other thereby producing a positive (resp.
negative) TED output and subsequently yielding an estimate of timing phase error.
-1 -1 -1 +1 +1 +1 -1 -1 -1 aˆ k
… k-2 k-1 k k+1 k+2
+outer-
level τ >0
yk −1

+inner- +1
level The slicer input
waveform (k + τ )T
0
(k − 1 + τ )T yk
-inner- -1
level
dyk
-outer- dτ
level
Fig. 4-3: Illustration of the existing transition-based MDFE TED algorithm.

Referring to Fig. 3-1(a), we can write the slicer input in the presence of a timing phase
error τ as
Chapter 4: Analysis of Timing Error Detectors
76

N b +1
yk = ∑ a q (( k + τ )T − iT ) + v ( t
i
i k )− ∑ i=2
a k − i q (iT )
N b +1
= ∑ i
a i q τk − i + v k − ∑ i=2
qi ak −i
N2 N b +1
= ∑
i = − N1
q iτ a k − i + v k − ∑
i=2
qi a k −i

(4.19)
where tk = ( k + τ )T , vk = v (tk ) , qτk = q ((k + τ )T ) and qk = qk0 = q (kT ) . Here, we denote
by v(t ) the noise at the forward equalizer output, and q (t ) the equalized channel bit response
from the channel input to the forward equalizer output with its duration given by N1 + N 2 + 1
bits. We assume the past decisions to be correct and the number of taps in the feedback
equalizer to be N b with tap values qi , i = 2,3," , N b + 1 .
It is not difficult to analyze the transition-based TED of (4.18). Using (4.19) and (4.18), the
timing function can be obtained as
N2 N b +1
ρ (τ ) = E[ χ k ] = ∑
i =− N1
qiτ ( ra (i + 1) − ra (i − 1)) + ∑ q (r (i − 1) − r (i + 1))
i =2
i a a
(4.20)

where ra (l ) = E [ak ak +l ] . The TED gain Kd can be computed as


N2
Kd = ∑ q′(r (i + 1) − r (i − 1))
i =− N1
i a a (4.21)

dqiτ
where qi′ = τ =0 . The noise component at the TED output is given by

uk = (vk + vk −1 )(ak − ak −1 ) . (4.22)
The autocorrelation function of the TED output noise is given by
ru (l ) = E[uk uk +l ]
(4.23)
= (2rv (l ) + rv (l − 1) + rv (l + 1))(2ra (l ) − ra (l − 1) − ra (l + 1))
where rv (l ) = E[vi vi + l ] . Then, the PSD of the noise component at DC is given by

Su (1) = ∑ r (l )
l =−∞
u . (4.24)

Substituting (4.24) and (4.21) in (4.1), we obtain the TED efficiency of the transition-based
MDFE TED scheme of (4.18) as
L2
[ ∑ q i′ ( ra (i + 1) − ra ( i − 1)) ]2
1 i = − L1 (4.25)
γ M DFE = ∞
SNRm
∑ r (l )
l = −∞
u

where SNRm is defined as in (4.14). We compute the TED efficiency for the MDFE
incorporating the rate 2/3 (1,7) code at various user densities and 27 dB SNR, and plot the
result in Fig. 4-4. The result reveals that the TED efficiency varies inversely with the user
density. This is because the channel bandwidth, and thereby the timing information at the input
of the TED, decreases with increase in user density.
Chapter 4: Analysis of Timing Error Detectors
77

4
TED efficiency (dB)

-1

-2
2 2.5 3 3.5

User density

Fig. 4-4: TED efficiency for the MDFE transition-based TED.

4.3.2 Error based ZF TED for MDFE


As we did for PR systems in Section 4.2.1, we now derive an error-based ZF TED scheme
for the MDFE [9] for easy implementation. The error signal at the inner level is given by
ek = yk − aˆk , which represents the difference between the actual slicer input yk and its desired
value aˆk . Then, yk = ek + aˆk . As a result, (4.18) becomes
χ k = ek ( aˆk − aˆk −1 ) + ek −1 ( aˆk − aˆk −1 ) . (4.26)
Clearly, the TED output is non-zero only when there are transitions in the data, i.e., aˆk ≠ aˆk −1 .
Since the MDFE detector uses a (1,7) code, the transition condition aˆk ≠ aˆk −1 implies that
aˆk +1 ≠ aˆk −1 and aˆk ≠ aˆk − 2 . Using this, we rework (4.26) into
χ k = ek (aˆk +1 − aˆk −1 ) + ek −1 (aˆk − aˆk − 2 ) .
(4.27)
The two terms on the right side of (4.27) are delayed versions of each other whenever
aˆk ≠ aˆk −1 . Since the loop averages the TED output, both terms essentially convey the same
information, and we might drop one of them. Thus, we could simplify the TED according to
ek (aˆk +1 − aˆk −1 ) if aˆk ≠ aˆk −1 ,
χk =  (4.28)
0 if aˆk = aˆk −1 .
The condition aˆk ≠ aˆk −1 implies that aˆk +1 ≠ aˆk −1 , whence χ k will be nonzero. Conversely, if
aˆk = aˆk −1 , then aˆk +1 does not necessarily equal aˆk −1 . In this event, if aˆk +1 ≠ aˆk −1 , then the
cross-product ek (aˆk +1 − aˆk −1 ) could convey significant timing information, while (4.28) would
nevertheless stipulate the TED output to be zero. To also take this timing information into
account, we remove the conditioning from (4.28) to obtain the new TED output
2ek aˆk +1 if aˆk +1 ≠ aˆk −1 ,
χ k = ek (aˆk +1 − aˆk −1 ) =  (4.29)
0 otherwise .
Chapter 4: Analysis of Timing Error Detectors
78

This expression coincides with that of the error-based Zero-Forcing TED in [14]. Compared to
the MDFE TED according to (4.18), the new TED according to (4.29) takes more timing
information into account, and can be verified to have a higher efficiency.

4.4 Marginal Detection-Based TED


In Section 4.3, we analyzed the efficiency of the error-based ZF TED for the MDFE
detector. In this section, we develop a method for this TED that can reduce the variance of
phase jitter resulting from low-SNR conditions, and analyze the resulting performance.
For the ZF TED according to (4.29), the noise component uk at the TED output is
uk = vk (aˆk +1 − aˆk −1 ) . Its variance is
σ u2 = 2(ra (0) − ra (2)) ⋅ σ v2 . (4.30)
For the 2/3 (1,7) code, the correlations ra (0) and ra (2) are given by (estimated using data-
averaging) ra (0) = 1.0 and ra (2) = −0.2167 . Thus, we get
σ u2  2.43σ v2 . (4.31)
The above equation shows that the variance of the noise component of the TED output is
proportional to the variance of noise vk at the TED input. Therefore, reducing the variance of vk
leads to a decrease in the TED output noise variance. However, vk is superimposed upon a
useful signal component, and it is sometimes difficult to distinguish vk and signal in practice,
especially at poor SNRs. For the MDFE detector that uses a zero-threshold slicer, it is possible
to use an augmented TED, named marginal detection-based TED, for minimizing the variance
of the TED output noise. This TED distinguishes big noise components at the TED input and
subsequently removes their effect on the timing loop.
The proposed marginal detection-based TED has output
( y − aˆ )(aˆ − aˆ ) if | yk |> M ,
χ k =  k k k +1 k −1 (4.32)
 0 i f | y k |≤ M
where the slicer input yk is given in (4.19) and 0 < M < 1 is a marginal threshold for detecting
large noise components. Since aˆk = ±1 , the condition | yk |≤ M is equivalent to
1 − M ≤| vk |≤ 1 + M . (4.33)
This means that whenever the TED input noise vk meets the conditions 1− M ≤ vk ≤1+ M for
aˆk =−1 and −1− M ≤ vk ≤ −1+ M for aˆk = 1, the TED output χ k becomes zero. Both conditions
indicate that yk is around zero and thus unreliable for bit detection. Making χ k zero can
suppress large noise disturbances under these conditions.

4.4.1 Performance analysis


Now, we analyze the TED output noise variance in the TED according to (4.32). Assume
that the total noise vk at the slicer input has a Gaussian distribution with a probability density
 v2 
function (PDF) P(v ) = (1/ 2πσ v2 ) exp  − 2  where σ v2 is the variance of vk . Denote by vk
 2σ v 
Chapter 4: Analysis of Timing Error Detectors
79

the noise that induces disturbances at the TED output. Clearly, vk is the noise portion in vk
that does not meet conditions 1− M ≤ vk ≤1+M and −1− M ≤ vk ≤ −1+M . Thus, the variance of
vk can be obtained as

σ v2 = ∫−∞ v 2 P(v)dv (4.34)
v∉V

where V = {1 − M ≤ vk ≤ 1 + M or − 1 − M ≤ vk ≤ −1 + M } . To calculate σ v2 , we define


∞   1− M   1 + M 
S  ∫−∞ P(v)dv = 1 − 2 Q   −Q  (4.35)
v∈V   σv   σ v 
1 ∞

2
where Q ( x ) = e − u / 2du , and
2π x


SV  ∫- ∞ v 2 P(v) dv
v∈V

1 − M  (1 − M ) 2  1 + M  (1 + M ) 2   1− M   1 + M 
= 2σ v  exp  −  − exp −  +σvQ 2  −σvQ 2 
.
 2π  2σ v 
2
2π  2σ v 
2
 σv   σ v 
(4.36)
Then, the variance σ of vk can be obtained as
2
v

σ v2 = (σ v2 − SV ) / S . (4.37)
As a result, the proposed TED of (4.32) has a noise component at the output given by
uk = vk ⋅ (aˆk+1 −aˆk−1) , which has variance
σ u2  2.43σ v2 . (4.38)
Thus, relative to the TED given in (4.29), the TED of (4.32) has its output noise variance
reduced by a factor
Sσ v2
κ= . (4.39)
σ v2 − SV
One can verify that the factor κ is always greater than 1.0. It is a function of the marginal
threshold M. By way of illustration we calculate κ for different M and SNR, and show the
result in Fig. 4-5. From Fig. 4-5 we obtain the following observations. First, the suppression
factor κ depends strongly on the threshold M . The larger the threshold M is, the larger will
be the suppression factor κ , and vice versa. In practice, however, the choice of M is based on
a tradeoff. If M is too small, the reduction in the output noise variance will also be small. If
M is too large, then undesirable things occur: large amounts of noise as well as the desired
signal will be rejected by the TED at its output, and as a result, the TED output may become
zero for a long period of time, thereby leaving the timing loop without control. In considering
this trade-off, a compromise of M equal to 0.25 is selected. It is found by experiment that this
threshold can make the TED work satisfactorily for our purpose. Second, the marginal
detection-based TED lowers the noise variance at its output significantly at low SNRs but not
apparently at high SNRs. This is so because the probability of large noise for low SNR is
greater than that for high SNR. Whenever noise exceeds the threshold M, the marginal
detection-based TED effectively shields this noise from the timing loop. When SNR is high
enough such that noise is small, the threshold M does not have much influence on the TED
output.
Chapter 4: Analysis of Timing Error Detectors
80

1.5

1 M=0.20
κ M=0.25
(dB) M=0.30
0.5

0
23 24 25 26 27 28 29 30
SNR (dB)
Fig. 4-5: TED output noise variance suppression factor κ at user density 3.0.

We also examine the jitter variance, which is the variance of the noise component of the
timing phase error τ , specified as σ ψ2 and σ ψ2 for the conventional TED according to (4.29)
and the marginal detection-based TED according to (4.32), respectively [14, Chap. 11]. The
numerical results are shown in Table 4-3. Observe that the jitter variance of marginal detection-
based TED is significantly reduced at low SNR. The jitter suppression factor κ decreases with
increase in SNR. Phase jitter suppression at SNR above 27 dB is negligible.
Table 4-3: RMS VALUES OF THE TIMING LOOP JITTER AND SUPPRESSION GAIN κ (USER DENSITY 3.0).

SN R C o n ventio nal M arginal L o o p jitter


TED d etectio n T E D sup p ressio n
dB σ ψ (rm s: % ) σ ψ (rm s: % ) κ (d B )
23 2.5 17 4 2.21 23 1 .12 24
24 2.2 60 6 2.06 98 0 .76 60
25 2.0 33 7 1.92 64 0 .47 07
26 1.8 33 5 1.77 98 0 .25 83
27 1.6 57 3 1.63 37 0 .12 49
28 1.5 02 7 1.49 37 0 .05 23
29 1.3 67 6 1.36 46 0 .01 87
30 1.2 49 8 1.24 90 0 .00 57

To verify the above analysis results, we conducted 1000 simulations at 23 dB SNR and user
density 3.0. We examined the timing phase error distribution at the 800th bit instant, well within
the steady state of tracking mode of operation. The timing phase error distributions for the
conventional TED and the marginal detection-based TED are illustrated in Fig. 4-6. This result
is basically consistent with the numerical computations. A small discrepancy with those
computations may arise from several factors such as the limited number of simulations in
collecting the histogram of timing phase error, and residual ISI effects.
Chapter 4: Analysis of Timing Error Detectors
81

RMS jitter = 2.44% RMS jitter = 2.15%

Number of occurrences
Number of occurrences

150 150

100 100

50 50

0 0
-0.05 0 0.05 0.1 -0.05 0 0.05

Timing phase error Timing phase error

Fig. 4-6: Distribution histogram of timing phase error for the TEDs with (right) and without (left)
the marginal detection scheme.

4.5 Optimality of MDFE Acquisition Performance


Fast and accurate acquisition of clock timing is an essential prerequisite for reliable data
detection in magnetic recording. Therefore, accuracy and speed of acquisition are important
factors when assessing the acquisition performance. Accuracy and speed of acquisition depend
largely on the preamble pattern and TED algorithm used. In this section, we investigate the
optimality of preamble pattern for the acquisition process, and particularly consider the MDFE
magnetic recording system as an example, and study its TED optimality issues.
This section first considers the optimality of the preamble pattern2 and then analyzes the
MDFE TED performance during acquisition.

4.5.1 Optimality of preamble pattern for magnetic recording


The optimality of the preamble pattern can be assessed in terms of the timing SNR of the
replay signal at the recording channel output. The timing SNR is defined as the ratio of the
power of the time derivative of the replay signal to noise variance within the band [ −0.5, 0.5]
that is normalized with respect to 1/ T , at the sampling instants. As mentioned in Chapter 2,
the harmonic contents of the periodic preamble ak are assumed to be negligible at the output of
the channel. Therefore, for a periodic preamble pattern ak of period T0 = MT , the replay
signal is essentially sinusoidal at normalized frequency Ω0 = 1/ M .
Assuming unit power for the fundamental component of ak , the PSD of ak becomes
A( e j 2πΩ ) = 0.5 (δ ( Ω + Ω0 ) + δ ( Ω − Ω0 ) ) in the normalized band [-0.5, 0.5]. Normalizing
the variance of noise to unity within the band [-0.5, 0.5], we can compute the timing SNR of
the replay signal as
SNRt = (2πΩ 0 / T ) 2 | H (Ω 0 ) |2 (4.40)

2
The problem of signal selection for phase-locked loops so as to minimize the tracking error due to
additive noise was addressed by Stiffler in [10].
Chapter 4: Analysis of Timing Error Detectors
82

where H (Ω) is the frequency response of the channel. For the Lorentzian channel model at
user density Du, H (Ω) is
H (Ω) = jπ T ( Du / R )e− jπΩ sin(πΩ)e−π Du |Ω|/ R (4.41)
where R is the code rate of the channel encoder.
We consider the 4T, 6T and 8T preamble patterns, which result in the replay waveforms as
shown in Fig. 4-7. They are sinusoids at normalized frequencies 1/4, 1/6 and 1/8, respectively.
We compute the timing SNR of these signals for various user densities. The amplitudes of the
sinusoids are given by
−π Du
π Du π
AM = sin( )e MR
(4.42)
R M
where M = 4,6 and 8. The resulting timing SNRs are shown in Fig. 4-8 for the rate 2/3 (1,7)
coded Lorentzian channel. The (1,7) code is used for the MDFE detector. Observe that the 4T-
pattern is optimal for very low densities and the 8T-pattern for very high densities. The 6T
pattern is optimum for medium to high recording densities.

r(t) 4T pattern r(t) 6T pattern


(+ + −− ) (+ + + −−−)
A4 A6
T T T T

t t
-A4 -A6

r(t) 8T pattern
( ++++−−−−)
A8
T T
t

-A8

Fig. 4-7: Replay waveforms at the output of Lorentzian channel.

2 4T-pattern (+ + - -)
6T-pattern (+ + + - - -)
8T-pattern (+ + + + - - - -)
Timing SNR (dB)

-1

-2

-3

-4

-5

-6
1.8 2 2.2 2.4 2.6 2.8 3 3.2 3.4

User density: Du
Fig. 4-8: Timing SNR for the 4T, 6T and 8T preamble patterns, on a rate 2/3 (1,7) coded
Lorentzian channel.
Chapter 4: Analysis of Timing Error Detectors
83

Similar conclusions were reached in Section 2.3.1 (see Fig. 2-14, Chapter 2) by
investigating the efficiency of the ML TED for each of the above preamble patterns. We recall
that the TED efficiency is a measure of how much timing information a TED can extract, for a
given amount of channel SNR. Timing SNR, by comparison, is a measure of how much timing
information the incoming signal carries, normalized with respect to noise power. For a periodic
preamble as in the case at hand, timing SNR and efficiency are essentially equivalent.
For comparison purposes, we also evaluate the timing SNR for the Lorentzian channel
using the rate 8/9 (0, 4/4) and rate 16/17 (0, 6/6) codes [11] [12]. The results are shown in Fig.
4-9. Observe that the 4T pattern is an optimum preamble for low to medium user densities. For
medium to high user densities, the 6T pattern tends to be optimum. In conclusion, for recording
densities that are typical in magnetic recording, the 6T pattern is an optimum preamble for both
d=0 and d=1 recording systems.
4 4

3 3

2 2

1 1
Timing SNR (dB)

Timing SNR (dB)

0 0

-1 -1

-2 -2

-3 4T-pattern (+ + - -) -3 4T-pattern (+ + - -)
6T-pattern (+ + + - - -) 6T-pattern (+ + + - - -)
-4 8T-pattern (+ + + + - - - -) -4 8T-pattern (+ + + + - - - -)

-5 -5

-6 -6
1.8 2 2.2 2.4 2.6 2.8 3 3.2 3.4 1.8 2 2.2 2.4 2.6 2.8 3 3.2 3.4
User density
User density

8/9 (0, 4/4) code 16/17 (0, 6/6) code

Fig. 4-9: Timing SNR for the 4T, 6T and 8T patterns in a (0,k) coded Lorentzian channel.

4.5.2 Optimality of TED for MDFE acquisition


Apart from the type of preamble pattern, the timing acquisition performance depends
heavily on the type of TED scheme used. We will show in Chapter 5 that the decision-directed
TEDs discussed in the previous section (see (4.18), (4.29)) can also be used for timing
acquisition if appropriate precautions are taken to ensure correct detection of the preamble bits
[9] [13].
As derived in the preceding section, the error-based ZF TED for MDFE has output (see
(4.29))
χ k = ek ( aˆk +1 − aˆk −1 ) = yk − aˆk ( aˆk +1 − aˆk −1 ) . (4.43) ( )
The TED performance can be assessed by comparing its efficiency with that of the ML TED
[14] [15] since the TED efficiency is a measure of its ability to extract timing information while
rejecting noise. Assuming that the noise at the equalizer input is white and that the channel
frequency response H (Ω) has little energy outside the band [ −0.5, 0.5] , we can deduce the ML
TED efficiency for a preamble with frequency Ω0 as [14]
2
∫ H (Ω )
2
γ ML = (2πΩ 0 ) 2 H (Ω 0 ) dΩ (4.44)
Chapter 4: Analysis of Timing Error Detectors
84

where Ω0 = 1/ 6 for the 6T pattern preamble. The harmonics of the preamble at frequencies
±3Ω0 , ±5Ω 0 , " , are assumed to be eliminated by the channel attenuation.
We proceed as follows to derive the efficiency of the error based ZF TED according to
(4.43) for MDFE during timing acquisition using the 6T pattern preamble.
Denote the MDFE band-limited equalized bit response in the presence of timing phase
error τ (normalized in units T ) by qτk , the feedback equalizer taps by bk , and the Fourier
transforms of the ideal equalized bit response qk0 and the feedback equalizer taps bk by
Q(e j 2πΩ ) and B (e j 2πΩ ) , respectively. Then the Fourier transform of qτk is Q ( e j 2πΩ )e j 2πΩτ ,
assuming that q (t ) has no excess bandwidth, and the slicer input is yk = (a ⊗qτ )k − (a ⊗b)k
where ‘ ⊗’ denotes convolution. For the error based ZF TED given in (4.43), the timing
function is
ρ (τ ) = E[ χ k ]
= E[ yk (ak +1 − ak −1 )]
0.5 0.5
=∫ −2 jQ(e j 2πΩ ) A(e j 2πΩ )e j 2πΩτ sin(2πΩ)d Ω + ∫ 2 jB(e j 2πΩ ) A(e j 2πΩ )sin(2πΩ)d Ω .
−0.5 −0.5
(4.45)
j2πΩ * − j 2πΩ
Because q and bk are real-valued, we have Q(e
0
k ) = Q (e
) and B(e ) = B (e− j2πΩ) j 2πΩ *

where ‘*’ denotes complex conjugation. Using these facts as well as that ρ (τ ) is a real valued
function, we can rework (4.45) into
ρ (τ ) = − j sin(2πΩ 0 ){[Q(e j 2πΩ0 )e j 2πΩ0τ − Q (e − j 2πΩ0 )e − j 2πΩ0τ ] − [ B(e j 2πΩ0 ) − B (e − j 2πΩ0 )]}
= 2sin(2πΩ 0 ){Im[Q(e j 2πΩ0 )e j 2πΩ0τ ] − Im[ B (e j 2πΩ0 )]}
= 2sin(2πΩ 0 ){Re[Q (e j 2πΩ0 )]sin(2πΩ 0τ ) + Im[Q (e j 2πΩ0 )]cos(2πΩ 0τ ) − Im[ B (e j 2πΩ0 )]}
(4.46)
where Re[ x ] and Im[ x] denote the real and imaginary parts of a complex number x ,
d ρ (τ )
respectively. Since K d = , we obtain the TED gain as
dτ τ =0
K d = 4πΩ0 sin(2πΩ0 ) Re[Q(e j 2πΩ0 )] . (4.47)
We now consider the noise component uk = vk −1 (ak − ak − 2 ) of the TED output. Denote the
Fourier transform of the MDFE forward equalizer coefficients ck by C (e j 2πΩ ) and the PSD of
channel noise n(t ) by N 0 / 2 . Then, the PSD of uk at DC can be calculated as
N 0 0.5
2T ∫−0.5
Su (1) = | C (e j 2πΩ ) |2 A(e j 2πΩ )(2 − e − j 4πΩ − e j 4πΩ )d Ω

N 0.5
= 0 ∫ 4 | C (e j 2πΩ ) |2 A(e j 2πΩ ) sin 2 (2πΩ)d Ω
2T −0.5
2 N0
= sin 2 (2πΩ0 ) | C (e j 2πΩ0 ) |2 .
T
(4.48)
Chapter 4: Analysis of Timing Error Detectors
85

1 ∫ | H (Ω) | d Ω
2

Also note that the SNR in the matched-filter bound sense is SNRm = .
T N0 / 2
Therefore, by the definition given in (4.1), we obtain the MDFE TED efficiency of the error
based ZF TED according to (4.43) as
(2πΩ0 )2 T 2 | Re[Q(e j 2πΩ0 )] |2 . (4.49)
γ MDFE = 2
| C (e j 2πΩ0 ) |2 ∫ H (Ω) d Ω
Then, using (4.44), (4.49) and the fact that Q (e j 2πΩ ) = H (Ω )C (e j 2πΩ ) / T , we can gauge the
TED efficiency loss of the MDFE error based ZF TED relative to the ML TED by computing
the ratio
2
γ ML
2
H (Ω0 ) | C (e j 2πΩ ) |2 Q(e j 2πΩ )
0 0

L = = . (4.50)
γ MDFE T 2 | Re[Q(e j 2πΩ )] |2 | Re[Q(e j 2πΩ )] |2
0 0

This is a very simple equation indicating that the efficiency loss is fully determined by the
j 2πΩ0
equalized bit response. Reducing the difference between | Q(e ) | and | Re[Q(e j 2πΩ0 )] |
j 2πΩ j 2πΩ
results in a decrease in efficiency loss. The ideal case is | Q(e 0
) |=| Re[Q(e 0 )] | which
corresponds to a symmetric equalized bit response. Table 4-4 lists this loss over a range of user
densities based on the 6T preamble for the Lorentzian channel. A loss factor of 2 means that the
timing-jitter variance at TED output will be twice as large for the same channel and timing loop
parameters. Observe that the loss of the MDFE TED is small even at very high densities. This
suggests that the MDFE TED using the algorithm (4.43) is near optimum for timing
acquisition.
Table 4-4: TED EFFICIENCY LOSS OF THE MDFE TED RELATIVE TO THE ML TED.
U ser
d e n s it y T E D L o ss T E D L o ss (d B )
2 .5 0 1 .0 3 2 1 0 .1 3 7 2
2 .7 5 1 .0 4 7 0 0 .1 6 7 3

3 .0 0 1 .0 6 2 0 0 .2 6 1 3

3 .2 5 1 .0 8 2 6 0 .3 4 4 9
3 .3 0 1 .0 9 3 0 0 .3 8 6 3
3 .5 0 1 .1 1 7 5 0 .4 8 2 4

4.6 Conclusions
In this chapter, we analyzed the efficiencies of the TEDs used in PR and MDFE magnetic
recording systems. In addition to examining the existing TEDs, we developed error based ZF
TED schemes for these systems. The error based TEDs are not only simple to implement but
also exhibit equivalent or better efficiencies compared to the existing non-error based TEDs. In
addition, the MDFE was considered as an example to study the marginal detection-based TED.
The result shows that the marginal detection-based TED algorithm can reduce jitter variance.
The reduction in jitter variance is significant at low SNRs. Further, using the timing SNR as a
measure, we examined the problem of optimal choice of preamble pattern for timing
Chapter 4: Analysis of Timing Error Detectors
86

acquisition for a given recording density. Finally, using the TED efficiency as a measure, we
showed that the MDFE TED with 6T pattern preamble is near optimum for timing acquisition
over a range of recording densities of interest.
Chapter 4: Analysis of Timing Error Detectors
87

References:

[1] J. W. M. Bergmans and H. W. Lam, “A class of data-aided timing recovery schemes,”


IEEE Trans. Commun., vol. 43, no. 2/3/4, pp. 1819-1827, Febr./March/April 1995.
[2] K. H. Mueller and M. Müller, “Timing recovery in digital synchronous data receivers,”
IEEE Trans. Commun., vol. 14, no. 5, pp. 516-531, May 1976.
[3] R. D. Cideciyan, F. Dolivo, R. Hermann, W. Hirt, and W. Schott, “A PRML system for
digital magnetic recording,” IEEE J. Select. Areas Commun., vol. 10, no. 1, pp. 38-56, Jan.
1992.
[4] J. -Y. Lin and C. -H. Wei, “Fast timing recovery scheme for class IV partial response
channels,” Electronics Letters, vol. 31, no. 3, pp. 159-161, Febr. 1995.
[5] H. K. Thapar and A. M. Patel, “A class of partial response systems for increasing storage
density in magnetic recording,” IEEE Trans. Magn., vol. 23, no. 5, pp. 3666-3668, Sept.
1987.
[6] H. Osawa, M. Okada, K. Wakamiya, and Y. Okamoto, “Performance improvement of
PRML system for (1,7) RLL code,” IEICE Trans. Electron., vol. E79-C, no. 10, pp. 1455-
1461, Oct. 1996.
[7] W. L. Abbott and J. M. Cioffi, “ Timing recovery for adaptive decision feedback
equalization of the magnetic storage channel,” in Proc. IEEE Intl. Conf. Global
Telecommun. (GLOBECOM), San Diego, CA, Dec. 1990, pp. 1794-1799.
[8] J. G. Kenney and R. W. Wood, “ Multi-level decision feedback equalization: An efficient
realization of FDTS/DF,” IEEE Trans. on Magn., vol. 31, no. 2, pp. 1115-1120, March
1995.
[9] Y. X. Lee, L. K. Ong, J. J. Wang, and R. W. Wood, “Timing acquisition for DFE
detection,” IEEE Trans. Magn., vol. 33, no. 5, pp. 2761-2763, Sept. 1997.
[10] J. J. Stiffler, “On the selection of signals for phase-locked loops,” IEEE Trans. Commun.
Tech., vol. 16, no. 2, pp. 239-244, April 1968.
[11] J. Eggenberger and A.M. Patel, “Method and apparatus for implementing optimum PRML
codes,” U.S. Pat. 4707681, issued Nov. 17, 1987.
[12] B. H. Marcus, A. M. Patel, and P. H. Siegel, “Method and apparatus for implementing a
PRML code,” U.S. Pat. 4786890, issued Nov. 22, 1988.
[13] J. J. Wang, J. W. M. Bergmans, Y. X. Lee, and G. Mathew, “DFE timing acquisition:
Analysis and a new approach for fast acquisition,” IEEE Trans. Magn., vol. 36, no. 5, pp.
2193-2196, Sept. 2000.
[14] J. W. M. Bergmans, Digital baseband transmission and recording. chaps. 9, 10, Boston:
Kluwer Academic Publishers, 1996.
[15] J. W. M. Bergmans, “Efficiency of data-aided timing recovery techniques,” IEEE Trans.
Inform. Theory, vol. 41, no. 5, pp. 1397-1408, Sept. 1995.
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
89

CHAPTER 5
FAST TIMING ACQUISITION FOR
DECISION FEEDBACK
EQUALIZATION RECEIVERS IN
MAGNETIC RECORDING

5.1 Introduction
In hard disk drives, data is written in concentric tracks that are subdivided into sectors. The
first 10 to 15 bytes of each sector are put aside for a periodic preamble pattern, which serves,
among other purposes, for timing acquisition [1] [2]. Timing acquisition techniques aim to
acquire the sampling phase so as to enable the bit detection process to commence. This task
should be accomplished reliably in a limited amount of time. In particular, false lock and hang
up [9] are to be avoided. Fast timing acquisition techniques were proposed for partial response
magnetic recording systems in [10] [12] [11]. In developing these techniques, DC offset and
gain were assumed ideal. A non-decision-directed acquisition scheme was reported in [3] [4]
for decision feedback equalization (DFE) receivers. Even though this scheme is not affected by
erroneous feedback decisions, it is rather sensitive to channel noise and can induce considerable
jitter. Moreover, the effects of DC offsets and gain errors were not considered in [3] [4].
Ideally, the same timing error detector (TED) can be used for both acquisition and tracking
modes. Resulting advantages include simplicity and effective suppression of channel noise. In
practice, the timing acquisition process has to deal with DC offsets and gain errors in the
channel, as well as with timing phase errors. In this situation, false lock and hang up problems
may become severe. In DFE receivers, wrong decisions in the feedback equalizer due to
possibly large initial timing, gain and DC errors can easily cause false lock and hang up. It is
necessary to develop techniques for avoiding these problems even in worst-case initial
situations.
This chapter is devoted to the development of fast timing-acquisition techniques for DFE
receivers in magnetic recording. In particular, we consider the multi-level DFE (MDFE)
receiver [5] [6]. Section 5.2 provides a novel timing acquisition scheme, which can prevent the
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
90

false lock problem [7]. Section 5.3 proposes another new scheme in a more general form to
ensure an effective, rapid and reliable acquisition process [8]. In Sections 5.2 and 5.3, a number
of simulations on the MDFE receiver are presented. The conclusions are drawn in Section 5.4.

5.2 Fast Acquisition for DFE with Modified Equalization


The false lock problem arises if the TED has zero output at non-ideal phases. If these
spurious zeros are avoided or eliminated, then false lock can be prevented. In DFE receivers,
erroneous TED outputs can stem from wrong decisions in the feedback loop. To solve this
problem, we propose an acquisition scheme that uses a modified forward equalizer to ensure
that the feedback register is loaded with the correct data pattern.

5.2.1 Inherent false lock in the MDFE TED


As shown in Chapter 4, the 6T pattern ( +++−−−) is an optimal preamble for a d=1
recording channel from medium to high densities. The MDFE receiver is designed for
delivering a good performance on d=1 recording channels. Recall from Chapter 4 that the
transition-based TED used for timing acquisition in MDFE has output χk = ( yk − aˆk )(aˆk +1 − aˆk −1)
where yk is the slicer input sample and aˆ k is the corresponding decision that is produced by
the slicer. To illustrate the features of the transition-based TED, the output is calculated while
timing phase error is gradually ramped up from -T to +T and then down again with the timing
loop open (see Fig. 5-1).
1.5

1 Increasing
Averaged TED output
TIMING ERROR SIGNAL

0.5

-0.5

-1

Decreasing
-1.5

-2
-1 -0.5 0 0.5 1
Timing phase error: τ
TIMING OFFSET

Fig. 5-1: Averaged TED output when the normalized timing phase error is changed dynamically
from -1 to 1 and from 1 to –1.

Since the TED output is non-zero only when there is a transition in the input data bits, it is
averaged through a six-bit sliding window for continuous display. It is clear that there is
hysteresis in the process. The hysteresis is useful in that it may eliminate the ‘hang-up’
problem. However, observe that the timing loop may falsely lock to the phases with errors near
±0.5T. On closer examination, we found that the preamble pattern was incorrectly detected as
‘ + + − − − − ’ or ‘ + + + + − − ’ in the false lock regions. In the general case, where there may
also be large initial DC offsets and gain inaccuracies, the system can be even more susceptible
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
91

to false lock. To illustrate the false lock effect, we do following. Setting the initial timing phase
error to 0.45T and SNR to 27 dB at user density 3.0, we made 10 simulation runs with the
timing loop closed. The phase error convergence during the first 300 channel bits is shown in
Fig. 5-2. As expected, the curves are all falsely locked to wrong phases with errors around 0.5T.
Timing phase error: τ

0. 5

0
0 100 200 300

Time ( channel bits

Fig. 5-2: Phase convergence curves at Du = 3.0 and SNR = 27 dB with the MDFE TED (10
simulation runs).

5.2.2 Modified acquisition scheme


We now propose an acquisition scheme for eliminating the false lock problem associated
with the MDFE TED [7]. The key idea is to prevent erroneous data patterns in the feedback
register, thereby preventing erroneous TED outputs. The scheme, which contains a modified
forward equalizer and several switches, is illustrated in Fig. 5-3. It is worth noting that this
scheme is also applicable for other DFE-type receivers.
to Loop Filter

forward equalizer coefficients detection of timing, gain and timing


DC-offset errors gain
start-up normal DC offset

bit decisions
B

readback feedback register


yk
signal forward
equalizer
C
A

feedback
equalizer

Fig. 5-3: Modified forward equalizer based MDFE timing acquisition scheme.

The acquisition procedure involves the following four steps:


Step 1: In this step, the feedback shift register gets loaded with the correct 6T ‘ + + + −−− ’
pattern that is close in phase to the one that should ideally be detected. This is done with no
feedback equalizer applied. To acquire the correct 6T pattern with no feedback equalizer, the
forward equalizer must be modified (i.e., Switch B is set to ‘Start-up’) to provide the correct
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
92

phase for the preamble waveform. This may be done by cascading the equalizer with a (a+bD)
filter. Suitable coefficients are a = 0.9091 and b = -0.0949 at the user density 2.5. During this
step, Switch A remains open and Switch C is connected to the slicer output.
Step 2: This step reconnects the feedback equalizer and resets the forward equalizer to
normal, after the feedback register contents are detected to be the correct 6T pattern. As soon as
three consecutive bits with the same sign are detected at the slicer output, Switch A is closed,
Switch B is set to ‘normal’, and Switch C is set to the inverter output. As a result, the feedback
register is disconnected from the slicer output and its contents recirculate. This ensures the
register contents to be the desired ‘ + + + −−− ’ pattern.
Step 3: When five user bytes after Step 2 have elapsed, the frequency loop is enabled, i.e.
the timing loop switches from first-order to second-order.
Step 4: Eleven user bytes after Step 2, Switch C is reconnected to the slicer output so that
the recirculation stops and real decisions flow through the shift register. At the same time, the
timing, gain, and DC loops are switched to tracking mode (i.e. loop gains are reduced by a
factor of 4) and the detected bits are checked for the presence of the synchronization word
(sync byte).
1. 5

1
Averaged TED output

0. 5

- 0. 5

-1

- 1. 5
-1 -0 . 5 0 0. 5 1

Timing phase error: τ

Fig. 5-4: Timing function of MDFE using the 4-step acquisition scheme.

0. 5
Timing phase error: τ

100 200 300


Time (channel bits)

Fig. 5-5: Phase convergence curves at Du = 3.0 and SNR = 27 dB (with the 4-step acquisition
scheme).
Using this procedure we repeat the simulation of Fig. 5-1. The result is shown in Fig. 5-4.
Observe that there is no difference between ‘increasing’ and ‘decreasing’ any more, and that
the false-lock spots around ±0.5T have been eliminated. To illustrate this, we made the
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
93

following simulations. We introduced a normalized timing phase error of 0.45 and a normalized
frequency offset of 0.001 during the initial part of the acquisition process at the 10th channel bit.
The initial gain and DC offset are set to 1.0 and 0, respectively. The channel SNR is 27 dB and
the user density is 3.0. Timing, gain and DC loops are closed and work simultaneously. The
simulation result is shown in Fig. 5-5. As expected, false lock no longer occurs.

5.2.3. Further simulations


To thoroughly study the proposed acquisition scheme, Monte-Carlo simulations were
conducted at user density 2.5. Timing, gain and DC loops are closed and work simultaneously.
Initial conditions are listed in Table 5-1.
Table 5-1: INITIAL CONDITIONS OF MONTE-CARLO SIMULATIONS FOR MDFE TIMING
ACQUISITION.

Parameters Initial conditions

1) Frequency error: uniform distribution from -0.1% to 0.1%

2) Phase error: uniform distribution from -0.5 to 0.5 bits

3) Gain error: log uniform distribution from 0.5 to 2

4) DC offset: uniform distribution from -0.5 to 0.5

5) Detector SNR: 14 dB (defined as the ratio of the inner eye


magnitude to the RMS noise at the input
of the slicer).

The termination condition of the simulations is that sync bytes are correctly recognized.
Acquisition failure is said to occur when the sync bytes are not detected at the end of the
acquisition process. No acquisition failures were observed in 20,000 acquisition trials. The
average value of the inner eye-level error is found to be 0.2, its standard deviation is 0.004, and
the maximum value is 0.29. Figs. 5-6, 5-7 and 5-8 show one set of convergence curves for the
VCO frequency, VGA gain, and DC offset, respectively. The convergence is false-lock-free
with good convergence speed.
1.58
VCO frequency / 100 (MHz)

1.56

1.54

1.52

1.5

1.48

1.46

1.44

1.42
0 100 200 300 400 500

Time (channel bits)

Fig. 5-6: Convergence of VCO frequency (data rate in the channel is 150 Mb/s).
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
94

1.1

0.9

VGA gain
0.8

0.7

0.6

0.5

0.4
0 100 200 300 400 500

Time (channel bits)

Fig. 5-7: Convergence of VGA gain (ideal value of VGA gain is 1.0).

0
DC offset

-0.2

-0.4
0 100 200 300 400 500

Time (channel bits)

Fig. 5-8: Convergence of DC offset (ideal value of DC offset is 0).

5.3 Modified Threshold-based Timing Acquisition


Scheme for DFE
Hang up during acquisition may not be easily noticed because it is an event of low
probability and the sampling phase can still converge to the steady state eventually [9].
However, hang up is unacceptable in magnetic recording, where rapid phase convergence is
required within a limited period of time. Hang up and false lock are two of the most serious
problems of timing acquisition. This is especially true for DFE acquisition, where wrong
decisions in the feedback register may propagate and degrade acquisition behavior. In this
section, we propose a new and general scheme for DFE timing acquisition for eliminating false
lock and effectively lowering the probability of hang up, and at the same time, tolerating large
gain and DC errors [8]. The main idea of the scheme is to generate a modified threshold for the
slicer to ensure the correct pattern in the feedback register for avoiding false lock, and to
introduce hysteresis for preventing hang up.
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
95

5.3.1 Existing variable threshold approach for acquisition


The idea of using variable threshold detection was earlier reported for PRML receivers in
which a 4T-pattern preamble is employed for timing acquisition [10] [11] [12]. To achieve
rapid acquisition and prevent hang up, a variable threshold ck = λ ⋅ aˆk − 2 (λ being a scale-
factor)1, is used in [10] to produce hysteresis in the timing function. However, this scheme
cannot be used directly for DFE acquisition because erroneous feedback decisions due to initial
errors in phase, gain and DC can lead to false lock. To illustrate this, we simulated the MDFE
timing function using the 6T (‘ + + + − − − ’) preamble in the absence of noise. The error based
ZF TED has output χ k = ek −1 ( aˆk − aˆk − 2 ) . Decisions aˆk are made using a slicer with variable
threshold ck = λ ⋅ aˆk −3 . We investigated the impact of channel DC offset (DC), VGA gain (g)
and the threshold scale-factor (λ) on the MDFE timing function. Ideal values of these
parameters are DC = 0 , g = 1.0 and λ=1.0. At the start of acquisition process, however, DC
and g may exhibit errors. We estimated the timing function (the averaged TED output) using
the 6T pattern, for all possible initial contents in the feedback register and for normalized
timing phase errors ranging from –2 to 2. The result is a set of overlapped timing functions,
shown in Fig. 5-9. Fig. 5-9(a) depicts the timing function for the ideal condition. Clearly, no
false lock phases exist since the timing function is zero only at integer values of τ . Also, a
considerable hysteresis range is observed with its edges close to the desired phase. Such a
hysteresis range gives the timing loop excellent immunity to hang-up. However, the timing
function changes due to errors in g, DC and λ. For instance, increase and decrease of λ can
result in the timing functions of Figs. 5-9(b) and (c), which exhibit false lock spots. Similar
false lock spots are also seen in Figs. 5-9(d) to (f) in the presence of DC offset and gain errors.
To demonstrate the resulting false lock, we provide a simulation example. Fig. 5-10
illustrates the closed-loop phase convergence in the presence of a fixed DC offset 0.5. Observe
that the loop may lock to the incorrect phases ±0.5 even though the modified threshold ck=âk-3
is used. Closer examination reveals that during false lock the decisions of the MDFE detector
differ from the 6T pattern “ + + + − − − ”. Further studies show that for certain initial values of
DC and timing phase errors, false lock can happen even with a closed DC loop. Thus, we may
conclude that the threshold ck=âk-3 fails to prevent false lock in the presence of significant DC
errors.
We also made similar investigations as above on the effect of a fixed gain error at the slicer
input. We observed several false lock phases when the gain is set to 0.5. Note that studying the
effect of gain at the slicer input is analogous to gain studies in non-feedback systems such as
partial response channels. The simulations show that false lock tends to exist for conditions
g<1.0, g>2.6, λ>1.0, λ<0.6 and |DC|>0.35. Hence, only for a small range of values of g, DC,
and λ will this scheme work reliably.

1
One can check that, for the 4T preamble, the slicer using the variable threshold ck = λ ⋅ aˆk − 2 is most
likely to make 4T pattern decisions. For the 6T preamble, same reasoning applies for the use of the
threshold ck = λ ⋅ aˆk −3 to make 6T pattern decisions.
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
96

1.5 1
1
1
0.5
0.5
0.5
0
0 0

-0.5 -0.5
-0.5

-1
-1 -1
-2 -1 0 1 2 -2 -1 0 1 2 -2 -1 0 1 2

(a) g=1.0; DC=0.0; λ=1.0 (b) g=1.0; DC =0.0; λ=1.5 ( c) g=1.0; DC =0.0; λ=0.5

1 1 1
Averaged TED output

0.5 0.5 0.5

0 0 0

-0.5 -0.5
-0.5

-1 -1
-3 -2 -1 0 1 2 3 -1
-2 -1 0 1 2 -3 -2 -1 0 1 2 3

(d) g=1.0; DC =0.4; λ=1.0 (e) g=0.5; DC =0.0; λ=1.0 (f) g=1; DC =-0.5; λ=1.0

Timing phase error: τ

Fig. 5-9: Timing functions based on the variable threshold ck=λâk-3 for MDFE acquisition scheme
using the 6T (‘ + + + − − − ’) preamble (Horizontal axis: normalized timing phase error, τ; Vertical
axis: averaged TED output).

1 .2
Timing phase error: τ

0 .5

- 0 .2
0 100 200 300

Time (channel bit )


Fig. 5-10: Closed-loop phase convergence in the presence of a fixed DC offset 0.5 with modified
threshold ck=âk-3 in a noiseless channel.

5.3.2. Modified threshold scheme


We now proceed to develop an improved modified threshold scheme. Fig. 5-11 shows the
structure of the proposed modified threshold based acquisition scheme for DFE-type receivers.
During acquisition, the signal from the forward equalizer is a sinusoidal waveform. It is
sampled at instants tk = ( k + τ )T which are controlled by the timing recovery system. The
resulting discrete-time sequence xk normally has rate 1/T and may suffer from a timing phase
error τ (normalized in units T ). Subtraction of the feedback equalizer output yields a sequence
yk , which in the usual case is applied to the slicer for making decisions. In the proposed
acquisition scheme, a special threshold sequence ck is subtracted from yk to produce a
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
97

sequence y k that is passed to the slicer. The sequence ck is the output of an additional FIR
filter, called modified-threshold (MT) FIR filter, which uses the past decisions as its input. Note
that the feedback equalizer and the threshold FIR filter could be merged into a single filter
without affecting the detection performance. However, we do not do this in order to be able to
use the same TED during acquisition and tracking. To this end the TED should operate on the
sequence yk , which must hence be produced explicitly.
feedback
tk = ( k + τ )T equalizer
r(t) xk yk y k aˆk
forward slicer
equalizer τ
ck
threshold
FIR filter
λ1 λ2
VCO LF TED

Fig. 5-11: Modified threshold timing-acquisition scheme for DFE detector.

For a preamble in the form of a 6T pattern ( + + + − − − ), ck is proposed to be


6
ck = λ1 (aˆk −3 − aˆk −5 ) + λ2 ∑ aˆk −i (5.1)
i =1
where typical values of the real-valued scale-factors λ1 and λ2 are 1.0 and 1/6, respectively. The
corresponding filter is depicted in Fig. 5-12. Below, we explain the reasons for constructing the
threshold according to (5.1).
First, for the 6T preamble, the decision pattern should ideally obey aˆk = −aˆk −3 , which is a
basic feature of the correct 6T pattern ( + + + − − − ). Any initial pattern in the feedback register
should be converted into a pattern meeting this rule with the help of the MT sequence ck.
Secondly, the decision pattern should have zero DC, which is a second feature of the correct 6T
pattern. The false lock patterns with DC, such as ‘ + + + + − − ’ and ‘ + + − − − − ’, should be
suppressed by applying the MT sequence. Similarly, the decision pattern should have no
content at the Nyquist frequency 1/(2T). Notice that the 2T pattern “ + − + − ” also obeys
aˆk = −aˆk −3 . We must make sure that the MT FIR filter does not encourage the 2T pattern to
appear in the feedback register. Finally, after loading a correct 6T pattern ( + + + − − − ) into the
feedback register, the slicer with the MT sequence should be able to shift the correct pattern
from one phase to another. This provides the slicer with the ability not only to make the correct
decision pattern but also to acquire the correct phase.
aˆ k
T T T T T T

λ2
λ1
ck
Fig. 5-12: Structure of modified threshold filter.
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
98

The first term of (5.1) promotes the slicer output sequence to obey aˆk = −aˆk −3 . At the
same time, it causes the frequency response of the MT FIR filter to have a null at 1/2T so as to
suppress 2T patterns. The threshold scalar λ1 is chosen to be 1.0 so that the modified threshold
ck has the form “ " − 2 − 2 0 + 2 + 2 0 − 2 − 2 0 + 2 + 2 0 " ” for a valid 6T pattern of any
phase. Using this sequence, the slicer will most probably make a decision aˆk to be +1 (resp. –
1) when ck is –2 (resp. +2), and to be +1 or –1 when ck is zero. This yields a hysteresis effect
and equips the slicer with the ability to shift a 6T pattern from the wrong phase to the correct
phase. Thus, the sequence of (5.1) can help to convert any wrong pattern in the feedback
register into the 6T pattern with the correct phase. The second term of (5.1) promotes the slicer
output to be DC free over any block of 6 bits. It is meant to monitor and suppress the DC
component in the decision sequence. We make the choice of the threshold λ2 as 1/6 because we
prefer the MT FIR filter to be as short as possible when estimating the DC value. A short length
FIR filter is simple and efficient, and thus desirable for cost effective solutions. Note that for
detecting the DC value of the 6T pattern stream, the valid window should cover at least one
period of the stream. This determines that the minimum length of the window is 6T. The scalar
λ2= 1/6 is meant to obtain an unbiased DC level estimate.
By way of illustration, we show in Fig. 5-13 the role of the MT sequence ck in making the
slicer determine the correct 6T pattern. Assume that the sequence aˆk , which is a valid 6T
pattern, represents the slicer output. The threshold sequence ck is calculated by applying (5.1)
to the aˆk sequence. Observe that ck is zero for every third bit instant, thereby letting the slicer
make decisions based on yk . For the next two bits of every 3-bit interval, the threshold
sequence ck is +2 or –2, thereby strongly influencing the decisions of the slicer. This
introduces the possibility for the slicer output sequence to shift from one phase of the 6T
pattern to another. To see this, consider the time instant n (i.e. k = n) at which cn = 0 . Let the
slicer output at this instant be '+ ' instead of '− ' . Using similar arguments, we can see the
slicer decisions at the instants n + 1 and n + 2 to be '+ ' with the corresponding threshold
ck = −1.67 for k = n + 1 and n + 2 . This causes the threshold ck at the instant n + 3 to be
+2.33, thereby resulting in a most likely decision of aˆn′ +3 = −1 . Following this, we see that the
decision sequence aˆk′ from the instant k = n onwards is a one-bit shifted version of the
sequence aˆk . This mechanism helps the slicer to traverse the different possible patterns to
arrive at the correct pattern.

k … n-3 n-2 n-1 n n+1 n+2 n+3 n+4 n+5 n+6 n+7 n+8 n+9 n+10 …
aˆk … + − − − + + + − − − + + + − − − …
ck … 0 +2 +2 0 -2 -2 0 +2 +2 0 -2 -2 0 +2 +2 …
ˆk
a′ … − − − + + + − − − + + + − − − …
ck … 0 -1.67 -1.67 +2.33 +2 0 -2 -2 0 +2 +2 0 …

Fig. 5-13: Illustration of the role of MT sequence ck to shift the decision sequence from one phase of
the 6T pattern to another.
To examine the effect of the MT filter, we consider the MDFE detector with arbitrary
initialization of the feedback register, and monitor the pattern in the feedback register for the 6T
preamble input and a fixed timing phase error. The user density is 2.5, and noise is absent. The
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
99

feedback equalizer has 9 taps. All possible initial feedback register contents (512 patterns) are
considered, with timing phase error ranging from –6T to 6T. Each pattern is identified using a
tag number. The status of the first 25 tagged patterns is shown in Fig. 5-14.
25
Status of patterns the in

20
feedback register

15

10

5 6T-pattern
(+++---)
0
-6 -4 -2 0 2 4 -6
Normalized phase offset: τ

Fig. 5-14: Results indicating which of the first 25 tagged patterns are presented in the feedback
register at the time of the 25th bit instant from the start of acquisition.

The numbers 0 to 5 represent the correct 6T pattern with the six different phases. The
numbers 6 to 25 represent the pattern “ + + + + − − ” with the six different phases, the pattern
“ + + − − − − ” with the six different phases, the patterns “ + − ” and “ − + ”, the pattern
“ + + + + + + ”, and the pattern “ − − − − − − ”. We specifically show these 25 patterns because
they are most likely to appear in the feedback register during acquisition with the 6T preamble.
In the figure, the horizontal axis shows the timing phase error τ (normalized in units T ) at the
sampler, and the vertical axis shows the status of each pattern. Here, the k th trace represents
the case where the feedback register is initialized with the k th pattern. A ‘high-level’ on the
k th trace for a particular timing phase error means that the k th pattern is the content of the
feedback register for that specific timing phase error from the 25th instant onwards, whereas a
‘low-level’ status means that the k th pattern does not appear. Thus, the conclusion from Fig. 5-
14 is that even though any random pattern can be the initial content of the feedback register,
only the 6T patterns ( + + + − − − ) will survive after 25 bits. This indicates that by using the
MT FIR scheme, we need not worry about initializing the feedback register with particular
patterns.
It is worth noting that the approach developed here for a 6T preamble is easily generalized
to other preamble periods.

5.3.3. Effect of parameter variations


To examine the performance of the MT scheme in the presence of gain, DC and threshold
variations, we repeat the simulation of Fig. 5-9 for the scheme at hand. The simulation result is
shown in Fig. 5-15.
Unlike Fig. 5-9, where hysteresis is introduced at the expense of causing false lock, the
timing functions in Fig. 5-15 display no false lock while preserving a considerable hysteresis
range. Simulation studies show that this threshold scheme can tolerate gain factors from 0.5 to
2.0 and DC offsets from –0.5 to 0.5 with no false lock while retaining adequate hysteresis. The
two threshold scale-factors can vary by ±3 dB with respect to their norminal values.
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
100

1 1 4

0.5 0.5
2

0
0
0
-0.5
-0.5
-2
-1
-1
-1.5 -4
-3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3
(a) all parameters are ideal (b) g=0.5; other parameters are ideal (c) g=2.0; other parameters are ideal

0.5 1 1

0.5 0.5
0

0 0
Averaged TED output

-0.5
-0.5 -0.5

-1 -1 -1

-3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3
-3 -2 -1 0 1 2 3

(d) DC=0.5; other parameters are ideal (e) λ1=0.7; other parameters are ideal (f) λ2=1/12; other parameters are ideal

Ideal values of parameters: g=1.0, DC=0.0, λ1=1.0 and λ2=1/6.0

Timing phase error: τ

Fig. 5-15: Timing functions based on the MT scheme ck=λ1(âk-3 -âk-3) +λ2∑i=1→6âk-i . (Horizontal
axis: timing phase error, τ; Vertical axis: averaged TED output).
By way of illustration, we repeated the simulation of Fig. 5-10, and show the result in Fig.
5-16. Observe that there are no false-lock phases. The oscillatory nature observed in the phase
convergence curves of Fig. 5-10 and Fig. 5-16 is due to the presence of the DC offset. The
curves would have been smooth if the DC offset was zero or if the DC loop was closed.

1
Timing phase error : τ

0 .4
0 100 200 300

Time (channel bit )

Fig. 5-16: Closed-loop phase convergence in the presence of a fixed DC offset 0.5 with the MT
scheme ck=λ1(âk-3 -âk-3) +λ2∑i=1→6âk-i for a noiseless channel.
To thoroughly examine the performance of the scheme, we conducted more extensive
simulations using a second-order timing loop and first-order VGA and DC loops. The timing-
loop has natural frequency ω nT = 0.035 and damping factor ς = 0.7 . The VGA and DC
parameters have time constants of 25 channel bits each. Fig. 5-17 illustrates a typical
acquisition simulation result for MDFE at SNR 27 dB and user density 2.5. The normalized
initial timing phase error and frequency offset are set to -0.5 and 0.1%, respectively. The initial
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
101

gain and DC errors are both set to 0.5, where gain error is referred to as the ratio of the total
gain and the ideal gain, and DC error as the DC offset with respect to zero. The horizontal axis
represents time in number of channel bits. The total length of preamble pattern considered for
acquisition is 192 channel bits. At the time instant 180, which approximately marks the
acquisition end, all critical parameters have properly converged.
0.1 1.1 1
Normalized timing

0 0.8
1
phase error: τ

-0.1

VGA Gain

DC offset
0.9 0.6
-0.2
0.8 0.4
-0.3
0.7 0.2
-0.4
-0.5 0.6 0

-0.6 0.5 -0.2


0 200 400 600 800 1000 0 200 400 600 800 1000 0 200 400 600 800 1000

Time (channel bits) Time (channel bits) Time (channel bits)

-3
x 10 3
3
Frequency Offset

Samples at the slicer

2
2
1
1
input

0
0
-1
-1
-2

-2
0 200 400 600 800 1000 -3
0 50 100 150 200 250 300

Time (channel bits) Time (channel bits)

Fig. 5-17: Convergence of parameters with closed timing, gain and DC loops (horizontal axis: time
in channel bits).
To gauge the robustness of the scheme, we made Monte Carlo simulations under the
conditions of high user density 3.0 and low SNR 23 dB. Initially, the timing phase error is 0.5T,
the frequency offset is 0.1%, and the gain and DC are 0.5 and 0.35, respectively. The overlaid
phase convergence curves for 5000 simulations with different noise sequences are shown in
Fig. 5-18. Analysis of the timing phase error distribution at the bit-instant 180 reveals that the
phase bias is close to zero and phase-jitter is less than 1.8% RMS, thereby showing that the
proposed acquisition scheme is very reliable.

Acquisition Tracking
(6T pattern) (random (1,7) code)
1.2
Normalized timing phase error: τ

0.8

0.6

0.4

0.2

-0.2
0 100 200 300 400 500 600 700 800 900 1000
Channel Bits
Time (channel bits)

Fig. 5-18: Phase convergence curves for 5000 acquisition trials at user density 3.0 and 23 dB SNR.
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
102

5.4 Summary
In this chapter, we developed two new and effective solutions to fast timing acquisition for
DFE receivers in magnetic recording systems. One solution uses special equalizer coefficients
and a control procedure with switches. This technique has been shown to be effective for
solving the false lock problem. On the other hand, the second solution, which uses a modified
threshold scheme, prevents false lock and hang up in a more general manner. Simulations have
shown that this scheme can tolerate large initial errors in gain, DC offset and threshold scalars.
CHAPTER 5: Fast Timing Acquisition for Decision Feedback Equalization Receivers
in Magnetic Recording
103

References:

[1] H. N. Bertram, Theory of magnetic recording. New York: Cambridge University Press,
1994.
[2] J. W. M. Bergmans, Digital baseband transmission and recording. Boston: Kluwer
Academic Publishers, 1995.
[3] W. L. Abbott and J. M. Cioffi, “Timing recovery for adaptive decision feedback
equalization for the magnetic storage channel,” in Proc. Intl. Conf. Global Telecommun.
(GLOBECOM), San Diego, CA, Dec. 1990, pp. 1794-1799.
[4] K. D. Fisher, J. M. Cioffi, W. L. Abbott, P. S. Bednarz, and C. M. Melas, “An adaptive
RAM-DFE for storage channels,” IEEE Trans. Commun., vol. 39, no. 11, pp. 1559-1568,
Nov. 1991.
[5] J. G . Kenney, L. R. Carley, and R. W. Wood, “Multi-level decision feedback equalization
for saturation recording, ” IEEE Tran. Magn., vol. 29, no. 3, pp. 2160-2171, July 1993.
[6] Y. X. Lee, G. Mathew, Q. C. Sun, J. J. Wang, H. Mutoh, J. Hong, and R. W. Wood,
“Design, implementation and performance evaluation of an MDFE read channel,” IEEE
Trans. Magn., vol. 34, no. 1, pp. 166-171, Jan. 1998.
[7] Y. X. Lee, L. K. Ong, J. J. Wang, and R. W. Wood, “Timing acquisition for DFE
detection,” IEEE Trans. Magn., vol. 33, no. 5, pp. 2761-2763, Sept. 1997.
[8] J. J. Wang, J. W. M. Bergmans, Y. X. Lee, and G. Mathew, “DFE timing acquisition:
Analysis and a new approach for fast acquisition,” IEEE Trans. Magn., vol. 36, no. 5, pp.
2193-2196, Sept. 2000.
[9] F. M. Gardner, “Hang up in phase-lock loops,” IEEE Trans. Commun., vol. 25, no. 10, pp.
1210-1214, Oct. 1977.
[10] R. D. Cideciyan, F. Dolivo, R. Hermann, W. Hirt, and W. Schott, “A PRML system for
digital magnetic recording,” IEEE J. Select. Areas Commun., vol. 10, no. 1, pp. 38-56,
Jan. 1992.
[11] J. Y. Lin and C. H. Wei, “Fast timing recovery scheme for class IV partial response
channels,” Electronic Letters, vol. 31, no. 3, pp. 159-161, Febr. 1995.
[12] F. Dolivo, W. Schott, and G. Ungerböck, “Fast timing recovery for partial-response
signaling systems,” in Proc. IEEE Intl. Conf. Commun. (ICC), Boston, USA, June 1989,
pp. 573-577.
CHAPTER 6
IMPLEMENTATION OF MULTI-
LEVEL DECISION FEEDBACK
EQUALIZATION TIMING
RECOVERY SYSTEM

Multi-level decision feedback equalization (MDFE) [1] is an elegant simplification of


fixed-delay tree search with decision feedback (FDTS/DF) [2] [3], which is an alternative to the
Viterbi detector for d=1 coded channels and has a similar structure to DFE. A 100 Mb/s
experimental MDFE read channel was designed and prototyped using discrete-components [4]
[7]. In the prototype, the analog forward equalizer consists of two bi-quads that are designed to
maximize the detection signal-to-noise ratio (SNR). The feedback equalizer consists of a 6-tap
FIR filter and an exponential decay circuit to reduce hardware complexity. The digitally
implemented timing/gain/DC loops are updated only when there are transitions at the slicer
input. The MDFE prototype project was targeted at evaluating and testing the MDFE read
channel on both bench and spinstand [4] [5] [7]. The implementation work involved
investigation of several aspects such as robust equalization strategy, reliable acquisition, byte-
synchronization, error propagation, implementation of timing/gain recovery, and error rate
evaluation on bench and spinstand.
In this chapter, we describe the prototype, illustrate the circuit realization of main
functional modules, present the design and realization of timing recovery circuitry, and report
on the evaluation of the prototype on bench and spinstand. The main emphasis of this chapter
will be on the timing recovery system, which is the contribution of the author. The author
developed the MDFE timing recovery circuitry board; he implemented and verified several
critical circuit blocks including the digital TED generating circuits, charge-pump and loop filter
circuits, lock-to-preamble circuits, and sync byte detection circuits. These circuit blocks
constitute the MDFE timing recovery circuitry board, working at 150 MHz/s along with other
parts of the MDFE prototype. Nevertheless, for the sake of completeness, we will also briefly
discuss the modules related to the MDFE forward equalizer and critical loop, which were
implemented by the author's partners in the MDFE project.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
106

6.1 Introduction to MDFE Read Channel Prototype


The decision to make the prototype using discrete components, instead of a full-scale
integrated circuit approach, was primarily motivated by the intention to demonstrate the main
features of MDFE. The MDFE has been claimed to be a simple and low-cost solution for data
detection in digital recording systems. Further, the short time frame of one year set for the
demonstration of the MDFE principle in hardware was another reason to resort to an
implementation using readily available discrete components.
The overall structure of the MDFE prototype is depicted in Fig. 6-1. There are several
functional modules, viz. the forward equalizer, the critical loop with analog to digital converter
(A/D) and feedback equalizer, timing/gain/DC extraction, timing/gain/DC offset distribution
units and loop filters, voltage controlled oscillator (VCO) and crystal oscillator, sequencing
generator, and analog and digital interfaces. The analog readback signal from the pre-amplifier
of a disk drive head is applied to the MDFE prototype, whose output is a detected bit stream
taking values +1 and –1. The user interface module equips the prototype with functions of
loading and programming the equalizers, data sequences and timing/gain recovery units.

crystal oscillator VCO loop


gain/DC & clock distribution filters

from data
VGA & DC offset timing/gain/DC
pre-amp. out
critical loop:
forward equalizer A/D, feedback equalizer

user interface analog & digital


(PC & software) interfaces sequencing
control registers

Fig. 6-1: Structure of the MDFE prototype.

An analytic procedure is developed for the design of a simple continuous-time forward


equalizer (tested with different channel models at several densities) [6] [7]. Essential
implementation constraints are included in the design at the cost of a minor performance
degradation [7]. The first feedback tap is forced to zero to eliminate tight feedback loop. The
length of the feedback equalizer and the magnitude of its taps are constrained in order to reduce
error propagation. The forward equalizer is a 3-zero and 4-pole filter and the feedback equalizer
has 6 taps with an exponential tail.
Fig. 6-2 shows the implementation of the forward path from the preamplifier output to the
summing node of the critical loop. The input signal from the preamplifier is approximately
0.3~0.6 volts peak-to-peak (Vpp) differential with 50 ohms termination. The differential input is
transformed and amplified into a current signal via the wide-band amplifier CLC220. The main
part of the forward path is a 3-zero 4-pole analog forward equalizer and is implemented using a
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
107

pair of bi-quad filters. The first section of the bi-quads is a 1-zero 2-pole filter and has low-pass
characteristics. The second section is a 2-zero 2-pole filter and has band-pass characteristics.
The forward equalizer output is a low-impedance (50-ohm) differential signal, which should be
amplified to 2.0 Vpp for subsequent A/D conversion. To accomplish this, the differential signals
are applied to the low-distortion, high-precision and wide-band amplifier AD9618, which is an
ideal choice for driving and buffering flash A/Ds.

gain: 4.5 to 2

0.3-0.6v 0.9-1.8v 0.9-1.8v 2.0


Vpp Vpp Vpp 1k Vpp
AD9618
TP1 _
CLC220 + x
50 +
pre-amp. 1 zero 2 zeros + 49
x3
AD9618
critical loop
output _ 2 poles 2 poles summing node
50 49 +
_ x
_ 49
noise 1k 2.0
Vpp
forward equalizer gain:4.5 to 2

Fig. 6-2: Implementation of the forward path from the preamplifier output to the critical loop
input, consisting of a 4-pole/3-zero forward equalizer.

50 50
current sum m ing bus
“feedback off”
switch
(used as 2
CA3127 diodes)
array

a k-4 -a k-4 a k-5 -a k-5


56 56 56 56

150 150 150 150

-5.2v -5.2v -5.2v -5.2v


I E : m ax 10 m A
m in 2m A

feedback feedback
coefficient 4 1k coefficient 5 1k

-10v -10v

Fig. 6-3: Implementation of the feedback equalizer using current steering switches.

Fig. 6-3 shows the implementation of the feedback equalizer. The feedback equalizer is
effectively a 10-tap filter, and is implemented by a 6-tap filter in tandem with an exponential
decay circuit so as to reduce the implementation complexity. To ease the tight time constraint
on the feedback loop, the forward equalizer is designed to force the first feedback tap to zero
[7] [8]. The implementation circuit of the feedback equalizer employs a set of high frequency
NPN transistor arrays, CA3127, as shown in Fig. 6-3, where each feedback filter tap is realized
using a current steering switch. A CA3127 consists of five isolated general-purpose silicon
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
108

NPN transistors. These transistors are used in pairs as current steering switches (max. 10 mA.).
The 1st feedback tap is zero, by design. The 2nd tap can be set to be either positive or negative.
The 3rd to 7th taps can only be negative. The magnitudes of these taps are limited to half the
minimum eye level at the slicer input. The remaining feedback taps are lumped into an
exponential tail, which is implemented using a one-pole RC (resistor-capacitor) network [7].
The principle behind the implementation of the feedback equalizer as described above
resembles the working mechanism of a set of dams. Feedback taps resemble the dam sizes,
which are controlled by decision bits for damming up (via transistor array CA3127) the current
on the current summing bus.

50
differential AD9630
input +
+1 50

50 49 CLC220 a k-2 Current steering: 6 x CA3127


0 AD9002 Feedback equalizer
output of the + D8
forward path Ain
_

50 AD9630 V-ref D4
- 6.4 eye 10H141 10H141
+1 ek-1 10H141 10H141
_
49

Fig. 6-4: Implementation of the critical loop.

The critical loop, which consists of the summing node, comparator (A/D) and decision
feedback loop, is implemented as shown in Fig. 6-4. The input, which comes from the output of
the forward path according to Fig. 6-2, is approximately 2.0 Vpp differential with 50-ohm
terminations. The AD9630 is a 750 MHz unity-gain buffer capable of driving 50 mA at 3.5V.
The outputs of the forward and feedback equalizers are summed on a low-impedance (50 ohms)
differential bus for driving the comparator AD9002, which is an 8-bit high-speed A/D
converter. Time constants at the bus are approximately 0.5 ns. At the input of the A/D, the
MDFE eye levels are at approximately +/- 0.5V and +/- 1.0V (measured differentially). These
are called the inner and outer eye levels, respectively. The six most significant bits of the A/D
are used to digitize the input. The A/D and the feedback shift registers MC10H141 get the same
clock, which is controlled by the VCO of the timing recovery board and is running at 150 MHz
channel rate. The timing diagram of the MDFE critical loop is depicted in Fig. 6-5, along with
the measured implementation delays of various signals, from the A/D via shift registers to the
summing bus.
In debugging the MDFE boards, the forward and feedback equalizers are optimized
separately. The forward equalizer is first adjusted to match the desired equalized bit response
(observed at the forward equalizer output). Then the feedback equalizer is fine-tuned to
optimize the system error rate. Fig. 6-6 shows the eye-pattern measured at user density 2.0 and
100 Mb/s user data rate. Observe that there are four distinct levels at the desired sampling
instant shown by the marked block in the figure. System performance is mainly determined by
the inner levels. The bumps and fuzzy traces in between the sampling instants are due to the
settling of the feedback equalizer at the summing bus.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
109

clock period = 6.7 ns

master clock
3.5ns
1ns
A/D
k k+2 clock
3.7ns
+2ns (OPA) 1 ns set-up
A/D output
a(k-2)

3ns delay and set-up 1ns hold


shift register
clock
2ns
shift register
a(k-3)
a(k-2) output
1.5ns
a(k-2)
summing-bus

Fig. 6-5: MDFE critical-loop timing diagram.

Fig. 6-6: Eye pattern measured at the analog summing bus showing the four levels at ideal
sampling instants (the block marks the desired sampling instant).
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
110

5 b its p re c is io n d e c is io n o f d e te c to r
s te p - s iz e = 2 /3 2 = 6 2 .5 m v s ig n o f e rro rs
d e c is io n A D 9002
m a g n itu d e o f e rro rs
b o u n d a rie s o u tp u t le v e ls
-0 .0 3 1 2 5
-0 .0 9 3 7 5 -0 .0 6 2 5 1 1 1 1 1
-0 .1 5 6 2 5 -0 .1 2 5 0 1 1 1 1 0
-0 .2 1 8 7 5 -0 .1 8 7 5 1 1 1 0 1
-0 .2 8 1 2 5 -0 .2 5 0 0 1 1 1 0 0
-0 .3 4 3 7 5 -0 .3 1 2 5 1 1 0 1 1
-0 .4 0 6 2 5 -0 .3 7 5 0 1 1 0 1 0
-0 .4 6 8 7 5 -0 .4 3 7 5 1 1 0 0 1
-0 .5 3 1 2 5 -0 .5 0 1 1 0 0 0 + in n e r-le v e l
-0 .5 9 3 7 5 -0 .5 6 25 1 0 1 1 1
-0 .6 5 6 2 5 -0 .6 2 50 1 0 1 1 0
-0 .7 1 8 7 5 -0 .6 8 75 1 0 1 0 1
-0 .7 8 1 2 5 -0 .7 5 1 0 1 0 0
-0 .8 4 3 7 5 -0 .8 1 25 1 0 0 1 1
-0 .9 0 6 2 5 -0 .8 7 5 1 0 0 1 0
-0 .9 6 8 7 5 -0 .9 3 75 1 0 0 0 1
-1 .0 3 1 2 5 -1 .0 1 0 0 0 0 re fe re n c e -le v e l
-1 .0 9 3 7 5 -1 .0 6 25 0 1 1 1 1
-1 .1 5 6 2 5 -1 .1 2 5 0 1 1 1 0
-1 .2 1 8 7 5 -1 .1 8 75 0 1 1 0 1
-1 .2 8 1 2 5 -1 .2 5 0 1 1 0 0
-1 .3 4 3 7 5 -1 .3 1 25 0 1 0 1 1
-1 .4 0 6 2 5 -1 .3 7 5 0 1 0 1 0
-1 .4 6 8 7 5 -1 .4 3 75 0 1 0 0 1
-1 .5 3 1 2 5 -1 .5 0 0 1 0 0 0 - in n e r-le v e l
-1 .5 9 3 7 5 -1 .5 6 25 0 0 1 1 1
-1 .6 5 6 2 5 -1 .6 2 5 0 0 1 1 0
-1 .7 1 8 7 5 -1 .6 8 75 0 0 1 0 1
-1 .7 8 1 2 5 -1 .7 5 0 0 1 0 0
-1 .8 4 3 7 5 -1 .8 1 25 0 0 0 1 1
-1 .9 0 6 2 5 -1 .8 7 5 0 0 0 1 0
-1 .9 6 8 7 5 -1 .9 3 75 0 0 0 0 1
-2 .0 3 1 2 5 -2 .0 0 0 0 0 0

Fig. 6-7: Mapping of input to 5-bit output of the 8-bit AD9002 A/D converter and error pattern
assignment.

For a 2.0 Vpp A/D input, both bit-error-rate (BER) simulations and experiments showed that
a 5-bit A/D would suffice for our purpose, and no significant improvement can be obtained by
using more bits. For this reason, the analog summing node output is digitized to 5-bit precision
using the 5 most significant bits (MSBs) of the 8-bit AD9002. From these 5 bits, the most
significant bit is used for the bit decision and the remaining 4 bits are used for generating the
error signal for the timing recovery system. To illustrate this, we show in Fig. 6-7 the mapping
of the A/D input to output for user density 2.5. The 2.0 Vpp input is mapped into 32 levels with
a step-size 62.5 mV according to the 33 decision boundaries. Experiments show that the BER
performance is sensitive to the A/D reference bias relative to the middle value of the input.
Therefore, while debugging the circuits of this portion, the actual reference level is fine-tuned
to have the best BER.

6.2 Implementation of MDFE Timing Recovery System


6.2.1 Data format used for MDFE timing recovery
The acquisition and tracking modes of timing recovery are associated with the use of a
specific data format. The recommended data format for MDFE timing recovery is illustrated in
Fig. 6-8. This format serves for timing acquisition, sync byte detection and clock tracking
purposes.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
111

READ ENABLE
STATE 1: decision feedback off; special forward equalizer used
RECOGNISE & LOCK TO PREAMBLE
STATE 2: decision feedback on; normal forward equalizer;
recirculate preamble within feedback register

START FREQ. ACQUISITION


STATE 3: as STATE 2; freq. loop enabled

servo START TRACKING


wedge
STATE 4: feedback register uses real decisions;
step sizes reduced by four;
search for sync. byte

1 5 11 sync

6T pattern preamble, 13 bytes 2 (1,7) coded data bits

Fig. 6-8: Data format of one sector of magnetic disk for MDFE timing acquisition, sync byte
detection and clock tracking.

In hard disk drives, servo wedge signals are embedded into sectors and are used to control
the position of the read head in accordance with the track and sector specifications. The READ
ENABLE signal is activated after the servo wedge signal. The timing acquisition scheme that is
implemented in the MDFE prototype is the first scheme discussed in chapter 5 (see Section 5.2)
which uses special forward equalizer coefficients and switches. As mentioned already, the
preamble used for acquisition is the 6T (‘ + + + − − − ’) pattern and has a length of 13 channel
bytes (13×12=156 channel bits). The acquisition preamble allows the system to acquire the
desired sampling phase and frequency before reading the user data. In the beginning (i.e.,
STATE 1), the decision feedback loop is switched off and special forward equalizer
coefficients are used for acquiring the 6T pattern in the feedback register. When the feedback
register is correctly loaded, as recognized by the lock-to-preamble circuit (in STATE 2), the
feedback loop is switched on, the normal forward equalizer is used, and the preamble pattern is
circulated in the feedback register. After 5 channel bytes have elapsed (in STATE 3), the
frequency loop is enabled. After 11 channel bytes have elapsed, the PLL switches to the
tracking mode (the loop step-sizes are scaled down by a factor of 4, circulation of the feedback
register contents is stopped and actual decisions are applied) in STATE 4. The acquisition
process is about to end. In order to identify the beginning of the user data, the sync byte must
be detected correctly. In magnetic recording, a short preamble is desirable to leave more
storage area for the user data. In MDFE, the total acquisition preamble length is 15 channel
bytes (180 channel bits) inclusive of 2 sync bytes.

6.2.2 Implementation of timing recovery loop

A. The overall timing recovery loop


In MDFE, the timing, gain and DC loops are updated simultaneously. Recall from Chapter
4 (see eq. (4.29)) that the timing-error detector (TED) output is given by χ k = ek aˆk +1 if
aˆk +1 ≠ aˆk −1 , where ek = yk − aˆk and aˆk is the decision corresponding to yk . This indicates
that only samples associated with transitions in the input data are used for timing recovery [9].
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
112

Fig. 6-9 shows the schematic of the overall timing loop and the use of the A/D output for
generation of the error signal. Because of a 1-bit delay introduced by the A/D, the A/D output
at the instant k corresponds to the input yk −1 . As mentioned already, the input yk is transformed
by the A/D AD9002 into a 5-bit digital output. The most significant bit gives the bit decision
aˆk −1 and the remaining 4 bits, i.e., ek3−1 , ek2−1 , e1k −1 and ek0−1 , give the error between yk and aˆk −1 .
Among these four bits, ek3−1 gives the sign of ek −1 and the remaining three bits give the
magnitude of ek −1 .

VCO
A/D clock

CLC220 0 D8 âk-1
sgn(ek-1) charge-pump
summing
AD9002

+
50 e3k-1 & loop filter
bus _ Ain
yk e2k-1

e1k-1 |ek-1| TED processing


V-ref D4 e0k-1 5
- 4 eye
3

Fig. 6-9: Block schematic of the overall timing loop which uses the A/D output for generating the
error signal for TED.

B. Generation of error signals for TED


The circuit for generating the digital error signals required for implementing the TED is
shown in Fig. 6-10. The feedback register input α k −1 may be different from the decision bit
aˆk −1 . For example, in the beginning of acquisition, the register input α k −1 is updated by
circulating the register contents. A large error ek-1 occurs whenever aˆk−1 ≠ αk−1 . In this situation,
the TED circuit outputs sgn (ek −3 ) = aˆk −1 and | ek −3 |= 0 , with which the subsequent charge-
pump and loop filter circuits yield a maximum current output. Note that the input ek-1 and the
output ek −3 are both 4-bit patterns but the input ek −1 represents the error ek −1 = yk − âk −1
whiles the output ek −3 represents the 2-bit delayed version of χ k = 0.5ek −1 ( aˆk − aˆk −2 ) .
Therefore, we can view the output ek −3 as a 4-bit digitized TED output. The TED output,
which is activated by the condition α k −2 ≠ α k −4 , drives the subsequent charge-pump circuit
whose analog output is used for controlling a voltage-controlled oscillator (VCO). When the
slicer output is connected to the feedback register input, as in the case of the later stage of
acquisition mode and the case of tracking mode, the condition α k −2 ≠ α k −4 becomes
âk −2 ≠ aˆk −4 . For maximum speed, emitter-coupled logic (ECL) components are used in the
circuit of Fig. 6-10. The MC10H176 contains 6 master-slave type-D flip-flops and has much
smaller propagation delay with no increase in power-supply current compared to the standard
MECL 10K family. It is triggered by the same clock used for the A/D IC. The MC10H107 is a
triple 2-input exclusive OR/NOR gate. MC10H104 is a quad 2-input AND gate. All these ECL
components typically have 1 ns propagation delay, and are thus suitable for clock rates up to
150 MHz.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
113

shift register αk-2


input data
D Q αk-2 ≠ αk-4
αk-1
transition
10H176 10H107 10H107 gate
from A/D â
k-1
output D Q D Q D Q αk-4
MSB
10H176 10H176 10H176

error
MSB
e e3k-3
k-1
D Q sgn(ek-3)
D Q

10H176 10H176
e2k-3
10H176

e1k-3
10H176

e0k-3 _
|ek-3|
10H107 10H104

Fig. 6-10: Digital error generation for TED implementation.

C. Implementation of the charge-pump and loop filter


The digital error signals are converted into analog quantities via the charge-pump circuit as
shown in Fig. 6-11 with a RC loop filter. The loop filter output is applied to a VCO with a free-
running frequency of 150 MHz. Charge-pump circuits are widely used in PLL applications [10,
11].
There are three similar circuits as in Fig. 6-11 for the timing, gain and DC loops. The
implementation of digital TED as shown in Fig. 6-11 becomes rather simple since it does not
involve high-speed multiplier. The charge-pump circuit acts as a D/A converter. It is realized
by the component OPA2662, which is a versatile driver with high bandwidth (370MHz). The
OPA2662 transforms digital errors into analog currents in positive (for charge action) and
negative (for pump action) values at its output. It has two parallel voltage-controlled current
gates and yields zero output for zero differential input. The loop filter is realized by a RC
circuit and the values of the resistor and capacitor determine the time constant and bandwidth
of the loop. During acquisition, the upper and lower voltage-controlled current gates of the
OPA2662 are both enabled, thus producing a large loop gain. During tracking, a small loop
gain is desired in order to decrease jitter in the timing loop, and hence the upper gate of the
OPA2662 is disabled via the ‘gain switching’ signal. In practice, an extra current compensation
circuit is used (not shown in Fig. 6-11) at both sides of the OPA2662. This compensation is
meant to improve the accuracy of the charge-pump circuit with zero output for zero differential
input.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
114

αk-2 = αk-4

αk-4

50
sgn(ek-3) OPA2662
100
B
C
10H176 200 E
ENABLE output
33
_
gain switching To VCO
|ek-3| 50
B
100 C 9630
E
measurement
point
200
180 R 10k

C 0.1u

Fig. 6-11: Circuit for implementing the charge-pump and loop filter.

The measured static transfer function of the circuit according to Fig. 6-11 for deterministic
input error patterns is shown in Fig. 6-12(a). This can be interpreted as the timing function
since the input patterns of the digital TED represent timing errors and the output of the loop
filter represents an average of the TED output. The table on the right-hand side shows the
mapping from the input pattern to the equivalent timing-error output. The integers in the second
column of the table in Fig. 6-12(b) represent 15 error levels arising from the negative-most to
positive-most digitized timing errors that the TED detects. An arbitrary waveform generator
(AWG) is used to produce a pseudo-random pattern as the input. The corresponding output
current of the loop filter is measured and its result versus the averaged input error is shown in
Fig. 6-13. The averaged input error is obtained by averaging the 4-bit ek input waveforms over
one period of the pseudo-random input pattern. For example, if the period of the pseudo-
random input pattern is 15 bits, and we obtain 15 timing errors as {3, 5, 2, 6, -4, 4, 2, -7, 0, -5,
1, -3, 5, -2, -3} in one period, then the average input error is approximately 2.66. The transfer
functions shown in Figs. 6-12 and 6-13 illustrate that the TED implemented here results in a
timing function that has good linearity with zero output for zero input.
To analytically examine the TED performance, we did a modeling of the circuits that
implement the TED, charge-pump and loop filter. Using this, we evaluated the timing function
for high gain conditions. The result is shown in Fig. 6-14 along with the corresponding
measurement result from Fig. 6-12. Observe that the actual circuit behavior (i.e., the measured
result) agrees well with expectations (i.e., the computation result). The small difference in the
slope may be attributed to the use of extra current compensation at the OPA2662 output, which
is difficult to take into account when modeling the circuit.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
115

TED timing
input pattern error index
40
1 1 1 1 7
high-gain for acquisition 1 1 1 0 6
mode 1 1 0 1 5
1 1 0 0 4
Loop filter output

1 0 1 1 3
current (mA)

1 0 1 0 2
1 0 0 1 1
0
1 0 0 0
low-gain for tracking 0 1 1 1 0
mode 0 1 1 0 -1
0 1 0 1 -2
0 1 0 0 -3
0 0 1 1 -4
0 0 1 0 -5
-40 0 0 0 1 -6
-7 0 7 0 0 0 0 -7
DC input error (0000 to 1111)
sgn(ek) |ek|
(a) (b)

Fig. 6-12: (a) Measured static transfer function from the digital TED input to the output of the
loop filter for DC input; (b) Table that shows the relation between the input patterns and timing
errors.

15
high-gain for acquisition
mode
Loop filter output current
(mA)

low-gain for tracking


mode

-15
-4.5 0 4.5
Averaged input error
(solid lines: measurement for the DC input; dotted circles:
measurement for the pseudo-random input.)

Fig. 6-13: Measured transfer function from the input of the digital TED to the output of the loop
filter for pseudo-random input.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
116

40

theoretical calculation
circuit mesurement
Loop filter output
current (mA)

-40
-7 0 7
DC input error (0000 to 1111)

Fig. 6-14: Comparison of measured and calculated transfer functions from TED input to the loop
filter output under high-gain conditions.

6.2.3 Lock-to-preamble and sync byte detection circuits


As discussed in Chapter 5 (see Section 5.2), to avoid the potential false lock problem
arising from wrong decisions in the feedback register, the 6T preamble pattern must be
preloaded. This task is to be accomplished by the lock-to-preamble circuit, as shown in Fig. 6-
15, in less than 15 to 20 channel bits. The MC10H141 is a four-bit universal shift register
without external gating. Two MC10H141 circuits receive decisions and complementary bits
from the A/D, and pass them to the lower part of the logic gates to discriminate the 6T pattern.
When the lock-to-preamble circuit detects the 6T pattern, it produces a ‘preamble lock’ signal.
At the same time, the ‘feedback on’ signal for connecting the feedback equalizer output to the
summer input and the ‘fc/6’ signal (i.e., the A/D clock (fc=1/T) is divided by 6) for the sync
byte detection circuit are activated. The timing diagram of the lock-to-preamble circuit is
shown in Fig. 6-16.
The ‘ read enable ’ signal starts the process for reading the preamble. It activates the lower
part of the logic gate array to detect the presence of a 6T pattern in the shift register. When it is
confirmed that the 6T pattern is present, then the ‘preamble lock’, ‘feedback on’ and ‘load S2’
signals respond accordingly. At this stage, the feedback filter is connected to the summing node
and the shift register is circulating so that wrong decisions have no effect on the feedback
register. At the same time, a valid fc/6 clock is available and the timing loop is settling to the
desired phase. Near the end of the acquisition process, the ‘ sync enable ’, which is generated by
a sequencing counter, activates the tracking mode, decision bits are used to update the shift
register, and the sync byte detection circuit is engaged to start the procedure for detecting the
sync byte.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
117

feedback on

1
ak-2
0 10H141
D8 Q
D
Ain Q 10H141
- ref V-ref D4 load S2 sync enable
AD9002

fc/6

2X10H105

preamble lock
10H105 10H141
__________
read enable

Fig. 6-15: Lock-to-preamble circuit for preloading the 6T pattern.

start to read
preamble pattern
read enable

start to detect
sync byte
sync enable
6T pattern detected
preamble non-6T
lock
on
feedback on off

shift shift
load S2 parallel load (recirculating)

shift valid fc/6 clock


fc/6

Fig. 6-16: Timing diagram of the lock-to-preamble circuit.

The purpose of the sync byte is to demarcate the start of the user data. A false or missed
detection of the sync byte will necessitate a rereading the sector, and this increases average
access time. Several patents are filed on error-tolerant sync byte recovery methods [12-14], but
most of them have high complexity. The general method for detecting the sync byte is using
correlation techniques. That is, the sync byte is designed such that it has minimum cross-
correlation with preamble and user data while exhibiting a distinct autocorrelation. The sync
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
118

byte used in the MDFE read channel has 2 user bytes. The recommended sync byte in the NRZ
form is ‘6-down 6-up 9-down 3-up’ (‘down’ and ‘up’ refer to ‘-1’ and ‘+1’, respectively). This
byte is chosen to have high autocorrelation and minimum probability of confusion with both
preamble pattern and the (1,7) coded data. Fig. 6-17 shows the sync byte detection circuit. The
sync word is preceded by the 6T preamble and followed by the user data. The 24-bit sync byte
is detected by computing the cross-correlation of the contents of shift registers 10H141 and the
specific 8-bit setting (i.e., 01100001)1 for the connection to the input of 10H116, a line receiver.
These 8 bits (01100001) are the detection result of the 24-bit sync byte in the shift registers
10H141s. When the input data stream passes through the circuit, only the segment that contains
the correct sync byte can make the 10H116 produce the ‘sync’ pulse.
G G 10H104
I I
I I 100 Ω
VL
fc/6
D-type
flip-flop -5.2v

33∼50Ω
10H141
10H141
fc/6
D Q D Q D Q
>c >c >c clk 10H116

sync enable G G D Q
100 Ω sync
>c
data
D Q D Q D Q
10H141
10H141

33∼50Ω
>c >c clock:Fc/6
>c
clk
clock: fc VH

2 bytes 10H104
clock:fc/6 sync word: (24-bit)
NRZ Î … 000 111 000 000000 111111 000000000 111 XXX …
NRZI Î …100 100 100 000000 1 00000 100000000 100 XXX …
I Î…1 0 1 1 0 X …
G Î… 1 0 0 0 1 X …

Fig. 6-17: Sync byte detection circuit.

10

8
Cross correlation

6 threshold for
5.5 sync-byte
detection
4

0
-36 -30 -24 -18 -12 -6 0 6 12

Relative time (T)


(relative to the location of the sync)

Fig. 6-18: Cross-correlation of the data in the shift-registers and the word ‘01100001’.

1
Setting the connection between the output of 10H141 and the input of 10H116 to match this 8-bit
pattern, makes the cross-correlation of the content in shift registers and the connection pattern (i.e.,
01100001) to the input 10H116 to be maximum, which is 8 for the sync byte in the detection circuit.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
119

The resisters at the MC10H116 input are tuned in order to achieve a certain threshold for
triggering the sync byte flag. The threshold is set to be 5½, to make the detection circuit
tolerant to 2 bit errors in the sync byte. The sync byte is said to be detected when the cross-
correlation of data in the shift registers with the 8-bit pattern connecting to the input of
MC10H116 is greater than or equal to 6. Fig. 6-18 illustrates the cross-correlation versus
relative time in a time step of 6T. The cross-correlation has an ideal value of 8 at the end of the
sync byte word, and has value of 3 or less otherwise.
By way of illustration, we show in Fig. 6-19 the measured result obtained from the circuit
according to Fig. 6-17. The circuit output is normally ‘low’ when the sync byte has not been
detected. After the sync byte has been detected, the circuit produces an output ‘high’ and
remains ‘high’ for 6 bits before it goes to ‘low’. This 6-bit width pulse is recognized as the sync
byte flag, which marks the start of the user data zone.

6T pattern preamble 24-bit sync byte (1,7) coded user


for acquisition data for tracking

sync byte flag

bit decisions

recovered 150 MHz


read clock

Fig. 6-19: Measured waveforms from the sync byte detection circuit.

6.3 Evaluation of MDFE Prototype


The critical loop board, which consists of the summing node, the feedback equalizer and
the A/D AD9002, constitutes one of the most important modules of the MDFE prototype. To
evaluate how well the critical loop works, we measured the BER performance with both the
forward equalizer (i.e., the bi-quad board) and timing loop (i.e., the timing recovery board) off.
The test setup is shown in Fig. 6-20. The user density for the measurement is 2.5, and the data
rate is 100 Mbits/s.
Fig. 6-21 illustrates the measured and the calculated values of the BER versus the detection
SNR. The detection SNR is defined as the ratio of the square of inner-eye opening to the power
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
120

of noise and residual ISI at the input to the A/D. We use the detection SNR measure because
there is no forward equalizer here. At BER=10-6, the critical loop operates with about 0.3 dB
degradation from the theory. This loss may be due to hardware non-idealities at high data rates,
for example, the feedback equalizer in hardware may not be ideal and the A/D resolution in
hardware may not be accurately taken into account. That is, since the forward equalizer and
timing are ideal, the loss may arise due to the fact that the detection is computed for the ideal
loop whereas the experimental loop suffers from A/D resolution and inaccurate implementation
of feedback taps.

convolution of measured
MR head waveform with
ideal forward equalizer

TEK clock trigger


AWG2041

AWGN slicer BER analyzer


(noise) AD9002 (BitAlyzer 400)

feedback
equalizer
critical loop
board

Fig. 6-20: Bench test setup for measuring the BER performance of the critical loop board (the
portion in the dashed block) of the MDFE prototype.

Theoretical vs Experimental Error


1e -4.5
1e -5
measured
1e -5.5
1e -6
theory
BER

1e -6.5
1e -7
1e -7.5
1e -8
1e -8.5
1e -9
13 13.5 14 14.5 15 15.5

Detection SNR (dB)

Fig. 6-21: Measured and theoretical BER performance of the critical loop board.

To thoroughly evaluate the MDFE prototype in real time, we tested it on spinstand in the
clean room. The measurement involves the forward equalizer (the bi-quads module), critical
loop, timing recovery and gain control modules. The replay waveform from a spinstand is used
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
121

as the input to the prototype. The measured BER performance corresponds to about 1 dB loss in
SNR with respect to the theoretical performance computed for ideal implementation of the
forward/feedback filters and timing/gain loops. This can be due to many factors, such as non-
idealities of the equalizers and timing/gain recovery circuits, off track distortion of the MR
head on the spinstand, and input waveform jitters. We also measured the error propagation
performance. Error propagation can be characterized in terms of an “error burst distribution”,
i.e. the probability distribution of error bursts against the burst length. We measured the error
propagation characteristics of the MDFE prototype using the BitAlyzer400. The result, which is
shown in Fig. 6-22, shows that error propagation is a serious concern because burst errors
extend to above 30 bits2. This issue is investigated in [15] [16] and some methods for
minimizing error propagation in MDFE are suggested in [17] [18]. The measured performance
shown in Fig. 6-22 is comparable to that obtained from simulations [19].

Fig. 6-22: Histogram of MDFE burst errors measured using BitAlyzer400.

2
The longer the error bursts, the more powerful should be the ECC (error control code) that is required to
correct these error bursts. This translates to poor coding efficiency and complex ECC decoding circuits.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
122

-3
10

BER 10 -4

10 -5 with timing loop on


ideal timing clock

10 -624 25 26 27

User SNR [dB]

Fig. 6-23: Measurement of the BER performance of the MDFE board at user density of 2.5, with
and without active timing recovery loop.
To evaluate the timing recovery performance of the MDFE prototype, we measured the
BER curves by using the ideal clock and the recovered clock, respectively. With ideal clock,
the circuit boards are triggered by the clock from an arbitrary waveform generator (AWG),
which is also used to produce input data to the MDFE board. When using the recovered clock,
the timing recovery system is active and provides the clock for all parts of the MDFE board.
Measurement results for both situations are shown in Fig. 6-23. Comparison reveals that the
timing recovery loop causes about 0.2 dB SNR loss at user density 2.5. Since the timing
recovery circuits are implemented using discrete-components, such a loss is reasonable. We
found that the recovered clock is very susceptible to external factors such as layout of the
circuits, choice of decoupling of the power and signal tracks, termination of ECL outputs, and
testing cables for probing the signals. For this reason, an integrated solution is expected to work
better.
For the recovered timing clock of the MDFE prototype, we also measured the residual
phase jitter using the approach in [20]. The measurement system is depicted in Fig. 6-24. The
6T (‘ + + + − − − ’) pattern is written on the media so that the replay waveform from the
spinstand MR head is essentially a sine wave. This waveform is used to drive the prototype via
the Tektronix AWG2041 and MicroNetics Mx5108 programmable noise generator. The
prototype is triggered by the clock provided by the timing recovery circuit. The Tektronix
TDS784D DPO (Digital Phosphor Oscilloscope) is used to acquire and store the input samples
yn to the slicer. In software they are then reconstructed to an analog waveform y (t ) through
the method of interpolation (ITP). Then, the zero-crossing detection (ZCD) technique is applied
to estimate the sampling phase φk in the measured waveform. In another path, we use the same
data { ak } to drive an MDFE jitter model that characterizes the deterministic jitter components.
The output of the model is a phase estimate φk . The LMS adaptation algorithm based on the
estimated residual jitter ∆ k = φk − φk is used in turn to adjust the coefficients of MDFE jitter
model. This method is described in more detail in [20]. The result of the estimated random jitter
∆ k shows that the timing loop has about 1.8% RMS random jitter at user density 2.5 and SNR
27 dB. This jitter is found to cause about 0.2 dB SNR loss and is reasonable for the circuits
based on discrete-components.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
123

noise detector input


generator

MDFE y(t) yn
AWG prototype
clk DPO
ITP &
Timing
spinstand recovery ZCD
φk random
φk jitter
MDFE jitter
data {ak} ∆k
model

LMS adaptation

Fig. 6-24: Test setup for measuring random jitter in the recovered clock of the MDFE prototype.

6.4 Considerations of Hardware Implementation


As described in Section 6.2, the timing recovery system of the MDFE prototype operates at
a speed of 150 MHz and is realized using ECL (Emitter-Coupled Logic) circuits. ECL is
regarded as one of the fastest forms of digital logic. It offers both the logic speed and the logic
features to meet the demands for the MDFE timing recovery circuitry. To achieve the high
speed of 150 MHz, several important points must be considered.
First, time-delay through interconnect-wiring, which may be ignored in low-speed systems,
becomes highly important [21]. We use short wires and provide parallel functional tracks with
equal effective wire lengths as far as possible during layout. Second, waveform distortions due
to line reflections also become troublesome at high data rates. Improperly terminated lines can
result in reflections that may cause false triggering. The usual solution, as applied in RF (Radio
Frequency) technology [22], is to employ transmission-line practices and properly terminate
each line/wire with its characteristic impedance. For ECL circuits, their low-impedance and
emitter-follower outputs facilitate transmission-line practices without seriously affecting
voltage levels. Third, the possibility of cross talk between adjacent signal lines is
proportionately increased in high-speed circuits. This is the result of very steep leading and
trailing edges of the high-speed signal. These steep edges are rich in harmonics that couple
readily to adjacent circuits. The Motorola ECL series that we used to design the MDFE timing
circuits have specially reduced rise and fall times, this reduces cross talk without compromising
other important parameters. Finally, precautions must be taken on electrical noise generation
and pick-up in the circuits because these are very detrimental at high speeds. In general, these
considerations are speed and frequency dependent.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
124

Fig. 6-25: MDFE prototype circuits and timing board.

For the MDFE timing circuit board, we designed a six-layer PCB (Printed Circuit Board)
containing signal lines, terminated power, power supply and ground. Each ECL line is kept as
short as possible and, at the same time, well decoupled by capacitors and shielded by the
ground layer on the opposite side of the board. Interconnections of signals and the clock with
other functional boards are accomplished via coaxial lines whose lengths and corresponding
propagation delays have been carefully selected. Fig. 6-25 illustrates the finished timing board,
and it works well for the MDFE prototype.

6.5 Conclusions
A 100 Mb/s experimental MDFE read channel was designed and prototyped using discrete-
components. The MDFE timing recovery system was described, its implementation issues were
investigated, and the performance was measured. The timing recovery board works well at high
data rate to support a robust MDFE prototype. The measurement results show that the circuit
performance is close to the expected result. The implementation is simple and reliable, and can
be easily realized by an integrated solution.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
125

References:
[1] J. G . Kenney, L. R. Carley, and R. W. Wood, “Multi-level decision feedback equalization
for saturation recording, ” IEEE Trans. Magn., vol. 29, no. 3, pp. 2160-2171, July 1993.
[2] J. Moon and L. R. Carley, “Performance comparison of detection methods in magnetic
recording,” IEEE Trans. Magn., vol. 26, no. 5, pp. 3155-3172, Nov. 1990.
[3] L. R. Carley and J. G. Kenney, “Comparison of computationally efficient forms of
FDTS/DF against PR4-ML,” IEEE Trans. Magn., vol. 27, no. 6, pp. 4567-4572, Nov.
1991.
[4] J. Hong, Y. X. Lee, H. Mutoh, Q. C. Sun, H. Ueno, J. J. Wang, and R. W. Wood, “An
experimental MDFE detector,” IEEE Trans. Magn., vol. 33, no. 5, pp. 2776-2778, Sept.
1996.
[5] Y. X. Lee, Q. W. Jia, J. J. Wang, Q. Li, L. Bi, H. Ueno, and H. Mutoh, “An experimental
M2DFE detector,” IEEE Trans. Magn., vol. 35, no. 5, pp. 2292-2294, Sept. 1999.
[6] G. Mathew, B. Farhang-Boroujeny, and R. W. Wood, “Design of multilevel decision
feedback equalizers,” IEEE Trans. Magn., vol. 33, no. 6, pp. 4528-4542, Nov. 1996.
[7] Y. X. Lee, G. Mathew, Q. C. Sun, J. J. Wang, H. Mutoh, J. Hong, and R. W. Wood,
“Design, implementation and performance evaluation of an MDFE Read channel,” IEEE
Trans. Magn., vol. 34, no. 1, pp. 166-171, Jan. 1998.
[8] J. Kenney and M. Melas, “Pipelining for speed doubling in MDFE,” in Proc. IEEE Intl.
Conf. Commun. (ICC), USA, 1996, pp. 561-565.
[9] Y. X. Lee, L. K. Ong, J. J. Wang, and R. W. Wood, “Timing acquisition for DFE
detection,” IEEE Trans. Magn., vol. 33, no. 5, pp. 2761-2763, Sept. 1996.
[10] F. M. Gardner, “Charge-pump phase-lock loops,” IEEE Trans. Commun., vol. 28, no. 3,
pp. 1849-1858, Nov. 1982.
[11] M. V. Paemel, “Analysis of a charge-pump PLL: A new model,” IEEE Trans. Commun.,
vol. 42, no. 7, pp. 2490-2498, July 1994.
[12] A. Gupta, “Error-tolerant byte synchronization recovery scheme,” U.S. Patent 5,544,180,
Aug. 1996.
[13] J. Hong and R. W. Wood, “Method and apparatus for determining byte synchronization
within a serial data receiver,” U.S. Patent 5,448,571, Sept. 1995.
[14] T. Setoyama, “Data and synchronization signal and outputting apparatus for recovering
missing data and synchronization signals,” U.S. Patent 5,546,243, Aug. 1996.
[15] V. Y. Krachkovsky, Y. X. Lee, and B. Liu, “Error propagation evaluation for MDFE
detection,” IEEE Trans. Magn., vol. 33, no. 5, pp. 2770-2772, Sept. 1996.
[16] V. Y. Krachkovsky, Y. X. Lee, G. Mathew, B. Liu, M. Y. Lin, R. W. Wood, and M.C.
Tong, “Error propagation evaluation for RLL-constrained DFE read channels,” IEEE
Trans. Magn., vol. 34, no. 1, pp. 147-152, Jan. 1998.
[17] N. Likhanov, Y. X. Lee, and G. Mathew, “Error propagation suppression in MDFE and
M2DFE detectors,” The 4th MDFE Consortium Meeting, Singapore, April 1998.
[18] H. Ueno, K. Shimoda, T. Suguwara, and H. Mutoh, “A method for suppressing error
propagation in (1,7) MDFE detectors,” IEEE Trans. Magn., vol. 35, no. 5, pp. 2289-2291,
Sept. 1999.
[19] G. Mathew, Y. X. Lee, and V. Y. Krachkovsky, “A novel threshold technique for
minimizing error propagation in MDFE read channel,” in Proc. IEEE Intl. Conf. Global
Telecommun. (GLOBECOM), Sydney, Australia, Nov. 1998, pp. 2898-2903.
CHAPTER 6: Implementation of Multi-level Decision Feedback Equalization
Timing Recovery System
126

[20] J. W. M. Bergmans, “Adaptive jitter characterization,” submitted to IEEE Trans.


Commun., 2001.
[21] M. Montrose, “Printed circuit board design techniques,” Seminar presentation at EEM
Advancement centre Pte Ltd, Singapore, June 1996.
[22] J. B. Hagen, Radio-frequency electronics: Circuits and applications. New York:
Cambridge University Press, 1996.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
127

CHAPTER 7
ASYNCHRONOUS EQUALIZER
ADAPTATION AND
INTERPOLATIVE TIMING
RECOVERY FOR A DIGITAL
OPTICAL RECORDING SYSTEM

7.1 Introduction
As technology allows ever more complex functions to be implemented, digital signal
processing techniques play an increasingly important role in storage systems. Good evidence is
the use of sample rate conversion (SRC) and interpolative timing recovery (ITR) for replacing
the conventional voltage-controlled oscillator (VCO) based timing recovery in digital recording
systems [4] [5] [6] [27] [28]. The use of SRC and ITR enables a low-complexity method for
using a free-running clock to sample the replay signal and permits 100% digital equalization
and timing recovery. Fully digital ITR has advantages of lower cost and higher stability. State-
of-the-art digital integrated circuit (IC) technology sustains digital design with greater designer
productivity and much easier, more reliable system design verification as compared to analog
and mixed analog/digital designs. To utilize these advantages, we develop a new asynchronous
equalizer adaptation structure with fully digital ITR. The structure is generic, attractive, and
widely applicable. Its development is illustrated for a digital optical recording system according
to the proposed digital video recording (DVR) standard [1] [2].
The conventional approach for asynchronous equalizer adaptation uses the Least-Mean-
Square (LMS) algorithm. This approach has to involve a mechanism of compensation for
latency of gradient computation and an inverse sample-rate converter for synchronous-to-
asynchronous conversion. This translates to a considerable implementation complexity. To
avoid this, a new structure, called latch-based structure, is proposed to replace the LMS
algorithm by a zero-forcing (ZF) algorithm and the inverse sample-rate converter by a simple
latch. This structure does not have to compensate for latency and has no need to use an inverse
sample-rate converter; thus, it provides an extremely simple solution for implementation. Study
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
128

and simulations indicate that this structure can work as well as an inverse sample-rate converter
based structure.
This chapter is organized as follows. In Section 7.2, we review interpolation techniques and
ITR applications. We discuss the architectures of asynchronously sampled systems and the
advantages of using ITR in DVR applications in Section 7.3. In Section 7.4, we investigate
equalizer adaptation algorithms and develop a new latch-based adaptation structure with ITR.
We apply this new structure for investigating the asynchronously sampled read channel for the
DVR system in Section 7.5. Section 7.6 studies the sample-rate converter and ITR, and
investigates their realizations. In Section 7.7, we simulate the DVR system and evaluate the
asynchronous equalizer adaptation and ITR loops for various sampling rates and conditions.
Summary and further discussions are provided in Section 7.8.

7.2 Overview of Interpolation Techniques


This section reviews interpolation techniques, which are key to ITR systems. In
asynchronously sampled recording systems, the sampling device operates on the replay signal
at a fixed sampling rate. The sampling clock is not necessarily synchronized to the incoming
data. The subsequent digital function must be accomplished in an asynchronous manner with
respect to the data rate, and the interpolative timing recovery has to employ a sample-rate
converter to convert the asynchronous samples to synchronous samples. A timing error signal is
extracted based on these synchronous samples.
The ITR technique has long since been used in optical recording. The first publication on
the application of ITR dates back to 1992 [3] for compact disc (CD) players, well before the
applications of ITR in magnetic recording [4] [5] [6]. The approach of using an interpolation
technique with a free running clock for various receiver applications has recently been
appearing in the literature [22] [7] [8] [9] [10] [11] [12]. At the heart of this approach is the
SRC technique. The SRC is the task of converting the sample rate of a digital signal to another
sample rate while a certain amount of information, usually in a limited frequency band, must be
preserved. Sampling-rate conversion is realized conceptually by first reconstructing the analog
version of the input digital signal, and then re-sampling it at a different sampling rate. In
practice, the task of SRC is usually accomplished by using an interpolation filter.
The design of interpolation filters is normally based on continuous-time functions such as
the sinc function [13] [14] [15] or polynomials [16] [17] [30] [18] [19]. Such designs are
generally not optimal in the sense of minimizing the mean-square error (MSE) at the input of
the detector. There are several articles on the design of optimal interpolation filters for SRC in
the context of multi-rate digital signal processing techniques [20] [21] [22], but they seldom
consider noise. There are several papers investigating the design of optimal interpolation filters
for symbol-rate ITR [23] [26] [27], but most of them do not present cost effective solutions that
can be easily implemented.
Feasibility of implementation of interpolation filters is always an issue, and it is of much
practical interest to look into how an interpolator can be designed and implemented. Since the
interpolation filter applied here serves to realize the SRC task, it works as a temporal
interpolation filter. This means that such filters can have time-varying coefficients, depending
on a time-varying phase variable. A direct approach to design such filters is to quantize these
phase variables and load appropriate filter coefficients from a predetermined table for each
phase. In this approach, the filter coefficients can be designed ahead of time, for example, by
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
129

using the LMS algorithm. An alternative approach is to approximate the dependence of the
coefficients on phase by a polynomial. This leads to the Farrow structure [16]. Another
approach is to treat the asynchronous samples as having timing phase errors and to reconstruct
the synchronous samples using a time-varying amplitude error predictor [24]. This approach is
simple for the PR-IV scheme that has few decision levels. For high order PR schemes that have
more decision levels, this approach is complicated and not efficient.

7.3 Advantages of using ITR for DVR


Applications of ITR for magnetic recording are described in [5] [6] [25] [26] [27] [28] [4].
In these papers, the sampling device invariably operates with an asynchronous free-running
clock at a sampling rate above the data rate. This makes sense because the maximum frequency
of magnetic recording channels is normally higher than the Nyquist frequency, viz. the half of
the data rate. For DVR channels, however, the maximum frequency can be well below the
Nyquist frequency. This implies that the sampling device in the read channel, which is usually
an analog-to-digital (A/D) converter, may possibly operate in an asynchronous manner at an
undersampling rate without causing information loss. The use of undersampling can
significantly reduce the sampling rate of the two most important devices, the A/D converter and
the equalizer, thereby considerably lowering the complexity and cost of the system.

read
decisions
signal analog low- digital FIR symbol
VGA pass filter A/D adaptive equalizer
detector

AGC VCO based equalizer tap error generator


PLL updating

(a) A conventional synchronously sampled read channel

read
signal decisions
analog low- digital FIR SRC symbol
VGA pass filter A/D adaptive equalizer
detector

AGC equalizer tap error generator


free-running updating NCO
clock

(b) An asynchronously sampled read channel

Fig. 7-1: Synchronously sampled versus asynchronously sampled read channels.

By way of illustration and comparison, we show in Fig. 7-1 the block schematics of
conventional synchronously sampled and asynchronously sampled read channels.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
130

Observe that in conventional synchronously sampled read channels (see Fig. 7-1(a)), the
AGC, equalizer adaptation and timing recovery form three nested loops. Here, the A/D
converter and the equalizer form part of the timing-recovery loop, and add to the timing loop
latency. This latency is unavoidable and can influence the timing loop stability. Because of this
concern, the length of the FIR equalizer usually has to be limited in order to mitigate the
equalizer latency that contributes to the timing loop. Furthermore, the A/D converter is required
to operate synchronously with respect to the data rate. When the channel rate becomes high, a
high-speed A/D converter has to be employed, which makes the system expensive. The
advantage of this structure is that all the digital signal-processing blocks in the receiver operate
with the same clock once the clock is acquired and tracked; thus, no sample-rate converter is
needed.
Unlike Fig. 7-1(a), the asynchronously sampled read channel uses interpolation techniques
(see Fig. 7-1(b)). The A/D converter samples the replay signal using a free-running clock.
Neither frequency nor phase of this clock is required to be synchronous with respect to the data
rate. The ITR loop operates on asynchronous samples to recover the synchronous samples. The
A/D converter and digital equalization filter are now outside the loop. Thus, their latency does
not contribute to the loop. Hence, a long equalizer can be used. On top of this advantage, it is
possible to lower the sampling rate of the A/D converter, for example, for the DVR channel
whose cut-off frequency is well below the Nyquist frequency. Therefore, the undersampling
nature at the A/D does not lead to any information loss, and the subsequent ITR and detection
tasks can, in principle, be fulfilled as well as in a synchronous read channel. With a lower
sampling rate A/D converter, the digital equalizer no longer has to operate at the data rate.
Another advantage of this type of read channel is that fully digital timing recovery with low
cost and high reliability can be implemented. In ITR systems, the conventional analog voltage
controlled oscillator (VCO) is replaced by a numerically controlled oscillator (NCO) for the
timing recovery loop, in which the frequency and phase errors are stored in digit registers.
These loop parameters are less easily affected by external disturbing factors, thereby providing
high system reliability.
We also have to mention the drawbacks of this ITR system. The fact that the digital
equalizer operates asynchronously at a different rate relative to the data rate increases the
complexity of equalizer adaptation, and hence favors shifting part of the burden of the equalizer
ahead to an analog filter. Furthermore, the use of SRC requires the design of an interpolation
filter, and adds to complexity of the system. The interpolation filter works as a multi-phase FIR
filter and its coefficients depend on a fractional phase variable that is adjusted by the ITR loop.
Thus, the interpolation filter phase accuracy is rather critical and can influence the entire
system performance. This has been observed in previous work reported in [30] [31] [23]. In
addition, the ITR system can have a potential risk of cross talk between the physical free-
running clock and the recovered synchronous clock. However, due to advances in technology
permitting more complex signal-processing functions to be implemented by state-of-the-art
digital IC technology, ITR can be implemented with high stability and at low cost.

7.4 Novel Asynchronous Adaptation Structure


This section develops a novel asynchronous equalizer adaptation structure, which will be
used for an asynchronously sampled DVR system in the subsequent study. To develop this
structure, we first study equalizer adaptation algorithms and conventional adaptation structures.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
131

7.4.1 Equalizer adaptation algorithms


Fig. 7-2 depicts the system model for studying synchronous adaptation algorithms. The
sequence rk , which is the read signal sampled at rate 1/T, is applied to the equalizer with the T-
spaced impulse response coefficients wk . The equalizer output yk is compared with its desired
value d k to form an error sequence ek = yk − d k where d k is obtained by applying the data
ak to a filter whose impulse response is a target response g k . Adaptation algorithms use ek to
calculate the adaptation gradients ∆ k for updating the coefficients wk for minimization of the
power of ek . In the following pages, we investigate how ∆ k can be calculated. Below, we
briefly study the LMS and zero-forcing (ZF) algorithms of equalizer adaptation.
∆k calculation ek
of gradients

rk yk dk ak
wk gk

Fig. 7-2: System model for studying synchronous equalizer adaptation algorithms.

A. Conventional LMS adaptation algorithm


Assume that the equalizer has the form of a transversal FIR filter with Nw+1 taps wkj (‘j’
denotes the coefficient index and ‘k’ denotes the time instant). It is well known that the
gradients for adaptation of wkj (0≤j≤Nw,), based on the LMS algorithm (see [46] [32] [33]
[34]), are given by
∆kj = ek ⋅ rk − j for 0 ≤ j ≤ N w . (7.1)

To minimize the complexity of the gradient computation, the use of so-called ‘signed-LMS’
algorithm was suggested in [35] in which the gradient is computed as
∆kj = sgn(ek ) ⋅ rk − j for 0 ≤ j ≤ Nw (7.2)

where sgn( x) = { 1
−1
x≥0
x<0 . It was reported in [35] that the SNR loss resulting from this
simplification is less than 0.2 dB, while significantly reducing the complexity of equalizer
adaptation. Note that the reference rk − j is noise corrupted and may also suffer from channel
distortions. To avoid this disadvantage, we resort to another simple algorithm, called the zero-
forcing (ZF) adaptation algorithm.

B. ZF adaptation algorithm
In the ZF adaptation algorithm, the noisy reference rk − j is replaced by d k − j in the
computation of ∆ kj [36] [46]. As a result, we have
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
132

∆kj = ek ⋅ dk − j for 0 ≤ j ≤ Nw . (7.3)

To avoid using a multiplier, we can use a signed-ZF adaptation algorithm as


∆ kj = sgn(ek ) ⋅ d k − j for 0 ≤ j ≤ N w . (7.4)

Note that the target response g k is predetermined and thus known to the receiver. It is usually
far shorter than the system response. Thus, calculating d k is very simple and introduces little
delay, thereby making the computation of ∆ kj extremely simple. Further, due to the absence of
noise in d k , equalizer adaptation behavior based on the ZF algorithm becomes comparatively
smooth and reliable.
The gradients (7.3) update the equalizer coefficients wkj at instant k+1 via the recursive
equations
wkj+1 = wkj − µ ⋅ ek d k − j for 0 ≤ j ≤ Nw (7.5)

where µ is a constant step-size parameter that controls the speed of adaptation.

C. Effects of ZF adaptation loop convergence


It can be analyzed that the time constant of adaptation, say Γ , is given by
−1 1
Γ= (7.6)
ln(1 − µ K ∆ ) µ K∆

∂∆ kj
where K ∆ E[ ] = E[(a ⊗ h) k − j ⋅ (a ⊗ g ) k − j ] , hk is the channel bit response, and ‘ ⊗ ’ stands
∂wkj
for linear convolution. The second step in (7.6) is only valid when µ K ∆ 1 . Correspondingly,
1
Γ . (7.7)
µ ⋅ E[(a ⊗ h) k − j ⋅ (a ⊗ g ) k − j ]
From this expression, we obtain some insights into the influence of the channel gain on Γ .
Observe that Γ is inversely proportional to the magnitude of hk. If a channel gain error occurs
resulting in twice larger magnitude compared to the ideal value, the convergence time constant
will be halved. Consequently, the equalizer will converge two times faster than expected. At the
same time, the magnitude of the equalizer steady-state coefficients will also be halved due to
the two times larger gain. On the contrary, if the channel gain is half of the ideal value, the
value of Γ will be doubled. This will lead to a two times longer period of time for the equalizer
to converge to the steady state and the equalizer coefficients will be two times larger. Hence,
time constants Γ are not predictable when gain errors occur in the channel.
Because the bit error rate (BER) is reliable only after the equalizer has converged, Γ is an
important parameter for the system. In practical recording systems, the replay signal usually has
a large dynamic range and hence an automatic gain control (AGC) mechanism has to be placed
in front of the equalizer. By doing so the power of the replay signal is constrained before it
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
133

enters the A/D converter and equalizer, so that the time constant of the adaptation loop is well
defined.

D. Simplification of equalizer adaptation scheme


For digital recording, the BER performance is primarily determined by data transitions,
because decision errors are most likely to occur in the vicinity of a transition, and away from
transitions even a slicer will almost make no bit errors. Thus, we may focus the equalizer
adaptation effort primarily on transitions. This can be realized according to
 w j − µ ⋅ ek ak − j if ak −1 ≠ ak +1 for 0 ≤ j ≤ N w . (7.8)
wkj+1 =  kj
 wk otherwise

The replacement of ek d k − j by ek ak − j in (7.8) is warranted by the fact that ak and d k have the
same sign for the optical recording system under consideration. Note that ak is a binary
sequence taking on values of +1 or –1. Thus the equalizer adaptation loop no longer requires a
multiplier for calculating the adaptation gradients. This simplification may work well enough
for our purposes.

7.4.2 A novel asynchronous equalizer adaptation structure


In the preceding discussion, the equalizer coefficients wkj are spaced by T seconds. The
adaptation of equalizer coefficients and computation of equalizer output are done in
synchronism with the data clock at rate 1/T. However, in asynchronously sampled read
channels with ITR as shown in Fig. 7-1(b), the equalizer and its adaptation no longer operate
synchronously since the A/D converter is running at a free-running clock rate 1/Ts. In this
situation, the equalizer coefficients wnj ( 0 ≤ j ≤ N w′ where N w′ +1 is the number of
coefficients) are spaced by Ts rather than T seconds. Unless mentioned otherwise, in what
follows we will use the subscript ‘n’ and ‘k’ to denote Ts-spaced and T-spaced samples,
respectively. The equalizer coefficients wnj are adjusted via the adaptation loop every Ts
seconds. As a result, the equalizer adaptation becomes
wnj+1 = wnj − µ ⋅ ∆ nj for 0 ≤ j ≤ N w′ (7.9)

where µ is a constant step-size and ∆ nj is the gradient at the instant nTs for updating the j-th
equalizer coefficient.
Now, an essential question is how to calculate ∆ nj . Unlike the conventional gradient ∆ kj ,
the gradient ∆ nj has to be calculated every Ts rather than T seconds.

Existing asynchronous equalizer adaptation topologies can be found in [4], which pertains
to a free-running sampling frequency above the data rate. Fig. 7-3 depicts these topologies.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
134

SR
∆n en SRC-1

τ ek
∫ µ τ dk gk ak

rn wn xn SRC yk

1/Ts 1/T

(a): Topology I with LMS scheme


SR
∆n
τ

∫ µ en dn
SRC-1
dk
gk
ak

τ
rn wn xn τ yk
SRC

1/Ts 1/T

(b): Topology II with LMS scheme


Fig. 7-3: Existing topologies for asynchronous equalizer adaptation with LMS algorithm.

The structure of Fig. 7-3(a) performs the error calculation in the synchronous (1/T) domain
and converts the error ek to the asynchronous (1/Ts) domain via an inverse sample-rate
converter. Due to the presence of the SRC and inverse SRC, there is a delay τ between xn and
en in the adaptation loop. The compensation for this delay needs to be done in the forward path
before the reference sequence rn is applied to the shift register (SR). Unlike the structure of
Fig. 7-3(a), the structure of Fig. 7-3(b) calculates the error en in the asynchronous (1/Ts) domain
directly, but in this structure both equalizer input and output paths must include the
compensation for the delay τ. In this sense, the structure of Fig. 7-3(a) is simpler than the
structure of Fig. 7-3(b).
Observe that both topologies employ LMS adaptation. The advantage is that they provide
an accurate and straightforward gradient calculation for adaptation. This advantage is obtained
at the cost of an extra inverse sample-rate converter for producing the error en, which makes the
system more complex and expensive. Moreover, the use of a reference rn from the forward path
introduces noise into the multiplication, and at the same time, the delay τ induced by the SRC
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
135

and the inverse SRC operation has to be compensated for in rn in order to obtain valid gradients
∆ nj . This delay is usually fractional and can be difficult to estimate in practice.
To overcome these disadvantages, we propose a novel structure with the ZF algorithm as
shown in Fig. 7-4.

∆k
latch ∫ SR

ek
µ dk gk

rn wn xn SRC yk det ak

1/Ts 1/T

Topology with ZF scheme


Fig. 7-4: Novel structure for asynchronous equalizer adaptation with the ZF algorithm.

This structure is very simple compared with those in Fig. 7-3. First, the reference for
calculating the gradient is obtained locally by using decision bits, and hence contains only
signal information without noise. Secondly, there is no need to compensate for an unknown
delay for calculating the adaptation gradients. This is because the detector and the target
response gk are known, and thereby the relative delay (which normally has an integer value) is
known. Therefore, this delay can be predicted and compensated for easily. Thirdly, we can use
an extremely simple inverse sample-rate converter, namely a bank of latches (or, equivalently,
a bank of zeroth-order interpolators). The reason is that coefficient values, which are obtained
from the adaptation gradients via multiplication by a small step size µ and integration, change
only slowly as a function of both 1/ T and 1/ Ts . Even the simplest conceivable inverse
sample-rate converter, namely a bank of latches, will then work well for conversion of
coefficient values from the synchronous to the asynchronous clock domain.
In principle, coefficient values produced by the bank of integrators pertain to a T-spaced
equalizer. The actual equalizer, however, is Ts-spaced. To resolve this mismatch, T-spaced
coefficient values would need to be converted into Ts-spaced values through some form of
‘spatial’ interpolation. For the sake of simplicity, the structure of Fig. 7-4 does not include
spatial interpolation. The absence of spatial interpolation will command a performance penalty
in situations when Ts deviates much from T. In the sequel, we will gauge this penalty through
comparison with a reference topology that does include spatial interpolation. As this topology
includes both spatial and temporal interpolation (the latter via the bank of latches), we will refer
to it as ‘STI’ (for Spatio-Temporal Interpolation). We will find that the penalty due to the
absence of spatial interpolation is negligible as long as T deviates less than about 40% from Ts.
It is worth noting that this latch-based structure can further save the high-speed multiplier
by applying the signed-ZF algorithm for implementation simplicity.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
136

7.5 DVR System Model


This section describes a DVR system model that will be used for investigating the
asynchronous equalizer adaptation and ITR structures. Fig. 7-5 depicts this system model.

∆n ∆k
SRC-1 SR

tn=nTs ek
n(t) dk ak
gk
ak r(t) rn equalizer
xn yk detector
âk
h(t) LPF SRC
1/T

1/Ts 1/T

Fig. 7-5: DVR system model.


A d=1 data sequence ak of rate 1/T is applied to a DVR channel with bit response h(t) and
additive white Gaussian noise n(t) to obtain the noisy replay signal r(t). A simple model for h(t)
is
2
 sin(π ⋅ Ω c ⋅ t / T ) 
h (t ) = Ω c   . (7.10)
 π ⋅ Ωc ⋅ t / T 
Here, Ωc<0.5 is the channel cut-off frequency normalized with respect to the symbol rate 1/T.
The frequency response of the DVR channel, denoted H (Ω) ( Ω = f ⋅ T being the normalized
frequency variable), is
 |Ω|
 1− , | Ω |< Ωc ,
H (Ω) =  Ωc (7.11)
 0, | Ω |≥ Ωc .

By way of illustration we show in Fig. 7-6 h(t) and H (Ω) for Ωc=0.20, 0.25 and 0.33. The
replay signal r(t) is applied to a low-pass filter (LPF) that is meant to limit the noise bandwidth
prior to the sampling operation. This sampling operation occurs at a crystal-controlled sample
rate 1/Ts, which is not necessarily synchronous to the data rate 1/T. The resulting sequence rn
of rate 1/Ts is applied to a Ts-spaced transversal equalizer that is adapted at rate 1/Ts via an
adaptation loop. The equalizer output xn of rate 1/Ts is applied to the sample-rate converter to
obtain a sequence yk of rate 1/T that is synchronous with respect to the input data ak. The phase
of the sample-rate converter is updated at rate 1/T via the ITR loop. The sequence yk is used by
the bit detector, the TED of the ITR loop and the equalizer adaptation loop.
Since the focus of this chapter is equalizer adaptation and ITR, we assume that the detector
produces correct decisions. We construct the error signal ek as ek = yk − d k where dk is the
desired value of yk and is obtained as d k = (a ⊗ g ) k in terms of the target response gk and data
ak . For the sake of convenience, we use a 5-tap target response gk given by
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
137

[ g −2 , g −1 , g 0 , g1 , g 2 ] = [0.17, 0.5, 0.67, 0.5, 0.17] . (7.12)

0.4 1

Ωc=0.33
0.8
0.3 Ωc=0.25 Ωc=0.33
Ωc=0.20 0.6
Ωc=0.25

| H (Ω) |
Ωc=0.20
h (t)

0.2
0.4

0.1
0.2

0 0
-8 -7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8 0 0.1 0.2 0.3 0.4 0.5
Time : t Frequency: Ω ( normalized by data rate)

Fig. 7-6: DVR channel bit response h(t) and its magnitude response for Ωc=0.20, 0.25 and 0.33.
By way of illustration, we show gk and its magnitude response in Fig. 7-7. The reason for
using this response is that it resembles a typical high-density DVR system target with a
normalized cut-off frequency Ωc around 0.33.
0.7
2.5
0.6
2
0.5
1.5
|G(ej2πΩ)|

0.4
gk

0.3 1

0.2
0.5
0.1
0
0 0 0.1 0.2 0.3 0.4 0.5
-3 -2 -1 0 1 2 3

k Ω

Fig. 7-7: Target response gk and its magnitude response.


1 3
Equalizer output

2
Channel output

0.5
1

0 0

-1
-0.5
-2

-1 -3
-2 -1 0 1 2 -2 -1 0 1 2
Time: t Time: t
Fig. 7-8: Eye patterns at the channel output and the equalizer output in the absence of noise for
Ωc=0.33.
To better understand the DVR system model, we show in Fig. 7-8 two eye patterns for
Ωc = 0.33. Observe that the eye patterns essentially have 10 levels at the input of detector. This
multi-level nature of the DVR signal is determined by the DVR channel and target responses,
and in return affects the complexity of equalizer adaptation, ITR and detection algorithms.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
138

7.6 Realization of SRC and ITR for DVR


7.6.1 Realization of sample rate conversion for DVR
A. Principle of SRC and its digital realization
Sample rate conversion can be thought of as a procedure of resampling after reconstruction.
Conceptually, an analog signal is first reconstructed from the digital signal by means of D/A
conversion and filtering, and is then resampled at a different rate [37]. Therefore, SRC can be
viewed as a process of resampling as illustrated in Fig. 7-9(a). The digital signal xn of rate 1/Ts
is applied to a linear pulse modulator with symbol response c(t) to reconstruct the continuous-
time signal y(t), and y(t) is then sampled at a different rate, say 1/T, to form a digital signal yk of
sin(π t / Ts )
rate 1/T. Ideally, c(t) is a sinc pulse according to c(t ) = . However, this function
π t / Ts
cannot be realized in practice. One solution is to apply a truncated version of the sinc pulse,
which will cause a certain degradation. A more practical solution is to exploit a realizable
function that can meet the SRC requirements without causing a significant degradation. Basic
requirements include: a) c(t) should be symmetric for avoiding phase distortion; b) the in-band
magnitude response of c(t) should be essentially flat; and c) the response should prevent
aliasing, i.e. out-of-band components should be rejected sufficiently.

tk=kT
SRC
SRC
xn c(t) y(t) yk xn sample- y(mk) cnµk yk
1/Ts selector 1/Ts interpolation filter 1/T
1/Ts 1/T
m kT s µk T s

(a) (b)

Fig. 7-9: Block diagram of SRC. (a): principle of resampling after reconstruction; (b) digital
realization of SRC.
Actual implementation of the SRC is entirely digital, as shown in Fig. 7-9 (b), which can be
viewed as the SRC portion of the ITR for the DVR system shown in Fig. 7-5. The equalizer
output yn of rate 1/Ts is applied to the sample-selector that has an integer index mk for choosing
the interpolator input samples ymk from xn. The interpolation filter with impulse response
µk
coefficient cn operates on its inputs ymk to produce the synchronous samples yk. The variable
µ k ∈ [0,1) is a time-varying phase of the interpolation filter relating to the index mk. The
instants tk for forming the samples xk can be expressed in terms of Ts according to
tk = (mk + µ k )Ts , where the integer part mk and the fractional part µ k are determined by

t  t t 
mk =  k  , µ k = k −  k  (7.13)
 Ts  Ts  Ts 
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
139

where  x  means the maximum integer not exceeding x. Ideally, the spacing of these instants
tk is equal to T seconds , i.e.
tk − tk −1 = (mk + µ k )Ts − (mk −1 + µ k −1 )Ts = T , (7.14)

so that we can determine mk and µ k in a recursive fashion as

T  T T 
µ k = µ k −1 + −  µ k −1 +  , mk = mk −1 +  + µ k −1 − µ k  . (7.15)
Ts  Ts   Ts 
The instants tk are determined by the ITR loop, which involves a TED, a loop filter, and a NCO.
We will investigate the ITR loop in detail later.

B. Interpolation filter design


The DVR channel has a low-pass nature. We need an interpolation filter with a pass-band
that covers all the desired DVR components while rejecting the out-of-band noise as much as
possible. One can refer to [17] [26] [27] for mathematical formulas of interpolation filter design
in the presence of noise. For the sake of simplicity and convenience for study and simulation,
we use an existing six-point Lagrange-interpolator with impulse response [38]
 µ ( µ 2 − 1)( µ 2 − 4 )
 n = − 3,
 120
 − µ ( µ 2 − 1)( µ + 2 )( µ − 3)
 n = − 2,
24
 (7.16)
 µ ( µ + 1)( µ 2 − 4 )( µ − 3)
 n = − 1,
c nµ =  12
 − ( µ 2 − 1)( µ 2 − 4 )( µ − 3)
n = 0,
 12

 µ ( µ − 1)( µ 2 − 4 )( µ − 3)
n = 1,
 24

 − µ ( µ 2 − 1)( µ − 2 )( µ − 3)
n = 2.
 120

Impulse response Magnitude response


1.2 1

1
0
0.8
Magnitude (dB)
Amplitude: c(t)

-1
0.6

0.4
-2

0.2
-3
0

-0.2 -4
-3 -2 -1 0 1 2 3 0 0.1 0.2 0.3 0.4 0.5

Time: t Normalized frequency with respect to 1/Ts

Fig. 7-10: Impulse response and frequency response of the six-point Lagrange interpolator.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
140

This interpolator has six coefficients and induces a three-symbol delay at its output. The reason
for using this interpolator is that it is simple and has good features in its time- and frequency-
domain responses. It exhibits a symmetric time-domain response that induces no phase
distortion, and its frequency response has a 3 dB cut-off frequency at 0.4/Ts, which is
sufficiently good for our purposes. These features are illustrated in Fig. 7-10.

7.6.2 Realization of ITR for DVR


Having studied the DVR system model, the equalization filter, and the SRC structure, we
can now describe the asynchronously sampled DVR system of Fig. 7-5 in more detail, as shown
in Fig. 7-11.

d=1
noise un SRC
data
equalizer
cnµk
ak r(t) rn xn âk
wn sample interpolation yk detector
h(t) A/D filter
1/T 1/Ts 1/Ts selector 1/T 1/T
mk µk
1/T
equalizer adaptation
tn =: nTs+φ NCO (phase error error
accumulator) generator
∆µk ek
loop χk TED
ITR Loop filter

Fig. 7-11: Asynchronously sampled DVR system with ITR loop.

In the ITR loop, the TED output χk is computed based on the error samples ek. It is then
filtered by the loop filter to create a gradient ∆µ k for the phase error accumulator. The loop
filter is similar to the one used in the conventional VCO-based timing recovery loop. The phase
error accumulator can be viewed as a numerically controlled oscillator (NCO), similar to the
VCO in an analog timing-recovery loop. Its outputs are used to update the integer index mk
and the fractional phase µ k for the sample-selector and interpolation filter, respectively, of the
SRC.

A. ITR loop recursive update


In practice, each instant tk can suffer from a timing phase error τ k T . One objective of the
ITR loop is to update the interval of instants tk to track T. Thus, the ITR loop update should
strive to accomplish
[tk + τ k T ] − [tk −1 + τ k −1T ] = T . (7.17)

Using the fact that tk = (mk + µ k )Ts , we reformulate (7.17) as


CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
141

T
mk + µ k − mk −1 − µ k −1 = [1 − (τ k − τ k −1 )] . (7.18)
Ts
Upon realizing that the value of µ k is fractional and that mk is integer, we obtain the recursive
equations for updating mk and µ k in the presence of a timing phase error as

 T 
µ k =  µ k −1 + [1 − (τ k − τ k −1 )]  mod 1 (7.19)
 Ts 
and

 T 
mk = mk −1 +  µ k −1 + [1 − (τ k − τ k −1 )] (7.20)
 Ts 
where ( x mod 1) x −  x  ∈ [0,1) .

The difference of normalized timing phase errors, say [τ k − τ k −1 ] , is updated under the
control of the NCO-based ITR loop. This difference can be interpreted as the NCO control
variable, which is effectively a period or frequency control knob. The ITR loop is responsible
to force this difference, on average, to zero. This can be accomplished by using a second-order
PLL loop, given by
λk = λk −1 − β ⋅ χ k −1
(7.21)
[τ k − τ k −1 ] = λk − α ⋅ χ k −1
where α and β are step-size values and χ k is the TED output. The term λk compensates for
frequency offset and the NCO output accounts for accumulation of [τ k − τ k −1 ] . The use of
negative sign in (7.21) is due to the fact that the TED has positive (negative) outputs for
positive (negative) timing phase errors. In practice, α and β can be determined by

2ς d ω nd T (ω nd T ) 2
α= , β= (7.22)
Kd Ko Kd Ko

where K d and K o are the gain factors of the TED and NCO, respectively. Here, ς d and ω nd
are the damping factor and natural frequency of the PLL, respectively. In most practical
situations, ς d 0.707 and ω nd T 0.1 .

B. ZF based TED for DVR


Several algorithms have been reported in the literature for symbol-rate timing recovery [39]
[40] [41] [42] [43] [44] [45] [46]. Among these algorithms, the zero-forcing (ZF) TED is
attractive for its simplicity and near optimal performance.
In this section, we consider sample-based and transition-based ZF TEDs, and compare their
data-aided (DA) and decision-directed (DD) schemes. The DA TEDs use the original data,
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
142

while the DD TEDs use the actual decisions. For the ITR system given in Fig. 7-11, one
example of the symbol-rate ZF TED output can be written as (see Chapter 4)
χ k = ( yk −1 − d k −1 )(d k − d k − 2 ) (7.23)

where yk is the reconstructed synchronous sample value at the interpolation filter output and
d k is the desired value of yk , given by d k = (a ⊗ g ) k where ak is the data and g k is the
target response. Usually, d k is produced based on decisions aˆk rather than the data ak , i.e.,
dˆ = (aˆ ⊗ g ) . This leads to the DD TED scheme given by
k k

χ k = ( yk −1 − dˆk −1 )(dˆk − dˆk − 2 ) . (7.24)

Concentrating primarily on transitions in yk , we may simplify the TED in (7.24) into a


transition-based TED as
χk = ( yk −1 − dk −1 )[sgn(dk ) − sgn(dk −2 )] .
This implies that only the samples yk −1 that meet the condition sgn( d k ) ≠ sgn( d k − 2 ) contribute
to the TED output. The TED output is set to zero under other conditions.
For the DVR system incorporating the rate 2/3 (1,7) code and the 5-tap target response g k
given in (7.12), one can verify that d k and ak are of same sign, i.e. sgn(d k ) = ak for all
instants k. Applying this fact to χk = ( yk −1 − dk −1 )[sgn(dk ) − sgn(dk −2 )] and noting that the
useful component of χ k is the average value of χ k , we can further simplify the transition-
based TED as χk = yk−1(ak −ak−2) , or equivalently as (after allowing the loop to absorb a gain
factor of 2),
 yk −1ak if ak ≠ ak − 2 ,
χk =  (7.25)
0 otherwise .
Substituting aˆk for ak , we obtain the DD TED scheme as

 yk −1aˆk if aˆk ≠ aˆk − 2 ,


χk =  (7.26)
0 otherwise .
This is a very simple TED algorithm, which does not need a multiplier and the conventional
error calculation ek −1 = yk −1 − d k −1 . The TED output is given by yk at transitions with its sign
controlled by aˆk . Thus, this TED is simpler than the conventional error-based TEDs according
to (7.23) and (7.24).
By way of illustration, we evaluate the timing functions for the DVR based on the TEDs
according to (7.23), (7.24), and (7.26), and show the result in Fig. 7-12.
In this simulation, the normalized timing phase error τ ranges from –1 to +1. The six-
point Lagrange interpolator of (7.16) is used for forming yk . Timing phase errors are
introduced by changing the phase of the interpolation filter. For the DD TED scheme, bit
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
143

decisions aˆk are obtained from a threshold detector. Observe that the timing functions cross
zero at the ideal phase ( τ = 0 ), and have good linearity. The timing function in the data aided
case is monotonic due to the consistent use of the correct data ak . The decision-directed
function exhibits multiple zero crossings, which can be distinguished in practice by using
specific acquisition and byte synchronization mechanisms. Also, observe that the linear ranges
of the timing functions of these TEDs are comparable. The identical slopes of the functions are
due to the fact that the timing information resides mainly in the vicinity of transitions.

1.5
DA TED scheme (7.23)
DD TED scheme (7.24)
Simplified DD TED scheme (7.26)
Averaged TED output

-1.5
-1 0 1

Timing phase error: τ ( normalized in units T )

Fig. 7-12: Timing functions for the DVR with Ωc=0.33 TED, no noise, and d=1 data.

7.7 Simulations of Asynchronous Equalizer Adaptation


and ITR Loops for DVR

In this section, we simulate the asynchronous equalizer adaptation and ITR loops for DVR.
We show the simulation block diagram of the DVR system in Fig. 7-13. We specifically study
the latch-based asynchronous adaptation structure, which involves the adaptation calculation
block shown in Fig. 7-14, and the ITR calculation block shown in Fig. 7-15, where ‘D’
represents one bit interval delay. We first study adaptation and timing recovery separately, and
later consider joint adaptation and timing recovery. The DVR channel considered here has a
normalized cut-off frequency Ωc = 0.33 and additive white Gaussian noise (AWGN). The
noise is added behind the A/D converter according to Fig. 7-11. The SNR, which is defined in
the matched filter bound sense, is assumed to be 9 dB, which represents a worst-case situation
with BER around 10−2 ∼ 10−3 . We characterize the oversampling rate by a parameter
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
144

src (T − Ts ) / Ts . For instance, src < 0 denotes undersampling ( Ts < T ), src > 0 denotes
oversampling ( Ts > T ), and src = 0 denotes synchronous sampling ( Ts = T ).

adaptation
calculation
9
w n+1 ek
xn yk dk ak
rn equalizer SRC gk
NCO
output to detector
ITR
calculation

Fig. 7-13: Block diagram showing the equalizer adaptation and ITR loops in the DVR receiver (the
quantity wn+1 represents a vector consisting of 9 equalizer coefficients).

dk

adaptation calculation
SR wk
µ D 9
9 9 9 9
ek ∆k w k+1 w n+1
latch

Fig. 7-14: Schematic showing the equalizer adaptation based on the latch-based asynchronous ZF
adaptation structure.

ak

ITR calculation
1-D2 α
NCO
NCO output
yk χk input
D NCO

β
D
λk

Fig. 7-15: Schematic showing the ITR calculation for the NCO adjustment in the ITR loop.

7.7.1 Asynchronous equalizer adaptation


In the simulations of this subsection, only the equalizer is adapted. The timing recovery
loop is kept open and ideal sampling phase is assumed in the sample-rate converter.
A. Simulation of synchronous sampling operation scheme
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
145

First, we perform simulations for src=0 . The objective is to investigate the synchronous
equalizer adaptation performance, and to check if the interpolation filter is working fine. This
simulation sets a reference for asynchronous adaptation. Illustrative results are shown in Fig. 7-
16.
1.5 1.5

(a) (b)
w j, j=0,1,…,8

w j, j=0,1,…,8
1 1

0.5 0.5

0 0

Latch based structure Latch based structure


-0.5 -0.5
0 1000 2000 3000 4000 5000 6000 0 1000 2000 3000 4000 5000 6000

k k
1.7 10
noiseless DVR channel 1 noiseless DVR channel
1.3
noisy DVR channel at SNR=9dB
2 noisy DVR channel at SNR=9dB
Amplitude, w j

0
|W(Ω)| (dB)

0.9

(c) -10
(d)
0.5

-20 2 1
0.1

-0.3 -30
0 2 4 6 8 10 0 0.1 0.2 0.3 0.4 0.5
j Ω : fT
0.7 4
1 noiseless DVR channel
(f)
yk actual interpolator output
Averaged |ek|2

Interpolator output

3
0.5
2 noisy DVR channel at SNR=9dB dk desired interpolator output
2
(e) 1
0.3
0
1 2
0.1 -1

-2

-0.1 -3
0 1000 2000 3000 4000 5000 6000 5500 5550 5600 5650 5700

k k

Fig. 7-16: Simulation results of synchronous adaptation at Ts=T for a 9-tap equalizer. (a)
coefficient convergence for the noiseless channel; (b) for the 9 dB SNR channel; (c) steady-state
equalizer coefficients; (d) corresponding equalizer magnitude response; (e) time-averaged squared
error convergence; (f) detector inputs.

Fig. 7-16 illustrates the equalizer convergence for both the ideal (noise-free) and the worst-
case (9 dB SNR) channel conditions. The squared errors are averaged over 100 bits for smooth
display. The simulation shows that the 9-tap equalizer converges well to a symmetric impulse
response. The steady-state equalizer has a low-pass nature and even at 9 dB SNR its magnitude
response matches the ideal response very well. The error power converges well enough to
produce the desired detector inputs. The noise-like convergence of the equalizer coefficients at
9 dB SNR arises due to the gradient noise resulting from the use of a relatively large step size.
This simulation confirms that the sample-rate converter and latch-based adaptation structure are
working satisfactorily.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
146

B. Simulation of asynchronous schemes


We now consider the asynchronous situation. For undersampling, we choose src=-0.25 , or
equivalently, Ts = (4 / 3)T , so that aliasing is avoided, albeit marginally. For oversampling, we
choose src=0.25 , i.e. Ts = (3/ 4)T . The objectives of this part of the simulation are threefold:
(a) investigate the suitability of the 6-point Lagrange interpolator given in (7.16); (b) examine
the latch operation; and (c) evaluate the adaptation loop performance.
Since the latch replaces a spatio-temporal interpolation (STI), we evaluate the latch vis-a-
vis a STI. Recall from Section 7.4.2 that when we use a ‘latch’ for the conversion of equalizer
coefficients from the synchronous (1/T) to the asynchronous (1/Ts) clock-domain, only the
temporal interpolation operation is involved in the conversion, whereas ‘STI’ involves both
spatial and temporal interpolation operations. Performance is evaluated by examining the
differences in error power and by measuring the equalizer frequency response. The simulation
results are shown in Fig. 7-17 and Fig. 7-18 for Ts = (4 / 3)T and Ts = (3/ 4)T , respectively.

1 STI/no noise
5
1 STI/no noise
0.6 (a) 2 STI/9dB 0 2 STI/9dB

3 3 latch/no noise
|ek|2

0.5 latch/no noise


|W(Ω)| (dB)

0.4 4 latch/9dB
-5 4 latch/9dB

-10
4
Averaged

0.3
2 4 -15
0.2
0.1 -20
3 2
0 -25 (b)
-0.1
1 3 1
-30
0 1000 2000 3000 4000 5000 6000 0 0.1 0.2 0.3 0.4 0.5 0.6

k Ω : fT

Fig. 7-17: Simulation results of undersampled asynchronous adaptation (Ts=(4/3)T): (a) time-
averaged squared error convergence, (b) steady-state equalizer magnitude response.
0.5 5
(a) 1 STI/no noise 1 STI/no noise
0.4 2 STI/9dB 0 2 STI/9dB
|W(Ω)| (dB)

3
Averaged |ek|2

latch/no noise
3 latch/no noise
latch/9dB
0.3 4 latch/9dB -5
4
-10
0.2
2 4 3 4
0.1
-15 2
-20
0
1 3 -25 (b)
-0.1 1
0 1000 2000 3000 4000 5000 6000 -30
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
k Ω : fT

Fig. 7-18: Simulation results of oversampled asynchronous adaptation (Ts=(3/4)T): (a) time-
averaged squared error convergence, (b) steady-state equalizer magnitude response.

From the above simulations, we obtain the following observations. First, the 6-point
Lagrange interpolator according to (7.16) and Fig. 7-10 is sufficient for our purpose. The error
power converges well to comparable levels as in the synchronous situation, even at 9 dB SNR.
This confirms that the 6-point Lagrange interpolator can well preserve the in-band components
of the DVR channel.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
147

The second observation concerns the adequacy of the latch as replacement of the STI.
Observe that the error powers for the latch- and STI-based structures are almost identical in the
steady state. The most apparent difference between both can be observed in the steady state
equalizer magnitude responses. These responses differ significantly in the out-of-band region
(Ω>0.33) and match well in most of the in-band region (Ω≤0.33). For undersampling
( Ts = (4 / 3)T ), the responses for the latch-based structure produce some in-band deviations
near Ω=0.33. For oversampling ( Ts = (3/ 4)T ), the in-band deviation is insignificant. In both
cases, however, the out-of-band attenuation differs by more than 10 dB. The use of a latch
tends to boost the stop-band response as Ts deviates from T and to move its corner frequency
away from 0.33, which is the cut-off frequency of the target response.
Reasons for the above observations are twofold. One is that the effective span (in seconds)
of the equalizer is different for undersampling and oversampling for the fixed length (9 taps)
equalizer. Another reason is that compared to the STI-based structure, the latch-based structure
may produce quite different taps at the two ends of the equalizer due to the absence of the
spatial interpolator. This could be the reason for the boost in the out-of-band components (high
frequency region) and for the shifted corner frequency.
During the simulation, we observed big fluctuations in the coefficients in the cases of
oversampling relative to the cases of undersampling. This is because of the fact that for a given
SNR, noise power is considerably higher for Ts<T than for Ts>T. The noise power is
proportional to the channel bandwidth, and hence increases with the oversampling factor.
However, the effective span of the equalizer is less for Ts<T compared to Ts>T since the
number of taps is kept fixed.
To investigate the tolerable range of sampling rates and the latch-based adaptation loop
performance, we consider the sampling rates ranging from undersampling to oversampling
schemes, with the ITR loop opened. We evaluate the steady state error power and compare with
that for the STI-based structure. The results are shown in Fig. 7-19.
From Fig. 7-19, we obtain the following observations. First, at 9 dB SNR, noise increases
the error power baseline from –27 dB to –11 dB. At the baseline of –11 dB, we examined the
equalizer response, and found that it matched the ideal response very well. This was verified
through the simulations according to Fig. 7-16 to Fig. 7-18. Second, at 9 dB SNR, the
asynchronous adaptation loop works well for oversampling ratio T/Ts from 0.7 to 1.7, which
corresponds to Ts from Ts=1.43T to Ts=0.59T. Observe that the error power throughout this
range remains essentially constant and that the error power levels obtained for the latch- and
STI-based structures almost coincide. Performance of the latch-based structure deteriorates
rapidly relative to that of the STI-based structure when Ts<0.59T. The latch-based structure is
relatively sensitive to noise and to the selection of Ts when Ts<T. One reason could be the
insufficient span of the equalizer resulting in poor stop-band suppression.
We also observe a deterioration for Ts>1.43T. Unlike the deterioration for oversampling,
this deterioration is not primarily due to noise, but to the undersampling scheme itself. This is
because the latch-based structure causes a considerable in-band signal loss for Ts>1.43T. We
investigated and found that for Ts>1.43T and Ts<0.59T the equalizer performs badly in terms of
the frequency response and the sample-rate converter output samples. In both cases, the
equalizer taps are not well centered and the frequency response has a large in-band distortion
and weak out-of-band attenuation. Moreover, the cut-off frequency deviates far from its desired
position.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
148

Averaged |ek|2 (dB) 0 1 STI based structure


2 latch based structure
-5

-10 2

-15 (a) noiseless channel


-20
1

-25
-27 dB
-30
0.5 1 1.5 2
T
Ts
15
1 STI based structure
latch based structure
2
Averaged |ek|2 (dB)

10

5
2
0
(b) noisy channel at SNR=9 dB
-5

1
-10
-11 dB

-15
0.5 1 1.5 2

T
Ts

Fig. 7-19: Equalizer adaptation performance of the latch-based and STI-based structure for
various sampling rates with (a) no noise; (b) 9 dB SNR.

We explain the above observations as follows: The performance of the latch-based


structure, compared to the STI-based structure, deteriorates rapidly when Ts<0.59T and
Ts>1.43T. This roughly corresponds to Ts, being at least 40% of T smaller and bigger,
respectively, than T. In both situations, the omission of spatial interpolation, as in the latch-
based structure, seems too serious. The consequence is that the equalizer taps are effectively
located at least 40% away from their desired positions. As a result, large inaccuracies of the
equalizer taps inevitably occur, thereby making the latch-based loop work inadequately.
The above results suggest that for a considerable sampling rate range from Ts=1.43T to
Ts=0.59T, the latch-based structure can adequately replace the STI-based structure while
causing only a negligible degradation. This replacement greatly reduces the complexity of the
asynchronous adaptation loop. For DVR, undersampled operation is attractive in practice. The
investigation results suggest that the latch-based structure is promising for DVR systems.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
149

7.7.2 Interpolative timing recovery simulation


In the simulations of this subsection, we restrict the investigation to the interpolative timing
recovery loop for a fixed equalizer. The objective is to examine the performance of a fully
digital NCO-based timing recovery loop with the ZF TED algorithm according to (7.26). We
examine the ITR loop convergence and NCO outputs, and check if the injected initial timing
phase error and frequency offset eventually converge to zero. The initial values of timing phase
error and frequency offset are set to 0.3 and 0.002, respectively, which are normalized with
respect to the synchronous bit interval at the interpolator output. We use a second order PLL as
shown in Fig. 7-15. Unless noted otherwise we use ς d ≅ 0.707 and ω nd T ≅ 0.01 .

We simulate the second-order ITR loop and monitor the behaviors of the TED output, the
NCO output and the frequency offset. The results are shown in Fig. 7-20, Fig. 7-21 and Fig. 7-
22, respectively. The simulations cover the synchronous ( src=0 ) and asynchronous
( src=-0.25 , src=0.25 and src=-0.123 ) conditions. Observe that all ITR loop variables
properly converge to zero. This validates the adequacy of the TED. Even though we conducted
the simulations in the absence of noise, similar simulation results were obtained when the
channel has noise.

0.1
3
0.05
2
0
TED output

4 1
-0.05
-0.1 src=0
1
-0.15 2 src=-0.25
3 src=0.25
-0.2 4 src=-0.123

-0.25
0 1000 2000 3000 4000 5000 6000
Time: k
Fig. 7-20: TED output for various sampling rates for the noiseless DVR channel with a fixed
equalizer.
-3
×10
x 10
-3

4
4 1 src=0
3 2 src=-0.25
3 src=0.25
NCO output

2 3 4 src=-0.123

1
2
0

-1 1
-2
0 1000 2000 3000 4000 5000 6000

Time: k
Fig. 7-21: NCO output for various sampling rates for the noiseless DVR channel with a fixed
equalizer.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
150

-3
x 10
3
1 src=0
2 src=-0.25
4
Frequency offset

2 3 src=0.25
4 src=-0.123

1 3
2
0
1
-1
0 1000 2000 3000 4000 5000 6000
Time: k
Fig. 7-22: Frequency offset for various sampling rates for the noiseless DVR channel with a fixed
equalizer.

7.7.3 Simulations for joint asynchronous equalizer adaptation and ITR loops
Having separately investigated the asynchronous equalizer adaptation and ITR loops, we
can now operate these two loops simultaneously. The objectives of these simulations are to
evaluate the convergence behaviors of adaptation and ITR loops when they operate together,
and to examine their interaction. We consider synchronous and asynchronous situations and
simulate both noiseless and noisy conditions. For the asynchronous situation, we consider both
undersampled and oversampled operations. For undersampling we consider Ts=(4/3)T and
Ts=(10/9)T, which represent heavily and lightly undersampled cases. For oversampling we
consider Ts=(4/5)T and Ts=(1/2)T, which represent lightly and heavily oversampled cases.

A. Results in the absence of noise


In the absence of noise, the error power curves all converge to zero in the steady state in all
the cases considered here. The 9 coefficients of the equalizer converge well and are properly
centered in the steady state. By way of illustration, we show in Fig. 7-23 the simulation results
of the error power convergence, the steady-state equalizer magnitude response versus
normalized frequency f ⋅ T , and the convergence of ITR loop parameters (i.e. the TED output,
NCO output and frequency offset).
Observe that the magnitude response of the equalizer changes as a function of sampling
rate. For the synchronous case (Ts=T), the equalizer has approximately flat in-band response
and the cut-off frequency occurs at around 0.33. Out-of-band attenuation is greater than 10 dB.
For the heavily undersampled case (Ts=(4/3)T), the in-band response hardly changes, but the
out-of-band attenuation is less than 8 dB. The attenuation improves to slightly more than 15 dB
for the lightly undersampled case (Ts=(10/9)T). For the lightly oversampled case (Ts=(4/5)T),
the out-of-band attenuation exceeds 20 dB and the in-band response is perfectly flat. However,
the cut-off frequency increases thus increasing the equalizer bandwidth. This effect is even
more pronounced for the heavily oversampled case (Ts=(1/2)T). The increased bandwidth does
not pose a problem here because noise is absent. But we will observe later that when noise is
present, this increased bandwidth can lead to a performance degradation.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
151

0.15 5
1 1

Equalizer magnitude
Ts=T Ts=T
2 0 2 Ts=(10/9)T
5
Ts=(10/9)T 3

response [dB]
Error power

0.1 3 Ts=(4/3)T
Ts=(4/3)T -5 4 Ts=(4/5)T
4 5
Ts=(4/5)T 1 Ts=(1/2)T
5 -10
0.05 3 Ts=(1/2)T 2
4
2 -15

-20 3
0
1 4 5 -25

-0.05 -30
0 2000 4000 6000 0 0.2 0.4 0.6 0.8 1

Time: k Frequency: fT

-4 -4
x 10 x 10
0.04 10 6
1 1
Ts=T 2
Ts=T
2 2
0.02 Ts=(10/9)T 4 Ts=(10/9)T
1 3 3
Ts=(4/3)T Ts=(4/3)T

frequency offset
TED output

5 4 4 4
NCO output

Ts=(4/5)T Ts=(4/5)T
0 2 5 2 5
4 Ts=(1/2)T Ts=(1/2)T
3 5
1 5 3
-0.02 Ts=T
4 2 0
Ts=(10/9)T 0 3
3
5 Ts=(4/3)T
-0.04 2
4 Ts=(4/5)T 1 -2 1
5 Ts=(1/2)T
-0.06 -5 -4
0 2000 4000 6000 0 2000 4000 6000 0 2000 4000 6000
Time: k Time: k Time: k

Fig. 7-23: Simulation results of the latch-based equalizer adaptation and ITR loops for the
noiseless DVR channel for Ts=T, Ts=(10/9)T, Ts=(4/3)T, Ts=(4/5)T and Ts=(1/2)T.

The ITR loop parameters converge well as desired. For the synchronous, lightly
undersampled, lightly and heavily oversampled situations, the TED output, the NCO output and
the frequency offset converge smoothly. Only for the heavily undersampled case, ITR
convergence curves have small fluctuations. These simulation results suggest adequate
convergence behaviors for the noiseless case.

B. Results in the presence of noise


Now we look into the results in the presence of noise. Fig. 7-24 shows the results for the
equalizer adaptation loop.
0.16 5
Equalizer magnitude [ dB ]

1 Ts=T 1 Ts=T
0.14 2 2
Ts=(10/9)T 0 Ts=(10/9)T
3 Ts=(4/3)T 3 Ts=(4/3)T
0.12 4 Ts=(4/5)T 4 Ts=(4/5)T
5 Ts=(1/2)T -5 4 5 Ts=(1/2)T
Error power

0.1
-10 3
0.08 5 5
1
-15
0.06 3 4
1 2 -20
0.04

0.02 -25
2
0 -30
0 1000 2000 3000 4000 5000 6000 0 0.2 0.4 0.6 0.8 1

Time: k Frequency: fT

Fig. 7-24: Error power convergence of equalizer output and steady-state equalizer magnitude
response with latch-based equalizer adaptation and ITR loops at 9 dB SNR for Ts=T, Ts=(10/9)T,
Ts=(4/3)T, Ts=(4/5)T and Ts=(1/2)T.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
152

For the synchronous case (Ts=T), the equalizer properly converges to the steady state, with
a symmetric impulse response. The error power in the steady state is roughly –17 dB, as
compared to –11 dB without ITR loop. Similar observations are obtained for the heavily and
lightly undersampled cases. The equalizer impulse responses for these cases converge to be
symmetric with good linear phase characteristics. No visible interaction was observed between
the adaptation and ITR loops. This benefit could be due to the low cut-off frequencies of the
DVR channel and the selected target, which are well below the Nyquist frequency, as well as to
the symmetric features of the channel and target responses. Consequently, the equalizer has
linear phase characteristics and a low-pass nature with a high attenuation at the Nyquist
frequency.
We also observed that the equalizer responses are similar to those obtained in the absence
of noise, which roughly have the same in-band flatness and at least 8 dB out-of-band
attenuation. Their band edges are located around or below the DVR channel cut-off frequency
0.33. These characteristics effectively prevent noise aliasing for SRC. The in-band distortion,
however, leads to some signal loss and thus causes errors at the sample-rate converter output.
For this reason, reconstructed samples at the sample-rate converter output can induce decision
errors. In general, for synchronous and undersampled conditions, the adaptive equalizer
behaves properly. Similar observations and conclusions apply to the ITR loop. Also, no
apparent interaction between the adaptation and ITR loops was observed for these situations.
The results for both lightly and heavily oversampled situations are quite different from
those in the absence of noise. The converged equalizer coefficients are no longer symmetrically
centered, indicating a phase distortion. The response has a flat in-band response but the
bandwidth extends far beyond that of the channel. The band edge, in fact, has moved towards
the Nyquist frequency. This is particularly true for the heavily oversampled case, where the
expected low-pass equalizer now becomes almost an all-pass equalizer. As a result, much of the
noise passes through the equalizer, and the sample-rate converter output exhibits severe noise
aliasing. In this situation, the adaptive equalizer is likely to diverge. This can be prevented by
using a smaller adaptation step-size, which was confirmed by simulation. Excessive gradient
noise could be the reason for this, because the equalizer can converge very well to the steady
state in the absence noise at this oversampling rate. In addition, we observe an interaction
between the equalizer adaptation and ITR loops. This is because the inadequately converged
equalizer coefficients result in large errors at the sample-rate converter output, yielding big
TED and NCO output errors, which in turn degrade the SRC performance and produce even
bigger errors at the sample-rate converter output. The resulting errors at the sample-rate
converter output cause gradient computation inaccuracy for the adaptation loop, and result in
erroneous equalizer coefficients. Consequently, these errors increase further and cause the
equalizer adaptation to diverge.

7.8 Summary and Further Discussions


In this chapter, we studied an asynchronously sampled DVR system with asynchronous
equalizer adaptation and fully digital interpolative timing recovery. We modeled the DVR
system, developed adaptation algorithms, performed analysis, and simulated the system under
various conditions. We proposed a new structure for the asynchronously sampled DVR system,
in which a simple latch replaces the spatio-temporal interpolator for equalizer adaptation and a
fully digital ITR replaces the conventional VCO-based timing recovery. The simulations show
that the proposed latch-based equalizer adaptation structure with the ITR loop works well for a
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
153

considerable range of sampling rates. In conclusion, the structure is simple, cost-effective and
thus promising for DVR.
So far, two problems have hindered an efficient application of asynchronous adaptive
equalizers. The first problem is the cost and complexity of real-time, multi-tap equalizer
adaptation. The second is the interaction between the adaptation process and other control loops
such as the AGC and timing recovery loops. To deal with the first problem, we proposed a
zero-forcing adaptation algorithm and investigated a simple and effective latch-based structure
for asynchronous adaptation. This greatly simplifies the equalizer adaptation relative to the one
based on the LMS algorithm. For timing recovery, we exploited the SRC technique and
developed a fully digital interpolative timing recovery, which makes the timing loop operate at
low cost and high stability. For the second problem, interaction, we notice that AGC, equalizer
adaptation and timing recovery constitute three nested control loops. An adjustment in one loop
can interact with the operation of the other loops. Since we focused on asynchronous adaptation
in this chapter, we did not investigate the interaction issue further. Also, in our study we
assumed that the AGC and detector decisions do not suffer from errors.
The interaction issue can be a topic for future research. Given the strong potential for
interactions between the equalizer adaptation, AGC and timing loops, a constrained adaptation
algorithm is needed to reduce or eliminate interaction. This issue is explored, for example, in
[4], [5], [26], [28], [47], [48] and [49].
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
154

References:
[1] T. Narahara, S. Kobayashi, M. Hattori, Y. Shimpuku, G. van den Enden, J. Kahlman, M.
van Dijk, and R. van Woudenberg, “Optical disc system for digital video recording,” Jpn.
J. Appl. Phys., pt. 1, vol. 39, no. 2B, pp. 912-919, 2000.
[2] W. Coene, H. Pozidis, M. van Dijk, J. Kahlman, R. van Woudenberg, and B. Stek,
“Channel coding and signal processing for optical recording system beyond DVD,” IEEE
Trans. Magn., vol. 37, no. 2, pp. 682-688, March 2001.
[3] E. F. Stikvoort and J. A. C. Van Rens, “An all-digital bit detector for compact disc players,”
IEEE J. Selected Areas Commun., vol. 10, pp. 191-200, Jan. 1992.
[4] R. T. Behrens, W. G. Bliss, D. Li, M. S. Spurbeck, G. S. Feyh, and T. O. Dudley, “Gain and
phase constrained adaptive equalizing filter in a sampled amplitude read channel for
magnetic recording,” US Patent 5,999,355, Dec. 1999.
[5] M. S. Spurbeck, R. T. Behrens, S. Feyh, and T. Boulder, “Sampled amplitude read channel
employing interpolated timing recovery,” US patent 5,696,639, Dec. 1997.
[6] G. G. Vishakhadatta, et al, “An EPR4 read/write channel with digital timing recovery,”
IEEE J. Solid-State Circuits, vol. 33, no. 11, pp. 1851-1857, Nov. 1997.
[7] V. Tuukkanen, J. Vesma, and M. Renfors, “Combined interpolation and maximum
likelihood symbol timing recovery in digital receivers,” in Digests IEEE 6th Intl. Conf.
Univ. Pers. Commun., Sept. 1997, pp. 698-702.
[8] D. Verdin and T. C. Tozer, “Interpolator filter structure for asynchronous timing recovery
loops,” Electronic Letters, vol. 29, no. 5, pp. 490-492, March 1993.
[9] D. Verdin and T. C. Tozer, “Hangup in asynchronous timing recovery loops,” Electronics
Letters, vol. 29, no. 22, pp. 1914-1915, Oct. 1993.
[10] K. Bucket and M. Moeneclaey, “Tracking performance analysis of feedback timing
synchronizers operating on interpolated signals,” in Proc. Intl. Conf. Global Telecommun.
(GLOBECOM), 1996, pp. 67-71.
[11] K. Bucket and M. Moeneclaey, “Periodic timing error components in feedback
synchronizers operating on nonsynchronized signal samples,” IEEE Trans. Commun., vol.
46, no. 6, June 1997.
[12] D. Fu and A. N. Willson, “Interpolation in timing recovery using a trigonometric
polynomial and its implementation,” in Proc. Intl. Conf. Global Telecommun.
(GLOBECOM), Nov. 1998, pp. 173-177.
[13] J. Armstrong and D. Strickland, “Symbol synchronization using signal samples and
interpolation,” IEEE Trans. Commun., vol. 42, no. 2, pp. 318-321, Febr. 1993.
[14] T. I. Laakso, V. Väimäki, M. Karjalainen, and U. K. Laine, “Splitting the unit delay,”
IEEE Signal Processing Magazine, vol. 13, no. 1, pp. 30-60, Jan. 1996.
[15] P. J. Kootsookos and R. C. Williamson, “FIR approximation of fractional sample delay
systems,” IEEE Trans. Circuits and Systems –II: Analog and Digital Signal Processing,
vol. 43, no. 3, pp. 269-271, March 1996.
[16] C. W. Farrow, “A continuously variable digital delay element,” in Proc. IEEE Intl. Symp.
Circuits and Systems(ISCAS), June 1988, pp. 2641-2645.
[17] L. Erup, F. M. Gardner, and R. A. Harris, “Interpolation in digital modems – Part II:
Implemementation and performance,” IEEE Trans. Commun., vol. 41, pp. 998-1008, June
1993.
[18] D. Fu and A. N. Willson, “Design of an improved interpolation filter using a trigonometric
polynomial,” in Proc. IEEE Intl. Symp. Circuits and Systems(ISCAS), June 1999, vol. 4,
pp. 363-366.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
155

[19] J. Vesma and T. Saramäki, “Interpolation filters with arbitrary frequency response for all-
digital receivers,” in Proc. IEEE Intl. Symp. Circuits and Systems(ISCAS),Atlanta, USA,
May 1996, pp. 568-571.
[20] R. W. Schafer and L. R. Rabiner, “A digital signal processing approach to interpolation,”
Proc. IEEE, vol. 61, no. 2, pp. 692-702, June 1973.
[21] G. Oetken, T. W. Parks, and H. W. Schüssler, “New results in the design of digital
interpolators,” IEEE Trans. Acoust., Speech, Signal Processing, vol. 23, no. 1, pp. 301-
309, June 1975.
[22] T. Hentschel and G. Fettweis, “Sample rate conversion for software radio,” IEEE
Commun. Magazine, pp. 142-150, Aug. 2000.
[23] D. Kim, M. J. Narasimha, and D. C. Cox, “Design of optimal interpolation filter for
symbol timing recovery,” IEEE Trans. Commun., vol. 45, no. 7, pp. 877-884, July 1997.
[24] T. Oening and J. Moon, “Digital detection with asynchronous sampling using amplitude
error prediction,” IEEE Trans. Magn., vol. 34, no. 4, pp. 1931-1933, July 1997.
[25] C. M. Melas, and P. Sutardja, “An asynchronous fully digital channel for magnetic
recording,” in Proc. IEEE Intl. Conf. Global Telecommun. (GLOBECOM), Piscataway,
NJ, 1994, pp. 1144-1147.
[26] M. Spurbeck and R. T. Behrens, “Interpolated timing recovery for hard disk drive read
channels,” in Proc. IEEE Intl. Conf. Commun. (ICC), 1997, pp. 1618-1624.
[27] Z. N. Wu, J. M. Cioffi, and K. D. Fisher, “A MMSE interpolated timing recovery scheme
for the magnetic recording channel,” in Proc. IEEE Intl. Conf. Commun. (ICC), 1997, pp.
1625-1629.
[28] D. Li, M. Spurbeck, and R. T. Behrens, “A linearly constrained adaptive FIR filter for hard
disk drive read channel,” in Proc. IEEE Intl. Conf. Commun. (ICC), 1997, pp. 1613-1617.
[29] F. M. Gardner, “Interpolation in digital modems-Part I: Fundamentals,” IEEE Trans.
Commun., vol. 41, no. 3, pp. 501-507, March 1993.
[30] K. Bucket and M. Moeneclaey, “Symbol synchronizer performance affected by non-ideal
interpolation in digital modems,” in Proc. IEEE Intl. Conf. Commun. (ICC), 1994, pp.
929-933.
[31] K. Kim, M. J. Marasimha, and D. C. Cox, “Unbiased timing-error estimation in the
presence of nonideal interpolation,” IEEE Trans. Commun., vol. 45, no. 6, pp. 647-650,
June 1997.
[32] E. A. Lee and D. G. Messerschmitt, Digital communications. Boston: Kluwer Academic
Publisher, 1987.
[33] W. L. Abbott, J. M. Cioffi, and H. K. Thapar, "Channel equalization methods for magnetic
storage," in Proc. IEEE Intl. Conf. Commun. (ICC), Boston, MA, June 1989, pp. .
[34] S. U. H. Qureshi., "Adaptive equalization," Proc. IEEE, vol. 73, no. 9, pp. 1349-1387,
Sept. 1985.
[35] W. L. Abbott, P. Valley, H. C. Nguyen and K. E. Johnson, “Disk drive using PRML class
IV sampling data detection with digital adaptive equalization,” US Patent 5,341,249, Aug.
1994.
[36] R. W. Lucky, “Automatic equalization for digital communications,” Bell Syst. Tech. J. vol.
44, pp. 547-588, 1965.
[37] R. E. Crochiere and L. R. Rabiner, Multirate digital signal processing. New York:
Prentice Hall, 1983.
[38] M. Abramowitz and I. A. Stegun, Handbook of mathematical functions with formulas,
graphs, and mathematical tables. pp. 878-879, 9th edition, Dover Publications, Inc. New
York, 1970.
CHAPTER 7: Asynchronous Equalizer Adaptation and Interpolative Timing Recovery
for a Digital Optical Recording System
156

[39] K. H. Mueller and M. Müller, “Timing recovery in digital synchronous data receivers,”
IEEE Trans. Commun., vol. 14, pp.516-531, May 1976.
[40] S. U. H. Qureshi, “Timing recovery for equalized partial-response systems,” IEEE Trans.
Commun., vol. 24, no. 12, pp. 1326-1330, Dec. 1976.
[41] J. Armstrong, “Symbol synchronization using baud-rate sampling and data-sequence-
dependent signal processing,” IEEE Trans. Commun., vol. 39, no. 1, pp. 127-132, Jan.
1991.
[42] R. D. Cideciyan, F. Dolivo, R. Hermann, W. Hirt, and W. Schott, “A PRML system for
digital magnetic recording,” IEEE J. Selected Areas Commun., vol. 10, no. 1, pp. 38-56,
Jan. 1992.
[43] T. Aboulnasr, M. Hage, B. Sayar, and S. Aly, “Characterization of a symbol rate timing
recovery technique for a 2B1Q digital receiver,” IEEE Trans. Commun., vol. 42, no. 5,
pp. 1409-1415, Febr./March/April 1994.
[44] J. W. M. Bergmans, “Efficiency of data-aided timing recording techniques,” IEEE Trans.
Inform. Theory, vol. 41, no. 5, pp. 1397-1408, Sept. 1995.
[45] J. W. M. Bergmans and H.W. Lam, “A class of data-aided timing recovery schemes,”
IEEE Trans. Commun., vol. 43, no. 2/3/4, Febr./March/April 1995.
[46] J. W. M. Bergmans, Digital Baseband Transmission and Recording. Boston: Kluwer
Academic Publishers, 1996.
[47] J. O. Voorman et al., “A one-chip automatic equalizer for echo reduction in teletext,”
IEEE Trans. Consum. Electron., vol. 27, no. 2, pp. 512-529, July 1981.
[48] G. Ungerboeck, “Fractional tap spacing and consequences for timing recovery in data
modems,” IEEE Trans. Commun., vol. 24, no. 3, pp. 856-864, Aug. 1976.
[49] R. D. Gitlin and H. C. Meadors, Jr., “Center-tap tracking algorithms for timing recovery,”
AT&T Technical Journal, vol. 66, pp. 63-78, Nov./Dec. 1987.
CHAPTER 8: Summary and Conclusions
157

CHAPTER 8
SUMMARY AND CONCLUSIONS

The objectives of this work are the study and development of timing recovery techniques
for digital recording systems. Partial-response and decision feedback equalized magnetic
recording systems and asynchronously sampled digital video recording systems are considered.
We study issues related to the tracking and acquisition performance, develop practical timing
recovery techniques, and carry out numerical analysis and evaluation simulations for various
recording densities. This chapter summarizes the thesis work and notable conclusions, and
provides suggestions for future research.
Chapter 2 reviews basics of timing-recovery techniques. We discuss basic structures,
requirements, performance measures, and the existing approaches. We compare inductive and
deductive timing recovery structures, and explore the effects of RLL modulation codes on
timing recovery. We also study measures of timing sensitivity, timing-error detector (TED)
efficiency, and tracking and acquisition performances. We particularly investigate the
maximum-likelihood (ML), minimum mean-square error (MMSE) and zero-forcing (ZF)
timing recovery techniques and analyze their tracking performances for magnetic recording
systems. We study the acquisition problems, investigate the issues of hang up and false lock
during timing acquisition, and explore solutions. In addition, we review practical decision-
directed (DD) and non-decision-directed (NDD) TED schemes for timing acquisition and study
several acquisition issues such as phase acquisition bound and acquisition speed measures.
Chapter 3 studies the timing sensitivity of multi-level decision feedback equalization
(MDFE) and partial-response class-IV with Viterbi detector (PR4-VD) for magnetic recording.
We develop an efficient approach for evaluating the error rate performance based on the
estimated probability density function of residual intersymbol interference (ISI) due to
misequalization and timing phase errors. This approach can cover the performance
consequences of both static phase error and random timing jitter. The numerical results have
been shown to closely match those obtained from bit-by-bit simulations. The principle of this
approach can be easily extended to evaluate the timing sensitivity of advanced versions of
MDFE and PR4-VD detectors.
Chapter 4 performs TED efficiency analysis of timing recovery for partial response (PR)
and decision feedback equalization (DFE) receivers in magnetic recording. We analyze several
TED schemes and propose error-based ZF TED schemes. The error based TEDs are not only
simple to implement but also exhibit equivalent or even better efficiencies compared to the
existing non-error based TEDs. We also analyze the TED noise performance and provide a
CHAPTER 8: Summary and Conclusions
158

marginal-detection based TED algorithm for jitter minimization. Simulation results show that
this algorithm can reduce jitter variance, especially for low SNRs. In addition, we study
optimality issues for timing acquisition. The investigation shows that the MDFE TED with 6T-
pattern preamble is near optimum in terms of efficiency for timing acquisition over a range of
recording densities of interest.
Chapter 5 focuses on hang-up and false-lock acquisition problems, and develops two
effective solutions to these problems for DFE receivers in recording systems. We demonstrate
that these solutions can facilitate a fast and reliable acquisition process. One solution uses
special equalizer coefficients and a control procedure with switches. This technique has been
shown to be effective for solving the false lock problem. On the other hand, the second
solution, which uses a modified threshold scheme, prevents false lock and hang up in a more
general manner. Simulations have shown that this solution can tolerate large initial errors in
gain, DC offset and threshold scalars.
In Chapter 6, we present an implementation of timing recovery for a 100Mb/s experimental
MDFE read channel prototype. We specifically detail the TED realization, charge-pump
control, lock-to-preamble detection, and sync-byte detection circuits. We evaluate the prototype
on both bench and spinstand, and measure the performance of the timing recovery system. The
timing recovery board works well at high data rate to support a robust MDFE prototype. The
measurement results show that the circuit performance is close to the expected result. The
implementation is simple and reliable, and can be easily realized by an integrated solution.
Chapter 7 investigates a new type of receiver for digital video-recording (DVR) channels. It
proposes a new asynchronous equalizer adaptation structure with fully digital interpolative
timing recovery (ITR) for DVR. The investigation shows that this new structure is promising
for DVR in optical recording applications. The structure is attractive for its low-cost high-
stability timing-recovery implementation and low-sample-rate adaptive equalizer. We study
and simulate the DVR system with this new structure for a wide range of sampling rates. The
results show that the proposed structure is robust enough even in the worst-case SNR, and that
performance is satisfactory.

8.1 Further Work


By the end of this dissertation work, we have developed an experimental MDFE read
channel prototype and its extended version, called M2DFE prototype. Both systems use the
timing recovery system that is developed and analyzed in this thesis. The current TED
algorithm is active at transitions. The transition conditions are checked only by the two most
recent decisions in the feedback register. Wrong decisions may mislead the TED and degrade
the timing recovery loop performance. One approach to improving the TED is to make use of
reliable decisions and to check for code-rule violations. This approach seems worth pursuing
for the M2DFE read channel, where more reliable decisions are available within the structure.
Noise performance of timing recovery loops is always a concern, especially for high
density and low SNR recording channels. For this reason, a good recording channel model
should include media noise in addition to electronic additive white Gaussian noise (AWGN).
For simplicity, we do not consider media noise effects. Studying timing recovery loop behavior
in the presence of media noise is a good topic of further work.
CHAPTER 8: Summary and Conclusions
159

Future work might also consider timing recovery for recording channels using turbo
coding, where good BER performance is obtainable at poor SNR. In this situation, the effect of
noise and distortion becomes even more critical, and robust techniques for timing recovery are
of great interest.
Another topic of future work might be the investigation of the interaction between the
equalizer adaptation, timing recovery and automatic gain control (AGC) loops. As the gradients
for controlling these three loops are normally derived from the same error signal that is
obtained from the detector input, parameter variations of one loop will inevitably affect
parameters of the other loops. To make the three loops operate well and converge within an
expected period of time, interaction effects among these loops have to be investigated and
effective approaches have to be developed to mitigate these effects. To reach this aim, new
adaptation and timing recovery schemes may be needed.
CHAPTER 8: Summary and Conclusions
160
161

Samenvatting
Een kritisch deel van systemen voor digitale gegevensopslag is het zogenaamde lees-
schrijfkanaal, dat bestaat uit de elektronische circuits die nodig zijn voor het schrijven van
digitale gebruikersgegevens op het opslagmedium en voor het betrouwbaar teruglezen van de
geschreven gegevens. Een goed ontworpen lees-schrijfkanaal kan zowel de opslagcapaciteit als
de overdrachtssnelheid van het opslagsysteem vergroten. Egalisatie, bit-detectie en
klokterugwinning behoren tot de belangrijkste functies van een lees-schrijfkanaal.

De eerstgenoemde twee functies mogen zich verheugen in een ruime belangstelling van
onderzoekers. Naarmate opslagsystemen efficiënter worden in termen van bijvoorbeeld
modulatie-code, bandbreedte en informatiedichtheid, wordt de taak van het
klokterugwinningssysteem moeilijker en tegelijkertijd kritischer voor betrouwbare bit-detectie.
Het klokterugwinningssysteem dient om optimale beslissingstijdstippen voor de bit-detector te
demarceren. Het onderzoek dat beschreven wordt in dit proefschrift behelst aspecten van
klokterugwinning voor lees-schrijfkanalen.

Dit proefschrift is gewijd aan de studie en ontwikkeling van klokterugwinningstechnieken


voor digitale gegevensopslagsystemen. Het bestudeert de structuur en prestaties van
klokterugwinningstechnieken, en ontwikkelt nieuwe klokterugwinningstechnieken voor
magnetische en optische gegevensopslagsystemen. Het proefschrift omvat acht hoofdstukken.
Hoofdstuk 1 geeft een korte inleiding op het gebied van digitale gegevensopslagtechnieken en
lees-schrijfkanalen. Dit hoofdstuk beschrijft verder de motivatie voor het onderhavige werk,
alsook de belangrijkste bijdragen en opbouw van het proefschrift. Hoofdstuk 2 verschaft een
gedetailleerd overzicht van klokterugwinning, en in het bijzonder van de diverse mogelijke
structuren, eisen, en prestatie-maatstaven. Dit hoofdstuk beschrijft ook generieke algoritmen
voor tijdfoutschatting en de belangrijkste aspecten van tijdbasisacquisitie. Waar mogelijk
worden de discussies ondersteund door analyses en simulatie-resultaten. Hoofdstukken 3 tot en
met 7 beschrijven de belangrijkste nieuwe onderzoeksbijdragen van het proefschrift. Hoofdstuk
8 beëindigt het proefschrift met enkele aanbevelingen voor toekomstig werk.

Hoofdstuk 3 onderzoekt de gevoeligheid voor tijdbasisfouten van twee soorten bit-


detectoren, te weten de ‘partial-response Viterbi detector’ en de beslissingsteruggekoppelde
egalisator. Tevens wordt een analytische benadering ontwikkeld voor de invloed van statische
en stochastische tijdbasisfouten op de prestatie van deze detectoren. Hoofdstuk 4 bestudeert de
efficiëntie van bestaande en nieuw voorgestelde algoritmen voor schatting van tijdbasisfouten
in systemen met ‘partial-response’ detectoren en beslissingsteruggekoppelde egalisatoren.
Tevens ontwikkelt en analyseert dit hoofdstuk een ‘marginaal-detectie’ algoritme voor
schatting van tijdbasisfouten in een meer-niveau beslissingsteruggekoppelde egalisator. Verder
onderzoekt het hoofdstuk de acquisitie-prestaties van bij deze egalisator bruikbare data-
preambules en schattings-algoritmen. Hoofdstuk 5 bestudeert het probleem van tijdbasis-
acquisitie in beslissingsteruggekoppelde egalisatoren. Het hoofdstuk ontwikkelt een tweetal
nieuwe technieken voor snelle acquisitie die vrij zijn van ‘hang-up’ en ‘false-lock’ problemen,
zelfs bij grote initiele fouten van tijdbasis, versterking en DC niveau. Hoofdstuk 6 beschrijft het
ontwerp en de implementatie van een praktisch klokterugwinningssysteem, opgebouwd met
discrete ECL (emitter-coupled logic) componenten, voor een 100 Mb/s experimenteel lees-
162

schrijfkanaal met meer-niveau beslissingsteruggekoppelde egalisator. De prestatie van dit


systeem wordt geëvalueerd voor zowel synthetische weergave-golfvormen als voor een
experimenteel magnetisch gegevensopslag-systeem.

De nadruk bij de ontwikkeling van de algoritmen en systemen in Hoofdstukken 3 t/m 6 ligt


op magnetische gegevensopslag, alhoewel deze algoritmen en systemen met geringe
wijzigingen ook bruikbaar zijn voor optische gegevensopslag. Daarentegen ligt de nadruk in
Hoofdstuk 7 op optische gegevensopslag. In dit hoofdstuk wordt een nieuwe en aantrekkelijke
architectuur ontwikkeld voor volledig digitale ‘zero-forcing’ egalisator-adaptatie in combinatie
met interpolatieve klokterugwinning. De ontwikkeling van de algoritmen en systemen in
Hoofdstukken 3 t/m 7 wordt ondersteund door uitvoerige computersimulaties. De
simulatieresultaten dienen ter illustratie van de effectiviteit van de voorgestelde algoritmen, en
ter bevestiging van de analytische resultaten.
163

Publications by the author

1. J. J. Wang and G. Mathew, “Timing sensitivity of decision feedback and partial


response detectors in magnetic recording,” in Proc. IEEE Intl. Conf. Global
Commun. (GLOBECOM), San Francisco, CA, USA, Dec. 2000, pp. 1872-1876.

2. J. J. Wang, G. Mathew, and V. Krachkovsky, “Bias and jitter minimization


techniques for timing recovery in magnetic recording,” in Digests 8th Joint MMM-
IEEE Intl. Conf. Magnetics (MMM-INTERMAG), San Antonio, Texas, USA, Dec.
2000, p. EQ-10.

3. J. J. Wang, J. W. M. Bergmans, Y. X. Lee, and G. Mathew, “DFE timing


acquisition: Analysis and a new approach for fast acquisition,” IEEE Trans. Magn.
vol. 36, no. 5, pp. 2193-2196, Sept. 2000.

4. G. Mathew, Y. X. Lee, B. Farhang-Boroujeny, H. Mutoh, and J. J. Wang, “A


Novel interpolation approach for reducing clock-rate in multilevel decision
feedback equalization detectors,” IEEE Trans. Magn. vol. 36, no. 5, pp. 3866-3878,
Sept. 2000.

5. J. J. Wang, J. W. M. Bergmans, Y. X. Lee, and G. Mathew, “DFE timing


acquisition: Analysis and a new approach for fast acquisition,” Digests IEEE Intl.
Conf. Magnetics (INTERMAG), Toronto, Apr. 2000, p. GD-10.

6. Y. X. Lee, Q. W. Jia, J. J. Wang, Q. Li, L. Bi, H. Ueno, and H. Mutoh, “An


experimental M2DFE detector,” IEEE Trans. Magn. vol. 35, no. 5, pp. 2292-2294,
Sept. 1999.

7. Y. X. Lee, Q.W. Jia, J. J. Wang, Q. Li, L. Bi, H. Ueno, and H. Mutoh, “An
Experimental M2DFE Detector,” in Digests IEEE Intl. Conf. Magnetics
(INTERMAG), Korea, May 1999, p. FS-05.
164

8. J. J. Wang and T. C. Chong, “Noise analysis of timing recovery loop in MDFE


recording channel,” in Proc. Asian Symp. Inform. Storage Technology (ASIST),
Singapore, Febr. 1999, pp. 97-103.

9. Y. X. Lee, G. Mathew, Q. C. Sun, J. J. Wang, H. Mutoh, J. Hong, and R. W.


Wood, “Design, implementation and performance evaluation of an MDFE Read
channel,” IEEE Trans. Magn., vol. 34, no. 1, pp. 166-171, Jan. 1998.

10. Y. X. Lee, L. K. Ong, J. J. Wang, and R. W. Wood, “Timing acquisition for DFE
detection,” IEEE Trans. Magn. vol. 33, no. 5, pp. 2761-2763, Sept. 1997.

11. J. Hong, Y. X. Lee, H. Mutoh, Q. C. Sun, H. Ueno, J. J. Wang, and R. W. Wood,


“An experimental MDFE detector,” IEEE Trans. Magn., vol. 33, no. 5, pp. 2776-
2778, Sept. 1997.

12. Y. X. Lee, L. K. Ong, J. J. Wang, and R. W. Wood, “acquisition for DFE


Detection,” in Digests IEEE Intl. Conf. Magnetics (INTERMAG), New Orleans,
USA, Apr. 1997, p. BS-17.

13. J. Hong, Y. X. Lee, H. Mutoh, Q. C. Sun, H. Ueno, J. J. Wang, and R. W. Wood,


“An experimental MDFE detector,” in Digests IEEE Intl. Conf. Magnetics
(INTERMAG), New Orleans, USA, Apr. 1997, p. CR-05.
165

Curriculum Vitae

Jianjiang Wang was born in Korla, China, on October 23, 1965. In 1983, he entered the
Tsinghua University, Beijing, the most renowned university in China, to major in
Communications in the Department of Electronic Engineering. He received the B.Sc and M.Sc
degrees in Electrical Engineering from Tsinghua University in 1988 and 1993, respectively.
From 1988 to 1990, he was an electronic engineer with the Industrial Applied Television
Institute, Changzhou, China, and from 1993 to 1995, he was a researcher with the Tsinghua
University, Beijing, in the Advanced Communications Group of the Department of Electronic
Engineering. From 1995 to 2001, he worked in the Coding and Signal Processing Department
of the Data Storage Institute (DSI) in Singapore. In 2001, he joined the Maxtor Corporation,
Milpitas, California, USA, in the Advanced Channels and ASICs Department.

During his association with DSI, he worked on a series of research projects on multilevel
decision feedback equalization (MDFE) technology. These projects were sponsored by two
international consortia (MDFE and M3DFE consortia) whose members included the disk drive
companies IBM San Jose, USA, Fujitsu Japan, Hitachi Japan, Texas Instruments, USA,
Tektronix, USA and DSI, Singapore. As part of his industrial research at DSI and his academic
research towards a Ph.D., he made several contributions to the development of MDFE
technology. In view of the significance of his contributions, in particular of the hardware
prototyping and demonstration of MDFE and M2DFE read channels, he was awarded along
with two other colleagues the Distinguished Researcher Award of the Year 1998 by DSI. This
is the highest annual award in DSI for recognizing research achievements. Following this, he
and his colleagues received the Outstanding Researcher Award by the National University of
Singapore (NUS) in 1999.

Since 1996, he also pursued a Ph.D. degree in the Department of Electrical and Computer
Engineering of NUS, and subsequently was selected to be part of the joint Ph.D. program
between NUS and Eindhoven University of Technology (TU/e), The Netherlands. From
September to December in 2000, he carried out his Ph.D. research in the Signal Processing
Systems group in the Department of Electrical Engineering of TU/e. During this period, he also
was a guest researcher in the Storage Signal Processing Group of Philips Research
Laboratories, Eindhoven, The Netherlands.
166

You might also like