

GBLD10+: a compact low-power 10 Gb/s VCSEL driver

This content has been downloaded from IOPscience. Please scroll down to see the full text. 2016 JINST 11 C01015 (http://iopscience.iop.org/1748-0221/11/01/C01015)

View the table of contents for this issue, or go to the journal homepage for more

Download details:

IP Address: 131.169.4.70 This content was downloaded on 15/01/2016 at 22:20

Please note that terms and conditions apply.

PUBLISHED BY IOP PUBLISHING FOR SISSA MEDIALAB



RECEIVED: October 30, 2015 ACCEPTED: November 25, 2015 PUBLISHED: January 13, 2016

TOPICAL WORKSHOP ON ELECTRONICS FOR PARTICLE PHYSICS 2015, SEPTEMBER 28<sup>TH</sup> – OCTOBER 2<sup>ND</sup>, 2015 LISBON, PORTUGAL

# GBLD10+: a compact low-power 10 Gb/s VCSEL driver

T. Zhang,  $a^{a,1}$  S. Kulis, <sup>c</sup> P. Gui, <sup>a</sup> F. Tavernier<sup>b</sup> and P. Moreira<sup>c</sup>

<sup>a</sup>Southern Methodist University,

Dallas, Texas 75275, U.S.A. <sup>b</sup>KU Leuven ESAT-MICAS

Kasteelpark Arenberg 10, 3001 Leuven, Belgium <sup>c</sup> CERN, 1211 Geneva 23, Switzerland

*E-mail:* taoz@smu.edu

ABSTRACT: We report the design and implementation of the GBLD10+, a low-power 10 Gb/s VCSEL driver for High Energy Physics (HEP) applications. With new circuit techniques, the driver consumes only 31 mW and occupies a small area of 400  $\mu$ m × 1750  $\mu$ m including the IO PADs and sealrings. These characteristics allow for multiple GBLD10+ ICs to be assembled side by side in a compact module, with each one directly wire bonded to one VCSEL diode. This makes the GBLD10+ a suitable candidate for the Versatile Link PLUS (VL<sup>+</sup>) project, offering flexibility in configuring multiple transmitters and receivers.

KEYWORDS: VLSI circuits; Analogue electronic circuits; Optical detector readout concepts



© CERN 2016, published under the terms of the Creative Commons Attribution 3.0 License by IOP Publishing Ltd and Sissa Medialab srl. Any further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation and DOI.

<sup>&</sup>lt;sup>1</sup>Corresponding author.

# Contents

| I | Intr | oductio          | on and the second se | 1 |  |  |  |
|---|------|------------------|-----------------------------------------------------------------------------------------------------------------|---|--|--|--|
| 2 | Desi | esign of GBLD10+ |                                                                                                                 |   |  |  |  |
|   | 2.1  | GBLD             | D10+ architecture                                                                                               | 2 |  |  |  |
|   | 2.2  | Limiti           | 3                                                                                                               |   |  |  |  |
|   | 2.3  | Outpu            | 4                                                                                                               |   |  |  |  |
|   |      | 2.3.1            | Output driver with feedforward compensation                                                                     | 4 |  |  |  |
|   |      | 2.3.2            | Configurable rising/falling-edge pre-emphasis                                                                   | 5 |  |  |  |
|   | 2.4  | GBLD             | D10+ implementation                                                                                             | 6 |  |  |  |
| • |      | •                |                                                                                                                 |   |  |  |  |
| 3 | Test | results          |                                                                                                                 | 6 |  |  |  |

4 Conclusion

#### 1 Introduction

The Versatile Link (VL) project was a joint ATLAS-CMS effort aiming at developing a radiation hard bi-directional optical link for use in the phase-ILHC upgrade program [1]. As the main building block on the transmission side, the GigaBit Laser Driver (GBLD) was developed to operate at a data rate of 5 Gb/s with a power consumption around 300 mW [2]. The Versatile Link PLUS project (VL<sup>+</sup>) targets the phase II upgrades of the ATLAS and CMS experiments [3]. With the high-luminosity operation of LHC in phase II, data transmission requires very high data rates and low power dissipation. VL<sup>+</sup> has to support data rates up to 10 Gb/s in the upstream direction. As an initial step towards this target, we have designed and demonstrated the first 10 Gb/s radiation-hard laser driver (GBLD10) in 130-nm CMOS with a power consumption of 85 mW [4]. Similar to the GBLD, the GBLD10 is packaged in a 4 mm × 4 mm QFN24 package and is used to drive a Transmit Optical Sub-Assembly (TOSA) in an Enhanced Small Form-factor Pluggable (SFP+) module.

Targeting at the phase II upgrade, the GBLD10+ is an enhanced Vertical Cavity Surface Emitting Laser (VCSEL) driver running at 10 Gb/s. Compared to the GBLD and GBLD10 [2, 4], the GBLD10+ features an ultra-low power consumption around of 31 mW. It also has a compact size of  $1750 \,\mu\text{m} \times 400 \,\mu\text{m}$ . The common-cathode output driver topology with on chip VCSEL bias implemented in the GBLD10+ saves assembly costs and simplifies the PCB design by eliminating external components on the PCB. Besides the compact size, the GBLD10+ also has a symmetrical PAD structure for digital and power IOs, which provides the users with the flexibility to choose the channel count in a customized form-factor die-to-die bonding module layout. As an example, figure 1 shows a transceiver module, which can run at 30 Gb/s upstream and 2.5 Gb/s downstream. It can be easily implemented by three GBLD10+ and one GigaBit Trans-Impedance Amplifier (GBTIA+) running at 2.5 Gb/s. The GBLD10+ has 40 PADs in total, 3 of which are high-speed

8



Figure 1. Transceiver module with three GBLD10+ and one GBTIA+.



Figure 2. GBLD10+ block diagram.

PADs (DIN+, DIN- and OUT) with a size of 70  $\mu$ m × 57  $\mu$ m while the rest are standard PADs with a size of 120  $\mu$ m × 75  $\mu$ m. The size of the PADs greatly limits the available active area for the core circuit, which leads to the challenges to reach a speed of 10 Gb/s, as explained later in the paper.

## 2 Design of GBLD10+

## 2.1 GBLD10+ architecture

The block diagram of the GBLD10+ is shown in figure 2. The core circuits include the limiting amplifier and the output driver. Two 7-bit Digital-to-Analog Converters (DACs) are designed to program both the biasing and modulation amplitude, with a resolution of 0.16 mA. The GBLD10+ also features a configurable pre-emphasis function with the emphasis amplitude controlled by a



**Figure 3.** Limiting amplifier topologies: (a) traditional inductor peaking topology; (b) shared-inductor peaking topology.

3-bit DAC. A reference generator generates the process and temperature independent current for the DACs. All the DACs are controlled through the I<sup>2</sup>C interface. A Power On Reset (POR) circuit and the I<sup>2</sup>C circuit are used to set the default DAC values after the chip power up. The I<sup>2</sup>C and POR circuits are triplicated to make the chip immune to Single Event Upsets (SEUs). Different from the GBLD and GBLD10 that are powered from single 2.5 V supplies, the GBLD10+ uses dual supply voltages — 1.2 V and 2.5 V, for power-saving considerations. The output driver has to be powered by 2.5 V because of the large threshold voltage of the VCSEL. All the other blocks, including the limiting amplifier and DAC, are powered by the 1.2 V supply.

## 2.2 Limiting amplifier

A limiting amplifier is designed to provide sufficient input swing (> 800 mV) for the output driver. There are two reasons to introduce the limiting amplifier before the output driver: first one is to increase the power efficiency of the output driver by completely switching in and out its tail current as the VCSEL modulation current; the second reason is to guarantee that the modulation current is independent of the input voltage swing which comes from the serializer and has a range of 200 mV-1.2 V. Therefore, users can program the modulation depth linearly through the DAC without the need to consider the effects of the input swing. To accommodate the minimum swing of 200 mV, a limiting amplifier with a minimum gain of 12 dB is needed to make sure the input swing of the output driver is always above 800 mV. High bandwidth is also desired to avoid limiting the bandwidth of the driver. To meet the gain and bandwidth requirements, as well as the stringent area requirement, a two-stage inductor peaking topology is proposed. The traditional inductor peaking approach shown in figure 3 (a) [5] needs two inductors in each stage, occupying a large die area. To minimize the area, we proposed to use a topology that shares the inductors between the first and second stages, as shown in figure 3 (b) [6]. To further shrink the area, a three-terminal inductor,



Figure 4. Output driver with the feedforward compensation.

which is equivalent to two inductors, has been used during the implementation. Simulations show that a gain of 13.3 dB and bandwidth of 14.8 GHz can be achieved, with a current consumption of 8 mA from 1.2 V supply.

## 2.3 Output driver

#### 2.3.1 Output driver with feedforward compensation

The proposed output driver with feedforward compensation is shown in figure 4. It is suitable for the common-cathode topology with the single-ended output connecting to the anode of the VSCEL. A 1:10 PMOS current mirror consisting of  $M_8$  and  $M_9$  is added to provide the level high current  $I_H$ . The modulation function is done by switching ON or OFF  $M_2$  so that a modulation depth current  $I_{DEP}$  is subtracted from the  $I_H$  or not.

Considering the right branch of the output driver (which consists of  $M_2$ ,  $M_4$ ,  $M_9$  and the VCSEL), the small-signal output current of the circuit can be expressed as follows:

$$I_{\rm OUT}(s) = \frac{g_{\rm m1,2}V_{\rm IN}}{2 \cdot [1 + s \cdot R_{\rm Dio}(C_{\rm D9} + C_{\rm Dio})]},$$
(2.1)

where  $R_{\text{Dio}}$  and  $C_{\text{Dio}}$  are the impedance and parasitic capacitance of the VCSEL diode respectively.  $C_{\text{D9}}$  is the parasitic capacitance of the drain of M<sub>9</sub>. Because of the low mobility of PMOS transistor compared to that of NMOS device, the size and thus the parasitics of M<sub>9</sub> are relatively large even if implemented with a thin oxide device. Together with the VSCEL parasitics, this is the main limitation the driver speed.

Compared to the right branch of the output driver, the left branch (which consists of  $M_1$ ,  $M_3$ ,  $M_5$  and  $M_6$ ) has much less parasitics load. In order to reach a speed of 10 Gb/s, a feedforward path can be built between the two branches by the coupling capacitor Cc [7]. In this design, we

further boost the speed of the left branch by the active inductor load, which consists of  $M_5$  and  $M_6$ , to enhance the feedforward strength at high frequencies. The high-frequency current output loss can be compensated by the compensation current generated by the feedforward path. This compensation current  $I_{COM}$  can be represented by

$$I_{\rm COM}(s) = \frac{g_{m1,2}g_{m9}g_{m6}C_{\rm C}(1 + s\frac{C_{\rm GS5}}{g_{\rm m6}}) \cdot V_{\rm IN}}{2\left[s^2C_{\rm C}C_{\rm P}C_{\rm GS5} + sC_{\rm GS5}g_{\rm m6}(C_{\rm C} + C_{\rm P}) + g_{\rm m6}g_{\rm m5}(C_{\rm C} + C_{\rm P})\right]}.$$
 (2.2)

In (2.2),  $C_P$  is the parasitic capacitance at the gate of M<sub>9</sub>. To get an idea on the bandwidth of  $I_{COM}$ , we can simplify (2.2) by assuming  $C_P \ll C_C$ ,

$$I_{\rm COM}(s) = \frac{g_{\rm m1,2}g_{\rm m9}C_{\rm C}(1+s\frac{C_{\rm GS5}}{g_{\rm m6}})}{2g_{\rm m5}C_{\rm C}(1+s\frac{C_{\rm GS5}}{g_{\rm m5}})} \cdot V_{\rm IN}$$
(2.3)

From (2.3), the bandwidth of  $I_{COM}$  is mainly determined by the size and biasing condition of  $M_5$  which has much less parasitics than the output of the right branch. Moreover, the zero determined by the size and biasing condition of  $M_5$  can further boost the bandwidth. Because the feedforward signal is superimposed to the output node, the overall bandwidth is boosted by the feedforward compensation path.

#### 2.3.2 Configurable rising/falling-edge pre-emphasis

One important difference between a VCSEL and the ideal RC load is its asymmetry on the rise time and fall time in the transient response. In order to reduce this asymmetry, we implemented a programmable rising/falling-edge pre-emphasis in the output driver, where the pre-emphasis can be set on either the rising edge, falling edge or both edges. Similar to the GBLD10, the pre-emphasis is realized by generating a high-frequency zero in the frequency domain [4]. As shown in figure 5, the output driver is source-degenerated by a RC network to generate the required zero. To tune the



Figure 5. Configurable edge pre-emphasis in output driver.

zero location so as to change the pre-emphasis amplitude, an emphasis capacitor array, controlled by a 3-bit DAC, is used in this RC network.

To make the edge pre-emphasis configurable, an emphasis control logic that can sense the input signal edge is implemented. As shown in figure 5, if the pre-emphasis is configured to be falling edge only, switch C is ON while switch A and D are OFF. With this configuration, switch B transits from ON to OFF during the falling edge of  $V_{IN+}$  (so as the output current), which makes the RC degeneration effective and generates the emphasis peak in the output current. On the other hand, the pre-emphasis is disabled at the rising edge of  $V_{IN+}$  because the RC network is shorted by switch B. Similarly, the rising edge only pre-emphasis or both edge pre-emphasis can be achieved by enabling switch D or both C and D respectively. The pre-emphasis function can be easily turned OFF by setting switch A and B ON so the degeneration network is always disabled regardless of the input signal.

### 2.4 GBLD10+ implementation

The GBLD10+ is implemented in a 9-metal 65-nm CMOS technology. The high-speed signals are routed with the thick top metal layers to reduce parasitics. Figure 6 (a) and (b) show the layout and chip photo of the GBLD10+. A three-terminal octagonal inductor with a diameter of 130  $\mu$ m is used in the limiting amplifier. The die size is 1750  $\mu$ m × 400  $\mu$ m including PADs and seal-rings, with an active area of only 0.16 mm<sup>2</sup>.





Figure 6. GBLD10+ implementation: (a) layout; (b) die photo.

## **3** Test results

Electrical tests have been carried out to characterize the GBLD10+ performance. An Agilent J-BERT N4903B was used to generate the differential PRBS input signal which is AC coupled to the chip. An Agilent DSA91204A oscilloscope was used to measure the outputs.



Figure 7. 10 Gb/s electrical eye diagram at 400 mV input swing.



**Figure 8.** 10 Gb/s electrical eye diagram at different input swings: (a) 200 mV input swing; (b) 1.2 V input swing.

The jitter performance at 10 Gb/s has been measured with PRBS  $2^7 - 1$  input data and at a BER of  $10^{-12}$ . Figure 7 shows the electrical eye diagram with a 4 mA modulation depth with the differential input swing of 400 mV. The total jitter in the electrical eye is 21.72 ps with an RMS random jitter component of 0.62 ps and a data-dependent jitter component of 13.42 ps peak-to-peak.

To test the function of the limiting amplifier, electrical eye diagrams at different input swings were measured. As shown in figure 8, the test results show that the GBLD10+ can accommodate a differential input swing from 200 mV to 1.2 V and maintain the stable modulation amplitude and jitter performance.

The supply current of the GBLD10+ has been measured at the default current settings for modulation height and depth. The measurements show 9 mA current consumption for the 1.2 V supply and 8 mA current consumption for the 2.5 V supply, which makes the total power of only 31 mW. A mistake in digital I/O pads prevented the full characterization of the chip. More measurement results on pre-emphasis, radiation tolerance and optical tests are expected after the Focused Ion Beam (FIB).

Table 1 compares the performance of the GBLD10+ with the GBLD10. Operating at the same speed of 10 Gb/s, the GBLD10+ has significant advantages in terms of the power consumption and die size. Moreover, compared to the differential AC coupling approach in the GBLD10 where external components on the PCB are needed, the GBLD10+ uses direct bonding to couple the VCSEL laser diode, greatly simplifying the PCB design and lowering the cost.

| Parameters                | GBLD10                                   | GBLD10+                                  |
|---------------------------|------------------------------------------|------------------------------------------|
| Technology                | 130-nm CMOS                              | 65-nm CMOS                               |
| Speed                     | 10 Gb/s                                  | 10 Gb/s                                  |
| Total Jitter (Electrical) | 14 ps                                    | 22 ps                                    |
| Power Consumption*        | 85 mW                                    | 31 mW                                    |
| Die Size                  | $2 \text{ mm} \times 2 \text{ mm}$       | $1.75 \mathrm{mm} 	imes 0.4 \mathrm{mm}$ |
| Laser Coupling Method     | differential AC with external components | single-ended direct bonding              |

| Table 1. Per | formance c | comparison | power. |
|--------------|------------|------------|--------|
|--------------|------------|------------|--------|

\* Power consumption is tested at default register settings after power-on reset operation.

## 4 Conclusion

A 10 Gb/s VCSEL low power laser driver (GBLD10+) has been designed in a 65-nm CMOS technology. To meet the speed requirement within a compact die size, a shared-inductor peaking and a feedforward approach have been used in the limiting amplifier and output driver respectively. A programmable pre-emphasis function, implemented by RC source degeneration technique, has been proposed to balance the rise and fall times of the optical signal. Measurements show that the GBLD10+ is able to achieve the speed of 10 Gb/s with a power consumption of only 31 mW. The whole chip has a compact die size of  $1.75 \text{ mm} \times 0.4 \text{ mm}$ , allowing users to drive the VCSEL die and choose the transmitter channel count by die-to-die bonding. The direct bonding method of coupling the laser diode used in GBLD10+ eliminates the need for external components on the PCB, which simplifies the board design and lowers the implementation costs.

## Acknowledgments

This work is supported by the DoE/Collider Detector Research Program. We would like to thank Jeffrey Prinzie and Paul Leroux in KU Leuven for their help during the design, integration and dicing. We also want to thank François Vasey, Jan Troska, Christophe Sigaud, Giuseppe Pezzullo, Csaba Soos, and Lauri Olanterä for their help on the test setup.

#### References

[1] F. Vasey et al., The versatile link common project: feasibility report, 2012 JINST 7 C01075.

- [2] G. Mazza et al., *The GBLD: a radiation tolerant laser driver for high energy physics applications*, 2013 *JINST* **8** C01033.
- [3] F. Vasey, Versatile Link PLUS project, https://espace.cern.ch/project-Versatile-Link-Plus.
- [4] T. Zhang, F. Tavernier, P. Moreira and P. Gui, A 10 Gb/s laser driver in 130 nm CMOS technology for high energy physics applications, 2015 JINST 10 C02036.
- [5] B. Razavi, *Design of Integrated Circuits for Optical Communications*, 2<sup>nd</sup> Edition, John Wiley & Sons, Inc., Hoboken, New Jersey, U.S.A. (2012).
- [6] T. C. Huang et al., A 28 Gb/s 1 pJ/b shared-inductor optical receiver with 56% chip-area reduction in 28 nm CMOS, in proceeding of the International Solid-State Circuits Conference (ISSCC), San Francisco, U.S.A., Feb. 9–13, 2014.
- [7] D. Kucharski et al., A 20 Gb/s VCSEL driver with pre-emphasis and regulated output impedance in 0.13 μm CMOS, in proceeding of the International Solid-State Circuits Conference (ISSCC), San Francisco, U.S.A., Feb. 6–10, 2005.