Introduction To Nanoelectronics
Introduction To Nanoelectronics
Introduction To Nanoelectronics
Introduction to Nanoelectronics
Increasing miniaturization of devices, components, and integrated systems requires
developments in the capacity to measure, organize, and manipulate matter at the
nanoscale. This textbook is a comprehensive, interdisciplinary account of the technology
and science that underpin nanoelectronics, covering the underlying physics, nanostructures, nanomaterials, and nanodevices.
Without assuming prior knowledge of quantum physics, this book provides a unifying
framework for the basic ideas needed to understand the recent developments in the field.
Following an introductory description of recent trends in semiconductor and device
nanotechnologies, as well as novel device concepts, materials for nanoelectronics are
treated, covering methods of growth, fabrication and characterization. Treatment then
moves to an analysis of nanostructures, including recently discovered nanoobjects, and
concludes with a discussion of devices that use a simple scaling-down approach to
copy well-known microelectronic devices, and nanodevices based on new principles that
cannot be realized at the macroscale.
Numerous illustrations, homework problems and interactive Java applets help the
student to appreciate the basic principles of nanotechnology, and to apply them to real
problems. Written in a clear yet rigorous and interdisciplinary manner, this textbook is
suitable for advanced undergraduate and graduate students in electrical and electronic
engineering, nanoscience, materials, bioengineering, and chemical engineering.
Further resources for this title, including instructor solutions and Java applets, are
available online at www.cambridge.org/9780521881722.
Vladimir V. Mitin is a Professor and Chair of the Department of Electrical Engineering
at the University of Buffalo, State University of New York. He has co-authored eight
books, and over 400 professional publications, including ten patents.
Viatcheslav A. Kochelap is Professor and Head of the Theoretical Physics Department
at the Institute of Semiconductor Physics National Academy of Sciences, Ukraine. He
has published over 200 journal articles.
Michael A. Stroscio is a Professor in the departments of Electrical and Computer Engineering, and Bioengineering, at the University of Illinois at Chicago. He is a Fellow of
the IEEE, the American Physical Society, and the AAAS.
Introduction to Nanoelectronics
Science, Nanotechnology, Engineering,
and Applications
VLADIMIR V. MITIN
University of Buffalo, State University of New York
VIATCHESLAV A. KOCHELAP
Institute of Semiconductor Physics National Academy of Sciences, Ukraine
MICHAEL A. STROSCIO
University of Illinois at Chicago
Cambridge University Press has no responsibility for the persistence or accuracy of urls
for external or third-party internet websites referred to in this publication, and does not
guarantee that any content on such websites is, or will remain, accurate or appropriate.
Contents
Preface
List of notation
1
Wave mechanics
3.1
3.2
3.3
3.4
3.5
3.6
Introduction
Classical particles
Classical waves
Waveparticle duality
Closing remarks
Problems
Introduction
The Schrodinger wave equation
Wave mechanics of particles: selected examples
Atoms and atomic orbitals
Closing remarks
Problems
page vii
xii
1
11
11
11
13
23
28
29
33
33
33
38
55
62
63
65
65
66
68
73
84
87
95
98
104
107
vi
Contents
Introduction
Time and length scales of the electrons in solids
Statistics of the electrons in solids and nanostructures
The density of states of electrons in nanostructures
Electron transport in nanostructures
Closing remarks
Problems
Introduction
Bulk crystal and heterostructure growth
Nanolithography, etching, and other means for fabrication of
nanostructures and nanodevices
Techniques for characterization of nanostructures
Spontaneous formation and ordering of nanostructures
Clusters and nanocrystals
Methods of nanotube growth
Chemical and biological methods for nanoscale fabrication
Fabrication of nanoelectromechanical systems
Closing remarks
Problems
Introduction
Electrons in quantum wells
Electrons in quantum wires
Electrons in quantum dots
Closing remarks
Problems
Nanostructure devices
8.1
8.2
8.3
8.4
8.5
8.6
8.7
8.8
8.9
Introduction
Resonant-tunneling diodes
Field-effect transistors
Single-electron-transfer devices
Potential-effect transistors
Light-emitting diodes and lasers
Nanoelectromechanical system devices
Quantum-dot cellular automata
Closing remarks
109
109
110
115
120
127
134
136
141
157
161
163
165
165
165
172
180
183
213
215
218
218
218
228
231
237
238
242
242
242
255
269
276
284
306
317
321
323
325
Preface
Welcome to the amazing nanoworld! In this book you will find fundamental principles in nanoscience and basic techniques of measurement, as well as fabrication and
manipulation of matter at the nanoscale. The book discusses how these principles, techniques, and technologies are applied to the newest generation of electronics, known as
nanoelectronics.
The science of atoms and simple molecules, and the science of matter from microstructures to larger scales, are both well established. A remaining, extremely important, sizerelated challenge is at the nanoscale roughly the dimensional scales between 10 and 100
molecular diameters where the fundamental properties of materials are determined and
can be engineered. This field of science nanoscience is a broad and interdisciplinary
field of emerging research and development.
Nanotechnology is concerned with materials, structures, and systems whose components exhibit novel and significantly modified physical, chemical, and biological properties due to their nanoscale sizes. A principal goal of nanotechnology is to control and
exploit these properties in structures and devices at atomic, molecular, and supramolecular levels. To realize this goal, it is essential to learn how to fabricate and use these devices
efficiently. Nanotechnology has enjoyed explosive growth in the past few years. In particular, nanofabrication techniques have advanced tremendously in recent years. Obviously,
revolutionary changes in the ability to measure, organize, and manipulate matter on the
nanoscale are highly beneficial for electronics with its persistent trend of downscaling
devices, components, and integrated systems. In turn, the miniaturization required by
electronics is one of the major driving forces for nanoscience and nanotechnology.
Practical implementations of nanoscience and nanotechnology have great importance,
and they depend critically on training people in these fields. Thus, modern education
needs to address the rapidly evolving facets of nanoscience and nanotechnology. A new
generation of researchers, technologists, and engineers has to be trained in the emerging nanodisciplines. With the purpose of contributing to education in the nanofields,
we present this textbook providing a unifying framework for the basic ideas needed to
understand recent developments underlying nanoscience and nanotechnology, as applied
to nanoelectronics. The book grew out of the authors research and teaching experience
in these subjects. We have found that many of the ideas and achievements in fields underlying nanoscience and nanotechnology can be explained in a relatively simple setting,
if the necessary foundational underpinnings are presented properly. We have designed
this textbook mainly for undergraduate students, who will be trained in diverse fields
viii
Preface
Preface
ix
wavelength. By analyzing particle motion, we show that at small spatial scales a particle
can not be characterized by exact coordinates and momentum and that it behaves rather as
an extended wave-like object. This analysis establishes the essence of the waveparticle
duality which is an underlying principle of nanophysics.
In Chapter 3, we discuss the basic physical concepts and equations related to the
behavior of particles in the nanoworld. We introduce the Schrodinger wave equation for
particles and determine the ways in which to calculate observable physical quantities.
Keeping in mind the diverse variants of nanostructures, by using wave mechanics we
analyze a number of particular examples, which highlight important quantum properties of particles. Many of the examples analyzed can serve as the simplest models of
nanostructures and are exploited in later chapters.
Chapters 4 and 5 are devoted to materials used in nanoelectronics, methods of their
growth, and fabrication and characterization techniques.
In Chapter 4, we present an overview of the basic materials that are exploited in
nanoelectronics. We start with semiconductor materials as the principal candidates for
use in nanoelectronics, because they offer great flexibility in the control of the electronic and optical properties, and functions, of nanoelectronic devices. We show how,
through proper regimes of growth, doping by impurities, and sequent modifications and
processing, one can fabricate nanostructures and nanodevices starting from bulk-like
materials. Then, we introduce other materials that have properties of great potential in
nanoelectronics. Organic semiconductors and carbon nanotubes are discussed.
In Chapter 5, the principal methods of materials growth and nanodevice fabrication are
presented. We start with an analysis of fabrication of nanodevices on the basis of perfect
materials and continue by considering processing techniques. All stages of fabrication
and methods of processing are considered in detail. Then, we discuss special regimes of
material growth, when nanostructures (mainly quantum dots) are formed spontaneously
due to the growth kinetics. These approaches to the production of nanostructures and
nanoelectronic devices actually represent evolutionary improvements in the growth
and processing methods applied previously in microelectronics. Nanoscale objects like
carbon nanotubes and biomolecules require, in general, other techniques for production.
These innovative techniques are also highlighted in this chapter.
We pay special attention to the most important characterization techniques, such
as atomic-force microscopy, scanning tunneling microscopy, and transmission electron
microscopy, among others.
Also in Chapter 5, we review advances in nanotechnology that came from synthetic chemistry and biology. These include chemical and biological methods of surface
nanopatterning for preparing nanostructured materials with predefined and synthetically programmable properties. The basic ideas related to these chemical and biological
approaches are discussed. Finally, we study the methods of fabrication of a new class of
devices commonly known as nanoelectromechanical systems (NEMS).
Chapters 68 include analyses of electron properties of nanostructures, traditional
low-dimensional systems, and recently discovered nano-objects.
In Chapter 6, transport of charge carriers is analyzed. Important aspects of transport
regimes are elucidated by comparing the time and length scales of the carriers with
Preface
Preface
xi
V.A.K. thanks his colleagues Professor S. V. Svechnikov, Professor A. E. Belyaev, Professor F. T. Vasko, Dr. V. I. Pipa and Dr. B. A. Glavin from the Institute of Semiconductor
Physics at Kiev for numerous fruitful discussions of general problems and perspectives of
nanoscience and nanoelectronics. The wide spectrum of research directions and creative
atmosphere prevailing in the Theoretical Physics Department stimulated his involvement
in various nanoscience activities. He acknowledges permanent contacts with graduate
and PhD students, which gave him valuable feedback during the work on the textbook.
V.A.K. is deeply grateful to all his family members for their understanding and permanent
support.
M.A.S. extends his sincere thanks to Deans Prith Banerjee and Larry Kennedy, at the
College of Engineering, University of Illinois at Chicago (UIC) for their active encouragement and their longstanding efforts to promote excellence in research at the UIC.
M.A.S. gratefully acknowledges the generous support and enlightened encouragement
of Richard and Loan Hill. Special thanks go to Dr. Dwight Woolard of the US Army
Research Office, Drs. Daniel Johnstone, Todd Steiner, and Kitt Reinhardt of the AFOSR,
Dr. Rajinder Khosla, Dr. James W. Mink and Usha Varshney of the National Science
Foundation, Dr. Daniel Herr of the Semiconductor Research Corporation, and Dr. John
Carrano of the Defense Advanced Research Projects Agency for their encouragement
and interest. M.A.S. acknowledges the essential roles that several professional colleagues
and friends played in the events leading to his contributions to this book; these people
include Professor Richard L. Magin, Head of the Bioengineering Department at the
University of Illinois at Chicago (UIC), Professors Robert Trew, Gerald J. Iafrate, M. A.
Littlejohn, K. W. Kim, and R. and M. Kolbas, and Dr. Sergiy Komirenko of the North
Carolina State University, Professors G. Belenky and S. Luryi and Dr. M. Kisin of the
State University of New York at Stony Brook, Professors George I. Haddad, Pallab K.
Bhattacharya, and Jasprit Singh, and Dr. J.-P. Sun of the University of Michigan, Professors Karl Hess and J.-P. Leburton at the University of Illinois at Urbana-Champaign,
Professor L. F. Register of the University of Texas at Austin, and Professors H. Craig
Casey and Steven Teitsworth of Duke University. M.A.S. also thanks family members
for their support while this book was being prepared; these include Anthony and Norma
Stroscio, Mitra Dutta, and Elizabeth, Gautam, and Marshall Stroscio.
Notation
Symbols
A amplitude of wave
A average value of A
a lattice constant
a0 length of carboncarbon bond in carbon nanotubes
a1 , a2 , a3 basis vectors
ai basis vectors of lattice
B magnetic field
C capacitance
d translation vector
D diffusion coefficient
dsp spacer thickness
E energy of a particle
EF Fermi level
Eg bandgap
e elementary charge
F electric field
F0 amplitude of electric field
f frequency
f SET frequency of Bloch oscillations
f vector of force
F distribution function
F F Fermi distribution function
G conductance
G0 quantum of conductance
H Hamiltonian operator
H magnetic field
H direction of the nearest-neighbor hexagon rows
H total energy, Hamiltonian function
h Plancks constant
h wave energy density
h1D wave energy density for a one-dimensional medium
-h Plancks constant divided by 2
Notation
J current density
I current
IT tunnel current
I wave intensity
i quantum-mechanical flux of the particles
k wavevector
kB Boltzmanns constant
L inductance
LT thermal diffusion length
le mean free path between two elastic collisions
l orbital quantum number
l angular momentum
l coherence length
LE inelastic scattering length
Lx , L y , Lz dimensions of a sample
M mass of resonator
M magnetic dipole moment
m magnetic quantum number
m effective mass of electron
m0 mass of an electron in vacuum
mHH heavy-hole mass
mLH light-hole mass
mSH split-off hole mass
Ns sheet concentration of donors
Ndepl sheet concentration of ionized acceptors
n principal quantum number
ns sheet concentration of electrons
P( ) Hermite polynomial
Pb Probability of finding electron under the barrier
q wavevector
Q quality factor
Q amount of deposited material
R radial function
r magnitude of radius vector
r coordinate vector
r0 Bohrs radius
R radius of quantum dot
R tube radius of carbon nanotube
R reflection coefficient
S spin intrinsic angular momentum
S cross-section
Sz projection of the spin of electron
s distance between tip and surface
s phase velocity of traveling wave
xiii
xiv
Notation
s spin of electron
t time
ttr transit time
T time period
T vector corresponding to tube axis of carbon nanotube
T ambient temperature
Te electron temperature
Td translation operator
uk (r) Bloch periodic function
u displacement of atoms from their equilibrium positions
V potential energy
V volume
Vb barrier height
V0 volume of primitive cell
vd average(drift) velocity
v velocity
vh velocity of hole
W crystalline potential
UM potential energy
z0 characteristic length
Z atomic number
dimensionality factor
spring constant
(x) Diracs delta-function
Notation
frequency
q frequency of harmonic oscillator
(r, t) non-stationary wavefunction
(r, t) complex conjugate of wavefunction (r, t)
(r) stationary wavefunction
three-dimensional density
1D linear density of string
density of states
conductivity
theta-function
polar angle
E mean free time between two inelastic collisions
d decay time of flexural vibrations
e mean free time between two elastic collisions
(z) wavefunction
electron affinity
Abbreviations
BT bipolar transistor
CMOS complementary MOS, i.e., NMOS and PMOS on the same chip
DPN dip-pen nanolithography
FET field-effect transistor
JBT homojunction BT
JFET junction FET
HBT heterojuction BT
HEMT high-electron-mobility transistor
HFET heterojunction FET
HOMO highest occupied molecular orbit
LUMO lowest unoccupied molecular orbit
MES metalsemiconductor
MESFET metalsemiconductor FET
MODFET modulation-doped FET
MOS metaloxidesemiconductor
MOSFET metaloxidesemiconductor FET
QUIT quantum interference transistor
RTD resonant-tunneling diode
SIMOX separation by implantation of oxygen
SMS semiconductormetalsemiconductor
VMT velocity-modulation transistor
xv
This book provides the foundations and the main ideas emerging from research that
underlies the applied field called nanoelectronics. Nanoelectronics promises to improve,
amplify, and partially substitute for the well-known field of microelectronics. The prefix
micro denotes one millionth and, as applied to electronics, it is used to indicate that
the characteristic sizes of the smallest features of a conventional electronic device have
length scales of approximately a micrometer. The prefix nano denotes one billionth.
Thus, in nanoelectronics the dimensions of the devices should be as many as a thousand
times smaller than those of microelectronics.
Such a revolutionary advance toward miniaturization of electronics is based on the
recently developed ability to measure, manipulate, and organize matter on the nanoscale
1 to 100 nanometers, i.e., 1 to 100 billionths of a meter. At the nanoscale, physics,
chemistry, biology, materials science, and engineering converge toward the same principles and tools, and form new and broad branches of science and technology that can be
called nanoscience and nanotechnology.
Advancing to the nanoscale is not just a step toward miniaturization, but requires the
introduction and consideration of many additional phenomena. At the nanoscale, most
phenomena and processes are dominated by quantum physics and they exhibit unique
behavior. Fundamental scientific advances are expected to be achieved as knowledge in
nanoscience increases. In turn, this will lead to dramatic changes in the ways materials,
devices, and systems are understood and created. Innovative nanoscale properties and
functions will be achieved through the control of matter at the level of its building blocks:
atom-by-atom, molecule-by-molecule, and nanostructure-by-nanostructure. The molecular building blocks of life proteins, nucleic acids, carbohydrates are examples of
materials that possess impressive properties determined by their size, geometrical folding, and patterns at the nanoscale. Nanotechnology includes the integration of manmade
nanostructures into larger material components and systems. Importantly, within these
larger-scale systems, the active elements of the system will remain at the nanoscale.
The driving forces underlying developments at the nanoscale have at least two major
complementary components scientific opportunities and technological motivations.
Scientific opportunities
The progress in physics, chemistry, and biology at the nanoscale represents a natural step
in advancing knowledge and understanding Nature. Scientific perspectives on this route
are conditioned first of all by new quantum phenomena in atomic- and molecular-scale
structures and by the interaction of large numbers of these small objects. Indeed, the
fundamental laws of physics in the nanoworld differ from those that apply to familiar
macroscopic phenomena. Instead of classical physics, that works so well for macroscopic
phenomena, the motion of particles and systems in the nanoworld is determined by the
so-called wave mechanics or quantum mechanics. A basic principle of nanophysics is the
fundamental concept that all matter, including electrons, nuclei, atoms, electromagnetic
fields, etc., behaves as both waves and particles. This waveparticle duality of all matter
is strikingly apparent at the nanoscale. For dealing with a large number of particles or
systems, the statistical laws are important. Statistical physics on the nanoscale is also
fundamentally different from that on the macroscale. In general, phenomena that involve
very large numbers of small interacting particles or systems follow different rules from
those involving only a few of them. Cooperative behavior of many-object systems is
revealed clearly at the nanoscale. Besides the phenomena just discussed, there are other
classes of phenomena that are important for science at the nanoscale.
It is appropriate here to refer to the famous 1959 lecture of the Nobel Prize laureate
Professor Richard Feynman with the title There is plenty of room at the bottom, where
he discussed the problem of manipulation and controlling things on a small scale.
Feynman did not just indicate that there is room at the bottom, in terms of decreasing
the size of things, but also emphasized that there is plenty of room. In his lecture,
Feynman justified the inevitable development of concepts and technologies underlying the nanoworld and presented his vision of exciting new discoveries and scientific
perspectives at the nanoscale.
Technological motivations
Achievements in nanoscience and nanotechnology will have tremendous multidisciplinary impact. The benefits brought by novel nanotechnologies are expected for many
important practical fields of endeavor. These include materials and manufacturing, electronics, computers, telecommunication and information technologies, medicine and
health, the environment and energy storage, chemical and biological technologies, and
agriculture. Having stated the purpose of this text, we consider now more detailed motivations for the development of electronics at the nanoscale.
In general, progress in electronics is stimulated, in part, by the enormous demands for
information and communication technologies as well as by the development of numerous special applications. The continuous demands for steady growth in memory and
computational capabilities and for increasing processing and transmission speeds of
signals appear to be insatiable. These determine the dominant trends of contemporary
microelectronics and optoelectronics. One of the main trends of the progress in electronics was formulated by Intel co-founder Dr. Gordon Moore as the following empirical
observation: the complexity of integrated circuits, with respect to minimum component
cost, doubles every 24 months. This statement formulated forty years ago is known as
Moores law and provides an estimate of the rate of progress in the electronics industry.
Specifically, Moores law predicts that the number of the basic devices transistors on
a microchip doubles every one to two years. This is possible only if progressive scaling
down of all electronic components is realized.
Electronics exploits the electrical properties of solid-state materials. A simple and
intuitive classification of solids makes a distinction between dielectrics and metals, i.e.,
dielectrics are non-conducting materials whereas metals are good conducting materials.
Semiconductors occupy the place in between these two classes: semiconductor materials
are conducting and optically active materials with electrical and optical properties varying over a wide range. Semiconductors are the basic materials for microelectronics and
remain the principal candidates for use in nanoelectronic structures because they exhibit
great flexibility in terms of allowing the control of the electronic and optical properties
and functions of nanoelectronic devices. Accordingly, to a large extent, we will analyze
the trends of electronics in the context of semiconductor technology.
It is instructive to illustrate these trends and achievements through the example of
Si-based electronics. Indeed, contemporary microelectronics is based almost entirely
on silicon technology, because of the unique properties of silicon. This semiconductor
material has high mechanical stability as well as good electrical isolation and thermal
conductivity. Furthermore, the thin and stable high-resistance oxide, SiO2 , is capable
of withstanding high voltages and can be patterned and processed by numerous methods. Silicon technology also enjoys the advantage of a mature growth technology that
makes it possible to grow Si substrates (wafers) of larger areas than for other semiconductor materials. The high level of device integration realizable with Si-based electronics technology may be illustrated by the important integrated circuit element of any
computer, controller, etc. the dynamic random access memory (DRAM). The main
elements of DRAM based on complementary metaloxidesemiconductor technology
(Si-CMOS) are metaloxidesemiconductor field-effect transistors (MOSFETs). For Si
MOSFETs, channels for flow of electric current are created in the Si substrate between
the source and drain contacts, and the currents are controlled by electrodes metal gates
which are isolated electrically by very thin SiO2 layers, which have become thinner than
10 nm.
Figure 1.1 illustrates the evolution of the DRAM size and transistor gate size as
functions of time. Besides transistors and capacitors, the chip contains metallic line
connections: local, intermediate, and global wiring. Figure 1.1 illustrates the steady
scaling down of all characteristic sizes and increasing levels of integration. For example,
the 64-Mbit DRAM chip contains approximately 108 transistors per cm2 , each with
feature sizes of the order of 0.3 m. The transistors in this DRAM as well as those of
the more highly integrated 256-Mbit chip operate as conventional devices and obey the
laws of classical physics. The next generation of devices is entering the nanoscale regime
where quantum mechanics is important; indeed, as we will discuss in this book, quantum
mechanics becomes dominant on the scale of approximately one to ten nanometers for
devices that operate at room temperature. According to Fig. 1.1, todays technology has
already reached the nanoscale and newer device concepts should be implemented before
2010.
One of the factors driving the huge production and wide use of microelectronic systems is the relatively low cost of their fabrication. Moreover, despite their increasing
1998
2001
2004
2007
2010
Memories, DRAM
Bits per chip
Cost per bit (milli-cent)
Cost per chip (US$)
64 M
0.017
11
256 M
0.007
18
1G
0.003
30
4G
0.001
40
16 G
0.0005
80
64 G
0.0002
130
Logic, microprocessors
Transistors per cm2
Cost per transistor (milli-cent)
Power supply (V)
4M
1
3.3
7M
0.5
2.5
13 M
0.2
1.8
25 M
0.1
1.5
50 M
0.05
1.2
90 M
0.02
0.9
Parameters
Minimum feature size (m)
Wafer size (in.)
Electrical defect density per m2
0.35
8
240
0.25
8
160
0.18
12
140
0.13
12
120
0.10
16
100
0.07
16
25
The data are from U. Konig, Physica Scripta, T68, 90, 1996.
Figure 1.1 Technology nodes and minimum feature sizes from application ITRS Roadmap:
MPU, Micro Processing Unit; ASIC, Application-Specific Integrated Circuit. Used with
permission, from W. Klingenstein (2002). Technology Roadmap for Semiconductors.
C InfineonTechnologies AG,
http://broadband02.ici.ro/program/klingenstein 3d.pdf, page 15.
2002.
In the same table, the integration levels of logic circuits and microprocessors are
forecast. We see that, for this case, device integration is also large but will increase
slightly slower than for DRAMs. The cost of the principal elements of logic circuits
transistors is significantly greater, but it also tends to decrease. The forecast for the
necessary power supply presented in the table portrays a slow, but persistent, decrease.
Thus, one can expect favorable trends for the power consumption of microelectronic
systems.
The bottom of the table presents the necessary technological parameters for ultra-high
integration: minimum feature sizes, diameters of wafers, and electrical defect densities.
The large wafer size allows a greater number of devices to be fabricated on a chip. The
density of electrical defects (i.e., crystal imperfections which affect electrical properties)
is characteristic of the quality of the wafers. Table 1.1 forecasts that wafer diameters will
be continuously increased, while the defect density decreases by a factor of six per
decade; currently, they must be limited to several tens per m2 .
After this overview of the dominant driving forces in nanoscale development, we will
mention briefly other general issues important for this field. These include improving
materials, fabrication and measurement techniques on the nanoscale, and novelties in
the operation principles of nanodevices.
1990s2000s
chemistry and even biology have much to offer for emerging nanotechnologies. Some
fundamental concepts coming from these fields can successfully be exploited for the
synthesis of nanomaterials and nanodevices. These include chemical and biological
methods of growth of nanoscale objects such as carbon nanotubes and biomolecules
surface nanopatterning, and preparing nanostructured materials with predefined, synthetically programmable properties from common inorganic building blocks with the
help of DNA interconnect molecules, etc.
devices may operate on the basis of single-electron transfer. Various novel single-electron
devices have been proposed and demonstrated. By reducing the sizes of quantum dots
or less, it is possible to operate with single electrons at temperatures near or
to 100 A
close to room temperature.
The great technological advances brought about in mainstream microelectronics and
nanoelectronics can be used for the fabrication of other classes of nanodevices. One
such approach is based on quantum dots arranged in locally interconnected cellularautomata-like arrays. The fundamental idea of operation of cellular automata is to encode
information using the charge configuration of a set of quantum dots. Importantly, in the
quantum-dot cellular-automata approach, the information is contained in the arrangement of charges of the dots, rather than in the flow of the charges, i.e., electric current.
It can be said that the devices interact by direct Coulomb coupling rather than via the
current through the wires.
Another approach employs both electrical and mechanical properties of nanostructures. The new generation of devices and systems based on this approach is commonly
referred to as nanoelectromechanical systems (NEMSs). Indeed, on the nanoscale a
strong enhancement of coupling between electronic and mechanical degrees of freedom
occurs. This electromechanical concept may be used for the development of a new class
of devices that includes nanomachines, novel sensors, and a variety of other new devices
functioning on the nanoscale. Thus, NEMSs may supplement the traditional electronics
that works solely with electrical signals.
10
may be improved greatly when nanostructures such as quantum wells, quantum wires,
and quantum dots are exploited as active optical elements.
As for the previously studied case of microelectronics, the trends in optoelectronics
involve scaling down the sizes of these devices as well as achieving high levels of
integration in systems such as arrays of light diodes, laser arrays, and integrated systems
with other electronic elements on the same chip. Optoelectronics benefits substantially
through the use of nanotechnology and becomes competitive with its microelectronic
counterpart.
In conclusion, the current and projected trends in electronics lead to the use of nanostructures and to the reliance on novel quantum effects as an avenue for realizing further
progress. These recent and diverse trends in semiconductor and device technologies as
well as in novel device concepts are driving the establishment of a new subdiscipline
of electronics based on nanostructures, i.e., nanoelectronics. This subdiscipline and its
foundations are studied in this book.
More general information on nanoscience, nanotechnology, and nanostructures, and
their potential, may be found in the following reviews:
R. Feynman, Theres plenty of room at the bottom, American Physical Society
Meeting, Pasadena, CA, 29 December 1959; originally published in Caltechs
Engineering and Science Magazine, February 1960; reprinted as R. P. Feynman,
Infinitesimal machinery, Microelectromechanical Systems, 2, 1 (1993); (see, for
example, www.zyvex.com/nanotech/feynman.html).
National Nanotechnology Initiative: The Initiative and Its Implementation Plan,
National Science and Technology Council, Committee on Technology, Washington
DC, 2000 (see, for example, www.nano.gov).
H. Kroemer, Quasielectric fields and band offsets: teaching electrons new tricks,
Rev. Mod. Phys., 73, 783 (2001).
The International Technology Roadmap for Semiconductors (Semiconductor Industry
Association, San Jose, CA, 2002 update).
2.1
Introduction
The evolution of microelectronics toward reduced device sizes has proceeded to a degree
that renders conventional models, approaches, and theories inapplicable. Indeed, for
objects with sizes of 100 nanometers or less it is frequently the case that the length scales
associated with fundamental physical processes are comparable to the geometrical size
of the device; also, fundamental time scales associated with physical processes are of
the order of the time parameters for nanodevice operation. Therefore, on the nanoscale
the theories and models underlying modern nanoelectronics become more complicated,
and rely more and more on basic science.
Generally, in the nanoworld the fundamental laws of physics that govern particles and
material fields differ from those that apply to familiar macroscopic phenomena such as
the motion of a baseball or a train. Instead of classical mechanics, that works so well
for macroscopic phenomena, the motion of particles in the nanoworld is determined
by the so-called wave mechanics or quantum mechanics. An underlying principle of
central importance for nanophysics is the fundamental concept that all matter, including
electrons, nuclei, and electromagnetic fields, behaves as both waves and particles, that
is, waveparticle duality is a basic characteristic of all matter.
At first glance, wave properties and particle properties for the same physical object
are hardly compatible. To understand waveparticle duality, we will briefly review, in
the following two subsections, the basic properties of particles and waves known from
classical physics.
2.2
Classical particles
A particle can be characterized by the momentum vector p and the kinetic energy E that
depends on the momentum. Here and throughout this book we will use the bold fonts for
vectors, i.e., p is the vector and p |p| is its absolute value. The change of momentum
with time is defined by Newtons second law:
dp
= f,
(2.1)
dt
where t is time and f is the vector of an external force. From Eq. (2.1) it follows that, if
the force is absent, then dp/dt = 0, i.e., p = constant. This is the so-called momentum
12
conservation law valid for a mechanical system in the absence of external forces. For
simplicity in classical mechanics, we assume that any particle we consider has a very
small size in comparison with the space where the particle is located. We refer to such a
particle as a point particle.
The coordinate vector, r, of a point particle and the particle velocity, v, are related by
the well-understood relation
dr
= v.
dt
(2.2)
To obtain the relationship among the velocity, v, the momentum, p, and the energy of a
particle, E, one has to calculate the power associated with the force f (the work of the
force f on the particle per unit time). So, we have to multiply the left- and right-hand
sides of Eq. (2.1) by v, resulting in v dp/dt = fv. (Remember that the scalar product of
two vectors a and b is defined as ab = ax bx + a y b y + az bz .) The right-hand side of the
transformed version of Eq. (2.1), fv, is equal to the rate of the energy change dE/dt
(dE/dt = fv), and we obtain the relation
dE
dp
=v .
dt
dt
(2.3)
Using the chain rule of function differentiation (d f (x)/dt) = (d f /dx)(dx/dt), we determine how the velocity of the particle is related to the momentum and energy:
v=
dE
.
dp
(2.4)
Here, the derivative with respect to the vector p also gives the vector v with components
vx =
dE
,
d px
vy =
dE
,
d py
and
vz =
dE
.
d pz
Let us consider an important case of a particle moving in a potential field. The force
is defined as the derivative of a potential V (r) with respect to the particle coordinate:
f = dV /dr. Note that, for the vector operator d/dr {d/dx, d/dy, d/dz}, one often
uses another notation: d/dr , so that dV /dr is the so-called gradient of function
V (r), V (r). On multiplying the left- and right-hand sides of Eq. (2.1) by v, using the
definition of Eq. (2.2) and the chain rule, we find
v
dp dV dr
d
(E + V (r)) = 0.
+
=
dt
dr dt
dt
(2.5)
represents the total energy of the particle, H. The above calculations tell us that the
total energy of a particle in a potential field does not change during its motion. So, we
have demonstrated the law of energy conservation, dH/dt = 0. When H is considered
as a function of two variables p and r, it is called the Hamiltonian function, or simply
the Hamiltonian. Remarkably, the partial derivatives of H give us both fundamental
equations (2.1) and (2.4): dp/dt = f = H/r and dr/dt = v = H/p. As we shall
13
see, the Hamiltonian of classical physics also plays an important role in the formulation
of quantum mechanics.
A point particle moving in free space may be characterized by a mass m and by the
kinetic energy:
E=
p2
.
2m
(2.6)
d2 r
= f.
dt 2
(2.8)
Now, Eq. (2.5) takes the form that we will use often in this book:
H
p2
+ V (r).
2m
(2.9)
One of the important results following from classical mechanics is that, if we know the
particle position r0 and its momentum p0 (or velocity v0 ) at an initial moment t0 , from
Eqs. (2.1)(2.4), we can find the position and the momentum (velocity) of the particle
at any given moment of time t for any given f or V (r).
Equations (2.1)(2.8) are the equations of classical mechanics. All of the variables,
such as r, p, E, and v, are continuous variables. Importantly, |p| can have any value,
including zero, i.e., p = 0 and E = 0 are allowed.
For a particle, say an electron, moving inside of a crystal (a metal, a dielectric, a
semiconductor, etc.), the interaction of this particle with the crystal generally makes the
relationship between E and p the dispersion relation more complicated. In particular,
E may be an anisotropic function, and the velocity and momentum may be noncollinear
vectors. Examples of such energy dependences will be given in the problems for this
chapter.
2.3
Classical waves
We are all familiar with a lot of examples of waves and wave processes. These include
sound waves in air, sea waves, and elastic waves in solids, electromagnetic waves, and
gravitational waves. Generally, in classical physics wave motion arises in extended continuous media with an interaction between the nearest elements of the medium. Such
an interaction gives rise to the transfer of a distortion (an excitation) from one element to another and to a propagation of this distortion through the medium. Despite the
14
n2
n1
a
zn2 = (n 2)a
n
M
un1
zn1 = (n 1)a
un
zn = na
n+1
n+2
un+1
Figure 2.1 A linear chain of identical atoms of mass M: u n are displacements of atoms from their
equilibrium positions and the restoring force acting on the nth atom is f n = (u n u n+1 )
(u n u n1 ). Note: displacements u n are not shown to scale.
differences in the particular nature of waves, wave motion has much in common for
different media. We introduce wave properties by analyzing the following simple model.
We will construct a model of a one-dimensional medium, elements of which are
represented by atoms connected by massless springs. Vibrations in such a linear atomic
chain are governed by the laws of classical mechanics. The chain is supposed to be
infinitely long. Let the equilibrium distance between atoms be a. Thus, the equilibrium
position of the nth atom is z n = na, and the displacement of this atom from its position
is denoted by u n . Figure 2.1 depicts such a linear chain of identical atoms of mass M.
The springs represent interatomic forces, i.e., interaction between nearest elements of
the medium. If the displacements of atoms from their equilibrium positions are not too
large, the restoring forces in the chain obey Hookes law,
f = u,
(2.10)
where u is a change of the spring length, is the spring constant, and f is the force
exerted by the spring. Now, we can apply Eq. (2.10) for the total force, f n , acting on the
nth atom coupled with its two nearest neighbors by two springs as
f n = (u n u n+1 ) (u n u n1 ).
(2.11)
Hence, the Newton equation of motion, Eq. (2.8), for the nth atom is
d2 u n
= (2u n u n1 u n+1 ).
(2.12)
dt 2
This set of linear differential equations, Eq. (2.12), in principle, describes wave-like
processes. However, we will make a further simplification and modify this discrete set
of equations to obtain one equation describing a continuous medium. Such a continuous
medium with elastic forces between its elements is, obviously, a string. To make this
transformation to the continuous case, we shall consider the discrete coordinate to be
continuous, z n z, and replace the finite difference in Eq. (2.12) with a derivative:
M
2u
u
(2u n u n1 u n+1 )
u n u n1
and
a
z
a2
z 2
Then, we obtain the equation describing the displacement, u, of a string:
1D
2u
2u
= 0,
1D
t 2
z 2
(2.13)
(2.14)
15
where 1D = M/a is the linear density of the string, and 1D = a is the so-called elastic
modulus of the string. The infinite set of ordinary differential equations, Eq. (2.12), is
replaced by a single partial differential equation. Since we started our derivation for a
mechanical system, for which the energy (per atom) can be defined, it is instructive to
find a similar characteristic for the continuous medium described by Eq. (2.14). Assume,
for a moment, that the spring under consideration is of finite large length, L. Then, let
us multiply Eq. (2.14) by u/t and integrate it over the length L:
L
2 u u
2 u u
dz 1D 2
1D 2
= 0.
t t
z t
0
By integrating
the second term by parts by using the standard relationship w dv =
wv v dw, we find the following identity for a unit length of the string:
1D u 2 1D u 2
1 L
dz
+
t L 0
2 t
2
z
u u
1D
u u
= 0.
L
z t z=L
z t z=0
To draw further conclusions,
we define an average of a quantity, A, over a piece of the
string, Z , as A = (1/Z ) (Z ) A dz. For a long string, the average should not depend on
the length, Z , of the piece of string. Now, as L , the latter identity will be satisfied
if the value
1D u 2 1D u 2
h 1D =
+
(2.15)
2 t
2
z
is independent of time, i.e., h 1D /t = 0, and the value h 1D is conserved. In fact, the
first term on the right-hand side of Eq. (2.15) is, obviously, the density of kinetic energy
over the length of the string, while the second term is the elastic (potential) energy. Thus,
h 1D has the meaning of the energy density for our one-dimensional continuous medium.
As expected, the energy density is conserved, if external forces are absent.
Our one-dimensional analysis, which assumes that the atoms can move only along
a single direction z, can be generalized to a three-dimensional elastic medium; see
Problem 4 of this chapter. Now, the displacement becomes a three-dimensional vector,
u, and, instead of Eq. (2.14), we obtain
2
2u
u 2u 2u
2
(2.16)
+ 2 + 2 =0
t
x2
y
z
with r = {x, y, z} being the vector coordinate and being the three-dimensional density, i.e., the mass of a unit volume of the medium; is the elastic modulus of the
medium. Since it was introduced as a single elastic modulus, independent of direction,
Eq. (2.16) is valid for an isotropic medium. The energy density of a three-dimensional
elastic medium is
u 2 u 2
h=
+
.
(2.17)
2 t
2 r
16
+
+
= div u.
r
x
y
z
(!V )
A dV ),
Finally, we can rewrite Eq. (2.16) in the standard form of the wave equation:
2
2u 2u
2u
2 u
= 0,
(2.18)
s
+
+
t 2
x2
y2
z 2
where we introduce a new parameter s, the meaning of which will be clarified below. For
our model
(2.19)
s = /.
Although we derived Eq. (2.18) for a particular model of the elastic medium, the equation
can be applied to describe a wide class of physical vector fields as exemplified by
u(x, y, z) the displacement field associated with a wave in an elastic medium. If the
characteristic of a wave field is a scalar value, say w, in Eq. (2.18), we should simply
substitute u w to describe the case of a scalar physical field.
Now we will analyze solutions to Eq. (2.18) for some cases in which the solutions
are particularly simple. In many cases, such solutions are associated with wave-like
processes. We may look for solutions of the form
u(t, r) = A cos(qr t) + B sin(qr t),
(2.20)
where A and B are arbitrary vectors, and q are unknown parameters; is known as
the angular frequency of the wave and q is called the wavevector. By substituting such a
form for u(t, r) into Eq. (2.18), we easily find that Eq. (2.20) is a solution of Eq. (2.18),
if 2 = sq2 . The relationship between and q = |q| is called the dispersion relation:
= s|q|.
(2.21)
Importantly, there is no limitation to the wavevector q: a solution can be found for any
q. This is valid only for infinitely extended media, for which the wavevector can be a
continuous vector.
Because the two terms in Eq. (2.20) behave similarly, we can discuss basic properties
of these solutions based on the example of sinusoidal waves:
u(t, r) = B sin(qr t).
(2.22)
The argument of the sine function is the phase of the wave, = qr t, and B is the
amplitude of the wave. Let the coordinate r be given, then we obtain a function that
oscillates in time with an angular velocity . The frequency defines the rate of change
of the phase with time t (radians per unit time). The period of time associated with a
single oscillation is
T = 2/.
Accordingly, T is known as the period. If the time t is fixed, Eq. (2.22) represents a
function that oscillates as the coordinate changes. These oscillations are characterized
by the wavevector, q (or wavenumber, q). The wavevector defines the rate at which the
17
u
l = 2p /q
B
l
l
4
l l
4
(b) t = T/4
(a) t = 0
Figure 2.2 A propagating wave u = B sin(qz t). At time t = 0 the wave u = B sin(qz t)
at the point z = /4 has displacement u = B sin(/2) = B. At time t = T /4 we have
t = (2/T )(T /4) = /2; the same displacement will occur at the point z = 0 : u =
B sin(qz t) = B sin(/2) = B.
phase changes with the coordinate (radians per unit length). One can introduce the spatial
period to represent the distance for which u undergoes an oscillation of one cycle. It is
called the wavelength,
= 2/q.
(2.23)
The relationship between the time period and the spatial period is defined by Eqs. (2.21)
and (2.23):
=
2
s = T s.
(2.24)
The waveforms of Eqs. (2.20) and (2.22) are often referred to as traveling plane
waves. Indeed, their phase changes in a single direction along the wavevector q and
surfaces of constant phase are planes perpendicular to q. Thus, in the case of a plane
wave in a three-dimensional medium, the wave parameters do not depend on the two
coordinates perpendicular to q. A traveling wave is illustrated in Fig. 2.2, where this
wave is shown at two different moments of time. Now, we can clarify the meaning of
the parameter s. Let the wavevector be directed along the z direction. Then, the wave
phase is = qz t = q[z (/q)t]. So we can conclude that a given magnitude of
the phase moves with the velocity /q = s, i.e., the parameter s in the wave equation
(2.18) is the phase velocity of the traveling wave. In the waveform of Eq. (2.22), the
constant B is the wave amplitude. The average of u2 (t, r) can be expressed in terms of
B as u 2 = B 2 /2, where we use the formulae [sin(qr t)]2 = [cos(qr t)]2 = 12 .
According to Eq. (2.15), the amplitude relates to the wave energy density. Indeed, we
note that
B2 2
u 2 u 2
[cos(qr t)]2 + q 2 [cos(qr t)]2
+
=
2 t
2 r
2
q 2
B 2 2
2
B2 2
B 2 2
+
=
+
=
.
=
2
2
2
2
2
2
2
18
In obtaining this result, we have taken into account that q 2 = 2 /s 2 and s = /.
Thus, we obtain h = 2 B 2 /2. Importantly, the energy density for traveling plane waves
is independent of the coordinate and it is proportional to 2 . If = 0, there is no wave
and there is no energy associated with it. Note that, for the waveforms of Eqs. (2.20) and
(2.22), averaging over a small volume of the medium is equivalent to averaging over the
period of oscillations.
For traveling waves, one often introduces also the wave intensity, which is the density
of the energy flux. This energy flux represents the energy transferred by the wave per unit
time through a unit cross-section perpendicular to q. The intensity is a vector directed
along q with the absolute value
I =sh =
s2 B 2
.
2
(2.25)
Besides the waveform of Eq. (2.20), a traveling wave can be presented in a complex
form,
u = Aei(qrt) ,
(2.26)
where the amplitude A is generally a complex quantity. Indeed, Eq. (2.26) is a solution to
the wave equation (2.18). For some cases, it is convenient to operate with such a complex
waveform. However, one should remember that true physical quantities always have real
values.
Now we consider the important wave phenomenon known as wave interference. Suppose that two sinusoidal waves of the same frequency propagate from different sources
through the medium. The sources of the waves are generally at different locations, so
the waves reach a point of observation r, in general, with different phase shifts 1 (r) and
2 (r):
u1 (t, r) = B1 sin(t + 1 (r)),
(2.27)
The resulting wave field is u(t, r) = u1 (t, r) + u2 (t, r). In experiments, instead of
the wave amplitude, it is the intensity of the wave that is measured in many cases.
From Eq. (2.25) the intensity is proportional to u2 . Straightforward calculation of u2
gives us
1
2
u2 =
(2.28)
B + B22 + 2B1 B2 cos[1 (r) 2 (r)] .
2 1
In deriving Eq. (2.28), we have used the identity
sin x sin y =
1
1
cos(x y) cos(x + y),
2
2
i.e.,
sin(t + 1 (r))sin(t + 2 (r)) =
1
1
cos(1 (r) 2 (r)) cos(2t + 1 (r) + 2 (r)),
2
2
and the fact that the average of cos(2t + 1 (r) + 2 (r)) is zero. Thus, the intensity
of the resulting wave consists of three contributions: the term related to the wave
19
Screen
S0
S1
S2
Interference picture
Light
coming from the first source, the term related to the wave from the second source,
and an additional term. This third contribution describes the effect of wave interference. It depends critically on the wave phase difference. The interference contribution can be positive (constructive interference at /2 < (1 2 ) < /2 when
cos (1 2 ) > 0), or negative (destructive interference at /2 < (1 2 ) < 3 /2
when cos(1 2 ) < 0). Importantly, the interference effect can be observed only
for waves with the same frequency (otherwise the averaging leads to zero interference contribution). The waves of Eq. (2.27) with time-independent phase shifts are
also known as coherent waves. If waves are characterized by a phase shift (1 2 ),
which jitters randomly in time, the waves are incoherent and no interference effect
occurs.
The simplest example illustrating the interference effect is the double-slit optical
experiment. Let two slits be illuminated by a light wave from a single source. These two
slits become two sources of coherent waves. The superposition of these waves generates
an interference pattern of fringes, as shown in Fig. 2.3.
The wave analyzed above travels along the vector q. Using the wavevector q in
Eq. (2.22) at the same frequency, we obtain another wave traveling in the opposite
direction. According to Eq. (2.20), a combination of these waves is also a solution to
Eq. (2.18):
u(t, r) = B+ sin(qr t) + B sin(qr + t),
20
(a) n = 1
(b) n = 2
(c) n = 3
Figure 2.4 Quantization of oscillations in the form of standing waves. The gray area is a
where again B+ and B are arbitrary constant vectors. The two waves can also be
interpreted as incident and reflected waves.
An important case is that of a wave propagating between two reflecting walls
placed at z = 0 and z = L. The waves depend on a single coordinate z: u(t, z) =
B+ sin(qz t) + B sin(qz + t). If the walls are rigid there are no displacements
at the walls, so we should use the boundary conditions for the waves at z = 0 and
z = L: u(z = 0) = u(z = L) = 0. One of the many physical embodiments of this situation are represented by waves in a continuum elastic medium bounded by rigid
boundaries at z = 0 and z = L; another is given by a string attached to two fixed
boundaries at z = 0 and z = L. From the boundary condition at z = 0, we find
B+ B = 0, thus u(t, z) = B+ [sin(qz t) + sin(qz + t)] = 2B+ sin(qz) cos(t).
(Note: sin x + sin y = 2 sin[ 12 (x + y)]cos[ 12 (x y)]). The boundary condition at z = L
can be satisfied if, and only if, sin(q L) = 0. The latter requires the so-called quantization of the wavevector q, q L = n, or
q = qn =
n,
L
n = 1, 2, 3, . . .,
(2.29)
where qn is called the wavenumber. That is, there exist only waves of a special form,
u = B sin(qn z) cos(n t),
(2.30)
at discrete values of the wavevector qn and frequency n = sqn . This important class of
waves is known as standing waves. From Eqs. (2.23) and (2.30), we have the relation
L = (/2)n. For standing waves, a strictly integer number of half-waves may be put
between the reflecting walls. Figure 2.4 illustrates standing waves of various wavelengths
for an arbitrary instant of time. The longest possible wave (n = 1) corresponds to = 2L
when a half-wave fits between the reflecting walls; see Fig. 2.4(a). To calculate the
energy density of the standing waves, we can use Eq. (2.15). Then, the energy density is
h n = B 2 n2 /4. It is independent of coordinate, as found for the traveling plane wave.
Obviously, the energy flux is now zero. It is important to stress again the difference
between traveling waves (with arbitrary and continuous values of q including q 0,
21
hence , in infinite media) and standing waves with quantized (or discrete) values
of qn with the minimum wavenumber q1 = /L.
The waveform given by Eq. (2.20) is actually a simple specific example of a more
general class of wave fields. Indeed, due to the linear character of the wave equation,
a sum of an arbitrary number of partial solutions is also a solution of the equation.
This property of waves satisfying linear wave equations is known as the superposition
principle. According to this principle, we may write the solution to Eq. (2.18) in the
general (complex) form as
u(t, r) =
Aq ei(qrq t) .
(2.31)
q
Here, the summation is taken over the wavevectors q. In addition, q relates to q through
the dispersion relation (2.21). The amplitudes of waves contributing to u depend, in
general, on the wavevectors. The superposition principle is the basis for many important
phenomena, including interference, formation of standing waves, and diffraction.
(2.33)
but only the real part of this formula has physical meaning. The same kind of equation
may be written for the magnetic field H.
22
H
z
Direction of
propagation
+
+
t 2
x2
y2
z 2
Since
> 1, the velocity of light, c/
, in a dielectric medium is less than that in free
space. The frequency, , and wavevector, q, are related through the dispersion relation
c
= |q|.
The energy of the wave can be characterized by the density of the electromagnetic
energy, which is defined as
W =
0
F2 (t) =
0
F02 ,
2
(2.35)
2.4
23
Waveparticle duality
In this section we will study how any physical object may behave simultaneously as
a particle and as a wave. To understand this so-called waveparticle duality, we use a
two-way road from waves to particles and from particles to waves.
(r) x 2
x x
x
where, for simplicity, we have written only the derivatives with respect to the xcoordinate. Within the geometrical-optics approximation, the phase (t, r) is large:
|| 1. Moreover, both contributions to the phase are large too: t, |0 | 1. Since the
phase is large, the leading terms in our calculations, namely the very first term (proportional to (/t)2 = 2 ) and the third term in the brackets (proportional to (0 / x)2 ),
are quadratic with respect to the phase derivatives. Keeping these leading terms and
cancelling out common multipliers, we obtain an equation for 0 :
0 2
(r)
= 2 2 .
(2.38)
r
c
If we find a solution of Eq. (2.38), we can define the so-called wave surfaces on which 0
is constant: 0 (r) = constant. For a given point r, the direction of the wave is determined
by the wavevector q = d0 /dr; the wave direction is perpendicular to the wave surface.
After we have found the wave surfaces and calculated q(r), we may construct the ray
trajectories, as presented in Fig. 2.6. In the simplest case of a homogeneous medium,
24
ray
a
b
c
ray
2
3
ray
Figure 2.6 A sketch of the construction of ray trajectories by using the wavefronts. Rays a, b, and
c are perpendicular to the wavefronts 1, 2, and 3.
where
does not depend on the coordinate, we obtain a trivial result: q = constant, i.e.,
the ray propagates along a straight path.
Thus, if the wavelength is small in comparison with the characteristic scales of the
system being considered, wave motion can be described in terms of ray trajectories. Up
to this point, our discussion has been based totally on classical physics.
Thus far, we have not discussed the wave amplitude (or the wave intensity). The wave
amplitudes enter the equations only as parameters that may be defined if the parameters
characterizing the light sources are known. However, the wave amplitude is directly
related to the kind of interpretation we are using: a classical interpretation or a quantum
interpretation. Indeed, it has been established that the classical-physics approach is valid
only for large intensities of waves. If the intensity of a wave is small, other laws of
quantum physics define the wave field.
Let us consider briefly basic quantum concepts related to wave fields. According to
Max Planck and Albert Einstein, electromagnetic waves interacting with matter can be
emitted and absorbed in discrete energy portions (quanta) photons. The energy of a
photon, E, is proportional to the frequency of the wave:
E = -h,
(2.39)
(2.40)
where q is the wavevector. (Equation (2.40) is formulated for photons in free space.)
Apparently, the direction of the momentum vector p coincides with the direction of
propagation of the wave. For each of the two possible polarizations of the radiation, one
introduces appropriate characteristics of photons: each polarization of light, b, is related
to a certain photon. Thus, a photon can be labeled by two indices, {q, b}.
25
-- )
Number of photons, Nq,b = W V /(h
Photon-flux density, I(r)/(h)
-- ) = Nq,b c
Photon flux, P/(h
Equation (2.39), which represents the energy of a quantum of light, may be readily
generalized for electromagnetic waves in a dielectric medium with dielectric constant
:
-hc
-h =
q.
(2.41)
It is very important that different quanta of an electromagnetic field do not interact with
each other, as is reflected by the linear character of the field in free space. An interaction
between these modes is possible only in special media. Such media are called nonlinear
optical media.
By introducing quanta, i.e., particles of electromagnetic field, we may understand
Eq. (2.38) and Fig. 2.6 in a new way. Let us multiply this equation by the factor -h2 /(2m),
where m is an inessential parameter. Then, we may write
-h2 2
(r)
p2
=0
2m
2mc2
upon taking into account the relations q = d0 /dr and p = -hq. On defining a potential
- 2 2
(r)/(2mc2 ), we come to the equation H p2 /(2m) + V (r) =
energy as V (r) = h
0, which is a direct analogue of the energy conservation law for a particle (compare this
with Eq. (2.9)). That is, Eq. (2.38) describes trajectories of quanta of the wave field.
In quantum theory, instead of the wave intensity one introduces the number of quanta
(or photons), Nq,b . If each of the quanta transfers the energy -h, then the intensity,
defined as the energy flux density, is I = -hcNq,b /V , where V is volume and Nq,b /V is
the density of quanta with wavevector q. Because the quantum picture has to coincide with
the classical picture for a large number of photons, one can match the latter relationship
and Eq. (2.36) for the classical intensity. From this comparison, it is possible to find the
relationship between the classical amplitude of the electric field, F0 , and the number of
quanta:
- Nq,b
2h
F0 =
.
(2.42)
0
V
Our discussion of the relationship between electromagnetic waves and photons provides an example of the waveparticle duality that is ubiquitous in quantum physics. A
comparison of some characteristics of electromagnetic fields in the classical and quantum interpretations is given in Table 2.1. For a wave of small intensity (small number
of photons, Nq,b ) the quantum description is more suitable, whereas for a substantial
intensity (large Nq,b ) a classical wave interpretation may be used.
26
(2.43)
2
.
k
(2.44)
2 -h
.
p
(2.45)
From Eqs. (2.44) and (2.45) we obtain a relation between the wavenumber and the
momentum of a particle: p = -hk, or, in the vector form,
p = -hk.
(2.46)
27
This relationship coincides exactly with Eq. (2.40) for quantization of electromagnetic
waves. This simplest case demonstrates the way in which one may attribute to a particle
both particle-like and wave-like properties.
Within the plane-wave description of a free particle, it follows from Eq. (2.43) that
there is an equal probability of finding a particle in any point of space:
|(r, t)|2 = (r, t) (r, t) = |A|2 = constant,
where (r, t) is the complex conjugate of (r, t).
This result appears to be in contradiction with the classical description of a particle.
Just as for the previously discussed case of electromagnetic fields, this contradiction
can be resolved by introducing the uncertainty principle. This principle has the form of
conditions restricting the range of coordinates and the range of momenta that can be
measured simultaneously for a particle. Uncertainties in the quantities !p and !r have
to satisfy the following inequalities:
!px !x h,
!p y !y h,
!pz !z h.
(2.47)
28
(a)
(1)
(2)
(3)
(4)
(b)
Figure 2.7 Tonomuras experiment. (a) A schematic diagram of the double-slit experiment.
(b) The build-up of interference fringes at various times: pictures on the monitor after (1)
10 electrons, (2) 200 electrons, (3) 6000 electrons, and (4) 140 000 electrons had been detected.
Electrons were emitted at a rate of 10 per second. Used with permission, from A. Tonomura.
(2006). Double-slit Experiment. (http://hqrd.hitachi.co.jp/globaldoubleslit.cfm), figures 1 and 2.
C Hitachi, Ltd. 1994, 2006. All rights reserved.
2.5
Closing remarks
This chapter emphasizes a fundamental property of all existing matter, which may be
called waveparticle duality. Starting with an analysis of an arbitrary wave field (elastic
vibrations in solids, electromagnetic fields, etc.), we found particle-like behavior of this
2.6 Problems
29
wave field at small wave amplitudes and (or) spatial scales larger than the wavelength.
By analyzing particle motion, we have seen that at small spatial scales a particle can not
be characterized by exact coordinates and momentum (velocity) and behaves rather as
an extended wave-like object. Thus, wave properties and particle properties for the same
physical object are compatible. This is the essence of waveparticle duality; indeed, this
is an underlying principle of central importance for nanophysics.
Contemporary microelectronics and optoelectronics exploit electric and light phenomena, which are determined entirely by the properties of electrons in solids. Estimates of
the de Broglie wavelength of electrons in solids give us a value of about 10 nm and larger,
i.e., emerging nanoelectronics inevitably should be based on the wave mechanics.
For those who want to look deeper into waveparticle duality we recommend the
following textbooks:
R. P. Feynman, R. B. Leighton, and M. Sands, Lectures on Physics, vol. 3 (New York,
Addison-Wesley, 1964).
W. E. Lamb Jr. The Interpretation of Quantum Mechanics, edited and annotated by J.
Mehra (Princeton, MA, Rinton Press, 2001).
The problems presented below have two aims: to help the reader to attain a better
understanding of the definitions and principles stated in this chapter, and to illustrate
some unusual behavior of the electrons in solids.
In the following chapters, we will introduce basic definitions and equations of quantum (wave) mechanics and analyze simple instructive examples that illustrate the main
quantitative and qualitative features of nanophysics.
2.6
Problems
1. For many semiconductor materials used in contemporary electronics, the relationship
between the momentum and energy of an electron is given by the implicit formula
E
p2
= E 1+
,
2m
Eg
where m is the so-called effective mass of the electron and 1/E g is the so-called nonparabolicity parameter. The formula has two solutions for unknown E: for electrons
(E > 0) and another for the so-called holes (E < 0).
(a) Find both solutions for E.
(b) Derive the expression for v for electrons only.
(c) Determine the electron velocity in free space and compare it with the expression
derived in (b).
As an example, consider GaAs, for which m = 0.067m 0 and E g = 1.42 eV.
Note: the expression for the kinetic energy of an electron in free space is
E=
p2
.
2m 0
30
py 2
px 2
+
.
2m t
2m l
(2.50)
the general solution to Eq. (2.8) is A sin( /m t + ), with A and being arbitrary
parameters. That is, the particle oscillates around the point z = 0 with the frequency
= /m. Particular values of parameters A and determine the magnitude and the
phase of these oscillations.
4. To illustrate the method of generalization of the one-dimensional model, one can
consider a square (two-dimensional) lattice with identical atoms placed at the corners
of the squares. Each atom interacts with its nearest neighbors, i.e., with four atoms, as
shown in Fig. 2.8. The equilibrium position of each atom is determined now by two
integer numbers, say n and m. The coordinates of the atoms can be written as rn,m =
{xn , ym } = {na, ma}. The atom with a given n and m has neighbors with coordinates
rn1,m , rn+1,m , rn,m1 , and rn,m+1 . If un,m are displacements of the atoms from their
equilibrium positions, their new vector-coordinates are r n,m = rn,m + un,m . Now the
displacements un,m and the forces are vectors. According to Hookes law, the force
acting on the {n, m}th atom from its nearest neighbors is
fn,m = (un,m un1,m ) + (un,m un+1,m ) + (un,m un,m1 ) + (un,m un,m+1 ) .
2.6 Problems
31
y
M
{n 1, m + 1} {n, m + 1}
un,m
(un,m)y
M
{n, m}
{n 1, m}
{n 1, m 1}{n, m 1}
{n + 1, m + 1}
(un,m)x
{n + 1, m}
{n + 1, m 1}
x
Figure 2.8 A square two-dimensional lattice.
Calculate fn,m , keeping only terms that are linear with respect to displacements un,m .
Show that the force fn,m has a form similar to Eq. (2.11), i.e.,
( f n,m )x = (u n,m )x (u n+1,m )x + (u n,m )x (u n1,m )x ,
etc. Write down the Newton equations for the displacements un,m . Using the previously
discussed replacement of the discrete system by the continuous medium, obtain a twodimensional wave equation. Find expressions for the density and the elastic modulus of
the two-dimensional elastic medium considered.
5. The simplest example of the interference effect is given by the superposition of two
sinusoidal waves (u1 = B sin 1 (r, t) and u2 = B sin 2 (r, t)). The resulting wave is a
sinusoidal wave of the form uint = Bint sin 3 (r, t).
(a) Find 3 (r, t).
(b) Find Bint .
(c) For the specific case of 1 (r, t) = 1 (r) + t and 2 (r, t) = 2 (r) + t, find 3 (r, t)
and Bint , and discuss the differences between your answers for this specific case and
answers (a) and (b) obtained before.
6. In addition to the solutions analyzed in Section 2.2, the wave equation (2.14)
has an infinite number of solutions of other forms. Let u(z) be an arbitrary function that satisfies Eq. (2.14), for which first and second derivatives du/dz, and
d2 u/dz 2 can be calculated. Show that functions u(z st) are the solutions to the wave
equation.
32
7.
Wave mechanics
3.1
Introduction
In this chapter we discuss the basic physical concepts and equations related to the behavior
of particles in the nanoworld. We introduce the Schrodinger wave equation for particles
and determine the ways to calculate observable physical quantities. We find that, in wave
mechanics, the motion of a particle confined to a finite volume is always characterized by discrete values of the energy and standing-wave-like wavefunctions, i.e., such a
motion is quantized. While motion in an infinite space (i.e., free motion) is not quantized
and is described by propagating waves, the energy of the particle is characterized by a
continuous range of values.
Keeping in mind the diverse variants of nanostructures, by using wave mechanics we
analyze some particular examples, which highlight important quantum properties of particles. Many of the examples analyzed can serve as the simplest models of nanostructures
and will be exploited in following chapters to understand the fundamentals of processes
in nanoelectronics.
3.2
The Schrodinger
wave equation
From the previous chapter, we conclude that nanosize physical systems are quantummechanical systems, inasmuch as their sizes are comparable to typical de Broglie wavelengths of the particles composing these systems. In dealing with quantum-mechanical
systems, one aims at determining the wavefunction of a single particle or of the whole
system. As we will demonstrate in the subsequent discussion, knowledge of the wavefunction in quantum mechanics is sufficient to describe completely a particle or even
a system of particles. This means that, if we know the wavefunction of a system, we
can, in principle, calculate all macroscopic parameters that define the properties of the
system.
The wavefunction of a particle satisfies the principal equation of quantum mechanics, the time-dependent Schrodinger wave equation,
i -h
= 0,
H
t
(3.1)
34
Wave mechanics
where, -h, the reduced Planck constant, -h = h/(2 ), was introduced earlier, and the
operator H is the Hamiltonian of the system:
-h2 2
+ V (r).
H =
2m
(3.2)
2
2
2
+ 2+ 2
2
x
y
z
(3.3)
being the Laplacian operator. On comparing Eqs. (3.1) and (3.2) for particles with
Eq. (2.18) for wave fields, we may notice that both equations have second derivatives with
respect to the spatial variable, r, and, correspondingly, first and second derivatives with
respect to time, t. Despite the latter difference, the solutions of Eq. (3.1) are expected to
be in wave-like form.
If the potential V (r) is time-independent, one can separate the dependences on the
time and spatial coordinates:
-
(3.4)
where (r) is a complex function of the spatial coordinates only. The time-dependent
wavefunction, (r, t), is often called the non-stationary wavefunction, while (r) is
referred to as a stationary wavefunction. By substituting Eq. (3.4) into Eq. (3.1), one
gets the time-independent Schrodinger equation:
-2 2
h
i j =
1,
0,
if i = j,
if i =
j.
The major task of quantum mechanics is to solve the Schrodinger wave equation,
Eq. (3.1).
As we have mentioned already, the wavefunction of a particle in free space
(V (r) = 0) has a plane-wave form (r, t) = Aei(krt) (Eq. (2.43)). On substituting this
3.2 The Schrodinger
wave equation
y3(z)
35
E
3
E3
Unbound state E3
Unbound state E2
V
y2(z)
E2
0
Potential well V (z)
E1
Bound state E1
Wavefunction y1(z)
Figure 3.1 Three types of solutions of the Schrodinger equation for a one-dimensional well of
arbitrary form.
wavefunction into the Schrodinger equation (3.1), one gets the relationship between the
electron wavevector and its energy,
-h2
p2
kx2 + k 2y + k z2 =
,
(3.7)
E = -h =
2m
2m
which coincides with the classical relationship between the particles momentum, p, and
its energy, E.
In general, the exact value of the energy E characterizes the system only for the socalled stationary-state case, when the potential energy and, therefore, the Hamiltonian
do not depend on time. In addition, for a stationary situation, a determination of energy
requires an infinite time of observation (measurement). If this time, !t, is finite, the
inaccuracy of a measurement of the energy, !E, should satisfy the following inequality:
!E !t h.
(3.8)
This is the uncertainty relation between the energy and the time. Inequalities (2.47) and
(3.8) constitute fundamental uncertainty relations in quantum physics.
Equation (3.5) has the form known as an eigenvalue equation. The energy, E, is its
eigenvalue and the wavefunction, (r), is its eigenfunction. The eigenvalue E may run
over discrete or continuum values, depending on the shape of the potential function,
V (r), and the boundary conditions.
In order to illustrate both possible types of solutions of the Schrodinger equation
and energy states, as well as to clarify the tasks which arise, let us consider the onedimensional problem for a system with potential energy V (r) = V (z) shown in Fig. 3.1.
Here the vertical axis depicts the energy E, and z represents only one space coordinate.
The potential has a negative minimum at z = 0, tends to zero as z , and saturates
to a finite value V at z . This potential is the most general form of a potential well.
At this point, a short qualitative discussion serves to emphasize that the boundary conditions define the type of solution. These solutions will be obtained and discussed in more
36
Wave mechanics
detail in due course. Among the possible solutions of the Schrodinger equation (3.5) with
the chosen potential, there can exist solutions with negative energy E = E i < 0. The
wavefunction corresponding to energy E = E 1 < 0 is represented in Fig. 3.1 by curve 1.
One of the peculiarities of solutions with negative energy is that the spatial region with
classically allowed motion, where the kinetic energy p2 /(2m) = E V (z) > 0, is certainly restricted. In the classically forbidden regions where V (z) > E, the -function
decays as |z| . The states of particles, like those just described, are the so-called
bound states and they are characterized by a discrete energy spectrum. Consider next the
possible solutions for the energy range 0 E V as shown by the line labeled E 2 in
Fig. 3.1. These solutions exist for any values of E; they are finite as z and penetrate slightly into the barrier region V (z) > E as shown by curve 2 in Fig. 3.1. In the limit
z , these solutions may be represented as a sum of two waves traveling in opposite directions: one wave is incident on, and the other is reflected from, the barrier. For
each energy in the range 0 E V there is only one solution satisfying the physical
requirements. For any energy E > V there exist two independent solutions; see curve 3
corresponding to the energy E 3 . One solution may be chosen in the form of the wave
propagating from left to right. At z it has only one component, namely the wave
overcoming the barrier, and as z there is a superposition of incident and reflected
waves. It must be emphasized that the wave reflected from the barrier, when its energy
exceeds the height of the barrier, arises only in quantum physics. The other wavefunction can be chosen in the form of waves propagating from right to left. The latter is an
example of continuum energy spectra with energies E > V when wavefunctions are
finite far away from the potential relief. These considerations demonstrate the importance of boundary conditions for Eq. (3.5): if the decay of wavefunctions (i.e., 0 at
z ) away from the potential well is required, the discrete energies and the bound
states can be found and they are of principal interest; if the boundary conditions correspond to the incident wave, continuous energy spectra will be obtained. To conclude
the discussion of boundary conditions, let us mention that we often use potential energies with discontinuities. In this case, at the point of discontinuity of the potential, we
should require both continuity of the wavefunction and continuity of the derivative of the
wavefunction with respect to coordinates.
Since the Schrodinger equation is linear, it is clear that, if a function is a solution
of the equation, then any function of the form of constant is also a solution of the
same equation. To eliminate this ambiguity, we have to take into account the probabilistic
character of the wavefunction. Indeed, if a physical system is enclosed in a finite volume,
the actual probability of finding a particle in this volume must equal 1, i.e.,
(3.9)
|(r, t)|2 dr = 1.
Equation (3.9) is called the normalization condition. It provides the condition needed to
determine the constant multiplicative factor of the wavefunction for the case of a system
of finite size.
If an infinite volume is under consideration and the integral of Eq. (3.9) does not exist,
there are other normalization conditions instead of Eq. (3.9). Consider, for example, the
3.2 The Schrodinger
wave equation
37
so-called scattering problem, in which electrons come from infinity and are scattered
by a local potential. This case corresponds to energies E2 and E 3 for particle motion in
the potential sketched in Fig. 3.1. For this problem one can assume the incident wave
to be a plane wave with a given amplitude A: (z, t) = Aeikz eiEt/h . Then, the amount
of scattered waves will be proportional to the amplitude A, due to the linearity of the
Schrodinger equation. Very often the last condition is referred to as an initial condition
instead of as a boundary condition because we are dealing with the state before the
scattering and states after the scattering.
In an overview of quantum mechanics it is very useful to introduce the density of
particle flow, i. In classical physics, the density of flow is a vector, which specifies the
direction of particle flow and has a modulus equal to the number of particles crossing a
unit area perpendicular to the area per unit time. In quantum mechanics, this quantity is
given by the following formula
ih
( ).
(3.10)
i=
2m
For example, the density of flow of particles described by the plane wave of Eq. (2.43)
is straightforwardly found to be
-hk
p
i=
|A|2 = |A|2 = v|A|2 .
m
m
(3.13)
38
Wave mechanics
then the expectation value of the physical observable a coincides with the eigenvalue:
a = a. For example, if the wavefunction is the solution of the Schrodinger equation
(3.5), one may calculate the mean energy, E:
= H
E dr = E.
E = H
(3.14)
E
However, if a particle were in a state with no well-defined energy, say a state that is
characterized by the superposition of the solutions Ei ,
Ci Ei ,
(3.15)
=
i
(3.16)
3.3
V=
39
V=
L
2
Wavefunctions, yn(z)
Quantized energies, en
n=4
n=3
n=2
n=1
0
L
2
L
2
L
2
Figure 3.2 Solutions of the Schrodinger equation for a quantum well with infinite barriers.
Eigenenergies, n , are defined as n = n 2 1 , where 1 = -h- 2 2 /(2m L 2 ).
(3.17)
i.e., the wavefunction is a plane wave propagating in the {x, y}-plane with the amplitude
modulated in the z direction. By substituting Eq. (3.17) into Eq. (3.1), we find the onedimensional equation
-2 2
h d
+
V
(z)
(z) = 0,
(3.18)
2m dz 2
where
=E
-h2 k 2
||
2m
(3.19)
Here E, -h2 k||2 /(2m), and can be interpreted as the total energy, the kinetic energy of
free motion along the x- and y-coordinates, and the energy associated with motion
in the potential V (z), respectively. Equations (3.17) and (3.18) demonstrate for the case
being considered that the motion along the z-coordinate is independent of motion in the
two other directions and can be analyzed in terms of one-dimensional motion.
40
Wave mechanics
a one-dimensional well with infinitely high potential barriers, namely V (z) = 0 inside
of the well and V (z) = outside of the well:
0,
for |z| < L/2,
V (z) =
(3.20)
+, for |z| > L/2.
The spatially dependent part of the wavefunction (z) satisfies the time-independent
Schrodinger equation (3.18):
-2 2
h d
(z) = 0.
(3.21)
2m dz 2
In the general case, the solution to Eq. (3.21) is
(z) = AeiK z + BeiK z ,
where we have introduced
K =
2m
-h2 .
(3.22)
(3.23)
Impenetrable walls are viewed as boundaries that exclude the particle. Specifically, for
the case of impenetrable walls, the probability of finding the particle at z L/2 or
z L/2 is zero, i.e.,
(L/2) = (L/2) = 0.
(3.24)
These are, thus, the boundary conditions for the wavefunction of a particle in a quantum
well with impenetrable walls. By applying these boundary conditions to the wavefunction
(3.22), we obtain
AeiK L/2 + BeiK L/2 = 0
for z = L/2,
for z = L/2.
(3.25)
A nontrivial solution of this algebraic system of equations exists if, and only if the
following determinant is equal to zero:
iK L/2 iK L/2
e
e
iK L/2 = 0,
eiK L/2
e
which results in
sin(K L) = 0
or
K L = n.
K n = n, n = 1, 2, 3, . . .;
L
(3.26)
(3.27)
n = 0 is excluded because for K 0 = 0 Eqs. (3.25) and (3.22) give (z) 0. From
Eq. (3.23) and (3.27), we find possible energies of this particle:
-h2 2
-h2 K 2
n
n =
=
n2.
(3.28)
2m
2m L 2
41
By substituting Eq. (3.27) into Eq. (3.25), it is easy to check that at a given n the
relationship between coefficients A and B is B = ei n A = (1)n+1 A. Thus, if n is an
odd integer number, we obtain symmetric wavefunctions from Eq. (3.22):
nz
2
cos
, n = 1, 3, . . .
(3.29)
n (z) =
L
L
If n is an even integer, we obtain anti-symmetric wavefunctions:
nz
2
n (z) =
sin
, n = 2, 4, . . .
(3.30)
L
L
In these functions, the factor 2/L arises as a result of the normalization condition:
|n (z)|2 dz = 1.
(3.31)
The integer n is called the quantum number. Actually, we see that the energies n do
not depend on the sign of the quantum number n. The same is valid for the physically
important quantity |n (z)|2 . Thus, we can use only, say, positive quantum numbers,
n > 0. Let us write down the four lowest states explicitly:
z
-h2 2
2
n = 1,
1 =
,
=
cos
;
1
2m L 2
L
L
2 z
2
2 =
sin
;
n = 2,
2 = 41 ,
L
L
3 z
2
cos
;
n = 3,
3 = 91 ,
3 =
L
L
2
4 z
n = 4,
4 = 161 ,
4 =
sin
.
(3.32)
L
L
These solutions allow one to draw some important conclusions.
First, the energy spectrum of the particle confined by a potential well is discrete. That
is, instead of a continuous change of energy as is characteristic in classical physics
a quantum particle placed in a well may have only certain discrete energy values. In
other words, the energy spectrum becomes quantized, as shown in Fig. 3.2. Often, one
refers to this type of the spectrum as a set of discrete energy levels. Interestingly, for the
case being considered, the interlevel distances, n+1 n , increase with the number of
levels n.
Second, the lowest energy level (usually called the ground state) is not zero; it is finite.
That is, the particle can not have zero energy! Actually, this is a direct consequence of
the uncertainty principle. Indeed, a particle placed in a well is localized in a space region
of size L, i.e., the uncertainty !z is less than or equal to L. According to Eq. (2.47),
such a localization leads to an uncertainty in the momentum !p !pz h/L. Then,
the momentum of the particle can be estimated by p !p, i.e., the momentum is not
zero and there is a non-zero total energy.
Third, the wavefunctions are, in fact, standing waves, and strictly integer numbers of
half-waves may exist between these impenetrable walls. This result is mathematically
Wave mechanics
V = Vb
V = Vb
Quantized energies, en
42
L
2
e3
e2
e1 0
0
L
2
Figure 3.3 Eigenenergies of a particle in a quantum well with finite potential barriers.
similar to the case of the quantization of standing elastic waves between rigid walls
depicted in Fig. 2.4.
- 2,
2m/h
(3.34)
(3.35)
and C and D are arbitrary constants. Outside the well the solution has the form
(zL/2)
, for z L/2,
Ae b
(3.36)
(z) =
for z L/2,
Beb (z+L/2) ,
43
(3.38)
(3.41)
(3.42)
sin(k w L/2) = kw /k0 , for cot(kw L/2) < 0,
Wave mechanics
LR
n=4
1
n=3
slope k01
n=2
n=1
0
kw,1
p
L
2p
3p
4p
kw
Figure 3.4 A graphical solution for Eqs. (3.41) and (3.42). Solutions correspond to the
intersections of the curves. The dashdotted line corresponds to small 0 , resulting in only one
solution of Eq. (3.41). The dotted line corresponds to a critical value of 0 at which a second
level appears in the well as a result of the solution of Eq. (3.42). The solid line corresponds to an
intermediate value of 0 leading to four solutions.
44
15
(a)
10
(b)
100
200
300
L (A)
Figure 3.5 The number of bound states of a square well plotted as a function of the well
kw = k0 = /L, as is shown by the dotted line in Fig. 3.4, with energy slightly below
Vb , (2 Vb ). When k0 increases further, the first and second levels become deeper and
a
third level occurs in the well, and so on. Indeed, new levels occur when the parameter
(2mVb L 2 )/( 2 -h2 ) becomes an integer.
Figure 3.5 depicts the number of bound states as a function of well thickness for two
specific values of Vb and for two particle masses. The parameters chosen are characteristic
for artificial quantum wells fabricated on the basis of AlGaAs/GaAs materials (see the
only a few energy
next chapter). One can see that for thin quantum wells (L < 100 A)
levels can occur.
The first example we discussed is related to an infinitely deep well, when k0 . In
this case, the slope of the linear function in Fig. 3.4 tends to zero, solutions correspond
to kw = n/L, and the energy levels are given by Eq. (3.28) as discussed earlier.
Thus, as for the first example, for a quantum well with walls of finite height, we have
also found discrete energy levels. The lowest energy level (the ground state) is also
45
non-zero. However, the number of discrete levels is finite. Another important phenomenon is the penetration of a particle under a barrier. Equation (3.36) describes
the wavefunction under a barrier. To explain this effect, consider the energy (2.9) for a
classical particle: = p2 /(2m) + V (z). Since p2 is always positive, particle motion is
possible in the spatial region with V (z) 0. For < Vb , this classically allowed
region coincides with the region within the quantum well. The motion of such a classical particle is always restricted to a finite interval of the coordinates and we never find
the classical particle outside of the region where its total energy is less than the potential energy: i.e., a classical particle treats any barrier as an impenetrable barrier. The
quantum-mechanical analysis shows that the wavefunctions are finite at any coordinate
and we can find a particle even in classically forbidden regions, or, as one says, under
the barrier.
The effect of penetration of a particle under a barrier is the so-called tunneling effect. It
is principally a quantum-mechanical phenomenon. The probability of finding the particle
rapidly decreases as we move away from the classically allowed interval.
-h
d 2
- ), when the second term in
Consider first at very large coordinates: 2 2/(h
V (z) =
(3.45)
Wave mechanics
Energy
46
5hw
2
e2 =
n=2
classical
turning point
3hw
e1 = 2
n=1
e0 =
n=0
hw
2
Figure 3.6 Eigenenergies and eigenfunctions of a particle in a parabolic quantum well (harmonic
oscillator).
( ) = e
/2
P( ).
It can be shown that, to satisfy the normalization condition (3.31) for wavefunctions,
the unknown function P( ) should be a polynomial of a special form, which is called a
Hermite polynomial. This is possible only at certain values of the energy:
1
, n = 0, 1, 2, . . .
(3.46)
n = h n +
2
For each value of n, the function P( ) can be calculated by using derivatives:
Pn ( ) = (1)n e
dn 2
e .
d n
47
As in the previous example, the particle penetrates, i.e., tunnels under, a barrier of finite
value. For comparison, we indicate that the classically allowed region of motion for the
particle is restricted by the two crossings of the curve V (z) and the line E = constant,
as illustrated by Fig. 3.6.
Concluding these examples of particles placed into confining potentials, we emphasize
the fundamental significance of Plancks constant, -h: all quantized energies and interlevel
distances are proportional to -h. If we formally suppose that -h 0, the effects of energy
quantization, tunneling, etc., will disappear. That is, this limit corresponds to classical
mechanics.
48
Wave mechanics
e3
e2
e1
0
ky
kx
Figure 3.7 An example of two-dimensional subbands. The subbands are represented by the
49
A
L
B
A
Figure 3.9 A three-layer structure where layers A create potential barriers for electrons from
layer B.
illustrates a heterostructure system forming quantum wells, which are important for
nanoelectronics.
Quite similarly, confinement of a particle in two directions leads to additional energy
quantization and leaves only one degree of freedom for particle-wave propagation. The
energy for this case is
-h2
k2,
E = E n 1 ,n 2 ,kx = n 1 ,n 2 +
(3.50)
2m x
where the quantized energy n 1 ,n 2 depends on two integer quantum numbers, n 1 and n 2 ,
and motion along the propagation direction (assumed to be the x direction) is characterized by the one-dimensional wavevector k x . Thus, the spectrum consists of a set of
one-dimensional subbands as shown in Fig. 3.10 (compare the imaginary slices of the
parabaloids of Fig. 3.7 at fixed values of k y with the parabolas of Fig. 3.10). Such a quantized particle is free in one dimension and the corresponding artificial structure is called
a quantum wire. Currently, there exist several technological methods for the fabrication
50
Wave mechanics
e13
e12
e11
0
kx
Figure 3.10 An example of one-dimensional subbands. The subbands are presented by the
Figure 3.11 A scanning electron microscope image of free-standing InP quantum wires. Reprinted
with permission from Thomas Martensson, Patrick Carlberg, et al., Nanowire arrays defined by
C American Chemical Society.
nanoimprint lithograph, Nano Lett., 4 (4), 699702 (2004).
of quantum wires. For illustration, in Fig. 3.11, an array of free-standing quantum wires
made from InP is presented.
In the following sections we will show that transformation of spectra from threedimensional to low-dimensional energy subbands drastically changes the behavior of
the particles.
51
z
Lz
Lx
Ly
x
Figure 3.12 A sketch of a quantum box, L x L y L z , embedded in a matrix.
52
Wave mechanics
numbers, n 1 , n 2 , and n 3 , that substitute for the three components of the wavevector, k:
k x , k y , and k z .
The discrete spectrum in a quantum box and the lack of free propagation of a particle
in any direction are the main features distinguishing quantum boxes from quantum wells
and quantum wires. As is well known, these features are typical for atomic systems as
well.
The similarity with atoms is seen from another example of a potential the spherical
dot. In this case, the potential can be modelled as
0,
for r R,
V (r) =
(3.53)
+, for r R.
It is instructive to compare this result with Eq. (3.20). We assume that the center of the
dot coincides with the center of coordinates; r is the magnitude of the radius-vector, r,
and R is the radius of the spherical dot.
As we have found above, a quantum number (continuous or discrete) can be attributed
to each of the degrees of freedom of the particle. For a problem with spherical symmetry,
like that given by Eq. (3.53), it is convenient to introduce the following three degrees
of freedom: radial motion (along the vector coordinate) and two rotations, which can
be described by two polar angles, and . As in the case of a parallelepiped-shaped
quantum dot, the three degrees of freedom will bring about three quantum numbers. One,
associated with the motion along the radius, r , substitutes for the wavevector component
kr , and the two other quantum numbers, associated with rotations, l and m, substitute
for the wavevector components k and k .
The electron wavefunction in a spherical quantum dot can be found in the form
n,l,m (r, , ) = Rn (r )Ylm (, ),
(3.54)
where Rn (r ) is the radial function and Ylm (, ) are the so-called spherical functions,
which are angle-dependent. We will consider the simplest case of a wavefunction with
spherical symmetry, which corresponds to Y00 = 1. For spherical symmetry both quantum numbers associated with rotations are equal to zero: l = m = 0; this corresponds to
k = k = 0, i.e., the wavevector, k, is parallel to the radius, r. On rewriting the radial
function as Rn (r ) = n (r )/r we find the Schrodinger equation for n (r ) to be similar to
that in Cartesian coordinates:
-2 2
h
E n (r ) = 0.
2m r 2
Thus, as a result of the spherical symmetry the problem reduces formally to a onedimensional equation. For the case in which there is an infinitely high barrier surrounding
the quantum dot, the wavefunction vanishes for r > R and it follows that Rn (r ) = 0 for
r > R. One can obtain the solutions of this equation as
A sin(kwr )
Rn (r ) =
,
(3.55)
r
ir
it
Vb
iin
L
2
53
L
2
is the same as Eq. (3.26) but with KL replaced by kw R. So, kw is quantized: kw = n/R
(compare this with Eq. (3.27) and its analogy with Eq. (3.28)) One can obtain
E n,0,0 =
-h2 2 n 2
,
2m R 2
n = 1, 2, . . .,
(3.56)
where zero indexes show that the values of l and m are equal to zero. This series of levels
is the same as for the one-dimensional quantum well with infinitely high barriers.
ir
i in
and
T () =
it
.
i in
(3.58)
54
Wave mechanics
In wave mechanics, we have to solve Eq. (3.18) with the potential (3.57). The solutions
are
ikz
z L/2,
e + r eikz ,
z
z
(z) = ae
(3.59)
+ be , L/2 z L/2,
ikz
te ,
z L/2.
To the left of the barrier we introduce the incident wave with a unit magnitude, eikz and
a reflected wave, r eikz . To the right of the barrier there is only a transmitted wave, teikz .
Then, the exponential factors are
2m 0 (Vb )
2m 0
k=
and
=
.
-h
-h
For < Vb , is a real number, whereas for > Vb , is an imaginary number. The
parameters r, a, b, and t are still arbitrary functions of , which we find by matching
these solutions and their derivatives at the walls of the barrier, z = L/2. The fluxes at
z can be calculated in terms of r and t: i in = 1, i r = |r |2 , and i t = |t|2 . Omitting
the procedure for calculation of r and t, we write down the obvious relationship
R + T = 1,
and the two different results for the two energy intervals are as follows:
1
T =
, < Vb ,
2
2 2
k +
2
1+
sinh (2 L)
2k
1
T =
, > Vb .
2
2
k K2
2
1+
sin (2K L)
2k K
(3.60)
(3.61)
16k 2 2 4 L
e
.
(k 2 + 2 )2
(3.62)
Thus, the tunneling effect is determined primarily by the exponential factor. According
to Eq. (3.59) for the wavefunction, the probability of finding the particle under the barrier
is also exponentially small.
For the second case, corresponding to an energy greater than the barrier height, when
classical physics gives strictly T = 1, from the quantum-mechanical equation (3.61), we
1.0
(a)
(b)
0.9
0.8
10
0.7
20
0.6
0.5
0.4
0.3 eV
0.3 eV
1.0
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0.0
0.0
55
0.3
0.2
Barrier height
0.3
0.6
0.9
1.2
1.5
e (eV)
1.8
2.1
2.4
0.1
0.0
0.0
Barrier height
0.3
0.6
0.9
1.2
1.5
e (eV)
1.8
2.1
2.4
Figure 3.14 Dependences of the transmission coefficient, T , on electron incident energy, , for
The height of the barrier
various thicknesses of the barrier, L: (a) L = 10 A and (b) L = 20 A.
Vb = 0.3 eV.
find that T reaches 1 only at sin(2K L) = 0, i.e., at certain resonance energies, when
2K L = n, with n being an integer. Otherwise, T < 1 with minima at 2K L = (n + 12 ) .
All of this means that there exists a reflection of the particle even at large energy.
Figures 3.14(a) and 3.14(b) illustrate the transmission coefficient for the barrier shown in
the inset. Both energy intervals, 0 < < Vb and > Vb , are presented. The transmission
coefficient for < Vb is finite due to tunneling, but it is small; whereas for > Vb the
coefficient is close to 1. Oscillations in T with energy are explained by another quantum
effect overbarrier reflection of the particles.
Concluding this analysis, we emphasize the great significance of quantum effects
in continuous energy spectra, and that these effects determine the basic electrical and
optical properties of nanostructures.
3.4
1 e2
,
4
0 r
(3.63)
where
0 is the permittivity of free space, e is the elementary electrical charge, and r is
the distance between proton and electron. The negative sign in Eq. (3.63) indicates that
the electron and the proton are attracted to each other.
The potential of Eq. (3.63), shown in Fig. 3.15, has some similarities to that of a
quantum dot with impenetrable walls. This similarity allows us to suppose that the energy
of the electron in the hydrogen atom would be quantized. There are two major differences
Wave mechanics
20
15
10
r (A)
10
15
20
n=3
n=2
5
E (eV)
56
10
n=1
15
between these potentials: (i) the profiles of the potentials (compare Eqs. (3.53) and (3.63))
and (ii) the reference points for zero energy. Namely, the zero energy was located in the
quantum dot and the electron should have infinite energy to escape into the potential
walls, whereas in the hydrogen atom the energy of a free electron is taken to be zero at
r = and the energy at r = 0 is equal to . For further analysis of the hydrogenatom model, the free-electron mass, m 0 , is used in this section because the motion of
nuclei can be neglected due to the fact that the mass of the nucleus is almost 2000 times
greater than the free-electron mass.
The ground state of the hydrogen atom may be described by the so-called radially
symmetric function, (r ):
(r ) =
1 r/r0
e
,
r0
(3.64)
where r0 is the characteristic radius of the ground state. It can be estimated as follows.
Let us evaluate the kinetic energy of an electron confined in the sphere of radius r0 by
using the uncertainty conditions of Eq. (2.47): p -h/r0 ; then, the kinetic energy can be
evaluated as
-h2
p2
.
2m 0
2m 0 r02
(3.65)
-h2
p2
1 e2
=
+
.
2m 0
4
0 r0
2m 0 r02
(3.66)
To find the r0 that corresponds to the minimum of the total energy, H, we find the first
derivative of H(r0 ) with respect to r0 and equate the result to zero:
-2
1 e2
2h
dH(r0 )
=
= 0.
dr0
4
0 r0 2
2m 0 r03
(3.67)
57
4
0 -h2
,
m 0 e2
(3.68)
which is called the Bohr radius. Now we can find the energy of the ground state E 1 :
E1 =
1 e2
m 0 e4
=
-2.
4
0 2r0
(4
0 )2 2h
(3.69)
Numerical estimates give the following values for the energy and the radius of the
(1 A
= 1010 m =
ground state of the hydrogen atom: E 1 = 13.5 eV and r0 = 0.529 A
0.1 nm).
An arbitrary atomic state is characterized by the following three quantum numbers,
n, l, and m:
principal number
orbital number
magnetic number
n = 1, 2, 3, . . .;
l = 0, 1, 2, . . ., n 1; or l = s, p, d, . . .;
m = 0, 1, 2, . . ., l .
The series of energy levels for the hydrogen atom is the following: 1s, 2s, 2p, 3s, 3p,
3d, . . . The ground-state wavefunction of Eq. (3.64) corresponds to the case of n = 1
and l = m = 0. This wavefunction has the eigenenergy E 1 of Eq. (3.69). The general
expression for the nth energy level of the hydrogen atom is given by
En =
E1
.
n2
(3.70)
Importantly, the energy spectrum of the hydrogen atom depends on only one discrete
number n and does not depend on the orbital and magnetic numbers l and m. This means
that some states of the atom can have the same energy. This situation is called degeneracy
of the states.
Results from a number of detailed optical experiments have supported this classification of energy spectra of hydrogen atoms and confirmed the energy values with high
accuracy. However, it turns out that nature is more complicated. Additional non-optical
experiments revealed a new property of quantum systems, which has no analogue in
classical physics.
Indeed, in classical physics a particle with angular momentum l can have a magnetic
dipole moment M = l with the coefficient called the gyromagnetic ratio. The latter
parameter is associated with the fundamental parameters of the electrons:
=
e
,
2m 0 c
(3.71)
where e, m 0 , and c are the elementary charge, free-electron mass, and speed of light,
respectively. If an external magnetic field, B, is applied, an additional potential energy
58
Wave mechanics
(3.72)
Let the magnetic field be nonuniform, say it depends on the z-coordinate. Then the force
acting on the particle will be
fz =
B
UM
= Mz
.
z
z
(3.73)
As a result, a beam of atoms with magnetic moments will split into components corresponding to different values of Mz . In special experiments, known as SternGerlach
experiments, with hydrogen atoms in the ground s state, when l = m = 0 and thus
Mz = 0, a splitting of the beam into two symmetrically deflected components has been
detected. It provided evidence of the existence of a dipole magnetic moment, which we
have not taken into account yet. It has been assumed that the electron has an intrinsic
angular momentum which is responsible for the results of the experiment. This intrinsic angular momentum S is called spin. The projection of the spin, Sz , of an electron
in the magnetic field, B, takes on the half-integer values + 12 and 12 . The spin effect
explains also the results of a number of other experiments with electrons. Moreover, it
has been found similarly that some spin characteristic can be attributed to any particle.
For example, photons light quanta have a spin equal to 1.
Thus, the complete description of an energy state of the hydrogen atom should include
additionally the spin number Sz . Now the series of states including the spin degeneracy can be written down as 1s(2), 2s(2), 2p(6), 3s(2), 3p(6), 3d(10), . . . The numbers in
brackets indicate the degeneracies of the states. For example, the nomenclature 2p(6)
describes six states of 2p type (l = 1) with the same energy, but with different spins ( 12 )
and different magnetic numbers (m = 1, 0, +1).
Spin is a very important characteristic. It is responsible for a number of quantum
effects. Moreover, it determines the character of the population of different energy levels
by particles: in a system consisting of identical electrons only two (with different spins)
or fewer electrons can be found in the same state at the same time. Since spin is included
among the quantum numbers, two electrons with opposite spins and all other quantum
numbers being the same should be considered as electrons in different quantum states.
The Pauli exclusion principle can be reformulated as follows: no two electrons can be
in the same state. This is the so-called Pauli exclusion principle. This is one more quantum principle. The Pauli exclusion principle is especially important for many-electron
systems.
The classification of energy levels used for the hydrogen atoms may be applied to
many-electron atoms. In the case of many-electron atoms, we should take the nuclear
charge as Z e, where Z is the atomic number in the Periodic Table of elements (Fig. 3.16),
and take into account the Pauli exclusion principle. The electronic structure of complex
atoms can be understood in terms of the filling up of energy levels. One can introduce
internal energy shells and external energy shells (or valence energy states). The latter
are filled up by the outermost valence electrons and play the major role in determining
the chemical behavior of complex atoms.
59
17
18
13
14
15
16
He
1.008
1.008
4.003
Li
Be
6.941
9.012
Transition Metals
10
Ne
10.81
12.01
14.01
16.00
19.00
20.18
13
14
15
16
17
18
10
11
12
Al
Si
Cl
Ar
24.30
26.98
28.09
30.97
32.06
35.45
39.95
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
11
12
Na
Mg
22.99
19
36
Ca
Sc
Ti
Cr
Mn
Fe
Co
Ni
Cu
Zn
Ga
Ge
As
Se
Br
Kr
39.10
40.08
44.96
47.87
50.94
52.00
54.94
55.85
58.93
58.69
63.55
65.39
69.72
72.61
74.92
78.96
79.90
83.80
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
Rb
Sr
Zr
Nb
Mo
Tc
Ru
Rh
Pd
Ag
Cd
In
Sn
Sb
Te
Xe
85.47
87.62
88.91
91.22
92.91
95.94
(97.91)
101.1
102.9
106.4
107.9
112.4
114.8
118.7
121.8
127.6
126.9
131.3
55
56
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
Cs
Ba
Lu
Hf
Ta
Re
Os
Ir
Pt
Au
Hg
Tl
Pb
Pi
Po
At
Rn
132.9
137.3
180.9
190.2
197.0
200.6
204.4
207.2
209.0
175.0
178.5
183.8
186.2
192.2
195.1
87
88
103
104
105
106
107
108
109
110
Fr
Ra
Lr
Rf
Db
Sg
Bh
Hs
Mt
111
112
(223.0) (226.0) (262.1) (261.1) (262.1) (263.1) (264.1) (265.1) (266.1) (269.1) (272.1) (277.1)
57
Lanthanides
Actinides
58
59
60
61
62
63
64
65
66
67
68
69
70
La
La
Pr
Nd
Pm
Sm
Eu
Gd
Tb
Dy
Ho
Er
Tm
Yb
138.9
140.1
140.9
144.2
(144.9)
150.4
152.0
157.2
158.9
162.5
164.0
167.3
168.9
173.0
89
90
91
92
93
94
95
96
97
98
99
100
101
102
Ac
Th
Pa
Np
Pu
Am
Cm
Bk
Cf
Es
Fm
Md
No
(227.0)
232.0
231.0
238.0
(237.0)
(244.1) (243.1)
(247.1)
(247.1)
(251.1)
(252.1)
(257.1)
(258.1)
(259.1)
While the energy levels of the hydrogen atom exhibit some degree of degeneracy,
i.e., they are independent of the angular momentum described by the orbital number l
and its projection described by the magnetic number m, in many-electron atoms, the
energies depend not only on the principal quantum number n, but also on the orbital
number l. Therefore, the electrons can be subdivided into subshells corresponding to
different orbital numbers l. The spectroscopic notations for these subshells are the same
as above: s, p, d, f, . . . For a given l there are 2l + 1 values of m and two values of
the spin Sz = 12 . Thus, the total number of electrons that can be placed in subshell l is
2(2l + 1). Periods of the Periodic Table of elements are constructed according to filling
of the shells. Hydrogen, H, is the first element of the table. It has one electron occupying
the 1s state, i.e., l = 0. The next atom is helium, He. It has two electrons, which occupy
the lowest s states in the same shell, but have opposite spins; this is referred to as the 1s2
electron configuration. So, the 1s shell is completely filled by two electrons.
The next period of the table begins with lithium, Li. It has three electrons, with the
third electron sitting on the 2s-type level. Thus, we may immediately represent the
electron configuration of Li as 1s2 2s1 . After lithium follows beryllium, Be, with the
configuration 1s2 2s2 . In the next element boron, B, the fifth electron begins to fill up
the next subshell: 1s2 2s2 2p1 . For the p shell, we have l = 1 and m = 1, 0, 1. Thus,
the valence electron in Be can take three values of the magnetic quantum number, m,
and two values of the spin, Sz . In general, we may put six electrons in the p shell, and
so on. For nanoelectronics the elements of group IV and groups III and V that constitute semiconductor crystals are important: silicon (Si, whose atomic number, and
number of electrons, is 14), germanium (Ge, atomic number 32), gallium (Ga, atomic
60
Wave mechanics
(a)
yA
yB
(b)
+
Higher level
yAB
(c)
nucleus nucleus
Bonding orbital yAB
Figure 3.17 Formation of bonding and antibonding atomic orbitals (two atoms).
number 31), arsenic (As, atomic number 33), etc. Their electron configurations are as
follows:
group IV:
Si :
group III:
Ga:
group V:
As:
Ge:
One can see that, both for Si and for Ge, the outer valence electron configurations are
very similar (the same number of electrons in the same p state). Thus, we can expect that
their chemical and physical properties are also similar.
Atoms with completed subshells are stable and chemically inert. Examples are helium,
He, with the configuration 1s2 2s2 ; neon, Ne, with the configuration 1s2 2s2 2p6 ; and argon,
Ar, with the configuration 1s2 2s2 2p6 3d10 4s2 4p6 .
In general, atoms with not completely filled-up subshells, that is, those which have
valence electrons, are expected to be chemically active, i.e., they can form chemical
bonds. Coupling between atoms in molecules and solids is determined by the type of
the electron wavefunctions. Often, these wavefunctions are called atomic orbitals. For
example, the wavefunction of a two-atom molecule should be a combination (a hybrid)
of two orbitals. Such a hybrid can be either a symmetric function, or an antisymmetric
one. Let this hybrid be a symmetric function (constructive interference of the orbitals), as
shown in Fig. 3.17(c). Then, there is a finite probability of finding the electron between
the nuclei. The presence of electrons between two nuclei, A and B, attracts both nuclei and
keeps them together. The corresponding state is called a bonding state of two atoms and
the corresponding wavefunction, AB , is called the bonding orbital. The antisymmetric
hybrid (destructive interference of the orbitals) does not bond atoms and is called an
antibonding state, see Fig. 3.17(b). The simplest example of the bonding effect is given
by the two-atom hydrogen molecule: the bonding state constructed from s states of
the hydrogen atoms corresponds to an energy lower than the energy of two uncoupled
(c)
61
y
x
pz orbital
py orbital
(a)
x
s orbital
(b)
(d)
+
y
y
x
+
px orbital
hydrogen atoms. This results in a stable H2 molecule. The antibonding state corresponds
to an energy greater than the energy of two free hydrogen atoms. In the antibonding state
the atoms repel each other.
Actually, the formation of bonding and antibonding states of two hydrogen atoms is
a particular example of a more general and very important result: when two identical
atoms approach each other closely, their energies change and, instead of a single atomic
(degenerate) energy level, two levels arise, as illustrated in Figs. 3.17(b) and 3.17(c). This
is the so-called energy splitting. Thus, bonding and antibonding orbitals correspond to
these two levels. Three atoms drawn together are characterized by three close levels, etc.
Now, consider in detail atoms of group IV. As presented above, for these atoms the
outer shell has four electrons: two in s states and two in p states, i.e., their configuration
can be thought as a core plus the s2 p2 shell. Thus, atoms of group IV can form four bonds
with other atoms in molecules or solids (one bond for one electron). The wavefunctions
of these outer-shell states have very different spatial configurations. Figure 3.18 illustrates the wavefunctions (orbitals) for an s state and for p states with different angular
momentum projections m. Thus, the s orbital is spherically symmetric without any angular dependence. The 3p orbitals are anisotropic and can be considered as perpendicular
to each other. The hybrid orbitals forming bonds with other atoms have two electrons
with opposite spins. In fact, because of interaction between electrons the s and p orbitals
overlap significantly and realistic bonds are formed as linear combinations of s and p
orbitals, which are called sp3 hybridized orbitals. Importantly, when the three directed p
orbitals are hybridized with the s orbital, the negative parts of the p orbitals are almost
cancelled out, so that mainly positive parts remain, as shown in Fig. 3.19(a). Thus, the
hybridized orbitals present directed bonds in space, as shown in Fig. 3.19(b). These
directed bonds of atoms are responsible for the particular crystal structures of elements
62
Wave mechanics
z
+
y
x
s orbital
sp3 orbital
+
+
y
(a)
(b)
py orbital
Figure 3.19 Formation of hybridized sp3 orbitals (a) and four sp3 hybridized orbitals in Si (b).
in group IV. In the next chapter, we will analyze these crystals using the definitions
discussed above.
3.5
Closing remarks
In this chapter we have presented basic definitions and equations of quantum (wave)
mechanics. We found that two cases of particle motion should be distinguished: (i)
quantized motion always occurs with in a finite volume (for example, in a potential
well); (ii) free motion occurs in an infinite space (practically, in a large volume). We
discussed the differences in particle behavior between these two cases.
We analyzed simple instructive examples that illustrate the main qualitative features of quantum physics. Then, we briefly discussed the electronic properties of
atoms, which we should use for the analysis of nanoelectronic systems. Many of the
basic quantum-mechanical concepts introduced above will be applied in the following chapters to study semiconductor nanostructures and the principles of operation of
nanodevices.
For those who want to look deeper into the basic principles of quantum physics we
recommend the following textbooks:
L. Schiff, Quantum Mechanics (New York, McGraw-Hill, 1968).
D. Saxon, Elementary Quantum Mechanics (San Francisco, CA, Holden-Day, Inc.,
1968).
R. P. Feynman, R. B. Leighton, and M. Sands, Lectures on Physics, vol. 3 (New York,
Addison-Wesley, 1964).
The problems presented below illustrate the main definitions given in this chapter and
allow one to practice with the simplest quantum-mechanical examples.
3.6 Problems
3.6
63
Problems
1. Suppose that a particle is placed into a region of space of volume V . Using the
normalization condition of Eq. (3.9), find the amplitude A of the plane wave (2.43).
2. Assume that the wavefunction is a superposition of two plane waves of the same
energy: = 1 + 2 with 1,2 = A1,2 ei(k1,2 rt) . Calculate the probability density of
finding a particle at the point r, i.e., calculate |(r)|2 . Discuss the interference effect and
its dependence on the wavevectors k1 and k2 .
3. Assume that an electron with mass m 0 = 9.8 1031 kg is placed in a quantum well
with two impenetrable walls and that the distance between the walls is L = 106 cm.
Calculate the three lowest subband bottom energies: 1 , 2 , and 3 . For these stationary
states, find the probability density of finding the electron at the middle of the well at
z = 0.
4. Consider an electron, which is placed into a quantum well with width L = 106 cm
and barrier height Vb = 300 meV. Calculate the lowest energy level and compare your
result with results for the previous problem. Since the probability density for finding the
particle at a point z is |(z)|2 , the probability of finding the particle in the classically
forbidden regions is
P =2
|(z)|2 dz.
L/2
Use the solution for the lowest energy level to calculate P. Find the number of discrete
energy levels in such a quantum well.
5. The hydrogen molecule, H2 , has the frequency of oscillations = 8.2 1014 s1 .
The reduced mass, which determines the relative displacement of hydrogen atoms, is
m = 0.84 1027 kg, which is half of the mass of a hydrogen atom. Using the model
of a harmonic oscillator, estimate the characteristic displacement, z0 , during vibrations
of this molecule.
6. In classical physics, for a system under equilibrium conditions, the equipartition
principle is valid. According to this principle the average energy of any oscillator is
kB T , where kB = 1.38 1023 J K1 is Boltzmanns constant and T is the ambient
temperature. Calculate the energy of zero-point vibrations of an electromagnetic wave
of wavelength 10 m. Calculate the average number of photons, Nq, , that will be excited
with this wavelength if the ambient temperature is equal to 500 K.
7. Consider two quantum objects, a cubic box and a spherical dot surrounded by impenetrable walls. Suppose that the volumes of the classically allowed regions are equal.
Compare the ground-state levels for the box and the dot.
64
Wave mechanics
8. By using approximate formulae for the transmission coefficient, calculate the probability of tunneling of the conduction electron through an AlGaAs layer of thickness
embedded within a GaAs matrix, for a barrier height equal to 0.3 eV, with the
20 A
electron mass in GaAs m = 0.067m 0 , where m 0 = 9.8 1031 kg is the mass of a free
electron.
9. One of the experimental facts which constitutes one of the fundamentals of wave
mechanics is the observation of discrete optical spectra (spectral lines) of emission of
hydrogen atoms. Using the formulae for the energy levels of a hydrogen atom, E n ,
calculate a general expression for the emission frequencies. Find the wavelengths for
several lines of the so-called Lyman series, for which one of the states participating in
emission is the ground state 1: 1 2, 1 3, 1 4, . . .
4.1
Introduction
After the previous introduction to the general properties of particles and waves on the
nanoscale, we shall now overview the basic materials which are exploited in nanoelectronics. As discussed in Chapter 1, electronics and optoelectronics primarily exploit the
electrical and optical properties of solid-state materials. The simplest and most intuitive
classification of solids distinguishes between dielectrics, i.e., non-conducting materials,
and metals, i.e., good conducting materials. Semiconductors occupy a place in between
these two classes: semiconductor materials are conducting and optically active materials
with electrical and optical properties varying over a wide range. Semiconductors are the
principal candidates for use in nanoelectronic structures because they allow great flexibility in the control of the electrical and optical properties and functions of nanoelectronic
devices.
The semiconductors exploited in microelectronics are, in general, crystalline materials.
Through proper regimes of growth, subsequent modifications and processing, doping by
impurities, etc., one can fabricate nanostructures and nanodevices starting from these
bulk-like materials.
Other physical objects that demonstrate promising properties for nanoelectronics were
discovered recently, for example carbon nanotubes. These wire-like and extended objects
are of a few nanometers in cross-section. They can be produced with good control of
their basic properties; in particular, they can be fabricated as either semiconductors or
metals. Various types of processing techniques have been shown to be viable for the
fabrication of electronic nanodevices from carbon nanotubes.
In this chapter, we consider various materials that have applications for nanoelectronics. We start with the classification of dielectrics, semiconductors, and metals. Then,
we define electron energy spectra, which determine the basic properties of the electrons
in crystals. For nanoelectronics, a critical issue is the engineering of electron spectra,
which can be realized in heterostructures. Thus, we analyze basic types of semiconductor
heterostructures. Finally, we briefly describe organic semiconductors and carbon-based
nanomaterials, among them carbon nanotubes and such nano-objects as fullerenes.
66
E1
E1
Energy level E3
E1
E1
(a)
Overlap
Energy level E2
Energy band
E1
2
Energy level E1
1
Energy bands E1, E2, and E3
(b)
Figure 4.1 (a) Formation of an energy band. The ground energy level E 1 of a single atom evolves
into an energy band when many identical atoms are interacting with each other. (b) Energy
bands 1, 2, and 3 that correspond to single-atom energy levels E 1 , E 2 , and E 3 , respectively.
4.2
Semiconductors
In every solid, electrons can be characterized in terms of their energy levels. In crystals,
allowed electron energies typically have an energy band structure that may be understood
as follows. In Section 3.4, we pointed out that, if two atoms, each with the same energy,
come close to each other, the composite two-atom system is characterized by two close
energy levels. Similarly, for a system of N atoms, every energy level of the isolated atom
is split into N closely spaced levels. This assembly of close levels may be considered as
an energy band. Figure 4.1(a) illustrates schematically the formation of such an energy
band from a single atomic level. Since a single atom has a series of energy levels, the
electron energies in a crystal constitute a series of energy bands that may be separated
by energy gaps, or may overlap as illustrated by Fig. 4.1(b). As soon as the energy bands
are formed, the electrons should be thought of as collectivized: they can no longer be
attributed to specific atoms, since the energy bands characterize the whole system of N
atoms.
A crucial point is the filling of the bands by electrons. We shall use the Pauli exclusion
principle, which we considered in the previous chapter. That is, no two electrons can be
in the same state. It is possible, as an example, for two electrons to be in the same energy
state, but these electrons must be in different spin states; thus the electrons are in fact in
different overall states. Under equilibrium conditions and at low ambient temperature,
the lowest energy levels should be populated. As we will see later, the most important
electrons are those in the upper populated bands. Then, principally, we obtain two possible
cases.
First, all bands are completely filled and the filled bands are separated from the upper
(empty) bands by an energy gap. This is the case illustrated by Figs. 4.2(a) and 4.2(b) for
dielectrics (insulators) with bandgaps E g > 5 eV and for semiconductors with bandgaps
E g < 5 eV, respectively. Actually, there is no difference between filling up energy bands
for a dielectric (insulator) and for a semiconductor. The difference is in the energy gap
4.2 Semiconductors
67
(a)
Empty
(b)
Empty
Eg
Eg
Filled
(c)
Partially Filled
Filled
(d)
Empty
Overlap
Filled
Filled
Figure 4.2 Energy bands in the cases of a dielectric, a semiconductor, and a metal. (a) The case
of a dielectric: filled valence band and empty conduction band; E g > 5 eV. (b) The case of a
semiconductor: filled valence and empty conduction band (at low temperature); E g < 5 eV.
(c) Electrons in the partially filled band may gain energy from an electric field and, as a result,
transfer to empty (free) levels of higher energy, thus exhibiting electric conductivity.
Accordingly, a structure with a partially filled band corresponds to a metal. (d) For overlapping
bands the available electrons fill states in both bands.
between the upper filled band and the next empty band: for semiconductors this energy
gap is much smaller than it is for dielectrics, as is illustrated in Figs. 4.2(a) and 4.2(b).
This upper band is empty, at least at low temperature. Later, we will study the electronic
band structure of semiconductor materials in more detail.
Second, the upper bands that contain electrons are not completely filled as in
Figs. 4.2(c) and 4.2(d); this case corresponds to a metal. Indeed, to exhibit conductivity under an applied electric field, an electron should experience acceleration and a
gain in energy. That is, an electron should be able to gain a small amount of energy
and be transferred to close, but higher, energy levels. If all energy levels (a whole band)
are already filled up, the electron can not participate in conduction processes. This is
the case of a dielectric (an insulator). In contrast, if the energy band is not completely
filled and there are empty energy levels available for the electrons, in an electric field
the electron can move to upper levels and gain energy in a process that corresponds to
electrical conductivity. This is the case of a metal.
Semiconductors include elements from the central portion of the Periodic Table of elements columns II to VI, as shown in Table 4.1. At the center of Table 4.1 is silicon, Si,
the backbone material of modern electronics. Silicon plays the central role in electronics
68
Group III
Group IV
Group V
Group VI
Be
Mg
Zn
Cd
Hg
B
Al
Ga
In
Tl
C
Si
Ge
Sn
Pb
N
P
As
Sb
Bi
O
S
Se
Te
Po
just as steel plays a dominant role in metallurgy. Below Si is germanium, Ge. Nowadays,
Ge is rarely used by itself; however, GeSi alloys play an increasingly important role in
todays electronics technology. Besides the elemental materials, contemporary electronics also uses combinations of elements from group III and group V, and combinations
of elements of group II and group VI, as well as some more complicated combinations.
These combinations are called compound semiconductors. By combining each element
from group III with N, P, As, Sb, and Bi from group V, 25 different IIIV compounds can
be formed. The most widely used compound semiconductor is GaAs (gallium arsenide)
and all IIIV semiconductors are used to fabricate so-called heterostructures. A heterostructure is made of two different materials with a heterojunction boundary between
them. The specific choice of heterostructure depends on the application.
Two or more compounds may be used to form alloys. A common example is aluminum
gallium arsenide, Alx Ga1x As, where x is the fraction of group III sites in the crystal
occupied by Al atoms, and 1 x is the fraction of group III sites occupied by Ga atoms.
Hence, now we have, not just 25 discrete compounds, but a continuous range of materials.
As with the IIIV compounds, every element shown in column II may be used together
with every element in column VI to create IIVI compounds, and again, by combining
more than two of these elements, it is possible to create a continuous range of materials.
As a result, it is possible to make compositionally different IVIV, IIIV, and IIVI
compounds.
4.3
(a) Crystalline
(b) Amorphous
69
(c) Polycrystalline
Figure 4.3 Three types of solids: (a) ordered crystalline and (b) amorphous materials are
Ionic crystals
Ionic crystals are made up of positive and negative ions. The ionic bond results primarily
from attractive mutual electrostatic interaction of neighboring ions with opposite charges.
There is also a repulsive interaction with other neighbors of the same charge. Attraction
and repulsion together result in a balancing of forces that leads to the atoms being in
stable equilibrium positions in such an ionic crystal. As for the electronic configuration
in a crystal, it corresponds to a closed (completely filled) outer electronic shell. A good
example of an ionic crystal is NaCl (salt). Neutral sodium, Na, and chlorine, Cl, atoms
have the configurations Na11 (1s2 2s2 2p6 3s1 ) and Cl17 (1s2 2s2 2p6 3s2 3p5 ), respectively.
That is, the Na atom has only one valence electron, while one electron is necessary to
complete the shell in the Cl atom. It turns out that the stable electronic configuration
develops when the Na atom gives one valence electron to the Cl atom. Both of them
become ions, with opposite charges, and the pair has the closed outer-shell configuration
(like inert gases such as helium, He, and neon, Ne). The inner shells are, of course,
completely filled both before and after binding of the two atoms. In general, for all
elements with almost closed shells, there is a tendency to form ionic bonds and ionic
crystals. These crystals are usually dielectrics (insulators).
Covalent crystals
Covalent bonding is typical for atoms with a low level of filling up of the outer shell. An
excellent example is provided by a Si crystal. As we discussed in the previous chapter, the
electron configuration of Si can be represented as core +3s2 3p2 . To complete the outer
3s2 3p2 shell, a silicon atom in a crystal forms four bonds with four neighboring silicon
atoms. The symmetry of the hybrid sp3 orbitals dictates that these neighboring atoms
should be situated in the corners of a tetrahedron as shown in Fig. 4.4. Then, the central
70
Si
Si
Si
Si
Si
Figure 4.4 Four sp3 -hybrid bonding orbitals in a crystal of Si.
Si atom and each of its nearest-neighbor Si atoms share two electrons. This provides socalled covalent (chemical) bonds (symmetric combinations of the sp3 orbitals) in the Si
crystal. The four bonding sp3 orbitals form an energy band that is completely filled by the
valence electrons. This band is called the valence band. The antisymmetric combination
of the sp3 orbitals with destructive interference, as in Fig. 3.17, leads to antibonding
states and to the first empty energy band. The latter band is called the conduction band
of the Si crystal. Not surprisingly, a covalent bond of this type plays a major role for Ge,
which is also from group IV of the Periodic Table of elements.
In fact, both of the types of atomic bonding discussed previously may exist simultaneously in a crystal, for example, in IIIV compounds. Indeed, the electron configurations
in Ga and As are core +4s2 4p1 (Ga) and core +4s2 4p3 (As). When a GaAs crystal is
formed, the As atom gives one valence electron to the Ga atom, which makes them both
ions. The Coulomb interaction of these ions contributes to the ionic bonding in IIIV
compounds. But now each ion has only two 2s electrons and two 2p electrons, which are
not enough to fill the shell completely. Therefore, the rest of the bonding goes through the
formation of the sp3 -hybridized orbitals. This is the covalent contribution to the crystal
bonding. We can conclude that the IIIV compounds are materials with mixed bonding
partly ionic and partly covalent.
Metals
Metals, such as Na, K, and Ca, consist of ions regularly situated in space. Each atom
contributes an electron to an electron sea, in which the ions are embedded. The system
as a whole is neutral and stable. The electrons contribute significantly to the binding
energy of metals.
71
Crystal
Ionic
NaCl
LiF
Diamond, C
Si
Ge
Na
Fe
Al
CH4
Ar
7.9
10.4
7.4
3.7
3.7
1.1
4.1
2.4
0.1
0.8
Covalent
Metallic
Molecular and
inert-gas crystals
There exist other bonding mechanisms in solids, in molecular crystals and inert-gas
crystals. However, these crystals have small binding energies, that is, they are not stable
enough to be used widely in electronics. In Table 4.2, for comparison, we present binding
energies per atom for different types of crystals. We can see that ionic and covalent
crystals typically have binding energies in the range 110 eV, whereas molecular and
inert-gas crystals are weakly coupled systems. Metals have intermediate coupling. The
binding energy of a crystal is an important parameter, since it determines the stability of
the crystal, its aging time, the applicability of various treatment processes, etc.
Crystal lattices
Now, we return to the discussion of crystal periodicity. The periodic arrangement of atoms
(ions) in a crystal forms the lattice. The positions of atoms in the lattice are defined as
the sites. In principle, atoms always perform small-amplitude oscillations around the
sites. However, in many cases we can neglect these small-amplitude oscillations and
think of a crystal as a system of regularly distributed atoms (ions). In such a perfect and
periodic crystal lattice, we can identify a region called a unit cell. Such a unit cell is a
representative of the entire lattice, since the crystal can be built by regular repeats in
space of this element. The smallest unit cell is called the primitive cell of the lattice. The
importance of the unit cell lies in the fact that by studying this representative element one
can analyze a number of properties of the entire crystal. The primitive cell determines
the fundamental characteristics of the crystal, including the basic electronic properties.
One of the most important properties of a perfect crystalline lattice is its translational
symmetry. Translational symmetry is the property of the crystals being carried into
itself under parallel translation in certain directions and for certain distances. For any
three-dimensional lattice it is possible to define three fundamental noncoplanar primitive
translation vectors (basis vectors) a1 , a2 , and a3 , such that the position of any lattice site
can be defined by the vector R = n 1 a1 + n 2 a2 + n 3 a3 , where n 1 , n 2 , and n 3 are arbitrary
integers. If we construct the parallelepiped using the basis vectors, a1 , a2 , and a3 , we
obtain just the primitive cell.
72
a2
a1
a2
B
a1
a2
a1
Figure 4.5 A two-dimensional lattice. Three unit cells are illustrated by A, B, and C. Two basis
a3
a3
a3
a2
a2
a2
a1
a1
(a)
a1
(b)
(c)
Figure 4.6 Three types of cubic lattice: (a) simple, (b) body-centered, and (c) face-centered.
73
a
Figure 4.7 The diamond lattice is of the face-centered type of cubic lattices. The tetrahedral
4.4
C3
C4
z
110
11
0
C3
C3
C3
001
C4
1
00
1
11
00
1
10
010
010
11
1
74
x
001
C4
(a)
(b)
Figure 4.8 (a) Symmetry directions for cubic crystals; (b) rotational symmetry directions in
cubic crystals.
be relatively weakly coupled to atoms. This allows one to think about a crystal as a
system with two relatively independent subsystems: the atom (ion) subsystem and the
electron subsystem. In this section, we will consider the electron subsystem of crystals.
Actually, in a crystal an electron moves in the electrostatic potential created by positively charged ions and all other electrons. This potential is frequently referred to as the
crystalline potential, W (r). We will consider one-particle states for electrons in an ideal
crystal and we will present a classification of these states and find a general form for the
wavefunctions and energies.
where r is an arbitrary point of the crystal and n j are some integers. The one-particle
wavefunction should satisfy the time-independent Schrodinger equation
-2
h
75
Because of the potential periodicity of Eq. (4.1), the wavefunctions (r) may be
classified and presented in a special form. To find this form we introduce the translation
operator, Td , that acts on the coordinate vector r as
Td r = r + d,
d=
3
n jaj.
(4.3)
j=1
On applying this operator to the wavefunction we find that the function Td (r)
(r + d) is also a solution of Eq. (4.2) for the same energy E. Let us assume that
the electron state with energy E is not degenerate. Then, we conclude that the two
wavefunctions (r) and (r + d) can differ only by some multiplier, C d :
(r + d) = Cd (r).
(4.4)
(4.5)
we obtain |Cd |2 = 1. Two different translations d1 and d2 should lead to the same result
as the single translation d = d1 + d2 , i.e., Cd1 Cd2 = Cd1 +d2 . From this result it follows
that Cd may be represented in an exponential form:
3
(4.6)
n jaj ,
Cd = eikd = exp ik
j=1
where k is a constant vector. Thus, from Eq. (4.4) we get the wavefunction in Bloch form,
(r) = eikd (r + d) = eikr u k (r),
(4.7)
(4.8)
where
One can check that the so-called Bloch function u k (r) is a periodic function:
u k (r + d ) = u k (r),
d =
3
n jaj.
j=1
Therefore, the stationary one-particle wavefunction in a crystalline potential has the form
of a plane wave modulated by the Bloch function with the lattice periodicity. The vector
k is called the wavevector of the electron in the crystal. This wavevector is one of the
quantum numbers of electron states in crystals.
By applying the so-called cyclic boundary conditions to the crystal with a number of
periods N j along the direction a j ,
(r + N j a j ) = (r),
N j ,
(4.9)
we find for k
ka j N j = 2n j ,
n j = 1, 2, 3, . . ., N j .
(4.10)
These allowed quasi-continuum values of k form the so-called first Brillouin zone of
the crystal. They are just those energy bands which we discussed above by using simple
76
(4.11)
where enumerates these solutions and, thus, the energy bands. Owing to the crystals periodicity, the Bloch function can be calculated within a single primitive cell.
In Eq. (4.11) we normalize the wavefunction ,k for the crystal volume V = N V0 ;
N = N1 N2 N3 and V0 are the number and volume of the primitive crystal cell,
respectively. From the normalization of the wavefunction ,k (r) one obtains
1
|u ,k |2 dr = 1,
(4.12)
V0 V0
where the integral is calculated over the primitive cell. This formula allows one to estimate
the order of the value of u ,k : |u ,k | 1.
Thus, through this analysis we have established an extremely important property of
the electrons in crystalline solids: despite the interaction of an electron with atoms and
other electrons, in a perfect lattice the electron behaves much like a free electron. The
electron can be characterized by a wavevector k and, thus, it possesses the momentum -hk.
By considering phenomena that have spatial scales much greater than distances between
atoms (ions) in the primitive cell, we may omit the Bloch function u ,k and describe the
electrons by a wavefunction in the form of the plane wave k (r) = A exp(ikr), just as for
a free particle. However, the wavevector changes inside the Brillouin zone in a manner
that is specific for a given crystal and, in general, the energy dispersions E = E (k) can
differ from that of the free electron considerably.
The holes
According to the discussion given in Section 4.2, some of the energy bands are completely
filled, while the others are almost empty. For our purposes, two of the bands are of great
importance: the upper filled band and the lowest empty band. They are called the valence,
E v (k), and the conduction, E c (k), band, respectively.
One of the ways to get an electron into the conduction band is to transfer an electron
from the valence band to the conduction band. Thus, for analysis of the valence band, it
is useful to adopt the concept of a hole as a new quasiparticle; i.e., by a hole we refer
to an electron missing from the valence band. These quasiparticles can be introduced
and described on the basis of simple considerations. If the valence band is full, the total
wavevector of all electrons in the valence band is zero:
ki = 0,
(4.13)
kv =
i
77
where the sum accounts for all occupied valence states. Let us assume that one of the
electrons with wavevector ke is removed from the valence band. The total wavevector of
the valence electrons becomes
ki = ke .
(4.14)
kv =
i
On the other hand, removing this electron is identical to the creation of a hole in
the valence band. One can attribute the wavevector of Eq. (4.14) to this hole: kh = ke .
Then, the energy of the valence electrons decreases by the term E v (ke ), and, thus, one can
also attribute the energy E h (kh ) = E v (ke ) to this hole. If the energy band is symmetric;
i.e., E v (k) = E v (k), we can write for the hole energy
E h (kh ) = E v (ke ) = E v (ke ) = E v (kh ).
(4.15)
Thus, we can characterize the hole by a wavevector, kh , and an energy, E h (kh ), and
consider the hole as a new quasiparticle created when the electron is removed from
the valence band. In the conduction band, the electron energy, E c (k), increases as the
wavevector, k, increases. Conversely, in a valence band, near the maximum energy of
the band the electron energy, E v (k), decreases as k increases. However, according to
Eq. (4.15), the hole energy increases with the hole wavevector, kh . That is, the hole
behaves as a usual particle. Thus, one can introduce the velocity of the hole, vh =
E h (kh )/kh , and then employ Newtons laws, etc. The absence of a negative charge in the
valence band brought about when an electron is removed makes it possible to characterize
a hole by a positive elementary charge; that is, the holes carry positive electrical charge.
It is worth emphasizing that the similarity between the electrons and holes is not
complete: the holes exist as quasiparticles only in a crystal, whereas the electrons exist
also in other physical media, as well as in the vacuum.
mxx
m 1
m 1
xy
xz
1
= m 1
m 1
m 1
(4.16)
yx
yy
yz ,
m ij
1
1
1
m zx
m zy
m zz
where i and j denote x-, y-, and z-coordinates.
Position of extremum in
k-space
Degeneracy
&
(/a)[111], (/a)[111],
(/a)[111],
(/a)[111]
| | < 1
pz
W
L
px
78
py
Figure 4.9 The first Brillouin zone of cubic crystals. Points of high symmetry, &, L, and X points,
are shown.
Thus, due to the crystal symmetry, the problem of finding an electron energy spectrum E (k) is reduced to the following steps: (a) determination of the high-symmetry
k points of the Brillouin zone; (b) calculation of energy positions of extrema; and
(c) analysis of effective masses or other parameters of an expansion of E (k) within
extrema.
The structure and symmetry of the Brillouin zone for cubic crystals of group IV
semiconductors and of IIIV compounds are very similar. Figure 4.9 shows the Brillouin
zone of these semiconductor materials. The symmetry points are shown in Fig. 4.9 and
presented in Table 4.3. Evidently, as a result of crystal symmetry, several points have
the same symmetry; indeed, they are mapped into themselves under proper symmetry
transformations. Such a degeneracy of the symmetry points is indicated in Table 4.3. In
particular, the &, L, X, and ! points are of central importance. They give the positions
of the extrema of the electron energy in IIIV compounds, Ge, and Si.
The bandstructures for GaAs and Si are presented in Fig. 4.10. The energy dispersions
along two symmetric directions of the wavevectors [111] (from & to L) and [100] (from
& to X) are shown. In each case, the energy, E, is taken to be zero at the top of the
highest valence band, which is located at the & point for both of these materials. Since
Si
E (eV)
E (eV)
Eg
GaAs
2
1
79
Eg
Figure 4.10 Bandstructures of Si and GaAs: the bandgap of Si is E g = 1.12 eV; and the bandgap
electrons tend to be near the energy minima, one can think of electrons as being located
inside marked regions of k-space. Frequently, these regions in k-space are referred to as
energy valleys, or simply valleys. Materials with several valleys are called many-valley
semiconductors. For IIIV compounds there is only one valley around the point k = 0
(the & valley); however, in the case of Si there exist six ! valleys in accordance with
the degeneracy of the ! points. It is worth emphasizing that also other symmetry points
can play a considerable role in processes occurring far from equilibrium. The highest
valence band and the lowest conduction band are separated by energy bandgaps, E g . Let
us postpone the analysis of valence bands for a while and focus on conduction bands.
Conduction bands have different structures for the groups of materials considered in
this section. The main difference is that for Si and Ge the lowest minima are located
at ! and L points, respectively, whereas for most IIIV compounds there is only one
lowest minimum, which is at the & point. The difference is not simply quantitative;
indeed, it is qualitative and leads to a series of important consequences in the behavior of
electrons.
As discussed previously, within the &, !, X, and L points the electron dispersion
curves, E (k), can be expanded in series with respect to deviations from the minima,
k :
1 -h2
E (k) = E(k ) +
(ki ki, )(k j k j, ).
(4.17)
2 m
ij
For the & point, we have the simplest case of the so-called isotropic effective mass, m :
1
1
= i j .
(4.18)
m&
m
ij
For !, X, and L points, i.e., for !, X, and L valleys, the reciprocal effective-mass
tensor has only two independent components, corresponding to the longitudinal, m l , and
transverse, m t , masses. For example, in the case of Si, for a coordinate system with the
80
Si
Ge
Type of bandgap
Lowest minima
Degeneracy
E g (eV)
Indirect
! points
6
1.12
Indirect
L points
4
0.664
Electrons
m l /m 0
m t /m 0
0.98
0.19
1.64
0.082
Holes
m hh /m 0
m lh /m 0
!so (eV)
0.50
0.16
0.044
0.44
0.28
0.29
z-axis along the axis where the ! valley under consideration is located and the two other
axes perpendicular to the first, one can obtain
1
mt
0
0
1
= 0
(4.19)
0 ,
m 1
t
m i j
0
0
m 1
l
where m l and m t are called the longitudinal and transverse effective masses, respectively.
For group IV semiconductors, Si and Ge, the energy parameters are presented in Table 4.4.
The degeneracy, given in Table 4.4, indicates the existence of six and four equivalent
energy valleys for Si and Ge, respectively. According to Fig. 4.9, the L points are located
at the edges of the Brillouin zone, i.e., only half of each energy valley lies inside the first
Brillouin zone. This reduces the effective number of the valleys to four. In Table 4.4 the
bandgaps, E g , and the electron longitudinal, m l , and transverse, m t , effective masses are
also presented.
For IIIV compounds, different situations occur for different materials: some of these
compounds are direct-bandgap crystals and others are indirect materials. Thus, for the
conduction bands of these materials the conduction band edge can be found at the & point,
at the L point, or at the X point. In Table 4.5, energy-band parameters for three sets of
IIIV compounds are shown; the direct or indirect nature of the crystals is indicated.
Now we shall consider valence bands. Their structure is more complicated. For group
IV semiconductors and for IIIV compounds, the top of the valence bands at k = 0
has high degeneracy, i.e., several valence bands have the same energy at this point. The
degeneracy occurs because these bands originate from the bonding of three p orbitals of
the atoms composing the crystals. Thus, if one neglects the interaction between the spin
of the electrons and their motion (so-called spinorbit interaction), one obtains three
degenerate valence bands, each of which is also doubly degenerate as a result of electron
spin. In fact, the spinorbit interaction causes a splitting of these six-fold degenerate
states. At k = 0 they are split into (i) a quadruplet of states with degeneracy equal to 4
and (ii) a doublet of states with degeneracy equal to 2. This splitting of valence bands,
81
GaAs
AlAs
Type of gap
Lowest minima
E g (eV)
Direct
& point
0.354
Direct
& point
1.42
Indirect
X points
2.16
Electrons
m /m 0
0.025
0.067
0.124
Holes
m hh /m 0
m lh /m 0
!so (eV)
0.41
0.26
0.38
0.50
0.07
0.34
0.50
0.26
0.28
!so , at k = 0 is shown in Fig. 4.10. One often refers to the lower valence band as the
split-off valence band. At finite k, the spinorbit interaction leads to further splitting of
the upper valence band into two branches: the heavy- and light-hole bands. Parameters
characterizing these bands are the heavy-hole, m hh , and light-hole, m lh , masses. The
effective masses of the light hole, m lh , and heavy hole, m hh , as well as the distance to the
split-off band, !so , for Si, Ge, and some IIIV compounds are presented in Tables 4.4
and 4.5.
It is worth mentioning that, despite the relative complexity of the picture presented
for energy bands, the description of electron properties in terms of the electron and hole
quasiparticles by using several E (k) dependences is incommensurably simpler than
operating with the enormous number of valence electrons in the crystal (1023 cm3 ).
82
emission (absorption). Throughout the whole optical spectral region (from infrared to
ultraviolet light), wavelengths are much greater than electron de Broglie wavelengths,
as estimated in Chapters 2 and 3. The photon wavevectors q = 2/, in turn, are much
smaller than the electron wavevectors (|k1 |, |k2 | |q|). This property reduces the above
equations to
k 1 k2 = k
and
In other words, under absorption and emission of the light electrons transferred between
the valence and conduction bands practically preserve their wavevectors. In other words,
the electron wavevector changes very little. In an energy scheme like that presented
in Fig. 4.10, the processes of lightcrystal interaction can be interpreted as vertical
electron interband transitions. Another conclusion following from this analysis is that
the interband phototransitions are possible only for light with energy quanta exceeding
the bandgap, -h E g . This finding implies that a pure semiconductor crystal is optically
- /E g , with c being the velocity of light in
transparent for light with -h < E g ( > 2 ch
vacuum). For some optoelectronic applications, the spectral range near the onset of
absorption/emission of light is critically important.
On combining this analysis with the previously described bandstructures of different
materials we can see that phototransitions induced by light with energy near the bandgap
are possible in semiconductors for which the conduction and valence bands have a
minimum and a maximum, respectively, at the same & point. For example, in GaAs one
can transfer an electron from the valence band to the conduction band directly without a
change in its momentum. Crystals of this type are called direct-bandgap semiconductors.
In contrast, in order to move an electron from the valence band to the conduction band
in Si and Ge, one needs not only to add an amount of energy greater than the minimum
energy difference between the conduction and valence bands to excite an electron, but
also to change its momentum by a large amount (comparable to the scale of the Brillouin
zone). Such a semiconductor is called an indirect-bandgap semiconductor.
Summarizing, the bandstructure of a semiconductor material determines both electrical and optical properties. Manipulation of electrons using light, i.e., optoelectronic
functions, is easier for a direct-bandgap semiconductor, such as GaAs. In contrast, silicon
and other group IV materials are optically relatively inactive. The situation is changed
in nanoscale Si and SiGe structures, where the momentum conservation law is no longer
rigorously obeyed.
83
E g (eV)
Alx Ga1x As
Alx In1x As
Gax In1x As
Gax In1x Sb
Alx Ga1x Sb
Alx In1x Sb
1.42 + 1.247x
0.360 + 2.012x
0.360 + 1.064x
0.172 + 0.139x
0.726 + 1.129x
0.172 + 1.621x
+ 0.698x 2
+ 0.415x 2
+ 0.368x 2
+ 0.430x 2
Effective mass, m /m 0
& point
X minima
L minima
0.067 + 0.083x
0.32 0.06x
0.11 + 0.03x
Heavy hole
Light hole
0.62 + 0.14x
0.087 + 0.063x
this disorder and produces high-quality crystals. The properties of such materials can be
interpreted in terms of nearly ideal periodic crystals.
Consider an alloy consisting of two components: A, with a fraction x, and B, with
a fraction 1 x. If A and B have similar crystalline lattices, one can expect that the
alloy A x B1x has the same crystalline structure, with the lattice constant ac given by
a combination of lattice constants of materials A, aA , and B, aB . The simplest linear
combination leads to the equation (Vegards law)
ac = aA x + aB (1 x).
(4.20)
Then, the symmetry analysis can be extended to these types of alloys. For SiGe alloys
and IIIV compounds, this leads us to the previously discussed symmetry properties
of the energy bands. Since the bandstructures are similar, one can characterize certain
parameters of the alloy as functions of the fraction x. This approximation is often called
the virtual-crystal approximation. For example, the bandgap of an alloy can be reprealloy
sented as E g = E g (x). Such approximate dependences are given in Table 4.6 for IIIV
alloys. They correspond to the bandgaps, E g , at & points.
As the composition of an alloy varies, the internal structure of energy bands changes
significantly. For example, in the case of Alx Ga1x As alloys, the lowest energy minimum
of the & conduction band of GaAs is replaced by the six X minima of AlAs as the value
of x is increased. Indeed, near the composition x 0.4, the alloy transforms from a
direct- to an indirect-bandgap material. The x-dependences of the effective masses for
various electron energy minima as well as for heavy and light holes for Alx Ga1x As are
presented in Table 4.7.
84
4.5
Semiconductor heterostructures
Further modification and engineering of material properties is possible with the use of
heterostructures. Heterostructures are structures with two or more abrupt interfaces at
the boundaries between the regions of different materials. With modern techniques for
growth of materials, it is possible to grow structures with transition regions between
adjacent materials that have thicknesses of only one or two atomic monolayers.
(4.21)
where A,B are the electron affinities of materials A and B. Similarly, one can calculate
the discontinuity of the valence band for the same heterojunction:
!E v = E v,B E v,A = B A + !E g ,
(4.22)
where !E g = E g,A E g,B is the bandgap discontinuity for the heterojunction, with
E g,A and E g,B being the bandgaps of materials A and B, respectively. Thus, if this simple
85
Eg
Forbidden
energy gap
Electron affinity
Vacuum level
Figure 4.11 This diagram illustrates the electron affinity and vacuum level in a crystal.
86
Ec,B
Ec
Ec
B
Ec,A
B E
g,B
Eg,A
Ev
Ec
Ev
Ev,A
Ev,B
(b)
(a)
Ev
A
(c)
Figure 4.12 Three types of interfaces: type I (a); type II (b); and broken-gap lineup (c). The
eFe
eF
eFe
B
Ax B1x B
A
+eF
(a)
Ax B1x
+eFh
(b)
+eFh
(c)
Figure 4.13 (a) A external electric field, F, simply tilts the conduction and valence bands of a
semiconductor. The forces eF and +eF acting on the electron and the hole are equal in
magnitude and opposite in direction. (b) The same direction of forces eFe and +eFh on the
electron and the hole caused by quasielectric field in the conduction band, Fe , and in the valence
band, Fh . (c) Forces eFe and +eFh of opposite directions for electrons and holes.
and the highest valence-band maximum on the other, with an energy separation between
the two less than the lower of the two bulk bandgaps. This case represents a type II
heterostructure. The combination AlAs/Alx Ga1x As for x > 0.4 and some Si/Six Ge1x
structures are of this kind. Figure 4.12(c) illustrates a broken-gap line-up, in which the
bottom of the conduction band on one side drops below the top of the valence band on
the other. The example of this band line-up is given by InAs/GaSb, with a break in the
forbidden gap at the interface of the order of 150 meV.
Graded semiconductors
Very often, graded semiconductors are used instead of abrupt heterointerfaces. To illustrate the idea, consider first a homogeneous piece of a semiconductor, say, a piece of
uniformly doped silicon, but with an electric field applied. Then, the band diagram looks
like that illustrated in Fig. 4.13(a) and is represented simply as two parallel tilted lines
corresponding to the conduction- and valence-band edges. The separation between the
two lines is the energy bandgap of the semiconductor; the slope of the two band edges
is the elementary charge e multiplied by the electric field F. When an electron or a hole
87
SiC
C
Si
GaP
GaAs
Ge
AlAs
InP
InAs
3.0806
3.5668
5.4309
5.4495
5.6419
5.6461
5.6611
5.8687
6.0584
is placed into this structure, a force eF acts on the electron and +eF acts on the hole;
the two forces are equal in magnitude and opposite in direction.
Slopes of conduction- and valence-band edges arise in the case of a graded transition
from one material to another. Graded transitions from a narrow-bandgap to a widebandgap semiconductor that correspond to the abrupt heterojunctions of Figs. 4.12(a)
and 4.12(b) are shown in Figs. 4.13(b) and 4.13(c). As is obvious from Figs. 4.13(b)
and 4.13(c), in the case of graded heterostructures there is a built-in electric field that
acts on electrons and holes. This field is called quasielectric. The quasielectric field
does not exist in homogeneous crystals; that is why graded heterostructures can be used
for new devices where the existence of a built-in electric field is required. Examples of
materials used in graded nanostructure devices are Six Ge1x and Alx Ga1x As, where
x changes in the direction of growth. Graded structures and the accompanying quasielectric forces introduce a new degree of freedom for the device designer and allow him to
obtain effects that are basically impossible to obtain using only external (or real) electric
fields.
4.6
3.0
ZnSe
2.5 AlP
CdS
AlAs
2.0 GaP
Eg (eV)
88
AlSb
1.5
Si
GaAs
CdTe
InP
1.0
GaSb
Ge
0.5
InSb
InAs
0.0
5.4
HgTe
5.6
5.8
6.0
6.2
6.4
6.6
a0 ()
Figure 4.14 Room-temperature bandgaps, E g , as functions of the lattice constant, a0 , for selected
IIIV and IIVI compounds and selected group IV materials and their alloys.
Valence matching
Since there are still no rigorous rules for how one can realize a given level of quality for
heterojunctions, we consider a few examples, which illustrate the problems.
If lattice matching were the only obstacle, the Ge/GaAs system would be the ideal
heterosystem, because, according to Table 4.8, it would allow one to realize the ideal
combination of group IV semiconductors and IIIV compounds. Indeed, on the basis
solely of lattice-constant matching, the Ge/GaAs system appears to be the most promising
candidate. However, it turns out that there is the problem of chemical compatibility for
this heterostructure. Covalent bonds between Ge on the one hand and Ga or As on the
other are readily formed, but they are what could be called valence-mismatched bonds,
meaning that the number of electrons provided by the atoms is not equal to the canonical
number of exactly two electrons per covalent bond. Hence, the bonds themselves are not
electrically neutral. Consider a hypothetical idealized (001)-oriented interface between
Ge and GaAs, with Ge to the left of a mathematical plane and GaAs to the right, as
shown in Fig. 4.15. In GaAs, an As atom brings along five electrons (resulting in 5/4
electrons per bond) and is surrounded by four Ga atoms, each of which brings along
three electrons (3/4 electron per bond), adding up to the correct number of two (8/4)
electrons per GaAs covalent bond. However, when, at a (001) interface, an As atom
has two Ge atoms as bonding partners, each Ge atom brings along one electron per
bond, which is half an electron more than is required for bonding. Loosely speaking,
the As atom does not know whether it is a constituent of GaAs or a donor in Ge. As
a result, each GeAs bond acts as a donor with a fractional charge, and each GeGa
89
-[110]
[001]
Ge
Ga
As
Ge/GaAs interface. The different atomic species, Ga or As atoms, and Ge atoms, do not bring
along the correct number of electrons to form electrically neutral GaGe or AsGe covalent
bonds of two electrons per bond. Reprinted with permission from W. A. Harrison, E. Kraut
C 1978 by the
et al., Polar heterojunction interfaces Phys Rev. B, 18(8), 4402 (1978).
American Physical Society.
bond acts as an acceptor with the opposite fractional charge. To be electrically neutral,
a Ge/GaAs interface would have to have equal numbers of both charges, averaged not
only over large distances, but also locally. Given chemical bonding preferences, such an
arrangement will not occur naturally during epitaxial growth. If only one kind of bond
were present, as in Fig. 4.15, the interface charge would support a large electric field
of 4 107 V cm1 . Such a huge field would force atomic rearrangements during the
growth, trying to equalize the numbers of GeAs and GeGa bonds. However, these
rearrangements will never go to completion, but will leave behind locally fluctuating
residual charges, with deleterious consequences for the electrical properties of materials
and any device applications.
Interfaces with perfect bond-charge cancellation may readily be drawn on paper, but
in practice there are always going to remain some local deviations from perfect charge
compensation, leading to performance-degrading random potential fluctuations along
the interface. This argument applies to other interfaces combining semiconductors from
different columns of the Periodic Table of elements.
The above discussion pertains to the most widely used (001)-oriented interface. The
interface charge at a valence-mismatched interface actually depends on the crystallographic orientation. It has been shown that an ideal (112) interface between group IV
and IIIV compounds exhibits no interface charge. An important example is the GaPon-Si interface that has a sufficiently low defect density; as a result, it is used in various
devices grown on Si. After these comments, we return to the discussion of the role and
importance of lattice matching.
90
Figure 4.16 Lattice-matched materials (a) and lattice-mismatched materials (b); the resulting
structure is strained (pseudomorphic) if the upper layer B adopts the lattice of substrate A (c).
Lattice-matched materials
For lattice-matched structures, the lattice constants of the constituent materials are nearly
matched; i.e., the lattice constants are within a small fraction of a percent of each other.
There is no problem, in general, in growing high-quality heterostructures with such
lattice-matched pairs of materials. By high-quality we mean that the interface structure is free of lattice imperfections such as interface defects, etc. Such imperfections
result in poor electrical and optical properties and may lead to fast and widespread
degradation of the structure. Figure 4.16(a) illustrates a lattice-matched layer B on a
substrate A. One can expect that the layer can be grown on the substrate if both materials are from the same group and the binding energies and crystal structures are very
similar.
According to the data of Fig. 4.14 and Table 4.8, the AlGaAs/GaAs system is an
example of a lattice-matched material. The system has a very small mismatch in the
lattice constants of only about 0.1% over the entire range of possible Al-to-Ga ratios
in the AlGaAs. As a result, such heterostructures can be grown free of mechanical
strain and significant imperfections. Hence, these structures provide a practical way
of tailoring bandstructures. In addition to these tailored electronic parameters, elastic
and other lattice properties can be different in layers composing such a lattice-matched
heterostructure.
Lattice-mismatched materials
The case of lattice-mismatched structures is characterized by a finite lattice mismatch.
Figure 4.16(b) depicts this case. If one tries to match these lattices, strain in the plane
of growth and a distortion along the growth axis arise. Thus, one obtains a strained
layer with a lattice deformation. The lattice-mismatched structure can be characterized
91
also by the relative mismatch of the lattice constants of the substrate, aA , and the
epilayer, aB :
aA a B
.
(4.23)
=
aA
Consider an elastic deformation of a lattice. It can be characterized by the vector of
relative displacement u. The displacement defines how any lattice point r moves to a
new position, r = r + u, as a result of the deformation. Different regions of the crystal
can be deformed differently. Thus, the displacement depends on coordinates: u = u(r).
In fact, only relative displacements are important. They are determined by the strain
tensor:
u j
1 u i
ui j =
,
(4.24)
+
2 j
i
where the tensor u i j has the following components:
(4.25)
(4.29)
where N is a vector perpendicular to the surface and f is an external force applied to the
surface.
92
These equations are sufficient for calculations of the strain of a layer A grown on
a mismatched substrate B. Let the lattice constants of these two materials be aA and
aB , respectively. In this discussion, both materials are assumed to be cubic crystals and
the direction of growth is along the [001] direction. If the layer A adopts the lattice
periodicity of the substrate B, the in-plane strain of the layer is
aB
u x x = u yy = u || = 1
.
(4.30)
aA
There should be no stress in the direction of growth. Thus, from Eq. (4.29) it follows that
zz = 0. On calculating zz from Eqs. (4.27)) and (4.28), and from the obtained result,
zz = c11 u zz + c12 (u x x + u yy ), we find the strain in the direction perpendicular to the
layer:
u zz =
2c12
u || .
c11
(4.31)
Thus, the strain can be found through the mismatch of the lattice constants.
The strain results in two types of effects: (1) the strain can generate various imperfections and defects; and (2) the strain in the layer leads to a change in the symmetry of the
crystal lattice, for example, from cubic to tetragonal or to rhombohedral, etc. Of course,
the latter effect can modify the energy bandstructure of the layer.
93
Misfit (%)
0
GexSi1x /Si
1 m
Strain + Defects
(Equilibrium)
1000
Str
100
Str
ain
ain
(E
(M
qu
e ta
ilib
st a
riu
ble
m)
10
0
0.2
0.4
0.6
0.8
1.0
Germanium Fraction, x
Figure 4.17 The stabilitystrain diagram for a Gex Si1x layer on Si substrate. After R. People and
J. C. Bean, Erratum: Calculation of critical layer thickness versus lattice mismatch for
Gex Si1x /Si strained-layer heterostructures Appl. Phys. Lett. 47, 322 (1985), Appl. Phys. Lett.
49, 229 (1986). Reprinted with permission from R. People and J. C. Bean, Appl. Phys. Lett., 49
C 1986 American Institute of Physics.
229 (1986).
94
4
3
2
n=1
a
0.5
Ge fraction
Figure 4.18 The elastic energy of strained Gen /Sin superlattices on Gex Si1x substrate with
various numbers of monolayers n as a function of the Ge fraction in the Gex Si1x substrate. The
numbers on the curves indicate n.
95
previous discussion, the elastic energy increases with increasing thicknesses of strained
layers for a given substrate material. Because of this, one employs superlattices with
a few monolayers: 2 (n, m) 5. Figure 4.18 also shows a nontrivial dependence of
the strain energy on the alloy composition of the substrate; the minimal strain energy is
expected for x in the range from 0.4 to 0.6.
Another important characteristic of pseudomorphic Si/Ge structures is the distribution
of the elastic energy over the monolayers of the superlattice. It was shown that the most
homogeneous distribution over layers occurs for the Si/Ge alloy with x 0.5. From
this point of view, Si0.5 Ge0.5 substrates are preferable. However, these results depend
strongly on the orientation of the substrate material. Often, the direction of growth on
the substrate is chosen to be the [001] direction.
Although the technology of the growth and fabrication of Si/Ge structures is still in a
developing stage, from our short analysis one can see that there is some qualitative and
even quantitative knowledge concerning the behavior responsible for the stability and
perfection of these structures.
Lattice-matched heterostructures
Let us return to Fig. 4.14 and discuss lattice-matched heterostructures in more detail.
From this figure, we can determine the lattice constants of various compounds. First of
all, one can see that the GaAs/AlAs system is really unique because the lattice constants
have almost identical values. In order to achieve lattice matching for other cases, it
is possible either to combine a binary compound and a ternary compound, or to use
ternaryternary compounds having appropriate ratios of atomic species within each
layer. For example, in the case of GaInAs/InP structures, lattice matching is achieved
exactly only for Ga0.47 In0.53 As, in which the ratio of Ga to In is 47 to 53 in the GaInAs
layer; for the other ratios, the GaInAs layer is not lattice-matched with the InP. Moreover,
the wide-bandgap Ga0.51 In0.49 P material is compatible with the narrow-bandgap GaAs
material.
In conclusion, the broad range of possibilities for controlling bandgaps and band offsets
for both electrons and holes, as well as electron and hole effective masses, provides
the basis for energy-band engineering. Through such energy-band engineering, it is
possible to design and fabricate high-quality heterostructures with designated optical
and electrical properties. If one can not achieve the desired properties using latticematched compositions, it is possible to employ strained pseudomorphic structures.
4.7
Organic semiconductors
In recent years, organic molecules have been shown to have properties that make them
suited for novel electronic and optoelectronic devices. Such novel devices include organic
light-emitting diodes, electronic circuits in mechanically flexible layers, crystal displays,
novel molecular electronic devices, carbon-nanotube-based devices for data processing,
and bioelectronic devices.
96
-bond
H
H
C
C
H
-bond
Figure 4.19 A schematic diagram of C2 H4 (ethene) illustrating -bonds and -bonds between
carbon atoms.
97
-bonds
Alternating
single and
double bonds
-bonds
Figure 4.20 A polymer with carbon backbone having alternating single and double bonds
between carbon atoms; such -conjugated polymers have extended wavefunctions facilitating
charge transport along the chain. Polyacetylene is a case in point.
Region of -bonds
and charge transport
Alternating
single and
double bonds
Region of -bonds
and charge transport
Figure 4.21 A -conjugated polymer (without -bonds shown) with an acceptor impurity. The
acceptor atom such as bromine or iodine attracts and captures one of the electrons in the
extended -states and results in a region where there is a deficit of electron charge a hole. The
captured electron is relatively immobile compared with the hole, which moves along the
polymer.
levels differ from one hydrocarbon polymer to another, making it possible to produce
conducting polymers with different electrical and optical properties; -conjugated polymers like polyacetylene have overlapping -orbitals that conduct charge. The doping
of a -conjugated polymer with an acceptor impurity is illustrated in Fig. 4.21. The
acceptor removes an electron from the polymer and leaves the polymer with a deficit
of negative charge, i.e., with a positive charge a hole (more detailed discussion about
doping is given in Section 5.3). This positive charge attracts neighboring electrons and
produces a local distortion of charge near the positive charge. This distorted charge
density is known as a polaron. A polaron propagates along the molecule in a manner
that bears some similarity to the propagation of a hole in a semiconductor. Indeed, the
polaron behaves as a positive (screened) particle with an effective mass that is different
from the mass of the electron. It is observed for polymers like polyacetylene that the
mobility of the hole is high and resulting conductivities are close to those of metals.
Thus, doping leads to conductivities in -conjugated polymers that make them useful in
electronic devices.
98
4.8
Carbon nanotubes
Actually, carbon nanotubes have the form of seamlessly rolled single sheets of carbon
atoms. The cylindrical sheets may have diameters of only a few nanometers. They are
very small objects that exhibit many different structures and properties. These nanotubes have no macroscopic analogues, such as the bulk semiconductors that served as
analogues for traditional semiconductor nanostructures. Thus, to study the structure of
the carbon nanotubes, one should use the most advanced microscopy techniques: atomic
force microscopy and scanning tunneling microscopy, which we will consider in the next
chapter.
A carbon nanotube is composed of carbon atoms. A defect-free single-walled nanotube
consists of a single cylinder and is characterized by the tube diameter and a quantity
known as the helicity. To understand the structure of carbon nanotubes, we will start with a
two-dimensional graphite sheet, shown in Fig. 4.22(a). The single sheet of graphite, called
graphene, has the form of a honeycomb-like lattice. Let a1 and a2 be the graphene lattice
vectors and n and m be integers. The diameter and helicity of the nanotube are uniquely
characterized by the vector C = na1 + ma2 (n, m) that connects crystallographically
equivalent sites on a two-dimensional graphene sheet. Here a1,2 are in units of a0 3 with
a0 being the carboncarbon distance. By using the vector C a carbon nanotube can be
constructed by wrapping a graphene sheet in such a way that two equivalent sites of the
hexagonal lattice coincide. The wrapping vector C defines the relative location of these
two sites. In Fig. 4.22(a), the wrapping vector C connects the origin (0, 0) and the point
with coordinates (11, 7). Thus, a nanotube denoted by indices (11, 7) is formed. A tube is
called an armchair tube if n equals m, and a zigzag tube in the case of m = 0. Wrapping
vectors along the dotted lines leads to tubes that are of zigzag or armchair form. All other
wrapping angles lead to chiral tubes whose wrapping angle is specified relative either to
99
tub
ea
xis
T H
(11,0)
zigzag
(0,0)
a1
chir
al
a2
(0,7)
arm
(11,7)
cha
ir
(a)
armchair
(b)
zigzag
Figure 4.22 (a) The relation between the hexagonal carbon lattice and the chirality of carbon
nanotubes. A carbon nanotube can be constructed from a single graphite sheet, called graphene,
by rolling up the sheet along the wrapping vector C. (b) Fragments of armchair and zigzag
carbon nanotubes.
the zigzag direction or to the armchair direction = 30 . Both and the wrapping
angle (chiral angle) are in the range (0, 30 ) as a result of the hexagonal character
of the carbon two-dimensional lattice of the graphene. Dashed lines are perpendicular
to C and run in the direction of the tube axis indicated by the vector T. The solid
vector H is perpendicular to the armchair direction and specifies the direction of nearestneighbor hexagon rows. The angle between T and H is the chiral angle . The unit
cell of a nanotube can be constructed by finding the smallest lattice vector T which
connects equivalent points of the lattice. It can be shown that this vector is given by
T = [(n + 2m)a1 (2n + m)a2 ]/q,
(4.32)
where a1 and a2 are the basis vectors of the graphene lattice; q is defined as follows:
l, if n m is not a multiple of 3l,
q=
3l, if n m is a multiple of 3l,
where l is the greatest common divisor of n and m. The number of hexagons, N , in the
unit cell is
N =
The tube radius, R, is given by
2(n 2 + m 2 + nm)
.
q
(4.33)
|C|
3a0
2
=
n + m 2 + nm .
(4.34)
2
2
The length of the carboncarbon bond can be estimated as a0 0.14 nm. It is easy to see
that, for a general chiral nanotube, the number of atoms contained in the unit cell is very
large. For example, an (8, 7) tube has a radius of about 0.57 nm and contains 676 atoms
in the unit cell. In determining the number of atoms, we have taken into account that
each hexagon contains two carbon atoms: n = 8, m = 7, n m = 1, q = 1, N = 338,
and the number of atoms is 2N = 676. In Fig. 4.22(b), the structures of two particular
examples of the armchair and zigzag types of nanotubes are sketched.
R=
100
T
No. 10
H
No. 1
No. 11
No. 8
No. 7
1 nm
In Figs. 4.23 and 4.24, atomically resolved images of individual single-walled carbon
nanotubes are shown. The images were obtained by using scanning tunneling microscopy.
On the surface of the cylinders (tube images), it is seen clearly that the lattice structure
has a spacing between carbon atoms equal to a0 0.14 nm. Such a lattice image makes
it possible to identify the chirality of the tube. Dashed arrows in Fig. 4.24 represent the
tube axis T and solid arrows indicate the direction of nearest-neighbor hexagon rows
101
H; compare this with Fig. 4.22(a). Tubes denoted by 10, 11, and 1 are chiral, whereas
tubes 7 and 8 have a zigzag and an armchair structure, respectively. Tube 10 has a chiral
angle = 7 and a diameter of 1.3 nm, which corresponds to the (11, 7) type shown in
Fig. 4.22(a). A hexagonal lattice is plotted on top of image 8 to prove that the non-chiral
armchair structure applies for this case.
In carbon nanotubes, an electron can propagate freely only in the direction along
the tube axis. Thus, electron or hole transport in these structures has the properties of
one-dimensional transport and the carriers can be characterized by a one-dimensional
wavevector k. As for any periodic structure, we can think of the electron energies as a
system of energy bands dependent on the wavevector k. The energy bands of a singlewalled nanotube can be constructed from the energy bands of the graphene presented
in Fig. 4.22(a). For the case of a multi-walled carbon nanotube, the energy bands may
be constructed from the energy bands of a layer of graphite having a thickness equal
to the thickness of the wall of the nanotube. Indeed, suppose that we find the energy
bands of the graphene to be E N (K) with K = {K x , K y } being the two-dimensional
wavevector. Instead of K x and K y , we can use projections of K onto the tube axis, T,
and onto the wrapping vector, C, k and K C , respectively; i.e., the energy is E N (K C , k).
However, along the C direction there is a periodicity. Thus, we can use the so-called
cyclic boundary conditions, as we did in Section 4.4 to obtain the Brillouin zones. This
results in quantization of the K projection onto the vector C:
KC =
2 N1
a0 3
(4.35)
(4.36)
That is, each of the initial bands of the graphene N is split into a number of onedimensional subbands N1 . In addition, the character of the splitting depends on the
wrapping type, i.e., on the values of (n, m) for the nanotube. Then, classification as
a metal, semiconductor, or dielectric can be made for these one-dimensional crystals.
For different (n, m) values, distinct properties of the nanotube can be expected. A more
detailed theory predicts that when (n m) is divisible by three the single-walled carbon
nanotubes are of metallic nature; otherwise, they are semiconductors with bandgaps that
depend on the nanotube diameter:
a0
(4.37)
E g 2E CC ,
d
where E CC is the binding energy of two carbon atoms, a0 is the carboncarbon distance,
and d is the diameter of the nanotube. Direct measurements support these considerations.
Specifically, for semiconducting tubes, an inverse linear dependence of the bandgap on
the nanotube diameter is confirmed, and the coefficient of proportionality is in good
agreement with theoretical expectations. It turns out that both types of carriers can exist
in those nanotubes, i.e., both the electron and the hole can be responsible for electric
conductivity of the tubes.
102
Buckyball fullerenes
Now we consider the so-called buckyball fullerenes which represent the fourth major
form of pure carbon. The previously discussed diamond and graphite and the well-studied
carbon nanotubes are the other three.
The buckyball fullerene is a molecule with the chemical formula C60 and is one of
the best-known nanoscale objects in nanoscience. Figure 4.25 depicts a buckyball. It
consists of 60 carbon atoms occupying equivalent sites. Each atom is bonded to three
other atoms. Single and double CC bonds occur. The lengths of the two types of bond
are 0.146 nm and 0.140 nm, respectively. That is, these bonds are practically identical.
In Fig. 4.25 they are represented by light and dark-shaded lines. To form almost an ideal
buckyball, nearest neighbors of carbon atoms are organized in pentagons and hexagons.
Every pentagon in the case structure is surrounded by five hexagons. The truncated
icosahedron structure shown in Fig. 4.25 has 12 pentagonal faces and 20 hexagonal
faces at 60 vertices with a C atom at each vertex.
103
Figure 4.25 A schematic drawing of the C60 buckyball (fullerene). Single and double bonds are
Because of the near-spherical shape of the truncated icosahedron, we can describe the
electronic energy states of the buckyball fullerene by using the classification of quantum
states developed for a spherically symmetric potential (Section 3.3) and the simplest
atoms (Section 3.4). As was found there, the electron energy states can be classified by
angular momentum quantum numbers (orbital numbers l). In the C60 fullerene, the
total number of valence electrons is 240. Of them, 180 electrons are involved in
the relatively deep-lying -bonding energy level, so that the electronic properties of the
fullerene are determined primarily by the remaining 60 electrons involved in -bonds.
A total of 50 electrons may be accommodated in quantum states with orbital numbers from l = 0 to l = 4. With two-fold spin degeneracy, this gives the following level
populations: 2 + 6 + 10 + 14 + 18, as illustrated by Fig. 4.26. The remaining 10
electrons fill the energy level with the orbital number l = 5. Actually, the total number
of these states is 2 (2 l + 1) = 22. Now, we recall that strictly speaking a truncated icosahedron differs from a sphere and has a lower symmetry. On analyzing this
lower symmetry, one finds splitting of l-states into three groups, as shown in Fig. 4.26
(h1u , t1u , and t1g denote different types of symmetry). The lowest five-fold (excluding
spin degeneracy) levels accommodate the remaining 10 electrons and can be designated
as the highest occupied molecular orbitals. Two other groups of l = 5 states are lifted
considerably; together with the l = 6 states they create the lowest unoccupied molecular
orbitals.
Conceptually, the electronic properties of an individual fullerene molecule can be
modified via replacement of a C atom with an atom having a greater or smaller number
of valence electrons. Such a process can be thought of as a kind of doping. An example
is azofullerene with the chemical formula C59 N, where one C atom is replaced by a N
atom.
104
l
5
6
6
5
5
4
3
3
2
1
0
t1g (LUMO + 1)
t1u (LUMO)
h1u
(HOMO)
Figure 4.26 A diagram of the energy states of 60 valence electrons in the fullerene molecule.
C IOP
After P. Moriarty, Nanostructured materials, Rep. Prog. Phys., 64, 355 (2001).
Publishing Limited.
The existence of unoccupied molecular orbitals for C60 explains effects of intermolecular interaction of the buckyballs. For example, two C60 molecules, being neutral systems,
attract each other through polarization of electron clouds. Such an attraction is caused by
van der Waals forces. Owing to these forces, two fullerenes compose the dimer C60 C60
with the coupling estimated to be about 0.27 eV. Similar van der Waals forces provide for interaction and adsorption of the fullerenes onto surfaces of various materials.
Figure 4.27 depicts the fullerene molecules adsorbed onto a Si surface. Since the van der
Waals forces are relatively weak, the fullerene molecules readily diffuse on the surface
and, in large concentrations, they can form hexagonally close-packed islands, as shown
in Fig. 4.28.
Then, the buckyballs themselves form face-centered cubic crystals (at room temperature) with large cohesive energy in this crystal (1.6 eV per molecule). Interestingly, the lattice constant of the C60 crystal has the unusually large value of
1.42 nm.
Summarizing, the fullerenes are natural and very stable nano-objects manifesting a
number of interesting physical and chemical properties, which can be controlled and
used in nanoscience applications.
4.9
Closing remarks
We began this chapter with the definition of crystalline materials. We introduced two
very important components of crystals, i.e., the electron subsystem and the crystalline
lattice. The electronic applications of a material are determined primarily by its electronic
105
Figure 4.27 An image of fullerene molecules adsorbed on a silicon surface (at low
concentration). Reprinted with permission, from P. Moriarty, Nanostructured materials,
C IOP Publishing Limited.
Rep. Prog. Phys., 64, 306 (2001).
Figure 4.28 At large concentrations, the fullerenes form a hexagonal structure on a surface.
Reprinted with permission, from P. Moriarty, Nanostructured materials, Rep. Prog. Phys., 64,
C IOP Publishing Limited.
358 (2001).
properties. As a result of the periodicity of crystals, one can interpret the electron energy
in terms of the energy bands. Depending on the relative positions of the energy bands
and their filling by electrons, a material can be described as a dielectric, semiconductor,
or metal. We found that, despite the interaction of the electrons with the great number of
atoms and ions composing a crystal, one can attribute a quasimomentum (a wavevector)
to an electron and that, frequently, within the actual range of the wavevectors, the electron
106
4.10 Problems
107
4.10
Problems
1. Using the lattice constant of silicon, a = 5.43 108 cm, and the fact that the
number of Si atoms per unit volume, a 3 , is eight, calculate the number of atoms per
1 cm3 and the density of the crystalline silicon (silicons atomic weight is 28.1 g mole1 ).
2. Estimate the volume of the first Brillouin zone in k-space for a simple cubic lattice
with the lattice constant a = 5 108 cm. Assume that the average energy of the electrons is 3kB T /2, where kB and T are Boltzmanns constant (kB = 1.381023 J K1 )
and the ambient temperature, respectively. Estimate the volume occupied by electrons
in k-space at T = 300 K with the effective mass m = 0.1m 0 , with m 0 being the freeelectron mass. Compare these two volumes and discuss whether Eq. (4.17) is valid for
electrons with the parameters specified previously.
3. Three electrons of the same energy are placed into three different energy valleys
of silicon. The valleys are located at ! points of the [100]-, [010]-, and [001]-axes.
Assuming that all three electrons move along the same direction, say [100], and using
Eqs. (4.17) and (4.19), find the ratio of the velocities for these electrons.
4. Consider a valence band consisting of light- and heavy-hole branches. Assume that a
heavy hole with energy E is transferred to a light-hole state with the same energy E. Find
the ratios of the quasimomenta and the velocities of the hole in the initial and final states.
5. For Alx Ga1x As alloy, find the composition having an energy bandgap equal to
2 eV using Table 4.6. For this alloy, determine the effective masses in the & and X
valleys using Table 4.7.
108
6. Assume that for some applications it is necessary to use a film of InGaAs of high
quality, which can be grown on an InP substrate. By using Eq. (4.20) and data presented
in Table 4.8, find the lattice-matched composition of this alloy, its lattice constant, the
energy bandgap, and the wavelength of the light corresponding to this bandgap.
7. Assume that the conduction-band offset for an Alx Ga1x As/GaAs heterojunction is
60% of the difference of the bandgaps of these materials. Find the composition of the
AlGaAs layer necessary for the resulting heterojunction to have an energy barrier for the
electrons equal to 0.3 eV. Calculate the energy barrier for the holes.
8. On the basis of the values of lattice constants given in Table 4.8, explain why it is feasible to grow stable Alx Ga1x As/GaAs and Inx Al1x As/In y Ga1y As heterostructures;
explain why it is difficult to grow stable heterostructures of GaP/SiC and InP/SiC.
9. Find the numbers of hexagons per unit cell in zigzag and armchair nanotubes. Calculate the radii of (7, 8) and (11, 10) nanotubes.
5.1
Introduction
Having reviewed the basic properties of materials exploited in nanoelectronics, we shall
now study the principal methods of high-quality material growth and nanodevice fabrication. Methods for the growth of perfect materials with controllable properties are critically
important for nanostructure fabrication. Indeed, stringent requirements must be met for
the growth of crystals for nanosize devices. These requirements include many factors and,
first of all, ultra-high quality and purity, both controlled within extremely close limits.
The following examples illustrate the term ultra-high quality. For Si crystals used in
nanodevices, concentrations of controlled impurities currently reach concentrations of
less than one part in ten billion (1 in 1010 ). For the case of Ge, this number is in the
range of 1 in 1013 1014 . The quality of a silicon crystal being used for nanoelectronics
can be characterized in terms of the density of defects: they must be limited to several
tens per 1 m2 (!) of the Si wafer according to the Semiconductor Road Map, that was
discussed in Chapter 1. The basic methods of growth of perfect crystalline materials and
multilayered heterostructures we will discuss in Section 5.2.
To fabricate a nanostructure and a nanodevice two approaches can be undertaken.
The first is based on a previously grown perfect material with further processing. This
includes a number of fabrication stages and methods (nanolithography, etching, implantation, selective doping, etc.). In Section 5.3 we review these methods. The second
approach exploits special regimes of material growth, when nanostructures are formed
spontaneously due to the growth kinetics. Such a growth regime can control size,
shape, and other properties of the nanostructures. Details of this approach are given in
Section 5.4.
Progress in the refinement of fabrication techniques for making nanostructures
depends on the great improvements made in characterization methods. In particular,
composition and dopant distribution, lattice strain, and other parameters within nanostructures must be known with atomic-scale precision. Currently, the manipulation of a
single atom (ion) in a solid is possible. In Section 5.5, we shall present the most important characterization techniques, such as atomic-force microscopy, scanning tunneling
microscopy, transmission electron microscopy, and others.
The previously mentioned approaches for the production of nanostructures and nanoelectronic devices actually represent evolutionary improvements in the growth and
110
5.2
Single-crystal growth
The following three steps are necessary to produce high-quality silicon crystals:
(i) production of metallurgical-grade silicon (impurity level 5 1016 cm3 ); (ii)
improvement of the latter material up to electronic-grade silicon (the level of impurities is reduced to 5 1013 cm3 or less); and (iii) conversion to single-crystal Si
ingots.
Metallurgical-grade silicon is typically produced via reaction of silicon dioxide (SiO2 )
with C in the form of coke:
SiO2 + 2C Si + 2CO,
(5.1)
which requires very high temperature (1800 C). Coke is a coal from which most of the
gases have been removed. The silicon obtained at this step is not single-crystalline and
is not pure enough for electronic applications, though it is good for some metallurgical
applications such as the production of stainless steel.
111
Pull
Seed
Wafers
Growing
crystal
Molten Si
Crucible
Figure 5.1 The Czochralski method for the growth of bulk semiconductors.
Further reductions in impurities can be achieved by carrying out the following reaction
of the silicon with dry HCl:
Si + 3HCl HSiCl3 + H2 .
(5.2)
Now we obtain the trichlorosilane HSiCl3 , which is typically in a liquid state with
a boiling point of 32 C. Simultaneously with HSiCl3 , other chlorides of impurities,
such as FeCl3 , are formed. Since their boiling points are different, the simple fractional
distillation technique can be applied: the mixture of HSiCl3 and other impurity chlorides
is heated and then condensed in a series of distillation towers at appropriate temperatures.
By this technique HSiCl3 is separated from impurities. The following reaction with H2
then converts the trichlorosilane into highly pure electronic-grade silicon:
2HSiCl3 + 2H2 2Si + 6HCl.
(5.3)
112
GaAs
Substrate
Molecular beams
Shutters
Sources
Be
Si
Al
Ga
As
Figure 5.2 The MBE method for the growth of GaAs/AlGaAs heterostructures.
Epitaxial growth
Fabrication of a crystal layer upon a wafer of a compatible crystal makes it possible to
obtain very well-controlled growth regimes and to produce high-quality crystals with the
desired crystalline orientation at temperatures typically well below the melting point of
the substrate. During the epitaxial growth, several methods of delivering the necessary
atoms to the growing layer can be used. The most developed methods are molecular-beam
epitaxy (MBE), chemical-vapor deposition (CVD), and liquid-phase epitaxy (LPE). Here
we shall discuss MBE and CVD in more detail.
Molecular-beam epitaxy
The MBE method can be realized in a high vacuum, where molecular or atomic beams
deliver onto a substrate the necessary components for growing the desired crystalline
layer. For example, suppose that we want to grow an AlGaAs layer on GaAs. Then,
the substrate will be GaAs and the atomic beams are fluxes of the elements Al, Ga,
and As, as well as beams of dopants (typically, Si is used for n-doping and Be for
p-doping). Sources of the elements are contained in separately heated chambers. The
evaporated elements form beams, which are separately and closely controlled, collimated,
and directed onto the substrate surface, as illustrated by Figs. 5.2 and 5.3. Typical
flux densities in the beams are of 1014 1016 atoms cm2 s1 . The substrate is held
at relatively low temperature (600 C for GaAs), while densities of the components
in the beams are large. This provides effective growth of the layer. A slow growth rate
(1 monolayer per second), which is often referred to as layer-by-layer growth, results in
113
the growth of a high-quality layer. By controlling shutters for each beam, one can produce
abrupt changes in crystal compositions and doping concentrations on the scale of one
monolayer.
Chemical-vapor deposition
This epitaxial method allows one to realize a low-temperature growth regime and to
use high-purity chemicals for delivering the necessary atoms for growth of a crystalline
layer. The layers can be grown onto a seed crystal or substrate from mixtures of chemical
vapors containing both semiconductor elements and dopants.
114
Gas inlet
Gas baffle
Quartz chamber
Wafers
Susceptor
Radio-frequency
source
Pedestal
Vent
Figure 5.4 The CVD method for the growth of heterostructures.
115
grown with thickness control of the order of one monolayer. Different types of doping
uniform doping, modulation doping, and delta-doping are realized with high accuracy.
Since in the chemical reactor the partial pressures of chemicals are much higher than the
pressure in the molecular beams of the MBE method, the rate of crystal growth realized
in the CVD method is higher than that of MBE. The former may be used in industrial
production, while the latter is rather well suited for research laboratories.
5.3
116
Illuminating radiation
Quartz plate
Ion-oxide layer
Illuminated areas
Negative resist:
exposed rendered
insoluble
Mask
Imaging system
Photoresist
Film to be patterned
Substrate
Develop
Positive resist:
exposed is
removed
Etch pattern
the photoresist is exposed through the pattern on the mask to high-intensity UV light.
One of the exposure methods, the contact method, is shown in Fig. 5.5. In this method,
the photoresist is exposed to UV light while the wafer is in the contact position with the
mask. As a result of the direct contact between resist and mask, very high resolution is
possible in contact printing.
One of the last steps in the photolithography process is development. The results
obtained after exposure and development are shown in Fig. 5.5 both for negative and for
positive resists.
Actually, photolithography processes have been known for decades. To use them for
nanostructures, we have to understand the limitations imposed by the wave nature of
light. Indeed, as we discussed in Chapter 2, simple geometrical ray optics is applied for
dimensions greater than the wavelength of light. The wave effect diffraction of light
restricts the application of photolithography to minimum dimensional scales of about
the wavelength of light. Thus, shorter wavelengths are preferable. Examples of some of
the shorter wavelengths in use are 0.365 m for UV mercury lamps and 0.193 m for
ArF excimer lasers. With these UV sources, it is possible to achieve linewidths of about
0.25 m and 0.15 m, respectively. Further penetration into the deep-UV region appears
to be extremely difficult.
The previously mentioned diffraction limit is much smaller for X-rays, electron beams,
and ion beams. Thus, advances in nanolithography are occurring as a result of the use
of these short-wavelength beams. For example, electrons with an energy of 10 keV have
i.e., less than the lattice constants of any crystal. Now, the
a wavelength of about 0.1 A;
117
Pump
Anode (Chamber)
Wafer
Cathode
RF
power supply
Etching
As soon as a resist pattern is formed, an etching method is generally used to produce the desired structures as shown in Fig. 5.5. There are many etching methods.
A widely used method is wet chemical etching. For example, dilute HF is used to
etch a SiO2 layer covering silicon. The HF reacts with SiO2 and does not affect the
photoresist or silicon. That is, this wet chemical etch is highly selective. However,
the rate of etching is the same for any direction, lateral or vertical, so the etching is
isotropic. Using an isotropic etching technique is acceptable only for relatively large
structures. For nanosize structures, anisotropic etching with faster vertical etching is
preferable.
Anisotropic etching generally exploits a physical process, or some combination of
both physical and chemical methods. The best-known method of anisotropic etching is
reactive-ion etching. Reactive-ion etching is based on the use of plasma reactions. This
method works as follows. An appropriate etching gas, for example a chlorofluorocarbon,
fills the chamber with the wafers. The pressure is typically reduced, so that a radiofrequency (RF) voltage can produce a plasma. The wafer we want to etch is a cathode
of this RF discharge, while the walls of the chamber are grounded and act as an anode.
Figure 5.6 illustrates a principal scheme for the ion-etching method. The electric voltage
heats the light electrons and they ionize gaseous molecules, creating positive ions and
molecular fragments (so-called chemical radicals). Being accelerated in the electric field,
the ions bombard the wafer normal to the surface. This normal incidence of bombarding
ions contributes to the etching and makes the etching highly anisotropic. This process,
unfortunately, is not selective. However, the chemical radicals present in the chamber
118
give rise to chemical etching, which, as we discussed, is selective. Now we see that the
method combines both isotropic and anisotropic components and can give good results
for etching on the nanoscale.
Doping
As we discussed in Section 4.2, a perfect semiconductor is a dielectric at low temperatures: the valence band is completely filled with valence electrons, while the conduction
band is totally empty. When the crystal temperature rises, some electrons can be excited
into the conduction band, which results in the creation of a pair: an electron in the conduction band and a hole in the valence band. Thus, for the electron concentration, n, and
the hole concentration, p, we obtain
n = p = ni (T ).
Here, n i (T ) is the temperature-dependent intrinsic density of electrons and holes. Such
a perfect semiconductor is itself called an intrinsic material. In practice, the intrinsic
concentrations are small and can not be controlled effectively.
Thus, it is important to find a method to create the electron or/and hole concentrations
needed for each particular device application. The most common method of varying
electron and/or hole concentrations in a crystal is the process of adding impurities to the
material; i.e., the doping process. The process is based on the following physical picture.
When an impurity is added to a semiconductor, additional energy levels are contributed by
the impurity to the energy structure of the crystal. Many of these additional energy levels
fall near the bandgap energies. Here, we consider two especially important cases: (1) the
impurity levels are close to the conduction band and (2) they are close to the valence band.
In the former case the impurities can be donors of electrons to the conduction band, while
in the latter case they can be acceptors; i.e., they will accept electrons from the valence
band and thereby generate holes in the valence band. For example, impurities from group
V of the periodic table of elements (P, As, Sb) are donors and impurities from group III
(B, Al, Ga, In) are acceptors for group IV semiconductors. The energy distances of
donor levels from the conduction band and of the acceptor level from the valence band
are much smaller than the energy bandgap. This promotes thermally induced ionization
of these impurities even at low temperatures and the creation of conducting electrons
or holes. In Fig. 5.7, the population of energy states in semiconductors with donors and
acceptors is illustrated for zero and finite temperatures.
We can perform the following simple estimates of energy levels of donors and acceptors. Consider first, for example, an atom from group V embedded in a semiconductor
material of group IV. Such an atom has five valence electrons, of which four can participate in formation of covalent bonds with neighboring group IV atoms. But the fifth
electron is an extra electron. We can consider this electron to be moving around a positive
ion, i.e., the impurity can be thought of as a hydrogen atom embedded in a dielectric
medium. If the radius of the electron state for this atom is large, we can suppose that the
electron has the effective mass m , which is characteristic for the conduction band. By
119
Figure 5.7 Doping of semiconductors: (a) donor doping and (b) acceptor doping.
applying the results of Section 3.4 that were obtained for the hydrogen atom, we find the
ground-state energy of such a hydrogen-like state of the impurity:
E1 =
m e4
,
(4
0
)2 -h2
(5.4)
with
0 and
being the permittivity of free space and the dielectric constant of the material, respectively. For an acceptor atom, we may apply the same approach, considering a
hole moving around a negatively charged ion. Equation (5.4) will be valid if the effective
mass of the electron in the conduction band, m , is replaced by the effective mass of a
hole, m h , in the valence band: m m v . Typical energies of donor and acceptor levels
in silicon, for example, are about 3060 meV from the conduction and valence bands,
and the ionization occurs at temperatures above several tens of degrees Kelvin.
Similarly to the previously considered case of materials of group IV, in IIIV compounds the atoms from group VI (S, Se, Te) are donors when they occupy sites of group
V atoms. Atoms from group II (Be, Zn, Cd) act as acceptors when substituting group
III atoms. A more complicated case occurs when doping is accomplished with atoms
of group IV; for example, with Si or Ge. These impurities are called amphoteric, which
means that they can act either as donors or as acceptors, depending on the sites they
occupy. If such an atom occupies a group III site, it brings an additional electron and
acts as a donor. When occupying a group V site, the atom accepts an electron and acts
as an acceptor. In GaAs, typically, Si impurities occupy Ga sites; thus they are typically
donors. However, if during the growth process there are As vacancies, Si atoms can fill
these vacancies and serve as acceptors.
In conclusion, the doping of semiconductor materials provides a powerful tool
for realizing the desired type of conductivity electron conductivity or hole
120
conductivity and fabricating structures with the desired values of electric resistance.
Doping methods provide a means for the control of the electronic properties over a
wide range of values. These methods are well suited for application in the fabrication of
semiconductor nanostructures.
5.4
121
Pz
Px
Vz
B
Py
VT
Control
s
s
IT
region under a potential barrier. This phenomenon was called tunneling. Thus, electron
transfer between two solids can occur as a tunneling process through (under) the vacuum
barrier. Tunneling experiments have been performed, for example, by using two metal
films separated by vacuum or a solid-state insulator (a sandwich structure). Each of the
metal films can be considered as an electrode and when a voltage bias is applied to
these electrodes a so-called tunneling electric current is produced. This current can give
information on electronic properties, but obviously the information will be averaged
over the area of the metal-film surface. By appropriate shaping of one of the electrodes
spatial resolution of far smaller scales than that of sandwich structures can be achieved.
Since vacuum is conceptually a simple tunnel barrier, such experiments pertain directly
to the properties of the electrodes and their bare surfaces. Clearly, vacuum tunneling
offers fascinating and challenging possibilities to study surface physics and many other
related topics.
The principle of STM, which is based on electron tunneling, is straightforward. It
consists essentially of a scanning metal tip (one electrode of the tunnel junction) over
the surface to be investigated (the second electrode), as depicted in Fig. 5.8. The metal
tip is fixed to a rectangular piezodrive marked as Px , P y and Pz . The piezodrive is made
of a piezoceramic material. The tunnel current, IT , is a sensitive function of the gap
between the tip and the surface, s; i.e., IT VT exp(A 1/2 s), where is the average
barrier height; the numerical value of A is equal to unity if is measured in eV and
Obviously the barrier height, 0 , is equal to the work function for a metal
s in A.
or doped semiconductor. With a typical value of of several eV, IT changes by an
change of the gap, s. The control unit, CU, applies
order of magnitude for every 1 A
a DC voltage, Vz , to the piezodrive, Pz , such that IT remains constant when scanning
the tip with Px and P y over the surface. At constant function , Vz (x, y) yields the
topography of the surface, that is z(x, y), directly, as illustrated at a surface step in
Fig. 5.8. The curvature of the tip causes the abrupt step to appear to be smeared over
a distance . For a constant tunneling current, changes in the work function, , are
122
(a)
2.5 nm
[001]
[110]
Measurement
Simulation
0.65
(b)
(d)
(c)
z || [001] (nm)
lattice constant
along [001] (nm)
height ()
123
0.60
0.55
5
10
0
5
position along [001] (nm)
10
0
y || [110] (nm)
10
10
10
y || [110] (nm)
Figure 5.9 Cross-sectional scanning tunneling microscopy (STM): (a) an image of a stack of
InAs islands in GaAs; (b) comparison between measured and simulated height profiles for a
similar sample; (c) the lattice parameter in the growth direction in an InAs island (the
experimental data were obtained from cross-sectional STM; the solid line is from a simulation
assuming an In content increasing from island base to island apex); and (d) the electronic
wavefunction measured at two different tip biases, compared with simulations for the ground
and the first excited states. Reprinted with permission from J. Stangl, V. Holy et al., Structural
properties of self-organized semiconductor nanostructures, Figs. 25 and 26, Rev. Mod. Phys.
76, 725783 (2004).
124
Figure 5.10 Atomic-force microscopy in the contact mode. The tip has dimensions of about
Atomic-force microscopy
An atomic-force microscope measures the force between the sample surface and a very
fine tip. The force is measured either by recording the bending of a cantilever on which
the tip is mounted the contact mode or by measuring the change in resonance frequency due to the force the tapping mode. For example, in Fig. 5.10, the contact-mode
AFM technique is sketched. With a typical resolution of several nanometers laterally
vertically, AFM is ideally suited to characterize the shapes of nanostrucand several A
tures. For large scan sizes up to 100 m 100 m, the lateral arrangement can also be
obtained. With AFM, any surface can be investigated; hardly any sample preparation is
required. A drawback of AFM is that only structures on a surface can be investigated.
Furthermore, most semiconductor materials oxidize under ambient conditions, so that,
strictly speaking, the AFM images usually show the surface of this oxide. When obtaining quantitative data such as lateral sizes and heights of structures, this has to be kept in
mind, as well as the fact that the image is actually a convolution of the samples surface
morphology with the shape of the microscope tip. Figure 5.10 is a schematic view of a
contact-mode AFM. Essentially, a micrometer-size cantilever has an extremely sharp tip
attached to it, which is sharpened to about 3050 nm at the end. A low-power probe laser
beam is reflected off the top of the cantilever and into a four-quadrant photodetector,
which records the position of the reflected beam. Note that the probe beam need not be
perfectly aligned (as long as some part of the beam is reflected into the detector, and the
surface does not reflect too heavily into the detector), and need not even be smaller than
the detector (since the difference between the quadrant signals allows the determination
of the beam position). The photodetector measures the position of the reflected beam,
which in turn gives information about the position of the cantilever and hence the tip. If
the whole apparatus is raster-scanned across the surface (or the sample is scanned under
the microscope), then an image of the surface relief can be generated.
125
(b)
Figure 5.11 PbSe islands with [001]-type facets: (a) an AFM image of the top surface of a
PbSe/PbEuTe island multilayer; (b) an AFM image of an area of 3 m 3 m of the top
surface of a PbSe/PbEuTe island multilayer. Islands are arranged in a regular array up to the
sixth-nearest neighbor. Reprinted with permission, from J. Stangl, V. Holy et al., Structural
properties of self-organized semiconductor nanostructures, Figs. 25 and 26, Rev. Mod. Phys.
76, 725783 (2004).
Examples of the quantitative analysis of AFM images are shown in Fig. 5.11. There,
the top surface of PbSe/PbEuTe multilayers is shown. Both materials are semiconductors.
From Fig. 5.11(a), one can see that PbSe forms triangular pyramids with [001]-type side
facets. The lateral ordering can also be analyzed by AFM. In Fig. 5.11(b), a hexagonal
in-plane arrangement of the pyramids is evident.
126
Scanning Electron
Microscope
Transmission Electron
Microscope
Source
Condenser
Sample
Scan Coils
Objective
System
Projector Lens
Screen
Sample
Detector
Figure 5.12 Schematic diagrams of a scanning electron microscope (SEM) and a transmission
the electron beam is not projected through the whole sample area. Instead, it is rasterscanned across the surface, and the secondary electrons, or X-rays, emitted from the
surface are recorded. This generates a lower-resolution image, but allows the direct
mapping of surface features, and can even be used for elemental analysis. Both types of
electron microscope are depicted schematically in Fig. 5.12.
The electron microscopy techniques are widely used due to their very high spatial
resolution and sensitivity to composition. TEM can be performed either on thin slices
parallel to the sample surface (plane-view TEM) or on cross-sectional slices. Hence,
buried nanostructures can be examined well by TEM, with some restrictions due to
specimen preparation: in many cases, the lateral island diameter is comparable to the
slice thickness. The image analysis is often not straightforward but requires elaborate
image-analysis techniques and/or model calculations. Compared with other techniques,
usually very small areas are investigated, so that no statistically averaged values can be
obtained.
We present here only one example of atomically resolved transmission electron
microscopy, from which the positions of unit cells, strain, and composition information were derived. The results obtained, after digital analysis of the lattice image, are
given in Fig. 5.13: the strain distribution (i.e., distortion of atoms constituting the island
from their regular positions) for an InGaAs island on a GaAs substrate is depicted.
Remarkably, these techniques make it possible to visualize a detailed map of the strain
for an object of size a few tens of nanometers. From Fig. 5.13, it is seen clearly how
127
Figure 5.13 Strain distributions obtained from the TEM images of InGaAs islands in GaAs by
using the method of digital analysis of lattice images. Reprinted with permission, from J. Stangl,
V. Holy et al., Structural properties of self-organized semiconductor nanostructures, Fig. 27(a),
Rev. Mod. Phys. 76, 725783 (2004).
the strain increases at the apex of the InGaAs island, while around the island the strain
changes its sign.
In conclusion, powerful characterization techniques have been developed to study
nanosize objects. The techniques give three-dimensional images in real space and on
an atomic scale in all three dimensions. The methods are nondestructive. They provide
means to perform structural and chemical analyses of the materials used in nanostructures. Moreover, these techniques make it possible to observe and measure directly the
electron distributions inside the nanostructures; that is, it is possible to observe the
electron probability densities!
5.5
128
(a)
(b)
(c)
Figure 5.14 Three types of periodic structures: (a) periodically faceted surfaces; (b) planar
[001]
[110]
Unreconstructed
Reconstructed
There are three distinctive classes of spontaneously formed nanostructures on a surface, as illustrated in Fig. 5.14. These nanostructures are (1) periodically faceted surfaces
(Fig. 5.14(a)), (2) periodic structures of planar domains (Fig. 5.14(b)), and (3) ordered
arrays of three-dimensional coherently strained islands in lattice-mismatched heteroepitaxial systems (Fig. 5.14(c)). Despite the fact that the geometries of the three classes
are different, there exist common features for all of these nanostructures. The main one
is that the driving force of the periodic ordering is a long-range elastic interaction. The
elastic-strain field is created due to the discontinuity of the intrinsic elastic properties on
domain boundaries and/or by the lattice mismatch between two materials composing the
heteroepitaxial system. The three classes depicted in Figs. 5.14(a)(c) are equilibrium
structures. In particular, they can be observed upon annealing of the crystal, or upon
interruption of the crystal growth. Here we restrict our discussion to the formation of
three-dimensional coherently strained islands.
We begin with a brief discussion of the strain arising at surfaces of solids. Since atoms
in the surface layer of any material are in a different environment from that experienced
by those in the bulk, the surface layer energetically favors a lattice parameter different
from the bulk value in the directions parallel to the surface. Being adjusted to the bulk
lattice, the surface layer is intrinsically stretched or compressed. Therefore, the surface is
characterized by intrinsic surface stress. At the surface, even the symmetry of the crystal
can be changed; this is known as surface reconstruction. Figure 5.15 illustrates such a
reconstruction of the Si surface.
FM
VW
129
SK
Figure 5.16 Three growth modes of heteroepitaxial systems: Frankvan der Merwe (FM),
VolmerWeber (VW), and StranskiKrastanow (SK).
To some extent the intrinsic surface stress of a solid is analogous to the surface tension
of a liquid. However, there is a fundamental difference between the properties of a liquid
surface and those of a solid surface. Indeed, liquids are typically isotropic. The processes
of formation and deformation of a liquid surface are identical and can be described by
a single parameter that characterizes the energy of cutting of bonds on the surface.
This is the so-called surface energy. However, in crystals the process of the formation
of a surface is quite different: the distance between atoms changes and symmetry can
change. This is quite different from the simple cutting of bonds. In addition, crystals are
anisotropic and the energy needed to create a free surface of a given orientation depends
on the orientation.
These features of the surfaces of solids give rise to different epitaxial growth regimes.
Three growth modes are distinguished traditionally. They are the (1) Frankvan der
Merwe (FM), (2) VolmerWeber (VW), and (3) StranskiKrastanow (SK) growth modes.
Respectively, they can be described as (1) layer-by-layer growth, (2) island growth (threedimensional), and (3) layer-by-layer plus island growth. These three modes are illustrated
in Fig. 5.16. The particular growth mode for a given system depends on the surface
energies and on the lattice mismatch between the material of the substrate and that of
the grown layer. The growth regimes discussed in Section 5.2 are of the FM type.
In the following discussion we define the surface energy as the excess energy of a very
thin distorted layer (one or two monolayers) at the free surface of a solid. The interface
energy is the excess energy of a thin layer at the interface between two solids. In latticematched systems, the growth mode is governed only by interface and surface energies.
Let 1 and 2 be the surface energies of the substrate and epitaxial layer (epilayer),
respectively, and let 12 be the interface energy. Then, if the sum of the epilayer surface
energy and the interface energy is lower than the energy of the substrate surface, i.e.,
1 > 2 + 12 , the deposited material wets the substrate and the FM growth mode occurs.
A change in 2 + 12 can drive a transition from the FM to the VW growth mode. These
two modes of coexistence of crystalline materials are quite analogous to those of a liquid.
For a strained epilayer with small interface energy, 12 , the initial growth may occur
layer-by-layer; however, as the layer becomes thicker it may lower its increasing strain
130
UF
DI
CI
Figure 5.17 Elastic-strain relaxation during StranskiKrastanow growth (schematic). Light gray
areas denote the substrate, dark gray areas denote the lattice-mismatched epilayer. The lines
symbolize lattice planes. Top, uniformly strained film (UF); middle, dislocated relaxed islands
(DI); and bottom, coherently strained islands (CI).
energy by forming isolated islands in which strain is relaxed. This results in the SK
growth mode. Thus, the SK mode depends critically on the lattice mismatch. Three
scenarios for strain relaxation are sketched in Fig. 5.17. Figure 5.17(a) corresponds to
uniform pseudomorphic growth without any stress relaxation, which can exist for a very
thin strained layer, as discussed in Section 4.7. Figure 5.17(b) corresponds to a dislocated
relaxed island. Finally, the island in Fig. 5.17(c) is coherently strained. From the point
of view of the use of self-forming nanosize islands, coherently strained structures are of
most importance, because of their high quality.
The scenarios presented in Fig. 5.17 occur as a result of the interplay of several
parameters: (1) the ratio of the surface energy and the energy of the dislocated interface,
(2) the amount of deposited material Q, and (3) the lattice mismatch,
(see Eq. (4.23)
and Fig. 4.16). As an example, a diagram of possible growth regimes is depicted in
Fig. 5.18. In Fig. 5.18, the various growth regimes are identified in terms of the amount
of deposited material, Q, versus lattice mismatch,
.
For a dense system of islands, the elastic interaction between islands via deformation
of the substrate is essential. The system of interacting islands is then a system of elastic
domains where the energy minimum corresponds to a structure with periodic domains.
Thus, there is a possibility of growing an ordered system of nano-islands.
Following this general analysis of formation of three-dimensional islands under pseudomorphic growth of crystalline materials, we consider particular examples of such
self-growing nanosize heterostructures.
The first experimental evidence of the formation of the coherently strained islands
was obtained by transmission electron microscopy of the InAs/GaAs system (GaAs is
the substrate, while InAs is the epilayer). As follows from the data of Table 4.8, this is
the lattice-mismatched system with mismatch parameter
7%. Coherently strained
and stable islands have been found for many systems: Ge/Si, GeSi/Si (Si substrate of
[100] orientation), AlInAs/GaInAs, InAs/InP, CdSe/ZnSe, and others.
R1
1
5
R1
R2
2
131
R3
3
R2
R3
3
2
FM
SK1
SK2
VW
1
0
0.0
0.1
FM
SK
0.2
VW
average lateral size, the height, and the density of the islands are 100140 A,
and 1010 1011 cm2 , respectively. Figure 5.19 depicts a single InAs island on GaAs
substrate studied by scanning tunneling microscopy.
In general, results of growth and post-growth annealing studies show that the islands
grown in the SK mode are usually stable against ripening. Under given growth conditions
they have well-defined sizes and shapes. For example, in SiGe/Si, essentially only four
forms of islands are observed: (1) shallow pre-pyramids, (2) square pyramids, (3) hut
clusters elongated pyramids, and (4) large domes with facets in several directions.
Figure 5.20 depicts a diagram that links possible shapes of islands with their volume
and the mismatch parameters for Ge or GeSi on Si substrate. In the first stage of growth,
shallow pre-pyramids appear that later convert to pyramids and then to domes. Pyramids
and domes are clearly observed during growth at higher temperatures, whereas the much
smaller hut clusters form at lower growth temperatures. The typical lateral sizes of the
132
[110]
[110]
50
Figure 5.19 A single InAs island on GaAs substrate. Reprinted with permission, from J. Stangl, V.
Figure 5.20 Shape transition of Ge or SiGe islands grown on Si [001] during growth (solid arrow),
postgrowth annealing (dotted arrow), and Si capping (dashed arrow). The solid curves represent
the critical volumes for pyramids and domes. Corresponding STM images are embedded in the
diagram. Reprinted with permission, from J. Stangl, V. Holy et al., Structural properties of
self-organized semiconductor nanostructures, Fig. 14, Rev. Mod. Phys. 76, 725783 (2004).
133
Figure 5.21 AFM images of the surface of a Ge layer grown on lithographically prepatterned Si
[001] substrates. In the sample shown on the top left, the islands are arranged in a regular array
along two orthogonal [110]-directions. In the sample shown on the bottom right, the unit vectors
of the two-dimensional array of pits are oriented along the [100]- and [110]-directions, leading
to a 45 island alignment. X-ray diffraction (top right and bottom left) demonstrates the high
degree of ordering of islands. Reprinted with permission from J. Stangl, V. Holy et al.,
Structural properties of self-organized semiconductor nanostructures, Fig. 21, Rev. Mod.
Phys. 76, 725783 (2004).
of islands can yield short-range ordered patterns with astonishing regularity; however,
this process does not result in true long-range ordering. Ordering can be realized by
using island nucleation on lithographically prepatterned substrates. Similarly to the case
for nonpatterned substrates, for prepatterned substrates the nucleation of the islands
is affected by the surface curvature and the surface stress. Thus, the substrate can be
prepared to provide initial nucleation at selective locations on the substrate. Various
methods can be used; e.g. shallow etching of trenches in Si and subsequent overgrowth
with a GeSi multilayer, followed by Ge nucleation; and direct growth of Ge in twodimensional periodically etched pits. The latter method results in long-range-ordered
island structures, as shown in Fig. 5.21. Nucleation occurs at the intersection of the side
facets within the pits. Thus, a combination of self-organizing processes in crystal growth
with prepatterning methods may be used to achieve precise placement of individual
islands, as well as high ordering throughout nanosize heterostructures.
In conclusion, for device applications, uniform islands with controlled positioning are
required. This goal is difficult to achieve using self-organization alone, but a combination
with other techniques may be successful. The following three routes are most promising:
(1) a combination with conventional lithography will allow controlled positioning of selforganizing nanostructures without losing their inherent advantages; (2) seeded growth
with catalytic nanoparticles facilitates the fabrication of structures significantly smaller
134
than the SK islands; and (3) a combination of self-assembly with the techniques of
organic chemistry and biochemistry. Nowadays, the controlled fabrication of extremely
small structures of a few nanometers in all three dimensions and their selective addressing
seems quite feasible.
5.6
135
Figure 5.22 Silicon nanoclusters absorbed on a (111) surface of Si. Several clusters, of differing
sizes, are seen on this STM image. Reprinted with permission from D. W. McComb, B. Collings
et al. An atom-resolved view of silicon nanoclusters, Chem. Phys. Lett. 251, 812 (1996).
Permission from Elsevier.
Si clusters. This explains the large sticking coefficient for Si clusters. As a result, on
being deposited onto the surface, these clusters do not diffuse onto the substrate at room
temperature.
Compound semiconductors can also be synthesized in the gas phase. For example,
laser evaporation of materials containing Ga and As atoms in an inert gas produces
GaAs clusters with dimensions of tens of nanometers. If these clusters are exposed to
air, they are capped by an As-rich oxide shell. Then, the clusters can be deposited
onto a surface (Si-(100), for instance) and manipulated by using an atomic-force
microscope.
The colloidal synthesis of almost monodisperse nanoparticles is based on controlled
nucleation and growth of clusters in a precursor-containing solution. Both metallic and
semiconducting clusters can be grown. The most studied case is synthesis of IIIV
and IIVI compounds. The method involves injection of reagents containing the cluster
constituents (for example, Cd and Se) into a hot solvent, where nucleation of CdSe
occurs. Molecules of the solvent cap and thereby passivate the nucleated cluster. For
the above example, tri-n-octylphosphide (TOPO) is used as a solvent. The reagents are
Me2 Cd and TOPSe.
A hot solution of TOPO is favorable for nucleation of CdSe. TOPO capping the
clusters significantly slows the cluster growth and makes the properties of the product
more controllable. Careful control of the solution temperature enables production of
nanoparticles with a small size dispersion. The size dispersion of nanocrystals can be
within a few percent of the average diameter. The latter is a few to tens of nanometers. The shape of nanocrystals is close to spherical. On being deposited onto a surface,
these nanocrystals form an ordered lateral structure. In Fig. 5.23, a tunneling electron microscope image is presented, which provides evidence of the formation of a
136
50 nm
Figure 5.23 A transmission electron microscopy image of CdSe nanocrystals of size 4.8 nm
grown by the colloidal method and then deposited on a surface. X-ray diffraction demonstrates
the high ordering of these nanocrystals. Reprinted with permission, from P. Moriarty,
C IOP Publishing
Nanostructured materials, Fig. 19(b), Rep. Prog. Phys. 64, 324 (2001).
Limited.
5.7
137
Figure 5.24 Catalytic growth of a (10, 10) armchair carbon nanotube using a metal atom (large
black sphere) as a catalyst. Carbon atoms are depicted by white balls. Reprinted with
permission, from J. Charlier and S. Iijima, Growth mechanisms, in M. S. Dresselhaus,
G. Dresselhaus, and Ph. Avouris (Eds.), Carbon Nanotubes, Fig. 11, Topics in Applied Physics
vol. 80 (Berlin, Springer-Verlag, 2001), 5579.
138
(a)
He
Arc-discharge
(b)
Laser ablation
Laser
pulse
(c)
CVD
CnHm
Catalyst
Figure 5.25 Schematic diagrams of arc-discharge (a); laser ablation (b); and chemical-vapor-
139
Figure 5.26 A bundle of carbon nanotubes grown by laser ablation. Reprinted with permission,
from Hongjie Dai, Nanotube growth and characterization, in M. S. Dresselhaus,
G. Dresselhaus, and Ph. Avouris (Eds.), Carbon Nanotubes, Fig. 4, Topics in Applied Physics,
vol. 80, pp. 2953 (Berlin, Springer-Verlag, 2001).
The process of growth of high-quality single-walled nanotubes takes place also during
the so-called laser-ablation (laser-oven) method illustrated by Fig. 5.25(b). The method
utilizes intense laser pulses to ablate a carbon target containing 0.5 atomic percent of
catalytic nickel and cobalt. The target is placed in a tube-furnace heated to 1200 C.
During laser ablation, a flow of inert gas is passed through the growth chamber to
carry the grown nanotubes downstream to be collected on a cold finger, where carbon
condensation occurs. The resulting tubes are primarily in the form of ropes consisting
of tens of individual nanotubes close-packed into hexagonal crystals via the van der
Waals interactions. Packaging of individual nanotubes into a bundle is clearly seen in
Fig. 5.26.
Typically, arc-discharge and laser-ablation methods lead to a number of by-products:
fullerenes (graphitic polyhedrons, some with enclosed metal particles), amorphous carbon, etc. Further purification is necessary to obtain the nanotubes. This process involves
refluxing the nanotubes in a nitric acid solution for an extended period of time, which oxidizes away amorphous carbon particles and removes metal catalyst species. Production
of single-walled nanotubes of high quality by laser ablation and arc-discharge is relatively simple and has already led to the wide availability of samples useful for studying
their fundamental properties.
Chemical-vapor deposition
A schematic experimental setup for chemical-vapor-deposition growth is depicted in
Fig. 5.25(c). The growth process involves heating a catalyst material to high temperatures in a tube furnace and flowing a hydrocarbon gas through the tube reactor for a
period of time. Materials grown over the catalyst are collected upon cooling the system to
140
room temperature. Parameters controlling nanotube growth are the hydrocarbons, catalysts, and growth temperature. The active catalytic species are typically transition-metal
nanoparticles formed on a support material such as alumina. The growth mechanism
involves the dissociation of hydrocarbon molecules catalyzed by the transition metal,
and dissolution and saturation of carbon atoms in the metal nanoparticle. The precipitation of carbon from the saturated metal particle leads to the formation of tubular
carbon solids. Tubule formation is favored over formation of other forms of carbon
such as graphitic sheets with open edges. This is because a tube contains no dangling
bonds.
Relatively high temperatures (10001300 K) are necessary to form single-walled
nanotubes with small diameters and allow the production of nearly defect-free nanotube
structures. Among all hydrocarbon molecules, methane is the most stable at high temperatures against self-decomposition. The methane chemical-vapor-deposition approach
is promising for enabling scale-up of defect-free nanotube materials to the kilogram or
even ton level.
141
Figure 5.27 Pillar tops are connected by suspended single-walled nanotubes that form (a) a
nanotube power-line-like structure; (b) a square of nanotubes; and (c) an extensive network of
suspended nanotubes. Reprinted with permission, from Hongjie Dai, Nanotube growth and
characterization, in M. S. Dresselhaus, G. Dresselhaus, and Ph. Avouris (Eds.), Carbon
Nanotubes, Fig. 9, Topics in Applied Physics, vol. 80 (Berlin, Springer-Verlag, 2001), pp. 2953.
fixation points for the growing tubes. If a waving tube contacts an adjacent tower, the
tubetower van der Waals interactions will catch the nanotube and hold it aloft. By such a
growth mode the suspended nanotubes can be made of macroscopic length. For example,
tubes longer than 150 m can be grown.
5.8
142
143
Figure 5.28 Molecular graphics of the triblock self-assembling molecule (left) and a cluster of
triblock macromolecules (center and right), shade-coded for energy in the various sectors of the
cluster. Reprinted with permission from S., Stupp, V. LeBonheur et al., Supramolecular
C 1996 AAAS.
materials: self-organized nanostructures, Science, 276, 384 (1997).
144
End group
Backbone
Head group
Substrate
Figure 5.29 A schematic diagram of a triblock polymer film; the polymer is shown standing-up.
After F. Schreiber, Self-assembled monolayers: from simple model systems to
C IOP Publishing
biofunctionalized interfaces, J. Phys.: Condens. Matter, 16, R881 (2004).
Limited.
Figure 5.30 A transmission electron micrograph of a film formed by the triblock molecules,
revealing regularly sized and shaped aggregates that self-organize into superlattice domains.
Reprinted with permission from S. Stupp, V. LeBonheur et al., Supramolecular materials:
C 1997 AAAS.
self-organized nanostructures, Science, 276, 384 (1997).
There are two ways to create nanostructures using self-assembly of block copolymers:
(i) precipitation of metal or other inorganic crystals onto a polymer mask attached to a
substrate; and (ii) using the polymer mask for the subsequent processing of the substrate,
or as a support for additional auxiliary masks. After the mask has been formed on the
surface of a semiconductor, the device-fabrication technology remains the same as with
the conventional nanofabrication and employs dry ion or selective etching, oxidation,
metallization, diffusion, selective growth, etc.
Biological methods
As examples of the use of biological methods as elements of nanoscale technologies, we
consider nanopatterning and nanoassembly techniques that employ proteins and DNA,
145
Side R Group
Amino Group N
C
H
146
NH
C
NH2
NH
CH2
OH
CH2
NH2
CH
COOH
Arginine
(Arg or R)
CH2
CH2
CH
NH2
Glycine
(Gly or G)
COOH
NH2
CH
COOH
Aspartic Acid
(Asp or D)
Such a process provides a very simple example of how chemistry may be used to selfassemble molecules and structures. By binding a few amino acids together a peptide is
formed. From this discussion it is clear that amino acids are indeed the building blocks
of many biomolecules, including peptides, proteins, and antibodies. As explained and
illustrated previously, these biomolecules peptides, proteins, and antibodies may be
formed through binding of amino and carboxylic groups at the two ends of the amino
acids. The full set of twenty common amino acids has the following members: alanine
(A), arginine (R), asparagine (N), aspartic acid (D), cysteine (C), glutamine (Q), glycine
(G), glutamic acid (E), histidine (H), isoleucine (I), leucine (L), lysine (K), methionine
(M), phenylalanine (F), proline (P), serine (S), threonine (T), tryptophan (W), tyrosine
(Y), and valine (V).
Just as the binding of NH2 and COOH groups plays a role in linking many
biomolecules, they may be used to bind a COOH-functionalized (COOH-coated) quantum dot to the amino terminus of a peptide, protein, or antibody. In this way, we mimic
a self-assembly technique found in nature as a key step in our arsenal of nanofabrication techniques. It is also possible to use another chemical bond found in nature the
thiol bond to provide a frequently used technique for assembling nanodevice structures. Specifically, the amino acid cysteine, depicted in Fig. 5.33, and several other
amino acids (leveine, valine, methionine, and serine), have special uses in chemical selfassembly since each of these amino acids contains a sulfur (S) atom in its side group, R.
Specifically, the sulfur atom has an affinity for binding to a (111) surface of gold and it
binds to quantum dots like CdS through an SS or thiol bond. These thiol bonds are also
instrumental in the phenomenon of protein folding, where they cause cysteine molecules
at various points along a given protein to be attracted to each other, thus causing protein
folding. Indeed, these SS bonds are the strongest bonds found of all possible bonds
among amino acids in protein systems.
CH3
NH2
CH3
CH
CH3
CH2
CH2
CH3
CH
COOH
CH
COOH
NH2
Leveine
(Lev or L)
NH2
147
CH3
Valine
(Val or V)
SH
OH
CH2
CH2
CH2
CH2
CH
Cysteine
(Cys or C)
COOH
NH2
CH
COOH
Serine
(Ser or S)
NH2
CH
COOH
Methionine
(Met or M)
The affinity of sulfur to bind gold is sufficient to result in the wide use of SAu
bonds to bind molecules to Au surfaces. In the field of molecular electronics, there are
efforts to use molecules as wires, and it is necessary to bind these wires to metal contacts.
By chemically binding a sulfur atom to the end of such a molecular wire, it is possible
to bind the S-functionalized ends of the wire to Au. The use of these SAu bonds in
molecular electronics is so prevalent that they are referred to as the alligator clips of
molecular electronics.
Now that we have considered basic properties of proteins, we turn to their applications
in nanofabrication. It is known that proteins form the external surface of many bacteria.
Analogously to the case of block copolymer films analyzed previously, surface-layer
proteins can be used as nanoscale biological masks. As shown schematically in Fig. 5.34,
surface-layer proteins can be isolated from bacterial cells, and subsequently reassembled
on the surfaces of a solid. The surface-layer proteins can be deposited upon various
metals and semiconductor crystals. On their surfaces, surface-layer proteins form twodimensional periodic structures of various symmetries. The sizes of the elementary cells
of these surface crystals range from 3 nm to 30 nm. The thickness of an artificially built
surface layer is about 28 nm, and it has pores of various sizes (28 nm) and shapes. In the
upper part of Fig. 5.35, one such periodic structure is shown. The figure illustrates typical
processing steps for nanopatterning: (a) deposition of surface-layer protein crystals onto
the substrate; (b) metallization by deposition of metal; (c) dry etching, which allows the
transfer of the pattern to the substrate; and (d) a plane view of the resulting nanostructure
with highly ordered holes.
S-layer protein
148
bacterial cell
envelope
isolation of S-layer proteins
reassembly
solid
liquid
Figure 5.34 A schematic drawing of the isolation of surface-layer proteins from bacterial cells
S-layer
(a)
Pore
Mask deposition
Protein
Substrate
(b)
Metal shadowing
(c)
Hole sites
(d)
Dry etching
Metal overlayer
Protein
Metal (oxide)
Protein
Hole sites
Metal (oxide)
on top of
protein
Figure 5.35 Processing steps for nanopattern transfer: (a) deposition of S-layer protein crystals
onto the substrate; (b) shadow metallization by electron-beam deposition; (c) dry etching to
transfer the pattern to the substrate; and (d) a plane view of the idealized nanostructure. After
T. A. Winningham, S. G. Whipple, et. al., Pattern transfer from a biological nanomask to a
substrate via an intermediate transfer layer, J. Vac. Sci. Technol., 19, 17961802 (2001).
Reprinted with permission from Thomas A. Winningham, Steven G. Whipple, and Kenneth
C 2001 AVS The
Douglas, Journal of Vacuum Science & Technology B, 19, 1796 (2001).
Science & Technology Society.
149
Figure 5.36 A scanning electron microscopy image of GaAs semiconductor wires obtained by
means of the surface-layer protein technology. The wires have a length of 80 nm and a diameter
of about 8 nm. Reprinted with permission, from M. Haupt, S. Miller et. al., Nanoporous gold
films created using templates formed from self-assembled structures of inorganic-block
copolymer micelles, Advanced Materials, 15, 829831 (2003), Fig. 3(a), p. 831.
Then, such a surface superlattice of holes can be used, for example, to fabricate
semiconductor nanoscale free-standing wires. Figure 5.36 depicts a scanning electron
microscopy image of semiconducting GaAs wires fabricated using surface-layer protein
technology. The wires have a length of 80 nm and a diameter of about 8 nm at the top
of each wire, while the base is thicker. High ordering of free-standing nanoscale wires
is obtained using this biology-based method of nanopatterning.
Biological methods facilitate the realization of the assembly of nanoscale inorganic
building blocks into functional materials. As an example, the biological macromolecule
DNA may be used as a synthetically programmable assembler of nanoscale structures
in order to fabricate a wide variety of different ensembles of selected nanocomponents.
Such ensembles may be assembled in a variety of architectures. This method is based on
the molecular-recognition properties associated with DNA. DNA may be prepared and
functionalized with virtually any chromophore, acceptor, donor, or an active group in
an automated fashion. It is known that the molecular-recognition properties arise due to
base pairing in DNA strands. Specifically, the commonly encountered double-stranded
DNA is composed of two single strands of DNA. These single strands are composed
of linear sequences of the four DNA bases adenine (A), cytosine (C), guanine (G), and
thymine (T). The two single strands bind together to form a single strand as a result of
the high affinities for G and T to bind together and C and A to bind together. Thus, as
a simple example, GTCAC and TGACA bind together to form a double-stranded DNA
molecule. The two single strands, GTCAC and CAGTG, are complementary to each
other. These molecular-recognition processes can guide the assembly of nanoparticles
into extended structures. In principle, the method selects nanoparticles with certain
chemical compositions and sizes, and it controls also the distance and coupling between
the particles in the resulting nanostructured materials.
150
Au nanoparticles
Modification with
3' thiol TACCGTTG 5'
B
B
A
B
Modification with
5 ' AGTCGTTT 3' thiol
B
A
5' ATGGCAAC
TCAGCAAA 5'
B A
Further oligomerization
and settling
A B A B A
B A B A B
Figure 5.37 Formation of a biomoleculeinorganic cluster hybrid. Reprinted with permission,
As nanoparticle building blocks, nanocrystals of metals, semiconductors, magnetic particles, fullerenes, etc. are currently available. The sizes of these nanocrystals/nanoparticles can be controlled precisely, in many cases from 1 nm to 20 nm in
diameter.
The formation of a biomoleculeinorganic cluster hybrid is one of the important biological methods for creating nanoscale architectures. This process can be illustrated by the
following example. Let two different sequences of non-complementary eight-base-pair
DNA be synthesized with alkanethiol endgroups. Then, two batches of Au nanoparticles are functionalized with these DNA strands. In the scheme presented in Fig. 5.37 this
functionalizing process is shown in the upper part of the figure. Two particular sequences
of DNA molecules are indicated as well. If the two batches of Au particles were simply
mixed, there would be no DNA recognition and hence no particle aggregation. However,
the addition of linker DNA strands changes the situation. Such a linker molecule is composed of a DNA molecule that has three regions: a central region of double-stranded
DNA and two end regions that are composed of unpaired single strands of DNA. These
unpaired single-stranded ends are known as sticky ends. If the linker DNA molecules
contain eight-base-pair sticky ends that are complementary to the base pairs attached
151
to the Au particles, the particles start to aggregate, as shown in the lower part of the
scheme in Fig. 5.37. The process of linking these nanoparticles with DNA molecules is
known as oligomerization since DNA molecules are also known as oligonucleotides.
In this example, the so-called sticky ends, 5 ATGGCAAC and TCAGCAAA5 are single strands of DNA bound to opposite ends of a segment of double-stranded DNA.
The double-stranded portion of this molecule is represented by the ladder symbol in
Fig. 5.37. As is illustrated in Fig. 5.37, the 5 ATGGCAAC sticky end binds to the
3 thiolTACCGTTG complementary molecule which is bound to an Au quantum dot by
this bond. Likewise, the TCAGCAAA5 sticky end binds to the 5 AGTCGTTT3 thiol
complementary molecule which is bound to an Au quantum dot by the thiol bond.
Thus, an object can be built from nanosized building blocks linked by DNA molecules.
By such a method, both two-dimensional and three-dimensional ordered nanosystems have been fabricated. Figure 5.38 depicts a transmission electron microscopy
image of a two-dimensional structure of Au nanoparticles linked by DNA molecules.
Similarly, semiconductor nanoclusterDNA hybrids have been used to construct
extended macroscopic structures of CdSe/ZnS quantum dots interconnected with DNA
molecules. The wavelength of light emitted by a quantum dot is determined by its size as
well as by the material. Table 5.1 summarizes the bandgaps of some direct-bandgap materials that are used to fabricate quantum dots. The materials represented in Table 5.1 have
wavelengths spanning the ultraviolet, visible, and infrared regions of the electromagnetic
spectrum.
When we integrate DNA with inorganic building blocks, we can exploit some of the
properties of the latter. For example, semiconductor nanocrystals hybridized with DNA
can be used as fluorescent biological labels, because of specific optical properties of
152
Bandgap (eV)
AlN
CdS hexagonal
CdS cubic
CdSe hexagonal
CdSe cubic
CdTe
PbS
PbSe
ZnS
GaN
6.15
2.4
2.55
1.75
2.17
1.49
0.41
0.27
3.68
3.36
153
Figure 5.39 A quantum-dot network assembled using chemically directed assembly techniques.
networks using chemically directed assembly with molecular links, it is necessary to use
molecular links that conduct electricity.
Since proteins (and short proteins peptides) constitute one of the major classes of
biomolecules, an understanding of the electronic properties of these general classes of
molecules is of potential importance in bioelectronics. Charge transport in peptides has
recently been studied by inserting amino acids containing a side group containing a
natural chromophore, which is a molecule that produces charge carriers when exposed
to light. In this way, charge can be introduced into the peptide by illuminating the chromophore in the side group. In addition, phenylalanine, tryptophan, and tyrosine have
been considered theoretically as possible conducting elements of peptides since their
side groups are rich in conductive -bonds. As we discussed for conducting polymers,
such -bonds provide charge conduction. Results from the chromophore-based studies of charge transport indicate that charges move along these peptide-based wires at
speeds of about 1.5 105 cm s1 , a relatively slow speed from the standpoint of Siand GaAs-based electronic devices. This relatively slow transport implies that electronic
systems of integrated nanocrystals with peptide-based interconnects must be designed
with architectures that overcome the limitations imposed by carrier-transport speeds in
peptides; perhaps, the design of massively parallel architectures offers a possible resolution of the difficulties imposed by the limited propagation speed in the peptide-based
interconnects.
Our previous discussion of the use of DNA as a self-assembling agent makes it clear
that DNA may be used to link metallic and semiconducting nanostructures. There are also
current research efforts aimed at using DNA as molecular wires that conduct electric
current. Of course, these efforts stem from the growing interest in the use of DNA
as a charge-carrying element in bioelectronic devices. In these efforts, a key parameter
determining the change of transport properties of DNA is the highest occupied molecular
orbital (HOMO). As shown in Fig. 5.40, the commonly encountered type of DNA is a
double-stranded helix-shaped molecule with each of the two strands being composed
of sequences of the following four bases: adenine (A), cytosine (C), guanine (G), and
thymine (T). As discussed previously, these sequences may contain the A, C, G, and T
bases in any order, and the adjacent bases on the two strands are pairs of either GC or
154
h+
h+
h+
h+
Light
e
DNA
Quantum Dot
Ev
h+
E HOMO
AT. For a simple double strand of DNA that contains only GC pairs, the energy levels
of the DNA exhibit a gap of the approximate magnitude of 2.0 eV between the HOMO
and the lowest unoccupied molecular orbital (LUMO).
In molecules, energy states may be filled, partially filled, or empty just as in semiconductors. As discussed earlier in this book, the valence band of an undoped semiconductor
is nearly full of electrons at low temperatures, and the conduction band is nearly empty
under these conditions. For molecules, it is traditional to refer to the HOMO simply as
the HOMO level or the HOMO band. The highest energy in the HOMO band, therefore, plays a role analogous to the high-energy edge, or top, of the valence band, E v , of
a semiconductor. Similarly, there is an energy gap between the top of the HOMO band
and the low-energy edge, or bottom, of the next (empty) molecular orbital, known as the
LUMO, which is referred to frequently as the LUMO band. The bottom of the LUMO
band plays a role analogous to the low-energy edge of the conduction band, E c .
One of the current approaches to studying charge transport in DNA is to bind DNA
to a semiconductor quantum dot and to inject charge into the quantum dot as shown in
Fig. 5.40. As shown in the lower portion of Fig. 5.40, it is desirable in such experiments
to pick a semiconducting material for the quantum dot that has some of its energy states
aligned with some of the orbitals of the DNA. In the case illustrated in Fig. 5.40, the
valence-band energy is selected so that it falls just below the HOMO of the DNA. For
a DNA molecule that is composed of a pure strand of G bases bound to a pure strand
of C bases, the top of the HOMO band has an energy of 7.34 eV as measured relative
to the vacuum level. Two readily available colloidal semiconductor quantum dots with
valence bands having energies close to the HOMO band energy are TiO2 and ZnO.
TiO2 has conduction- and valence-band energies of 4.21 eV and 7.41 eV relative
to the vacuum level, respectively. ZnO has conduction- and valence-band energies of
4.19 eV and 7.39 eV relative to the vacuum level, respectively. For either of these
semiconductors, the energy alignment of the valence-band edge with the HOMO band
is as shown in Fig. 5.40. With such a band alignment, holes in the semiconductor may
be injected into the DNA.
155
Dip-pen nanolithography
Since the invention of the scanning tunneling microscope and similar techniques, there
have been attempts to develop new nanolithography methods. In particular, STM- and
AFM-based methods were applied to oxidize, scrape, or etch nanostructures on surfaces.
Some of these methods were mentioned in previous sections. However, proposed methods
are generally limited to the growth of thin oxides on selected metal and semiconductor
surfaces, or to multistep etch procedures that cannot be generalized to parallel and
productive patterning of nanostructures.
Finally, dip-pen nanolithography (DPN) was introduced as a direct-write scanningprobe-based lithography in which an AFM tip is used to deliver chemical reagents directly
to nanoscopic regions of a target substrate, as shown in Fig. 5.41.
The DPN technique is a type of soft lithography, where the word soft refers to the
chemical composition of the nanostructures which can be fabricated. They are made
156
Molecule to be
Assembled
Water Meniscus
Writing Direction
Solid Substrate
Figure 5.41 The principal writing element of a dip-pen lithographic system is a nanoscale
structure having a tip with nanoscale dimensions. Fluids flowing over the surface of the tip are
deposited onto the surface of a substrate in the region where a meniscus is formed between the
tip and the surface. Reprinted with permission, from C. A. Mirkin, Programming the assembly
of two- and three-dimensional architectures with DNA and nanoscale inorganic building
C American Chemical Society.
blocks, Inorg. Chem., 39, 22582272 (2000).
of organic ligands rather than solid-state materials. Though it is unlikely that dip-pen
technologies can displace conventional solid-state fabrication methods, they are highly
complementary. Many interesting scientific and important practical issues pertaining to
miniaturization can be addressed as a result of having these soft lithography methods,
whereby molecules can be patterned in a controlled fashion on the sub-100-nm length
scale. For example, such a method can generate nanoscale molecule-based conducting structures and provide their contacting with macroscopically addressable electrodes
prepared via conventional microfabrication methods.
In inventing DPN, the idea was to miniaturize a 4000-year-old technology: the technology of the quill or dip-pen. The largest difference is that we wanted to do on a nanoscopic
scale what a quill pen can do on a macroscopic scale.
Considering the fundamentals of DPN, first we should mention that the simple idea
of transporting any ink through a nanoscale AFM tip to a surface via the well-known
capillary action does not work properly. Instead, the basic idea is to design a system with
an ink that would chemically react with a substrate of interest. The scheme in Fig. 5.41
represents the main DPN elements: the moving AFM tip is coated with ink molecules;
a water meniscus forms between the tip and the solid substrate; ink molecules react
with the substrate. Such an effect of reaction of a chemical with a surface is called
chemisorption. Thus chemisorption acts as a driving force for moving the molecules
from the tip to the substrate and then results in formation of stable one-molecule-thick
nanostructures. The key to making this approach successful is to select organic molecules
with low water solubilities so that the chemisorption driving force facilitates the control
of tip-to-substrate transport properties. This would prevent the uncontrolled, nonspecific
adsorption and accumulation of multilayers of molecules on the surface.
Figure 5.42 illustrates the results of chemisorption. The panel on the left shows the
coating painting of a 1-m2 area of the gold surface with octadecanethiol via the
use of DPN. The formation of a one-monolayer structure is proved by the more detailed
(a)
157
(b)
Figure 5.42 (a) The painting via DPN by octadecanethiol on gold. (b) A lattice-resolved image of
5.9
158
Substrate
Notes
Alkylthiols
Au
Ferrocenylthiols
Silazanes
Proteins
Conjugated polymers
Au
SiOx
GaAs
Au, SiOx
SiOx
DNA
Fluorescent dyes
Metal salts
Colloidal particles
Au, SiOx
SiOx
Si, Ge
SiOx
(a)
(b)
Figure 5.43 The nanoscale grid (left) and the dot array (right) generated via DPN. Reprinted with
permission, from C. A. Mirkin, Programming the assembly of two- and three-dimensional
architectures with DNA and nanoscale inorganic building blocks, Inorg. Chem., 39, 22582272
C American Chemical Society.
(2000).
were necessary in order to obtain superior electronic properties. The great achievements
and the advances brought about in mainstream electronics by these technologies can be
used for the exploration of other nanodevices commonly called nanoelectromechanical
systems (NEMSs). This class of devices includes nanomachines, novel sensors, and a
variety of new devices that function on the nanoscale.
Nanomechanical devices promise to revolutionize measurements of extremely small
displacements and extremely weak forces, particularly at the molecular scale. Indeed,
with surface and bulk nanomachining techniques, NEMSs can now be built with masses
approaching a few attograms (1 attogram = 1018 g) and with cross-sections of about
Electrical
input signal
Input
transducer
159
Electrical
output signal
Mechanical
stimulus
Mechanical
system
Mechanical
response
Output
transducer
Mechanical
perturbation
Electrical
control signal
Control
transducer
10 nm. The small mass and size of NEMSs give them a number of unique attributes that
offer immense potential for new applications and fundamental measurements. In general,
the potential applications of NEMSs are likely to be enormous and could benefit a diverse
range of fields, ranging from nanoelectronics to medicine and biotechnology. In this
section, we study the basic concepts of NEMSs, the technology for NEMS fabrication,
and the challenges arising in this field.
An electromechanical device can be thought of as a two-, three-, or, generally, multiterminal transducer that provides input stimuli (i.e., signal forces), and reads out a
mechanical response (i.e., output displacement). At additional control terminals, electric signals can be applied and subsequently converted by the control transducers into
varying forces to perturb the properties of the mechanical element in a controlled
and useful manner. The generic picture of a NEMS is shown in Fig. 5.44, where
the input, output, and control transducers/terminals are presented schematically. The
basic mechanical element of a NEMS is a nanosize suspended film, a membrane, or a
beam; in the following discussion, we will use the term beam. Easy flexural deformations of such mechanical elements provide high mechanical responsivity for a NEMS.
Electronic devices to which the beam is coupled are assumed to be of comparable
dimensions.
Let us consider the fabrication of the basic mechanical component of a NEMS. The
crystal and heterostructure growth, and processing techniques, which have been studied previously are used widely to produce suspended semiconductor structures. These
techniques can be applied to bulk silicon, epitaxial silicon, and systems based on IIIV
compounds and to other materials.
The procedure for fabricating a suspended structure is illustrated in Fig. 5.45. In
its simplest form, the procedure starts with a heterostructure that contains structural
160
(a)
(b)
(c)
(d)
Figure 5.45 A general scheme for the fabrication of NEMS suspended structures using structural
(gray) and sacrificial (dark) layers on a substrate (dark gray). (a) Three-layer base
heterostructure, (b) etching-mask deposition, (c) anisotropic etching, and (d) selective wet
etching of the sacrificial layer.
(gray) and sacrificial (dark) layers on a substrate (dark gray), as in Fig. 5.45(a). Masks
on the top of this heterostructure can be patterned by a combination of optical and
electron-beam lithography, followed by thin-film-deposition processes. The resulting
mask protects the material from beneath during the next stage; see Fig. 5.45(b). Unprotected material around the mask is then etched away using a plasma etching process
as in Fig. 5.45(c). Finally, a local chemically selective etching step removes the sacrificial layer from the specific regions to create a freely suspended nanostructure that
is both thermally and mechanically isolated; see Fig. 5.45(d). This procedure can be
repeated several times and combined with various deposition processes to produce
the necessary mechanical nanostructure for a particular device. The flexibility of the
process allows one to apply this general scheme to diverse materials and to fabricate fully suspended structures with lateral dimensions of approximately a few tens of
nanometers.
Consider, for example, the important case of silicon nanomachining by the use of
the so-called SIMOX (Separation by IMplantation of OXygen) process. The procedure
starts from a Si wafer, which is processed by a large dose of oxygen-ion implantation.
The implanted wafer is annealed at high temperature to form a SiO2 layer of 0.051 m.
Above the SiO2 layer formed, a single Si crystal layer is then grown. This top layer is 0.1
0.2 m thick. As the result, one obtains a Si/SiO2 /Si heterostructure that corresponds
161
Figure 5.46 Four different NEMSs fabricated by SIMOX method. From A. N. Cleland,
Foundations of Nanomechanics, Fig. 11.4 (Berlin, Springer-Verlag, 2003).
to the gray/dark/dark gray structure of Fig. 5.45. All stages are already illustrated in
Fig. 5.45: the top Si layer is patterned and an etching mask is created as in Fig. 5.45(b);
to the top Si layer anisotropic etching is applied as shown in Fig. 5.45(c); then, the oxide
is subjected to selective wet etching to create finally a suspended structure as depicted
in Fig. 5.45(d). The SIMOX process can be carried out with wafers of large area (4 to
6 inches in diameter) and facilitates the integration of a number of NEMSs and other
electronic devices on a chip. Figure 5.46 illustrates some suspended structures fabricated
by this method.
In conclusion, due to advanced technologies, fabrication of new nanoelectromechanical systems becomes possible. These systems have dimensions so small that their mechanical motion (vibrations) are coupled to the electrons much more strongly than in the case
of bulk-like, massive samples. A number of nanodevices can be built on the basis of
NEMSs. Chapter 8 provides additional discussion of these devices.
5.10
Closing remarks
In this chapter, we focussed on methods used for high-quality material growth and for
nanodevice fabrication. We studied the growth of perfect crystals and multilayered heterostructures. We found that there has been a considerable and persistent improvement
of traditional semiconductor technologies for material processing. We also found that
these advances facilitate the fabrication of nanostructures and nanodevices with excellent precision and high reproducibility, and with the necessary electrical, optical, and
mechanical properties.
Moreover, we analyzed novel approaches to producing nanostructures based on special
regimes of material growth (the StranskiKrastanow regime), in which nanostructures are
formed spontaneously due to the growth kinetics. The processes of self-organization and
self-ordering of nanostructures (quantum dots, quantum wires, etc.) give rise to a new and
unique way to produce high-density ensembles of nanodevices with desired properties.
Apart from the traditional methods of fabrication, those adapted for microelectronics
162
5.11 Problems
163
5.11
Problems
1. Describe the main differences between the Czochralski method of crystal growth
and epitaxial growth. Which of these approaches can be applied to grow multilayered
crystalline structures?
2. In the molecular-beam epitaxy method, the rate of crystal growth is characterized
by the flux density, J , of atoms, constituting the growing film. For binary crystals AB,
like GaAs, SiGe, etc., the fluxes of deposited components A and B are to be equal:
JA = JB = J/2. Let the crystal density AB be given. Using given J, AB , and masses of
atoms A and B, calculate the time necessary to grow a film of thickness d. Estimate numerically the growth time for GaAs film of thickness 100 nm at J = 1015 atoms cm2 s1 ;
(GaAs = 5.316 g cm3 ).
3. When a photolithographic method is applied, the effect of diffraction of light restricts
the minimal scale of an illuminated pattern necessary for further processing to fabricate
a nanostructure. Explain the advantages in the use of short-wavelength light sources in
nanolithography.
Assume that the minimal thickness of a light line dmin is related to the wavelength
of illumination as dmin /2. Calculate and compare the minimal scales of devices
fabricated by exploitation of three laser sources: a red HeNe laser ( = 0.63 m), a UV
KrF laser ( = 0.243 m), and a UV ArF laser ( = 0.19 m).
4. Apply the hydrogen model of Eq. (5.4) for energy levels of donors in GaAs and
InAs. Use the effective masses according to Table 4.5 and set the dielectric constants,
,
equal to 12.8 and 15.5, respectively. Calculate the ionization energies and estimate the
radius of ground donor states for these materials. Find the average number of primitive
cells covered by a single donor electron.
5. Explain the role of lattice mismatch for regimes of self-organizing growth of nanostructures. For which of the following heterostructures is the formation of nano-islands
respectively),
possible: GaAs/AlAs (the lattice constants are a0 = 5.64 and 5.66 A,
and Si/Ge (a0 = 5.43 and 5.65 A).
164
6.1
Introduction
In previous chapters we studied advances in materials growth and nanostructure fabrication. In the case of electrons, we paid primary attention to the quantization of their
energy in nanostructures. In fact, electronics relies upon electric signals, i.e., it deals with
measurements of the electric current and voltage. Controlling and processing electric
signals are the major functions of electronic devices. Correspondingly, our next task will
be the study of transport of charge carriers, which are responsible for electric currents
through nanostructures.
The possible transport regimes of the electrons are dependent on many parameters and
factors. Some important aspects of these regimes can be elucidated by comparing the
time and length scales of the carriers with device dimensions and device temporal phenomena related to operating frequencies. Such an analysis is carried out in Section 6.2.
In Sections 6.3 and 6.4 we discuss the role of electron statistics in transport effects.
Then, we consider the behavior of the electrons in high electric field, including so-called
hot-electron effects. Analyzing very short devices, we describe dissipative transport and
the velocity-overshoot effect. Finally, we consider semiclassical ballistic motion of the
electrons and present ideas on quantum transport in nanoscale devices in Section 6.5.
6.2
Electron transport
Electron wavelength, l ()
166
T = 300 K
2
3
4
(T = 300 K). Points 1 through 4 correspond to InSb, GaAs, GaN, and SiC, respectively.
=
=
,
(6.1)
=
0
p
m
2m 0 E
2m E
where E is the electron energy and m 0 is the mass of the electron in vacuum. In Fig. 6.1
the value is shown as a function of m /m 0 . Points 14 on the curve indicate wavelengths
for electrons in InSb, GaAs, GaN, and SiC, respectively. We have used effective masses
m /m 0 equal to 0.014, 0.067, 0.172, and 0.41, respectively, for these materials and we
have assumed that the electron energy is E = kB T . Here T = 300 K is the ambient
temperature and kB is Boltzmanns constant. We see that the de Broglie wavelength of
an electron in typical semiconductors with m in the range (0.011)m 0 is of the order
i.e., it is really much larger than the lattice constants for the materials
of 73073 A;
presented in Table 4.8. As the temperature decreases to 3 K, the de Broglie wavelength
increases by one order of magnitude. Thus the wavelength becomes comparable to the
sizes of semiconductor structures and devices fabricated by modern nanofabrication
technology.
167
Contacts
Lx
Ly
Lz
(a)
(b)
Figure 6.2 (a) Geometrical sizes of a semiconductor sample (L z < L y < L x ) and (b) a sample
with contacts; the electron transport occurs along the x-direction.
the system is free of randomness and other scattering mechanisms are sufficiently weak,
the electron motion is quasiballistic and the only length with which the geometrical sizes
need be compared is the electron de Broglie wavelength . Since only an integer number
of half-waves of the electrons can fit into any finite system, instead of a continuous energy
spectrum and a continuous number of the electron states, one obtains a set of discrete electron states and energy levels, each of which is characterized by the corresponding number
of half-wavelengths. This is frequently referred to as quantization of electron motion.
Depending on the dimensions of the system, one can distinguish the following cases.
(a) The three-dimensional or bulk-like case, when the electron spectrum quantization is
not important at all,
Lx , L y, Lz,
(6.2)
and an electron behaves like a free particle characterized by the effective mass m .
(b) The two-dimensional or quantum-well case, when the quantization of the electron
motion occurs in one direction while in the other two directions electron motion is
free:
Lz L y, Lx.
(6.3)
Such a case was discussed in Chapter 3 for the example of a potential energy dependent on a single coordinate. The electron energy for this case is given in the form of
two-dimensional subbands, as described by Eq. (3.49).
(c) The one-dimensional or quantum-wire case, when the quantization occurs in two
directions, so that the electron moves freely only in one direction along the wire:
Lz
Ly
Lx.
(6.4)
Such a case was discussed in Chapter 3 for the example of a potential energy dependent on two coordinates. The electron energy for this case is given in the form of
one-dimensional subbands, as in Eq. (3.50).
168
Electron transport
Ly
Lz
(6.5)
The simplified models for this case also were analyzed in Chapter 3. The energy
spectrum is discrete.
The last three cases also illustrate the quantum size effects in one, two, and three
dimensions, respectively. If at least one geometrical size of a device is comparable to the
electron wavelength, a quantum-mechanical treatment of the problem is strictly required.
Let us analyze the conditions when and the reasons why carriers lose their wavelike behavior so that they can be considered as classical particles. There are two major
reasons. The first is non-ideality of the system, which leads to electron scattering. The
second is related to finite temperature and electron statistics.
Electrons in solid-state devices are subjected to scattering by crystal imperfections,
impurities, lattice vibrations, interface roughness, etc. These scattering processes are
divided into two groups: elastic and inelastic. In classical physics, an elastic collision
leads to a change only in the particle momentum (wavevector), whereas in an inelastic
collision both the momentum and the energy change. An essential property of an elastic
collision is that it does not destroy the phase of the electron. Indeed, after an elastic
scattering the energy remains unchanged and the electron wavefunction (r, t) consists
of different components, which are of type eit eikj r . All components have the same timedependent phase eit . Thus, the spatial distribution of the electron density |(r, t)|2 =
|(r)|2 remains independent of time. In other words, elastic scattering does not destroy
the coherence of electron motion. The same is true for the case of two or more impurities:
the spatial wave pattern is generally complex, but it remains coherent.
Using the semiclassical language, if e refers to the mean time between two elastic
scattering events, we can define the mean free path of the electrons between elastic
scattering events as le = ve , where v is the average electron velocity. Therefore, even
for distances exceeding le the wave-like properties of electrons are coherent.
Inelastic scattering leads to a new result. This scattering produces electron waves
with different energies and the resulting wavefunction has a complex dependence on
both position and time; the beating of different wave components in time washes out
the coherence effects. Let E be the mean time between two inelastic collisions. The
distance the electron propagates between these collisions is frequently called the inelastic
scattering length, L E . The electron preserves its quantum coherence for distances less
than L E and it loses coherence for larger distances. Generally, L E > unless extremely
nonequilibrium conditions exist. Often L E far exceeds the mean free path, le . In this case,
the electron undergoes many elastic collisions before losing its energy. This process is
known as diffusion and its displacement during E is known to be
L E = DE (E e ),
(6.6)
where the diffusion coefficient D is given by D = v 2 e /, with = 3 for a threedimensional electron gas, = 2 for a two-dimensional electron gas, and = 1 for a
169
min{L E , L T }.
(6.7)
(6.8)
170
Electron transport
(6.9)
(6.10)
(6.11)
electron transport is described in terms of the classical ballistic regime, which means
that electrons can move through the device along classical trajectories without collisions.
If the dimension L x is greater than the mean free path,
L x le ,
(6.12)
171
Quantum regime
Mesoscopic regime
172
Electron transport
Low frequencies
If e 1 the electron undergoes many scattering events during one period of the
external field. Multiple scattering during the period brings the electron into a quasistationary state, which follows the oscillations of the external field. In other words, the
electron momentum oscillates in phase with the field.
In closing this section we conclude that depending on the device dimensions, the
temperature, and other conditions there is a variety of transport regimes. Each of these
regimes demonstrates peculiar properties and requires a physical description suited to
the relevant conditions.
6.3
6.3 Statistics
173
The rules and principles according to which particles occupy the energy states in manyparticle systems constitute the so-called physical statistics. In the physical statistics, to
describe the occupation of states by particles, one uses a distribution function of particles. Under equilibrium, the distribution over the energy levels determines entirely
the properties of many-particle systems. The distribution function has the meaning
of the probability of finding particles with a given energy, E. Let El be the energy
level of particles in a many-particle system, the index l numbers the energy levels.
Then, the distribution function can be thought of as a function of the energy, F(El ).
Obviously,
F(El ) = N ,
(6.13)
l
where N is the total number of the particles. It turns out that the statistical principles
in classical and quantum physics are different. The quantum physics brings statistical
features that are absent in the classical description. These features are associated with the
fact that elementary particles, including electrons, are identical and it is impossible, in
principle, to specify their coordinates and trace a given electron. In addition, an internal
characteristic of a particle, the spin, plays a very important role in many-particle physics.
Although a definition of spin is absent in classical physics, we start with a brief review
of classical statistics.
Classical statistics
In classical physics, under equilibrium the distribution function has an exponential form.
This is the so-called Boltzmann distribution
FB (E) = CeE/(kB T ) ,
(6.14)
with C and T being the normalization constant and the ambient temperature, respectively. The normalization constant can be found from Eq. (6.13). In general, statistics in classical physics does not restrict the number of particles occupying an energy
level E.
As an example of the application of the Boltzmann distribution, consider the distribution of free particles with mass m over the velocities. First, we suppose that the system
under consideration is uniform, i.e., external forces are absent. Then, the energy of these
particles coincides with the kinetic energy E = Ekin = mv2 /2, where v is the particle
velocity. For three-dimensional particles, we have v = {vx , v y , vz } and each of the velocity components can vary in the interval [, +]. Thus, by rewriting Eq. (6.14) we
can introduce the distribution function as
Fv (v) = Cv emv
/(2kB T )
(6.15)
Since the velocity is a continuous value, the meaning of the distribution Fv is the following. If we define an infinitesimally small volume d3 v dvx dv y dvz around a given
velocity v in the velocity space, then the number of particles whose velocity is inside
this volume is Fv (v)d3 v. That is, actually, Fv is the density of the distribution over the
174
Electron transport
velocity. To find the normalization constant, we can integrate Fv over all possible v, and
then we obtain the total number of the particles N . As a result, the distribution function,
Fv (v), can be written in the form
3/2
3/2
m
m
2
2
2
mv2 /(2kB T )
Fv (v) = N
e
=N
em (vx +v y +vz )/(2kB T ) .
2kB T
2 kB T
(6.16)
This is the so-called Maxwellian distribution of the particles. By using the Maxwellian
distribution, we can calculate average characteristics of the so-called ideal gas of particles. For example, the average energy per particle is
1
mv2
3
E=
(6.17)
d3 v
Fv (v) = kB T,
N
2
2
the average velocity at equilibrium is v = 0, etc.
6.3 Statistics
(a)
(b)
Spin degeneracy
Spin splitting
175
Figure 6.3 Occupation of energy levels by the electrons: (a) spin-degenerate levels and
FF (E)
T1 = 0
1
1/2
0
T4
T3
EF
Figure 6.4 How the Fermi distribution function varies with crystal temperature.
It is clear that the Pauli exclusion principle leads to a new, non-classical statistics of
electrons. Such statistics is called the Fermi statistics. Under equilibrium the occupation
of the energy levels is described by the Fermi distribution function:
1
FF (El,s ) =
,
(6.18)
(E
1 + e l,s E F )/(kB T )
where T is the temperature of the system, El,s is the energy of the quantum state characterized by the set of quantum numbers l and s, and E F is the so-called Fermi energy or
Fermi level. The evolution of the Fermi function FF (El,s ) with the temperature is shown
in Fig. 6.4, where we use the temperature parameters T4 > T3 > T2 > T1 with T1 = 0.
Importantly, the Fermi energy can be related to the total number of electrons through
the normalization condition of Eq. (6.13). Taking into account explicitly the summation
over the spin, we obtain
FF (E l,s ) = N ,
(6.19)
l,s
which gives E F = E F (N , T ).
176
Electron transport
From Eq. (6.18), one can see that in accordance with the Pauli principle the occupation
of any energy state, {l, s}, defined by Eq. (6.18), is always less than or equal to 1.
At high temperatures the second term in the denominator of Eq. (6.18) is substantially
larger than unity and the Fermi distribution is close to the Boltzmann distribution:
FF (E) e(EF E)/(kB T ) .
(6.20)
Equations (6.20) and (6.14) are the same when the normalization constant C is equal to
EF
.
(6.21)
C = exp
kB T
The corresponding curve is shown schematically in Fig. 6.4 at T = T4 .
In the limit of low temperatures, T 0, the function FF transforms into a step
function:
1, E F > E,
FF (E) =
(6.22)
0, E F < E,
i.e., FF (E) = 1 for the energy levels below the Fermi energy E F since all levels with
E < E F are occupied and FF (E) = 0 for energies above E F since these levels are empty.
In this limit, the electron system is frequently referred to as a highly degenerate electron
gas.
Now we can apply Fermi statistics to the electrons in a conduction band. Let n be
the concentration of electrons in the conduction band with energy dispersion E(k). We
accept that the energy spectrum is independent of the spin. Thus, the set of the quantum
numbers, l, is identical to the set of the electron wavevectors, k. According to the Fermi
distribution, the probability of finding an electron with the wavevector k is
1
,
(6.23)
FF (E(k)) = 2
(E(k)E
F )/(k B T )
1+e
where the factor 2 comes from the spin degeneracy. The Fermi energy, E F , and the
electron concentration, n, are related through the following equation:
1
2
N
,
(6.24)
=
n=
V
V k 1 + e(E(k)E F )/(kB T )
with V being the volume of the crystal. The summation in the latter formula can be
converted to an integration. Indeed, according to the analysis given in Section 4.4 and
Eq. (4.10), the electron wavevector takes the following values: k x L x = 2l1 , k y L y =
2l2 , and kz L z = 2l3 , with l1 , l2 , and l3 being integers. Here we introduced the crystal
dimensions L x , L y , and L z . (The crystal dimensions are related to the basis vectors of the
lattice ai and the number of primitive cells Ni as L x = ax N x , etc.) Thus, summation over
k is equivalent to summation
over li . The latter can be approximately calculated via the
&
integral: l1 ,l2 ,l3 (. . .)
dl1 dl2 dl3 (. . .). Since the distribution function depends
on E(k), it is convenient to express the latter integral in terms of integration over k. We
can use the relationships
!l1 =
Lx
!k x ,
2
!l2 =
Ly
!k y ,
2
!l3 =
Lz
!k z .
2
6.3 Statistics
177
(6.25)
V kkF
V
(2 )3
|k|kF
Evaluation of this integral gives the volume of a sphere of radius kF , i.e., 4 kF3 /3. Then, we
obtain a relationship between the Fermi wavevector kF and the electron concentration n:
kF = (3 2 n)1/3 . Finally, the Fermi energy of the degenerate electrons in a bulk crystal is
-h2 n 2/3
, at T 0.
(6.26)
E F = (3 2 )2/3
2m
In this case, E F increases as the 2/3 power of the electron concentration n. Because the
Fermi function contains the exponential factor, the low-temperature limit corresponds to
the condition E F kB T . In metals and heavily doped semiconductors, the electron gas
remains degenerate up to room temperature. For example, in the case of a GaAs crystal
with the effective electron mass M = 0.067m 0 , where m 0 is the mass of the free electron,
at the concentration n = 1017 cm3 , we find kF = 1.43 106 cm1 and E F = 11.6 meV.
This energy corresponds to a temperature of 135 K. Thus, at T < 135 K the electron gas
concentration n = 1017 cm3 in GaAs can be considered as degenerate and it is possible
to use the above estimates for the Fermi energy, EF , and the Fermi wavevector, kF .
The degenerate electron gas is an interesting and very important physical system. This
limiting case facilitates understanding a number of complex phenomena in a simple way.
Indeed, as emphasized previously, in a degenerate electron gas all states below E F are
occupied. Let us imagine that a small external perturbation is applied to such a manyelectron system. The perturbation first of all will cause a redistribution of electrons
between the energy states. However, below the Fermi level all states are completely filled
and no redistribution is possible. Instead, only those electrons that are at the Fermi level,
i.e., that have energy just equal to E F , can be affected by the perturbation. This results in
the fact that only a small portion of the electrons can participate in the crystals response
to the perturbation. One can say that these active electrons are on the so-called Fermi
surface in k space and the size of the Fermi surface determines the basic properties
Electron transport
kz
(a)
kF
ky
0
kx
kx
kF
(b)
ky
(c)
kF
kF
}
}
178
kF
+kF
kx
Figure 6.5 The Fermi surfaces for (a) three-, (b) two-, and (c) one-dimensional electron gases.
of the degenerate electron gas. For bulk-like crystals with the simple energy spectrum
considered above, the Fermi surface is just a sphere of radius kF . The Fermi surface for
a three-dimensional electron gas is presented in Fig. 6.5(a). Using Eq. (2.4) it is easy to
show that an electron on the Fermi surface has the velocity vF = -hkF /m .
We can consider low-dimensional electron systems quite similarly. As discussed in
Section 4.5, by using heterostructures it is possible to fabricate artificial potential wells,
which confine the electrons from the conduction band in such narrow layers that electron
motion across the layers becomes quantized. This results in electron energies in the form
of low-dimensional subbands given by Eq. (3.49) for the so-called quantum wells:
El3 (k|| ) = l3 +
-h2 k 2
||
2m
(6.27)
where l3 (l3 = 1, 2, . . . ) and k|| , the two-dimensional wavevector, determine the electron
motion in the plane of the layer. The Fermi distribution function in the form of Eq. (6.18)
can be used to calculate the two-dimensional electron concentration (sheet concentration
of electrons), n 2D :
n 2D = n d =
1
2
,
S l3 ,k|| 1 + e(El3 (k|| )EF )/(kB T )
(6.28)
where we have introduced the sheet concentration of electrons, n 2D (the number of electrons per unit area), the thickness d, and the area S of the confined layer; obviously
V = d S. Actually, exactly as found for bulk crystal, Eq. (6.28) establishes the relationship between the sheet concentration, n 2D , and the Fermi energy, E F .
6.3 Statistics
179
The sheet concentration of the electrons can be explicitly calculated in the limit of
a degenerate electron gas. Let us suppose that the temperature is low and that only the
lowest subband is populated by the electrons. Then, in the sum over l3 in Eq. (6.28) we
should keep only one term with l3 = 1:
1
2
n 2D =
.
(6.29)
S k|| 1 + e(E 1 (k|| )EF )/(kB T )
Calculation of the right-hand side of this equation can be simplified by replacing the
summation over k|| by integration, similarly to that of Eq. (6.25):
S
(. . .) =
(6.30)
dk x dk y (. . .),
(2 )2
k||
where k x and k y are the components of the two-dimensional vector k|| . In the limit of
T 0, the sheet concentration n 2D can be calculated by defining a two-dimensional
Fermi wavevector k||,F via the relationship
-h2 k 2
||,F
.
(6.31)
EF =
2m
The Fermi surface for two-dimensional carriers is a disk of radius k||,F in k|| space,
as shown in Fig. 6.5(b). On performing the integration of Eq. (6.30) over the disk, we
find
-h2
and
E F = n 2D , at T 0.
(6.32)
k||,F = (2n 2D )1/2
m
That is, the Fermi energy, E F , increases as the first power of the electron sheet concentration, n 2D .
For a nanostructure in which the electron motion is restricted in two directions, i.e.,
for the quantum wire, the energy spectrum is given by Eq. (3.50):
-h2 k 2
x
,
(6.33)
El2 ,l3 (k x ) = l2 ,l3 +
2m
where it is assumed that the x direction is the only direction of free motion. By applying
a procedure similar to that used above, we may see that for low temperatures the Fermi
surface shrinks to two points in the one-dimensional k space: kx = kF with kF =
1
n 1D . Here, n 1D is the linear electron concentration, which can be defined by the
2
formula n 1D = N /L, with N being the total number of electrons in the wire and L
being the wire length. Subsequent evolution of the Fermi surface of the electrons as
the dimensionality is lowered is illustrated in Figs. 6.5(a)(c). The Fermi energy of
one-dimensional electrons is given by
2 -h2 2
n , for T 0.
(6.34)
EF =
8m 1D
On comparing the Fermi energies obtained for different dimensionalities of the electron
gas, we can conclude that lowering the dimensionality of the gas gives rise to a more
rapid increase of E F with increasing concentration for low-dimensional systems.
180
Electron transport
The concept of Fermi statistics is one of the fundamental ideas of modern solid-state
physics and extremely important for nanoelectronics. The Fermi statistics is applied
widely in the description of nanoelectronic devices.
6.4
(6.35)
where E is the energy associated with the quantum state . At this point, it is useful to
introduce Diracs famous -function:
0,
for x = 0,
(x) =
(6.37)
, for x 0,
though
(x)dx = 1.
(6.38)
Diracs -function is used when performing an integration. The main rule of such an
integration is the following:
b
dx (x x 0 )(x) = (x0 ), if a < x 0 < b,
(6.39)
a
(E)
(E)
(a)
0
(c)
(E)
e11
ea
(b)
0 e1 e2
e3
(d)
(E)
eb
181
e111
ea eb ec
Figure 6.6 The density of states of electrons, (E), in systems of different dimensionalities: (a) a
bulk crystal, (b) a quantum well, (c) a quantum wire, and (d) a quantum dot. Here 1 , 11 , and
111 are the ground states in a quantum well, quantum wire, and quantum dot, respectively; 2
and 3 are higher states in a quantum well and a , b , and c are higher states in a quantum wire
and in a quantum dot.
Because E(k) actually depends on the modulus k, spherical coordinates may be used.
Integration over two angles gives 4 and the triple integral is reduced to the single
integral
2V
3D (E) =
4
dk k 2 (E E(k)).
(6.41)
(2)3
V 2m 3/2
3D (E) =
dE E(E E).
(6.42)
2
2
2
h
0
Finally, by using the rule of Eq. (6.39), we obtain
3/2
m
V
2E.
3D (E) = - 2
h
2
(6.43)
The obtained density of states for three-dimensional electrons is depicted in Fig. 6.6(a).
For electrons in a quantum well with the energy spectrum given by Eq. (6.27), the
set of quantum numbers includes a spin quantum number, s, a quantum number, l3 ,
characterizing the transverse quantization of the electron states, and a continuous twodimensional vector, k|| . Hence, {s, l3 , k|| }. There is a two-fold spin degeneracy of
each state, (s = 12 ), so that
-h2 k 2 + k 2
x
y
2D (E) = 2
E l3
.
(6.44)
2m
l3 ,k x ,k y
In order to calculate the sum over kx and k y , we can apply the replacement, Eq. (6.30),
with S being the area of the surface of the quantum well, S = L x L y , where L x and
182
Electron transport
L y are the sizes of the quantum well in the x and y directions, respectively. Calculations
of the integrals give us
m S
m S
dE (E l3 E) = - 2
(E l3 ),
2D (E) = - 2
h
h
0
l3
l3
(6.45)
where (x) is the Heaviside step-function:
1, for x > 0,
(x) =
0, for x < 0.
(6.46)
Very often the density of states per unit area, 2D /S, is used to eliminate the size of
a sample. Each term in the sum of Eq. (6.45) corresponds to the contribution from one
subband. The contributions of all subbands are equal and independent of energy. As
a result, the density of states of two-dimensional electrons exhibits a staircase-shaped
energy dependence with each step being associated with one of the energy states, l3 .
The height of each step is universal and depends only on the effective electron mass.
Figure 6.6(b) depicts the two-dimensional density of states. By comparing the densities
of states for the electrons in bulk crystals per unit volume and those of quantum wells
per unit surface we may see that the differences between the two- and three-dimensional
cases are most pronounced in the energy regions of the lowest subbands. For large l3
the staircase function lies very close to the bulk curve 3D (E) and coincides with it
asymptotically.
Similarly, we can find the density of states of a one-dimensional electron gas with the
energy spectrum given by Eq. (6.33). The result of calculations is
l2 ,l3 (E),
1D (E) =
l2 ,l3
where
l2 ,l3 (E) =
2m
1
-h2
E (E l2 ,l3 ).
l2 ,l3
(6.47)
Here, L is the length of the wire. Schematically, 1D (E) for one-dimensional electrons is
shown in Fig. 6.6(c). The characteristic feature of the one-dimensional density of states
is its divergence near the bottom of each of the one-dimensional subbands. The density
of states then decreases as the kinetic energy increases. This behavior is very remarkable
because it leads to a whole class of new electrical and optical effects peculiar to quantum
wires.
Now, we can consider the ultimate case of the density of states for zero-dimensional
electrons, i.e., for electrons in quantum dots. According to the definition of Eq. (6.36),
in the case of quantum dots or boxes the spectra are discrete. Thus, the density of states
is simply a set of -shaped peaks, as depicted in Fig. 6.6(d). For an idealized system, the
peaks are very narrow and infinitely high, as illustrated in Fig. 6.6(d). In fact, interactions
between electrons and impurities as well as collisions with lattice vibrations bring about
a broadening of the discrete levels and, as a result, the peaks for physically realizable
systems have finite amplitudes and widths. Nevertheless, the major trend of sharpening
183
6.5
m
dv
= v eF,
dt
e
(6.48)
where e can be interpreted as the momentum relaxation, or free-flight time, which was
discussed in Section 6.2. Obviously, the electron motion described by the above equation
is dissipative in nature.
For the stationary state, dv/dt = 0, we obtain
ee
v = F = F,
(6.49)
m
where we have introduced the parameter , called the electron mobility. The mobility
is one of the basic characteristics of electron transport in the case of low electric fields.
The definition of ,
ee
= ,
(6.50)
m
184
Electron transport
Figure 6.7 Chaotic diffusion motion of an electron at equilibrium (a) and when an electric field F
is applied (b).
shows that the mobility is larger for materials with small effective mass m and with
suppressed scattering, i.e., for large e . The negative sign in Eq. (6.49) reflects the fact
that the electrons move in the direction opposite to the electric field, because of their
negative charge.
Having the electron average (drift) velocity, v, and the electron concentration, n, we
can calculate the electric current density as
J = evn = enF = F.
(6.51)
= en
(6.52)
Here,
is the so-called specific conductivity, and, as we can see, it depends on both the electron
concentration, n, and the electron mobility, . The result given by Eq. (6.51) is known
as Ohms law. If the specific conductivity and the geometrical dimensions of a sample
are known, we can readily calculate the total electric current:
0
,
(6.53)
I = J S = SF, |I| =
R
where we introduce the cross-section of the sample S, the voltage drop across the sample
0 = L x F, and the sample length L x . The resistance R is
Lx
.
(6.54)
R=
S
Equations (6.49)(6.51) are valid for systems of any dimensionality, where dissipative transport occurs. These include three-dimensional, two-dimensional, and onedimensional systems. For all of them, the electron mobility is a representative characteristic of the classical transport regime.
185
To define measurement units of the mobility, we must first discuss briefly the units of
measurement of several entities introduced in the preceding discussion. (1) The charge
is measured in units of coulombs, C. The elementary charge of the electron, e, is equal
to 1.6 1019 C. (2) The electron concentration is measured in m3 , m2 , and m1 for
three-dimensional, two-dimensional, and one-dimensional systems, respectively. (3) The
electric field is measured in units of volts per meter, V m1 . (4) Accordingly, the units
of mobility are m2 V1 s1 . (5) The current is measured in amperes, A, i.e., coulombs
per second, C s1 . (6) For the current density and conductivity of Eq. (6.51), we obtain
different units for electron systems with different dimensionalities. For example, in a
bulk crystal the current density is measured in amperes per unit cross-section, A m2 ,
and the units of conductivity are 1 1 m1 with being the ohm the unit of electrical
resistance.
Returning to the discussion of mobility, we shall stress that the magnitude of the
mobility depends on the particular material and scattering mechanisms in this material.
Typically, the scattering mechanisms in semiconductors include scattering by impurities, imperfections, and lattice vibrations. If the first two scattering mechanisms can
be avoided in pure and high-quality samples, lattice or, as one frequently says, phonon
scattering can not be avoided in principle. The total scattering rate is a sum of rates of
scattering due to particular mechanisms. Each of the rates is proportional to a scattering
probability and inversely proportional to the mean free time. This leads to the conclusion that also the mobilities due to two or more scattering mechanisms should be added
inversely:
1
1
1
+
+ ,
=
ph
im
(6.55)
dn
eD r n,
dr
(6.56)
186
Electron transport
Figure 6.8 The temperature dependence of the electron mobility in Si for a system with two
where D is the diffusion coefficient, that was discussed in Section 6.2. Thus, in a nonuniform conductor, the total current is composed of both drift and diffusive contributions:
J = eFn + eD r n.
(6.57)
e
Thus, having the electron mobility , one can easily calculate the diffusion coefficient,
D.
The results just discussed are relevant to steady-state electron transport. However,
the Newton equation (6.48) can describe the behavior of electrons in an arbitrary timedependent electric field, F(t). Since any dependence, F(t), can be represented by using
a Fourier transform, without loss of generality we can analyze the case of a harmonic
dependence of the field:
F(t) = F cos(t),
(6.59)
187
where F is the field magnitude and is the frequency of field oscillations. For further
analysis, we use the well-known formula cos(t) = Re[eit ]. Here Re[. . .] means
calculation of the real part of an expression in the brackets. Thus, the physical field is
F(t) = Re[F eit ].
It is worth noting that calculations with exponential functions like eit are always
simpler than those with sine and cosine functions. For this reason, the following approach
is commonly used. Instead of the field in the form of Eq. (6.59) in the Newton equation,
one uses the complex representation of the field:
= F eit .
F(t)
When a solution to Eq. (6.48) with the complex field is found, the real part of this solution,
that has a physical meaning, can be calculated easily.
To apply this approach, we look for the solution of Eq. (6.48) with a complex field of
the exponential form v eit . On substituting this form into Eq. (6.48), we immediately
obtain
e
e
v =
F .
(6.60)
m 1 ie
In this case, the electron velocity should be calculated as v(t) = Re[v eit ]; i.e.,
cos (t)
e sin (t)
ee
.
(6.61)
+
v(t) = F
m
1 + 2 e2
1 + 2 e2
Thus, the cosine electric field of Eq. (6.59) induces electron motion, with both cosine
and sine contributions. It is instructive to rewrite Eq. (6.61) as
v(t) =
ee cos(t )
F
,
m
1 + 2 e2
(6.62)
where we introduce the phase shift of the electron velocity with respect to the phase
of the field (6.59). This phase shift can be found from the equation tan = e . Here,
> 0, which implies a delay of the electrons with respect to the field changes. Obviously,
the delay exists only for non-zero frequencies and arises because of the friction force
in the Newton equation. The second conclusion, which follows from Eq. (6.62), is that
the magnitude of velocity oscillations decreases when the field frequency increases. In
the limit of very high frequencies e 1, the alternating electron velocity vanishes.
Having the electron velocity, v(t), and concentration, n, we can calculate the timedependent electric current similarly to Eqs. (6.51) and (6.52). However, more common
is the use of the complex representation of the current density:
= ()F eit,
J(t)
where we introduce the complex conductivity:
e
1
e2 e n
+i
.
() =
m 1 + 2 e2
1 + 2 e2
(6.63)
(6.64)
This conductivity reduces to its steady-state value in the limit e 0, and, in this
limit, Im[ ] 0. At finite frequencies, both contributions, Re[ ] and Im[ ], are
188
Electron transport
1.0
0.8
0.6
s(wte)
s(0) 0.4
0.2
0.0
0
2
1
wte
9 10
Figure 6.9 Dependences of the real (curve 1) and imaginary (curve 2) parts of the complex
I =
,
Z ()
Z () =
Lx
.
()S
(6.65)
Here, in the second relation, we omit the vector designation; L x and S are the intercontact
distance and the cross-section of the sample, respectively. By comparing these results
with those of the steady-state case given by Eqs. (6.53) and (6.54), one may see that,
instead of the resistance, R, a frequency-dependent parameter, Z (), is introduced. It
is called the impedance. The impedance is a complex function that characterizes the
electrical properties of the whole sample. The impedance can be introduced for any
nonhomogeneous sample and device.
(a)
n+
(b)
V(x)
189
n+
EF
x
(c)
V(x)
virtual cathode
Figure 6.10 (a) The structure schematic of an n+ in+ diode. Potential energy profiles, V (x),
in a diode with charge-limited dissipative transport: (b) for an unbiased diode and (c) for a
biased diode.
the electric resistance and the current, according to Eq. (6.53). In short samples, another
electrical effect can contribute essentially toward the electron transport and the current.
The effect arises due to nonuniform redistribution of the electrons and, thus, of the
electric charge across the short sample. The charge induced by the current affects the
potential distribution along the sample, making it strongly nonuniform and dependent
on the current. As a result, the dependence of the total current on the applied voltage
is no longer linear. The electron transport in this regime is called space-charge-limited
transport. Since in this text we are interested particularly in very short structures, it is
instructive to review briefly the space-charge-limited transport under the condition of
Eq. (6.2), when we can introduce mobility.
Let us consider a short sample with two contacts as depicted in Fig. 6.2(b). The twocontact device is called the diode and the body of the sample between the contacts is
called the base of the diode. Assume for simplicity that the base is not doped, i.e., there
are no conducting electrons in the base. The contacts can be fabricated by heavy n-type
doping of the contact regions; they are called n+ regions. In such a case, this device is an
n+ i (insulator)n+ diode as shown in Fig. 6.10(a). Let designate the electrostatic
potential. Then, the potential energy of the electrons is V = e. In Fig. 6.10(b), a
sketch of the potential energy of an unbiased diode is presented: the electrons in the
contact regions on the left and right are separated by a high potential barrier in the base.
If an electric voltage 0 is applied, the potential relief changes, as shown in Fig. 6.10(c).
The potential energy, V (x) = e(x), decreases and some of the electrons can overcome
the potential barrier, penetrate from the emitter electrode (the cathode) into the diode
base, and contribute to the current. This bias-induced effect is referred to as electron
190
Electron transport
injection from the electrode to the base. Obviously, with increasing bias, the maximum
of the potential shifts toward the cathode and becomes lower, and the injection current
increases. This physical picture can be described by the following simple model, which
is valid at large electric biases. The current density, J , can be written via the mobility
characteristic for the undoped base, , the injected electron concentration, n(x), and the
electrostatic field, F(x) = d/dx, as in Eq. (6.51):
J = en(x)F(x).
(6.66)
Importantly, for the chosen configuration, see Fig. 6.10(a), the cathode is on the left and
the anode is on the right. The potential, , increases with x, and the electric field, F,
and current density, J , are both negative. As a result of the continuity condition, in the
stationary state the current density through the diode is conserved: J = J0 = constant,
where J0 is the absolute value of the current density. From the latter relationship, we can
determine the concentration of the injected electrons, n(x):
n(x) = J0 /(eF(x)).
(6.67)
(6.68)
where
is the dielectric constant of the base material and
0 is the permittivity of free
space. After substitution of n(x) from Eq. (6.67), Eq. (6.68) can be rewritten in terms of
the field, F:
F
J0
dF
=
dx
or
F dF =
J0
dx.
(6.69)
This equation should be supplemented by a boundary condition for the field. We can use
the fact that at the maximum of the potential barrier:
d
= F = 0.
dx
(6.70)
At a large electric bias, this maximum is shifted close to the cathode and we can set
F(x = 0) 0. This approach neglects the processes which occur in a very narrow
region between the real cathode and the potential maximum. It is called the virtualcathode approximation and it is used widely for such a simplified analysis. Integration
of Eq. (6.69) leads to the solution
1 2
2J0 1/2 1/2
J0
F (x) =
x
or
F(x) =
x ,
(6.71)
2
where the negative value of the square root is chosen for the solution in accordance
with the above-discussed sign of the electric field. Thus, the distribution of the potential,
(x), is given by
x
8J0 1/2 3/2
dx F(x) =
x .
(6.72)
(x) =
9
0
191
From the total voltage drop (L x ) = 0 , we can find the relationship between the
current density, J0 , and the bias, 0 , i.e., the currentvoltage characteristic:
J0 J0,d =
9
0
20
.
8 L 3x
(6.73)
This is the so-called MottGurney law for the diode with dissipative electron transport. We have found that the currentvoltage characteristic becomes strongly nonlinear
because of the space-charge effect. The space-charge effect can be characterized, particularly, by the average concentration of the injected electrons, n :
Lx
1
3
0
0
n =
n(x)dx =
,
(6.74)
Lx 0
2 eL 2x
which increases with the applied voltage.
Equation (6.49) and F(x) facilitate the calculation of the average time of electron
transit through the diode, ttr,d :
Lx
Lx
1
1
4
dx
dx
(6.75)
=
= t0,d ,
ttr,d =
v(x)
F(x)
3
0
0
where
t0,d =
L 2x
.
0
(6.76)
Here, t0,d is the transit time of the electrons drifting in the average electric field, 0 /L x ,
i.e., when the space-charge effects are neglected. As follows from Eq. (6.75), these effects
increase the transit time by the factor 4/3.
In general, non-stationary electrical properties of a device are characterized by the
current induced in response to a time-dependent external voltage bias. If the bias is of a
frequency , the current response is given by the impedance according to Eq. (6.65). For
a MottGurney diode biased with a steady-state voltage, 0 , the complex impedance
can be calculated exactly:
2
6Rd
1 + cos ,
(6.77)
Z () = 3 ( sin ) + i
2
where
Rd =
d0
4L 3x
=
dJ0
9
0
S0
is the differential resistance of the diode at steady state calculated from Eq. (6.73),
I0 = J0 S, S = L y L z is the cross-section of the sample, and = ttr,d ; i.e., it is
determined via the average time, ttr,d , of electron transit through the diode; see Eq. (6.75).
The dependences of Re[Z ()] and Im[Z ()] are presented in Fig. 6.11. The impedance
of Eq. (6.77) differs considerably from the impedance of a material with the Drudelike high-frequency behavior. Importantly, as the voltage bias increases, the impedance
decreases and the current response increases 0 ; moreover, the spectral width of the
response becomes larger ( 1/ttr,d 0 ). Both conclusions provide evidence that a short
192
Electron transport
1.0
0.8
0.6
Z(wttr,d)
Z(0)
0.4
0.2
0.0
wttr,d
9 10
Figure 6.11 Dependences of the real (curve 1) and imaginary (curve 2) parts of the impedance of
diode with space-charge-limited dissipative transport under a high bias can operate at
higher frequencies than can a doped diode with Drude conductivity.
To conclude this analysis of dissipative electron transport in short samples, we want
to point out that the main results obtained are valid at large biases, when the diffusive
contribution to the current can be neglected:
dn(x)
.
(6.78)
|F(x)| D
dx
(A related analysis is proposed in Problem 7.) For such conditions, the transport is
determined to a large extent by the space-charge effects, the electric field is highly
nonuniform, and the injection current increases quadratically with the bias, as Eq. (6.73)
shows. Under high bias, the transit time through the diode decreases and the device
remains electrically active in a frequency range enlarged proportionally with the bias.
Hot electrons
Now, we return to large uniform crystals to study the effect of high electric fields on
the drift velocity and the currentvoltage characteristics. Equations (6.49)(6.51) were
obtained under the assumption that the relaxation time, e , is constant; i.e., it does not
depend on the magnitude of the electric field. When the field increases, the electron gas
is far from equilibrium. In particular, the average electron energy increases. This may
be understood from the following qualitative considerations. The electron energy in an
electric field may be described through the relationship
E E eq
dE
= e(vF)
.
(6.79)
dt
E
The first term on the right-hand side corresponds to the power gained by the electron
from the electric field; the second term represents the rate of electron-energy losses.
193
The losses are proportional to the deviation of electron energy, E, from its equilibrium
value, E eq , and E is the energy relaxation time introduced in Section 6.2. The electron
energy increases in the electric field until the overall energy balance is maintained. In
the stationary case, when dE/dt = 0, Eq. (6.79) yields an electron energy, E, given by
E = E eq + e(vF)E .
(6.80)
According to Eq. (6.49), the velocity, v, is linearly proportional to the electric field, F.
Thus, the mean electron energy increases as the square of the electric field and
exceeds the equilibrium value, E eq . It is convenient and conventional in semiconductor
electronics to consider the electron effective temperature, Te , instead of the mean
electron energy. The relationship between the temperature and the mean energy can be
found at thermal equilibrium and is E = kB Te /2, where the factor is the dimensionality of the structure. Obviously, under thermal equilibrium the electron temperature, Te ,
coincides with the lattice temperature, T . Under nonequilibrium conditions the two temperatures, T , and, Te , may differ. The effective electron temperature expressed through
the mean electron energy serves as a gauge of the nonequilibrium state. If Te exceeds T
only slightly, and the electron transport still obeys Ohms law, we have warm electrons.
The case with Te T corresponds to the situation in which the electrons are far from
equilibrium. For such a situation, the electrons frequently are called hot electrons. The
electron temperature can reach magnitudes of about several thousand degrees Kelvin,
while the lattice can remain cold. For simple estimates it is accepted that transition from
the warm-electron regime to the hot-electron regime occurs at the electric field F = Fhe ,
when e(vF)E is equal to E eq = kB T . It is easy to estimate the heating electric field,
Fhe . Indeed, in Eq. (6.80) the drift velocity v can be estimated from v = F; thus, the
critical heating field is
kB T
.
(6.81)
Fhe =
eE
For hot electrons, the scattering processes themselves become dependent on the field.
The linear relationship of Ohms law of Eq. (6.51) is no longer valid and the current
voltage characteristics, J = J (F), and the drift-velocityfield dependence, v = v(F),
can exhibit strongly nonlinear behavior, which depends on the electron bandstructure
and specific scattering mechanisms. In particular, these dependences are of different
shapes in the two most important materials. In group IV semiconductor materials, the
J (F) and v(F) dependences lead to the saturation effect at large fields, as shown in
Fig. 6.12(a), whereas in IIIV compounds, after a nonlinear portion with a rise, the
current and the velocity undergo a decrease in some interval of the fields, as shown
in Fig. 6.12(b). As follows from the analysis of different transport regimes given in
Section 6.2, for high-speed devices the important material parameter is the maximum
of the drift velocity, which can be achieved in high electric fields. A comparison of the
saturated and maximum velocities for relevent materials is presented in Table 6.2.
One can see that some of the IIIV compounds (including GaAs, InP, and InSb) have
velocities several times larger than those of Si, GaP, and AlAs. For Si the drift velocity is
restricted to a magnitude of about 107 cm s1 that can be reached for electric fields above
194
Electron transport
Material
Velocity
(107 cm s1 )
Si
SiC
SiO2
AlAs
GaP
GaAs
InP
InAs
InSb
1
2
1.9
0.65
1.1
2
2.5
4.4
6.5
(a)
(b)
103
107
T = 300 K
Si
102
InP
GaAs
106
77 K
193 K
298 K
10
10
102
F 0 lines
F1/2 lines
Low-field extrapolation
103
105
104
0
0
5 10 15 20 25 30 35
Figure 6.12 Currentvoltage characteristics at large electric fields: (a) Si and (b) GaAs and InP.
Reprinted with permission, (a), from E. J. Ryder, Mobility of holes and electrons in high
C 1953 by the American Physical Society.
electric fields, Phys. Rev., 90, 766 (1953).
several kV cm1 . For GaAs, the velocity maximum of 2 107 cm s1 can be achieved
at fields 3.5 kV cm1 .
(a)
(b)
6
Si
20 kV cm1
10 kV cm1
5 kV cm1
0.5
0.00 1 2 3 4 5 6 7 8 9 10
Time (1013 s)
GaAs
v (107 cm s1)
v (107 cm s1)
1.5
1.0
195
10
kV cm1
5 kV cm1
2
1
0
0
1 kV cm1
Time (1012 s)
Figure 6.13 The overshoot effect transient response of the electron drift velocity to step-like
pulses of the electric field at room temperature. The field magnitudes are indicated for each plot:
(a) Si and (b) GaAs. Reprinted with permission, from V. Mitin, V. Kochelap, and M. Stroscio,
Quantum Heterostructures, Fig. 7.13 (Cambridge, Cambridge University Press, 1999).
momentum and energy relaxation times, etc. Now, we will consider processes that occur
in the electron system immediately following its deviation from the equilibrium state. In
this particular case, we will focus on the response to the electric field in the form of a
step-function.
In general, the momentum relaxation time, e , entering Eq. (6.49) is shorter than
the energy relaxation time, E , which determines the electron energy; see Eq. (6.79).
Therefore, the velocity response to the electric field step is faster than the energy
response for the case described by Eq. (6.79). Typically, if the scattering rate increases
with increasing electron energy, the electron velocity may exceed the stationary velocity during a time interval of the order of E . In other words, the transient velocity
is not just a function of the electric field but also a function of the electron energy.
Indeed, the velocity adjusts itself quickly to the quite slowly changing energy, and
it follows that energy until the energy reaches the steady state. Initially, when the
electron energy has not reached the stationary value, the electron velocity corresponding to the transient energy is higher than the velocity corresponding to the stationary
energy.
The overshoot effect is shown in Fig. 6.13 for Si and GaAs. The drift velocity is
presented as a function of time for several electric fields. The electric field is assumed to
be switched on at time t = 0. The overshoot effect is pronounced in high electric fields.
The maximum of the transient velocity can exceed the stationary saturated velocity by
as much as two to four times.
From a physical explanation of the overshoot effect, one can understand how it is
possible to utilize this effect. Let us imagine that cold electrons enter an active region
of a semiconductor device through a contact. If there is a high electric field in the active
region, the electrons will be accelerated. At some distance from the injecting contact the
Electron transport
15
10
1.0
0.5
5
0
0.0
0.5
x (m)
1.0
0.0
v
4
2
0
0.0
0.5
1.0
0.5
0.4
0.3
0.2
0.1
0.0
E (eV)
ND
n
v (107 cm s1)
20
(b)
n, ND (1016 cm3)
(a)
F (kV cm1)
196
x (m)
Figure 6.14 Space-charge-limited transport with the velocity overshoot effect in an n+in+
GaAs diode at L x = 0.37 m, T = 77 K, and 0 = 0.5 V. (a) The doped regions, the electric
fields, F, and the injected electron concentration, n. (b) The drift velocity, v, and the average
energy of the injected electrons, E. From A. Ghis, E. Constant, and B. Boittiaux, Ballistic and
overshoot electron transport in bulk semiconductors and in submicronic devices, J. Appl. Phys.,
54, 214221 (1982). Reprinted wth permission from A Ghis, E Constant, and B. Boittiaux,
C 1983 American Institute of Physics.
Journal of Applied Physics, 54, 214 (1393).
electrons will reach the maximum overshoot velocity and after that their velocity will
gradually decrease to the stationary value. If the active region of the device is short and
comparable to the distance over which the overshoot effect takes place, electron transit
through this active region will occur at a velocity higher than the stationary velocity and
the overall transit time will be shorter. Consequently, the device will be able to operate
at higher speed and frequency.
As follows from Fig. 6.13, the characteristic time of the overshoot effect is about
tr = 0.5 1012 s for GaAs. Estimating the average velocity under the overshoot as
vm = (24) 107 cm s1 , we obtain an estimate for the device length necessary to realize
the discussed effect: L x vm tr = 0.10.2 m. That is, the overshoot effect and ultrahigh-speed electrons are achievable in short (submicron) samples.
At high biases and currents, the space-charge effects should be taken into account
together with the velocity overshoot. In Fig. 6.14, we present the calculations of spacecharge-limited electron transport in a short n+in+ GaAs diode, where n+ are heavily
doped contacts and i is the undoped base of the diode. The step-like doping of the contact regions is marked by ND . The field, the concentration of the injected electrons and
their average drift velocity, and the average energy are shown. The electric field and the
injected electron concentration are extremely nonuniform, which is similar to the previously analyzed case of dissipative electron transport. However, for a short diode, the
velocity overshoot in the diode base is clearly observed. The average energy increases
in the base, and drops only in the receiving contact because of strong scattering in
the heavily doped region. Interestingly, the drift velocity reaches a value of above 4
107 cm s1 and the electron energy exceeds 3500 K at a crystal temperature
of 77 K.
For the low-field, hot-electron, and overshoot transport regimes studied, the processes
of dissipation play a dominant role in controlling the electron drift velocity. By scaling
197
down the sample dimensions, one can achieve an ultimately fast collisionless electron
transport.
(6.82)
The velocity, v, can be found from the Newton equation (2.8); however, we will use the
energy conservation law, which reads as
mvc2
m v 2 (x)
(6.83)
e(x) =
ec = constant.
2
2
Here, vc and c are parameters corresponding to the injecting electrode (cathode). In
the case of a large electric bias, we can simplify the latter equation by considering that
the electrons are injected over the barrier with small velocity vc 0. Then, we exploit
the virtual-cathode approximation discussed previously. It is convenient to set c = 0
and obtain
2e(x)
v(x) =
(6.84)
m
and
n(x) =
J0
,
ev(x)
(6.85)
where J0 is the absolute value of the current density through the diode. As we noted in the
section devoted to dissipative transport, J is negative and it is convenient to introduce an
absolute value of current density, J0 , which is constant throughout the entire sample. On
combining the latter formula with Poissons equation (Eq. (6.68)) we obtain the equation
for (x):
d2 (x)
m
J0
.
(6.86)
=
2
dx
0
2e(x)
On multiplying both sides of the last equation by d(x)/dx, we obtain an equation that
can be integrated. The result of integration is
1 d 2
2J0 m
+ C.
(6.87)
=
2 dx
2e
198
Electron transport
The integration constant, C, can be set to zero, since within our approach as x 0 we
have d/dx 0 and 0. Now, we can rewrite the result as
1 d
J0 m
=2
.
(6.88)
1/4 dx
0
2e
Further integration gives us the final result for the distribution of the electrostatic
potential:
3 J0 2/3 m 1/3 4/3
x .
(6.89)
(x) =
2
0
2e
Taking into account that (L x ) = 0 , we can find the currentvoltage characteristic of
the ballistic diode:
3/2
0
2e 3/2
2
J0,b =
.
(6.90)
3
L 2x m 0
This equation is known as Childs law. It was found and verified first for vacuum diodes,
where collisions of electrons were totally absent. Childs law differs considerably from
the currentvoltage characteristic of the dissipative diode given by Eq. (6.73).
Using Eq. (6.89), we can calculate other parameters of the ballistic diode. For example,
the transit time for the electrons is
2m
3
ttr,b = t0,b , t0,b = L x
,
(6.91)
2
e0
with t0,b being the transit time of the ballistic electrons in the absence of space-charge
effects. The space-charge effects increase the transit time, ttr,b , by the factor 3/2.
It is instructive to compare the results obtained for space-charge-limited transport in
the dissipative and ballistic diodes. To make this comparison we assume that the lengths
of the diodes, L x , are equal and that the two diodes are biased equally. Then, from
Eqs. (6.75) and (6.91), for the ratio of the transit times we obtain
ttr,b
9 m 0
9e e0
9le
=
=
=
1,
(6.92)
ttr,d
8L
4 2eL x
4 2m L x
x
where we used the relationship = ee /m , and introduced the maximum mean-free
(a)
(b)
Contact
14
0.1
12
0.0
10
8
6
4
vT 2
0.0
0.1
0.2
x (m)
0.3
0.4
0
0.5
0.8
x = 0.1 m
F (v, x)
0.2
Contact
199
0.6
0.4
x = 0.3 m
x = 0.05 m
x = 0 m
0.2
0.0
10
v (107 cm s1)
15
Figure 6.15 The results of numerical modeling of a GaAs diode. (a) The distribution of electron
potential energy, e(x), and average velocity, v (x), across the diode. (b) The distribution
function, F(v, x), at x = 0, 0.05, 0.1, and 0.3 m. The applied voltage, 0 , is equal to 0.47 V;
the thermal velocity, vT , is equal to 2.6 107 cm s1 . Reprinted with permission from H. U.
Baranger and J. W. Wilkins, Ballistic electrons in an inhomogeneous submicron structure:
C 1984 by the American Physical
thermal and contact effects, Phys. Rev. B, 30, 7349 (1984).
Society.
is e = 2.9 1013 s. At the given temperature, the thermal velocity of the electrons
200
Electron transport
1.0
0.8
0.6
0.4
Z(wttr,b)
Z(0)
0.2
0.0
0.2
0.4
0.6
0
2
1
wttr,b
9 10
Figure 6.16 Dependences of the real (curve 1) and imaginary (curve 2) parts of the impedance of
well-pronounced maximum. For larger distances, this maximum corresponds approximately to values given by the previously discussed model of pure ballistic transport. This
example highlights the real nature of an almost ballistic diode: some of the electrons participate in transport and there occurs some spreading over the velocity. However, the average properties of the diode are very close to those obtained in the simple model based on
the classical Newton and Poisson equations.
The previously discussed simple model allows one to investigate high-frequency properties of the ballistic diode and calculate its impedance. The real and imaginary parts of
the impedance are
12Rd
[2(1 cos ) sin ],
(6.93)
Re[Z ()] =
4
12Rd
Im[Z ()] =
[2 sin (1 + cos ) 3 /6].
(6.94)
4
Here, Rd = d0 /dJ0 is the differential resistance calculated with the use of the steadystate currentvoltage characteristic of Eq. (6.90); = ttr,b , i.e., the frequency measured
in units of the inverse transient time defined by Eq. (6.91). Both dependences, Re[Z ()]
and Im[Z ()], are presented in Fig. 6.16. According to Eqs. (6.93) and (6.94) and the
definition of Rd , the magnitudes of Re[Z ()] and Im[Z ()] decrease as the voltage
bias, 0 , increases:
d0
1 3m L 2x
.
=
Rd =
dJ0
2 e0
0
That is, the current response becomes larger at larger biases (Rd 1/ 0 ). Simultaneously, the frequency region within which the diode is still active increases with the bias
( 0 ). These features are quite similar to those obtained for the dissipative Mott
Gurney diode; see Eq. (6.77). A principally new feature is the observable oscillations
of Re[Z ()] and Im[Z ()] with and the frequency windows arising with negative
201
(a)
Lead
Device
L
Lead
R
(b)
EF
x
Figure 6.17 (a) The structure schematic of a nanostructure mesoscopic device and (b) the
potential energy profile, e(x), in the biased mesoscopic device.
values of Re[Z ()]. The first window with Re[Z ()] < 0 occurs for between
6.3/ttr,b and 9/ttr,b . It is known that a negative Re[Z ()] corresponds to an electrical
instability at frequency . Such an instability can be used for generation of highfrequency electromagnetic oscillations, and we will discuss this phenomena in detail in
Section 8.2.
In conclusion, ballistic nanoscale devices are the fastest devices based on classical
electron transport. Both a finite electric current and a finite electric resistance in the
absence of electron scattering arise because of the electrostatic effects induced by redistribution of charged electrons in the device. This is the reason why this case is associated
with charge-limited transport. The ballistic diodes are characterized also by frequency
windows within which the dynamic resistance of the device is negative, which signifies
an electrical instability and the possibility of generating ultra-high-frequency electromagnetic radiation.
202
Electron transport
In order to avoid the need for a detailed description of the leads, we assume that the
leads are reservoirs of electrons, where energy and momentum relaxation processes are
so effective that the electrons remain in equilibrium even under a given applied voltage
bias. Hence, the boundary conditions at the interface between the leads and the device
are assumed to be determined by the equilibrium Fermi distribution function of Eq. (6.18).
The electron concentration in the leads is so high that the electrostatic potential in each
lead is taken to be constant, as for the case of a metal. Let E F be the Fermi energy of the
electrons in the leads in the absence of a bias. On applying a voltage bias, 0 , the Fermi
level in one of the leads becomes E F 12 e0 , while that of the second lead becomes
E F + 12 e0 , as depicted in Fig. 6.17(b). Thus, the electron distribution functions are
1
1
and
FF E e0 E F ,
(6.95)
FF E + e0 E F
2
2
at the left and right leads, respectively. Here E is the kinetic energy of the electrons.
In studying quantum wires in Section 3.3, we found that the electron spectrum consists
of a series of one-dimensional subbands with the energies
-h2 k 2
x
E kx ,n,m = n,m +
.
(6.96)
2m
Here, n and m are integers and k x is a one-dimensional wavevector directed along the
axis of the quantum wire. The electron wavefunction can be written as
(x, y, z) = (y, z) || (x).
(6.97)
203
The complex coefficients tr , rl , tl , and rr are connected by several fundamental relationships, which do not depend on the specific design of the device. The requirement of
current continuity for both wavefunctions, ||,l (x) and ||,r (x), gives
kl (1 |rl |2 ) = kr |tr |2
and
kr (1 |rr |2 ) = kl |tl |2 .
(6.100)
tr tl = tr tl .
(6.101)
and
(6.102)
By substituting the wavefunctions of Eqs. (6.98) and (6.99) into Eq. (3.10), which
defines the particle flux, we calculate the incoming, i in , and outgoing, i out , electron flows:
i in = vl
(6.103)
where vl = -hkl /m and vr = -hkr /m are the velocities at the L and R cross-sections,
respectively. The ratio of these quantities defines the transmission coefficient for
electrons moving through the device from the left to the right:
Tlr (E) =
i out
kr
= |tr |2.
i in
kl
(6.104)
The transmission coefficient corresponding to the electrons moving through the device
from the right to the left equals
Trl (E) =
kl 2
|tl | .
kr
(6.105)
From Eqs. (6.102), (6.104), and (6.105), one can find that
Tlr (E) = Trl (E) = T (E || ),
(6.106)
where
E|| =
-h2 k 2
x
2m
(6.107)
ir
= |rl |2 = |rr |2 .
i in
(6.108)
It is obvious that
T (E) + R(E) = 1.
(6.109)
Now, we can take into account contributions to the total current from all electrons
entering the device from both leads. Consider a state of the electrons, say in the left
lead, with quantum numbers kx , n, and m. The number of electrons in this state is given
204
Electron transport
(6.111)
(6.112)
Since, for our model, the electron velocity, v|| , and the transmission coefficient, T , are
independent of the transverse quantum numbers, n and m, we can calculate the sum over
n and m. Taking into account the explicit form of the Fermi function, FF , it is convenient
to introduce the distribution function dependent only on the kinetic energy E || :
1
.
(6.113)
FF (E || ) = 2
+ n,m E F
E
||
n,m 1 + exp
kB T
Next, as in Eq. (6.30) the summation over k x in Eq. (6.112) may be replaced by an
integration:
dk x
dE ||
{. . .} L c
(6.114)
{. . .} = L c
-hv|| {. . .}.
2
2
kx
Finally, we obtain the following expression for the total current:
dE ||
1
1
I = e
T (E || ) FF E || + e0 E F FF E || e0 E F ,
2 -h
2
2
(6.115)
where the integration range runs over the kinetic energy of the longitudinal motion E|| .
Note that the electron velocity does not appear in the final expression for the total current,
I . This general result may be applied to a variety of different cases. Let us consider two
of them.
2m
205
(6.116)
where S is the cross-sectional area. The term FF (E || ) has the meaning of the number of
electrons with energy E || . Then the total current density, J = I /S, is
E F E || + 12 e0
1 + exp
kB T
dE ||
m kB T
T (E || ) ln
J = e
. (6.117)
1
-h2
2 -h
E F E || 2 e0
1 + exp
kB T
This is indeed a useful result since Eq. (6.117) allows one to calculate the currentvoltage
characteristic of a nanostructure device and its dependence on the electron concentration,
temperature, etc. Interestingly, we found a finite current in this biased device, where no
scattering occurs. That is, the device possesses an electric resistance. The resistance is
explained by two factors: (1) quantum-mechanical reflection of electron waves inside
the device and (2) the finite number of electrons which can be injected into the device.
Note that no electrical charge redistribution was considered, i.e., relatively small currents
were supposed.
T 0
(6.118)
If the applied voltage, 0 , is small, the difference between the distribution functions in
the sum of Eq. (6.112) is equal to
1
1
FF E(k x , n, m) + e0 E F FF E(k x , n, m) e0 E F = e0 (E F E),
2
2
(6.119)
where we have taken into account the fact that the derivative of the step-like Fermi
distribution function is a -function. Hence, the current is proportional to the voltage
bias, 0 .
One can introduce the conductance of a nanostructure device as the ratio
G=
I
.
0
(6.120)
From Eqs. (6.112), (6.114), (6.119), and (6.120) we obtain the conductance at low temperatures in the form
G=
e2
e2
T (E F , n, m) = 2
T (E F , n, m),
h n,m,s
h n,m
(6.121)
206
Electron transport
where the sum extends only over the electron states (n, m) with energy E < E F . The
coefficient in front of the sum of Eq. (6.121),
G0 =
e2
,
h
(6.122)
Here each channel (n, m) contributes G 0 Tn,m to the conductance, G. If the channel corresponding to the (n, m) state is transparent to electrons, Tn,m = 1, and the contribution
of this channel is equal to the quantum of conductance of Eq. (6.122).
This regime is also referred to as quantum ballistics. Thus, a nanoscale device with
quantum ballistic transport exhibits finite conductance (finite resistance). Generally, the
conductance depends both on the transmission coefficient of the device and on the Fermi
distribution functions in the leads. In particular, even if the device is entirely transparent, T = 1, the conductance remains finite and is equal to the quantum of conductance, G 0 . An increase in the occupation of the upper low-dimensional subbands gives
rise to an unusual behavior of the conductance. Indeed, at low temperature, actually at
kB T E F , only those subbands for which the energy of the subband bottoms n,m < E F
are occupied. If, on changing the Fermi energy, E F , a new subband starts to populate,
the conductance of the device increases in a step-like manner, independently of how
many electrons occupy this new subband. In conclusion, the Landauer formula describes
quantum transport in mesoscopic devices. It is valid at low temperature and small voltage
bias.
The simplest device with quantum ballistic transport is the so-called quantum point
contact. It can be fabricated in a number of fashions. The essence of the quantum point
contact is that two electron reservoirs are connected through a conducting region with
transverse dimensions, L z and L y , comparable to the de Broglie wavelength of the
electrons. In Fig. 6.18(a), a sketch of a point contact on the {x, y}-plane is depicted.
The electron transport and current occur along the x direction. For simplicity, we can
assume that the channel has a constant dimension, L x , in the x direction. If the variation
of the channel dimension in the y direction, L y (x), is adiabatically smooth, we can use
Eq. (3.50) for the one-dimensional energy subbands:
-h2 2 n 2
-h2 2 m 2
-h2 k 2
x
E kx ,n,m =
+
+
,
(6.124)
2m L 2z
2m [L y (x)]2
2m
where we used approximate quantization of electron motion in the y direction. The
resulting potential profiles are shown in Fig. 6.18(b), for a few subbands in the structure
207
(a)
y = 0.5d + ax2
Ly(x)
d
y = (0.5d + ax2)
(b)
m = 1, 2, 3
EF
x
Figure 6.18 Geometrical representations of point contact (a) and quantized energy 1,m in the
region of point contact (L z L y,min ) (b). It is shown schematically that electrons can be in all
three subbands in the region with the densest hatching. Electrons can be only in the subband 1,1
in the region shown by the least-dense hatching.
with L z L y,min . For the indicated Fermi energy, we have only one channel open for
electron motion and the second channel has a small barrier. As soon as the Fermi energy
reaches the bottom of the next subband, a new channel opens for conducting electrons
and the conductance of this point contact undergoes a step-like increase. Typically,
such narrow constrictions clearly demonstrate the quantization of conductance. Since
the potential barriers in the constriction are quite wide, tunneling processes have small
probability. As a result, each of the channels is either almost open (the transmission
coefficient T = 1) or almost closed (T = 0); i.e., one will expect steps in the conductance
close to their universal value 2G 0 .
Consider as an example the narrow constriction defined by shallow etching on the
AlGaAs/GaAs structure with two-dimensional electrons. In order to perform the shallow
etching with well-controlled edges, a 13-nm-thick Al mask was formed by electronbeam lithography and lift-off, using a field-emission scanning electron microscope at
an acceleration voltage of 2.5 kV. The sample was then shallow etched, as described in
Chapter 4. After the etching mask was removed, an Al gate electrode was deposited,
covering the constriction. By applying a voltage to the gate, one can control the electron
density and, thus, the Fermi energy. A scanning electron micrograph of a shallow etched
constriction is shown in the inset of Fig. 6.19. The shape of the etched constriction
is characterized by two parameters: (1) the width of the constriction, d, and (2) the
curvature, a, of the parabolas describing the shallow etched walls of the constriction
Electron transport
16
14
12
10
G (e2/h)
208
8
31 K
6
4
20 K
11 K
2
1.3 K
0
0.6
0.7
0.8
0 (V)
0.9
1.0
Figure 6.19 The quantization of conductance, G, at temperatures T = 1.3, 11, 20, and 31 K.
seen in the picture. If the {x, y}-coordinate system is placed in the center of symmetry
of the constriction, the etched walls are described by
y = (0.5d + ax 2 ).
(6.125)
The quantum contact presented in Fig. 6.19 has the parameters d = 50 nm and
a = 0.001 25 nm1 . Using the second term in Eq. (6.124), we can estimate the energy
distances between the lowest subbands in the narrow cross-section of the constriction;
they are 20 meV, 16 meV, 11 meV, etc. The sample exhibits well-defined conductance
quantization at low temperatures, as depicted in Fig. 6.19 for a series of measurements at
various temperatures. The rise of the conductance for each step is close to the universal
value 2G 0 .
Single-electron transport
In the previously discussed cases of electron transport, we assumed that the number of
electrons participating in this transport is so large that the discrete nature of the electrons does not matter. For small devices operating with weak currents, this assumption
209
is no longer valid. Instead, a new type of charge-dependent electron transport, singleelectron transport, develops. Below we present the basic concepts of this transport
regime.
In general, in bulk-like materials and macroscopic devices the electron discreteness is
not manifested in average characteristics such as the local electron density, total electric
current, etc. However, it is well known that this discreteness manifests itself by contributing to current noise (fluctuations) even for macroscopic samples. For example, the
so-called shot noise is entirely due to the discreteness of the electron charge; it arises
because of the random process of electrons entering the device.
When the dimensions of the devices are scaled down, the role of the discreteness
of charge increases. In the case of ultra-small devices, the discreteness of the electron
charge gives rise to principally new effects in electron transport. This transport becomes
correlated; i.e., a transfer of one electron through the device depends upon the transfer
of others. The correlation appears because of the Coulomb interaction of individual
electrons. A new class of devices, referred to as single-electron devices, is based on such
processes.
Single-electron effects rely on a charging process that occurs when electrons enter
a tiny conducting sample. When the tiny conducting sample, often called the metallic island, is extremely small, the electrostatic potential of the island significantly
increases even when only one electron enters it. In general, the charging energy of a
sample is
EC =
Q2
,
2C
(6.126)
where Q is the charge and C is the capacitance of the sample. For a spherical island of
radius r , the capacitance, C, can be estimated as
C = 4
0
r.
(6.127)
210
Electron transport
~ 10
Tunnel
junction
Figure 6.20 The simplest MIM (metalinsulatormetal) system of single electronics: a tunnel
capacitance, C, and the conductance, G. We can suppose that the capacitance is roughly
proportional to the cross-section, S, of this junction. Thus, a small S implies a small
capacitance. The conductance of the junction, G, is small enough for one to consider
the system as a leaking capacitor. Let the system have a charge, Q. Then, when a single
electron leaks through the insulator, this event changes the initial electrostatic charge
just by the elementary charge, e: Q (Q e). Hence, according to Eq. (6.126), the
electrostatic energy of the junction changes by
!E C =
Q2
e(2Q e)
(Q e)2
=
.
2C
2C
2C
(6.128)
(6.129)
2 = (Q e)/C.
(6.130)
to
Then, the single-electron transfer leads to a voltage fluctuation across the junction equal
to
! = 1 2 = e/C,
(6.131)
(6.132)
The current related to the single-electron transfer can be estimated from the following
qualitative considerations. The uncertainty relation of Eq. (3.8) between the energy and
time allows us to estimate the extreme limit of the tunneling time, t , for such a junction:
t
h
.
!E C
(6.133)
!Q
!t
(6.134)
IS
e/2
EC =
211
Q2
2C
e/2
e Q
Figure 6.21 The charging energy, E C , for a tunnel junction with the capacitance, C, and charge,
Q, of one of the plates. The solid arrow indicates energetically favorable electron-tunneling
events, the dashed arrow indicates an energetically unfavorable event, and the dotted double
arrow indicates tunneling events without a change in energy of the capacitor, E C . Inset: a circuit
with charging of a tunnel junction by a current source.
e
e3
.
t
2hC
(6.135)
If this current, It , exceeds the current fluctuation due to the voltage fluctuation, !,
namely
e
It !I = G ! = G ,
(6.136)
C
the single-electron transfer processes will control the electric current through the junction. The latter inequality leads to the criterion
e2
G.
h
(6.137)
Note that the value on the left-hand side of this criterion just coincides with the quantum
of conductance, G 0 e2 / h, introduced by the Landauer formula of Eq. (6.121).
As a result of this analysis, we can state that, if the Coulomb energy of charging of the
device is greater than the thermal energy, kB T , and the current, I , associated with the
single-electron transport is greater than fluctuations of the leakage current, then electron
transport is correlated and single-electron effects are important. The criteria for such
single-electron transport can be formulated as follows:
C
e2
kB T
and
G
e2
;
h
(6.138)
i.e., both the capacitance and the leakage conductance of the device should be small.
Now we can discuss the physics of single-electron transfer by using the electric circuit
shown in Fig. 6.21. We assume a situation in which the tunnel junction in the circuit is
charged by a current source, IS . Starting with no net charge on the capacitor plates at
time t = 0, the current source slowly begins to charge the junction. For a given charging
rate and short time scales, the excess charge on the capacitor plates Q < e. Importantly,
212
Electron transport
I
e
C
e
C
Figure 6.22 The currentvoltage characteristic under Coulomb blockade. Within the voltage-bias
macroscopic plates can be charged with a charge less than the charge of a single electron
just by shifting the conduction electrons in the plates with respect to the positive ions of
the lattice (the so-called polarization charge). When the current source has charged one of
the plates of the capacitor to a charge equal to +e/2 and another plate to e/2, the plates
have the charge difference of 1e between them. The electrostatic energy of the junction
reaches the value marked by the dot in Fig. 6.21. Now 1e charge can tunnel through the
junction as shown by the dotted arrow in Fig. 6.21. Thus, before the tunneling the initial
charges on the plates were (+e/2, e/2), whereas after the tunneling the charges are
(e/2, +e/2). We see from Fig. 6.21 that the charge transfer does not change the energy
of the junction. The two charge configurations are, actually, identical. Because of this,
nothing can stop the electron from tunneling back to its initial state and so on. However,
the current source continues to charge the plates and the tunneling shown by the solid
arrow becomes energetically favorable (!E C in Eq. (6.128) becomes negative), and it
becomes energetically unfavorable for the electron to tunnel back to its initial capacitor
plate. Thus, in reality, we obtain tunneling of a single electron and the system starts
to accumulate the charge to provide for the tunneling of the next electron, which will
happen after some time has elapsed. The succession of electron-tunneling events means
that the electron flux is highly correlated. In Fig. 6.21 all discussed tunneling events
are shown schematically. From the considerations based on Eq. (6.128) one can see that
any electron transfer is prohibited for a small initial capacitor charge: e/2 < Q < e/2
(!E C for such Q is positive). This physical effect is called Coulomb blockade. If the
conditions of Eq. (6.138) are met, the charging energy plays the dominant role in the
system, the tunneling of an electron is energetically unfavorable, and at low temperature
tunneling is not possible at all (if it is blocked). This results in the specific current
voltage characteristic presented in Fig. 6.22. The main feature of such a characteristic
is the total suppression of the current in some finite interval of external voltage biases,
e/C < 0 < +e/C.
Importantly, the manifestation of the correlated transport strongly depends on the
external circuit to which the single-electron tunnel junction is attached. Let the external circuit impedance be Z (). If the impedance is as low as |Z ()| G 1
0 for the
most important frequencies, 1/t e2 /(hC), then charge fluctuations in the circuit
are greater than the elementary charge, e, and all correlation effects are suppressed.
Q
+
e
2
e
2
213
TSET
0
TSET =
1 = e
I
fSET
Figure 6.23 Bloch oscillations, showing the dependence of junction charging on time.
1
If the external circuit impedance is in the intermediate interval G 1
0 |Z ()| < G ,
the junction exhibits Coulomb blockade in the bias range from e/C to +e/C, as
shown in Fig. 6.22, but outside this range there is no correlation between tunneling
events.
Finally, an interesting current regime occurs if |Z ()| G 1 G 1
0 , when the external circuit can be considered as a source of a fixed direct current, I . This current causes
a recharging of the junction inside the range of Coulomb blockade without electron tunneling; it corresponds to a linear change in the charge with time: dQ/dt = I constant.
When the edge of the blockade range is reached, an electron tunnels through the junction.
The system finds itself again in the blockade range near the opposite edge and the
process repeats; see Fig. 6.23. Thus, one gets temporal oscillations of the charging with
a frequency, f SET , determined by the current:
f SET =
I
.
e
(6.139)
6.6
Closing remarks
In this chapter, we have shown that a number of different electron-transport regimes
can occur in semiconductors and their nanostructures. These regimes are characterized
by different values of the electron velocity and magnitude of the current, very different
currentvoltage dependences, etc. The electrons can behave as semiclassical particles,
or as quantum ones. If the device dimension along the current is much larger than the
214
Electron transport
6.7 Problems
215
6.7
Problems
1. Consider the bulk crystal of GaAs with an electron effective mass m = 0.067m 0 ,
where m 0 is the free-electron mass, and mobility = 105 cm2 V1 s1 at T = 77 K.
Using the relationships introduced in Sections 6.2 and 6.5, and Eq. (6.50) ( =
ee /m ), calculate
the de Broglie wavelength, ;
the scattering time, e ;
the thermal electron velocity, vT ;
the mean free path, le ; and
the diffusion coefficient, D (D = vT2 e /, = 3 for bulk material).
Determine the transport regimes for devices with feature sizes L x = 0.05, 0.5, and 5 m.
2. Consider a bulk crystal, a quantum well of thickness 10 nm, and a quantum wire of
cross-section 10 nm 10 nm. For such samples, the specific electron concentrations of
216
Electron transport
1018 cm3 , 1012 cm2 , and 106 cm1 correspond to the same three-dimensional electron
density. For these three cases, by using the results of Section 6.3, calculate and compare
values of
the Fermi wavevector, kF , and corresponding de Broglie wavelength ( = 2/kF );
the Fermi energy, E F ; and
6.7 Problems
217
U (eV)
0.2
0 10
50
x (nm)
Figure 6.24 A potential barrier U (x) = U0 /cosh(x/d) with U0 = 0.2 eV and d = 10 nm.
Estimate the mean free path, le = e vT . Set the diode length L x = le ; i.e., assume
that the diode is a ballistic device.
Calculate the current density, J , and the transit time, tr , for the ballistic diode at 0 =
0.1 V and compare your result with parameters obtained for Problem 7.
9. The Landauer formula of Eq. (6.121) includes the energy-dependent transmission coefficient, T (E). It is instructive to discuss this formula by using an example for which the exact solution of the quantum-mechanical Schrodinger equation is
known. One such case is the electron motion in the potential of a special dependence:
U (x) = U0 /cosh(x/d) (Fig. 6.24). One can see that this potential barrier has height equal
to U0 and the characteristic spatial scale d. It was found that the transmission coefficient
has the form
T (E) =
sinh2 (k x d)
,
U d 2
8m
0
sinh2 (k x d) + cosh2
1
-h2
2
for
8m U0 d 2
> 1,
-h2
Electrons in traditional
low-dimensional structures
7.1
Introduction
Now, we begin our analysis of novel developments in electronics that have resulted from
the use of nanostructures in modern electronic devices. Importantly, the attributes of
nanotechnology make it possible to pursue both devices with smaller dimensional scales
and novel types of device. Though the ongoing trend of miniaturization in electronics
is extremely important, the unique properties of electrons in nanostructures give rise
to novel electrical and optical effects, and open the way to new device concepts. The
electric current and voltage in a device are determined by two major factors: the concentration and the transport properties of the charge carriers. In nanostructures, these
factors can be controlled over wide ranges. In this and the next chapter we will study
nanostructures for which these basic factors that are important for the electronics are engineered, which are being exploited intensively both in research laboratories and in practical
nanoelectronics.
To distinguish the nanostructures already having applications from the newly emerging
systems, we refer to the former as traditional low-dimensional structures.
7.2
(b)
(a)
Vb(z)
Ec
Charge transfer
EF
e (z)
Vb
+
++
+
2DEG
EF
Ec
Unstable state
219
Thermal equilibrium
Figure 7.1 The charge-transfer effect in a selectively doped single heterostructure. (a) The
step-like profile is unstable. (b) Charge transfer results in ionization of a layer of the doped
material, band bending, lining up the Fermi levels across the structure, and formation of an
electron channel at the interface.
220
To complete the discussion of this simple model, let us note that in the real situation the
side of the heterostructure with the lower conduction-band edge is usually doped lightly
with acceptors. In this case, the right barrier of the well is higher than half of the bandgap
of the narrow-bandgap material. Of course, the residual acceptors reduce the electron
channel mobility but their effect is weak.
Here, we considered a heterostructure with the Fermi level defined only by the modulation doping. These structures are called ungated heterostructures. For gated heterostructures the physical picture is slightly different. Such gated heterostructures will be studied
subsequently.
0
Here, e is the elementary electrical charge,
is the dielectric constant, and
0 is the
permittivity of free space. The total charge consists of the charges of the electrons
and impurities. Let the wavefunctions of the electrons be (r), with being the set
of electron quantum numbers. Then, | (r) |2 represents the probability of finding the
electron of the state labeled by at the point r. In order to calculate the contribution of
the electrons to the space charge we introduce the energy-dependent electron distribution
function, F(E ), which determines the probability of the electron occupying the energy
&
level E . Thus, the electron charge density is e | (r)|2 F(E ). The charge of the
nonuniformly distributed positive donors and negative acceptors is determined by their
densities, eND (z), and, eNA (z), respectively.
We assumed that the potential, (z), does not depend on the x- and y-coordinates.
Thus, (r) can be factorized as discussed in Section 3.2:
1
(r) = ei(k x x+k y y) j (z),
S
{ j, k|| },
(7.2)
where S is the area of the junction and k|| = {k x , k y }. The factorization (7.2) leads to
(r) (r) = | (r)|2 =
1
| j (z)|2 ,
S
(7.3)
221
which makes Eq. (7.1) dependent only on the z-coordinate. We also obtain the onedimensional Schrodinger equation for the z direction:
-2
h
d2
+
V
(z)
e(z)
j (z) = j j (z).
(7.4)
b
2m dz 2
Here, the total potential energy consists of two contributions: the built-in potential of
the heterostructure, Vb (z), and the self-consistent potential, e(z). In our case, Vb (z)
corresponds to the energy-band discontinuity at the junction see Fig. 7.1(a) which
depends only on the z-coordinate. We set
Vb (z) = Vb (z),
where (z) is the Heaviside step-function. In Eq. (7.4), j are the energies of twodimensional subbands. Thus, the total electron energy is given by Eq. (6.27):
E j (k|| ) = j +
-h2 k2
||
2m
j
Here, we also introduce the sheet density of electrons for the level j:
EF j
1
m kB T
n s, j (E F )
F(s, k|| , j) =
ln
1
+
exp
.
S
-h2
kB T
(7.5)
(7.6)
s,kx k y
Actually, this result repeats Eq. (6.116). The quantity n s, j is a function of the temperature,
T , and takes the simplest form as T 0:
m
n s, j (T = 0) = - 2 (E F j )(E F j ).
(7.7)
h
Equation (7.7) indicates that the level j is occupied if the Fermi level exceeds the
corresponding energy of quantization of transverse electron propagation, j .
It is necessary to formulate boundary conditions for the coupled differential equations
represented by Eqs. (7.4) and (7.1). For localized electron states, we may set j (z) 0
for z . For the electrostatic potential we suppose that d/dz 0 for z
.
In conclusion, Eqs. (7.1), (7.4), and (7.5) together with the boundary conditions complete the formulation of the self-consistent problem that describes the formation of a
two-dimensional electron gas at the interface as well as the electron quantization. By
integrating Eq. (7.1) over an infinite range of z and using the boundary conditions, one
can find a neutrality equation in the form
+
n s, j +
dz[NA (z) ND (z)] = 0.
(7.8)
j
This result implies that, despite the charge transfer, the entire system remains electrically
neutral.
222
Vacuum level
(b)
(a)
1
2
EF
Ec
Eg1
GaAs
AlGaAs
Eg2
+ +
1
Eg2
Eg1
Ev
Figure 7.2 Schematic energy band diagrams of a selectively doped AlGaAs/GaAs
heterostructure before (a) and after (b) charge transfers have occurred. Relative positions of the
valence and conduction bands are given for both materials. The electron affinities are shown
conditionally (left). The Fermi level in the AlGaAs material is supposed to be pinned on the
donor level. The narrow-bandgap GaAs is slightly p-doped. After C. Weisbuch and B. Vinter,
Quantum Semiconductor Structures (San Diego, Academic Press, 1991).
223
0.3
Al0.3Ga0.7As
GaAs
Energy (eV)
dsp
0.2
0.1 +
2DEG
EF
0
300
200
100
100
200
300
400
z ()
Figure 7.3 Calculated self-consistent potentials, energy levels, and wavefunctions of an
Al0.3 Ga0.7 As/GaAs selectively doped heterostructure. The junction is situated at z = 0. The
spacer thickness is 50 and the donor binding energy of AlGaAs is chosen as E D = 50 meV.
After T. Ando, Self-consistent results for a GaAs/Alx Ga1x As heterojunction. I. Subband
structure and light-scattering spectra, J. Phys. Soc. Japan, 51, 3893 (1982).
side. For such a doping the quantum well occurs only for electrons and does not occur for
holes, since the AlGaAs/GaAs heterojunction is of type I (according to the classification
of Chapter 4.)
In Fig. 7.3, the results of numerical solution of Eqs. (7.1)(7.5) are presented for
a selectively doped Al0.3 Ga0.7 As/GaAs heterojunction. Heavily n-doped Al0.3 Ga0.7 As
and lightly p-doped GaAs materials are separated by a thin undoped Al0.3 Ga0.7 As layer
with thickness dsp . Such an undoped layer is called the spacer layer or simply a spacer.
In calculations, the sheet electron concentration, n s , in the channel at the junction is
equal to the number of ionized donors per unit area, Ns = 5 1011 cm2 . The sheet
concentration of ionized acceptors, Ndepl , is supposed to be much smaller: Ndepl = 5
1010 cm2 . The self-consistent potentials, energy levels, and wavefunctions are shown
for two different cases. For the first case, shown by the dashed curve in Fig. 7.3, the
trial electron wavefunction is defined by Eqs. (7.9). The second case, shown by the solid
curve in Fig. 7.3, corresponds to a more sophisticated trial wavefunction, which can
penetrate under the barrier into the wide-bandgap part of the structure. In the region of
classically allowed motion, the wavefunctions for all of these approximations are similar
to each other. However, they are drastically different in the barriers. Generally, the trial
function of Eqs. (7.9) gives a reasonable value of the energy 1 , as well as the behavior of
the potential at large distances. However, the second trial function yields more accurate
results. In particular, it gives a slightly lower energy level. Results obtained for various
concentrations, n s , of electrons at the interface are summarized in Table 7.1.
In this table, the lowest energy level, 1 , the spatial scale of the electron confinement,
z, and the probability of finding electrons in the barrier, Pb , are presented for three
224
1 (meV)
z (A)
Pb (%)
38.6
116
0
56.6
99
0
86.1
82
0
II
1 (meV)
z (A)
Pb (%)
32.00
100
0.7
45.6
82.6
1.11
66.3
65.6
1.95
100 A.
225
(b)
Gated heterostructure
Metal gate
(a)
Ungated heterostructure
n+-AlGaAs
n+-AlGaAs
2DEG
p-GaAs
p-GaAs
Figure 7.4 Schematic diagrams of (a) ungated and (b) gated heterostructures with
(a)
eb
(b)
+
+
eb
+
+
+
+
+
EF
EF
EF
2DEG
+
EF
1
ld dsp 0
ld
dsp 0
Schottky voltage controls the depletion region under the metallic gate. It results in (a) a
normally-off device for a narrow barrier and (b) a normally-on device for a wide barrier with
2DEG in the potential well (60 nm or wider).
226
undoped spacer. The structure is shown in Fig. 7.5(a) for a relatively thin AlGaAs layer,
and Fig. 7.5(b) depicts a structure with a thicker AlGaAs layer. For both cases, there
exists an extended depletion region, which affects the electron channel formed at the
AlGaAs/GaAs interface and provides two possibilities of controlling the structure. The
normally-off structure corresponds to Fig. 7.5(a). The depletion region extends through
both a thin AlGaAs layer and the junction. The bottom of the quantum well shifts up.
The Fermi level lies under the lowest energy subband. Thus, there are no electrons inside
the channel and the conductivity along the heterostructure is almost zero. The donors
in the AlGaAs doped region are ionized, and the electrons have left the semiconductor
part of the structure which is charged positively. In order to turn on the conductivity
of the device, it is necessary to apply a positive voltage to the metal gate. normally-off
structures can be fabricated by using a thin AlGaAs barrier.
The normally-on structure is illustrated in Fig. 7.5(b). In this case, the built-in voltage
drops across a thick AlGaAs layer so that the Fermi level lies above the lowest subband
and electrons populate the channel without an external voltage bias. This channel has
a finite conductivity under normal conditions. This case can be realized for sufficiently
thick AlGaAs layers. Thus, in normally-on devices, one can control the conductance
of the channel by applying a negative voltage to the metal. A large voltage leads to a
depopulation of the channel and can switch the device off.
The calculated electron potential energy and quantized levels both for normally-off
and for normally-on devices are shown in Fig. 7.6 for various gate voltages, G . The
Fermi level is taken to be at the zero energy. The values for the normally-off device were
see Fig. 7.6(a). The quantum
calculated for an AlGaAs layer thickness of about 400 A;
well formed on the interface contains up to four quantized levels. A positive voltage
lowers the bottom of the conduction band of GaAs at the interface. The bottom touches
the Fermi level at G = +0.3 V and the device is turned on at a threshold voltage of
about +0.8 V when the first quantized level touches the Fermi level. It is clearly seen that
in the AlGaAs barrier layer a potential minimum occurs and tends to be lowered with
increasing gate voltage. This phenomenon can result in a negative effect: the formation
of a second channel in this layer, which will collect electrons, screen the gate voltage,
and result in loss of control of the concentration of the two-dimensional electron gas at
the interface. The normally-on device is shown in Fig. 7.6(b). It has an AlGaAs layer
The device can be switched off when a negative voltage of
thickness greater than 600 A.
about 0.5 V is applied to the gate.
The results just presented were obtained by numerical calculations. Let us consider
some experimental data related to the problem of modulation-doped heterostructures
where carrier concentrations and their mobilities have been measured simultaneously.
In Fig. 7.7 the sheet electron concentration controlled by the gate voltage is shown for
Al/AlGaAs/GaAs systems fabricated for high-electron-mobility transistors (HEMTs).
The curves correspond to various spacer thicknesses dsp . One can see that the electron
concentration can be varied over one order of magnitude. Saturation of the sheet concentration at high positive voltage is caused by transitions of electrons to the potential well
which is formed in the middle depleted barrier region, as discussed previously. Figure 7.7
(a)
227
(b)
300
G
= 0.3 V
= 1.5 V
Energy (meV)
200
100
0
G
= 0.8 V
GaAs
AlGaAs
100
0
400
800
= 0.5 V
AlGaAs
400
GaAs
800
shows that considerable changes in the properties of the structures occur with changes
in spacer thickness. Among the various structures presented in Fig. 7.7, the particular
structure with dsp = 0 is most likely to be useful for fabricating a normally-off device.
Indeed, by applying a positive voltage to the gate one can increase the electron concentration in the channel. Structures with a thick spacer are well suited for normally-on
devices; in these structures, positive voltage does not change the concentration, whereas
a negative voltage reduces it sharply.
The spacer is an important element of modulation-doped structures because it partially
prevents the scattering of channel electrons by the heavily doped side of the heterostructure and increases the electron mobility. On the other hand, there is one negative effect
of a spacer. Increasing the spacer thickness leads to an increase of the potential drop
on the spacer and consequently to a lowering of the electrostatic potential that confines
electrons near the interface. Hence, a thick spacer causes a decrease in the electron concentration. This trade-off between the mobility and the carrier concentration requires an
optimization of the structural design for each particular device application.
228
11
10
10
dsp =
0
cm2
ndepl
Al0.3Ga0.7As/GaAs
T = 12 K
45
75
180
2
0
2
1
0
Gate voltage,
1
G
(V)
electrons in Al0.3 Ga0.7 As/GaAs structures at T = 12 K and various spacer thicknesses dsp .
All samples are Si-doped with ND = 4.6 1017 cm3 , except for the sample with 180- spacer,
which has ND = 9.2 1017 cm3 . The thickness of the doped Al0.3 Ga0.7 As layer is
7001200 . From K. Hirakawa, H. Sakaki, and J. Yoshino, Concentration of electrons in
selectively doped GaAlAs/GaAs heterojunction and its dependence on spacer-layer thickness
and gate electric field, Appl. Phys. Lett., 45, 253 (1984). Reprinted with permission from K.
C 1984 American
Hirakawa, H. Sakaki, and J. Yoshino, Applied Physics Letters, 45, 253 (1984).
Institute of Physics.
Here, we have considered systems with a single heterojunction. These systems can
be fabricated by a relatively simple technology and they have numerous applications,
but they suffer from greatly limited carrier concentration in the conduction channel. As
can be seen from Fig. 7.7, the typical surface concentrations are less than 1012 cm2 for
single-heterojunction devices. Higher concentrations of carriers in the conducting channel can be obtained in a double-junction system. This system is illustrated by Fig. 7.8,
where possible double-junction heterostructures are presented schematically. For heterostructures of type I, quantum wells are formed both for electrons (two quantized
levels are shown) and for holes (one quantized level is shown), whereas for a type-II
heterostructure a quantum well arises only for the electrons.
7.3
(a)
229
(b)
Conduction
band
Vb,e
Vb,h
2,e
2,e
1,e
1,e
1,h
L 0 L
2
2
Valence
band
L 0 L
2
2
Figure 7.8 Two types of band diagrams of double heterostructures: (a) type I and (b) type II.
(a)
(b)
G
A
B
Figure 7.9 Quantum wire formed by etching (a) and split-gating (b) of the two-dimensional
electron gas.
12
10
G ( 2e2/h)
230
8
6
4
2
0
T = 4.2 K
wire length
(m)
0.2
0.4
1
2
5
7
10
0.8
0.9
1.0
1.1
1.2
g (V)
Figure 7.10 The dependence of the conductance, G, of a quantum wire versus gate voltage, g ,
for seven different lengths of wires as indicated near each curve. After L. Worschech, F.
Beuscher, and A. Forchel, Quantized conductance in up to 20 m long shallow etched
GaAs/AlGaAs quantum wires, Appl. Phys. Lett., 75, 578 (1999). Reused with permission from
C 1999
L. Worschech, F. Beuscher, and A. Forchel, Applied Physics Letters, 75, 578 (1999).
American Institute of Physics.
231
effect and allow one to determine the concentration of one-dimensional electrons. The
typical concentration and subband spacing were found to be n 1D 6 106 cm1 and
!1,2 12.5 meV, respectively. The relatively large subband spacing equivalent to
140 K implies that the one-dimensional character of electron motion should be
unchanged even at temperatures of several tens of degrees Kelvin.
For higher temperatures, at which electron motion becomes semiclassical, the transport
properties of quantum wires based on semiconductor heterostructures are similar to those
of quantum wells and pure bulk-like materials. Thus, selectively doped quantum wires
should exhibit high electron mobilities under low electric fields and high drift velocities
characteristic of hot electrons under high fields.
The situation with electron transport in carbon nanotubes is quite different. In high
fields the electron drift velocities can reach magnitudes in the range (24) 107 cm s1 ,
which are larger than those in perfect IIIV semiconductor compounds. However, the
electrical properties are affected significantly by the surrounding environment, which
influences the removal of heat from current-carrying nanotubes. For example, suspended
nanotubes display drastically different electron transport from that of those on substrates.
Figure 7.11 illustrates this difference. In the upper panel of this figure, scanning electron microscope images of two single-walled carbon nanotubes with Pt contacts are
shown. The left nanotube segment is not suspended and is in contact with a nitride-based
substrate. The right nanotube segment is suspended over a 0.5-m-deep trench. The
diameters of the nanotubes were measured by the atomic-force microscope to be in the
range 23 nm. In the lower panel, the results of measurements of the currentvoltage
characteristics for the 3-m length of suspended and non-suspended segments of the
nanotube are given. The measurements were conducted at room temperature in vacuum.
One can see that the non-suspended nanotube portion displays a monotonic increase
in the current, approaching 20 A under increasing voltage, V , while the current in
the suspended tube reaches a peak of 5 A followed by a pronounced current drop.
This strikingly different behavior is due to significant self-heating effects of the wires
carrying ultra-high current densities. Indeed, in the case of the suspended tube the Joule
heating can not be removed effectively, because thermal fluxes are possible only through
nanotube contacts. As a result, the temperature of the tube increases, especially in its
central region.
7.4
(a)
(b)
I (A)
232
V (V)
Figure 7.11 (a) Carbon nanotubes on a substrate and suspended over a trench; (b) their
voltage applied to the gate. In Fig. 7.12, three types of quantum dots are shown: a
nanosized island self-forming under a special growth regime, colloidal nanocrystals, and
a quantum box fabricated in a controlled way from a two-dimensional heterostructure.
An example of a quantum dot formed by using the technique of fabricating a gate upon
a two-dimensional heterostructure will be discussed in the next chapter; see Section 8.4.
The most notable feature of quantum dots is that all existing degrees of freedom of
electron propagation are quantized, if a confining potential is deep and the dimensions
of the structure are comparable to the de Broglie wavelength of the electron, as defined
in Eq. (6.5). According to the analysis given in Section 6.2, such a system can be called
a zero-dimensional system. The latter terminology stresses the dramatic changes of the
electronic properties in quantum dots. Indeed, in a crystalline solid, as was discussed in
Chapter 4, electrons possess continuous energy bands with the occupation, width, and
(a) As2
In
233
InAs
GaAs
AlGaAs
(b)
CdS
Molten Silicate
(c)
8 nm
AlGaAs
GaAs
AlGaAs
Figure 7.12 Schematic representations of three different approaches for the fabrication of
quantum dots: (a) self-organized growth of nanosized islands, (b) nanocrystals in a colloid, and
(c) artificial patterning and etching of a heterostructure with two-dimensional electron gas.
separation of these bands determining the fundamental electrical and optical properties
of the solid. At the other end of the length scale, for individual atoms the electronic state
density is discrete, resulting in, for example, the absence of simple electron transport
and intrinsically sharp spectral optical lines. In some respects, the electronic structure
of quantum dots might be said to fall somewhere between these two extremes. Thus,
quantum-dot structures are like large artificial atoms, macroatoms.
For the analysis of the electronic properties of a quantum dot, a model of a confining
potential may be used. In Section 3.3, we considered two examples of these models:
the quantum-box model given by Eq. (3.51) and the spherical-dot model of Eq. (3.53).
The results obtained with these models for the energy spectra and wavefunctions can be
applied to semiconductor quantum dots, if the masses are taken to be the effective masses
of the material being analyzed. The height of the confining potential is determined by
the band offsets, see Fig. 4.12.
Consider semiconductor quantum dots fabricated from a two-dimensional heterostructure by using an etching process, or by colloidal synthesis, as illustrated in Figs. 7.12(b)
and 7.12(c). In these cases, depending on the heterostructure type, it is possible to realize
different types of carrier confinement. The classification of heterostructure types was
presented in Section 4.5. If the initial heterostructure is of type I, both electrons and
holes can be confined in the same quantum-dot structure. If the initial heterostructure is
of type II, only one type of carriers, either electrons or holes, can be confined.
If electron motion is quantized in all three possible directions, we obtain a new physical
object, a macroatom. Questions concerning the usefulness of such objects for applications
234
naturally arise from the point of view of their electronic applications. A fundamental
question is the following: what is the current through a macroatom? A valid answer is
that there exists the possibility of passing an electric current through an artificial atom
due to tunneling of electrons through quantum levels of the macroatom.
Importantly, that transport of charge occurs always in quanta of the elementary charge
of the electron. This discreteness has no practical consequences for current flowing in
bulk materials, in two-dimensional structures, and even in quantum wires, because the
numbers of electrons transported are large. Charge transport in quantum-dot systems
is completely different. Indeed, it occurs as electron tunneling from a cathode through
the quantum dot to an anode. That is, the clearance space between the cathode and the
anode should be considered as a potential barrier in which a quantum dot is embedded.
In Section 3.3, while studying the tunneling effect, we found that the probability of
tunneling drastically depends on the thickness and height of the barrier: for large enough
thickness and barrier height, tunneling is essentially suppressed. However, tunneling
through a discrete state localized inside the barrier has much higher probability. The
whole tunneling process can be thought of as a sequential process: the capture of an
electron from the cathode to the localized state and then its emission to the anode. As a
result, the easiest way for the electrons to be transferred from the cathode to the anode
is for them to tunnel through the quantum dot. Then, an electron being captured to
the quantum dot blocks tunneling of other electrons. Electron transfer occurs in highly
correlated manner, one by one, at least at low temperatures. Examples of such singleelectron transport were discussed in Section 6.5. Thus, the electric current through a
quantum dot occurs in the regime of single-electron transport. Devices that use such
transport will be discussed in the next chapter.
In fact, electrical methods applied to quantum dots to realize useful devices are not
the only methods possible. The control of the electric current through the dots can also
be realized by means of light, sound waves, etc. Consider here optical control of the
dots and optoelectronic functions of zero-dimensional devices. The main peculiarities
of the optical properties of quantum dots arise due to electron and hole quantization.
In quantum dots fabricated using a type-I heterostructure, the carrier energies have the
form
E eQD = E gQD + n (n 1 , n 2 , n 3 ),
QD
QD
Eh
= p (n 1 , n 2 , n 3 ).
Here, E g is the fundamental bandgap of the material of the quantum dots. For a type-I
QD
heterostructure, E g is less than the bandgap E g of the surrounding material into which
the dots are embedded; n and p depend on sets of three discrete quantum numbers,
{n 1 , n 2 , n 3 } and {n 1 , n 2 , n 3 }, for the electrons and holes, respectively. Specific depenQD
QD
dences of E e and E h on the quantum numbers are determined by the materials used
for quantum dots and their environment, the geometry of dots, etc. The model dependences of Eqs. (3.51) and (3.53) can be used for estimation of the energy levels. Owing
to these discrete energy spectra, quantum dots interact primarily with photons of discrete
energies:
-h 2 hc = E QD + n (n 1 , n 2 , n 3 ) + p (n , n , n ),
(7.10)
g
1
2
3
235
Ec1
QW
QD
1.15
1.20
1.25
Energy (eV)
Ehh1
R
X
GaAs
InGaAs
GaAs
20 nm
100 nm
Figure 7.13 A sketch of quantum dot confining potentials for electrons and holes. Peaks in optical
spectra correspond to phototransitions between quantized states of the electrons and holes in the
quantum dot. Reprinted with permission from J. Tulkki and A. Heinamaki, Confinement effect
C 1995
in a quantum well dot induced by an InP stressor, Phys. Rev. B, 52 (11), 8239 (1995).
by the American Physical Society.
where c is the velocity of light and is the wavelength of the light. The different combinations of quantum numbers {n 1 , n 2 , n 3 } and {n 1 , n 2 , n 3 } give a series of optical spectral
lines, for which interaction between the dots and light is efficient. Importantly, the fact
QD
that E g < E g implies that the light interacting with the dots is not absorbed by the
surrounding material. These considerations are illustrated in Fig. 7.13. The potential
profiles of the conduction band, E c , and the upper (heavy-hole) valence band, E hh , are
depicted. The potential wells for electrons and holes represent the confinement potentials of the quantum dot. Quantized levels for electrons and holes are shown; arrows
indicate the possible phototransitions between different quantized states. For each such
transition, a characteristic line of the optical spectrum arises. For the example of an
InGaAs quantum dot grown inside a GaAs quantum well, the quantum-dot spectral lines
arise in the region of photon energies from 1.1 to 1.3 eV, as presented in Fig. 7.13. The
width of observed spectral lines depends primarily on two factors. First, the spectral
lines are broadened because of relaxation processes in the system. Second, it is typical that in experiments light interacts with numerous dots and there is some dispersion
of their sizes. This produces a dispersion in energy-level positions and additional line
broadening.
The optical control of the electric current flowing through a quantum dot can be
explained with the help of a device that can be called a single-quantum-dot photodiode.
InGaAs
QD
Vb
EF
EF
hv
n+-GaAs
i-GaAs
Metal
Energy
236
Depth
Figure 7.14 A schematic energy band diagram of a single-quantum-dot photodiode for
photocurrent experiments, with an n+ -contact on the left and a metal contact (Schottky contact)
on the right. After Figs. 1 and 13(b), H. Kenner, S. Sufler et al., Recent advances in
exciton-based quantum information processing in quantum dot nanostructures, New J. Phys., 7,
C IOP Publishing Limited.
184 (2005).
The diode base is an i region (undoped, i.e., intrinsic region), into which a single quantum
dot is embedded. A schematic band diagram of such a photodiode under equilibrium,
when the Fermi level, E F , is the same throughout the whole structure, is shown in
Fig. 7.14. At the semiconductor/metal heterojunction a potential barrier Vb (the Schottky
barrier) arises. An electric bias is to be applied to two terminals; in the case under
consideration they are the n+ region (on the left) and a metal contact (the Schottky contact)
on the right. Under inverse biasing, an electric current is possible, in principle, due to the
interband tunneling mechanism. The probability of such tunneling is extremely small and
the current is negligible even if a quantum dot is embedded in the base. Illumination of the
diode by light resonant with phototransitions between discrete electron and hole levels in
the quantum dot leads to the excitation of an electronhole pair inside the dot. Now, the
electron and hole can tunnel more readily from the dot into the i region and contribute
to the electric current, the photocurrent. In Fig. 7.15, we present the experimental
results obtained from excitation of the ground state (n 1 = n 2 = n 3 = n 1 = n 2 = n 3 = 1)
of a single self-assembled In0.5 Ga0.5 As quantum dot embedded into a 360-nm-thick
intrinsic GaAs layer. Since InGaAs/GaAs heterostructures are of type I, the only optically
active part is the single In0.5 Ga0.5 As quantum dot. The experiments were carried out
at 4.2 K. In Fig. 7.15, the photocurrent is plotted as a function of the electric bias
for various wavelengths of illuminating light. The wavelengths are indicated on the
photocurrent curves. One can see that photocurrentvoltage dependences have, in fact, a
pronounced resonant character. This is explained by the fact that the quantized electron
and hole energies are shifted under an applied electric field, as expected from the socalled Stark effect observable for atoms and molecules. When these energies are such that
the wavelength given by Eq. (7.10) corresponds to the illuminating light, the light excites
30
Photocurrent (pA)
25
laser=
T = 4.2 K
927.35 nm
927.62 nm
20
15
237
927.1 nm
928.03 nm
10
928.45 nm
5
0
0.4
0.6
0.8
1.0
1.2
1.4
electrons and holes inside the quantum well, which produces the measured photocurrent.
As the applied bias increases, the energies are shifted to smaller values and the resonance
wavelength increases. In Fig. 7.15, spectra for excitation of the same ground state of the
dot for different biases are shown. The observed photocurrent spectra are very narrow
because a single dot is involved. Spectral broadening becomes visible at high biases when
the electron and hole energy levels decay as a result of the increased rate of tunneling
from the dot.
A photocurrent excited resonantly with an electrically tunable optical resonance can
find numerous applications. Specifically, such a quantum-dot photodiode facilitates optical manipulation of individual quantum states tunable with an electric bias. Such a
capability is necessary for devices required as components of systems used in quantum
information technology.
7.5
Closing remarks
In this chapter, we focussed on traditional quantum structures that are already being
exploited in nanoelectronics and optoelectronics. These include quantum wells, wires,
and dots. These structures provide electron confinement in one, two, and three dimensions, respectively. We found that confinement of the electrons in any direction forbidding
free propagation in this direction leads to quantization of the electron energy.
We showed that the confinement effect can be very strong. In particular, for one of the
most used and practical cases when electrons are confined at a heterointerface we
found that the electrons are localized within a thin spatial layer of thickness ranging from
238
7.6
Problems
1. Consider a quantum-well structure with the electron energy given by Eq. (6.27).
Assume that for the lowest two-dimensional subbands the intersubband distance !21 =
2 1 is given. For T 0, find the formula for the critical concentration n c of twodimensional electrons at which the second subband starts to be populated. Using 1 , 2 ,
and !21 for an infinitely deep quantum well, estimate the critical concentration, n c , for
a quantum well of width L = 10 nm.
7.6 Problems
239
2. To obtain characteristics of the electrons confined at an interface, one can use the
electron wavefunction of the lowest quantized state in the form of Eq. (7.9), which
depends solely on the parameter b. Apply the definition of the expectation value of a
physical quantity presented in Eq. (3.12) and find an expression for the average distance
of the electrons from the interface z. Calculate the mean square deviation of the location
of electrons from its average position; i.e., calculate (z z)2 .
3. Near the semiconductor heterojunction the electrons can be confined in the direction
normal to the heterojunction. In Figs. 7.1 and 7.2, sketches of confining potentials are
presented. If the potential has high barriers, the lower energy levels may be studied with
some accuracy by applying the following triangular approximation:
V (x) =
eF x,
,
for x > 0,
for x 0.
near these minima. The minima are at the following crystalline directions: [100], [100],
[010], [010], [001], and [001] (see Table 4.4). Within these minima, the electron energy
spectrum is given by Eq. (4.17) with effective mass tensors of the type of Eq. (4.19).
Let us consider a Si quantum-well layer oriented perpendicular to the [001] direction.
the
For free electrons collected in the pair of valleys along directions [001] and [001],
240
-h2 2
-h2 2
2
+
,
2
2
2m t x
y
2m l z 2
while for the other two pairs of valleys the Hamiltonians are
-h2 2
-h2 2
-h2 2
,
2m l x 2
2m t y 2
2m t z 2
-h2 2
-h2 2
-h2 2
.
2
2
2m t x
2m l y
2m t z 2
and
These Hamiltonians imply that the electron motion along the quantum-well layer and
that in the perpendicular direction are completely independent. The wavefunctions and
quantized energies should have the form of Eq. (7.2).
Assume that the potential barriers of the quantum well are infinitely high. Employing
the results of Eqs. (3.27) and (3.28) for an infinitely deep potential, find energy subbands
for the different electron valleys. Discuss the energy splitting of the valleys in the quantum
well. Using the data for m l and m t presented in Table 4.4, determine which pair of valleys
will be of the lowest energy. In bulk Si crystals, the numbers of electrons in different
equivalent minima are equal. Can we expect an electron redistribution to be caused
by quantization in the well? Which valleys will be overpopulated and which will be
depleted?
5. A nanostructure consists of two semi-infinite two-dimensional electron gases connected through a quantum wire. The entire structure is fabricated from a single infinite
quantum-well layer. Does a potential barrier arise for electron motion from one twodimensional gas to the another? Explain why the barrier arises.
Using the adiabatic approach described by Eq. (6.124), discuss the barrier heights for
the electrons from different two-dimensional subbands.
Assume that the thickness of the quantum-well layer fabricated from GaAs is 5 nm and
that the quantum-wire width is 5 nm. Estimate the height of this barrier for the electrons
populating the lowest two-dimensional subband.
6. In IIIV compound semiconductors the energies of heavy and light holes coincide
at zero wavevector, as shown in Fig. 4.10. Energy levels exhibiting such a coincidence in
energy are known as degenerate energy levels. Quantization of the two types of holes lifts
this degeneracy. Using models with infinite-potential walls for quantum wells, wires, and
dots, calculate the splitting between the lowest states of heavy and light holes in these
low-dimensional structures.
Obtain numerical estimates of this splitting for GaAs structures with the following
geometrical parameters: a quantum well of thickness 5 nm, a quantum wire of crosssection 5 nm 5 nm, and a quantum dot of volume 5 nm 5 nm 5 nm. Parameters of
heavy and light holes are given in Table 4.5.
7.6 Problems
241
Nanostructure devices
8.1
Introduction
In previous chapters, fundamental physical processes on the nanoscale, analysis of nanomaterials, and nanofabrication methods were all discussed extensively. The knowledge
gained in these previous discussions makes it possible to consider and analyze a variety of different nanostructure devices. In this chapter, we consider electronic, optical,
and electromechanical devices. Some of these devices mimic well-known microelectronic devices but with small dimensional scales. This approach facilitates applications
to devices with shorter response times and higher operational frequencies that operate
at lower working currents, dissipate less power, and exhibit other useful properties and
enhanced characteristics. Such examples include the field-effect transistors and bipolar
transistors considered in Sections 8.3 and 8.5.
On the other hand, new generations of the devices are based on new physical principles,
which can not be realized in microscale devices. Among these novel devices are the
resonant-tunneling devices analyzed in Section 8.2, the hot-electron (ballistic) transistors
of Section 8.5, single-electron-transfer devices (Section 8.4), nanoelectromechanical
devices (Section 8.7), and quantum-dot cellular automata (Section 8.8).
As a whole, the ideas presented in this chapter provide an understanding of the future
development of nanoelectronic and optoelectronic devices that may be realized through
the wide use of nanotechnology.
8.2
Resonant-tunneling diodes
Diodes or, in other words, two-terminal electrical devices, are the simplest active elements of electronic circuits. Some applications of diodes are based on their nonlinear
currentvoltage characteristics. Another important capability required of diodes is their
operational speed. Such high-speed operation implies that the feature sizes of diodes
should be as small as possible.
In previous sections, we studied two types of short n+ in+ diode, both with spacecharge-limited classical electron transport. These diodes used homostructures, i.e., it was
supposed that they are made from the same material with nonuniform doping. Among
the cases studied, the highest speed is achieved for the ballistic diodes. According to
the classification given in Section 6.2 for the extreme scaling down of the diode size,
Vb
Lb
Lw
(a)
(b)
n+-GaAs
n+-GaAs
AlxGa1 x As
GaAs
AlxGa1 x As
GaAs
AlxGa1 x As
GaAs
AlxGa1 x As
GaAs
n+-GaAs
243
Vb
Lb
Lw
n+-GaAs
Figure 8.1 Layer designs for double-barrier resonant-tunneling structures. (a) Alternating layers:
n+ -doped GaAs (substrate and one of the contacts), undoped AlGaAs (barrier), undoped GaAs
(quantum well), undoped AlGaAs (barrier), n+ -doped GaAs (top contact). (b) The same as (a)
except for two additional undoped spacer layers of GaAs between the contacts and the barriers.
Vb is the barrier height, L b the barrier thickness, and L w the quantum well width.
quantum transport must be used to describe the carriers. A very important example of
such nanoscale diodes is the so-called double-barrier resonant-tunneling diode. In this
section, we consider this type of nanoscale quantum diode.
244
Nanostructure devices
(a)
d
1
EF
Ec
EF
Ec
z
(b)
EF
em
Ec
e
(c)
EF
em
Ec e
EF
col
Ec
1
0
EF
col
Ec
(a) equilibrium conditions; (b) in-resonance; and (c) off-resonance. The electron potential
energy and populations of the energy band in contact n+ -regions are sketched in the left and
right parts of the figure. E cem and E ccol denote the positions of the bottoms of the conduction
bands under electric bias.
subband because there exists in-plane free electron motion. The parameters of the diode
are chosen to be such that, in the non-biased state of the diode, the quasi-bound level,
1 , lies above the Fermi energy, E F , in the contacts, as in Fig. 8.2(a). By applying a
voltage bias to the contacts, one can produce a downward shift of the level in the well.
For electrons with arbitrary energies, the probability of tunneling through the doublebarrier structure is very small. The structure is designed to prevent the thermal transfer
of electrons over the barriers. Therefore, the only situation favorable for transmission
of electrons through the structure is when the quasi-bound level lies below the Fermi
energy, EF , but above the conduction-band bottoms of the contacts. In this case, those
electrons from the emitter (the contact on the left) whose kinetic energy of the in-plane
(perpendicular) motion, E = -h2 k z2 /(2m ), coincides with 1 are transmitted through
the structure with finite probability. This is the so-called resonant-tunneling process,
which has the important attribute of exhibiting negative differential resistance.
Before analyzing this effect, we recall that the electrical properties of a simple conductor with a linear currentvoltage characteristic are characterized by a resistance, R,
according to the formula
I = 0 /R,
where 0 is the applied voltage and I is the current. If a conductor has a nonlinear
currentvoltage characteristic, one can introduce the so-called differential resistance,
Rd :
dI 1
Rd =
.
d
245
I
(b)
(d)
(a)
(c)
0
(a), (b), and (c) correspond to the physical situations illustrated in the previous figure. Portion
(d) shows what happens with the current when the second quasi-bound state enters into
resonance with the electrons from the emitter.
The term negative differential resistance is used to denote that Rd < 0; this corresponds to the unusual effect of a decrease of the current when the applied voltage
increases.
The following qualitative model of resonant tunneling explains the appearance of negative differential resistance. Before the level of the well reaches the resonance position,
the current through the diode is very small because it is controlled by non-resonant tunneling and by transport over barriers, both of which have low probabilities of occurring.
When the bias corresponds to the case of resonance energies, as depicted in Fig. 8.2(b),
the transmission coefficient and the electric current through the diode increase sharply.
Further increase in the current with increasing voltage bias continues until the resonance
level passes the bottom of the emitters conduction band. There are no electrons (the
conduction band is above the Fermi level, E F ), as shown in Fig. 8.2(c), to tunnel resonantly and the current decreases in spite of the increase in voltage bias. Consequently,
the currentvoltage characteristic of the structure contains a segment that exhibits negative differential resistance. The general form of the currentvoltage characteristic is
presented in Fig. 8.3. At large bias, further increase of the current can be realized either
by shifting other quasi-bound states so that they are resonant with electron energies in
the emitter, or by substantial evolution of the potential profile and non-resonant transport
through and over the barrier.
Coherent tunneling
In this case, the electron is characterized by a single wavefunction throughout the whole
quantum structure. Traditionally, the cathode and the anode of the diode are referred to
as the emitter and collector, respectively. An electron entering the structure from the
Nanostructure devices
1.0
0.9
T (E) =
0.8
E E n
1+
E
0.7
T(E )
246
2 E = 80 meV
0.6
n=
0.5
0.1 eV
0.4
0.3
0.2
0.1
0.0
0.2
0.1
0.0
0.1
0.2
0.3
0.4
E (eV)
Figure 8.4 The full width, 2 !E, of the half-maximum of the transmission peak.
emitter is described by the wavefunction em (r, z), which has the form of an incident
plane wave,
em
z)
em (r, z) = B em ei(kr+kz
z)
= e0 .
E E
2m
2m
The coefficients Aem and B em can be found by solving the Schrodinger equation with
the potential corresponding to the double-barrier structure under voltage bias. The ratio
of the coefficients B em and Aem defines the transmission coefficient:
T (E ) =
|B em (E )|2
.
|Aem (E )|2
Instead of solving the Schrodinger equation, here we give a simplified formula for T (E ),
which allows us to introduce the basic parameters of the resonant-tunneling effect.
247
1+
1
E n
!E
2 ,
(8.1)
where 2 !E is the full width of the half maximum of the transmission peak. Equation (8.1) assumes that the probability of tunneling through the structure is unity when
the electron energy, E , exactly coincides with a quasi-bound state, n . In Fig. 8.4, T (E )
is plotted for several magnitudes of !E. One can see that the tunneling transmission
coefficient can be a very sharp function of the electron energy.
Besides the transmission coefficient, it is convenient to introduce the probability of
tunneling of the electrons out of the well per second, &; it is determined by !E:
& = !E/ h.
According to the uncertainty relation we can estimate the lifetime, , of the electrons on
the quasi-bound level between the barriers as
= 1/(2&).
The tunneling probability per unit time can be expressed as the product of the attempt
rate, vz /(2L w ), and the probability of tunneling through a single barrier with one
attempt, T :
&=T
vz
,
2L w
where L w is the thickness of the well and vz is the transverse velocity of the electron in
the well, which can be estimated from the relationship
m vz2 /2 = n .
For the purpose of making illustrative numerical estimates, we choose the following set
of parameters: the energy of the first level, 1 , is 50 meV, the height and thickness of
and the well width is L w = 100 A.
For
the barriers are Vb = 300 meV and L b = 40 A,
7
1
11 1
this structure, we obtain vz = 5 10 cm s and & = 10 s . Hence, the lifetime of
an electron in the well equals = 5 1012 s. Both and & depend critically on the
height and thickness of the barriers.
The lifetime of the quasi-bound level depends on heterostructure parameters and
material combinations. In Fig. 8.5, the lifetimes of the quasi-bound states are depicted
for five particular double-barrier structures. The structures are assumed to be made of
GaAs/AlGaAs, InGaAs/AlAs, InGaAs/GaAs, and InAs/AlSb. The widths of the quantum
while the barriers have thicknesses ranging from 10 A
to 60 A.
Nanostructure devices
248
10
Vb = 1.2 eV
Vb = 0.3 eV
1
a
Vb = 1 eV
well
barriers
GaAs
AlAs
a
AlAs
b In0.53Ga0.47As
GaAs
Al0.42Ga0.58As
c
GaAs
d In0.53Ga0.47As
e
InAs
AlSb
0.1
0.01
3
4
5
Barrier thickness (nm)
Figure 8.5 The lifetime of the first quasi-bound state in n-type double-barrier structures made
from the five material systems presented in the inset. In each structure, the quantum well width
The energy heights of the barriers are indicated next to the curves. After
is fixed at 46 A.
T. C. L. G. Sollner et al., High-frequency applications of resonant-tunneling devices, in
Electronic Properties of Multilayers and Low-Dimensional Structures, J. M. Chamberlain et al.
(Eds). (New York, Plenum, 1990), pp. 283296.
For an asymmetric double-barrier structure, we define two different transmission coefficients for the left-hand, Tl , and right-hand, Tr , barriers, respectively, so that the total
transmission coefficient, T , can be approximated by
T (E ) =
4Tl Tr
(Tl + Tr )
1+
1
E n
!E
2 ,
(8.2)
where !E -h& and & = (Tl + Tr )vz /(2L w ). For asymmetric barriers, the maximum
transmission at E = n is less than unity. It is important to recognize that these results
are obtained under the assumption that the electrons transit through the system without
phase-changing scattering, so that E remains constant.
Using the parameters introduced for the double-barrier resonant-tunneling structures
and assuming low temperatures, it is possible to obtain the estimates for the current
voltage characteristics sketched in Fig. 8.3.
The formulas (8.1) and (8.2) describe the transmission of an electron with fixed energy
E . In order to calculate the electric current, we should take into account all of the
electrons which tunnel from the emitter and collector. We suppose that in the contact
regions heavily doped regions the thermal equilibrium for electrons is established in
a very short time. Therefore, to a reasonable degree of approximation we can assume
that the Fermi distribution gives the distribution of electrons entering the double-barrier
part of the structure. If a voltage bias, 0 , is applied, the difference between the Fermi
249
250
Nanostructure devices
Table 8.1 Resonance width and collision broadening of level 1 of Al0.3 Ga0.7 As/GaAs
double-barrier structures
&r /&sc
L w (A)
L b (A)
&r (meV)
at T = 300 K
at T = 200 K
at T = 70 K
50
50
50
20
70
50
30
50
1.3 102
1.5 101
1.76
6.03
6 103
7.5 102
0.88
3.02
1.9 102
8.3 101
1.3
4.56
2.6 101
3.1
3.6
124
When the bound-state energy, n , falls below the bottom of the conduction band in the
emitter, the current drops rapidly to the value determined by off-resonance tunneling
processes. This value can be estimated, if we assume that off-resonance transmission
coefficients are constant:
em 2
E Tl Tr .
(8.9)
Iv =
2 2 -h3 F
Since for the transmission coefficients the inequality Tl , Tr 1 is valid, the resonant
current of Eq. (8.8) is much greater than the off-resonance current of Eq. (8.9). The results
obtained for the coherent mechanism of tunneling through a double-barrier structure
support the qualitative discussion given at the very beginning of this section.
Sequential tunneling
Another process responsible for the resonant-tunneling effect is the so-called sequential
tunneling process. In the sequential tunneling scheme, electron transmission through the
structure is regarded somewhat artificially as two successive transitions: first from
the emitter to the quantum well and then from the well to the collector. It is important
to highlight the main difference between the previously studied coherent mechanism
and the sequential mechanism of resonant tunneling. The first mechanism excludes any
electron collisions during the transition from the emitter to the collector. The second scenario applies even when there is electron scattering inside the quantum well. Although
the coherent and sequential processes result in the same behavior of the double-barrier
resonant structures, it is possible and instructive to separate and compare these processes. In the previous discussion, we defined a broadening, &, for the quasi-bound
state due to the tunneling process. If both the processes of tunneling and scattering
inside the well occur, the width of the quasi-bound state increases. Let us introduce
the width of the quasi-bound state, &r , as the full width of the half maximum of the
transmission peak. Let the collision broadening of this state be &sc . From the previous discussion, we conclude that coherent tunneling dominates if &r > &sc , whereas
the sequential processes dominate if &r < &sc . Table 8.1 illustrates different regimes
of tunneling for several specific Al0.3 Ga0.7 As/GaAs double-barrier structures at zero
bias. In this table, L w and L b are the thicknesses of the wells and barriers, respectively.
The ratio &r / &sc is calculated by using the estimate &sc = -h/ , where is the scattering time. From the results of Chapter 6, this time may be either calculated from first
251
300 K
80 K
Figure 8.6 Measured currentvoltage characteristics of a symmetric Al0.48 In0.52 As/Ga0.47 In0.53 As
double-barrier resonant-tunneling structure at 300 K (a) and 80 K (b). After Fig. 1, F. Capasso,
S. Sen et al., Quantum-effect devices, in S. M. Sze (Ed.), High-Speed Semiconductor Devices
(New York, Wiley, 1990). Reprinted with permission of John Wiley & Sons, Inc.
principles or deduced from experimental mobility measurements. For Table 8.1, the following values are assumed: when T = 300 K, = 7000 cm2 V1 s1 , and 3 1013
s; when T = 200 K, = 2 104 cm2 V1 s1 , and = 1012 s; and when T = 70 K,
105 cm2 V1 s1 , and 5 1012 s. If &r / &sc > 1, elastic collisions can be
neglected and resonant tunneling is highly coherent. From Table 8.1 one can see that
this type of tunneling is typical for the case of low temperatures and thin barriers. In
the opposite limit, &r / &sc 1, the tunneling processes are more likely to be sequential. It is important to emphasize that the scattering affects also the magnitude of the
transmission coefficient. The maximum of the transmission probability decreases by the
factor &r /(&r + &sc ) when scattering takes place. This explains why resonant tunneling
is washed out at high temperatures and in structures with defects and impurities.
Nanostructure devices
4.0
3.5
T = 300 K
3.0
J (10 5 A cm2)
252
InAs/AlSb
Jp/Jv 3.3
2.5
2.0
1.5
GaAs/AlAs
Jp/Jv 1.4
1.0
InGaAs/AlAs
Jp/Jv 12
0.5
0.0
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
2.0
Voltage (V)
Figure 8.7 The room-temperature current density versus voltage characteristics and peak-to-
valley ratios, Jp /Jv , for high-speed double-barrier RTDs made from three different material
systems: GaAs/AlAs, InAs/AlSb, and InGaAs/AlAs. After E. R. Brown, Resonant tunneling in
high-speed double barrier diodes, in Hot Carriers in Semiconductor Nanostructures: Physics
and Applications, J. Shah (Ed.) (Boston, MA, AT&T and Academic, 1992), pp. 469498.
253
+ ,
(8.10)
+
tr = m
&
vg
&
2m[V (z) E ]
0
where d is the total thickness of the two barriers and the well, and vg can be interpreted
as the electron group velocity. If the perpendicular energy of the incident electron, E ,
is much greater than the resonance width, &, it can be shown that the total transit time
through the structure is given approximately by tr . The first term represents the semiclassical transit time across the structure and the second term is the so-called phase time.
!
,
!I
where !I is the change in the current through the resistor when the voltage drop is
changed by !. Thus, in fact, Rd is the differential resistance that can have both positive
Nanostructure devices
(b)
104
(a)
254
Rd
C
InGaAs/AlAs
103
InAs/AlSb
102
GaAs/AlAs
101
-+
100 1
10
102
103
Frequency (GHz)
Figure 8.8 (a) The simplest electric circuit. (b) The generated microwave power per unit
and
= 0 eit ,
(8.11)
2L
LC
2L
2
For L/C >
-(Rd /2) and for a positive resistance Rd , there are oscillations with the
frequency 1/(LC) [Rd /(2L)]2 and damping characterized by
= Im{c } = Rd /(2L).
Here, is positive and Im{} denotes the imaginary part of c .
If the real part of the impedance is negative,
Re{Z ()} = Rd < 0,
(8.13)
255
it follows that is negative and the steady state of the circuit is unstable with respect to
the generation of voltage oscillations. Thus, if we want to realize voltage oscillations at
a desired frequency, , the differential resistance at this frequency should be negative.
The fact that double-barrier resonant-tunneling structures have extremely small electron
transit times means that these devices exhibit negative differential resistance up to ultrahigh frequencies. In Fig. 8.8(b), the generated microwave power per unit cross-section
of double-barrier resonant-tunneling diodes is presented as a function of the frequency
for the same three devices as were considered in Fig. 8.7. The generated power decreases
with frequency as a result of the previously discussed finite transit time of tunneling
electrons. Absolute values of the power correlate with the current densities achieved for
these devices. It is seen that oscillations up to 1 THz = 1000 GHz (i.e., 1012 oscillations
per second) have been reached for nanoscale quantum devices of this type.
Finally, the resonant-tunneling structures are the simplest quantum devices. They
exhibit strongly nonlinear currentvoltage characteristics with negative differential resistance. Since these devices have nanoscale sizes, they exhibit extremely short transit
times for carrier transport through the structures. These properties allow one to exploit
resonant-tunneling structures for the generation of ultra-high-frequency electromagnetic
oscillations. Indeed, an oscillation frequency of about 1 THz has already been reached
in resonant-tunneling diodes.
8.3
Field-effect transistors
The previously analyzed two-terminal devices diodes are the simplest electronic
devices, for which the current is controlled by the diode bias and vice versa. A useful function can be performed mainly due to nonlinearity of currentvoltage dependences. In contrast, in three-terminal devices known as transistors there exists the possibility of controlling the current through two electrodes by varying the voltage or the
current through a third electrode. Depending on the principle of operation, transistors
can be associated with one or other of two large classes: (i) field-effect transistors and
(ii) potential-effect transistors.
256
Nanostructure devices
(a)
Lx
S
(b)
D
ld
n
Semi-insulating
substrate
x
y
n
Semi-insulating
substrate
Figure 8.9 The MESFET model. The conducting channel of width l is supposed to be grown on a
semi-insulating substrate. The length of the channel is L x . The depleted region, ld (x), under the
gate, G, is marked by a shadow region. In case (a) there is no voltage bias applied between the
source, S, and the drain, D; in case (b) a positive voltage is applied to the drain. There are no
electrons in the depletion region.
There exist several different types of field-effect transistor (FET). Before discussing
the most important types of FET, including those based on nanostructures, we will explain
the operation principles by using the simplest FET, the so-called metalsemiconductor
FET (MESFET).
Figure 8.9 presents a simple model for a field-effect transistor. It is assumed that the
active region of the device is made of an n channel, which can be fabricated, for example,
by homogeneous doping. The source and drain are heavily doped n+ regions, which are
assumed to serve as Ohmic contacts. The gate upon the active layer forms a so-called
Schottky barrier, which produces a depletion region, as discussed in Section 7.2 (see
the subsection on the control of charge transfer). The formation of the depletion region
under the gate is illustrated by Fig. 8.9. Such a design is typical for commonly used
semiconducting materials such as GaAs and InP. From the bottom, the active region is
restricted by a non-conducting (semi-insulating) substrate.
If no voltage is applied to the contacts, the depletion region is almost uniform along
the active region, as shown in Fig. 8.9(a). The characteristics of the depletion region are
determined by the built-in (Schottky) voltage. If a negative voltage is applied to the gate
and there is still no voltage between the source and drain, the depletion region extends
farther into the active region and decreases the width of the channel. At some voltage
the channel is completely pinched off. Let the gate voltage be fixed and a small positive
voltage be applied to the drain. A current will flow through the channel in the Ohmic
linear regime. If the drain voltage is increased further, it will affect the distribution of
the potential in the device: the width of the depletion region increases near the drain end
of the channel, as shown in Fig. 8.9(b), as a result of the increasing potential difference
between the gate and the drain. At a certain drain voltage, the channel begins to pinch off
at the drain end. At this voltage the current saturates and remains nearly constant with
further increase in the drain voltage. This behavior is illustrated in Fig. 8.10, where typical
currentvoltage characteristics of a MESFET are presented for various gate voltages. It
is clear that, if the gate voltage is negative, the channel becomes narrow and the current
is suppressed, whereas, for positive gate voltage, the depletion region under the gate
ID
Drain current
257
Linear
region
Breakdown
region
G > 0
G = 0
G < 0
G = th
Drain voltage
Figure 8.10 The drain-current versus drain-voltage characteristics for the MESFET at different
gate voltages. The linear and saturation regions are indicated. The breakdown effect occurs at
high voltages B and is presented schematically.
Homogenous FETs
MOSFETs
Heterostructure FETs
Novel Devices
VMT
QUIT
Figure 8.11 The family tree of FETs consists of three groups of conventional FETs that have
much in common in their technologies, properties, and applications and new groups of novel
devices. The abbreviations used in this chart are defined as follows: MOSFET metaloxide
semiconductor FET; VMT, velocity-modulation transistor; and QUIT, quantum-interference
transistor.
becomes narrower, the channel opens wider, and the current has to be larger. Thus, in
comparison with a diode, the more complex configuration of the electrostatic potential
in a FET gives rise to strongly nonlinear sourcedrain currentvoltage characteristics.
Qualitatively, two operational regimes are possible, the linear region and the saturation
region, as indicated in Fig. 8.10. For both regimes, there is effective current control by
the gate voltage. At very large sourcedrain voltages, electric breakdown can occur.
In summary, the phenomenon of controlling the resistance of the conduction channel
by an external voltage (by a field) is the basic principle of any FET.
258
Nanostructure devices
is divided into three large groups of devices used in practice, as well as a group of
potentially new devices based on recent developments. The first group includes the variations of metalsemiconductor FETs discussed previously. The second group consists
of metaloxidesemiconductor FETs (MOSFETs). In contrast to the previously considered MESFETs, in MOSFETs, the gate and conducting channel are separated by a
dielectric (an oxide) instead of a Schottky barrier. The MOSFETs are typically fabricated
by using a metaloxidesilicon structure, with the oxide layer being silicon dioxide. In
Fig. 8.12 we present a sketch of a Si MOSFET. It should be said that Si MOSFETs are
microelectronic devices of the utmost importance. A third group of FETs in Fig. 8.11
incorporates heterostructures and differs in the design of the doping and conducting
channel. For example, the donor-layer heterostructure FET (HFET) subgroup has one or
several doped layers, which supply electrons to the conducting channel. If the channel
is formed on a single heterointerface (see Section 7.2) using a single donor layer the
device is called a modulation-doped FET (MODFET). Such MODFETs have the largest
electron and hole mobilities. That is why they are also called high-electron-mobility
transistors (HEMTs). Examples of material systems used for HFETs are as follows:
n+ -AlGaAs/GaAs (unstrained) heterostructures, strained layer n+ -InGaAs channels on
GaAs, and n+ -InAlAs/InGaAs heterojunctions grown on InP. Although the latter two
structures are more complex, they offer better transport parameters in the conducting
channel.
Figure 8.13(a) shows schematically one of the variants of a HFET a recessed-gate
+
n -AlGaAs/GaAs HFET. The device is fabricated on a semi-insulating GaAs substrate
upon which an undoped buffer layer of GaAs is grown. A thin undoped AlGaAs spacer
layer and an n-doped AlGaAs layer are grown on the buffer layer. Source, drain, and gate
electrodes are fabricated on the top of the structure. Under the electrodes, two heavily
n+ -doped regions serve as the contacts to the two-dimensional electron gas formed under
the heterojunction. Such a two-dimensional electron-gas channel was studied in Section
7.2 in detail. The gate length varies from 1.0 m to 0.1 m, or less, depending on the
speed needed for applications.
Source
GaAs n+
Drain
A
Gate
AlGaAs n+
(a)
AlGaAs undoped
2DEG
GaAs undoped
259
GaAs wafer
V (z)
(b)
EF
80
(c)
z
Figure 8.13 A recessed-gate n+ -AlGaAs FET. (a) The cross-section of the HFET. (b) The energy
band diagram along the line AB; the Fermi level, E F , and electron confined states are shown.
In the donor layer, there are two depletion regions for a thick barrier: one is due to the Schottky
built-in voltage; the second is due to the conduction-band offset at the junction. (c) The
sequence of layers along the line AB.
The dimensions in the vertical directions can be different. This facilitates the fabrication of two types of HFET: normally turned-on and normally turned-off devices.
(Both types of heterostructures were studied in Section 7.2.) Figure 8.13(b) illustrates
the energy-band coordinate dependence for the cross-section of the structure under the
gate. The major element affecting the physical picture is the Schottky depletion layer
under the gate. The depletion layer is different for different thicknesses of the barrier
layer. For a barrier layer of small thickness, the Schottky barrier depletes even the quantum well, leading to the normally turned-off state in the absence of the gate voltage.
For larger barrier thicknesses, the Schottky barrier does not prevent the formation of
the conducting channel, and we obtain the normally turned-on state of the HFET as
illustrated in Figure 8.13(b).
Let us briefly review the heterostructures employed for FETs based on Si technology.
Although many combinations of materials are being investigated with the goal of fabricating high-quality Si-based heterostructures, the Si/SiGe system is the most studied
and developed and has already found various device applications. Properties of SiGe
alloys and Si/SiGe heterostructures were studied in Chapter 4. These heterostructures
can be used for the creation of two-dimensional electron and hole gases and for the
improvement of device parameters.
In Si/SiGe systems, a low-dimensional n-type channel can be obtained if a strained
Si layer is grown upon a strained SiGe layer. Figure 8.14(a) depicts the cross-section
of an n-channel modulation-doped Si/SiGe FET. The device is similar to the previously
discussed AlGaAs/GaAs FET with selective doping and improved channel confinement.
260
Nanostructure devices
Source
AuSb
Si
Si1 xGex
Gate
Drain
Au
Ti
Pt
10 nm undoped
10 nm x = 0.5 0
Si0.5Ge0.5 10 nm
(a)
Si
20 nm
Sb doping layer
undoped
0.2 m
Si0.75Ge0.25
AuSb
tap layer
graded layer
buffer layer
Si substrate
106 cm
Sb doping spike
2DEG
Ec
EF
(b)
0.9 eV
1.1 eV
10 nm 10 nm 10 nm
Si
Si1xGex Si0.5Ge0.5
buffer layer
Ev
20 nm
200 nm
Si
Si0.75Ge0.25
buffer layer
Figure 8.14 The Si/SiGe-based n-channel MODFET. (a) The cross-section of the device; the
electron channel is formed in the strained Si layer. (b) Energy-band diagram of the device. After
H. Daembkes, W. Goodhue et al., The n-channel SiGe/Si modulation-doped field-effect
C IEEE.
transistor, IEEE Trans. Electron Devices, ED-33, 633638 (1986).
The electron channel is formed in the undoped strained Si layer situated between two SiGe
layers. The lower Si0.75 Ge0.25 buffer layer is grown on Si and is strained and undoped.
The upper Si0.5 Ge0.5 layer is thin and -doped by donors. Next, the Si1x Gex layer is
graded, with x varying from 0.5 to 0. The top thin Si layer is undoped. Such a structure
is chosen to create the electron channel and avoid degradation of the characteristics of
the strained layer. Figure 8.14(b) shows the energy-band diagram for this device. The
electron channel on the Si0.5 Ge0.5 /Si heterointerface is marked. The Si0.5 Ge0.5 donor
layer is -doped, which leads to the formation of a spike in the potential profile.
There are many other schemes for Si-based HFETs that combine the advantages of
heterostructure bandgap engineering and selective-doping methods. Motivated by the
261
300
AlInAs/GaInAs
MODFET
10 12
100
t (s)
AlGaAs/GaAs
MODFET
10 11
10
GaAs MESFET
Si NMOS
1
0.1
10 10
5
L, LG (m)
Figure 8.15 The cut-off frequency in GHz versus the device length L or the gate length L G in m
for three groups of FETs: MOSFETs, MESFETs, and MODFETs. The latter are represented by
AlGaAs/GaAs and AlInAs/GaInAs material combinations. The parameters are given at 300 K.
After Fig. 25, S. J. Pearton and N. J. Shah, Heterostructure field-effect transistors, in S. M. Sze
(Ed.) High-Speed Semiconductor Devices (New York, Wiley, 1990). Reprinted with permission
of John Wiley & Sons, Inc.
current wide spread use of Si-based devices, worldwide developments in these areas are
now in progress.
One of the key parameters of contemporary devices is the maximum frequency of
efficient operation the cut-off frequency. At fixed device dimensions, the cut-off frequency depends on material characteristics and device design. As a basis for making
comparisons, Fig. 8.15 illustrates representative parameters of all three of these groups
of FETs. One can see that the HFET class exhibits superb high-frequency performance.
Currently, the highest speed is reached for AlInAs/GaInAs HFETs with short gates.
This type of HFET with a nanoscale gate of the length L G = 45 nm has a high cut-off
frequency, above 450 GHz.
Nanowire FETs
The previouslyanalyzed HFETs use heterostructures with two-dimensional electrons.
The next step in reducing the number of dimensions of the device is to exploit nanowires.
Semiconductor nanowires and carbon nanotubes are attractive components for nanoscale
FETs.
For example, nanowire FETs can be configured by depositing the nanomaterial onto
an insulating substrate surface, and making source and drain contacts on the ends of the
262
Nanostructure devices
(a)
(b)
Figure 8.16 (a) A schematic diagram of a Si FET with the nanowire, the metal source, and drain
electrodes on the surface of a SiO2 /Si substrate. (b) A scanning electron micrograph of a Si
nanowire FET; the scale bar is 500 nm. The 5-nm diameter Si wire image was obtained by using
a high-resolution transmission electron microscope. Reprinted with permission, from Y. Cui,
Z. Zhong et al., High performance silicon nanowire field effect transistors, Nano Lett., 3,
C American Chemical Society.
149152 (2003).
nanowire. Figure 8.16(a) illustrates this approach. There, we show a schematic diagram
of a Si-nanowire FET with the nanowire, the metal source, and drain electrodes on the
surface of the SiO2 /Si substrate. The image of the 5-nm-diameter Si wire was obtained
by using a high-resolution transmission electron microscope. In Fig. 8.16(b), a scanning
electron micrograph of a Si-nanowire FET is shown; the scale bar represents 500 nm. This
approach may serve as the basis for hybrid electronic systems consisting of nanoscale
building blocks integrated with more complex planar silicon circuitry.
An extremely small FET may be built on the basis of carbon nanotubes. Indeed, despite
their nanosize diameters, the carbon nanotubes are sufficiently robust and may be long
enough to allow electrical connections to lithographically defined metallic electrodes,
therefore making it possible to probe directly the electrical properties of these nanometersize structures. Electrical connections to nanotubes can be achieved either by depositing
a metal electrode over the top of the tubes (end-contacted samples), or by placing the
tubes on the top of prepared metal leads (bulk-contacted samples).
An advanced nanotube-based FET is shown in Fig. 8.17. The depicted nanotube FET is
fabricated on SiO2 /Si substrates. The silicon is doped heavily to serve as a back gate. The
dielectric SiO2 of thickness 10 nm covers most of the area of the substrate and electrically
isolates the Si substrate and the wire. Two metal Pd electrodes were deposited to fabricate
the source and drain. This technology provides the nanotube FET with a back gate shown
in Fig. 8.17(a). Additionally, by using atomic layer deposition, an 8-nm-thick HfO2 film
can be deposited to cover the device from above. Then, 20 nm of Al is deposited onto
the dielectric (HfO2 ) film to create the top gate, as shown in Fig. 8.17(b). Thus, we have
presented nanowire FETs with single and double gates. For both devices, the total tube
length between metal electrodes was 2 m, and the top-gated section length was 0.5 m.
(a)
carbon nanotube
263
(c)
(b)
Figure 8.17 Schematic diagrams of a carbon-nanotube FET: (a) two metal (Pd) electrodes are
deposited onto a carbon nanotube placed on a 10-nm-thick layer of SiO2 that insulates it
from the back gate; (b) 8-nm-thick HfO2 serves as an insulator for the top gate; and (c) a
scanning electron micrograph of the carbon nanotube FET. Reprinted with permission, from
A. Jarvey, J. Guo et al., Carbon nanotube field-effect transistors with integrated ohmic
C American Chemical
contacts and high-k gate dielectrics, Nano Lett., 4, 447450 (2004).
Society.
Velocity-modulation transistors
For all of the previously analyzed FETs, the gate voltage controls the number of conducting electrons, and the speed of operation is determined by the transport time of
electrons along the conducting channel. An alternative principle of device operation
consists of modulating electron mobility, or, possibly, electron drift velocity, without
significant changes in the number of current carriers. To achieve such a modulation
of the drift velocity one can use spatially nonuniform (selective) carrier scattering
across the active channel. Then, a redistribution of the carriers across the channel in
response to a gate voltage would lead to control of the velocity. The time of such a
redistribution across a narrow channel can be considerably smaller than that of the
longitudinal transport. A three-terminal device with modulation of the drift velocity by an external voltage has been referred to as the velocity-modulation transistor
(VMT).
can
The characteristic time of charge transfer in a channel of thickness about 100 A
13
be estimated to be as short as 10
s. Thus, for ultra-high-frequency operation, the
velocity-modulation effect should be the dominant factor controlling electric signals.
264
Nanostructure devices
Gate
(a)
Source
x
Drain
Channel 1
Barrier
Channel 2
Semi-insulating substrate
E
Next subband
(unpopulated)
(b)
Ch 2 Ch1
Lowest subband
(populated)
(c)
kx,y
EF
kx,y
kx,y
t1
P1
t'1
t2
P2
t'2
x=0
x=L
Figure 8.18 The schematics of QUIT with two parallel channels: (a) the structure of the device;
(b) electron subbands in different regions of the device and the Fermi energy (under
equilibrium); (c) an illustration of the electron paths in the device; t1 , t2 and t1 , t2 indicate the
amplitudes of split and interfering waves, respectively. P1 and P2 denote factors of different
phase shifts in the channels.
Quantum-interference transistors
The remaining class of novel nanoscale FETs presented in Fig. 8.11 is the so-called
quantum-interference transistor (QUIT). It is based on quantum ballistic electron transport. According to the classification given in Section 6.2, for such a transport regime,
one has to make the coherence length, l , greater than the characteristic device scale, L:
l > L;
(8.14)
that is, this device is a mesoscopic device. The principle of operation of a QUIT is the
control of the interference pattern of conducting electrons by an external voltage.
Remember that interference is one of the most important effects in the physics of
waves. We studied the interference effect in Sections 2.4. Examples of interference
patterns are given in Figs. 2.3 and 2.7. In a QUIT, these patterns can be arranged as a
result of interference of the waves traveling through two or more channels (arms).
A schematic diagram of a two-channel quantum-interference device is shown in
Fig. 8.18(a). Basically, it is an ordinary FET with a short channel and a barrier parallel to the current. The barrier is embedded in the middle region of the device. The
barrier splits the main channel into two channels: 1 and 2. There are also two contacts
to the main channel the source and the drain. The gate is placed on the top of the
265
device. Owing to the gate, the symmetry of channels 1 and 2 is broken if a gate voltage
is applied. The distance between contacts, L, satisfies the inequality of Eq. (8.14). The
widths of the main channel and the split channels are small, so that there is a quantization
of transverse electron states in the z direction. Two-dimensional subbands are illustrated
in Fig. 8.18(b); the subbands, E(k x , k y ), and the Fermi energy, E F , are plotted for three
major device regions. For simplicity, one can assume that only the lowest subband is
populated in each of the device regions. If a voltage is applied to the gate, in the middle region, the bottoms of the subbands, 1 and 2 , for channels 1 and 2 are generally
different. The electron energy spectra are of the form
E = 1 +
-h2
-h2
2
2
2
+
k
+
+ k 2y ,
k
=
k x,2
2
x,1
y
2m
2m
(8.15)
where E is the energy of an incident electron, and kx and k y are the components of its
wavevector in the plane of the device. Obviously, the energy E does not change along
the channel. The k y -component is also conserved because it is assumed that there are no
forces along the y direction. Thus, from Eq. (8.15), it follows that only the k x -components
in the channels can be different.
Let an electron be injected from the source into the left region of the main channel.
Its wavefunction is
L = L (z)ei(kx x+k y y) ,
where L (z) is the wavefunction of the lowest subband in the left region. We can introduce the amplitudes of the waves transmitted into channels 1 and 2 as t1 and t2 . These
amplitudes determine the electron wavefunctions inside the middle device region:
M =
upper channel,
lower channel,
where M,1 (z) and M,2 (z) are the wavefunctions of the lowest subbands in channels
1 and 2, respectively, and xL is the coordinate of the left end of the channels. Since,
in general, the potential profiles in the two channels are different, the wavevectors are
different: kx,1 = k x,2 . Then, let t1 and t2 characterize the waves transmitted from the
channels into the right device region; see Fig. 8.18(c). The wavefunction inside the right
region can be written as
R = t1 P1 t1 + t2 P2 t2 R (z)ei(kx (xL xR )+k y y) ,
(8.16)
where P1 exp(ik x,1 L), P2 exp(ik x,2 L), and R (z) is the wavefunction of the transverse motion at the right end of the channels. For the sake of simplicity, we neglect
multiple reflections from the ends of the barriers.
The wavefunctions L and R are normalized to unity; thus, the total transmission
coefficient for the device is
T (E) = |t1 P1 t1 + t2 P2 t2 |2 .
(8.17)
266
Nanostructure devices
We can assume that both channels, 1 and 2, are symmetric with respect to z = 0 in the
absence of a gate voltage. Thus, for the lowest occupied subbands we find
t2 = t 1
and
t2 = t1 .
(8.18)
= (k x,2 k x,1 )L
(8.19)
where
is the relative phase shift of the two channels. If we introduce the average electron
velocity,
vx =
(8.20)
L 1 2
-h .
vx
The origin of the phase shift is obvious: if 1 = 2 , a difference between the kinetic
energies in the two channels gives rise to different phases of the waves coming into the
right device region; of course, these different phases lead to quantum interference.
For channels that are symmetric with respect to z = 0, in the absence of a gate voltage,
we find 1 = 2 = 0 , and the phase shift equals zero. If we apply a gate voltage, the
potential energy as a function of the transverse coordinate is modified:
V (z) = V0 (z) e(z).
Here, (z) is the potential induced by the applied voltage. This leads to subband energies
1 = 0 eM,1 ||M,1 ,
2 = 0 eM,2 ||M,2 .
Using Eqs. (8.20) and (8.19), we find the phase shift,
=
L e12
,
vx -h
(8.21)
where 12 M,2 |(z)|M,2 M,1 |(z)|M,1 . The value 12 represents the difference between the average potentials in the channels. This difference determines the
transmission through the device.
To calculate the electric current, one can use the results of Section 6.5, where quantum
ballistic transport was analyzed. Equation (6.120) defines the device conductance, G. At
267
low temperatures, these results lead to the Landauer formula of Eq. (6.121), which, in
the case under consideration, has the form
G=
2e2
4e2
T (E) =
|t1 t2 |2 (1 + cos ).
h
h
(8.22)
The second term in the brackets is due to electron interference. One can see that the
interference controls the device conductance. If the phase shift of split waves = 0
(12 = 0), the conductance reaches the maximum, G max = 8e2 |t1 t2 |2 / h. If = , i.e.,
-hvx
e12 =
,
(8.23)
L
the interference is destructive and the conductance vanishes.
Let us introduce the characteristic transit time through the channels, ttr = L/vx . If
and vx = 2 107 cm s1 , we get ttr = 1 ps, and for the destrucwe set L = 2000 A
tive potential difference we get 12 2 mV. The shorter channels require a larger gate
voltage. Obviously, the transit time, ttr , determines the cut-off frequency of the device,
cf = 2/ttr . Since the device channels are not doped and can be made quite short, these
devices portend operation up to the terahertz frequency region.
Our simple model of the quantum-interference transistor allows us to compare this
device with a conventional FET. If the latter is in a normally turned-on state with the
Fermi energy of electrons in the conducting channel equal to E F , we can estimate the
threshold voltage needed to deplete the FET channel as
-h2 k 2
F
.
(8.24)
eth,FET E F =
2m
For quantum devices described by Eq. (8.23), we find
-h2 k 2 2
F
-hvx
-h2 kF
F
eth,QUIT e12 =
= =
= eth,FET ,
L
m L
2m kF L
L
(8.25)
268
Nanostructure devices
L2
L3
L*
L1
Figure 8.19 The schematics of a T-shaped QUIT. The source (grounded), drain, and gate
(a)
q = 0.69 q = 0.42
(b)
(c)
Figure 8.20 Experimental results on the QUIT effect: (a) and (b) for the two-channel QUIT T =
0.03 K, (c) for the T-shaped structure, both fabricated from a high-mobility AlGaAs/GaAs
heterostructure. (a) Modulation of the device conductance by the gate voltage. (b) The phase
shift as a function of the gate voltage. (c) The same as in (a) for the T-shaped device at five
different temperatures. Reprinted with permission from K. Kobayashi, H. Aikawa et al. Fano
resonance in a quantum wire with a side-coupled quantum dot, Phys. Rev. B, 70, 035319
C 2004 by the American Physical Society.
(2004).
As a practical realization of the QUIT effect, we present the results obtained for an
AlGaAs/GaAs structure. The channels were made from high-mobility GaAs. In one of the
channels a GaAs quantum dot was embedded under the gate. This device corresponds
to the scheme in Fig. 8.18(a). If one of the channels were blocked, one would obtain
the T-shaped structure, as in Fig. 8.19. Both cases also are illustrated schematically in
the insets of Fig. 8.20. The low-temperature mobility of the two-dimensional electron
269
gas was as high as 9 105 cm2 V1 s1 , and the measured electron concentration was
2 1011 cm2 . These parameters correspond to a mean free path of about 8 m. Low
temperatures (0.030.7 K) ensure that the dephasing length l is even greater. In the
case under consideration, the lengths of the channels are about 1 m, and the quantumdot (two-dimensional) size is 0.2 m 0.2 m. In Fig. 8.20(a), the QUIT conductance
is presented as a function of the gate voltage (in units of the quantum conductance,
e2/h). One can see that the conductance is periodically modulated by the gate voltage, in
accordance with the previously developed model of the QUIT. Special experiments with
applied magnetic field, B, make it possible to deduce the phase shift of the electrons in
the QUIT. The results are presented in Fig. 8.20(b). The modulation of the conductance
for a T-shaped device is shown in Fig. 8.20(c). Measurements at various temperatures
illustrate the smearing out of the interference patterns due to the temperature dispersion
of the energy of the electrons entering the device.
Summarizing, a pronounced QUIT effect is observed for devices with submicrometer
scale at low temperatures. The scaling down of device dimensions will allow significantly
increased working temperatures.
In conclusion, the quantum-interference effect provides a new principle of operation
for three-terminal devices. These devices are now in an early stage of development.
However, these approaches portend effective control by a small voltage as well as the
development of very-high-speed transistors.
8.4
Single-electron-transfer devices
Before analyzing examples of the devices based on single-electron transport studied in
Section 6.5, we shall overview briefly a relatively simple technique for the formation of
structures with low-dimensional electron gas.
270
Nanostructure devices
(a)
Narrow
metal strip
dsp
Thin undoped
layer of AlGaAs
n+-AlGaAs
ld
2DEG
2DEG
GaAs
(b)
eb
e1
EF
eG
EF
ld
dsp 0
Figure 8.21 (a) Two-dimensional electron gas (2DEG, denoted by dashed lines) at the interface of
of different forms, one can create various configurations of regions occupied by the
electrons. It is possible to form wires, dots, rings, cavities, etc. for the two-dimensional
electrons. For example, if two closely placed parallel metal strips are fabricated on the
top of the heterostructure, then, by applying negative voltage to these two gates, we can
form two side barriers for the electrons and confine them into a channel. If the channel is
narrow enough, the two-dimensional electrons can be quantized in the second direction
and we obtain a quantum wire; such a wire is shown in Fig. 8.22. The confinement of
the electrons to the dots, wires, rings, etc. can be accomplished by a heterojunction on
one side and electrostatics on all other sides. The split-gate technique has successfully
been exploited for measurements of transport regimes in various quantum structures, for
example, in quantum point contacts and electron waveguides.
A considerably more sophisticated design of patterning of a two-dimensional gas to
a shape desirable for single-electron applications is presented in Fig. 8.23. The main
features of the design shown in Fig. 8.23(a) are the following: (i) the Ohmic contacts
(OC) to the two-dimensional electron gas (contacts to two electron reservoirs (R)) and (ii)
a system of gates, which create electrostatic tunnel barriers (TB) and confine electrons
into a quantum dot (QD). The tunnel barriers are formed when the voltages applied to
the gates are negative with respect to the voltages applied to the contacts. The barriers
should be high enough to decouple the quantum dot and the reservoirs. In Fig. 8.23(b), the
Z
Metal strips
271
dsp
n+-AlGaAs
ld
GaAs
1DEG
Thin layer of GaAs to
cover AlGaAs
Figure 8.22 A quantum wire at the interface of a GaAs/AlGaAs heterostructure with two
depletion regions under the negatively biased metal strips.
(a)
(b)
R
A
OC
R
TB
QD
TB
OC
Gates
GaAs Cap Layer
AlGaAs, n+
AlGaAs undoped
2DEG in GaAs
GaAs undoped
GaAs wafer
V(z)
R
Gate
TB
Gate
TB
QD
EF
Figure 8.23 (a) Design patterning of two-dimensional gas to a shape desirable for single-electron
applications: OC, ohmic contacts; R, reservoir of electrons; QD, quantum dot; and TB, tunnel
barrier. (b) The potential profile, V (z), along the line AB. Ohmic contacts, OC, are not shown
since they are outside of the depicted region.
Single-electron transistors
Single-electron devices like the one sketched in Fig. 8.23 are two-terminal devices, i.e.,
they are diode-type devices. It is possible to introduce an additional gate and create
Nanostructure devices
G1
(a)
G3
G2
G4
C2
C3
(b)
C1
Source
Drain
VG
Gate
Figure 8.24 (a) The design of a quantum-dot single-electron transistor; (b) an electric circuit with
a quantum-dot single-electron transistor.
Conductance (10 6 1)
272
12
12
Sample 1a
Sample 1b
4
0
12
12
273
Figure 8.26 A scanning electron microscope image of a single-electron transistor. From Y. Ono,
A. Fujiwara et al., Manipulation and detection of single electrons for future information
processing, J. Appl. Phys., 97, 031101 (2005). Reused with permission from Yukinori Ono,
Akira Fujiwara, Katsuhiko Nishiguchi, Hiroshi Inokawa, and Yasuo Takahashi, Journal of
C 2005 American Institute of Physics.
Applied Physics, 97, 031101 (2005).
states in the contacts. Though the conductance versus gate-voltage dependences are
different, i.e., not reproducible, the peak spacing is the same for both devices. It is
determined by the change in the gate voltage required to change the charging energy
of the quantum dots by one electron. The figures show clearly that the electric current
is modulated significantly by the gate voltage. Thus, for transistors with single-electron
transport, strong control of very small electric current may be possible. The problem of
fabrication of reproducible devices requires further improvements in technology.
274
Nanostructure devices
(I)
(II)
(V)
(III )
(IV)
the island potential nearly constant (state IV), and then lower the island potential so that
an electron can enter from the left channel (state V). Thus, during one period only one
electron can be controllably transferred through the device. This system can be viewed
as a single-electron pump, since it transfers the electrons one by one without applying a
bias between the source and drain.
The electron pump can have a number of applications. One of them is a current etalon.
Indeed, the average current flowing through the device is
I = e f,
where f is the frequency of variation of the voltage on the gates. Since the frequency can
be measured with very high accuracy, the device can serve as the electric-current standard. The accuracy of such a standard is much higher than the analogous characteristic
of any other current standards.
The turnstile operation is realized by making a small change in the previously described
regime of operation. The detailed procedure for the turnstile is illustrated schematically
in Fig. 8.28. In the turnstile mode, with a non-zero sourcedrain voltage, we first close
gate 2 so that an electron enters only from the left side (stage I). We next close both gates
(stage II), and then open gate 2 so that an electron is emitted to the right (stage III). Finally,
we again close both gates (stage IV), and open gate 1, reaching the initial state. This
procedure can be accomplished by applying an AC bias to each gate with the phase shift
of . Noteworthy is that, in this turnstile procedure, at least one of the channels is always
closed, which is not the case in the pump operation mode. This is advantageous because
high resistance effectively prevents the tunneling of two electrons one of the major error
sources for single-electron-transfer regimes. Figure 8.29 shows the drain-current versus
drain-voltage characteristics in the aforementioned turnstile mode at several frequencies,
f = 0.001, 0.5, and 1.0 MHz. Staircases quantized in units of e f are observed both for
275
(I)
EF
EF
( II )
(IV)
EF
EF
EF
EF
( III )
EF
EF
Figure 8.28 Schematics of a turnstile operation. From Y. Ono, A. Fujiwara et al., Manipulation
and detection of single electrons for future information processing, J. Appl. Phys., 97, 031101
(2005). Reused with permission from Yukinori Ono, Akira Fujiwara, Katsuhiko Nishiguchi,
C 2005
Hiroshi Inokawa, and Yasuo Takahashi, Journal of Applied Physics, 97, 031101 (2005).
American Institute of Physics.
0.4
0.3
0.2
1 MHz
0.5 MHz
1 kHz
0.1
0.0
-0.1
-0.2
-0.3
-0.4
-120 -80
-40
40
80
120
276
Nanostructure devices
Potential-effect transistors
Bipolar transistor
Hot-electron transistor
positive and for negative drain voltages. This is the evidence of single-electron transfer
by the turnstile operation. The levels of the current plateaus for the positive drain voltages
are exactly equal to e f to within the accuracy of the measurement system of about 102 .
In these experiments, the working currents were extremely small.
In conclusion, the main principle of the FETs current control by means of a voltage
applied through capacitively coupled electrode(s) can be successfully applied to singleelectron devices. By combining the field effect and the single-electron phenomena, one
can manipulate the device states using extremely small electric currents.
To date, single-electron experiments have been conducted at low temperatures (typically below 20 K). However, with further scaling down of semiconductor technology
to nanosizes, we may expect that single-electron effects and devices will be exploited at
higher temperatures, including under room-temperature conditions.
8.5
Potential-effect transistors
According to the classification given in Section 8.3, a second large family of electronic
three-terminal devices is that of potential-effect transistors. This family is shown in
Fig. 8.30. In contrast to the case of FET devices, potential-effect transistors are currentcontrolled. The controlling electrode is resistively coupled to the active region of the
device and the carriers are separated by an energy barrier.
The most important representative of this class is the bipolar transistor, which was
invented in 1947, and has undergone considerable and persistent transformation. Currently, the bipolar transistor provides high speed of operation in most circuit applications.
Detailed analysis of the physics of bipolar transistors is certainly beyond the scope of
this book. Instead, at the end of this chapter we provide references to books on bipolar
transistors. However, we shall consider the working principles of this transistor, at least
very briefly.
pn Junctions
The key element of the transistor is the so-called pn junction. In the simplest case it is
a junction of two regions in a semiconductor: one side is doped by donors and contains
277
Depletion
regions
Ec
eBI
Electrons
+
+
Ev - - - - - - - - ++++++
-
Ec
EF
Holes
Ev
Figure 8.31 A pn junction with abrupt coordinate dependence of doping concentration.
(a)
Equilibrium
(b)
Forward bias
(c)
Reverse bias
EFp
EFn
EFp
EFn
f
EFp
e
EFn
Figure 8.32 The scheme of a pn junction (a) at equilibrium; (b) under forward bias f ; and
(c) under reverse bias r . For non-equilibrium cases, (b) and (c), instead of one Fermi level,
there are two quasi-Fermi levels, one for holes, E Fp , and one for electrons, E Fn , as shown.
electrons; the other side is doped by acceptors and contains the holes. In Fig. 8.31, the
scheme of such a pn junction is presented. An abrupt transition between the p- and
n-doped sides of a semiconductor is assumed. The charge transfer across the junction
lines up the various Fermi levels and results in a Fermi level, E F . It depletes two narrow
regions in the p- and n-doped sides and creates a built-in electrostatic potential, BI .
The potential barriers of height eBI prevent the penetration of electrons into the p-type
side of the structure and of holes into the n-type side of the structure. No electric current
passes between p and n regions under such an equilibrium condition; see Fig. 8.32(a).
As is clear from this discussion, the potential profile which appears at the pn junction
and the possibility of controlling of it are characteristic for potential-effect devices.
278
Nanostructure devices
IS
Now, let a voltage be applied to the pn junction. The corresponding electric circuits
are shown in the upper part of Fig. 8.32. There are two quite different cases. The forward
bias of the pn junction corresponds to a negative potential, = f (f > 0), applied
to the n-type side. Potential barriers both for the electrons and for the holes decrease and
are equal to e(BI f ). This case is illustrated by Fig. 8.32(b). A positive potential
(the reverse bias r > 0 of the junction) leads to an increase of the potential barrier
between the n and p parts of the structure, as illustrated by Fig. 8.32(c). Now the barrier
height equals e(BI + r ). The applied voltage is dropped primarily across the depletion
regions because their resistances are much greater than those of neutral n and p regions
of the junction. The voltage breaks down the equilibrium; in particular, there is no longer
a common Fermi level. Instead of the true Fermi level, quasi-Fermi levels E Fn and E Fp
can be introduced for better understanding of changes in the potential relief, as shown
in Fig. 8.32. As soon as E Fn = E Fp , an electric current starts to flow between the n and
p parts of the structure. For a forward applied bias, the current increases exponentially
as the potential barrier decreases, and the junction becomes strongly conducting. For
an applied reverse bias, the current is small and saturates at large biases. The current
voltage characteristic of the pn junction is presented in Fig. 8.33. The main features of
this characteristic are a rectifying behavior and a strong nonlinearity. This nonlinearity
is used in numerous applications of the pn diode.
Bipolar transistors
A homostructure bipolar transistor consists of two pn junctions. Usually one of
them the emitter junction is forward-biased, while the other the collector junction is reverse-biased. The energy-band diagram for a bipolar transistor is shown
schematically in Fig. 8.34. This diagram clearly illustrates the formation of a potential
for electron and hole motion. The control of this potential is the key element of the
bipolar transistor. The device depicted is of the npn type, though almost all results are
applicable also to pnp devices. This diagram can be easily understood in terms of the
charge-transfer effect between the emitter (left n region), the base (middle p region), and
the collector (right n region). The voltage bias is supposed to be such that an electron
travels from the left n region across the p region to the right n region. An applied forward
Emitter
n+
Base
279
Collector
p
n
Figure 8.34 An energy band diagram of a homostructure npn bipolar transistor along the
direction of electron travel. Quasineutral emitter, base, and collector regions are indicated. The
emitter junction is forward-biased, and the collector junction is slightly reverse-biased.
IC
IB
5
4
3
2
1
CB
Figure 8.35 The collector current IC in a bipolar transistor for five different base currents, IB .
bias to the emitter junction lowers the energy barrier for an electron flowing from the
emitter to the base. Simultaneously, a similar flow of holes from the base to the emitter
appears. Those electrons that overcome the barrier travel across the base. Recombination
of the minority carriers (electrons) occurs in the base, but, if the thickness of the base
region is less than the diffusion length of the minority carriers, these electron losses are
negligible. When the electrons reach the collector junction, they are swept away into the
collector due to the high electric field within the depletion region of the latter junction
and form the collector current. In the bipolar transistor, each of the conducting regions
(the emitter, the base, and the collector) is provided by an electrical contact. Thus, this is
a three-terminal device. The operation of a bipolar transistor is based on the principle of
controlling the current by injecting minority carriers. For example, in an npn device,
the injection of minority carriers from a forward-biased np junction (emitterbase) into
the base provides the current through the base contact and the controlling function of the
collector current. In Fig. 8.35, the collector current versus the collectorbase voltage,
CB , is shown for various currents, IB , through the base contact (compare this with the
currentvoltage characteristics of the FETs presented in Fig. 8.10).
The carriers travel from the emitter to the collector perpendicular to junctions; thus,
the carrier transit time through the base determines the speed of operation and the
cut-off frequency of the device. Device scaling in order to reduce the transit time in a
280
Nanostructure devices
Emitter
Base
n++ Si
n+ Si
p+ SiGe
Base
n Si
n+ Si substrate
Collector
Figure 8.36 The schematic cross-section of an npn Si/SiGe heterojunction bipolar transistor.
n++ , n+ , p+ , and n denote high, intermediate (n+ and p+ ), and relatively low doping,
respectively. The p+ -SiGe layer is strained.
homojunction bipolar transistor can not be achieved while simultaneously realizing all
the requirements for high performance; such requirements include high doping levels
both in the emitter and in the base, and a small base region. The use of heterostructures
helps to solve these problems and to improve device performance. Thus, one can divide
this transistor class into two large and important groups: homojunction and heterojunction bipolar transistors, as shown in Fig. 8.30. As an example of the latter subclass
of bipolar transistors, consider the Si/SiGe device. In Fig. 8.36, the cross-section of a
Si/SiGe heterostructure npn bipolar transistor is shown. The device is fabricated on
an n+ -Si substrate (subcollector with doping of about 2 1019 cm3 ) contacted by an
electrode. The collector layer is doped to approximately 2 1016 cm3 and has a width
of about 0.3 m. The p+ base is fabricated of a Si1x Gex alloy with x ranging from
0.2 to 0.3, and has a width of typically 3050 nm. The base doping is in the range from
2 1018 to 5 1019 cm3 . Two-sided metallic electrodes provide the direct contact with
the base region. The emitter is made of n+ -Si with doping of about 5 1017 cm3 . A
heavily doped layer, marked as n++ -Si, is placed at the top of the emitter to provide a
good contact to the metallic electrode. Owing to the very short base and other peculiarities of the heterojunctions, for such a device an extremely high cut-off frequency of
75 GHz at room temperature is measured, whereas at liquid-nitrogen temperature, a
cut-off frequency as high as 94 GHz is reported for this type of bipolar transistor.
The very recent record results for this type of bipolar transistor were reported by
IBM: the silicongermanium transistor hits 500 GHz. More detailed analysis of heterostructure bipolar transistors can be found in references presented at the end of this
chapter.
Hot-electron transistors
Another group of potential-effect devices is represented by hot-electron transistors, as
illustrated in Fig. 8.30. These devices employ an emitterbarrierbasebarriercollector
structure. The hot electrons are injected over, or through, an emitter barrier into a
(a)
(b)
(c)
M
S
M O M
O
M
O
281
M
S
Figure 8.37 Energy-band diagrams for metal-base ballistic-injection transistors. Metal, oxide,
and semiconductor layers are marked by M, O, and S, respectively. Semiconductor layers are
doped. The diagrams are presented under operational bias conditions.
narrow base region. Injected carriers have a large velocity and transit through the
base almost ballistically; that is, they transit the base without significant scattering.
The current through the device is controlled by changing the height of a collector barrier. This principle is used in a number of different configurations and combinations of
materials.
Consider a ballistic-injection device, which is primarily a unipolar device; that is,
only one type of carrier, say electrons, is used. A ballistic-injection device also consists of an emitter, base, and collector. The role of the emitter is to inject electrons
with high velocities into the base; the second electrode should collect these electrons.
The input base voltage controls the electron injection and, therefore, the output emitter
collector current. If small changes in the input produce larger changes in the output, the
device exhibits a current gain. Generally, in ballistic devices, the electrons are injected
into the base with a high energy exceeding 0.1 eV. This should lead to a decrease in
the time of flight through the base region. Another advantage of ballistic devices is
related to their unipolar character; this means that it is possible to choose the fastest
type of majority carriers (electrons) and avoid the participation of slower minority
holes.
In order to realize a transistor faster than a bipolar or field-effect transistor, several
schemes of ballistic-injection devices have been proposed. They differ in terms of the
physical mechanisms of electron injection as well as in the materials used in the devices.
The first and, perhaps, the simplest device is a metaloxidemetaloxidemetal heterostructure. In Fig. 8.37(a), this device is shown under a bias. Other similar device
structures are the metaloxidemetalsemiconductor structure and the semiconductor
metalsemiconductor (SMS) structure presented in Figs. 8.37(b) and 8.37(c), respectively. The principle of operation is the same for these three structures. Consider, for
example, the case of the SMS structure. One can see a close analogy with the bipolar transistor: a forward-biased semiconductormetal junction serves as the emitter, a
second metalsemiconductor junction serves as the collector, and a metal layer is the
base. Both junctions are, in fact, Schottky diodes, one is forward-biased and the other
is reverse-biased. Under such bias conditions, electrons are injected over the Schottky
barrier with energies substantially exceeding the thermal energy in the base. If the base is
narrow, the electrons fly across the base region without losing their energy. Their subsequent destiny is defined by the basecollector bias: a lowering of the collector barrier
282
Nanostructure devices
increases the fraction of electrons coming into the collector electrode and, consequently,
the collector current.
Various materials have been used for the structures shown in Fig. 8.37. In particular,
the Si/CoSi2 /Si structure was investigated for the SMS scheme. An factor as high as 0.6
was reported for this case. A common disadvantage of metal-base transistors is the high
electron reflection from the metal. This reflection is, mainly, a quantum-mechanical effect
in nature and, therefore, can not be avoided. It occurs even for ideal semiconductormetal
interfaces.
Semiconductor heterostructures may be used in other ways to realize ballistic-injection
devices. Let us consider n-type devices. In semiconductor-based ballistic-injection
devices, the emitter, base, and collector are doped regions separated by two barriers.
The barriers can be formed by growing layers of materials with a positive conductionband offset as shown in Fig. 8.38(a). This structure actually uses four heterojunctions.
Another kind of barrier can be produced by planar acceptor doping in a homostructure
as illustrated in Fig. 8.38(b).
Consider the typical parameters of structures for ballistic-injection devices. If the
height of the emitter barrier is Vb , the velocity
the emitter is p||,E 2m kB T , where T is the device temperature. Owing to the lateral
translation symmetry this component does not change under electron injection through
the barrier. Thus, the characteristic angle for the velocity spreading can be evaluated as
follows:
p||,B
p||,E
v||,B
kB T
= = =
.
vB
m vB
m vB
Vb
For example, in the case of liquid-helium temperatures and Vb 0.3 eV, this equation
gives 6 .
Collisions in the base reduce the number of ballistic electrons. If the electron mean
free path in the base is le and the base width is L B , the fraction of ballistic electrons
collected by the collector is estimated to be exp(L B /le ). Thus, the base region
should be quite narrow. In this case, the base region has to be heavily doped to reduce
the resistance for the base current.
A limitation of the base-doping technique comes about as a result of the fact that a
high doping gives rise to additional electron scattering and quenches the ballistic regime
of electron motion. Specifically, for IIIV compounds, if impurity concentrations exceed
1018 cm3 electron scattering becomes very strong.
As a result, for AlGaAs/GaAs devices, base regions with doping of about 1018 cm3
and with widths of 3080 nm are used. Optimization of the structure parameters facilitates
283
(a)
emitter
n-GaAs
AlGaAs
n-GaAs AlGaAs
n -GaAs
base
collector
(b)
n+
Depleted
acceptor
layers
n+
(c)
n+
eBE
n-GaAs Graded-gap
AlxGa1xAs
GaAs
AlGaAs
eCB
Undoped
n
Figure 8.38 Energy-band diagrams for semiconductor ballistic transistors. (a) The AlGaAs/GaAs
structure with four heterojunctions and a doped base. Electrons are injected into the base due to
tunneling. This kind of device is referred to as a tunneling hot-electron-transfer amplifier. (b) A
homostructure electron device. Barriers are formed by planar p-doping. The base is n-doped.
(c) A device with an undoped base. After Fig. 12, S. Luryi, Hot-electron transistors, in S. M.
Sze (Ed.), High-Speed Semiconductor Devices (New York, Wiley, 1990). Reprinted with
permission of John Wiley & Sons, Inc.
284
Nanostructure devices
The two-dimensional electron gas in the base is characterized by a high mobility and
large two-dimensional electron concentrations of up to 2 1012 cm2 . Both effects
cause a low base resistance; thus, the base can be chosen to be very narrow, typically
This results in a high fraction of ballistic electrons coming into the collector.
100 A.
For example, a well-designed AlGaAs/GaAs induced-base transistor results in 0.96
even at room temperature. Similar induced-base transistors have been fabricated using
InGaAs/AlGaAs and p-doped Ge/SiGe heterostructures.
In conclusion, hot-electron transistors with nanoscale base regions are characterized
by a high operation speed. An electron transit time of the order of 0.10.5 ps corresponds
to a cut-off frequency in the range 210 THz.
8.6
285
E2
hw = E2 E1
E1
Figure 8.39 A two-level system interacting with light.
boson statistics, or BoseEinstein statistics, which gives the average number of photons
of some chosen mode under equilibrium in the form
N=
1
,
exp[h/(kB T )] 1
(8.26)
with T being the ambient temperature. Equation (8.26) is also known as Plancks formula.
Clearly, the BoseEinstein statistics of photons differs drastically from Fermi statistics,
which we used for electrons (see Chapter 6). In particular, it permits the accumulation
of an arbitrary number of photons in any mode. For example, from Eq. (8.26) it follows
- ) 1 the phonon number N kB T /(h
- ) 1, whereas the number of
that at kB T /(h
electrons occupying any state can not exceed 1.
After these introductory remarks on quantum properties of electromagnetic fields, we
review briefly the three major processes involving a quantum material system and light:
absorption, spontaneous emission, and stimulated emission.
(8.27)
If the lowest energy level E 1 is occupied, the wave can excite the system into the upper
level E 2 and the electromagnetic energy must decrease. One can describe this process as
the absorption of one photon because the energy of the electromagnetic field decreases
by E 2 E 1 . If the system occupies the upper level E 2 , it can make a transition to level
E 1 as a result of interaction with the electromagnetic field. Then, the electromagnetic
energy increases by E 2 E 1 . This process represents the emission of a photon with
energy -h. When activated by an external electromagnetic wave, the latter process is
called stimulated emission. It is important that, for stimulated emission, each emitted
photon has the energy, direction, polarization, and even phase coinciding precisely with
those of the stimulating wave.
286
Nanostructure devices
(8.28)
(8.29)
where n 1 and n 2 are the numbers of particles in the system occupying the levels 1 and 2,
respectively; B12 and B21 are kinetic coefficients describing these processes. The physical
meanings of these coefficients will be addressed in the subsequent discussion.
The two processes of absorption and emission are insufficient to describe the whole
picture of the interaction between radiation and matter. For example, let us apply these two
processes only for thermal equilibrium conditions, for which the ratio of the populations
of the two levels is
n2
= e(E2 E 1 )/(kB T ) = eh /(kB T )
(8.30)
n1
(see Section 6.3 and particularly Eq. (6.14)). Using Eqs. (8.28)(8.30) one can see that
Rabs = Rst.em at any temperature T . This result is in contradiction with the expected
equilibrium between the system and the field. According to the Einstein theory, there is
an additional quantum radiative transition in the system with the spontaneous emission
of a photon of the same mode. The rate of this process is
Rsp.em = A21 n 2 ,
(8.31)
where A21 is the coefficient or rate of spontaneous emission. The spontaneous process
does not depend on the intensity of the electromagnetic wave and takes place even in
the absence of this wave. According to quantum electrodynamics, the excited material
system spontaneously emits a photon as a result of the interaction between the electrons
and the zero-point vibrations of electromagnetic fields. The zero-point vibrations of
electromagnetic fields were introduced briefly in Section 3.3.
In contrast to the case of stimulated emission, a photon produced by the spontaneous
process has an arbitrary phase. Moreover, this process produces photons with different
directions of q and polarizations, but the energy is fixed, i.e., it produces photons of
different modes of the same frequency.
Now we can apply the results of Eqs. (8.28)(8.31) to thermal equilibrium. Under
equilibrium conditions, the total rate of photon emission has to be equal to the rate of
photon absorption; thus we will have
Rabs = Rsp.em + Rst.em .
(8.32)
Using the Planck formula of Eq. (8.26) and the ratio of n 2 /n 1 of Eq. (8.30), and substituting the expressions for Rabs , Rsp.em , and Rst.em into Eq. (8.32), one can find the
relation
-
(8.33)
287
Because this relation has to be satisfied at any arbitrary temperature T , one can obtain
two equalities:
A21 = B21 = B12 .
(8.34)
Thus, we have established the existence of three basic processes for the resonant interaction of radiation and matter: absorption, stimulated emission, and spontaneous emission.
Moreover, we have found the relations between the coefficients determining the rates of
these processes. It is worth emphasizing that all three processes are related to interactions
with photons of the same mode.
The sum of the stimulated and spontaneous emission rates, as determined by
Eqs. (8.29) and (8.31), gives the total emission rate of photons for a fixed mode:
Rem = A21 (1 + Nq,b )n 2 .
(8.35)
From this equation, one can see that stimulated emission dominates over spontaneous
emission for a fixed mode, {q, b}, if the number of photons, Nq,b , is sufficiently larger
than 1. However, there is spontaneous emission of a great number of other modes with
the same frequency but with different directions of q and different polarizations b. This
total spontaneous emission can be the dominant radiative process even if stimulated
emission is the most important process for a particular mode.
Now, let us compare absorption and stimulated emission by calculating the rate of
increase of the number of photons in some fixed modes:
dN q,b
R Rst.em Rabs = B21 Nq,b (n 2 n 1 ).
(8.36)
dt st
This result shows that, if
n 2 n 1 > 0,
(8.37)
288
Nanostructure devices
1
B21Nq,b (n 2 n 1 )
cm
(a)
(b)
LZ
LZ
289
Figure 8.40 Longitudinal and transverse modes in the planar FabryPerot resonator.
(a) Longitudinal modes. Strictly perpendicular light rays do not escape from the resonator.
(b) Transverse modes. Slightly inclined light rays eventually escape from the resonator and have
poor quality factors.
plane mirrors parallel to each other, which are finite in their transverse dimensions; an
example is given by the so-called FabryPerot etalon shown in Fig. 8.40. In this resonator,
most of the modes propagating through the cavity are lost in a single traversal of the
cavity since the mirrors are inclined with respect to the mode-propagation directions. This
implies that most of the modes, so-called transverse modes, have a very low quality factor.
Only the waves propagating perpendicular to the mirrors can be reflected back and travel
from one mirror to another without escaping from the resonator. These waves correspond
to the so-called longitudinal modes of the resonator. Thus, for finite dimensions of mirrors
only longitudinal modes can have a high quality factor. Their loss and diffraction are
caused by absorption by the mirrors, by transmission through the mirror, and by wave
diffraction on the sides of the mirrors. The wave-diffraction losses can be made much
smaller than can those arising from other loss mechanisms. Thus, the open resonator
provides for strong discrimination between modes. Relatively few of these modes have
a high quality factor, Q. According to the definition of Eq. (8.39), they are capable of
accumulating the light energy. The photons corresponding to these high-quality modes
can be generated effectively by the active medium.
At this point, we end our discussion of the simple two-level model of the optical
medium and start to consider more realistic systems.
290
Nanostructure devices
Absorption
Emission
absorption increases sharply in the short-wavelength region. Let E g be the bandgap of the
semiconductor. Then the material is relatively transparent for -h < E g . For -h > E g ,
the semiconductor exhibits strong absorption; accordingly, -hg = E g corresponds to
the absorption edge. The shape of the absorption edge depends significantly on the
structure of the electron bands. Remember that in Section 4.4 we defined direct- and
indirect-bandgap semiconductors. Direct-bandgap semiconductors such as GaAs have
a more abrupt absorption edge and a larger absorption value than do indirect-bandgap
materials, of which Si provides an example. We can introduce the so-called bandgap
- /E g . If E g is given in eV, the bandgap
wavelength, or cut-off wavelength, g = 2 ch
wavelength in micrometers is
g =
1.24
.
Eg
(8.40)
The values of E g and g for various IIIV semiconductor materials are apparent from the
curves plotted in Fig. 8.42. One can see that interband transitions in IIIV compounds
cover a wide range from infrared to visible spectra. Optical activity in this spectral region
is crucial for optoelectronic applications of these materials.
A photon absorbed during an interband transition excites an electron from the
valence band to the conduction band, i.e., it creates an electronhole pair as depicted in
Fig. 8.43(a). The inverse process the phototransition of an electron from the conduction
band to the valence band is referred to as the radiative recombination (annihilation)
of an electron and a hole; this is depicted in Figs. 8.43(b) and 8.43(c). According to the
general properties of phototransitions studied at the begining of Section 8.6, there exist
two such processes: spontaneous and stimulated emission, as illustrated by cases (b) and
(c), respectively, in Fig. 8.43.
From the above analysis it is clear that, to obtain intense interband emission, one
should provide nonequilibrium electrons and holes. Moreover, large concentrations of
291
Figure 8.42 Absorption coefficient versus photon energy and wavelength for interband
phototransitions in various semiconductors. After G. E. Stillman, V. Robbins et al., IIIV
compound semiconductor devices: optical detectors, IEEE Trans. Electron Devices, ED-31,
C IEEE.
16431655 (1984).
(a)
(b)
(c)
E2
Ec
Eg
hw
hw
hw
hw
Ev
E1
hw
Figure 8.43 (a) The absorption of a photon results in excitation of the electronhole pair.
both electrons and holes in the same spatial region correspond to the population inversion
necessary for stimulated emission. Indeed, according to Section 4.4, the presence of the
holes means empty electron states in the valence band. This explains the appearance
of the population inversion between the conduction and valence bands under conditions
of large electron and hole concentrations.
However, nonequilibrium carriers quickly relax due to various interband relaxation
mechanisms. When the electron and hole concentrations are highly nonequilibrium, the
characteristic lifetime of these excess carriers is small. To describe the nonequilibrium
electrons and holes, we introduce the density of the pumping (excitation) rate, Rpump ,
which represents the number of electronhole pairs excited in a unit volume per unit
292
Nanostructure devices
time. For the case when the radiation mechanism is the decay of electronhole pairs, one
can write
Rpump = Bn 2 =
n
,
R (n)
(8.41)
(8.42)
To obtain an idea of how intense this pumping is, we can perform the following estimates. Suppose that the bandgap, E g , is about 1 eV, then the pumping rate of Eq. (8.42)
corresponding to the density of the pumping power given in the previous example is
E g Rpump 16 MW cm3 . This is a huge pumping power!
At this point, an important spectral property of light amplification in semiconductor
materials will be discussed. For the previously analyzed two-level model, phototransitions and light amplification/absorption were possible only at a fixed photon energy
-h = E 2 E 1 . This implies that in such a two-level system the amplification and absorp- . On the contrary, in
tion occur in a very narrow spectral range near the value (E 2 E 1 )/h
semiconductors, the spectral range within which amplification and absorption are possi- . This results in a wide spectral band of optical
ble is restricted only from below, > E g /h
activity. In Fig. 8.44, we show the spectral dependences of the gain/loss coefficient for
GaAs for various electronhole concentrations at room temperature. The left boundary
of these dependences coincides approximately with the bandgap value 1.42 eV. The spectral range for which the gain coefficient, , is positive becomes wider and its maximum
moves to larger photon energies as the electron/hole concentration increases because the
number of inversely populated states increases with the concentration. Amplification in a
wide spectral range allows one to build a laser with considerable tuning of the frequency.
The following two methods can be applied to generate large nonequilibrium carrier
concentrations. The first method is optical excitation or pumping, whereby external light,
often incoherent, with a photon energy larger than the bandgap is absorbed and creates
nonequilibrium electronhole pairs. Then, because of the short time associated with the
intraband relaxation, the electrons and holes relax to the bottom of the conduction band
and the top of the valence band, respectively, where they are accumulated. If the rate of
optical pumping is sufficiently large, the necessary level of interband emission can be
reached. In particular, inversion between the bands can be induced and light amplification
and laser generation become possible. The optical pumping corresponds to a conversion
of one kind of radiation, not necessarily coherent, into coherent radiation with a lower
photon frequency. One employs optical pumping in cases when electric-current pumping
either is not possible or is ineffective. This pumping method is often used to test prototype
laser structures before the design of the current pumping system.
293
160
120
cm
(cm 1)
18
80
40
0
40
18
cm
80
1.42
1.46
1.50
Laser diodes
A much more convenient method that can be applied to achieve the interband emission is
electron and hole injection in devices with pn junctions. The physics of a pn junction
was studied in the previous section. There, we found that, under forward bias of a pn
junction, it is possible to inject the electrons from the n part of the junction and the holes
from the p part of the junction into the same spatial region. Such a double injection can
give rise to large nonequilibrium electron and hole concentrations. Often, the n and p
parts are separated by a narrow intrinsic (undoped) i region. The corresponding structure
is called a pin device.
In Fig. 8.45, we show the case of forward biasing of a pin structure the so-called
flat-band condition, when there are no potential barriers for the electrons and holes,
and the maximum possible carrier injection into the i region is realized. In the case of
direct-bandgap semiconductors, double injection provides for the intense emission of
light.
It is easy to estimate the electric currents which are necessary to obtain a given
concentration of carriers under double injection. We define the active region, where both
nonequilibrium electrons and holes are present. In the case of a pin structure, the
active region coincides with the i region. Let the length of this region be l. We introduce
294
Nanostructure devices
Ec
p+
Ev
n+
+h
Figure 8.45 The double injection of the electrons and holes, which leads to the formation of
a cross-sectional area A, through which the current I is injected into the pn region of a
diode. Then, at steady state, the rate of injection of electrons and holes into a unit volume
per unit time can be expressed as
R=
J
I
= ,
e Al
el
(8.43)
J,
el
(8.44)
where is the total lifetime of the nonequilibrium pairs in the active region. In general,
due to the small electron and hole lifetimes, the active region is quite narrow. For GaAs,
the length of the active region is estimated to be from 1 m to 3 m, depending on
temperature, crystal quality, etc. Now, we can use Eq. (8.43) to estimate the current
density necessary to achieve the pumping rate of Eq. (8.42): J (1.64.8) 103 A cm2 .
The current densities obtained are very large. These estimates indicate that, in order to
operate with acceptable current levels, the cross-sections of real emitting diodes have to
be very small.
From Eqs. (8.43) and (8.44), it follows that another possible way of decreasing the
pumping electric current is to reduce the thickness of the active region l. This thickness is
one of the critical parameters for injection pumping of light-emitting devices. Indeed, as
understood from the previous discussion and Fig. 8.45, an excess carrier concentration
in the active region always leads to carrier diffusion out of this region: the electrons
diffuse through the active region to the p part of device, while the holes diffuse to the n
part. In general, diffusive leakage of the carriers is limited by their finite lifetime (n).
In Section 6.2 we found that, if the diffusion
coefficient D is given, the diffusion length
during a time can be estimated as L D = D . In the case under consideration, is the
excess carrier lifetime and the length L D corresponds to the average distance of electron
(hole) transfer before recombination. Obviously, the minority carriers (the electrons in
the p part and the holes in the n part) recombine immediately with the majority carriers
(the holes in the p part and the electrons in the n part). Thus, the width of the region with
the excess carrier concentration can not be less than the diffusion length L D , i.e., d > L D .
Since the diffusion length of electrons and holes in direct-bandgap materials is of the
order of a few micrometers, it is not possible to make the active region shorter than a
295
E
Barrier
Eg1
EF
EF
h
Eg2
Eg3
Figure 8.46 The energy-band diagram of a double heterostructure for light amplification/
generation. The applied voltage 0 induces the flat band conditions; E g1 , E g2 , and E g3 are the
bandgaps in different regions of the structure. The energy states filled by the electrons are
denoted by the shaded areas. Thus, in some spatial region there are electrons in the conduction
band and depopulated states (holes) in the valence band.
Nanostructure devices
(b)
(c)
(a)
Figure 8.47 Composition profiles of (Alx Ga1x )0.5 In0.5 P heterostructures providing
simultaneously quantum confinement of the carriers and optical confinement: (a) a single
quantum well and a step-like refractive-index heterostructure, (b) a single quantum well and a
graded-index optical confinement structure, and (c) a multiple quantum well and a step-like
index structure. After P. S. Zory Jr., Quantum Well Lasers (Boston, MA, Academic, 1993).
a 0 (cm1)
296
SQW L z = 50
200
InGaAsP
= 1.3 m
100
0
20
GaAs
40
j (A cm 2)
60
Figure 8.48 The amplification coefficient 0 as a function of the pumping current density for
After P. S. Zory Jr., Quantum Well
single InGaAsP and GaAs quantum wells of width 50 A.
Lasers (Boston, MA, Academic, 1993).
and holes that are either generated by external light or injected from p and n regions
move in barrier layers and then are captured in the active region and quantum wells. The
characteristic time of this capture is less than 1 ps. Escape processes require additional
energy and have low relative probabilities of occurrence. Carriers in the quantum wells
relax to the lowest energy states available. This results in the accumulation of both types
wide or even
of carriers in an extremely narrow active region, which is typically 100 A
narrower. A similar situation can be realized if quantum wires or dots are embedded in
the active region.
A positive effect of electron confinement in quantum wells is illustrated in Fig. 8.48.
In this figure, the results for the gain factor 0 as a function of the current density J
are presented for two material systems: InGaAsP/InP and AlGaAs/GaAs heterostructures. It can be seen that the gain coefficient always increases with the current. The
current for which 0 becomes positive, i.e., a population inversion is established, can
be defined as the threshold current. The gain coefficient is shown for heterostruc quantum wells. This system has low values of threshold currents for
tures with 50-A
both heterostructures: 20 A cm2 and 60 A cm2 for InGaAs/InP and AlGaAs/GaAs,
297
Cleaved
surface
Ly
d
Lx
n
Cleaved
surface
Figure 8.49 The scheme of an injection laser with two cleaved facets that act as reflectors. After
B. E. Saleh and M. C. Teich, Fundamentals of Photonics (New York, Wiley, 1991). Reprinted
with permission of John Wiley & Sons, Inc.
respectively. For both material systems, the 0 (J ) curves increase sharply with the current density above the threshold. As discussed previously, lasing can be achieved if a
light amplifier is supplied with a path for optical feedback. For an injection laser, the
feedback is usually obtained by cleaving the crystal planes normal to the plane of the
pn junction. Figure 8.49 depicts a device with two cleaved surfaces forming an optical
resonator. For the light reflected from the crystal boundaries, we define the reflection
coefficient by
r=
Ir
,
Iin
where Iin and Ir are the intensities of incident and reflected light, respectively. The
reflection coefficient for an airsemiconductor boundary is
n ri 1 2
,
r=
n ri + 1
where n ri is the refractive index of the semiconductor material. Since semiconductors
usually have large refractive indexes the coefficients r are large enough. The intensity of
the light transmitted through this mirror is
Iout = (1 r )Iin .
Let two cleaved surfaces be characterized by two coefficients, r1 and r2 . After two passes
through the device, the light intensity is attenuated by the factor r1 r2 . We can define
an effective overall distributed coefficient of optical losses:
1
1
.
ln
r
2L x
r1 r 2
Here, L x is the distance between the cleaved surfaces. In principle, there can be other
sources of optical losses in the resonator. Let them be characterized by the absorption
Nanostructure devices
3
(b)
298
r ~ 0.3
(a)
1 r ~ 0.8
10
12
After B. E. Saleh and M. C. Teich, Fundamentals of Photonics (New York, Wiley, 1991).
Reprinted with permission of John Wiley & Sons, Inc.
(8.45)
For injection lasers, the criterion of Eq. (8.45) is a condition imposed on the magnitude
of the injection current density, J .
In order to have an idea of the order of magnitude of optical losses in laser diodes, we
consider an example. For light of visible and near-infrared ranges, the typical refractive
index can be estimated as n ri 3.23.5. Thus, the reflection coefficient is r 0.3. Let a
diode have the geometry presented in Fig. 8.49 with dimensions such that d L x L y =
1 m 200 m 200 m. Then, we can estimate the radiation losses: r 60 cm1 .
The total current through the laser diode is 12 A. The threshold currents for lasing
for the quantum-well-based structures are considerably smaller and are typically of the
order of tens of mA. In Fig. 8.50, the output light power is presented as a function of the
quantum well embedded in
injection current for an AlGaAs laser with a single 100-A
the active region. This particular laser design has a threshold current of about 8 mA for
the optical feedback due to the cleaved (uncoated) end facets (the reflection coefficient
is r 0.3). The light reflection can be improved through the use of special reflecting
coatings on the end facets. In the latter case, the threshold current decreases below 1 mA,
as shown in Fig. 8.50 for the reflection coefficient r 0.8. The same figure illustrates
299
Light output
/n
metallization
active region
multilayer
implanted region
110 m
also a typical output light power in the injection lasers: from 1 mW to tens of milliwatts
depending on the particular diode and resonator design.
Surface-emitting lasers
Thus far, in considering quantum-well lasers we have assumed that the laser is designed
for light propagation along the quantum-well layers. Another possible design uses
light propagating perpendicular to the layers in a so-called vertical geometry. The
amplification of light passing through a quantum-well layer can be defined as
Iout Iin
.
Iin
Here, Iin and Iout are the input and output light intensities, respectively. The quantity
is estimated through the gain coefficient as () ()L, where L is the width
of the quantum well. It is easy to see that is typically very small. For example, if
we get = 104 . In order to obtain laser oscillations in
0 = 100 cm1 and L = 100 A,
a vertical-geometry structure, one should employ a multiple-quantum-well structure and
provide near-perfect mirrors with extremely high reflectivities. Figure 8.51(a) depicts
schematically such a surface-emitting laser. The laser design includes an active region
providing for high light gain, dielectric multilayers, metallic contacts, and implanted
regions, which form the light output. Layered dielectric mirrors give very high reflection
while the active region contains a multiple-quantum-well structure. The lateral sizes
of this laser can be reduced to the 110-m range. A decrease in the surface area of
the diode leads to a considerable decrease in the magnitude of the threshold current.
In the case of the quantum-well structures just considered, we can assume a characteristic current density of about 100 A cm2 ; accordingly, if the lateral sizes are each
5 m, the pumping surface area is 2.5 107 cm2 and the threshold current equals
25 A.
Surface-emitting quantum-well lasers offer a new advantage of high-packing density.
Nowadays, technology allows one to fabricate an array of about 106 surface-emitting
300
Nanostructure devices
Bi l xSbx
Pbx Snl xSe
PbxSnl xTe
PbSl xSex
InAsxSbl x
CdxHgl xTe
CdxPb1-xS
Inl xGax Asy Pl y
GaAs xSbl x
InAs xP1 x
(AlxGal x`)y Inl yAs
AlxGal xAs
GaAll xPx
InxGal xAs
(AlxGal x)y Inl yP
CdSxSe l x
CdxZnl xS
0.1
0.5
10
50
100
Wavelength (m)
Figure 8.52 Spectral regions of lasers with various semiconductor media. After P. L. Derry, A.
301
Operation in the very interesting range of shorter wavelengths requires the use of
semiconductors with larger energy bandgaps. Well-known examples of such widebandgap semiconductors are representative IIVI compounds: CdS, CdSe, ZnCdSe,
etc. By using different quantum-well designs, the blue injection lasers were realized
on the basis of these materials. However, the major problems associated with this class
of quantum-well lasers are the device degradation and short lifetimes of the devices.
The electrical power dissipated in IIVI diodes is too high and causes rapid degradation as a result of the generation of intrinsic defects. Much improvement must be
realized in this field in order for the group IIVI-based devices to have wide practical
application.
Recently, another class of heterostructure materials has been studied in detail the
nitrides of group III. It is difficult to obtain these materials in the usual single-crystal
form. However, thin nanometer-scale layers can be grown successfully with good quality
by applying the growth methods presented in Chapter 5. The nitrides of group III include
InN, GaN, AlN, and all possible ternary and quaternary alloys. For InN and GaN, the
energy bandgaps are about 0.9 eV and 3.5 eV, respectively, while that for AlN is 6 eV.
Thus, by changing the In and Al content in the alloys In y Alx Ga1yx N, one can increase
the bandgap from about 1 eV to 6 eV and realize lasers with wavelengths spanning the
range from 1.3 m (near-infrared light) to 0.2 m (deep-ultraviolet light). The alloys
Inx Ga1x N produce green, blue, and violet light.
Light-emitting diodes
Although stimulated emission from the injection laser diode is very important, practically,
sub-threshold operation of the diode when only spontaneous light is emitted is in
many cases advantageous and has a number of applications. This mode of operation
does not require feedback to control the power output, which facilitates operation over
a wide range of temperatures, and is reliable and inexpensive. Diodes operating with
spontaneous light emission are called light-emitting diodes.
The important characteristic of the light-emitting diode is the spectral distribution
of emission. The spectrum of emission is determined, primarily, by the electron/hole
distributions over energy, which can be approximately described by corresponding Fermi
functions, as studied in Chapter 6. Thus, the ambient temperature, T , defines both the
spectral maximum (the peak) and the spectral width of emission. The peak value of
the spectral distribution can be estimated as
-hm = E g + 1 kB T.
2
- and is independent
The full width at half maximum of the distribution is ! 2kB T /h
of . In terms of the wavelength, , we obtain
! = 2m /(2 c) !,
or
! = 1.452m kB T,
(8.46)
302
Nanostructure devices
Orange
Yellow
Near infrared
GaAs0.35 P0.65
GaAs 0.14P0.86
Green
GaP:N
GaN
Red
Violet
GaP:ZnO
GaAs
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
1.1
1.2
1.3
Wavelength (m)
Figure 8.53 The spectra of light-emitting semiconductor diodes with different bandgaps. After
Fig. 13 on page 696 of S. M. Sze, Physics of Semiconductor Devices (New York, Wiley, 1981).
Reprinted with permission of John Wiley & Sons, Inc.
303
transition intraband absorption is allowed in an ideal crystal system with heterojunctions. Since the latter phototransitions are drastically different from interband transitions,
laser action associated with these intraband transitions should differ in a fundamental
way from that studied for the laser schemes considered above. First of all, an intrabandtransition laser should employ only one type of carrier, i.e., it is a unipolar device.
Second, it should be based on electron transitions between confined states arising from
the quantization in semiconductor heterostructures. In order to create a population inversion between two confined states, one needs to provide for (i) electron injection into a
higher lasing state and (ii) depletion of a lower lasing state. For this purpose, a vertical
scheme of electron transport has been proposed. This scheme is illustrated in Fig. 8.56(a).
The proposed heterostructure is a superlattice with a complex design for each period.
Each of the periods consists of four AlInAs barriers, forming three GaInAs quantum
304
Nanostructure devices
hw
(a)
2
1
e3
e2
e1
EFn
hw
hw
(b)
Figure 8.56 (a) Two periods of the 25-stage staircase coupled-well region of a quantum-cascade
laser under operational conditions. The laser phototransitions are indicated by arrows. They
occur between levels (subbands) 3 and 2 with the photon energy of 295 meV. Level 2
depopulates through level 1 and subsequent tunneling. The energy separation between levels 2
and 1 is 30 meV. (b) Energy dispersion for subbands 1, 2, and 3, phototransitions, and
intersubband-scattering processes (straight lines). Reprinted with permission from J. Faist,
C 1994 AAAS.
F. Capasso et al., Quantum cascade laser, Science, 264, 553556 (1994).
wells, and a graded AlInGaAs region, which is doped. Under zero-bias conditions, the
overall band diagram resembles a sawtooth structure. Under an applied electric field,
the band diagram takes on a staircase structure as shown in Fig. 8.56(a). The barriers
form three coupled quantum wells with three quasi-bound levels. These three levels are
labeled in Fig. 8.56(a) by 1, 2, and 3. Each of the confined states originates from one
of the wells. The structure is chosen so that there is a considerable overlap between the
wavefunctions of the upper state, 3, and the intermediate state, 2. The same holds for
wavefunctions of states 2 and 1. Under a voltage bias, the potential in the doped regions
is almost flat, as shown in Fig. 8.56(a). The electrons are injected from the doped regions
through the barrier in the confined state 3 of the first quantum well. From this state, they
relax primarily to state 2. There are two processes of relaxation: phonon emission and
photon emission. In Fig. 8.56(b), these processes are shown for electrons with various
values of the in-plane wavevector, k. The three indicated subbands, 1,2,3 (k), correspond
to the three confined states. The straight arrows represent intersubband phonon relaxation. The third confined state, 3, is selected to provide depletion of state 2 as fast as
possible. Thus, in this manner we have a three-level scheme whereby the upper level is
pumped by the direct injection of electrons from the doped region. The second level is
depleted due to strong coupling with the lowest level 1. From level 1, electrons escape to
the next doped region. Then the processes are repeated in each subsequent period of the
superlattice. One can say that the carriers make transitions down through such a cascade
structure.
305
To describe the processes in the cascade structure we need to define the numbers of
electrons in three states: n 3 , n 2 , and n 1 . The criterion for population inversion between
levels 2 and 3 should be
n3 > n2.
We can write simple balance equations for n 3 and n 2 :
1
n3
dn 3
,
= J
dt
e
32
n3
dn 2
n2
=
,
dt
32
21
(8.47)
(8.48)
where J is the density of the injection current, and 32 and 21 are relaxation times
between the states 3 and 2, and 2 and 1, respectively. In Eq. (8.48), we neglect the
inverse 1 2 process since state 1 can be regarded as almost empty as a result of
fast electron escape to the doped region. For the steady-state conditions, we obtain the
concentrations
1
n 3 = J 32
e
and
n2 = n3
21
,
32
21
1
.
!n n 3 n 2 = J 32 1
e
32
(8.49)
Thus, to create a population inversion, one should design the laser so that
21 < 32 .
(8.50)
In order to fabricate such unipolar laser structures with vertical electron transport, very
precise sophisticated semiconductor technology is necessary.
For a particular device structure (Fig. 8.57) with an optical path of about 700 m and
the mirror reflectivity r1 = r2 = 0.27, the laser output-current characteristics at various
temperatures are shown in Fig. 8.58. The insets of Fig. 8.58 show the currentvoltage
characteristics and temperature dependence of the laser threshold current. The laser
threshold current can be approximated by Ith = C exp(T /112), where the constant C is
about 900 mA and T is measured in degrees Kelvin. From Fig. 8.58, it follows that the
output power reaches tens of milliwatts.
The emission energy is in the range 275310 meV. Spectra of the laser output for
various currents at T = 80 K are presented in the right-hand inset of Fig. 8.58. For
this case, the threshold current is about 1.06 A. This inset clearly demonstrates a sharp
narrowing of the emission spectra above the laser threshold: the spectra reduce to a sharp
peak at I = 1.1 A > Ith .
Thus, the unipolar cascade laser is drastically different from the lasers based on
intersubband phototransitions. The properties of intraband phototransitions and, consequently, the properties of the unipolar laser are determined to a large degree by
quantum confinement; accordingly, this novel laser can be tailored for operation in
Nanostructure devices
GaInAs
AlGaInAs (graded)
1.0 1018
1.01018
670.0
30.0
AlInAs
5.0 1017
1500.0
AlInAs
1.5 1017
1000.0
1.5 1017
18.6
Active region
undoped
21.1
GaInAs
1.0 1017
300.0
14.6
18.6
Active region
undoped
21.1
GaInAs
1.0 1017
300.0
1.5 1017
33.2
AlInAs
5.0 1017
500.0
Waveguide
cladding
Contact
layer
20.0 nm
Waveguide core
GaInAs (Sn-doped)
25
Waveguide
cladding
306
Figure 8.57 A schematic cross-section of the cascade laser structure. The whole structure
consists of 500 layers. Reprinted with permission from F. Capasso, Quantum cascade laser,
C 1994 AAAS.
Science, 264, 553556 (1994).
the spectral region from the middle infrared to submillimeter waves. Quantum cascade lasers are very sensitive to the ambient temperature and typically work at reduced
temperatures.
In conclusion, we have shown that nanostructures play a key role not only for downscaled electrical devices, but also for optical devices. They facilitate improvements in
bipolar injection lasers and make it possible to realize cascade-laser structures, which
work on the basis of innovative concepts.
8.7
307
Figure 8.58 The measured optical power P from a single facet of the quantum-cascade laser with
the structure presented in Fig. 8.57 and an optical cavity length of 1.2 mm. The results are given
for four different temperatures. The insets on the left show the dependence of the laser threshold
current as a function of temperature and the currentvoltage characteristics of the device. The
inset on the right of the figure shows the output spectrum for a 1.2-mm-long laser below and
above threshold (Ith = 1.06 A) at 80 K heat-sink temperature. Above 0.4 A the luminescence
peak was found to grow superlinearly with current due to optical gain causing the line to narrow.
After F. Capasso, Quantum cascade lasers: a unipolar intersubband semiconductor laser, in
Proceedings of the International Conference on the Physics of Semiconductors (Singapore,
World Scientific, 1995), pp. 16361640.
systems (NEMSs) was described in Section 5.9. Here, we will discuss a few particular
NEMS devices.
0 S
1 + 2 .
C
d
d
d
If a time-dependent voltage V (t) is applied to the capacitor, its electrical energy becomes
E el =
1
C V 2 (t).
2
308
Nanostructure devices
Let M and K be the mass and the spring constant of our resonator, respectively. Then,
Newtons second law for the resonator plate takes the form
M
d2 z
dz
dE el
+ M
+ Kz = f =
.
dt 2
dt
dz
(8.51)
Here, we introduce the term M dz/dt describing the damping of the resonator. The
force acting on the resonator is
2z
0 S 2
;
(8.52)
f = 2 V (t) 1 +
d
d
i.e., it is proportional to the square of the driving voltage, V (t). Importantly, the force
contains a term proportional to the displacement z. We can characterize the resonator by
0 S V 2 (t)
FD (t) =
,
(8.54)
d2
we can rewrite the force equation as
0 dz
!K (t)
d2 z
2
+
+
(8.55)
+
z = FD (t).
0
dt 2
Q dt
M
The physical significance of each of the quantities !K (t) and FD (t) is obvious: !K (t)
describes a parametric modulation of the spring constant by an applied voltage, while
FD (t) is the displacement-independent driving force. The latter equation is the simplest
basic equation for the effect of parametric resonance: the parameters !K and FD have the
309
same time dependence; i.e., they are both proportional to V 2 (t). If the voltage depends on
time harmonically, V sin(t), we find that both the driving force and the spring constant oscillate with the driving frequency, which varies with time as cos(2t). In general,
a NEMS can be fabricated with two or more capacitor plates; then, different voltages
can be applied to these capacitors. Thus, to generalize the analysis we may assume
that the spring-constant modulation, !K , and the driving force, FD , have independent
modulation frequencies, P and D :
!K = !K 0 sin(P t);
FD = F0 cos(D t + ).
Here, denotes a phase shift between these two dependences. More specifically, the
basic equation now becomes:
d2 z
0 dz
!K 0
2
+
(8.56)
+ 0 +
sin(P t) z = F0 cos(D t + ).
dt 2
Q dt
M
This equation is the so-called Mathieu equation with damping. Analysis of this
equation yields the following results. First, let the parametric modulation be absent
(!K 0 = 0), then the equation describes vibrations of a resonator with frequency 0 .
For small damping (Q 1), vibrations will manifest a strong resonance at the driving
frequency, D 0 :
z(t) = A cos(0 t) + B sin(0 t),
(8.57)
F0 Q
F0 Q
sin ,
B=
cos .
(8.58)
A=
K
K
This result shows that, for an oscillator with a quality factor Q , the vibration
magnitude diverges.
If a parametric modulation is tuned by a small driving force, additional strong resonances arise at P = 20 /n, with n being an integer. That is, the parametric modulation
resonates for all submultiples of the frequency 20 . Consider, for example, the case of
n = 1, i.e., P = 20 and D = 0 . In the limit of high Q, it is possible to find a solution
in the form of Eq. (8.58) with
F0 Q
F0 Q
sin
cos
,
Bp =
.
(8.59)
Ap =
K 1 Q !K 0 /(2K )
K 1 + Q !K 0 /(2K )
Thus, the oscillations depend essentially on the parametric coupling parameter !K 0 ,
and the amplitudes of the mechanical vibrations can be controlled by these parameters.
By comparing these results with and without parametric coupling, i.e., Eqs. (8.59) and
(8.58), we can define the parametric gain,
A2p + Bp2
.
G=
A2 + B 2
Using the above formulas for the coefficients A, B, Ap , and Bp , we find
1/2
cos2
sin2
G=
+
.
[1 + Q !K 0 /(2K )]2
[1 Q !K 0 /(2K )]2
Nanostructure devices
10
310
j = p/2
0.1
0.01
j=0
0.1
Q K 0/(2K)
Figure 8.60 The gain/loss coefficient G as a function of the quality factor Q for different phase
shifts between spring-constant modulation and the driving force. From A. N. Cleland,
Foundations of Nanomechanics (Berlin, Springer-Verlag, 2003).
For !K = 0, the gain depends on the relative phase . If = 0, , 2, . . ., the amplitudes of the vibrations are suppressed and G < 1. If = /2, 3/2, 5/2, . . ., the
vibrations are amplified and G > 1. Formally, if !K 0 2K /Q, the amplitude diverges
with G . This phenomenon is called parametric resonance. Both de-amplification
and amplification regimes are illustrated by Fig. 8.60. We can conclude that, if the quality
factor, Q, is large, an amplification of the amplitude of vibrations can be obtained even
for small modulations of the driving force.
The resonator of a micromechanical parametric oscillator, shown in Fig. 8.59, is
fabricated by the SIMOX technique. The dimensions of the suspended resonator plate
are 4 m 4 m 0.2 m. The substrate is grounded. A metal is deposited onto
the resonator plate. The resonator is driven by a voltage applied between the plate
and the substrate. The fundamental frequency of the resonator is 0 /(2 ) = 485 kHz.
Thus, the amplified signal is at this frequency, while the parametric drive is at P = 20 .
These parameters correspond to the lowest resonance in the Mathieu equation (8.56). To
detect and measure displacements, one can use reflection of a laser beam from the
substrate and the resonator. Interference of these two reflected signals yields good displacement sensitivity. In Fig. 8.61, the measured square of the oscillation amplitude is
presented as a function of the pump amplitude for such a parametric oscillator. The
maximum amplification, G, achieved is about 10.
The resonance properties of NEMSs will undoubtedly be employed in a broad range of
applications. Obviously, one of the principal areas will be signal processing in the veryhigh-frequency (VHF), ultra-high-frequency (UHF), and microwave-frequency bands.
311
10
0.1
0.00
--0.05
0.10
response
fit
0.15
0.20
displacement sensor
(fiber optic interferometer)
force detector
(NEMS resonator)
gradient magnent
(nanoscale magnet)
excitation field
(rf coil or microstrip)
surfaces
of constant
magnetic field
resonant
slice
sample
H1 cos(w0t)
Figure 8.62 The principal scheme of mechanically detected magnetic resonance imaging. After
M. L. Roukes, Nanoelectromechanical systems, in Technical Digest of the 2000 Solid-State
and Actuator Workshop, Hilton Head Island, SC, pp. 110.
techniques still take about 1014 1016 nuclei to generate a measurable signal of the magnetic resonance. This means that state-of-the-art MRI in research laboratories attains,
at best, a maximal resolution (minimum voxel size) of the order of 1 m. For standard
clinical MRI, the resolution yielded by commercial instrumentation is much poorer, with
a voxel size of about 1 mm.
Mechanically detected MRI, now commonly called magnetic resonance force
microscopy (MRFM), is significantly more sensitive than conventional MRI. There are
several principal components of a MRFM instrument, which are presented in Fig. 8.62.
An antenna structure in the form of a coil or a microstripline generates a radio-frequency
field of frequency 0 . The static magnetic field created by a miniature magnet splits the
spin states of impurities within a sample and provides a resonance interaction of the
312
Nanostructure devices
Figure 8.63 The electron shuttle a device with electron transport provided through mechanical
motion of the cantilever. C is a nanosized cantilever, S and D are the source and drain contacts,
and G1 and G2 are the gate contacts, which move the cantilever. The inset depicts the electric
circuit of the structure. From A. N. Cleland, Foundations of Nanomechanics (Berlin,
Springer-Verlag, 2003).
radio-frequency field with these spins. The magnet is attached to a mechanical resonator
of a cantilever type. The interaction of the resonant spins with the magnet results in
a time-varying back-action force upon this cantilever. This force can be detected by a
sensor with high resolution of displacements, such as an optical interferometer. All of
these elements constitute a resonant force sensor. Such a sensor can detect the extremely
weak forces exerted by the resonant spins upon the mechanical system. Microscopy is
realized by scanning the cantilever with attached magnet over the sample. By correlating
the resonant mechanical response with the cantilever position, one can obtain spatial
imaging of spin density.
The resonance properties of NEMSs provide key advances for MRFM. Indeed, utilization of a nanometer-scale ultra-high-frequency mechanical resonator makes it feasible to
couple directly the spin precession and mechanical vibrations and to improve drastically
the resolution of MRI.
The field of MRFM is still very much in its infancy. Sustained efforts are required in
order to take it from a scientific demonstration to a useful technique for high-resolution
MRI. With its potential for atomic resolution, such efforts are of great potential importance, especially for biochemical applications.
313
45
300 K
I (nA)
30
15
I (nA)
I (nA)
5
0
50
100
f (MHz)
4
2
1.0
2.0
V (V)
50
100
f (MHz)
Figure 8.64 Electric current in the electron shuttle versus frequency in the gate circuit. Upper
inset: the cantilever is blocked and there is no current. Lower inset: the shuttle current as a
function of the sourcedrain voltage at a given frequency in the gate circuit. From A. N.
Cleland, Foundations of Nanomechanics (Berlin, Springer-Verlag, 2003).
electron micrograph of the electron shuttle is presented. Two gates are fabricated to
drive the cantilever electrostatically. The cantilever can be driven by the voltage, VG1 G2 ,
so that the cantilever can nearly contact each of the spatially separated electrodes. A
voltage VSD is applied to the electrodes, which can be considered as a source and a
drain. If the frequency of the voltage applied to the gates, VG1 G2 , does not coincide
with the resonance frequency of the cantilever, the amplitude of cantilever vibrations is
small. Accordingly, the cantilever does not mechanically contact the source and drain
electrodes, and electron transport is suppressed. As soon as the frequency VG1 G2 matches
the mechanical resonance, the cantilever contacts the source and drain during each cycle
of the mechanical motion. When the cantilever contacts the electrodes successively, the
metallized end of the cantilever charges and discharges, and transfers electrons between
the source and the drain. In Fig. 8.64, results of measurements of the sourcedrain current
are presented; several peaks in the current correspond to different resonant vibration
modes of the cantilever. Only for these resonances are the amplitudes of the vibrations
large enough to support electron transfer through the nanostructure. If the motion of the
cantilever is blocked, no current is observed, as seen from the upper inset to Fig. 8.64.
The lower inset shows the magnitude of one of the current peaks as a function of VSD
at a given resonance frequency. Thus, this NEMS actually presents an electron shuttle.
The number of electrons transferred depends on the dimensions of the metallized island
314
Nanostructure devices
Figure 8.65 A set of SiC NEMSs. Submicrometer double-clamped SiC beams exhibit
on the cantilever. By decreasing the size of the island, it becomes feasible to transfer a
single electron at a time.
Following these examples of NEMSs with different functions, we consider the basic
parameters that determine the performance of any NEMS. Actually, these parameters
are practically the same as for an electronic device: the response time or characteristic
frequency, the quality factor (energy loss), the operating power, the signal-to-noise ratio
(sensitivity), etc.
Frequency
We start by considering a NEMS as a mechanical vibrator at a natural angular frequency,
315
Table 8.2 Estimated fundamental frequency versus geometry for SiC, Si, and GaAs mechanical resonators
Resonator dimensions
L x L y L z in (m)
Boundary conditions
10 0.2 0.1
1 0.05 0.05
Material
Both ends clamped or free
Both ends pinned
Cantilever
After M. L. Roukes Nanoelectromechanical systems, in Technical Digest of the 2000 Solid-State Sensor and
Actuator Workshop, Hilton Head Island, SC, pp. 110.
Quality factor
Another important parameter, which characterizes the rate of energy dissipation in a
NEMS, is the so-called quality factor, Q. If we define a decay time of the flexural
vibrations of the beam in a NEMS, d , then, Q 0 d . The quality factor achieved for
a NEMS in a moderate vacuum is in the range from 103 to 105 . This greatly exceeds the
quality factor typically realized for electrical (microwave) resonators. The small degree
of internal energy dissipation in a NEMS directly relates to the high attainable force
sensitivity and low operating power levels.
316
Nanostructure devices
Quality factor, Q
100 MHz
100 MHz
1 GHz
1 GHz
10 000
100 000
10 000
100 000
40 aW
4 aW
400 aW
40 aW
necessary to operate with a NEMS using the flexural mode can be obtained as follows.
At equilibrium the average energy of such a mode is equal to the thermal energy kB T .
To perform an operation, the energy of the external input signal should be larger than the
thermal energy. The characteristic time scale for energy exchange between the mode, at
frequency 0 , and its surroundings is just the decay time, d = Q/0 . Thus, the minimum power of the signal which has to be applied to the system to drive it to an amplitude
above the thermal fluctuations is
P > Pmin = kB T,
d =
k B T 0
.
Q
In Table 8.3, we present values of Pmin for various frequencies and quality factors at room
temperature. As displayed in Table 8.3, this minimum power can be extremely small for
a NEMS. For device dimensions accessible today via electron-beam lithography, the
characteristic level is of the order of tens of attowatts (1017 W!). This is many orders
of magnitude smaller than the power dissipation in contemporary systems of similar
complexity based on digital devices that work solely with electric signals.
317
Simultaneously, the signal amplitude of vibrations for a mechanical device (in a linear
operation regime) scales downward in direct proportion to its size. It is important to estimate the acceptable level of operational displacements of the suspended structure, i.e.,
the so-called dynamic range for a linear NEMS. To estimate the characteristic dynamic
range, we need to define the displacement amplitude corresponding to the onset of nonlinearity. This nonlinearity implies that, in a power-series expansion for elastic potential
energy, the largest term beyond the quadratic term (i.e., beyond the Hookes-law term)
becomes important. For a double-clamped beam, this condition translates into the relation
!z N 0.5L z . This criterion depends only upon the beam thickness, L z , in the direction
of vibrations. The linear dynamic range for a NEMS can be defined as the ratio of the
nonlinearity onset, !z N , to the thermal displacement, !z T : DR = 10 ln(!z N /!z T ).
The following example highlights the characteristic displacements and the dynamic
range in a relatively small NEMS. Consider a suspended Si structure of dimensions
0.1 m 0.01 m 0.01 m with a quality factor Q = 104 at T = 300 K. Then, we
find !z T 0.09 nm and !z N 5 nm. For these parameters, a high value of the dynamic
range, DR, of approximately 40 is obtained.
These considerations and estimates indicate two very crucial areas for NEMS engineering necessary to provide femtowatt to picowatt regimes: (i) development of ultra-sensitive
transducers that are capable of enhanced displacement resolution with increasingly higher
frequencies as device sizes are progressively scaled downward and (ii) development of
techniques tailored to operate over the entire dynamic range of the NEMS. As we studied
previously, nowadays, probe microscopy techniques are capable of probing and measuring quantitatively estimated ultimate displacements.
In conclusion, a nanoelectromechanical system consists of a nanometer-tosubmicrometer-scale mechanical resonator that is coupled to an electronic device of
comparable dimensions. The mechanical resonator may have a simple geometry, such
as a cantilever (a suspended beam clamped at one end) or a bridge (a suspended beam
clamped at both ends) and is fabricated from materials such as silicon using lithographic and other techniques similar to those employed for fabricating integrated circuits. Because of their submicrometer and nanoscale size, the mechanical resonators can
vibrate at frequencies ranging from a few megahertz up to around a gigahertz. The quality factors of these resonators greatly exceed those of typical microwave resonators. A
NEMS operates with low power dissipation in a wide dynamic range. These properties of
NEMSs open the way to a number of applications, ranging from signal processing to novel
detectors.
8.8
318
Nanostructure devices
- cell
- physical interaction
Figure 8.66 A schematic picture of a cellular array with interconnections via physical interaction.
319
(b)
(a)
P = 1
P = +1
Figure 8.67 A quantum-dot cell consisting of five dots with two electrons in the cell: (a) the
Coulomb repulsion causes the electron to occupy the antipodal sites; and (b) two bistable states
result in different polarizations. From S. Craig, P. Lent et al., Bistable saturation in coupled
quantum dots for quantum cellular automata, Appl. Phys. Lett., 62, 714 (1993). Reused with
permission from S. Craig, P. Lent, D. Tougaw, and W. Porod, Applied Physics Letters, 62, 714
C 1993 American Institute of Physics.
(1993).
1.0
P1
0.5
0.0
0.5
1.0
1.0
cell 2
0.5
cell 1
0.0
0.5
1.0
P2
Figure 8.68 The cellcell response. A polarized cell 1 implies the polarization of cell 2 due to the
Coulomb repulsion of the electrons populating the cells. The strong nonlinearity of the cellcell
response plays the same role as the gain in a conventional digital device. From S. Craig, P. Lent
et al., Bistable saturation in coupled quantum dots for quantum cellular automata, Appl. Phys.
Lett., 62, 714 (1993). Reused with permission from S. Craig, P. Lent, D. Tougaw, and W. Porod,
C 1993 American Institute of Physics.
Applied Physics Letters, 62, 714 (1993).
The Coulomb repulsion between the electrons in a cell tends to place them at antipodal
sites in the square. For an isolated cell, there are two energetically equivalent arrangements of the extra electrons, which are denoted as cell polarizations, P = +1 and
P = 1. The polarization is used to encode binary information. For example, if P = +1
represents a binary 1, then P = 1 can represent a binary 0. The two polarization states
of a cell will not be energetically equivalent if another cell is nearby. Figure 8.68 shows
how one cell is influenced by the state of the neighboring cell. The inset illustrates two
cells where the polarization P1 is determined by the polarization of the neighbor, P2 ,
320
Nanostructure devices
1
(a)
fixed driver
(c)
1 1
corner
wire
1
(b)
(d)
fan-out
inverter
1
Figure 8.69 Examples of the simplest quantum-dot cellular-automata arrays: wire, corner,
fan-out, and inverter. From Y. Ono, A. Fujiwara et al., Manipulation and detection of single
electrons for future information processing, J. Appl. Phys., 97, 031101 (2005). Reused with
permission from Yukinori Ono, Akira Fujiwara, Katsuhiko Nishiguchi, Hiroshi Inokawa, and
C 2005 American Institute of
Yasuo Takahashi, Journal of Applied Physics, 97, 031101 (2005).
Physics.
through the interaction of the electrons in the cells. Let us assume that the polarization
P2 has a given value corresponding to a certain arrangement of the charge in the cell 2.
Owing to the charge repulsion, the response of cell 1 is given by the strongly nonlinear
dependence presented in Fig. 8.68: a small asymmetry of charge in cell 2 is sufficient
to break the degeneracy between the two possible states of cell 1 and leads to the same
configuration of cell 1. Thus, in some cell arrangements, fixing the polarization of a cell
at an edge by an external bias determines the polarization of the cells at other edges. The
polarization is dependent on the cell configuration.
In the quantum-dot cellular-automata approach, the circuit is built by forming a
tree of cells. Figure 8.69 shows some elements of the quantum-dot cellular automata.
Figure 8.69(a) is a binary wire. The polarization of the leftmost cell is fixed, which represents the input. Then, all the other cells, including the output cell, align with the same
polarity because it is the most energetically favorable. Flipping the polarity of the
input cell results in the flipping of all the other cells. During this procedure, no
direct current flows in the circuit. Cells positioned diagonally from each other tend
to anti-align. This feature is employed to construct other logical elements. A fanout, a corner, and an inverter are shown in Figs. 8.69(b), 8.69(c), and 8.69(d),
respectively.
In conclusion, quantum-dot cellular-automata systems exploit the interactions between
quantum dots on the nanoscale. They are able to perform all processes necessary for signal
processing. No current is used in the circuits built on the basis of quantum automata. The
quantum-dot cellular-automata approach presents a nanoscale alternative to conventional
microelectronics and nanoelectronics.
8.9
321
Closing remarks
In this chapter we have considered several different nanostructure devices. It is convenient
to classify these devices into two categories. The first are the devices which are based
on usual classical principles of operation but considerably scaled down. This scaling
down facilitates the improvement of device performance and solves some fundamental
difficulties, which can not be overcome in microelectronics. These devices include fieldeffect transistors and bipolar transistors.
Another category is nanostructure devices based on new physical principles. Among
these are resonant-tunneling diodes the simplest quantum devices. Since these devices
have nanoscale dimensional features, they have extremely short transit times for carrier
transport through the structures, which results in the possibility of generating ultra-highfrequency electromagnetic oscillations up to the terahertz frequency range. As examples
of new principles for three-terminal electronic devices (transistors), we analyzed the
quantum-interference and hot-electron transistors, which have great potential for ultrahigh-speed operation.
To illustrate the contribution of nanostructures to optical devices, we have focussed
on the examples of quantum-well and quantum-wire lasers with bipolar injection, and
the multilayered quantum-cascade laser. The latter is a monopolar device, i.e., it uses
only one type of carrier (electrons). Exploitation of nanostructures results in a dramatic
decrease of the threshold current necessary for laser generation and in a widening of
emission spectra.
We have analyzed some nanoelectromechanical systems with mechanical resonators
that can vibrate at frequencies ranging from a few megahertz up to 1 GHz and have
quality factors greatly exceeding those of typical microwave resonators. This implies that
nanoelectromechanical systems operate with low power dissipation. These properties
open the way for a number of applications, ranging from signal processing to novel
sensors/detectors.
Finally, we have described innovative systems for signal processing based on quantumdot cellular automata. These automata exploit interactions between quantum dots on the
nanoscale. This approach presents a nanoscale alternative to conventional microelectronics and nanoelectronics.
The description of nanostructured devices which we presented here is not complete. More information on traditional scaling down to nanosize devices, double-barrier
resonant-tunneling diodes, and lasers can be found in the following books:
M. Shur, Physics of Semiconductor Devices (Englewood Cliffs, NJ, Prentice-Hall,
1990).
S. M. Sze, High-Speed Semiconductor Devices (New York, John Wiley & Sons, Inc.,
1990).
V. V. Mitin, V. A. Kochelap, and M. A. Stroscio, Quantum Heterostructures (New
York, Cambridge University Press, 1999).
Single-electron transport is described in the following references:
322
Nanostructure devices
Name
Symbol
Length
Mass
Time
Electric current
Temperature
Amount of substance
meter
kilogram
second
ampere
kelvin
mole
m
kg
s
A
K
mol
Name
Symbol
Equivalent
Plane angle
Solid angle
Speed, velocity
Acceleration
Angular velocity
Angular acceleration
Frequency
Force
Pressure, stress
Work, energy, heat
Impulse, momentum
Power
Electric charge
Electric potential, emf
Resistance
Conductance
Magnetic flux
Inductance
Capacitance
Electric field strength
Magnetic flux density
Electric displacement
Magnetic field strength
Celsius temperature
Luminous flux
Illuminance
Radioactivity
Catalytic activity
radian
steradian
rad
sr
hertz
newton
pascal
joule
Hz
N
Pa
J
watt
coulomb
volt
ohm
siemens
weber
henry
farad
W
C
V
S
Wb
H
F
tesla
degree Celsius
lumen
lux
becquerel
katal
m/m = 1
m2 /m2 = 1
m s1
m s2
rad s1
rad s2
s1
kg m s2
N m2
N m, kg m2 s2
N s, kg m s1
J s1
As
J C1 , W A1
V A1
A V1 , 1
Vs
Wb A1
C V1
V m1 , N C1
Wb m2 , N A1 m1
C m2
A m1
K
cd sr
lm m2
s1
mol s1
C
lm
lx
Bq
kat
324
Appendix
Symbol
Value
Units
m s1
Elementary charge
Electron mass
Electron charge to mass ratio
Proton mass
Boltzmann constant
Gravitation constant
Standard acceleration of gravity
Permittivity of free space
e
m0
e/m 0
mp
kB
G
g
0
h
0 = 0 /
0
NA
2.9979 108
3 108
1.602 1019
9.11 1031
1.76 1011
1.67 1027
1.38 1023
6.67 1011
9.807
8.854 1012
1019/(36)
4 107
6.6256 1034
376.73 120
6.022 1023
C
kg
C kg1
kg
J K1
m3 kg1 s2
m s2
F m1
H m1
Js
mol1
Abbreviation
Meaning
Prefix
Abbreviation
Meaning
attofemtopiconanomicromillicentideci-
afpnmcd-
1018
1015
1012
109
106
103
102
101
dekahectokilomegagigaterapetaexa-
dahkMGTPE-
101
102
103
106
109
1012
1015
1018
Si unit
Gaussian units
Length
Mass
Force
Energy
1m
1 kg
1N
1J
102
103
105
107
cm
g
dyne = 105 g cm s2
erg = 107 g cm2 s2
Index
carbon nanotubes, 98
carboxyl endgroups, 142
cascade structure, 305
cathode, 189
charge-limited transport, 201
chemical radicals, 117
chemical-vapor deposition, 112
Childs law, 198
chiral angle, 99
classical ballistic regime, 170
classical description, 22
classical regime, 170
classical transport regime, 170
coherence length, 169
coherent tunneling, 245
coherent waves, 19
collector, 245
collisionless motion, 8
complementary, 149
complex conductivity, 187
compound semiconductors, 68
concept of hole, 76
conduction, 70
constructive interference, 19
Coulomb blockade, 212
covalent, 69
criterion for laser oscillations, 298
cross-sectional STM, 122
crystal, 68
crystalline potential, 74
cubic symmetry, 72
cut-off wavelength, 290
CVD, 112
cyclic boundary conditions, 75, 101
dangling bonds, 134
DBRTD, 8
de Broglie wavelength, 28
degeneracy of states, 57
degenerate state, 174
density of states, 180
density of the distribution, 173
dephasing length, 170
326
Index
destructive interference, 19
diagonal components, 91
diamond lattice, 72
differential resistance, 191
diffusion, 168
diffusive classical size effects, 170
diode, 189
dip-pen nanolithography, 155
direct-bandgap semiconductor, 82
discrete energy levels, 41
discrete energy portions, 24
discrete energy spectrum, 36
dispersion relation, 16
distribution function, 173
donors, 118
double-barrier resonant-tunneling diode, 8
drain, 255
DRAM, 3
Drude formula, 188
dynamic range, 317
field-effect devices,
field-effect transistors, 255
finite conductance, 206
finite lattice mismatch, 90
finite resistance, 206
first Brillouin zone, 75
flat band, 293
flexural deformations, 159
flexural modes, 315
forward bias, 278
frequency of single-electron tunneling
oscillations, 213
fullerenes, 98
Hamiltonian, 12
harmonic oscillator, 45
heavy hole, 81
helicity, 98
Hermite polynomial, 46
heterojunction field-effect transistor, 93
heterostructure field-effect transistor, 261
heterostructures, 5
high (classical) frequencies, 172
high-electron-mobility transistors, 258
highest occupied molecular orbital, 153
highly degenerate electron gas, 176
hole, 76
hot-electron effects, 165
hot electrons, 193
hot-electron transistors, 280
hybridized orbitals, 61
impedance, 188
incident waves, 20
incoherent waves, 19
indirect-bandgap semiconductor, 82
induced-base transistor, 283
inelastic, 168
inelastic scattering, 168
inelastic scattering length, 168
injection, 190
inorganic nanotube, 102
insulators, 66
interband phototransitions, 289
intraband absorption, 304
intrinsic density, 118
intrinsic material, 118
isolated islands, 130
isotropic effective mass, 79
Index
327
328
Index
Index
329