The Design of High Performance Mechatronics
The Design of High Performance Mechatronics
IOS Press BV
Nieuwe Hemweg 6b
1013 BG Amsterdam
The Netherlands
tel: +31-20-688 3355
fax: +31-20-687 0019
email: [email protected]
www.iospress.nl
LEGAL NOTICE
The publisher is not responsible for the use which might be made of the
following information.
Preface xvii
Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . xix
Summary of the contents . . . . . . . . . . . . . . . . . . . . . . . xx
v
vi Contents
7.4.3
Diffraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 505
7.4.3.1 Amplitude gratings . . . . . . . . . . . . . . . . 506
7.4.3.2 Phase gratings . . . . . . . . . . . . . . . . . . . 508
7.4.3.3 Direction of the incoming light . . . . . . . . . 515
7.4.4 Imaging quality based on diffraction . . . . . . . . . . . 515
7.4.4.1 Numerical aperture and f-number . . . . . . . 519
7.4.4.2 Depth of focus . . . . . . . . . . . . . . . . . . . . 522
7.5 Adaptive optics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 525
7.5.1 Thermal effects in optical imaging systems . . . . . . . 525
7.5.2 Correcting the wavefront . . . . . . . . . . . . . . . . . . . 527
7.5.2.1 Zernike modes . . . . . . . . . . . . . . . . . . . . 528
7.5.2.2 Adaptive optics as correction mechanism . . . 532
7.5.3 Principle of operation . . . . . . . . . . . . . . . . . . . . . 533
Appendix 733
Recommended other books . . . . . . . . . . . . . . . . . . . . . . . . . 733
Nomenclature and abbreviations . . . . . . . . . . . . . . . . . . . . . 737
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 746
Preface
Motivation
xvii
xviii Preface
Besides much material from our own experience, this book also includes
material created by many other people.
Several university staff members and students have contributed to and
reviewed the material. Unfortunately it is impossible to mention all without
forgetting some names so as example only the three most important students
are mentioned.
The first is Ton de Boer, who accepted the impossible task as MSc-student to
write the rough material that started this book as lecture notes by following
the lectures on Mechatronic System Design. Initially, in spite of professional
advice, this writing was done in a well-known WYSIWYG program and only
later it was transferred into LATEXwhich proved indeed the only realistic
way to create a professional technical textbook.
Leon Jabben and Jonathan Ellis have been working as PhD-students at our
laboratory in Delft and parts of their theses are used in the measurement
chapter.
Our partners from industry deserve gratitude for their support, financially,
in equipment or advice, by permission to use company illustrations or by
reviewing the material. The three most important to mention are the Dutch
high-tech company ASML and the metrology companies Heidenhain from
Germany and Agilent Technologies from the United States.
From ASML especially Hans Butler, Patrick Tinnemans and Jan Mulkens
(thanks to the volcano on Iceland!) have helped in reviewing some chapters.
We further thank all other companies and individuals that kindly gave
permission to use their illustrations. These all are separately mentioned at
the related figures.
It is true to say that this textbook is based on the knowledge of many others
as laid down in books, patents and journal articles. For reason of readability
we decided not to include references in the text but instead we included a
list of the most relevant books that we found to be applicable.
Finally also a word of respect and gratitude should be given to the many
contributors of Wikipedia. Even though this huge source of information is
not always as consistent and flawless as might be required by the scientific
community, Wikipedia has proven to be very useful to quickly find the
right physical and mathematical terms or derivations. It also provided
information about small trivia like the date of birth or the full name of a
famous scientist from the past.
xx Preface
This book is written in such a way that it is useful both for a high-level
student who wants to learn about advanced mechatronics and for engineers
in the high-tech industry who want to learn more about adjacent special-
isations. To accommodate this dual approach, the first and last chapter
determine the environment that makes use of the material of the theoretical
chapters in between. It is not a surprise that this first and last chapter are
connected by the wafer scanners of ASML as these might well be the most
advanced mechatronic systems that are ever designed.
The nine chapters are summarised as follows:
The introduction in Chapter 1 gives the context of mechatronics in the
Dutch high-tech industry with the historical background, some general
observations on the international differences in approach towards mecha-
tronics and the close link with “Systems Engineering”. Subjects include
the development of the optical Video Long Play (VLP) disk and the wafer
stepper at Philips Research Laboratories. These developments have strongly
determined the dominant foothold of high-precision mechatronic system
design in the Netherlands and are exemplary for the specific photon-physics
oriented approach in this country, so quite different from the machining
oriented approach in most other countries. The overview on systems en-
gineering and design introduces some functional design and development
methodologies that have proved to be crucial for the success of the high-tech
industry. These methods are based on industrial practice where complex
multidisciplinary designs have to be realised. Systems Engineering is a
field closely related to mechatronics and the corresponding principles are
used in structuring the design of a mechatronic system.
Chapter 2 is the first of a series of chapters on the basic theory that is applied
in controlled motion systems. It consists of a short overview of the principles
of electricity, frequencies, waves and signal responses. The chapter
starts with basic electricity, the linking element in a mechatronic system.
Followed by signal theory this chapter explains the reason why the properties
of mechatronics are so often described in the frequency domain next to
the more mechanical oriented time-related step and impulse responses.
The chapter also introduces different graphical representations of these
responses as this material is used throughout most chapters in this book.
The hard-core of a mechatronic system is still the mechanics that represent
the real, dynamic, hardware world that has to be mastered when position-
ing objects in a controlled way. In most cases, the dynamic properties of
Preface xxi
Mechatronics in the
Dutch high-tech industry
This introductory chapter places the subject of this book in the context of
the rapid development of the high-tech industry in the Netherlands. This
industry has become a main driver of the economic growth in the industrial
region around Eindhoven with the multinational company “Royal Philips
Electronics” as the most prominent original source of technical innovation.
Based on their wide market scope with both consumer and industrial prod-
ucts Philips gave birth to several high-tech spin-off companies among which
ASML would become the global market leader in exposure equipment for
semiconductor manufacturing. The first part of this chapter will pay tribute
to the achievements within the research and development departments of
“Philips Gloeilampen fabrieken”, as the company was named at that time,
because these have determined the success of high performance mechatron-
ics in both high-tech and consumer equipment. After this short historical
overview, the second section will position the field of mechatronics on the
international playground where precision machining and manufacturing
appear to be the main application fields. The last section deals with systems
engineering as a design and development framework for the highly complex
mechatronic systems that are applied in the high-tech industry. It will also
give a general overview on the related development processes.
1
2 Chapter 1. Mechatronics in the Dutch high-tech industry
Figure 1.1: The Philips “Nat Lab” research laboratories in Eindhoven where the
real breakthrough in mechatronics was initiated.
(source: Philips Technisch Tijdschrift Vol.43 nr 2/3/4)
1.1. Historical background 3
Figure 1.2: The VLP Optical disk player had a contact-less optical read-out principle
with light from a Helium-Neon laser that was focused on the track of
the disk.
(source: Philips Technisch Tijdschrift Vol.43 nr 2/3/4)
In those days, only the gramophone record player and cassette recorder
were known for music playback and home registration, while for video
only the video cassette recorder (VCR) existed. Even though pre-recorded
videotapes would eventually become very popular, the general opinion was
at that time that tapes would always be too expensive for the consumer
market, due to the large number of parts and expensive material in one
tape-cassette. There were already several people in the world with the vision
that a gramophone record for video would fundamentally solve that problem,
because moulding a gramophone record is not costly at all. Unfortunately,
the registration of video images in analogue technology would have required
a bandwidth of more than 4 MHz for normal Television signals and that
is more than a hundred times higher than needed for the reproduction of
sound with a maximum frequency of 20 kHz in two-channel stereo. Based
on the fact that even this 20 kHz was quite difficult to achieve with a contact
stylus with diamond tip, including the unavoidable contamination of the
grooves, it was not considered possible to register video in a comparable
way! The breakthrough in thinking, that was required for the future success,
was based in the understanding that the contact method would need to be
replaced by a contact-less read-out principle with light. With the then newly
developed laser as a light source and the application of precision optics, a far
higher density of information could be registered than would ever be possible
4 Chapter 1. Mechatronics in the Dutch high-tech industry
Figure 1.3: The signal on a VLP disk is coded in pits on a spiral wound track with
a variable length and frequency at a fixed width and mutual distance.
The frequency of the pits determined the video signal and the length of
the pitsdetermined the audio signal.
For reason of efficiency, it was decided to directly write the analogue video
signal on the disk as a series of pits, tiny stripes with a variable frequency
and length. While the frequency contained the video information, the sound
was registered in the length of the pits as shown in Figure 1.3.
The pits were written in a spiral-wound track with a width of 0,4 μm at
a mutual distance of 1,6 μm on a transparent disk with the same size
as an ordinary gramophone record, a diameter of 0.3 m. The rotating
speed would be a factor ten higher however, with up to 30 revolutions per
second, in order to be able to handle the high frequency. Also different to
1.1. Historical background 5
Laser Lens
Semi-transparent
mirror
Out of focus scratch
Figure 1.4: The basic optical read-out principle of an optical disk by focusing a laser
beam on the track that is embedded in the optical disk and measuring
the reflected intensity with an optical sensor. A scratch on the surface
of the protecting layer is not detected because it is not in focus.
the gramophone record was the location of the track, that no longer was
embedded in a groove, but was hidden inside the disk and detected by means
of a small spot of light through the transparent covering layer, as shown
in Figure 1.4. This really appeared to be a revolutionary way of thinking.
Some of the competitors, like RCA, still tried to read the information written
in a groove on the surface with the help of a stylus. In this case the stylus
was only used to follow the track, while the high frequency signal was
detected with a local capacitive sensor on the tip of the stylus. With this
splitting of functions, following and detecting, they did succeed in getting
the mechanical system operational and it has even been some time on the
market with well protected disks in cassettes to prevent contamination.
This product was in itself a significant achievement for that time but when
compared with the contact-less optical read-out system of Philips it could
not survive. Especially the lack of sensitivity for surface scratches appeared
to be one of the big advantages of optical detection. The scratches are not
detected because they are not in the focal point of the detecting laser beam. A
problem with this principle was however the fact that no mechanical means
were available to follow the track and this created the need to look for ways
to actively control the position of the optical pick-up unit that was designed
to read the information on the disk. It was the solution to this problem
that determined the crucial breakthrough of precision mechatronics in the
Netherlands. From the signal of a special segmented photo-diode with some
additional optics, the relative position of the optical pick-up unit to the
track could be measured and with tiny linear moving-coil motors, similar
to the ones that are used in loudspeakers, it became possible to keep the
6 Chapter 1. Mechatronics in the Dutch high-tech industry
Opcal disk
Opcal Controller Drive
pick-up unit
Posion
Posion error
Loudspeaker Measurement
coils
Magnec
system
Figure 1.5: The active position control of the optical pickup-unit by measuring the
position from information in the reflected light and correcting position
errors by means of moving coil actuators.
pick-up unit right on track by means of a real fast position control system, as
shown in Figure 1.5. This so called servo-system was capable to correct the
different disturbances and imperfections in the track. Today this correction
is achieved even in the cheapest CD-players with special digital processors
in one IC that perform all the control tasks, but at that time this control
system could only be realised with operational amplifiers, RC-networks and
adjustable resistors.
The VLP was the first product that was brought on the market with optical
recording but in spite of the cooperation with MCA for the movie software,
the introduction under the name of “Laservision” was eventually not success-
ful. This was mainly caused by the lower than envisaged cost of the video
cassette, while the VCR-player could also record video signals directly from
television. The missing, at that time not sufficiently developed, possibility
to record optical disks on a consumer product, appeared to be the most
important drawback of the Laservision disk. This problem would be solved
some time later with recordable CDs but that came only after the second
breakthrough that reached almost every human being in the world, the in-
vention of the audio Compact Disc player of which one of the first commercial
products is shown in Figure 1.6. With many successful and less successful
successors like CD-Interactive, CDROM, CDRAM, DVD Superaudio CD
and Blu-Ray disk, this development really has made a major difference to
the entertainment industry. With the CD three major developments were
1.1. Historical background 7
Figure 1.6: The CD 303 was the first real high-end CD player of Philips. The
complete optical read-out mechanism with the rotating motor for the
CD and the rotary arm that carried the optical pick-up unit, was built
into a separate drawer that would enable to load the disk from the front.
This innovation would not have been easily possible with a normal
gramophone record player.
united, the digitisation of analogue signals, the optical disk principle of the
VLP and the invention of the semiconductor laser, that enabled a far smaller
design of the optical system. As an example of the miniaturisation, that
accompanied these developments, the complete mechanism of a CD-player
and the optical pick-up unit from a Blu-ray player are shown in Figure 1.7.
The design principles that are applied in these systems with the long-stroke,
short-stroke splitting of precision and range will return in the last chapter
of this book that describe the more recent developments in wafer scanners
that started as described in the following section.
8 Chapter 1. Mechatronics in the Dutch high-tech industry
Figure 1.7: The optical pick-up unit in the CD-mechanism on top of this picture is
carried by a simple and inexpensive linear moving stage that is driven
with a rotating motor and a screw-nut/gearwheel transmission. This
long-stroke linear drive roughly positions the pick-up unit to within the
capture range of the precision track-following servo-system with the
moving-coil motors of the pick-up unit. This basic long-stroke, short-
stroke positioning concept is refined in later developments like the
Blu-ray disk of which the optical pick-up unit is shown below the CD-
mechanism. This unit is designed to be compatible with all previous CD
and DVD related formats. For that reason three different laser sources
are integrated in one unit, working at three different wavelengths
while all light is optically combined to one spot by means of several
semi-reflecting cubes and lenses.
1.1. Historical background 9
Figure 1.8: The OM 200 was the first integrated circuit made by Philips in 1965.
The size was 0.75×0.75 mm2 and it integrated three transistors and
two resistors.
(source: Philips Technisch Tijdschrift Vol.43 nr 2/3/4)
Around the same time as optical recording was invented, Philips produced
its own semiconductors. To support this activity, within the Natlab several
research groups were active in the global battle to decrease the size of
the details in semiconductor based Integrated Circuits (ICs). The first
commercial integrated circuit of Philips is shown in Figure 1.8 with only a
very small number of components, but gradually the number of components
increased dramatically with sometimes millions of transistors, resistors
and capacitors that have to be realised on a single silicon substrate. These
electronic elements are all connected by a multitude of different layers with
wiring on top of the active elements, the lighter regions in Figure 1.8.
The technical support and design group within the Natlab supported this
research with in-house designed and manufactured precision equipment,
like the “Opticograph” that could write a mask for an IC by means of a
scanning focused light beam. Among these machines, the Silicon Repeater, of
which two generations are shown in Figure 1.9, was designed for exposing the
pattern of an integrated circuit. The first generation (Mark 1) was designed
together with the engineering department of Philips Semiconductor in
Nijmegen, while Silicon Repeater Mark 2 was purely made inside the Natlab.
10 Chapter 1. Mechatronics in the Dutch high-tech industry
Figure 1.9: The Silicon Repeaters Mark 1 and Mark 2 were the first wafer steppers
of Philips and achieved an imaging resolution in the micrometre range.
They were designed as laboratory tools to enable advanced research on
IC production technology.
(source: Philips Technisch Tijdschrift Vol.37 nr 11/12)
Both for the realisation of the active electronic components in the IC and
the related wiring, it is necessary to work the silicon by means of etching,
oxidising or changing the properties locally by different chemical elements.
The related process is called lithography which name is based on the ancient
Greek words “λιϑ oς” = stone and “γραϕειν” = writing.
The first step in this process is to “print” the desired pattern by means of
an optical exposure of a light sensitive resist layer. This exposure is done
by an optical system that images the mask (also called a reticle) on the
resist. Subsequently, after the development of the resist, the silicon can be
chemically treated on the spots where the resist was illuminated, as shown
in Figure 1.10. It is essential to note that the same step has to be repeated
up to around thirty times, where each different layers has to be defined in
one exposure cycle. All exposed layers have to be positioned with a high
accuracy relative to the previous layers. In Figure 1.11 a schematic overview
of this IC manufacturing process is shown.
1.1. Historical background 11
2: Develop resist
UV light
Recle
3: Etch Oxide
Resist
Si oxide
Si wafer
Figure 1.10: The lithographic process is a local chemical treatment of the silicon
substrate. The treated areas are determined by a resist layer that
contains a pattern that is previously imaged from the pattern on a
mask by an optical exposure system.
Exposure Developing
Photoresist and baking
Slicing
Polishing
!
"
"
#""
Figure 1.12: Schematic drawing of the main components of a wafer stepper. The
mask is imaged on the wafer with a demagnifying lens. Due to the
high opening angle that is required for the high resolution only small
areas can be exposed at one exposure at the same time. This requires
the wafer to be exposed in steps by means of a wafer stage.
that projects light through a transparency and a lens to the wall, a wafer
stepper projects light through a reticle and lens to a silicon wafer. The only
difference is the demagnification of the image on the silicon wafer which
is related to the required high resolution. To realise a high resolution, the
opening angle of the lens needs to be very high as will be explained in
Chapter 7 on optics. For this reason the image has to be very close to the
lens and light should come from all directions. Exposing the entire wafer in
one step would require a lens with a very large diameter which would be too
expensive to manufacture. It was concluded that only a small area could be
exposed in one exposure at the same time. This means that the wafer has
to be exposed in steps while it should be positioned very accurately between
the different exposures by means of a mechanism, the wafer stage. This
stepping motion has to be done extremely fast to not lose time and keep the
productivity on an acceptable level. For that reason the speed of this highly
accurate movement became one of the key aspects that drove the need for
perfection of mechatronics in these machines.
The most important and difficult part from a mechatronic perspective was
and still is the wafer stage. The accuracy of the positioning of the wafer in
the Silicon Repeater was in the same order of magnitude as the accuracy
of the previously described Video Long-play Disk with an error of signifi-
cantly less than a micrometre over a range of several tens of centimetres.
Instead of following a fast moving track, the wafer in a waferstepper had to
be positioned in steps followed by a perfect stand-still during exposure. This
standing-still had to be realised with a position repeatability of better than
0.1 μm on a wafer of at that time only 5 inch (≈ 12 cm), which is a better
accuracy than one over a million.
The first wafer stages were driven by hydraulic linear motors as schemat-
ically shown in Figure 1.13. Precision hydraulics was a known field of
expertise within Philips, originating from the design of high precision ma-
chining tools that were needed in the laboratory to manufacture for instance
optical parts. Hydraulic cylinders and servo-valves were a proven technology
for high precision positioning under several strict conditions. Friction had to
be avoided in the hydrostatic bearings and the temperature of the hydraulic
oil had to be actively controlled to around 100 mK in order not to influence
the measurement of the position by laser interferometers. With this wafer
stage a repeatability of 0.1 μm was realised by using special valves that
control the differential pressure over the hydraulic motors without any back-
14 Chapter 1. Mechatronics in the Dutch high-tech industry
Figure 1.13: The first wafer stages had hydraulic linear motors to achieve the
required stiffness, speed and accuracy. (source: commercial leaflet
Philips)
Figure 1.14: With electric linear motors the wafer stage became “cleanroom-
friendly”.
Figure 1.15: Essentially mono disciplinary solutions to a problem. They work, but
is the solution optimal?
18 Chapter 1. Mechatronics in the Dutch high-tech industry
Many definitions exist for the field of mechatronics. The first example finds
its origin in Japan where the term mechatronics was first officially deposited:
to optimise production.
A synergistic combination of
This definition is however very wide and covers about every engineering
subject ranging from controlling chemical processes to air planes. The rela-
tion with precision engineering is clear but precision is a relative concept.
Present modern industries are all working towards the maximum of their
technological capabilities. It is probably true to say that these definitions in
fact define mechatronics to be the modern version of mechanical engineering.
Mechanical engineering has in its history always integrated new emerging
technologies, like for instance thermodynamics and electrical power sources.
While the term “mechatronics” originated in the seventies of the last cen-
tury, it clearly appears to have become an inextricable part of the present
mechanical engineering discipline.
The above definitions and the international developments in the field unveil
some interesting observations regarding the differences in approach on
1.2. Definition and international positioning 19
system became much lower than would be the case with using piezoelec-
tric actuators, as would have been the common approach in the machining
industry.
The signatures of these two different applications are still very visible in the
present mechatronic playground. In the USA, where the competition in the
development and manufacturing of consumer electronic products is almost
completely vanished, the machining industry determined the mechatronic
research and development. Only in the last several years fast tool servo
systems with reduced mounting stiffness and high bandwidth for increased
precision are introduced, specifically for the manufacturing of high quality
optical parts.
In Japan a mixture of both approaches is practised, as in that country both
precision machining and consumer electronics are developed. Nevertheless
the more conservative machining approach is frequently observed which
sometimes hampers progress in their developments.
Europe is in this respect interesting in its own way. The largest part of
Europe followed along the lines of the developments in the USA and fo-
cused on precision machining and related mechatronics instead of consumer
products related technology. The Netherlands however went 100 % into
the other direction. First of all there is hardly any machining industry left,
while several world class companies in the high-tech industry are active in
that country applying precision mechatronics in professional printing, in
electron optic and x-ray imaging and in optical lithographic equipment. The
related photon-physics oriented mechatronic approach has been instrumen-
tal in the success of these industries and for that reason it is unavoidable
that this photon-physics oriented Dutch school of mechatronics resonates in
this book.
1.2. Definition and international positioning 21
Informaon Energy
Feedforward Electrical
Power Heatloss
Mechanical
Error Power
Posion +
setpoint Controller Amplier Actuator Mechanism
-
Measured posion
Sensor
Control electronics + so$ware Hardware
Figure 1.16: A mechatronic motion system consists of several elements that are all
required for the total functionality. The left side (yellow) only deals
with information. Although fully analogue electronic realisations are
still used, presently this function is mostly implemented in software
with digital electronics. At the hardware side (blue) both information
and energy is manipulated to achieve the final goal.
Figure 1.16 shows these elements in an overview. The arrows indicate the
flow of information and energy, most of which takes place in the electrical
domain. This connecting function of electricity within mechatronics is also
one of the connecting elements in this book.
22 Chapter 1. Mechatronics in the Dutch high-tech industry
Breadth of
knowledge
Generalist
Generalist
Specialist
Specialist
Specialist
Specialist
Specialist
Specialist
Specialist
Specialist
knowledge
Depth of
Figure 1.17: Both specialists and generalists are required to design a complex
system.
(Courtesy of G. Muller, Gaudi site)
1.3. Systems engineering and design 23
more a real issue as the capability to act as a real generalist with sufficient
knowledge to effectively communicate with specialists in a multitude of
disciplines is something one can hardly learn from books, but first and for
all by experience.
Based on this trend, a continuous effort is taking place to find methods that
help to keep the problems under control that are inherent to this increasing
complexity. In the following, some basic methods are described that are
widely accepted with clear benefits for a successful completion of a complex
system design.
1 www.incose.org
2 www.gaudisite.nl
24 Chapter 1. Mechatronics in the Dutch high-tech industry
Feedback loops
System design
Subsystem design
Figure 1.18: The V-model of systems engineering divides the design process of a
complex system both in time and in levels of complexity, in order to
organise the effort over specialists and generalists.
• In the “Detailed design” layer, the requirements for the detailed ele-
ments are translated into drawings, software code and realised parts
with a verified performance that can be assembled into functional
subsystems.
In very complex systems, the subsystem layer itself often consists of dif-
ferent layers. Each has its own requirements, specifications and realised
functionality.
The work in each layer has a different character. The deeper one goes to-
wards the bottom of the V-model the more detailed and concrete the work
will be. But also within one layer the work is different between the left and
the right leg, due to the fact that definition and realisation require different
skills. For this reason, often the people in the team are changed during
the project. This is not preferable from a learning perspective, because
experience with results of the realisation phase will improve the decisions
taken in the definition phase of a next project. Nevertheless, the work in
the left leg is in practice done primarily by innovator and creator types of
designers, while the right leg is realised by more completer and finisher
types of people. In any case, frequent feedback is necessary within a project
to overcome non compliances with the specifications that show up in the
verification stage. Communication is also necessary between projects where
experience from the past is used for continuous improvement over time.
These feedback loops over different people need continuous attention as they
do not happen automatically.
The subsystem layer is the place where many mechatronic systems are
designed, like motion stages and actively controlled optical imaging systems.
In the following section the work in these layers is described in the context
of a regular development according to the product creation process.
Figure 1.19 shows the time division of the product creation process that is
commonly used in industry. The phases 0, 1 and 2 correspond with the left
leg of the V-model while the phases 3, 4 and 5 correspond with the right leg.
From practical experience it is known that many mistakes are made during
the feasibility and definition phase, when not sufficient time and effort is
given to determine all factors that are at stake. In this early phase the
1.3. Systems engineering and design 27
0. 1. 2. 3. 4. 5.
Feasibility Denion System Engineering Integraon Field
design & test monitoring
Requirements
Specicaon
Design
Vericaon
Engineering
Most informaon
Core informaon
Legend: in dra$
50% available in heavier change control
concept
full under development preparing or updang work
Timeline
Figure 1.19: This standard time line of a product creation process is closely related
to the V-model and is more directly suitable for planning purposes.
The lower graph shows the yearly cost related to the effect of decisions
on the financial results of the project and emphasises the need to
especially control the first phases.
(Courtesy of G. Muller, Gaudi site)
decisions with often the largest impact on the success of a project are made
at the lowest relative cost. This relation is schematically shown in the lower
graph of the figure. Especially in this phase the systems engineer needs to
have a good overview on market demands and technological possibilities.
In most cases he closely cooperates with a product manager who takes
care of marketing issues and represents the end user of the product. This
combination of often only two people is to a large extent responsible for the
success or failure of a product. One might say that a badly started product
creation process can hardly be repaired while of course a good start can still
end up in a mess due to a bad execution. This duo of systems engineer and
product manager has to deal with the tension from the external market
in timing, product margins and functional requirements. They will define
the final set of requirements in a process as shown in Figure 1.20 that are
used as input for the development activities. They also have to keep these
requirements closely under change control. After the integration and test of
28 Chapter 1. Mechatronics in the Dutch high-tech industry
top-down
Key drivers
(customer, business)
Operaonal drivers
(logiscs, producon, et cetera)
Roadmap
(posioning and trends in me)
Compeon
(posioning in the market)
Connued
Regulaons Requirements Product Creaon
Process
Ideal reference design
Prototyping, simulaon
(learning vehicle) Feedback
Boom-up
(technological opportunies)
Exisng systems
boom-up
the product, this team is often also responsible for dealing with the market
response on the product.
At all levels in the V-model, a trade-off has to be made between specifications
of subsystems that have to work together. In the next section the related
process of requirement budgeting will be explained a bit further.
Balls
15 μm
Bearings
18 μm
Tolerances
Overall Actuaon Guiding 20 μm
60 μm 40 μm 30 μm
Stability
10 μm
Other Other
20 μm 20 μm
Other
10 μm
Figure 1.21: Even a simplified accuracy budget of a positioning system with many
independent error sources already shows the choices that have to be
made in close cooperation with the design teams that have to realise
the targeted budget.
1.3.0.6 Roadmapping
Figure 1.22 shows in a graphical way, that the process of developing prod-
ucts takes place in a continuous flow of changing external influences, the
increasing demands of the market together with the also increasing techno-
logical developments. In most cases these external influences can hardly
be influenced by the design team. A good example is the development of
the Personal Computer. Initially the combination of IBM, Intel and Mi-
crosoft could determine their market approach without the need to take the
competition into account. Inevitably however, at some moment in time, the
competition like Apple and the Unix based Linux community had arisen forc-
ing the seemingly monopolistic PC league to listen to their customers. This
observation has as consequence that in general a lot of marketing research
30 Chapter 1. Mechatronics in the Dutch high-tech industry
Market demands
Products
Technological developments
is needed to get and keep a good feeling of the market dynamics and to make
reliable estimates of future needs and possibilities. These are key tasks of
the marketing people in close cooperation with their technological counter-
parts. For several practical reasons, developing products can never be as
continuous as the market would like. To keep the cost down and the quality
high, the diversity in parts and products has to be limited. For this reason
products are generally developed in product families, also called platforms,
that will stay in production as long as possible. As a consequence, when
the time needed for the development itself is added, the initial decisions
on requirements and specifications need to remain valid for a long time.
One of the tools to systematically support these long term decisions is the
process of road mapping. This is a strategic process that should be done at
regular intervals to act as a basis for investments in new technologies and
developments. Figure 1.23 shows a typical example of such a roadmap just
for illustration of the principle. It is a highly simplified roadmap dealing
with the engine drive-train of a car.
Of course there is a large freedom in methods of road mapping and the shown
example is just one of the many possibilities to draw a suitable roadmap
for the planning of future development activities. The roadmap consists
of several layers where, like in the V-model, the upper layer deals with
the total overview of all product introductions and the lower layer with the
actual detailed plans on subsystem level. The definition of these roadmaps is
real teamwork, where people from marketing, development, manufacturing
and other sectors all need to have their input. An additional benefit of the
process is the resulting commitment of all involved people to realise what
has been agreed upon. It is in this way also clearly communicated and
understood that some developments need more time than others, like the
electrical drive in the example that is related to the applied batteries.
1.3. Systems engineering and design 31
Innovaon Drivers
Maximum speed (km/hr) 150 160 180 150 130
Technologies
Diesel Improved turbo (A)
Diesel Improved turbo (B)
Hybrid Hybrid technology
Electrical All electrical
Planned acvies
Enhanced Turbo development
Development hybrid technology
Development electrical drive in H and E
Figure 1.23: Roadmap that was drawn in 2007 as university lecture example, indi-
cating the different engine-drive principles that were expected to be
developed by the car industry in the following years.
As noted in the previous part a (system) designer has to deal with several
contradictory demands:
• Concurrent Engineering.
The drive for total quality in the second half of the last century has forced
development activities to become a more and more rigidly planned process.
Total quality means that not only the products are expected to function
exactly like they are promised to do, but also the process to create and
produce them needs to be fully determined. One item that was high on the
priority list at that time was the reliability of the development timing and
the reduction of scrap due to unplanned repairs. This process resulted in a
strict planning with milestones and go/no-go decisions in order not to start
with a next step until all questions of the previous step were answered. This
approach had several advantages:
• The planning became continuously more solid towards the end of the
process.
Product denion
Concept design
Detailed design
Long-lead item order and delivery
Integraon and test
Producon
Product denion
Concept design
Detailed design
Long-lead item order and delivery
Integraon and test
Producon
The process to allocate all activities essentially in series over time often
resulted in practice in a development lead time of more than three years
for a general consumer product. To partly solve this problem, the process
of concurrent engineering was introduced as shown in Figure 1.24. Under
pressure of market needs, people started to make use of the fact that in
each phase of the development process some elements will take more time
than others to be finished. It might be useful to treat these long-lead items
as off-the-shelf parts that have to be used just as they are. This approach
means that the long-lead items should be the first to design and the other
parts have to adapt to the interfaces chosen for these long-lead items. In
this way a hierarchy of parts is created that enables the designers to start
earlier with the next step in the process.
An important condition to make this method work is the use of agreed
interfaces. From the computer industry several well established interface
standards are known like PCI, USB and (serial-) ATA to name a few and
these have helped this industry to become as successful as it is. For the
34 Chapter 1. Mechatronics in the Dutch high-tech industry
design of mechatronic systems the situation is less easy due to the low
flexibility of the solid hardware but even in that field standard interfaces
are quite common with as best example the standardisation in connective
parts like bolts and nuts
In spite of the reduction of the development time, this more parallel concur-
rent engineering approach also has some drawbacks that need attention:
In the industries where concurrent engineering became the “de facto” stan-
dard of operation like in semiconductor manufacturing equipment, several
methods are used to avoid these mentioned problems. First of all a formal
and intensive communication network is created, focused on exact interface
definitions. Specifications are only adjusted in a disciplinary process accord-
ing to the rules of systems engineering. Secondly it appeared to be of crucial
importance to work with a modular design that is based on a technology
platform with a lifetime of more than one product introduction.
Recle stage
Illuminator
Figure 1.25: The main modules in a wafer scanner that is based on the “Twinscan”
platform of ASML. All modules are individually upgradable giving
the platform a virtually infinite lifetime in a market that demands a
continuous increase in performance.
(Courtesy of ASML)
and reliability in the new design. While the power supply and display are
controlled with embedded software (firmware), much flexibility is available
in showing or hiding functions depending on the market position of the
specific product. Most functions, sometimes including the camera, are
integrated and depending on the execution they are switched on or off in the
firmware or just covered by the shell. The total cost of such a design is often
even lower when some hardware in the product is not used, than would be
the case if a special hardware version without that specific function has to
be developed. This is mainly due to the lower logistic cost related to the
reduced diversity.
The last example of a modular design is the ASML wafer scanner as shown
in Figure 1.25. The “Twinscan” platform with a dual wafer stage became
the leading technology in this industry. In retrospect, many reasons for this
success can be mentioned like the team spirit, commitment and drive of
the people, the partnership with Zeiss and the strong mechatronic roots in
36 Chapter 1. Mechatronics in the Dutch high-tech industry
• All modules can be mounted and dismounted from the machine without
affecting the other modules.
37
38 Chapter 2. Electricity and frequency
Charged particles like electrons and protons have as property that they
either attract each other, when their charge has an opposite sign, or repel
2.1. Electricity and signals 39
+ _
+ _
Figure 2.1: Charges have an electric field that is directed away from the positive
charge and towards a negative charge. The electric field is graphically
represented by field lines. The density of the field lines determines the
force acting on other particles in the vicinity and the arrows of the field
lines give the direction of the force in case of a positive inserted charge.
The combined field of two opposite charges results in curved lines that
start at the positive charge and end at the negative charge.
each other, when their charge has the same sign. For example an atom with
a lower number of electrons than its proton count will attract electrons until
its charge is neutralised again. The related electrostatic force is the driver
behind all important electric phenomena. The magnitude and direction of
this force is represented by the electric field E. Mathematically the electric
field at a distance r in the direction of unit vector r̂, with its origin in the
centre of a single charged particle with charge qx , is represented by the
following equation:
1 qx
E(r) = r̂ V/m (2.2)
4π r 2 0
tion of the electric field as arrows and their density is proportional to the
magnitude of the field. This also means that the field is constant orthog-
onal to the field lines. By the definition of unit vector r̂, the direction is
outward for a positive charge and inward for a negative charge. Though
the drawing is two-dimensional, in reality the space around the charge is
three-dimensional, which corresponds with the decrease in the electric field
and the corresponding density of the field lines to the distance squared. In
the lower drawing of the figure the effect on the course of the field lines of
two opposite charges is shown. The connecting curved field lines are the
result of the superposition principle of electrostatic fields, which means that
fields can be simply added together. As a consequence, the resulting field
at the equidistant plane from both charges is zero and the field lines will
cross this plane perpendicular, leading to the shown curved lines from one
charge to the other.
The magnitude and direction of the force that is acting on a charged particle
with charge qx , inserted in a field E(r) that originates from several other
charged particles, can be calculated quite straightforward as follows:
This force F will be directed in the direction of the field in case of a pos-
itive inserted charge and in the opposite direction in case of a negative
inserted charge. This direction corresponds with the known phenomenon
that negative charges are attracted in the direction of a positive charge.
The vectorial nature of the electric field and the related force is very useful
when determining physical effects in for instance electron microscopes,
where charged particles have to be controlled in velocity and direction. In
electric systems, where electrons are guided in conductive material like
resistors, wires and other elements, it is more easy to work with the scalar
magnitude of the related potential energy of a charged particle that is
inserted in an electric field. What happens can best be explained when
looking at two points in the electric field at a different distance from the
first charge. The potential energy is determined in the same way as with
gravitation. When a particle with charge qx moves from location r1 to r2 , the
work exerted by the electrostatic force will change its potential energy. The
energy (E) is given in the unit Joule [J] named after the English physicist
James Prescott Joule (1818 – 1889) for his important exploratory work on
2.1. Electricity and signals 41
r2
ΔE p = − F · dl
r1
r2
= − qx E · dl [J] (2.4)
r1
Due to the sign convention this means that a negative charge will have a
positive potential energy in the vicinity of a positive charge which increases
with the distance. This reasoning corresponds with the potential energy of
a mass under the gravity force on earth. From this difference in potential
energy between location r1 and r2 , the potential difference can be defined
in order to get a number that is independent of the charge of the inserted
particle, by dividing the previous equation by qx .
ΔE p
ΔV =
qx
r2
= − E · dl [V] (2.5)
r1
These are positive numbers, when the E field is created by a positive charge,
as this field is directed outwards. These expressions make clear that the
42 Chapter 2. Electricity and frequency
V1 Potenal di<erence
r1 (V1 - V2)
I
_
r2 V2
+
Figure 2.2: Two points at a different distance from a charge have a difference in
potential. When the charge is positive, the potential V1 is higher than
the potential V2 , which means V1 − V2 is positive. A negative charge,
trapped in a conducting wire between r1 and r2 , will be driven by
the electrostatic force in the direction of r1 , which corresponds with a
positive electric current flowing in the other direction.
Moving charges are known as electric current, which is given the variable-
symbol I with unit Ampère [A] named after the French physicist and math-
ematician André-Marie Ampère, because of the high value of his research
on electromagnetism. The current is equal to the flow of charge C per unit
of time through a surface C/s .
Electromotive Force Fe .
When Ei is the internal electric field, e 1 and e 2 are the electrode locations
and dl is the inner path between the electrodes, the electromotive force
becomes:
e2
Fe = Ei · d l [V] (2.7)
e1
• The electromotive force from the source is equal to the potential differ-
ence of the external electric field and is mostly just called “voltage”.
When a charge moves through an electric field, its potential energy E p and
its electric potential V changes. The power (P ) of a system is defined as
the time derivative of energy with unit Watt [W], named after the Scottish
mechanical engineer James Watt, because of his famous work on steam
engines. With this definition the power can be expressed as follows:
dE p
P= [W] (2.8)
dt
The electric power can be expressed in the electric potential of the charges
as given in Equation (2.5), and in the current: I = d q/d t [A]. This gives the
familiar expression for electric power:
dE p d q
P = ·
dq dt
= V · I [W] (2.9)
1 In solid material positive charge can not move. The only examples of a material flow
corresponding to a positive current is in ionised gases and the movement of positrons but the
latter do not play a role in the real engineering world.
46 Chapter 2. Electricity and frequency
Current I
+
Voltage
+
source
V Resistance R
Figure 2.3: Basic electric circuit consisting of a voltage source and a resistive load.
The current is proportional to the voltage and inversely proportional
to the resistance, following Ohm’s law. The notation of the potential
difference is given with a double tip arrow where the signs are noted to
define the direction from + to −.
The signs of the previous equations are very important. It was shown,
that an electron as being a negative charge moves towards a decreasing
negative potential while a positive charge on its turn would experience a
force in the direction of a decreasing positive potential. This means that the
electric potential energy of a charge decreases due to this movement and
the corresponding power becomes available in another form, for instance
the dissipated heat in a resistor or in mechanical work in an electro motor.
In this book the positive direction of a current is defined by the pointing
direction of a single arrow while the positive direction of the potential
difference is given by a + and − sign adjacent to a double arrow to indicate
the relative potential difference.
It should also be noted that the units after the equations will most often be
omitted when it is obvious what is meant.
Figure 2.3 shows the most basic electric circuit possible. It consists of a
voltage source and a load, connected between the electrodes, that pulls
current from the source. In this basic form of an electric circuit the load is a
resistor R with unit Ohm [Ω]. In the next section the load will be extended
into a complex load, but originally the German physicist Georg Simon Ohm
(1789 – 1854) based his well-known law on the research he did using the
electric batteries of Volta. With the constant, not time dependant, voltage
of batteries the current in the circuit is only dependent on the resistivity in
the electric circuit. For this reason Ohm’s law gives the relation between
2.1. Electricity and signals 47
V2
P = IV = I 2 R = (2.11)
R
This squared, non-linear relation is an important phenomenon in mecha-
tronic systems. As an example of this phenomenon, an increase of the force
of an actuator requires an increase of the current. This increase results in
a squared increase of the power dissipated in the series resistance of the
actuator. A configuration that was just safe from a thermal point of view,
can become suddenly overheated, when only a seemingly limited increase of
the current is applied.
• nV : Unmeasurable.
In practice the term nV is only found in expressions like nV/ Hz, that relates
to the density of noise in a certain frequency band.
Practical values of current show an even wider range:
• kA : Power distribution.
• fA : Unmeasurable.
Electric voltages and currents are generally not constant over time. The
variability of these voltages or currents contain information about their
origin, for which reason they are called electric signals.
In physics a signal represents any value with a variability over time and/or
space, the temporal or spatial variability. An example of temporal variabil-
ity is sound that changes over time in amplitude and tonal character. In
mechanical engineering a position dependent force is an example of spatial
variability. In optics, spatial variability represents the difference in inten-
sity of light as function of the location on a surface. In a mathematical sense
both spatial ( f ( x)) and temporal ( f ( t)) variability can be treated the same
way, when x is exchanged with t or vice versa in the relevant formulas.
Electric signals in mechatronic systems are always temporal even though
the signal can be derived both from temporal and spatial physics phenomena.
The latter are translated into the time domain by for instance a scanning
movement with a constant velocity and measuring with a clear time refer-
ence. For this reason, the following part of this chapter will mainly deal with
temporal varying signals while in Chapter 7 on optics the spatial variability
will mainly be used.
2.1. Electricity and signals 49
The DC value of an electric signal equals the average value of the current over
time. For an alternating signal the term temporal frequency is defined as
the number of occurrences of a certain event per unit of time. Consequently
one can define the time period related to this frequency as the time taken for
one event to happen. In SI units the frequency with variable-symbol ( f ) is
expressed in the unit Hertz [Hz] named after the German physicist Heinrich
Hertz (1857 – 1894). The time period of one cycle, with variable-symbol (T )
in seconds [s] is inversely proportional to f .
1
T= [s] (2.12)
f
In physics and engineering the term angular frequency with variable-symbol
(ω) is often used. This frequency is directly related to the mathematical
description of harmonic oscillations, defined by the sine or cosine of an angle
changing with a constant angular velocity ω and an amplitude A , as shown
in Figure 2.4
When the sine and cosine function are presented in a graph as function
of the angle, this results in a graphic wave shape as shown in Figure 2.5.
Though this is not a real physical wave, the term “waveform” is often also
used for this graphic representation of a frequency because of its shape.
When ω is constant, the horizontal axis is also a time line where 2π equals
the period T . This means that the angular frequency is directly related to
the temporal frequency, because one period T equals a full circle being an
angle (φ) of 2π radians:
ω = 2π f [rad/s] (2.14)
A sin(ωt) A
ωt t=0
A cos(ωt) x
of magnitude and can easily result in wrong conclusions. In this book both
units will be used. The [Hz] units are applied with the practical examples
and graphical representation, where the relation with periodicity and mul-
tiplicity of events is more clear. It is often more related to the real world
and human understanding. The rad/s unit is mostly used for mathematical
analysis, equations of motion and the relation with physical effects.
When two signals with the same frequency like the sine and cosine of Fig-
ure 2.5 are compared, than the sine is shifted with an angle φ = π/2 in respect
to the cosine. This time relationship is called phase and is one of the most
important parameters in a controlled motion system because a negative
phase shift represents time delay, that can cause instability when applying
negative feedback. For practical reasons this phase shift is expressed in
degrees (◦ ), because of the small numbers involved. A 30◦ phase shift can
have a large impact while it is less than π/4. Though the degree is not an
official SI unit, it is widely accepted to be used in this context.
A simple thought experiment can be done on position, velocity and acceler-
ation with a body that moves with a sinusoidal movement x( t) = x̂ sin(ω t) =
1 Sine
0.8
0.6 Cosine
0.4
0.2
0
> 0.2
> 0.4
> 0.6
> 0.8
>1
0 π π 3π 2π 5π 3π 7π 4π
2 2 2 2
φ (ωt)
1 ?
0.8
0.6 Velocity
0.4 @
0.2
0
> 0.2
> 0.4
> 0.6
> 0.8
>1
0 3 2 5 3 7 4
2 2 2 2
Figure 2.6: Two periods of the position, velocity and acceleration of a body with a
sinusoidal movement: x(ω t) = x̂ sin(ω t) with x̂ = 1 and ω = 1.
x̂ sin(2π f t) with an amplitude x̂ of 1. The blue line in Figure 2.6 shows that
the position at π/2 is stationary. This means that the velocity is zero, cor-
responding to the green line which was drawn at the cosine function. Also
when observing for instance the steepest down slope of the position at π it
shows that then the velocity is maximum negative, which is logical. This
reasoning can be repeated at any position in the graph. For the relation
between velocity and acceleration the same reasoning results in the red line
which was drawn as −1 times the sine function.
In fact by just looking at the graph, it is visually verified that the mathe-
matics are correct. After all, when the position ( x) can be described as:
x( t) = x̂ sin(ω t) (2.15)
then the speed (v) equals the derivative of position over time:
d x( t)
v( t) = = ẋ( t) = x̂ω cos(ω t) (2.16)
dt
and the acceleration (a) equals the derivative of velocity over time:
dv( t) d2 x
a( t) = = = ẍ( t) = − x̂ω2 sin(ω t) (2.17)
dt d t2
The visual representation is however not fully representative for the real
situation, because it only gives the right answer on the amplitude of velocity
and acceleration for a specific frequency when ω = 2π f = 1. These amplitudes
at another frequency can be found by straightforward reasoning, as a higher
frequency means a faster change. For a constant position amplitude, the
amplitude of the velocity increases proportional with the frequency and the
amplitude of the acceleration with the frequency squared.
As a second step, the phase relationship between these three parameters is
52 Chapter 2. Electricity and frequency
t1
x( t 1 ) = x( t 0 ) + v( t) d t (2.18)
0
The third kind of signals next to DC and AC is noise. Noise is a real random,
non-deterministic signal which means that the signal value at any time
of observation can have any value within a certain range. The range is
described by means of statistical distributions that give the probability that
the value is within that range. In Section 2.3 it will be explained that random
signals in the time domain consist of an infinite amount of frequencies in
the frequency domain. Also the distribution of these frequencies, called
a frequency spectrum, can only be described in statistical terms. Noise is
sometimes named with a colour term to indicate their frequency spectrum
like white noise with an equal presence of all frequencies over the entire
frequency spectrum or pink noise when the low frequency components are
more stronger present than the high frequency components
High performance Mechatronic systems exist by virtue of predictable and
well controlled dynamics of the motion systems and to that respect random
signals are by definition a part of the possible disturbances that impair the
functionality. The only thing one can do is avoid or suppress the effects of
noise but it can never be cancelled as that would require knowledge of future
values and then the signal would no longer be random. The reduction of
noise has always been an important area of research by determining the
mechanisms behind the noise in order to improve the predictability and at
least filter out the systematic effects that occur at regular intervals.
Noise and its statistical analysis is examined more in depth in Section 8.2
on Dynamic Error Budgeting where it will be explained how to deal with
random disturbances in precision measurement and positioning systems.
2.1. Electricity and signals 53
The term “rms” refers to Root Mean Square (RMS), named from the action
of taking the root of the mean value of the squared function. The RMS value
is a well-known term in electricity to characterise the useful value of the
mains supply with a sinusoidal voltage V = V̂ sin(ω t).
The equivalent DC voltage becomes:
T T
1 1
Vrms = (V̂ sin(ω t)) d t = V̂
2 sin2 (ω t) d t
T T
0 0
T
1 1 − cos(2ω t) 1 t sin(2ω t) T
= V̂ d t = V̂ − (2.21)
T 2 T 2 4ω 0
0
means that the sine term will average to zero and the final result becomes:
1 t T T V̂
Vrms = V̂ = V̂ = (2.22)
T 2 0 2T 2
The 230 V AC mains supply in the Netherlands is the RMS value of the
voltage so the peak value equals V̂ = 2 · 230 = 325 V.
Similar to the voltage, the RMS value of a sinusoidal current will equal Î/ 2
and the combined power will equal VRMS I RMS = 0.5V̂ Î as is proven with the
following seasoning, when I (ω) = Î sin(ω t) and V (ω) = V̂ sin(ω t):
1 − cos(2ω t)
P = I (ω)V (ω) = P̂ sin(ω t) sin(ω t) = P̂ sin2 (ω t) = P̂ (2.23)
2
with the peak power P̂ = Î V̂ . The result is a positive number and averages
out over time to P = P̂/2 because the average value of a sinusoidal function
is zero.
Im
Z(M )
A
j sinM
M
cosM Re
Also the relation for power becomes less trivial as a phase shift between
current and voltage has impact on the related power. Without a phase
shift, the power was shown to average out over time to P = P̂/2. In case of
a π/2 phase shift, where the current advances over the voltage, the Power
becomes:
1
P = I (ω)V (ω) = P̂ sin(ω t − π/2) sin(ω t) = P̂ cos(ω t) sin(ω t) = P̂ sin(2ω t) (2.26)
2
Current I()
+
Voltage
+ source
Figure 2.8: Generic representation of Ohm’s law. All variables can be frequency-
dependent. The load becomes a “complex” impedance resulting in a
possible phase shift between current an voltage.
56 Chapter 2. Electricity and frequency
1
Sine Sine
1
Power Cosine
JQ8 JQ8 Power
JQ6 JQ6
JQ4 JQ4
JQ2 JQ2
0 0
>JQ2 >JQ2
>JQ4 >JQ4
>JQ6 >JQ6
>JQ8 >JQ8
> >
0 1 2 0 1 2
Period Period
vp
λ = vp T = (2.27)
f
In the following, two essentially different types of waves will be presented,
the real and observable mechanical waves, that are based on movements
of material, and the physical-model-based electromagnetic waves. The
examples are aimed to only create a basic idea of these phenomena that are
sufficient to use it in this book. For that reason a full mathematical analysis
with wave-equations is not presented.
k2
k1 k n-1
m1 m2 mn
m n-1
Transversal wave
k1 k2 kn-1
m1 m2 m n-1 mn
Longitudinal wave
Figure 2.10: Transfer of energy by waves in a mechanical body can be either lon-
gitudinal or transversal. Transversal waves can be in any direction
orthogonal to the propagation direction which is always longitudinal.
This equation is valid for gases and fluids and to a lesser extend for solid
objects where it can be used only as approximation. The shape of the object
also plays a role and the velocity for transversal and longitudinal waves is
different. As an example with steel, B ≈ 160 · 109 M/m and ρ ≈ 8 · 103 kg/m3
resulting in a calculated velocity of approximately 4500 m/s. In reality the
velocity in stainless steel varies between 3500 m/s for transversal waves and
5500 m/s for longitudinal waves. With for instance half a metre of steel this
gives a delay of about one tenth of a millisecond which results in a phase
delay of almost thirty-five degrees at one Kilohertz which can be significant
from a control point of view!
Acoustic waves that relate to sound are a special subdivision of mechan-
ical waves. In principle they can be visualised with the help of the same
Figure 2.10. However with sound the medium is gas and this means that
2.2. Energy propagation and waves 59
B
t/x
Magnec eld
Electric eld
Figure 2.11: Electromagnetic waves consist of a coupled periodic electric and mag-
netic field that propagate as function of time and position. In vacuum
the velocity is equal to the speed of light c, exactly 299,792,458 m/s.
the mass elements are very small (molecules), while the elastic elements
are also very small as they consist of the interaction between the moving
molecules. On a macro-scale, the energy in acoustic waves is transferred
by pressure and velocity where in practice only longitudinal acoustic waves
exist because of the low viscosity, which could be the only mechanism to
transfer transversal movements.
The example shown is still very much one dimensional and it is worthwhile
to consider the three dimensional case at this point as the inserted energy
is in reality transferred in all directions. To take this into account the
transferred energy is defined as the energy per unit of surface. This value
is often called intensity. Based on the law of conservation of energy this
means that the intensity value will decrease as function of the enlarging
surface which often in practice corresponds to something like a half sphere.
When motion energy is inserted at the surface of an object, the intensity will
decrease proportional with the squared distance from the point of insertion.
A special type of waves is the electromagnetic wave, because it does not use
material for its propagation. Electromagnetic waves as a physical model
were first postulated by James C. Maxwell (1831 – 1879) to describe phenom-
ena like radio transmission and light. At first, people who were accustomed
to the mechanical wave theory thought that electromagnetic waves should
also have a carrier to propagate the energy. For this reason they invented
an unknown medium called “Aether” as a wave carrier. Present theory
60 Chapter 2. Electricity and frequency
It has been mentioned that waves represent the transfer of energy by means
of an oscillating medium. In the mechanical wave this energy is alternatingly
present in the form of kinetic energy or as potential energy. When looking
at the maximum kinetic energy E k = 1/2 mv2 at any point in a mechanical
wave it is clear that the related intensity is proportional to the amplitude
squared. Also with electromagnetic waves the intensity is proportional to
the amplitude squared, but with these waves generally the term irradiance
is used rather than intensity because in radiology the term intensity or
radiant intensity is also used as the radiated power per unit of solid angle
2.2. Energy propagation and waves 61
Figure 2.12: Electromagnetic waves are observed over a vast spectrum of frequen-
cies, ranging from radio waves to gamma rays. The spectrum of visible
light covers only a very small part.
originating at the source of the wave. It is typical for waves that start at a
singular spot. The term irradiance will be frequently used in the chapters
on optics and measurement.
2
Sequence
Figure 2.13: A man holds a rope with a loose end and inserts a half wave by an
upwards movement followed an equal downward movement. The
reflection of the wave at the loose end has the same sign because of the
zero force boundary condition at the end point. The effect of gravity is
neglected.
as the original half wave, because it first starts with an upward momentum,
originating from the extreme upward motion of the end, followed by a down-
ward movement.
Another situation occurs when the end of the rope is connected to a rigid
stationary object like a wall. This can in realty be visualised also with
presence of gravity and is shown in Figure 2.14. This zero movement bound-
ary condition causes the force at the connection point to increase until all
energy in the wave is absorbed in the local elasticity in the rope. That
potential energy is inserted back in the rope such that its movement gains
a maximum momentum passing the centre line until its slowed down again
at the downward side where its energy will be transferred in the backward
direction. The reflection in this situation leads to a sign reversal, because
it first starts with an downward momentum, originating from the extreme
force at the connection point, followed by an upward movement.
Of course these two examples can also be calculated with real mathematics.
This would enable to precisely calculate the transfer over a discontinuity and
show the minus sign in the reflection at a rigid connection but as mentioned,
is not necessary for a sufficient comprehension of the phenomena in the
2.2. Energy propagation and waves 63
2
Sequence
scope of this book. This phase reversal is also observed with the reflection
of electromagnetic waves when going from a low density to a high density
medium like with reflecting on a mirror.
Figure 2.15: A standing wave occurs with a sinusoidal excitation when the reflected
wave is in phase with the excited wave. This occurs at a frequency
equal to f = nvp /4d with n = 1,3,5,7..... The situation for n = 3 is shown
in the upper graph and n = 5 for the lower graph.
the value is a function of the angle θ instead of time and the interval of one
period T is represented as an angle from −π to π. When F (θ ) represents the
Fourier transform of f (θ ) then:
∞
F (θ ) = a 0 + [a n cos( nθ ) + b n sin( nθ )] (2.31)
n=1
The sine and cosine terms define the phase relation between the different
components. The Fourier coefficients a n and bn are the amplitudes of the
corresponding sine and cosine terms belonging to the different frequencies
and are calculated over the mentioned interval by the next relations:
π
1
a0 = f (θ ) d θ (2.33)
2π
−π
This first coefficient represents the average value and is zero for a periodic
signal without a DC offset. The other terms become according to the above
mentioned trigonometric identity:
π
1
an = f (θ ) cos( nθ ) dθ (2.34)
π
−π
and:
π
1
bn = f (θ ) sin( nθ ) dθ (2.35)
π
−π
In equation (2.32) the frequency terms are all an integer multitude of the
fundamental frequency. This interrelation is called harmonic because in
music a waveform that consists of only harmonically related frequencies does
sound like a harmonic (not out of tune) tone. The fundamental frequency
equals the first harmonic and likewise the frequencies corresponding with
n = (1, 3, 5, ...) are called odd harmonics and the frequencies corresponding
with n = (2, 4, 6, ...) are called even harmonics2 . To illustrate and underline
2 The terms “odd” and “even” should not be confused with odd and even functions in math-
ematics that deal with symmetry relations of functions. A function is called even when the
graphic representation in the x − y plane is unchanged when mirrored around the y axis. The
graphic representation of odd functions remain unchanged when rotated 180◦ around the origin.
For example cos x is an even function while sin x is an odd function.
2.3. Mathematical analysis of signals and dynamics 67
>1
1
2
Magnitude
0 0.9
0.8
0.7
Magnitude
0.6
>1
"! 0.5
2 X
0.4
X
Z
0.3
X
\
0.2
!"
>1 0.1
>π > π 0 π π 0
2 2 0 1 2 3 4 5 6
Angle Harmonic
Figure 2.16: Triangle waveform, approximated by the sum of the first three har-
monic terms of the corresponding Fourier series, already comes close
to the ideal waveform.
this theory on Fourier series of periodic functions the following three wave-
forms are taken as example, the triangle, sawtooth and square waveform.
>1
1
2
Magnitude
0 0.7
"! 0.6
X
Magnitude
0.5
X
^
>1 X
Z 0.4
2
X
0.3
X
\
X
` 0.2
>1 !"
0.1
>π > π 0 π π 0
2 2 0 1 2 3 4 5 6
Angle Harmonic
Figure 2.17: Sawtooth waveform, approximated by the sum of the first six harmonic
terms of the corresponding Fourier series, is still hardly recognisable
as a real sawtooth.
sharp tip if all terms would be included. It is also nice to see that between
for instance θ = π/4 and π/2 the combination of the second and third term
leads to a flattening of the curved slope of the first term in that region.
The mathematics also show, that only odd harmonics are present. Due
to the quadratic term in the numerator of the amplitude coefficients the
higher order terms are rapidly becoming negligible. The image in the figure
consists of only three terms from the Fourier series and the sum already
looks like a triangle. Nevertheless due to the sharp edges an infinite amount
of harmonics would be needed to create an ideal triangle waveform.
The sawtooth waveform is shown in Figure 2.17. While it might look like
another kind of triangle it is not that simple. A theoretical ideal sawtooth
waveform has an infinitely steep slope around θ = 0 which represents a
discontinuity. As a consequence many terms are required to approximate a
sawtooth waveform to an acceptable level of accuracy. Like with the triangle
a 0 is zero but in this case the fundamental frequency of the range is a sine
function as the sawtooth is like the sine an odd function. The mathematical
relation that describes the sawtooth waveform as function of the angle θ is
2.3. Mathematical analysis of signals and dynamics 69
as follows.
1
f (θ ) = −1 + θ for − π < θ < 0 (2.38)
π
1
f (θ ) = 1 − θ for 0<θ<π
π
f (θ ) = 0 for θ=0
This means that both odd and even harmonics are present. Furthermore, the
amplitude coefficients of the successive harmonics only decrease proportional
with n which underlines that many terms are required to create a reliable
sawtooth. This is due to the above mentioned fact that the infinitely steep
ramp has to be created by summation of only finitely steep sine wave slopes.
The image in the figure consists of only six terms and though one faintly
recognises the final shape it is still far from a sawtooth waveform.
Calculating the coefficients the Fourier series of this function in time nota-
tion (θ = ω t) gives the following infinite series of frequencies:
4 1 1
F ( t) = sin(ω t) + sin(3ω t) + .... + sin( nω t) n = 1, 3, 5, 7, ......
π 3 n
(2.41)
70 Chapter 2. Electricity and frequency
>1
1
2
Magnitude
0 1.3
1.2
"! 1.1
Magnitude
X
1
>1 0.9
2 X
Z 0.8
X
\ 0.7
X
{ 0.6
0.5
X
| 0.4
>1
!"
0.3
0.2
0.1
>π > π 0 π π 0
2 2 0 1 2 3 4 5 6 7 8 9 10
Angle Harmonic
Figure 2.18: Square waveform, approximated by the sum of the first five harmonic
terms of the corresponding Fourier series starts to look like a real
square.
This means that like with the triangle waveform only odd harmonics are
present. Furthermore, like with the sawtooth waveform, the amplitude
coefficients of the successive harmonics only decrease proportional with n,
which underlines that also in this case many terms are required to realise
a reliable square wave for the same reason as an ideal theoretical square
waveform has discontinuities that can only be approximated. The fact that
the summed first six terms of the Fourier series already looks more like
a real square waveform than the previous example is due to the missing
even harmonics leading to a higher frequency of the last terms that were
included in this approximation.
Based on these three examples the following observation can be made.
A periodic signal that is symmetrical around an axis in the y direction at
π/2 ± nπ will create only odd harmonics. The even harmonics all represent an
asymmetry around these axes as can be seen with the sawtooth waveform.
This is especially important with spatial frequencies in optics where in
principle only odd harmonics are present.
From the previous part it was demonstrated, that a periodic signal can be de-
composed into harmonically related frequencies. The process of calculating
the terms in the equation focused on one period only as being representative
2.3. Mathematical analysis of signals and dynamics 71
A B C D
Magnitude
Time
for the entire continuous function. In reality signals often look more like
the graph of Figure 2.19 showing a random signal that was registered over a
short period of time. A real random signal consists of an infinite amount of
different sinusoidal frequencies with different (and changing!) amplitudes
and phase. When only a limited time span is observed, like shown in the
figure one can recognise areas with a low frequency like in area A – B and
with a high frequency like the area C – D. It is possible to approximate such
a random signal by a limited amount of frequencies when observing only
a limited time span. Figure 2.20 shows an example that is created by a
further execution of the above mentioned Fourier transform methodology
and it works as follows:
A sample of the signal is taken over the defined time span and this sample is
multiplied with a series of sine and cosine frequencies covering the spectrum
of interest. For example in mechatronic systems a spectrum of 0 – 1000 Hz
could be taken with steps of 1 Hz resulting in two thousand multiplications.
The outcome of each multiplication results in the amplitude of each of these
frequencies and can be represented graphically in the form of a frequency
spectrum. It is clear that a lot of computation power is necessary to do this
with a large resolution in frequencies especially when real-time information
of fast changing signals with many samples is needed. Even with the fast
computing systems of today this discrete Fourier transform (DFT) is not
used anymore as the same result can be achieved by means of a more
efficient algorithm called Fast Fourier Transform (FFT). This method of
which the mathematical details go beyond the scope of this book has become
the standard for signal analysis and is for instance applied in instruments
like a Spectrum Analyser. Next to the digital FFT operation, Spectrum
72 Chapter 2. Electricity and frequency
JQ
0.3
0.2
0.1
Magnitude
0
0.12
>JQ
0.1
Magnitude
>JQ^ 0.08
>JQZ 0.06
"! 0.04
>JQ
!"
0.02
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0
0 10 20 30
Time Harmonic
Analysers can also be based on analogue mixing of the unknown signal with
a known frequency and creating a multiplication by non-linearity in the
amplification. This method is used in very high frequency analysers.
s = σ + jω (2.43)
The same transform can be applied to an integration action which gives the
following result:
⎧ t ⎫
⎨ ⎬ x( s) x( s) x( s)
L x( t) d t = = = −j (2.45)
⎩ ⎭ s jω ω
0
it can be concluded that the s term due to the differentiation over time of
a variable results both in a proportional increase of the magnitude of the
variable with increasing frequency by the multiplication with ω and in a
positive phase shift of 90◦ corresponding with the positive imaginary term
j . Following the same reasoning, the integration of a variable over time
gives both a proportional decrease of the magnitude of the variable with
increasing frequency because of the division with ω and a phase shift of
−90◦ corresponding with the negative imaginary term − j .
The frequency domain, when written with the s variable, is also frequently
called the Laplace domain. It is important that equations from the frequency
domain are not confused with equations from the time domain. When it
is necessary to clearly distinguish the different domains, the notation of a
function like f ( t), with the variable t between round brackets, strictly refers
to the time domain. The notations with the variable (s) for the Laplace
domain, (ω) for the radial frequency or ( f ) for the temporal frequency all
strictly refer to the frequency domain. The addition of the distinguishing
variable terms ( t), (s), (ω) or ( f ) will in many cases be only done once in every
equation at the term(s) before the equal sign in order to avoid too many
74 Chapter 2. Electricity and frequency
The step response is a typical time domain related phenomenon and gen-
erally the most straightforward of all responses. It is often quite easy to
understand or to visualise what happens when a force on an object increases
from zero to a certain value. For instance when putting a heavy load in
a car one sees the car sagging with a certain speed and depending on the
suspension and the shock absorbers (dampers), a slow periodic movement
with decreasing amplitude is observed. In Figure 2.21 a typical response
of a dynamic system to a sudden change of force (step) from value 0 to 1
2.4. Dynamic system response to a stimulus 75
1.5
1
F
0.5
x
0
>^ 0 ^ 4 6 8 10
t
Figure 2.21: Typical response of a damped mass spring system to a force on the
mass that changes from 0 to 1 at t = 0. The vertical axis is normalised
to 1 at t = ∞.
is shown. The dynamic system in this case is a damped mass spring sys-
tem like presented in Chapter 3. This response, if considered upside down,
corresponds to the described response of the car.
Even with the origin of the step response in the time domain its equivalent
in the frequency domain can be derived by the Fourier transform. This
shows that a step function consists of a continuous spectrum of sinusoidal
frequencies starting at 0 Hz that all are in phase at the moment of the step
and which have an amplitude that is inversely proportional to the frequency.
This can be imagined when one considers that all those frequencies will only
be correlated (in phase) at t = 0 but at any other time their correlation is
lost and the combination will average out to zero. In a mathematical sense
a step function is a special case of a square waveform with a fundamental
frequency of 0 Hz. Starting with the Fourier series of a square waveform
according to Equation (2.41), the limit of ω → 0 results in a continuous
spectrum of frequencies starting at 0 Hz.
Because of this wide frequency spectrum, a step function is a very suitable
stimulus for a passive mechanical system to investigate the occurrence
of resonances.It is important to note however that the step response is
less applicable as stimulus in an actively controlled dynamic system with
actuators as with most controllers the infinite slope will force the actuator
into saturation and cause other non-linear effects that are not observed
76 Chapter 2. Electricity and frequency
Step 2
l h
Step 1
t
JQ}
JQ`
F
JQ
JQ^
x
>JQ^
>^ 0 ^ 4 6 8 10
t
Figure 2.23: Typical response of a damped mass spring system on a force impulse
at t = 0. The vertical axis is normalised to 1 just after the impulse.
This means that the length becomes longer than zero and the peak value pro-
portionally lower than infinite while simultaneously the edges are rounded
off. To a certain extent one can compensate the corresponding lack of high
frequencies by convolution3 of the measured response with the real spectrum
of the stimulus but at higher frequencies the noise will limit the reliability
of the outcome.
The impulse response is often used for modal analysis of a mechanical sys-
tem. The impulse is applied by means of a standardised “hammer”. The
hammer is provided with a rubber tip to avoid damage by the ”infinite“
force and as a consequence, the impulse will be rounded off with a lower
amplitude of the high frequencies. By measuring the exerted impulse itself
directly on the hammer, the deviation of the ideal pulse can be taken into
account.
One important drawback of this stimulus is that all the energy is inserted
in the system in a very short time with a high force over a short time. In-
evitably this will give erroneous information in case of non-linear systems.
As with the step response, for this reason also the impulse is less suitable
for active controlled dynamic systems.
functions.
78 Chapter 2. Electricity and frequency
Frequency domain related stimuli avoid the extremely high forces that are
related to the “single event stimuli” of the previous section, because they
present a more or less permanent continuous spectrum of frequencies with
sufficiently reduced amplitudes. These signals enable longer measurements
without errors due to nonlinearity.
Sinusoidal waveforms are quite easy to create in electronics. For that reason,
initially stimuli were used that consisted of a sinusoidal waveform with a
constant amplitude and a gradually changing frequency. This “frequency
sweep” was suitable for fast electronic systems. In mechanical systems
with inherent lower frequency resonances the frequency sweep method
sometimes induced errors due to the time needed for resonances to gain
sufficient energy to manifest themselves. A fast sweep would cause these
resonances to remain unnoticed. One way to overcome this problem is using
a very slow sweep frequency if time allows and this process could be made
interactive to zoom in on critical areas. A better method is to use a stimulus
containing all frequencies over a longer time than with an impulse. The first
example is to use white noise as stimulus of which the power (signal value
squared) is constant for any fixed frequency range over the total frequency
band. In practical situations, similar to the impulse, ideal white noise does
not exist due to high frequency limitations in the physical embodiment and
only the frequency band of interest is excited.
The second example of a more continuous frequency spectrum is created in
this era of high speed digital electronics and is based on the synthesis of a
stimulus signal by means of multi-sines. In principle this approach allows to
create any spectrum of signals, with correlated phase and frequency, random
phase relations or any other combination. Frequency areas of no interest can
be avoided, different amplitudes can be chosen for different frequency areas
to avoid overheating or non-linear behaviour and the frequency spectrum can
be refined in the areas of most interest. As the signals are fully deterministic,
all phase and amplitude relations in the stimulus are known and it becomes
fully straightforward to determine the dynamic system response.
2.4. Dynamic system response to a stimulus 79
• The Bode-plot to show the magnitude and phase response upon a con-
tinuous frequency stimulus in two graphs as function of the frequency,
one for the magnitude and one for the phase.
2.4.1.1 Bode-plot
The Bode-plot, named after the American engineer Hendrik Wade Bode
(1905 – 1982), visualises the frequency response of the output of a dynamic
system to an input stimulus and is originally known from analogue electronic
signal analysis. Figure 2.24 shows an example of a Bode-plot of a fourth
order coupled mass spring system as will be introduced in Chapter 3 on
dynamics.
In principle a Bode-plot consists of two parts, one above the other, that both
share the same horizontal axis with the frequency as parameter. The upper
part is the magnitude Bode-plot with the ratio between the magnitude of
the response and the magnitude of the stimulus on the vertical axis. The
second part shows the phase shift on the vertical axis relative to phase of the
stimulus. Several important aspects have to be recognised in a Bode-plot:
• In a typical Bode-plot both the frequency and the amplitude scales are
logarithmic to base 10.
• The frequency scale can be both in rad/s or in Hz. The latter is often
preferred in the mechatronic field, mainly because of the more “natural”
character of the temporal frequency.
• The magnitude scale can be different for each application because the
units of the stimulus and the response can be different. For instance
the stimulus can be a force [N] while the response can be a position
[m]. The numbers have only a meaning when this relation is known.
>^
>}
10 *`J
>J
10 *^JJ
0
>\
Phase [deg]
>|J
>Z\
>}J
1 ^ 3
10 10 10 10
Frequency (Hz)
Figure 2.24: Example Bode-plot of a motion system. The upper graph gives the
magnitude of the position response divided by the magnitude of the
force stimulus. The lower graph gives the phase relation between
the position response and the force stimulus. Starting at the low
frequency side first the response is frequency independent but at 15
Hz a damped resonance determines the start of a negative slope where
the magnitude decreases as function of the frequency. Around 3 kHz a
typical resonance and anti resonance indicate the effect of a decoupling
mass as will be explained in Chapter 3.
• The Bode-plot represents the response of the system when all fre-
quencies are continuously present at the stimulus. It is a station-
ary representation.
The double logarithmic scales are used because most frequency responses
are either proportional or inversely proportional to the frequency f or ω
to the power n, where n is an integer. With double logarithmic scales the
function log ωn = n log ω becomes a straight line for all values of n, because
one scale shows log ωn and the other log ω. The resulting line will have an
upward slope when n is positive and a downward slope when n is negative.
It is common to talk about the order of the slope in terms of n. For example
a transfer function f (ω) = 1/ω2 , which means that n = −2, results in a line
with a slope of −2 in the Bode-plot.
2.4. Dynamic system response to a stimulus 81
In Section 2.3.2, where the Laplace transform was introduced, a +1 slope was
shown to be representative for a differentiating action with a corresponding
phase shift of +90◦ and a −1 slope is representative for an integrating action
with a corresponding phase shift of −90◦ . This direct link between phase
and slope might lead to the conclusion that the phase plot gives redundant
information. In simple “minimum phase” linear systems, the relation be-
tween the phase and amplitude is indeed unambiguously determined.
Often, however, this relation is more complicated and the same phase might
occur at different frequencies and magnitude-slope values, especially with
higher order systems. The influence of sampling, quantisation, hysteresis,
backlash and other complicating factors is another reason to not leave the
phase plot out in the analysis of the open-loop response of modern feedback
controlled mechatronic systems. The open-loop phase plot is often even more
important for dynamic system analysis than the amplitude. This is the main
reason that the Nyquist plot has been developed. Nevertheless, due to its
clear relation with the standard applied integration and differentiation
operations in motion control systems, the Bode-plot is a frequently used
graphical representation both in practice and in this book.
Table 2.2: The relation between the order of the slope of a Bode-plot and the mag-
nitude ratio both in deci-Bel and in amplitude and power ratios with a
frequency difference of a factor two or ten.
as incremental unit.
P1
X = 10 log [dB] (2.46)
P2
V2
P = V I = I2R = [W] (2.47)
R
For that reason the relation in dB between two magnitude levels of electrical
signals is:
I1 V1
X = 20 log or X = 20 log [dB] (2.48)
I2 V2
This means that in expressing the relation between different levels of power,
the multiplication factor is ten and between different levels of signal ampli-
tude the multiplication factor is twenty.
In terms of deci-Bel, the -1,-2,+1,+2 etc term for the slope in the Bode-plot
becomes equal to a certain number of dB per frequency ratio as shown in
Table 2.2. The values are given both for a decade, representing a factor ten
between the frequencies, and for an octave with a factor two difference, as
commonly used in sound systems and music.
While the dB is frequently used for the magnitude scale of Bode-plots,
probably because it avoids the “power of ten” term with large numbers, it
is more practical to use the absolute number when analysing the dynamic
properties of mechatronic systems. The reason for this preference is that the
2.4. Dynamic system response to a stimulus 83
magnitude level is the reference for the magnitude of the different control
parameters.
In practice the use of dB can even cause confusion when analysing the
dynamic properties of a system at different frequencies at a ratio that is
neither a factor two or ten. With for instance a factor three frequency
difference one might think this is equal to 0.33 of a decade and conclude that
a -2 slope (40 dB per decade) would then give an attenuation of −0.33 · 40 =
−13, 2 dB. In reality the magnitude attenuation would be 20 log 1/32 = −19 dB.
This error is often made by students at examinations.
In control terminology the 0 dB level is often used to indicate the frequency
where the magnitude of the transfer function of a system becomes equal to
one. In case one prefers to avoid the use of dB, like in parts of this book,
this frequency can be named the unity-gain cross-over frequency.
With the Bode-plots in this book the horizontal scale is always in Hz and
the magnitude scale is mostly absolute with the exception of the electronics
chapter where both notations are used at different sides of the Bode-plot.
The main reason for this exception is the possible application of this book
by audio engineers for whom it is more logical to use the dB notation and
electronic filters are frequently used in their field.
Even though the absolute number is the preferred notation with active
control of dynamic systems, also in this field the dB notation is still fre-
quently used. For that reason a mechatronic design engineer should always
be aware of these different notations in spite of the apparent logic behind
using absolute numbers.
The Nyquist plot, named after the Swedish electrical engineer Harry Nyquist
(1889 – 1976), is a polar coordinated two dimensional vector plot where
the magnitude M is represented by the length of the vector starting in the
origin. The phase angle (ϕ) is represented by the angle of the vector relative
to the positive horizontal axis, while its sign is positive moving against the
direction of the clock. Because most dynamic systems have a response with
a phase delay, most Nyquist plots show a phase angle in the direction of the
clock, corresponding with the negative phase that is caused by the delay.
When the different vectors for all frequencies are connected, a curved line is
obtained with the frequency as parameter along the line. This plot is often
used for the stability and robustness analysis of feedback controlled systems
as will be presented in depth in Chapter 4 on motion control. A typical
84 Chapter 2. Electricity and frequency
^
0 dB
^
Q
QQ^\
QQ` 6 dB
Q^ QZQ^
J
Imaginary Axis
0
(-)
>
M
>^
>^Q\
~>^QZ{
!X^{\
>Z
>\ > >Z >^ > 0
Real Axis
Figure 2.25: Nyquist plot of the open-loop transfer function of a controlled mass
positioning system. The blue line connects the polar vector points
(black arrow) defined by the phase angle relative to the positive real
axis and the magnitude as length of the vector for different frequencies.
It starts at low frequencies following the blue arrow to the origin. The
phase angle of this example is negative. The Nyquist plot is used to
show the stability of a system when the feedback loop is closed.
Dynamics of motion
systems
Introduction
All mechatronic systems in the context of this book represent controlled
motion systems. This statementh implies, that they are all inherently
dynamic in nature. The dynamic properties of the separate parts within
mechatronic systems each have their own impact on the performance with
respect to response speed and precision. It is fully justified to say, that
the mechanical part is often in practice the most determining factor for
the final performance of the total mechatronic system. For this reason it
is more than anything else of paramount importance to fully understand
the physical properties, that are linked with items like mutually connected
springs, bodies and dampers.
The chapter starts with the relation between the stiffness of a mechanical
construction and its inherent relative position accuracy under external loads.
This will show the relation between accuracy and natural frequency and
numerical examples will give a feeling on practical design constraints. It
includes an example of an active feedback controlled positioning system as
used in a CD-player, in order to link the feedback mechanism with stiffness,
as a short introduction towards Chapter 4 on motion control.
This introduction is followed by a more theoretical analysis of the response
to a force of a mass-spring1 system with different levels of damping, the com-
1 The term “mass-spring” or “mass-spring-damper” system is a regularly used term although
85
86 Chapter 3. Dynamics of motion systems
a spring and damper are objects, while the mass is a property of a body. To avoid confusion,
the regular naming will be used for the complete system, while the element will be named a
body with mass m as its property.
3.1. Stiffness 87
3.1 Stiffness
Most people have a rather good qualitative feeling about what is stiff and
what is not, but it becomes different when it is necessary to work with
quantitative data based on the SI units. The unit of stiffness is called
“spring constant” k and is defined in Hooke’s law of elasticity by the English
scientist Robert Hooke (1635 – 1703). It equals the incremental change in
the amount of force ( dF ), that an elastic element would produce in reaction
to an incremental deformation ( d x). When the incremental deformation is
pointing in the positive x direction the corresponding incremental force in
that same direction2 would equal:
dF
d F = − k ( x) d x =⇒ k ( x) = − N/m (3.1)
dx
The negative sign is the result of the fact, that the force is directed in the
opposite direction of the deformation, indicating that F is negative for a
positive d x resulting in a positive number for k( x).
With small deformations the stiffness is mostly independent of the defor-
mation. In that case Hooke’s law becomes linear and can be written in the
simple notation:
F
F = − kx =⇒ k=− (3.2)
x
where x is defined as the displacement from the position where no force is
exerted on the spring.
Based on the third law of the English scientist Sir Isaac Newton (1643 –
1727), stating, that the forces of interaction between two bodies are equal,
opposite and collinear, an equal force F = kx must be applied in the direction
of the deformation to the elastic element. This equation is also called Hooke’s
law, which can cause confusion when it is not clearly defined which force is
meant, the action or the reaction. For this reason in this book the second
version of Hooke’s law is called the “Hooke – Newton” law even though these
two eminent scientists would probably not have liked to be named in one
tern, in view of their not too well personal relation.
In general the term “spring constant” is associated with the spring as a
separate item. The term stiffness is preferred in the technical context
2 The force in all drawings in this book is noted with a double arrow because it acts between
two bodies according to the third law of Newton. The direction of the force at the point of
insertion is usually defined to be positive in the positive direction of the corresponding position
coordinate system defined by one or more arrows, depending on the number of directions in
the coordinate system that are presented.
88 Chapter 3. Dynamics of motion systems
Rotang workpiece
Figure 3.1: Schematic view of a turning lathe. The force that is exerted by the
rotating work piece in the negative direction on the cutting tool will
deform the spring with stiffness k resulting in a displacement − dx from
the intended position of the tool tip.
The typical stiffness of a turning lathe is about 107 − 108 N/m. When a maxi-
mum error of one micrometre is specified, the change in the cutting force is
not allowed to be larger than ten Newton.
Another example, where high precision is required, is the inspection micro-
scope shown in Figure 3.2. In this example, the desired maximum measure-
ment error is 10 nm. This implies, that the maximum deviation xmax from
the ideal position between the microscope and the object, has to remain
smaller than that value. This requirement directly results in a minimum
value for the stiffness between the object and the microscope, because of the
influence of vibrations of the table top that the microscope is placed on.
The following assumptions are chosen for this example:
The upper part of the microscope has a mass of 5 kg and it is used in a
laboratory, where the measured vibrations on the table top show acceleration
levels up to 0.01 m/s2 . These accelerations have to be followed by the mass
of the upper part of the microscope which requires a certain force F ( t) that
will cause a deformation of the stiffness k that is not allowed to be larger
than the maximum error of x̂e = 10 nm. The required stiffness (k), to achieve
this maximum position error under these circumstances, can be calculated
using the second law of Newton:
d2 x
m
k≥
F ( t)
=
d t2 = 5 · 0.01 = 5 · 106 N/m (3.3)
x̂e ( t) x̂e 10 · 10−9
This value is directly related to the undamped natural frequency ω0 , as
90 Chapter 3. Dynamics of motion systems
Support frame
k
m Microscope
head
xe
Object
Vibraons xf
Table top
Figure 3.2: Schematic view of an inspection microscope. Vibrations in the table top
act on the entire instrument. Due to the mass and the limited stiffness
the microscope head will not be able to perfectly follow the accelerations
causing an alignment error xe with the object.
d2 x d2 x
Fa ( t) + Fd ( t) = m + kx = 0, =⇒ m = − kx (3.4)
d t2 d t2
The sign can be checked by the reasoning, that a deformation of the spring
in the positive x direction causes a force and a corresponding acceleration
in the opposite direction of the deformation. It is known, that in resonance
the body follows a sinusoidal movement x( t) = x̂ sin(ω0 t) with x̂ being the
amplitude.
With this information the following equation can be formulated:
From the calculated value of the required stiffness in Equation (3.3), the
related natural frequency of the system can be derived:
5 · 106
ω0 ≥ = 1000 [rad/s], corresponding with:
5
1
f0 ≥ 1000 = 160 [Hz] (3.7)
2π
From Equation (3.3) it can also be concluded, that a larger mass of the
upper part of the microscope requires a proportionally increased stiffness,
to achieve the same accuracy. This would mean that the natural frequency
is kept constant. It appears, that the natural frequency is directly related
to the performance with respect to the suppression of external vibration
disturbances. To show this in mathematical form, the behaviour of the
upper part is observed. This microscope head has to follow the sinusoidal
motion of the table top vibrations with position x( t) and acceleration ẍ( t)
equal to:
x( t) = x̂f sin(ω t) =⇒ ẍ( t) = − x̂f ω2 sin(ω t) (3.8)
The amplitude of the force F̂ acting on the following measurement head is
determined by the second law of Newton:
F ( t) = m ẍ( t) =⇒ F̂ = m x̂f ω2 (3.9)
The resulting maximum deformation of the connecting spring equals:
F̂ m x̂f ω2
x̂e = = [m] (3.10)
k k
With equation (3.6) k/m can be replaced by ω20 , resulting in the following
expression:
ω2
x̂e = x̂f (3.11)
ω20
The minimal value of ω0 to achieve a certain precision becomes:
x̂f x̂f
ω0 ≥ ω [rad/s], and f0 ≥ f [Hz] (3.12)
x̂e x̂e
This means, that the minimum natural frequency is proportional to the
frequency of excitation and to the square root of the ratio between the
excitation amplitude and the allowable error. In the next section this same
relation is shown in the performance of a feedback controlled positioning
system defined by the maximum tracking frequency, called bandwidth and
the position error ε. It will show, that these values are related to this
natural frequency in passive systems and is a preview to Chapter 4 on
motion control.
92 Chapter 3. Dynamics of motion systems
Radial tracking
Opcal disk
Focus
Hr Hf
kf Opcal
m
Pick-up
kr Unit
Figure 3.3: Schematic view of an Optical disk system with a “virtual” spring, cre-
ated by the position control system that connects the pick-up unit in
six orthogonal coordinate directions (translations and rotations) to the
track. From these directions two translation stiffness values are shown,
the radial stiffness kr and the focal stiffness kf that, together with the
mass of the pick-up unit, determine the tracking errors εr and εf under
the impact of eccentricity of the disk and external vibrations.
frequency with the same amplitude as the eccentricity. The forces due to
the eccentricity can be neglected, because of the quadratic relation between
acceleration amplitude and frequency.
For the calculation it is further assumed, that the movements caused by
the shocks can be approximated by a sinusoidal shape. The maximum
acceleration of these movements is then described by:
When the mass of the pick-up unit is 10−2 kg, the amplitude F̂ of the disturb-
ing force becomes 0.05 N and with this number and the required maximum
error εr of 0,2 μm the minimum radial stiffness kr can be calculated:
F̂ 0.05
kr ≥ = = 2.5 · 105 [N/m] (3.14)
εr 0.2 · 10−6
This would result in a natural frequency of:
ω0 1 k 1 2.5 · 105
f0 = = = ≈ 800 [Hz] (3.15)
2π 2π m 2π 0.01
When checking this frequency with equation (3.12), the same result is ob-
tained:
x̂ 200 · 10− 6
f0 ≥ f = 25 ≈ 800 [Hz] (3.16)
εr 0.2 · 10− 6
• The measurement system, that translates the deviation from the set
point into a voltage signal, Vm = G m εr .
94 Chapter 3. Dynamics of motion systems
Io Vc
Io GpVc Vc GcVm
Power amplier Controller
x F Vm
ms 2 Hr x
m Hr x
Moving body Vm G mH r
Measurement
Actuator kr
F Ga I o
• The control system, that converts this voltage into a negative feedback
voltage, Vc = −G c Vm .
The negative feedback causes the force to be opposite to the direction of the
deviation, just like a passive spring opposes a deformation. This means
that a position controller with a constant gain G t , acting on a body, creates
a virtual spring that behaves fully comparable with a spring in a passive
mass-spring system.
As will be shown in the following sections, a simple system with a body and a
spring needs damping to return to stand-still after excitation by a stimulus,
which is equally true for an active system with only a virtual spring. In
Chapter 4 on motion control it will be shown, how a differentiating action
can act as damper to stabilise a feedback controlled positioning system.
3.2. Mass-spring systems with damping 95
m x
x0
c F k
Staonary reference
Figure 3.5: A damped mass-spring system with an external force stimulus. The
reaction x on the force F is determined by the combined compliance of
the body (Cm ∝ 1/m), the spring (Cs ∝ 1/k) and the damper (Cd ∝ 1/c).
In the mass-spring system with damping from Figure 3.5 the applied force
is distributed over the three elements while these share the same position x
referenced to the position x0 , where the system is at rest when F = 0. Starting
with a spring, its compliance Cs is a simple expression assuming a position
independent spring constant:
x 1
Cs = = [m/N] (3.18)
F k
Note, that the sign is positive, as the displacement is in the same direction
as the externally applied force conform the Hooke – Newton law. The
compliance of a spring is a constant factor, so it is independent of time
and frequency. This means, that an amplitude Bode-plot would show a
straight horizontal line with magnitude Cs . This is called the spring-line. In
the phase plot the spring-line shows a phase of 0◦ , because the compliance
equation of the spring does not contain any (s) term in case one would
perform a Laplace transform on this constant.
For the damper the force equals in the time domain:
dx
F ( t) = c (3.19)
dt
After Laplace transform to bring the function to the frequency domain:
This results in the following expression for the compliance of the damper as
function of the radial frequency (ω):
x 1
C d (ω ) = = [m/N] (3.21)
F jcω
3.2. Mass-spring systems with damping 97
Total force
x
Ft (Z )
Body Ct Spring
x
Fm (Z ) mZ 2 x x Fs
x
kx
Cm Cs
Damper
x
Fd (Z ) jcZ x
Cd
Figure 3.6: In a mass-spring-damper system the force is divided over the three
elements according to the inverse of their compliance while sharing the
same position.
For the magnitude, the absolute value of the compliance is taken, which
means, that the damper has a compliance, inversely proportional to the
frequency. In a magnitude Bode-plot this would show up as a straight
line with a down slope of −1, the damper-line. This also corresponds with
what is understandable by pure reasoning, because the force of a damper
is proportional to the velocity and with a harmonic movement x = A sin ω t
the related differentiation of the position introduces the additional ω term
in the amplitude. This means, that with an increase of the frequency the
velocity increases if the position amplitude is kept constant. This also means,
that a proportional higher force is needed at higher frequencies to move
the damper. Special attention should also be given to the j term in the
denominator, which means to say, that the compliance of the damper has
an imaginary value and as a consequence the damper-line in the phase plot
shows a phase shift of −90◦ .
The third element, the body reacts on a force according to the second law of
Newton in the time domain:
d2 x
F ( t) = m (3.22)
d t2
After Laplace transform this equation becomes in the frequency domain:
This gives the following expression for the compliance of a body as function
of the radial frequency (ω):
x 1
C m (ω) = =− [m/N] (3.24)
F m ω2
98 Chapter 3. Dynamics of motion systems
Table 3.1: Overview of the dynamic properties of body, spring and damper in the
time and frequency domain. The Laplace transform is indicated by a
sign and the variable terms (t), (s) and (ω) are omitted for reasons of
simplification.
dx d2 x
External force Fs = kx Fd = c csx = jcω x Fm = m ms2 x = − mω2 x
dt dt2
x 1 x 1 1 x 1 1
Compliance Cs = = Cd = = = Cm = = =−
Fs k Fd cs jcω Fm ms2 m ω2
[m/N]
1 ◦ 1 1
Magnitude [m/N], ,0 , −90◦ , −180◦
k cω m ω2
Phase angle [◦ ]
1 1 k
Z0
k mZ02 m
Log |x/F|
x 1 Mass- line (-2)
Spring-line Cs
Fs k Damper- line (-1) x 1
Cm
x 1 Fm mZ 2
Cd
Fd cZ
0q
Spring-line 0q
Phase
Figure 3.7: The response of body, damper and spring, each represented as separate
elements by a straight line in a Bode-plot with amplitude and phase.
The red dashed line shows the combined compliance plot. The natural
frequency ω0 where the response will roll-off is found at the intersection
of the spring and mass magnitude lines.
This corresponds with the statement, that at any frequency the element
with the least compliance determines the total compliance.
To illustrate the individual effect on the Bode-plot of each element, each
response is drawn separately in the amplitude and phase part of a Bode-plot
as shown in Figure 3.7. In this Plot the frequency is noted in the angular
frequency (ω), because of the clear relationship with the mathematical
expression for the compliance terms for each element.
The magnitude of the compliance of the combined mass-spring-damper
system can now be derived by taking the lowest value of the individual
elements depending on the frequency region. At low frequencies, the spring
determines the response of the system with the horizontal spring-line at
a compliance magnitude-level of 1/k. At high frequencies the behaviour is
fully determined by the body, according to its mass-line with a magnitude
level of 1/mω2 . The roll-off starts at the intersection of these lines at the
natural frequency ω0 . This can be concluded from the following reasoning.
100 Chapter 3. Dynamics of motion systems
Log |x/F|
Log |x/F|
0q 0q
Phase
Phase
180q 180q
0 0
Log Log
a: No damping b: Low damping
Log |x/F|
Log |x/F|
0q 0q
Phase
Phase
180q 180q
1 0 2 1 0
Log Log
c: High damping d: Extreme damping; first order system
Figure 3.8: The Bode-plots of the response of a mass-spring system with different
levels of damping shows the effect on the resonance at the natural
frequency ω0 .
In the mid frequency range, the damper determines the behaviour of the
system. Depending on the level of the magnitude (1/cω) of its corresponding
damper-line, the phase shows a gradual combination of the phase shift of
the different elements.
To get an initial idea about the effects of different levels of damping four
examples are shown in Figure 3.8.
Bode plot 3.8.a shows the response of mass-spring system without damping.
This is the situation where the assumption on the upper limitation of the com-
pliance is no longer true. At the natural frequency ω0 the amplitude of the
system increases by absorbing energy and loading it in alternating potential
energy (spring) or kinetic energy (mass), resulting in an infinite compliance.
At this natural frequency the phase jumps from 0◦ to −180◦ without gradual
transition. In the next section this effect will be better explained with the
equations of motion.
3.2. Mass-spring systems with damping 101
d2 x dx
F ( t) = m +c + kx (3.28)
d t2 dt
From this equation the following relation for the total compliance Ct (s) can
be derived:
x 1
C t ( s) = =
F ms2 + cs + k
1
= m
k [m/N] (3.30)
2 cs
s + +1
k k
To qualify this expression, three entities are introduced, the damping ratio ζ,
the spring compliance Cs and the un-damped4 natural frequency ω0 . Of
these the damping ratio is a new term. It is a dimensionless number as
4 The un-damped natural frequency is mostly named just the natural frequency, which is
not completely correct as at higher damping values the peak in the compliance response will
shift slightly to a lower frequency as will be presented a bit further on in this chapter.
3.2. Mass-spring systems with damping 103
it gives the ratio between the actual damping coefficient c and the critical
damping coefficient c 0 defined by the level where the step response will
not show overshoot anymore, as will be shown in Figure 3.12 in the next
section. This is the situation when this second-order mass-spring system
just behaves like a combination of two equal first-order systems. It will be
shown later, that this critical damping coefficient equals 2 km.
To summarize the derived variables:
c
ζ= (3.31)
2 km
1
Cs =
k
k
ω0 =
m
Using these terms in Equation (3.30) gives the following:
x Cs
C t ( s) = = 2 [m/N] (3.32)
F s s
+ 2ζ +1
ω20 ω0
The amplitude is the absolute value of the vector, being the square root of
the sum of the squared vales of the real part and the imaginary part, as
shown in the Nyquist plot of Figure 3.9. This means, that the amplitude
equals:
x Cs
|C t | (ω) = = (3.34)
F 2
2 ω 2
1− ω + 2ζ
ω20 ω0
O Real axis
Imaginary axis
x
Ct
F
The angle for a negative real value and a positive imaginary value equals
180◦ minus the above calculated arctangent value. A negative value for both
gives an angle of 180◦ plus the above calculated arctangent value while a
positive value for the real term and negative value for the imaginary term
gives a phase angle of 360◦ minus the calculated arctangent value.
For this damped mass-spring system the numerator of Equation (3.32) is
constant without imaginary part. As a consequence the phase angle of the
numerator is zero which means that the phase angle is only determined by
the denominator with a negative sign added to the above calculation. This
all is summarised in the following equation which works in two quadrants
because the imaginary term is always positive and only the real term has
two signs, positive at a frequency below ω0 and negative at a frequency
3.2. Mass-spring systems with damping 105
>^
10 JQJJ\JJ
JQJ\J
JQ^\^
>Z
10 JQ\
JQ\
Compliance [m/N]
>
^JQ^\
10
>\
10
>`
10
>{
10
0
Phase [deg]
>|J
>}J
1 ^ Z
10 10 10
Frequency [Hz]
above ω0 :
⎛ ⎞ ⎛ ⎞
ω ω
⎜ 2ζ ⎟ ⎜ 2ζ ⎟
0 ⎜ ω0 ⎟ ⎜ ω0 ⎟
φtot = φnum − φden = arctan − arctan ⎜
⎜ 2
⎟ = − arctan ⎜
⎟ ⎜
⎟ (3.37)
⎟
Ct ⎝ ω ⎠ ⎝ ω2 ⎠
1− 2 1− 2
ω0 ω0
To show the amplitude and the phase of this transfer graphically, an example
damped mass-spring system is defined, that will be used throughout most
of this section. The spring has a spring constant k of 1 · 104 N/m and the body
has a mass m of 0.025 kg, giving a natural frequency of:
1 k 1 1 · 104
f0 = = = 100 [Hz] (3.38)
2π m 2π 0.025
The maximum value of the peak at for instance ζ = 0.25 is not at ω0 , but
at a slightly lower frequency, which behaviour will be explained in the
following section. It is also shown, that Equation (3.35) is correct, as for
instance the resonance peak at ζ = 0.005 has a magnitude of a factor hundred
above the magnitude of the spring-line. This factor hundred is equal to the
variable Q , the quality factor, that equals Q = 1/2ζ. This variable will be
further presented later, after the next section.
d c ( s) = ( s − p 1 )( s − p 2 ) (3.39)
where p 1 and p 2 are the two poles. Generally these poles are complex
numbers and come as two conjugate complex terms, written as:
p 1 = −σ + j ωd and p 2 = −σ − j ωd (3.40)
where σ is the real part of the pole and the imaginary part contains ωd ,
the real resonance frequency in the damped situation. When these are
combined in the generalised description of the denominator, it becomes:
x C s ω20
C t ( s) = = (3.44)
F csω20
s2 + + ω20
k
For the pole locations only the denominator part is relevant:
csω20
d c ( s) = s2 + + ω20 (3.45)
k
determine the poles the mid term is changed by replacing one factor ω0
To #
by k/m:
$
k
cs ω0
m csω0
d c ( s) = s2 + + ω20 = s2 + + ω20 (3.46)
k km
When the middle and the last term of this equation are compared with the
corresponding terms in the expanded version of equation (3.41):
then the following equations are obtained that relate the real and imaginary
value of the pole to ω0 :
c ω0
σ= and ω20 = σ2 + ω2d (3.48)
2 km
The second relation is a circle equation that can be written as follows to
obtain the imaginary term ωd of the pole:
%
σ2
ωd = ω0 − σ = ω0 1 − 2
2 2 (3.49)
ω0
108 Chapter 3. Dynamics of motion systems
Im
]< 1
]= 1
]> 1
]> 1 Re
]< 1
Figure 3.11: Pole locations of a damped mass-spring system in the complex plane.
Depending on the damping ratio the poles are either real numbers,
that represent an overly damped system consisting of two first-order
systems, or complex conjugate numbers with less damping. The mag-
nitude of the real part relative to the imaginary part determines the
amount of damping in the system.
>
x 10
1.6
0.6
0.4
0.2
0
0 0.005 0.01 0.015 0.02 0.025 0.03
Time [s]
Figure 3.12: Position response of a mass-spring system in the time domain with
k = 1 · 104 N/m and m = 0.025 kg to a step Force of 1 N with different
levels of damping.
From the above equations it follows that increasing the damping will shift
the real resonance frequency ωd below the un-damped natural frequency ω0 ,
until the real resonance frequency becomes equal to zero at ζ = 1. As a direct
consequence, the overshoot at a step response is zero, as can be seen in
Figure 3.12.
110 Chapter 3. Dynamics of motion systems
3.2.3.3 Quality-factor Q
T
1 2 1 2π
E l ( t) = v̂2 sin2 ω0 t d t = c v̂ T = c v̂2 (3.55)
2 2 ω0
0
#
With ω0 = k/m and using above equations the energy ratio (R e ) equals:
1
Es m v̂2 m ω0 km
Re = = 2 = = (3.56)
El 1 2 2π 2π c 2π c
c v̂
2 ω0
#
Because Q = 1/2ζ = km/c the following conclusion is valid:
Maximum energy stored
Q = 2πR e = 2π (3.57)
Energy lost per cycle
Note, that the mentioned “maximum energy stored” is the energy that
is present in the system at each specific cycle and this value decreases
continuously as a consequence of the dissipated energy in the damping per
cycle.
112 Chapter 3. Dynamics of motion systems
1
10
= 0.005 Q = 100
= 0.05 Q = 10
JQ^\^
0
Velocity response [m/Ns] 10
= 0.5 Q = 1
=1 Q = 0.5
^JQ^\
>
10
>^
10
>Z
10
Phase [deg]
>|J
1 ^ Z
10 10 10
Frequency [Hz]
time of the time domain transfer function of the position. In the frequency
domain this differentiation is equal to the multiplication of the system
compliance Ct (s) with s:
v sx( s) sC s
( s) = = sC t ( s) = 2 (3.58)
F F s s
2
+ 2ζ +1
ω0 ω0
The peak level at for instance Q = 100 can be checked by taking the earlier
found motion amplitude of 10 mm and multiply that value with ω to get
the maximum velocity. At 100 Hz = 628 rad/s this results in a maximum
velocity of 6.28m/s, that corresponds with the level of the modelled peak in
the Bode-plot.
As a closure of this part on damping and energy it is illustrative to draw a
plot of the velocity squared over a small area around the natural frequency.
Because the stored kinetic energy equals E = 0.5mv2 such a plot gives the
relative energy levels at different values of Q . In Figure 3.14 the relative
energy level is shown of the discussed system with f 0 = 100 Hz. The energy
114 Chapter 3. Dynamics of motion systems
4
10 10000
Q = 100
Q = 10
Q=1 9000
3 8000
10
7000
''
''
6000
2
10
5000
4000
1
10
3000
2000
0
10 1000
Figure 3.14: Relative energy level as function of Q at a small band around the
natural frequency of 100 Hz. The narrow peak at high levels of Q is
especially prominent, when the phenomenon is presented on a linear
scale.
m xm
c k
Table xf
Staonary reference
Figure 3.15: The transmissibility of a dynamic system reflects the sensitivity for
external movements of one body, in this case the table, to another
body.
After the Laplace transform this force equation equals in the frequency
domain:
xm ( ms2 + cs + k) = xf ( cs + k) (3.61)
116 Chapter 3. Dynamics of motion systems
10
= 0.005 Q = 100
= 0.05 Q = 10
10 = 0.25 Q=2
Transmissibility (abs) = 0.5 Q=1
0
10
10
10
10
10
0
Phase [deg]
>|J
>}J
> 0 1 2 3
10 10 10 10 10
Frequency [Hz]
The mechanics in real precision positioning systems are far more complex
than the one-dimensional single mass-spring-damper configurations, that
were presented in the previous section. When systems must be modelled
in multi dimensions, all equations need to be written in a six coordinate
direction vector and matrix notation, where also cross couplings between the
different directions can be taken into account. These equations often become
too complicated for straightforward analytical calculations and in practice
they are always done by using computer simulation software like MATLAB
and ANSYS. As explained before, this book focuses on the physical under-
standing of observed phenomena with their basic mathematical modelling,
rather than detailed multidimensional calculations. For this reason the use
of vectors is restricted to the bare minimum. That being said, even in one
direction realistic systems generally consist of a multitude of bodies and
springs and it is very important to be able to qualify an quantify the effects
of these coupled dynamics from a mechatronic design perspective. First of
all, elastically coupled bodies heavily influence the possibility to create a
stable controlled motion system, due to their related resonances. Secondly,
the measurement position in an active positioning system is hardly ever
at the same location as the actuator. For these reason this chapter will be
rounded off by presenting two methods to describe the behaviour of a higher
order mechanical system that consists of more than one body connected by
springs and dampers. The first method builds on the previous analytical
approach with equations of motion, that describe the movement of the bodies.
It is shown how these interact with each other. It also becomes clear, that
the methodology complicates quickly up to the level, that a real analytical
analysis without making errors is not realistic anymore. To cope with this
problem, the second method based on the addition of eigenmodes, with their
corresponding mode-shapes and eigenfrequencies is introduced. This has
proven to be a very powerful method to describe the dynamic properties
of real complex machines in such a way, that it more easily connects with
the design space of mechanical engineers. To illustrate the relation with
measurement even further also an example is shown, that works in two
dimensions.
3.3. Multi-body dynamics and eigenmodes 119
x
x1 x2
k
F m1 m2
Staonary reference
Figure 3.17: Mass-spring system with two bodies with mass m 1 and m 2 connected
by a spring with stiffness k. A force F excites the first body and the
resulting motion is measured at both bodies.
d 2 x1 ( t )
m1 = F ( t) − k( x1 − x2 ) (3.65)
d t2
Note, that the force from the spring acts on both bodies in the opposite
direction and the external force only works on the first body. For the second
body the force balance according to the second law of Newton equals in the
120 Chapter 3. Dynamics of motion systems
time domain:
d2 x2 ( t)
m2 = k( x1 − x2 ) (3.66)
d t2
After Laplace transform these differential equations become in the frequency
domain respectively:
m 1 s2 x1 ( s) = F − k ( x1 − x2 ) (3.67)
2
m 2 s x2 ( s) = k ( x1 − x2 ) (3.68)
From these two equations the following transfer functions in the frequency
domain between x1 respectively x2 and F can be derived by first using
Equation (3.68) to write x2 as function of x1 and use that result to solve
Equation (3.67):
x1 m 2 s2 + k
( s) = (3.69)
F m 1 m 2 s4 + k ( m 1 + m 2 ) s2
x2 k
( s) = (3.70)
F m 1 m 2 s4 + k ( m 1 + m 2 ) s2
This means, that at these high frequencies the first body will respond
according to the mass-line of only its own mass, while the second body will
respond with a slope of −4 corresponding with its inability to follow the
movement of the first mass. This is called the decoupling of the second body.
The transfer functions are written in a polynomial form, that is common in
control engineering environments using dedicated mathematical software
like MATLAB. To investigate the response at the intermediate frequency
range in an analytical way, it is better to write the equations in a different
form.
3.3. Multi-body dynamics and eigenmodes 121
• m 1 < m 2 : This is for instance the situation of a body with a large mass
( m 2 ), actuated by a less heavy actuator (m 1 ), that is connected by
means of a flexible mount.
*^ *^
>Z >Z
Compliance [m/N] 10 10
> >
10 10
>\ >\
10 10
11 *
>` >`
10 10
>{ >{
10 10
0 >}J
Phase [deg]
>|J >^{J
>}J >Z`J
1 ^ Z 1 ^ Z
10 10 10 10 10 10
Frequency [Hz] Frequency [Hz]
Figure 3.18.b where the magnitude of the transfer function of the second
body at f a = 100 Hz equals 10−4 , which is equal to the compliance of the
connecting spring.
At f = f 2 both bodies will resonate while the movement of the first body
is 180◦ out of phase with respect to the movement of the second body as
they move in opposite directions. When f > f 2 the slope of the Bode-plot
of the first body will continue at −2, but at a higher level than found by
extrapolating the slope at f f a . This is caused by the decoupling of the
second body, that does no longer join the movement of the first body. As a
consequence the slope of the response of the second body becomes twice as
steep (−2) as the slope of the first body. The compliance of the first body
124 Chapter 3. Dynamics of motion systems
>Z >Z
10 10
Compliance [m/N]
> >
10 10
>\ >\
10 10
>` >`
10 10
>{ >{
10 10
0 >}J
Phase [deg]
>|J >^{J
>}J >Z`J
1 ^ Z 1 ^ Z
10 10 10 10 10 10
Frequency [Hz] Frequency [Hz]
Figure 3.19: Bode-plot of the response of a dual mass-spring system with bodies
having an equal mass m 1 = m 2 = 25 · 10−3 kg and k = 1 · 104 N/m. These
values result in a natural frequency of 141 Hz where both bodies move
in the opposite direction with equal amplitude. The ’anti-resonance’
measured at the first body is not changed compared with Figure 3.18.
>Z >Z
Compliance [m/N] 10 10
> >
10 10
>\ >\
10 10
>` >`
10 10
>{ >{
10 10
0 >}J
Phase [deg]
>|J >^{J
>}J >Z`J
10 1 10 ^ 10 Z 10 1 10 ^ 10 Z
Frequency [Hz] Frequency [Hz]
Figure 3.20: Bode-plot of the response of a dual mass-spring system with the mass
of body one being ten times larger than the mass of body two. For this
example m 1 = 0.25 kg, m 2 = 25 · 10−3 kg and k = 1 · 104 N/m, resulting
in a natural frequency of 105 Hz, very close to the ’anti-resonance’.
Note, that the overall compliance is decreased due to the increase of
the total mass.
k
F m1 m2
Staonary reference
a: Mode shape 1
F m1 m2
Staonary reference
b: Mode shape 2
Figure 3.21: The two eigenmodes with the corresponding mode-shapes of a dual
mass-spring system with a connecting spring. The first eigenmode is
the linear uniform motion in one direction at the first “eigenfrequency”
f 1 = 0 Hz. The second eigenmode is the elastic movement of the two
bodies opposite to each other at the second eigenfrequency f 2 . The
drawn mode direction is only one of the possible directions as the
movement is periodically reciprocating.
that depends on the mass ratio. One can imagine the movements of these
eigenmodes as stable oscillations without external forces to keep them going.
A constant movement will always continue at the absence of external forces
and the same goes for a once excited resonance without damping. It is this
straightforward imagination possibility, that makes their use so very valu-
able in practice. This is even more so because these modes are independent
as long as the system behaves linear. Under that condition their individual
response to a stimulus can be simply added to give to response of the total
system. In practical designs this linearisation is often allowed because of
the small deformations involved in precision mechatronic equipment.
As a last example of the analytic equations of motion, the related two transfer
functions are derived to illustrate this superposition principle. Starting
with a slightly different notation of Equation 3.73 the following equation is
obtained for the position of the first body in the frequency domain:
x1 1 1 m 2 s2 + k
( s) = (3.77)
F ( m 1 + m 2 ) s2 Ms2 + k
To convert this into two terms with 1/s2 , that can be added, as a first step Ms2
3.3. Multi-body dynamics and eigenmodes 127
will both be added and subtracted from the numerator of the last equation:
x1 1 1 ( Ms2 + k) + ( m 2 − M ) s2
( s) =
F ( m 1 + m 2 ) s2 Ms2 + k
1 1 m2 − M
= +
( m 1 + m 2 ) s2 Ms2 + k
⎛ ⎞
m2
1 ⎜1 − 1 ⎟
= ⎜ + M ⎟ (3.78)
(m1 + m2 ) s ⎝ 2 k ⎠
2
s +
M
With M = m 1 m 2/( m
#1 + m 2 ) and the natural angular frequency of the second
mode-shape ω2 = k/M , the total transfer function becomes written as the
addition of the transfer functions of two separate mass-spring systems:
x1 C 1 C2
= 2 + (3.79)
F s s + ω22
2
With:
1 1 1 1 m2
C1 = and C2 = − = (3.80)
m1 + m2 m1 m1 + m2 m1 + m2 m1
a: Body 1 b: Body 2
>^
10 10
>Z >Z
10 10
Compliance [m/N]
> >
10 10
>\ >\
10 10
>` >`
10 10
Mode shape 1
Mode shape 2
>{ >{
Total system
10 10
0 0
Phase [deg]
>|J >}J
>}J
1 2 Z
>Z{J 1 2 Z
10 10 10 10 10 10
Frequency [Hz] Frequency [Hz]
Figure 3.22: Bode-plot of the response of the same dual mass-spring system of
Figure 3.18 with m 1 = 2.5 · 10−3 kg, m 2 = 25 · 10−3 kg and k = 1 · 104 N/m,
resulting from the combination of the two mode-shape responses. The
response of the first body (a:) is the result of the addition of both
mode-shape responses, while the response of the second body is the
result of the subtraction of the second mode-shape from the first mode-
shape. The difference between the responses of both bodies in the
spring-line of the second mode-shape is the direct consequence of the
difference in mass. The largest mass m 2 will get the smallest part
of the spring corresponding with a lower compliance. Also the ’anti-
resonance’ appears to be no resonance at all, as it is the result of the
combination of two equal amplitudes with 180◦ phase difference.
and thus the two contributions will add to zero. This makes clear, that the
term ’anti-resonance’ is badly chosen as it is only the result of two equal
signals in counter phase. Similarly the transfer function x2/F of the second
body can be determined out of the two separate transfer functions. In the
second mode-shape the second body moves in the opposite direction from
the first body, which means, that the response of the second mode-shape
has to be subtracted from the response of the first mode-shape to achieve
the total response. Due to this phase inversion the anti-resonance does not
occur. At higher frequencies beyond f 2 the phase of the second eigenmode
is 180◦ delayed. Because then both components have approximately the
same magnitude and an opposing phase, the combined magnitude goes to
zero, leading to the −4 slope for high frequencies with the corresponding
−360◦ phase relation.
The example of the previous section consisted only of a simple two body
mass-spring system, without a connection to the stationary world. Generally
the dynamics of a positioning system are more complex. When for instance a
spring is added in Figure 3.17, that connects the first body to the stationary
& '
reference, the transfer function of the first mode-shape becomes C1/ s2 + ω21 ,
similar to the second mode. When measuring the real dynamic transfer
function of a mechanical system, one could obtain a Bode-plot, like shown
in Figure 3.23. The first mode corresponds with the mass of 2.5 kg of
this example system, connected by a spring to the stationary world with
a stiffness k = 104 . At higher frequencies several other resonances are
visible, each corresponding to a different eigenmode, caused by several parts
connected to the main body by supports with a certain stiffness. All of these
eigenmodes add another two orders to the analytical differential equation
and this complicates the analysis considerably. Even without a thorough
higher-order analysis it is often possible to estimate the cause of these
eigenmodes, given the mass of the different parts in the structure. With
this information the mechanical designer can adapt the dynamic behaviour
to the required properties.
Unfortunately, frequently the resonances are due to three dimensional
deformations of the body itself, like shown for instance in Figure 3.24. In
such cases Finite Element Modelling of the dynamics, with software like
ANSYS, is necessary to calculate the eigenfrequencies and mode-shapes of
each eigenmode. Practical verification is done with a method called modal
analysis. In this method the response to a dynamic stimulus is measured
130 Chapter 3. Dynamics of motion systems
>^
10
>
10
Compliance [m/N]
>`
10
>}
10
>J
10
>^
10
0
Phase [deg]
>|J
>}J
0 1 ^ 3
10 10 10 10 10
Frequency [Hz]
Figure 3.24: One of several mode-shapes of a solid body as calculated with Finite
Element Modelling software. The red lines indicate the undeformed
shape. Excitation of this torsional eigenmode is often extremely diffi-
cult to avoid by other means than a careful lay out of the actuator and
sensor locations.
In order to link this chapter with the following chapter on motion control, the
negative effect of eigenmodes on the dynamic performance of a controlled
motion system will be elaborated a little more from the aspect of phase
in relation to the position of actuators and sensors. These positions have
their influence on the observability and controllability of an active controlled
system which refers to the possibility to measure and control the eigenmodes
of a dynamic system.
In the previous sections on the two body mass-spring system it was shown,
that measuring the position at the second body results in a maximum phase
shift of −360◦ . This is far above the maximum of −180◦ , that is required
for stability in a feedback loop, as will be shown in the next chapter. Also
Figure 3.23 might look all right from a phase perspective but as soon as the
measurement system is connected to one of the decoupling masses instead
of directly at the actuator, the resulting phase delay would be significant.
This phase problem by eigenmodes is also observed in Figure 3.24, as the
shown torsional mode represents a movement, that is different for all parts
of the body both in phase and amplitude.
With the more simplified example of Figure 3.25 it will be illustrated, how
these eigenmodes and their corresponding frequency response measurement
are impacted by the location of the actuators and sensors. It emphasises the
132 Chapter 3. Dynamics of motion systems
F F
x x
Figure 3.25: The mode-shapes of the two main, in plane, eigenmodes of a body,
connected to the stationary reference by means of a leaf-spring. The
first mode-shape consists of a swinging movement around a pole close
to the stationary reference and the second mode-shape is a reciprocat-
ing rotation around the centre of mass. Depending on the location of
actuation (F ) and sensing ( x), the observed dynamic behaviour of the
system is different, observed primarily with the second mode-shape.
importance of the right choice of this location in an actual design. This two
dimensional model consists of a rigid body connected to the stationary refer-
ence by means of a mass-less ideal leaf-spring with only two mode-shapes
of interest, a “swinging” movement around a pole close to the connection of
the leaf-spring with the stationary world and a rotation around the centre
of mass of the body. The method to combine the response of eigenmodes
of a single body is identical to the method of the combined eigenmodes of
a multiple body system, because in both cases the mode-shapes of the dif-
ferent eigenmodes are independent. This means, that they can be linearly
combined to give the resulting overall behaviour, as long as the coordinate
system of all mode-shapes is identical. This does not necessarily mean, that
the results are identical and it will be shown, that especially with different
locations of actuator and sensor the response becomes different.
A periodic force stimulus F with frequency f is exerted on the top side of the
body. To illustrate the difference in perceived behaviour, the measurement
of the movement is done at different positions between the bottom side and
the topside of the body.
For the calculation of the eigenfrequencies and the Bode-plot, some values
are chosen as example for the different parameters. The mass (m) of the
body equals 0.05 kg and the bending stiffness (k) of the leaf-spring for the
3.3. Multi-body dynamics and eigenmodes 133
>Z >Z
Compliance [m/N] 10 10
> >
10 10
>\ >\
10 10
>` >`
10 10
Mode shape 1 Mode shape 1
^ ^
>{
^ >{
>^
10 10
0 0
Phase [deg]
>|J >}J
>}J
1 ^ Z
>Z{J 1 ^ Z
10 10 10 10 10 10
Frequency [Hz] Frequency [Hz]
Figure 3.26: Bode-plot of the response of the two mode-shapes of Figure 3.25. The
left graph (a:) shows the measured behaviour when sensing at the
same location as the force, while the right graph (b:) shows the mea-
sured response when sensing at the bottom of the body. In that situa-
tion the ’anti-resonance’ is disappeared but both resonances remain
present.
above for the position of the actuator. As long as in the example this position
is in line with the centre of mass, the second eigenmode will not be excited
by the actuator. This optimal position from the point of view of avoiding
resonances in eigenmodes comes at a price in a reduced controllability
of the system. Even if the controller would have information about this
eigenmode it will not be able to control it because it is missing an actuator
at the right location. This can be solved with a separate actuator, only
intended to control this eigenmode, but more often very strict measures are
taken to prevent any other source of interference. This is one of the many
important attention points, when designing complex mechatronic systems
with extreme precision.
3.3.2.3 Summary
In this chapter some important lessons can be learnt, that are summarised
as follows:
• The position of the actuator and the sensor determine the observability
and controllability of the different eigenmodes.
136 Chapter 3. Dynamics of motion systems
Motion Control
Introduction
137
138 Chapter 4. Motion Control
Informaon Energy
Guidance
Output disturbance
signal
Input
Feedforward disturbance Process
disturbance
Output
Pre-lter Feedback D/A Plant
-
Sensor disturbance
A/D sensor
Figure 4.1: Block diagram of a motion control system, including feedforward and
feedback control. The plant consists of the power amplifier, actuator
and the mechanical dynamics. The diagram clearly shows the different
places where interfering disturbances impact the system.
Figure 4.1 shows a basic control loop of a positioning system where the
control part consists of a feedforward path and a feedback path. Control
systems could in principle consist only of a feedforward or feedback path, but
generally a combination is applied as each principle has its own advantages
and disadvantages, that are separately discussed in the following sections.
For simplicity this chapter is limited to linear dynamics which fortunately
holds for most practical precision motion control systems. In most cases also
a non-linear motion system can be linearised around a chosen operating
point.
The plant is a control engineering term for the uncontrolled physical system.
In a positioning system this consists of the power amplifier, actuator, and
the mechanical structure, as shown in Figure 4.2.
Each of these elements has its own inherent dynamic properties. They inter-
act in both directions in such a way that each element not only determines
the input of the next element but also influences the previous element by
its dynamic load. Often the plant is called just simply the “system”, which
can be confusing when it is not specified, as it can mean the controlled or
uncontrolled system, the control system or whether it includes the sensor
or not.
Also in this book the term ”system“ is more generally used and hence not
precisely defined. It can for instance be the mechanical mass-spring system
4.1. A walk around the control loop 139
Process
disturbance
v Mechanical x
Amplier Actuator
structure
Figure 4.2: In a mechatronic system the plant consist of the amplifier, the actuator
and the mechanical structure. Sometimes also the sensor is included
with the related electronics. The elements that transfer energy interact
with each other in two directions. For instance the amplifier determines
the input of the actuator, while the actuator determines a dynamic
“load”, that influences the behaviour of the amplifier. The same goes for
the actuator and the mechanical structure.
from the previous chapter, the control system or even the full controlled
mechatronic system including the sensor. Its meaning will however be clear
from the context.
As shown in Figure 4.1, the functional blocks that determine the plant with
the related disturbances are represented in red and the functional blocks
forming the control-system are represented in yellow.
It is necessary to clearly distinguish the domain of each functional block,
because everything that is located at the left side from the D/A and A/D-
converters, including the converters themselves, is only involved with in-
formation exchange. This means that these blocks can be interconnected
in serial or parallel configurations, without changing the properties of the
individual block. On the other hand however, the blocks that are shown
in red represent physical systems or sub-systems where energy exchange
is involved. This implies that changes in one sub-system (or block) may
also change the dynamics of the blocks that it is interacting with. A simple
example is a motor that behaves differently whether a mechanical load is
connected to it or not. These aspects of system thinking have to be consid-
ered carefully when interconnecting mechatronic sub-systems, including
the control system.
For analysis and discussion of the control part, first a short section on poles
and zeros will determine the red thread through the chapter, then feedfor-
ward and feedback control will be presented separately.
140 Chapter 4. Motion Control
In Section 3.2.3 of the previous chapter the pole of a transfer function was
introduced for those values of the Laplace parameter s = σ ± j ω where the
denominator of the transfer function becomes equal to zero. It was used to
explain the relation between the location of the poles in the Laplace plane
and the damping ratio ζ of the mass-spring system. It was shown that, when
the poles are located in the left-half of the Laplace plane, the ratio between
the real part and the imaginary part will determine the amount of damping
in the mass-spring system.
Many passive mechanical systems, like the damped mass-spring systems
from the previous chapter, are stable in the uncontrolled, open-loop situation.
In dynamic systems stability is defined by the ability of the system to be
insensitive to external stimuli or to return to a defined equilibrium situation
after the application of a stimulus.
There are also examples of passive mechanical systems that are not stable,
like an inverted pendulum or a piece of iron between two permanent magnets.
These systems can not be stably positioned at any location between two (or
more) extreme positions as they show a negative stiffness. To illustrate the
effect in the frequency domain the transfer function can be derived from the
general compliance transfer function of a standard mass spring system.
The compliance without damping was shown to be equal to:
x 1
C t ( s) = = 2
(4.1)
F ms + k
A spring with a negative stiffness implies that k has a negative value and as a
consequence
# the two poles of this transfer function are both real with a value
of ± k/m. One of these values is positive which means that the location of
this pole is on the real axis in the right-half of the Laplace plane, indicating
an instable pole. The related instability is manifested by a increasingly
accelerated movement away from the zero position as is shown in Figure 4.3.
With feedback control such an unstable system can be stabilised by shifting
the positive pole from the right to the left-half of the Laplace plane. This
effect can be explained with the example of Chapter 3 where proportional
negative feedback is shown to create a spring with a positive stiffness for the
positioning of an optical pick-up unit of a CD player. When this principle of
negative feedback is applied on a mechanical system with a negative stiffness
4.1. A walk around the control loop 141
like the magnetic bearing that will be presented later in this chapter, this
positive stiffness can compensate the negative stiffness when the total loop
gain is sufficiently high. When the positive stiffness by the feedback is even
higher than the negative stiffness of the mechanical system, the resulting
positive stiffness creates a normal mass-spring system with two conjugate
complex poles on the imaginary axis and this transformation implies that
the originally positive pole is shifted towards the left-half of the Laplace
plane by the negative feedback. The addition of damping by differentiating
the feedback signal, as will be explained later, will move the poles even
further to the left and such a magnetic bearing will behave just like any
other well-controlled positioning system.
With active dynamic systems the presence of elements that add energy to the
plant like amplifiers and actuators can also induce instability in a normally
stable passive system, as can also be explained with a simple example.
In the previous chapter it was demonstrated that a negative proportional
feedback loop creates a system with the same behaviour as would occur with
a mechanical spring. This means that by only reversing the sign (= 180◦
phase) of this feedback loop, a negative stiffness would be created with the
inherent instable pole in the right-half of the Laplace plane.
In dynamic systems the phase in the loop is a function of the frequency and
in mechanical systems phase lag is present due to the influence of masses
and damping and higher order eigenmodes. Ultimately these effects result
in a large enough phase lag to turn the system into an unstable situation
even with negative feedback. For this reason the analysis of the poles is
of prime importance when dealing with active control of dynamic systems.
Figure 4.3 gives an overview of the impulse response of a dynamic system
for different locations of the poles in the Laplace plane. It clearly underlines
the observed phenomena. The location on the real axis corresponds with
non-oscillating gradually changing responses, either to a stable position for
poles in the left-half-plane or to infinity for poles in the right-half-plane. The
imaginary part of the pole determines the oscillatory behaviour in relation
to the real part. The magnitude of the imaginary part ( j ω) determines the
oscillation frequency and the ratio between the imaginary part and the real
part determines the damping ratio.
142 Chapter 4. Motion Control
Pole-Zero Map
10
4
Imaginary Axis
-2
-4
-6
-4 -3 -2 -1 0 1 2 3 4
Real Axis
Figure 4.3: The impulse response of a dynamic system as function of the pole
locations in the Laplace plane. In the left-half-plane the response
returns asymptotically to a zero deviation value while in the right-half-
plane the response grows exponentially to infinity. For simplicity only
the positive imaginary axis is shown but any pole with an imaginary
part also has a conjugate complex counterpart mirrored around the
real axis.
The counterpart of the poles are the zeros. They are not detrimental to
stability as they correspond with a value of s where the numerator of the
transfer function becomes equal to zero. As an example, the anti-resonance
of the coupled mass-spring system is a pair of conjugate complex zeros of the
transfer function, as shown in Equation (3.73) in the previous chapter. The
drawback of a zero is the lack of reaction to a stimulus at that frequency.
It can, however, be beneficially used in feedforward control because zeros
can compensate poles as can be seen at the following generalised transfer
function written in pole-zero notation where p represent the poles and z
represent the zeros:
( s − z1 )( s − z2 )( s − z3 )( s − z4 )( s − z5 )....( s − z m )
G ( s) = (4.2)
( s − p 1 )( s − p 2 )( s − p 3 )( s − p 4 )( s − p 5 )....( s − p n )
4.1. A walk around the control loop 143
Input disturbance
r Feedforward u v Plant y
Cff (s) G(s)
Figure 4.4: Block diagram of a feedforward-controlled motion system with one input
and output (SISO). The transfer functions Cff (s) of the feedforward
controller and G(s) of the plant are frequency dependent, which is
denoted by the Laplace variable s.
Figure 4.4 shows the typical basic configuration for feedforward control,
which sometimes is also called open-loop control. This example has only one
input and output variable which is in control terms a Single Input Single
Output (SISO) system.
It is important to note that in mechanical engineering a SISO positioning
system is also called a single degree of freedom system, relating to a specific
direction in an orthogonal three dimensional spatial coordinate system.
Unfortunately the term “degree of freedom” can lead to confusion in position
control of mechanical systems, because in control terms, feedforward and
feedback are considered as two degrees of freedom even though it relates to
one direction. For that reason the term is avoided further in this chapter.
The reference or guidance signal1 r is applied to the controller that has a
frequency dependent transfer function Cff (s). The output u of the controller
is connected to the input of the motion system that has a transfer function
1 According to the domain notation that was defined in Chapter 2 the ( t), ( s), (ω) and ( f ) terms,
that define whether the equation is in the time or the frequency domain, are only mentioned
once before the equal sign. One exception to this rule is applied: When transfer functions are
included without expanding them, these functions will also show an indicating term in order
to distinguish them from time invariable functions. The terms are never shown with variables
because these are variable by nature according to the domain that the function describes.
144 Chapter 4. Motion Control
C ff ( s) = G ( s)−1 . (4.4)
The feedforward problem can be more complicated as not always all plant
dynamics can easily be inverted. For example a system with a low pass
characteristic, a property of all positioning systems that are limited by the
mass-line, would require a feedforward controller with high-pass character-
istics, having a very high gain at high frequencies. From this reasoning it is
obvious that feedforward control of motion systems is applicable for a certain
frequency range only, depending on the system dynamics. A second problem
is that an unstable system cannot be stabilised with open-loop control by
cancelling the unstable pole of the system. Further also unstable transfer
zeros, non-minimum phase zeros with a not unique phase to magnitude
relationship, must not be inverted as they would lead to an unstable pole in
the feedforward controller.
This all is included in the following drawbacks and limitations of feedforward
control:
Output disturbance
Measurement
M(s)
Sensor disturbance
Figure 4.5: Block diagram of a SISO feedback controlled motion system. Each func-
tional part has its own, mostly frequency dependent transfer function.
assumed to include the measurement sensor as part of the plant and in that case the output
disturbance and sensor disturbance are combined. In real systems with a not ideally linear
sensor it is sometimes better to show the dynamic properties and limitations of the sensor as a
separate transfer function.
4.1. A walk around the control loop 147
In control design one has the freedom to choose F ( s) and particularly Cfb (s)
such that the total transfer function fulfils the desired specifications. In the
pure error-feedback case, like with the positioning of the optical pick-up unit
of the CD player, the reference is constant and no pre-filter is needed. In
the next section, it is shown with this example, how the feedback controller
C fb ( s) can be designed to shape the dynamics of the feedback loop.
Just as the feedforward control approach has its advantages and disadvan-
tages, also feedback control offers some benefits and potential pitfalls:
Although feedback control provides some very good features, it has of course
also some pitfalls that have to be dealt with:
Cf C f ω21
G ( s) = = . (4.7)
s2 s s2 + 2ζf ω1 s + ω21
2
+ 2ζ f +1
ω1 ω1
s2 + 2ζf ω1 s + ω21
C ff ( s) = (4.8)
ω21
150 Chapter 4. Motion Control
>
10
Magnitude (abs)
>`
10
>}
10
>J
10
3 4
10 10
Frequency [Hz]
0
Phase [deg]
>|J
In order to realise this controller first two poles have to be added, placed
at the same frequency as the resonance but with a higher damping ratio.
Typically a damping ratio between aperiodic and critical (0.7 < ζ < 1) is
applied to avoid oscillations. For ζ = 1 this results in the following transfer
function:
s2 + 2ζf ω1 s + ω21
C ff ( s) = . (4.9)
s2 + 2 · 1 · ω1 s + ω21
4.2. Feedforward control 151
When this controller is connected in series with the scanning unit, the anti-
resonance of the controller and the resonance of the piezo-scanner cancel
each other out. This pole-zero cancellation is the only manipulation on poles
that can be achieved with feedforward control and results in this example
in a well damped third order transfer function:
>
10
Magnitude (abs)
>`
10
>}
10
>J
10
3 4
10 10
Frequency [Hz]
0
Phase [deg]
>|J
>}J
>^{J
3 4
10 10
Frequency [Hz]
4.2.2 Input-shaping
-2
-4
-2
-4
step, the same steady-state response would be obtained as with the full step
stimulus after all oscillations are damped out. If one of these half-height
steps is delayed by half the period of the scanner’s resonance frequency, the
oscillations that are caused by each individual step are 180◦ out of phase,
and cancel each other out.
This splitting of the reference signal into two (equal) signals and delaying
one of them by half the period of the system’s resonance is a typical example
of input-shaping. This method clearly is very different from pole-zero can-
cellation as it is time domain instead of frequency domain based filtering.
In the frequency domain these sampled adaptations to the input create a
frequency spectrum with a multiple of notch filters at the harmonics of the
frequency that these adaptations are applied. This effect is related to real
sampling that will be explained in Chapter 8 on measuring.
With the previous example of the feedforward controller with pole-zero can-
cellation it was shown that a triangular scanning signal to the scanning
unit “triggers” oscillations after each change of slope of the triangular signal.
These are caused by the discontinuity in the derivative and the correspond-
ing higher harmonics of the triangle waveform as presented by Fourier
154 Chapter 4. Motion Control
1 1
?
Q
0.5 0.5
0 0
-0.5 -0.5
-1 -1
0 0.3 0.6 0 0.3 0.6
1 1
Scan signal (norm.)
?
Q
0.5 0.5
0 0
-0.5 -0.5
-1 -1
0 0.3 0.6 0 0.3 0.6
Time [ms] Time [ms]
analysis in Chapter 2.
Applying input-shaping to this triangular scanning signal results in the
introduction of a plateau instead of the sharp peak, where the width of the
plateau corresponds to half the period of the scanner’s resonance as can be
seen in Figure 4.9. When at a positive slope the scanning signal reaches the
plateau and stops raising, the scanner starts to oscillate at its natural fre-
quency, so with a sinusoidal motion. After half the period of this oscillation,
the scanner is arrived at the same position as where the oscillation started,
but it is moving with almost the same speed in the opposite direction, due to
the low damping. When at that moment the scanning signal is changed into
a negative slope, a smooth transition is realised, without further exciting the
scanner’s resonance, and the actual motion follows the desired triangular
scanning motion.
4.2. Feedforward control 155
In the previous section it was shown how feedforward control can work
for an open-loop stable system with known properties and circumstances.
Striving for a maximum of predictability of the dynamic behaviour of a
mechatronic system is an important part of the design because of the ben-
efits of feedforward control. In spite of these efforts, even with the most
sophisticated models and by suppressing external disturbances to the bare
minimum, almost always some remaining errors need to be corrected by
feedback control. Next to these systems with remaining random errors,
motion systems with inherent instability, like a magnetic bearing, with
poles in the right-half-plane require feedback control for stabilisationn.
Feedback control is more complex and critical to design than feedforward
control. For that reason it receives generally more attention in the scientific
world and in the last decades several approaches have been introduced to
design an “optimal” feedback control system according to several optimi-
sation criteria. In most cases these innovations were driven by the new
possibilities of fast computer systems that enabled “real-time” modelling
and control of higher-order systems. Still the classical PID-control principle,
where PID stands for a proportional-integral-differential feedback loop is
predominantly used in industry because of its relatively straightforward
mode of operation combined with generally a sufficient performance. Knowl-
edge about the different properties of PID-control is crucial for the design of
high performance mechatronic systems. Without mastering this knowledge
all further refinements in control theory are of no practical value. For this
reason this section is fully devoted to PID-control with a full emphasis on
its representation in the frequency domain.
In spite of this dominance of PID-control in mechatronic systems it is ex-
pected that modern control methods will gradually be used more frequently.
For that reason some of the promising new elements of modern control
engineering will be introduced after this section, based on the state-space
representation in the time domain.
Before explaining the more generic methodology, PD-feedback control, shortly
named PD-control of an optical pick-up unit for a compact disk player is
presented to create an initial idea about the principle. This unit was intro-
duced in Chapter 3 on dynamics. In control terms this is a servo-system.
The word “servo” stems from the Latin word “servus”, which means “slave”
and refers to a control system that is designed to follow a moving target.
4.3. PID feedback control 157
Radial tracking
Optical disk
Focus
Hr Hf
Optical
kf Pick-up m
Unit
kr
Figure 4.10: The optical disk pick-up unit of Chapter 3 connected to the track by
means of a “virtual” spring, created by the position control system.
In the introduction of the optical pick-up unit for a compact disk player the
relation between the control task and the maximum position error of the
lens in respect to the track of the CD was explained. The control system
creates a virtual connection between the track and the optical system as
indicated in Figure 4.10 by a mechanical spring. It was shown that the
required radial stiffness kr amounts to a value of 2.5 · 105 N/m for a moving
mass m = 10−2 kg and a corresponding natural frequency f 0 of 800 Hz. This
stiffness appeared to be equal to the total loop gain G t of the feedback loop
around the moving body that consisted of the combination of the gain of the
measurement system, the controller, the power amplifier and the actuator, as
shown schematically at the left side of Figure 4.11. It was further explained
that a spring and a mass need a damper to control the resonance at its
natural frequency. For this reason in this section also a virtual damper will
be added.
At the right side of Figure 4.11 the equivalent feedback loop for the position
control mechanism is shown. It is a simplified version of Figure 4.5 without
a reference signal and pre-filter. The plant consists of the amplifier, the
actuator, the mechanical moving body and the measurement sensor.
The loop will be traced “step by step” from within the plant. The input of the
mechanics is the force that is generated by the actuator of the positioning
system that holds the lens. The output of the (already perturbed) motion
system is the radial tracking position error εr between the lens position and
the centre of the track. In the equivalent feedback loop this radial error is
denoted by the control error e after the measurement.
158 Chapter 4. Motion Control
Io Vc Feedback e
Io GpVc Vc GcVm Cfb (s)
Power amplier Controller
F Vm u Output
x
ms 2 Hr x disturbance
Moving body Vm G mH r
Measurement Plant Measurement
Actuator G(s) M(s)
F Ga I o Hr y
Figure 4.11: Mass-spring system with a virtual spring that is created by closed-loop
control. The plant consists of the power amplifier, actuator, moving
body and measurement system. In combination with the controller
the total system can be described by the simple feedback loop as shown
at the right side of the drawing. The disturbance, consisting of the
movement of the track due to eccentricity and external vibrations,
directly interferes with the output of the plant.
The positioning system with the lens is modelled as a single body with mass
m and its corresponding compliance transfer function C m ( s) is equal to:
x 1 1
C m ( s) = = =− (4.14)
F ms2 0.01ω2
The transfer function shows a −2 slope in the Bode-plot and a phase lag of
180◦ over the entire spectrum. The magnitude of the compliance shows a
gain of one at a unity-gain cross-over frequency of ≈ 1, 6 Hz, as shown in the
lower (blue) mass-line of Figure 4.12.
^
10
Magnitude (abs)
1 *^
>^
10
*^
>
10
mass line, open-loop
>`
?>*
10
0
Phase [deg]
>|J
>}J 0 1 ^ 3
10 10 10 10 10
!X
Figure 4.12: Open-loop Bode-plot of the CD-player lens with a mass of 10−2 kg. P-
control with a gain kp = 2.5·105 proportionally shifts the total frequency
response of the combined system upwards, such that the unity-gain
cross-over frequency becomes equal to 800 Hz.
2.5 · 105 . The related control term of this proportional gain is proportional
control or P-control. In the loop this means that the feedback control transfer
function Cfb (s) becomes Cp = kp , without the Laplace variable as there is no
frequency dependency of the gain value.
Before the loop is closed, the open-loop Bode-plot of the P-control feedback
system is shown in the upper (black) mass-line of Figure 4.12. The open-
loop transfer function is the total transfer function of the system from the
input of the controller following all functional elements until the measured
sensor signal e, without connecting this signal with the feedback input of the
controller. This open-loop transfer function L p (s) in the frequency domain
is equal to:
kp 2.5 · 105
L p ( s) = G ( s)C p = C m ( s)C p = =⇒ L p (ω) = − . (4.15)
ms2 0.01ω2
P-control shifts the mass-line of the lens-system upwards in the Bode-plot
because it is only a simple gain factor. This shift in magnitude also causes
a shift of the unity-gain cross-over frequency (ωc = 2π f c ) to a higher level,
corresponding with the previously determined required value of the natural3
frequency f 0 = 800 Hz, which was directly related to the required maximum
3 The natural frequency ω = 2π f relates primarily to the first resonance frequency of a
0 0
passive dynamic mass-spring system. Later in this chapter an example is shown with a passive
160 Chapter 4. Motion Control
^
10
*^
Magnitude (abs)
1
10
0
10
>
10
?>* *^
>^
?> *
10
0
Phase [deg]
>|J
>}J
^ 3 4
10 10 10
!X
Figure 4.13: Open-loop and closed-loop Bode-plot of the feedback controlled CD-
player lens with P-control at a gain of kp = 2.5 · 105 . The resulting
undamped resonance at the natural frequency of 800 Hz is clearly
visible.
position error.
When the feedback loop with only P-control is closed, the transfer function
becomes in the frequency domain:
L p ( s) 1 1
T p ( s) = = m =⇒ Tp (ω)
2 . (4.16)
1 + L p ( s) s2 + 1 ω
kp − +1
ωc
The resonance peak is of course not acceptable for the closed-loop operated
system and in control engineering terms this effect is represented by the
location of the poles of the system in the Laplace plane. The closed-loop
poles are a conjugate complex pair on the imaginary axis which means that
the system is marginally stable as the phase-lag of the open-loop transfer
function at the cross-over frequency is 180◦ . In order to achieve an acceptable
control performance, some damping and phase-lead around the unity-gain
cross-over frequency needs to be added.
In order to add damping to the positioning system of the lens, a force com-
ponent must be created, that is proportional to the velocity. This force can
be created by a signal that corresponds with the derivative of the position
error over time. With this derivative signal the feedback controller transfer
function Cfb (s) becomes equal to the Proportional Differential control or
PD-control transfer function Cpd ( s):
C fb ( s) = C pd ( s) = k p + k d s (4.17)
^
10
*^
Magnitude (abs)
1
10
0
10
*
>
10 ?>*
?>* *^
>^
?> *
10
0
Phase [deg]
>|J
>}J ^ 3 4
10 10 10
!X
Figure 4.14: Bode-plot of the response of the feedback controlled CD player lens
with PD control without limitation of the differentiating action at high
frequencies. The unity-gain cross-over frequency is increased and the
closed-loop response has a −1 slope at high frequencies, which implies
a reduced attenuation of noise and resonances due to high-frequency
mode-shapes.
In Figure 4.14 the response of this transfer function is shown in red with a
−2 slope at the frequency range where k p is dominating. In the range where
the differentiation dominates, a −1 slope is visible in the amplitude-plot
with a corresponding less negative phase in the frequency range of the unity-
gain cross-over frequency. The resulting phase lag is approximately 120◦ as
compared to the original 180◦ . For this reason a differentiating action in the
control system is also often called a lead-network as it introduces a phase
lead in the open-loop response, reducing the original phase lag of the mass
response at the unity-gain cross-over frequency.
Closing the feedback loop for this system results in the following transfer
function of the closed-loop system:
L pd ( s) kp + kd s
Tpd ( s) = = (4.20)
1 + L pd ( s) ms2 + k p + k d s
This equation can be written in the frequency domain in the form as pre-
sented in the previous chapter for passive dynamic systems with the ratio
4.3. PID feedback control 163
ω/ωc as variable:
kd s
s+1 +1
kp ωd
Tpd ( s) = = (4.21)
ms2 k d s2 s
+ s+1 2
+ +1
kp kp ωc ωd
ω
2 jζ + 1
ω
Tpd (ω) =
2 c (4.22)
ω ω
− + 2 jζ +1
ωc ωc
#
with ωc = 2π f c = kp / m and ωd = kp/kd = 1/τd = ωc/2ζ.
This transfer function corresponds with the characteristic transfer function
of the mechanical damped transmissibility of a movement of the support to
a body, connected by a spring and a damper as expressed in Equation 3.64
in the previous chapter. The change of the −2 slope to a −1 slope at higher
frequencies is caused by the zero in the numerator of the transfer function.
This result is not surprising as the optical lens from the CD player is also
following the movement of the track, but in this case it is connected by a
virtual spring and a virtual damper. Likewise with the transmissibility
example, this configuration has a drawback due to the dominant coupling of
the damper at higher frequencies but in active feedback this effect can be
reduced by limiting the virtual damping at these frequencies.
1
10
Magnitude (abs)
1 1
ft Zt
+1 2S 2SW t
1
fc Zc
2S
1 1
0 fd Zd
10
2S 2SW d
90
Phase [deg]
45
0
1 2 3 4
10 10 10 10
Frequency [Hz]
of the combined controller becomes zero and the gain becomes proportional
again. This is shown in Figure 4.15 where f d = ωd/2π and f t = ωt/2π. It will
be shown later that an additional low-pass filter at f t is in practice even
more preferred in which case the slope becomes −1 and a real roll-off with
phase lag is obtained at higher frequencies.
Another issue of the PD-control action can also be observed in Figure 4.14
as the cross-over frequency is increased significantly due to the magnitude
increase by the differentiating term. With the typical “rule of thumb” value
ωd = 0.33ωc it is necessary to reduce k p with the same factor of 0.33 to achieve
an unchanged cross-over point. With that correction, the total open-loop
transfer function of the system becomes in the frequency domain:
1
s 1+
ωd ωd kp ω d k p 1 + τd s
L pdt ( s) = G ( s)C pdt ( s) = = . (4.23)
ωc ms 2 1 ωc ms2 1 + τt s
1+ s
ωt
with ωt = 1/τt being the radial frequency where the differentiator is “tamed”
at approximately three times ωc according to the “rule of thumb” which is
also approximately ten times ωd . The Bode-plot of the resulting response
with ωd = 0.33ωc = 0.1ωt or f d = 0.33 f 0 = 0.1 f t is shown in Figure 4.16 both
4.3. PID feedback control 165
^
10
*^
Magnitude (abs)
1
10 *^
0
*
10
?>*
>
10 ?> * *^
?>*
?> * *^
>^
10
0
Phase [deg]
>|J
>}J ^ 3 4
10 10 10
!X
Figure 4.16: Open-loop and closed-loop Bode-plot of the feedback controlled CD-
player lens for both P- and tamed PD-control with ωd = 0.33ωc = 0.1ωt .
It shows the reduction of the loop gain at lower frequencies and the
reduction of the attenuation at higher frequencies due to the necessary
phase compensation at the unity-gain cross-over frequency.
in open- and closed-loop, for comparison together with the response with
P-control only.
The unity-gain cross-over frequency of the open-loop transfer function has
become the roll-off frequency of the closed-loop system. The closed-loop
frequency response shows a nicely damped behaviour at the roll-off frequency
with a −2 slope, but at a sacrifice. First the total loop gain at low frequencies
is reduced by a factor 3, corresponding to a proportional reduction of the
virtual stiffness. As a direct consequence also the capability to suppress
errors at those frequencies is reduced. Secondly also at high frequencies
the attenuation of resonances and noise is reduced with a factor 3. It
is important to notice that this is an area of optimisation as ωd and ωt
can be placed closer together to achieve better error reduction at low and
high frequencies with a reduced damping around the unity gain cross-over
frequency ωc .
As another method to solve the reduction of the loop gain at lower frequencies,
an integrating control (I-control) can be introduced eventually forming the
well-known PID-control principle. First some more control theory will be
presented before further introducing this useful principle, because I-control
has no simple comparable element in the mechanical world.
166 Chapter 4. Motion Control
1.8
1.6
''
1.4
1.2
0.8
0.6
0.4
0.2 ?>
?>
0
0 1 2 3 4 5
>Z
Time [s] x 10
Figure 4.17: Step response of the closed-loop controlled CD-player lens under PD-
control and P-control, clearly showing the beneficial damping effect on
the resonance at the unity-gain cross-over frequency ωc .
As was demonstrated in the example of the tracking controller for the CD-
player lens, the dynamics of the controlled system can be directly modified
by closing the feedback loop. Feedback control allows to directly place the
system poles at values that are more useful for the operation of the motion
system than their natural locations. This enables a faster response of
the system with adequate damping. This property is in clear contrast to
feedforward control, where only pole-zero cancellation can be applied by
compensation with the inverse transfer function.
One of the other main advantages of feedback control was the possibility to
reduce the effect of disturbances. To illustrate this a simplified version of
the generic feedback loop is shown in Figure 4.18. The measurement system
is assumed to have a unity-gain transfer function. This means that for this
thinking model the reference and the input of the controller work in the
same position domain as the output of the system. The disturbance in the
measurement system is included in the output disturbance n. The process
disturbance can be anything occurring between the output of the feedback
controller and the output of the plant. For reason of simplicity it is inserted
between the plant and the controller. Disturbances from within the plant
can be calculated to this location by using the mathematical model of the
plant. With this simplified model, the transfer functions of the different
inputs of the system to three relevant output variables in the loop are written
down in a set of equations. The input variables include the reference and
the two sources of disturbances. The first output variable is x for the output
of the plant, being a subset of the system-states that will be introduced in
168 Chapter 4. Motion Control
Section 4.4. The second and third output variables are y for the output of
the total system and u for the output of the controller. In these transfer
functions for simplicity the Laplace variable (s) is omitted and C = Cfb :
G GC GCF
x = d− n+ r (4.24)
1 + GC 1 + GC 1 + GC
G 1 GCF
y = d+ n+ r (4.25)
1 + GC 1 + GC 1 + GC
GC C CF
u = − d− n+ r (4.26)
1 + GC 1 + GC 1 + GC
It should be noted that these transfer functions are written in a purely single
dimensional notation with scalar values. When more degrees of freedom are
controlled the different upper-case letters become matrices and the lower-
case letters become vectors. With multiplication of matrices the successive
order is important and the shown order is valid for the defined feedback
loop from Figure 4.18. When for instance disturbance d is neglected, the
vector notation would be x = Gv and v = Ce which gives x = GCe.
Although these transfer functions consist of a combination of nine transfer
functions, one for each input-output combination, some of them are the same.
When these doubles are omitted, six different transfer functions remain as
shown in Table 4.1. These functions are also called the gang of six, a term
introduced by the Swedish control scientist Karl Johan Åström from Lund
University, because this set of transfer functions gives an interesting insight
in how the feedback controlled system reacts to the different system inputs.
The transfer functions of the first column shows the influence of the reference
signal on the output of the plant and on the output of the feedback controller.
The first transfer function of the second column gives the influence of the
output disturbance on the output of the plant. while both transfer functions
in the second column give the influence of the two disturbances on the output
of the controller.
The first transfer function in the third column shows how the output of the
plant reacts to the process disturbance. The second transfer function of the
third column gives the influence of the output disturbance on the output of
x y GCF x u GC x y G
= = − =− = = =
r r 1 + GC n d 1 + GC d d 1 + GC
u CF u C y 1
= = =
r 1 + GC n 1 + GC n 1 + GC
4.3. PID feedback control 169
x y x u GC x y G
= =− =− = = =
r r n d 1 + GC d d 1 + GC
u u C y 1
= = =
r n 1 + GC n 1 + GC
Magnitude (dB)
Magnitude (abs)
Magnitude (abs)
0 JQ{
T(s) Bandwidth S(s) -3
10
-5
> <
10
slope -10
>^
10
>Z -15
10 Low gain
10
>
0.1 1 *^J
^ 3 4
10 10 10 10
Low frequency Cross-over High frequency Frequency [Hz]
region region region
>|J
r e u y
Phase [deg]
C(s) G(s)
>}J -
Stability
L(s)
>^{J
1 ^ 3 4
10 10 10 10
Frequency [Hz]
The gain margin determines by which factor the open-loop gain addition-
ally can increase before the closed-loop system goes unstable. It is
defined by the distance between the loop-gain L(s) and unity-gain at
the frequency where the phase-lag of L(s) becomes more negative than
−180◦ . The gain margin can have values between zero and infinite.
With first and second order transfer functions where the phase does
never become more negative than −180◦ the gain can be increased
172 Chapter 4. Motion Control
^ Q
QQ^\
J
^
QQ`
Q^ QZQ^
`
J
~
@
0
?
~
"!
>
>^
>Z
>\ > >Z >^ > 0
@
Figure 4.20: The phase, gain and vector margin can be easily derived in a Nyquist
plot. Stability is guaranteed when the −1 point on the real axis is kept
at the left hand side of the response-line upon passing with increasing
frequency. The dashed circles determine the magnitude peak (Q ) after
closing the loop at the frequencies where the response-line crosses the
circles. The blue response-line represents an example PID-controlled
mass-positioning system where the I-control action together with the
mass creates a phase of more than −180◦ at low frequencies.
The phase margin determines how much additional phase lag at the
unity-gain cross-over frequency is acceptable before the closed-loop
system becomes unstable. It is defined by the difference between the ac-
tual phase-lag of L(s) and -180◦ at the unity-gain cross-over frequency.
The following rules for stability can be derived from the gain- and phase-
margin in relation to the Nyquist plot:
• The system will become unstable upon closing the loop if the phase of
the open-loop transfer function passes the negative real axis (−180◦ ) at
an amplitude larger than one and does not return below the negative
real axis before the gain gets smaller than one.
4.3. PID feedback control 173
• A stable system after closing the loop is recognised in the Nyquist plot
when the −1 point on the real axis is kept at the left hand side upon
passing with increasing frequencies.
It is also interesting to observe in this example plot that the phase at low
frequencies is more than −180◦ shifted. This situation occurs when the
I-control action, that will be presented in the following subsection, is applied
with a purely mass based positioning system like the CD player pick-up unit,
so without a spring to the stationary reference. Even though such a system is
stable in the closed-loop situation, it is not readily known what the amount of
damping will be when the feedback loop is closed. To indicate the magnitude
of the closed-loop frequency response at different frequencies, circles are
shown that determine regions with a different maximum magnitude of the
closed-loop response above unity gain. The smaller the circle that the graph
will intersect, the higher this magnitude will be. These circles do not have
their centre at the −1 point on the real axis but on a shifting point to higher
real numbers for a lower magnitude. These circles are derived from the
magnitude of the complementary sensitivity function in the Nyquist plot
for a certain level of Q :
#
|GC | Re2 + Im2
|T ( s)| = =# =Q (4.29)
|1 + GC | (1 + Re)2 + Im2
Which is a circle with its centre on the real axis at a distance D from the
origin and with a diameter d :
Q2 Q
D=− and d= (4.31)
Q2 − 1 Q2 − 1
0
10
Magnitude (abs)
>^
10
k x 10
>
100 N m 10
>`
Driven body
m 10
>}
Phase [deg]
0.25 kg 0
>|J
>}J
>^ 0 ^
10 10 10 10
Frequency [Hz]
Figure 4.21: The undamped mass-spring system and the Bode-plot of its transfer
function that is used as example to illustrate PID-control with some
real data.
t
d e ( t)
u ( t) = k p e ( t) + k i e(τ) dτ + k d (4.32)
dt
0
ki
u ( s) = e ( s) k p + + k d s =⇒
s
u ( s) ki
C pid ( s) = = kp + + kd s
e ( s) s
1
G ( s) = (4.33)
ms2 + k
1
= . (4.34)
0.25 s2 + 100
1
# frequencies the spring-line /k = 0.01 defines the transfer function. At
At low
ω0 = k/m = 20 rad/s the system has a resonance with an infinite high peak
due to the absence of damping. At ω ω0 the transfer function is dominated
by the mass-line (1/ms2 ) with a −2 slope and a phase lag of 180◦ .
For this system the PID feedback controller will be designed with the fol-
lowing control objectives:
4.3.4.1 P-control
4.3.4.2 D-control
The differentiating action (D-control) is used to add phase lead and damping
to the system around the unity-gain cross-over frequency ωc . The differ-
entiating term of the PD-controller (kp + skd ) starts to dominate at ωc/3.
Rewriting this PD-controller as
kd
C pd ( s) = k p (1 + s ), (4.35)
kp
kp 3 · 3.3 · 104
kd = 3· = ≈ 160. (4.37)
ωc 628
6
10
Magnitude (abs)
-1
Tamed-D (=P)
I-control
5
+1 D-control
10
P-control
4
10
|J
Phase [deg]
45
>\
>|J
> 0 1 2 3 4
10 10 10 10 10 10
Frequency [Hz]
Figure 4.22: Bode-plot of the PID controller. From left to right first the −1 slope of
the I-control line is followed until it intersects with the horizontal P-
control line at 10 Hz. At the intersection of the P-control line with the
+1 slope of the D-control line at 33 Hz, the magnitude increases again
until the intersection at 330Hz with a second horizontal line, located
a factor 10 above the P-control line, determined by the ratio of the
differentiating and “taming” frequency where the controller transfer
function becomes proportional again. At 100Hz this controller has
the required gain of 1 · 105 with a phase lead of around 55◦ , giving an
equal phase margin for robustness.
4.3.4.3 I-control
4
-1
10
Magnitude (abs)
^
10
*^
-1
0
10
>^ *^
10
>\
>|J
Phase [deg]
>Z\
\\
>}J
>^^\
>^{J
> 0 1 ^ 3 4
10 10 10 10 10 10
Frequency [Hz]
Figure 4.23: Bode-plot of the open-loop transfer function of the total system with
the PID controller in series with the mass-spring system. Even though
the eigendynamics of the mechanical system show a clear resonance
at around 3 Hz the phase lag at the 100 Hz unity-gain cross-over
frequency remains well below 180◦ .
Therefore:
These controller gains for kp , kd , and ki result in the Bode-plot of the transfer
function of the controller shown in in Figure 4.22. The integrating action
up to 10 Hz and the differentiating action from 33 Hz until 330 Hz with the
corresponding phase behaviour are clearly visible.
Figure 4.23 shows the Bode-plot of the open PID-control loop, consisting
of the PID controller in series with the mass-spring system, as defined in
Equation 4.33. At low frequencies the integrating action is providing a
−1 slope for a high gain. At 100 Hz the gain of the open-loop system is
equal to one as required. Beyond the natural frequency of the uncontrolled
mass-spring system at ≈ 3.3 Hz the system shows after the sharp peak at the
resonance a −2 slope which is reduced to a −1 slope by the differentiating
action around the unity gain cross-over frequency. The phase margin of this
system is about 55◦ , being the distance between the phase lag of the system
(≈ 125◦ ) at the cross-over frequency of 100 Hz and the maximum limit of
180◦ phase lag. The amplitude margin of this system is infinite because in
4.3. PID feedback control 179
0
10
Magnitude (abs)
^
>^ -1
10
+1
>
10
T(s) reference following
>`
S(s)
!
10
0
Phase [deg]
>\
>|J
>Z\
>}J
> 0 1 ^ 3
10 10 10 10 10 10
Frequency [Hz]
Figure 4.24: Bode-plot of the closed-loop PID feedback controlled mass-spring sys-
tem. Both the complementary sensitivity function T(s), representing
the capability to follow the reference as the sensitivity function S(s),
representing the disturbance-rejection capability, are shown. The orig-
inal resonance has disappeared and is turned into a strong rejection
of disturbances at that frequency.
this ideally modelled example the open-loop system reaches the 180◦ phase
lag asymptotically only at infinitely high frequencies.
Figure 4.24 shows the Bode-plot of the closed-loop system. The phase plot is
generally not meaningful anymore in the closed-loop frequency response as
it does not give any further information on stability. For precision systems
that have to move synchronously, like with the wafer scanner of Chapter 9,
the closed-loop phase gives information on the time delay of the movement
and in that case it is still important information.
In this case the phase plot just shows the standard shape of a well damped
mass-spring system with a natural frequency at 100 Hz, corresponding with
the blue magnitude line of the complementary sensitivity function T (s). At
low frequencies the closed-loop gain is equal to one which means that the
system follows the reference signal with only a very small error. Beyond
the control bandwidth the feedback controlled system rolls off to higher
frequencies with a −2 slope. The fact that the feedback controlled system is
well damped can be observed by the absence of a resonance peak.
As mentioned with the previous example of the CD player, the roll off beyond
180 Chapter 4. Motion Control
Magnitude (abs)
1.5
0.5
0
0 100 200 300 400 500 600 700 800 900 1000
Frequency [Hz]
Figure 4.25: The sensitivity function S(s) of the PID-feedback controlled mass-
spring system plotted in a linear scale. This representation clearly
shows the increased sensitivity of this actively controlled system at
frequencies above the bandwidth of 100Hz. When the shaded area
below the magnitude of one is increased for better rejection of low
frequencies, the shaded area above a magnitude of one also increases.
This waterbed effect is inherent to feedback systems and can not be
avoided.
ln |S (ω)| dω (4.40)
0
This integral is zero when the open-loop transfer function L(s) of the dynamic
system has at least two more poles than zeros and when all poles are located
in the left half of the Laplace plane. This condition is always true with
stable feedback controlled positioning systems.
The reason why the waterbed effect is not so clearly visible in a normal Bode-
plot is due to its logarithmic scales. Figure 4.25 shows the magnitude of S (s)
on a double linear scale. With this representation the frequency range above
100 Hz with an increased sensitivity appears to be much larger than the
frequency range below 100 Hz, while in a normal Bode-plot this difference
is not as pronounced. Also the magnitude is relatively reduced with double
logarithmic scales. In the linear representation the benefit of feedback seems
to have vanished but linear scales are also not representative for the real
value of feedback in the low-frequency area. A reduction of a factor 100 or a
factor 1000 of the disturbances at low frequencies can not be distinguished
on a linear scale. In most motion control systems the disturbances occur
more in the low-frequency area where the process disturbance d (s) has
typically a sort of 1/ f spectrum over the frequency range. This means that
in practice the beneficial effect of feedback in the low-frequency region by
far outweighs the disturbance amplification at higher frequencies due to
the waterbed effect. It is also for this reason that logarithmic scales are
preferred as long as the mechatronic designer is aware of the negative effect
on disturbance rejection just above the unity-gain cross-over frequency for
those cases that significant disturbances are also present in that frequency
region.
182 Chapter 4. Motion Control
At first sight the magnitude plot looks like a well damped mass-spring
system but the phase plot is fully different with −180◦ phase over the entire
frequency band, caused by the negative stiffness value. This negative sign
implies that the position is always in the opposite direction of the force.
With increasing frequency the compliance of the mass-line will take over
the total compliance from the negative spring-line at their intersection
frequency4 f 0 :
1 1 kn 1 104
f0 = ω0 = = ≈ 50 [Hz] (4.43)
2π 2π m m 2π 0.1
4 This intersection frequency is not named ”natural-“ or ”eigenfrequency“ because no reso-
>Z
10
Magnitude (abs)
> '
*
10
>\ *
10
-2
>`
10
>{
10
0
Phase [deg]
>|J
>}J
10
0
10
1 f0 2
10
Z
10
Frequency [Hz]
Figure 4.26: The Bode-plot of the transfer function of a magnetic bearing shows a
frequency response without resonance and with a constant phase of
−180◦ due to the negative stiffness value.
The phase remains constant over the entire frequency band because the
mass-line also has a phase of −180◦ . Next to the negative sign a second ob-
served difference with a regular mass-spring system is the lack of resonance.
This is caused by the fact that the denominator does not become zero at
any value of ω and this corresponds
# again with the location of both poles on
k
the real axis at a value of ± /m = ± 105 ≈ 320. With this information the
design of a PID controller can be realised according to the same steps made
in the previous part of this section. Still some things will show to be quite
different because of the negative stiffness.
First P-control is applied in a magnitude that is at least larger than the
magnitude of the spring-line. In mechanical terms this means that after
closing the negative feedback loop the positive stiffness by the P-control
action will be larger than the negative stiffness of the actuator. For reasons
of robustness against variations in the negative stiffness the positive control
stiffness kp should be chosen minimally as large as the maximum value of
the negative stiffness that can occur. A practical value is at least a factor
two larger than the average value of the negative stiffness. This choice
results in a total positive stiffness that is always equal to or larger than the
magnitude of the negative stiffness.
The effect on the poles is illustrated in Figure 4.27 and it shows that with
only P-control, the closed-loop response would become identical to a regular
undamped mass-spring system. The equivalent natural frequency of this
system would correspond with the unity-gain cross-over frequency f c of
the combined negative-stiffness actuator with the P-control gain kp at a
184 Chapter 4. Motion Control
400
kp > kn
Stable damped
300
system Increase k d
200
Increase k p
100
Increase k p Increase k p
kp= kn
Imaginary Axis
0
' k p < kn kp < kn '
-100 <kn <kn
Increase k p
-200
Increase k d
-300 Stable damped
system kp > kn
-400
-300 -200 -100 0 100 200 300
Real Axis
Figure 4.27: The poles of the instable magnetic bearing system with a moving mass
m m = 0.1 kg and a (negative!) stiffness of k n = −104 −N/m are located
at ±320 on the real axis. The poles are brought to the left-half of the
Laplace plane in two steps. First P-control with an increasing value
of is added until at kp = 2.3 · 104 an undamped system is created with
both conjugate complex poles at ±360 on the imaginary axis. In the
second step D-control is added to create damping by shifting the poles
over a circle to the left. Note that the circle is an ellipse because of
the difference in scales.
The second step is the addition of D-control according to the rules of thumb
for D-control. In this case the proportional gain should however not be
reduced because then the total loop gain at the start of the differentiating
action would become lower than one with a phase of more than −180◦ , lead-
ing to instability. D-control will shift the poles to the left over a circle that
has been defined for a damped mass-spring system in Equation (3.48) of
Chapter 3. As a further consequence the D-control action will increase the
unity-gain cross-over frequency and in practice the open-loop unity-gain
cross-over frequency will become approximately twice the value of the in-
tersection frequency f 0 of the mass- and the spring-line of the magnetic
bearing.
This effect leads to an important conclusion regarding magnetic bearings:
^
10
Magnitude (abs)
0
10
>^
10 Open-loop
Closed-loop
PD - control
> PID - control
10
0
Phase [deg]
>|J
>}J
>^{J fc
> 0 1 ^ 3
10 10 10 f0 10 10 10
Frequency [Hz]
Figure 4.28: The open-loop and closed-loop Bode-plot of the magnetic bearing with
negative stiffness with PD-control (dashed lines) and PID-control
(solid lines). In closed-loop the frequency response shows a magnitude
larger than one Q = 2 that stretches over a wide frequency range until
f c . With PD-control starting at 0 Hz and with PID-control starting
around 20 Hz.
This statement is in principle valid for any system that is unstable in open-
loop, like an inverted pendulum or a rocket.
As a last addition I-control is applied in order to to increase the rather
limited open-loop gain below ωi = 0.1ωc . Even though it may seem that
a significantly higher proportional gain than the negative stiffness value
would create a very stiff system, this higher proportional gain only creates
a parallel stiffness after closing the loop. In the open-loop situation,
the multiplication results in a loop gain that is equal to the number of
times that the proportional gain is chosen larger than the negative stiffness,
which is often not sufficient for disturbance suppression at lower frequencies.
Furthermore a very high kp far above the negative stiffness would give a very
high unity-gain cross-over frequency with an increased risk on instability
to high-frequency resonances by eigenmodes of the mechanical system.
186 Chapter 4. Motion Control
JQ8
JQ4
Imaginary Axis
JQ^
QJ
^JX
J
>JQ^
>JQ4
JJX
>JQ6
>JQ8 PD-control
PID-control
>
>^Q5 >^ >Q5 > >JQ5 J
Real Axis
Figure 4.29: The Nyquist plot of the open-loop frequency response of the mag-
netic bearing with PD- and PID-control explaining the magnitude
of the closed-loop frequency responses by the position relative to the
Q -circles.
Figure 4.28 shows the open-loop and closed-loop frequency response of the
combined PD- and PID-controller with the plant with negative stiffness. In
this case k p = 2.3 · 104 , which is 2.3 times the value of the negative stiffness.
In feedback this value adds to the negative stiffness to a total value of
4
k t = 1.3 · 10
# and the poles of the closed-loop system have an imaginary
value of ± kt/m = ± 1.3 · 105 ≈ 360, as shown in Figure 4.27. The D-control
action goes from 30 Hz until taming at 300 Hz giving a unity-gain cross-
over frequency of 100 Hz. I-control works below 10 Hz. The closed-loop
bandwidth is 100 Hz and the resonance peak is less damped (Q = 2) than
with the previous system with a positive stiffness value as the phase lead is
only around 30◦ . With PD-control this effect is even more prominent as it
stretches until 0 Hz. This effect is fully caused by the −180◦ of the negative
stiffness and can be better analysed with the Nyquist plot of Figure 4.29.
With PD-control the magnitude starts at kp × kn = 2.3 and runs around the
Q = 2 circle. Over the frequency band ranging from 0 Hz until just more
than 100 Hz the graph remains between the Q = 1.6 and the Q = 2 circle
which means that in closed-loop the magnitude will be around 1.8 in that
frequency range, which is observed in the Bode-plot. With PID-control the
graph runs just inside the Q = 2 circle from 20 Hz to 100 Hz, which results
in the observed magnitude of the closed-loop frequency response in that
frequency range in the Bode-plot, while the closed-loop response below 20 Hz
4.3. PID feedback control 187
measurement
k1 x1 x2
100 N m 2.5 10 N m
5
Figure 4.30: When the sensor is connected to the driven body with a compliant
mounting, the mass of the sensor will decouple at high frequencies
due to the second eigenmode, where the two bodies vibrate in opposite
directions by the elastic coupling of k2 .
approaches a value of one because of the high loop gain due to the integrator.
A reduction of the proportional gain below the example value k p = 2.5 · 104
will cause the graph on the Nyquist plot to approach the −1 point on the real
axis, resulting in a higher Q over a wider range of frequencies until at-half
the presently chosen value ( k p = 1.15 · 104 ), the system will start to resonate
at 20 Hz. Optimal tuning of such a magnetic bearing is clearly not trivial as
the optimisation depends on many parameters, like the expected frequency
spectrum of the disturbances and the frequency range of operation.
6
10
Magnitude (abs)
4
10
^
10
0
10
>^
10
0
Phase [deg]
>|J
>}J
>^{J
>Z`J
> 0 1 ^ 3 4
10 10 10 10 10 10
Frequency [Hz]
5
3
Imaginary Axis
1
Q=1.6 QZQ^
0
>
>^
> >^ 0 ^ 4
Real Axis
Figure 4.31: Bode and Nyquist plot of the open-loop transfer function L(s) of the
PID-controlled mass-spring system when the sensor is mounted on a
body that decouples dynamically at 500Hz with Q ≈ 20. The graph in
the Nyquist plot passes the −1 point on the real axis at the right hand
side and instability will occur after closing the loop. It illustrates the
necessity to keep the amplitude of the resonance below one in order to
maintain stability.
a body with a mass of 25 · 10−3 kg with a mounting stiffness of 2.5 · 105 N/m,
as shown in Figure 4.30. In Section 3.3.1 of Chapter 3 on the dynamics of
multiple coupled bodies the configuration appeared to result in a second
eigenmode with an eigenfrequency of ≈ 500 Hz. Because in practice such
situations show a very limited damping, a Q of ≈ 20 is used in the modelling.
4.3. PID feedback control 189
Figure 4.31 shows both the Bode-plot and the Nyquist plot of this situation.
The Nyquist plot is especially useful in this situation as it shows in clear
detail, what happens around the −1 point on the real axis with respect to
closed-loop stability in this more complex dynamic feedback system. The
phase at 500 Hz due to the second eigenmode changes from almost −150◦ ,
the mass-line combined with the D-control action, to below −180◦ . This
phase shift occurs in the frequency range where the magnitude becomes
larger than one due to the resonance peak of the second eigenmode and as
a consequence instability will occur.
The Nyquist plot clearly indicates that the −1 point on the real axis is kept at
the right hand side when passing. Only when the magnitude of the second
eigenmode is kept below one by a reduction of the loop gain this instability
condition would not occur, but then the resulting bandwidth of the controlled
system would be reduced to below ≈ 20 Hz.
1
10
Magnitude (abs)
0
10
0
Phase [deg]
>}J
>Z`J
>\\J 1 2 Z
10 10 10
!X
\
magnitude
~
@
Z
?
$
2
1
QQ`
QZQ^
0
>
>^
> >^ 0 2
@
unity-gain cross-over frequency, but this is a detail that can be adapted when
a further optimisation of the system is required. For the understanding of
the principle it is useful to treat them first as two separate filters.
Figure 4.32 shows the effect of the additional low-pass filter on the open-loop
transfer function of the example PID-controlled mass-spring system. Next
to the situation without the additional low-pass filter, also the open-loop
response of the system with a low-pass filter at 350 or 650 Hz is shown. The
4.3. PID feedback control 191
1
10 ?`\JX
?Z\JX
Magnitude (abs) 10
0
>
10
-2
-3
'
T(s)
1
10
Magnitude (abs)
0
10
>
'
S(s)
10 2 3
10 10
Frequency [Hz]
Figure 4.33: The closed-loop Bode-plot of the sensitivity functions of the PID-
controlled mass positioning system with decoupling mass show the ef-
fect of the low pass filter frequency on the magnitude of these functions
at the cross-over frequency and the eigenfrequency of the decoupling
mass. At 350 Hz these magnitudes are equally balanced. This optimal
setting is also confirmed by the sensitivity plot at the lower side.
Bode-plot clearly demonstrates that such a filter has only a limited negative
effect on the phase margin at the cross-over frequency of 100 Hz, while the
phase at the resonance has definitively come below −180◦ . In this more
complex PID-control situation the Nyquist plot is an indispensable tool as
it more clearly shows the effects regarding stability. With the additional
first-order low-pass filter, stability is achieved as the −1 point on the real
axis of the Nyquist plot is passed at the left hand side, while without a low-
pass filter the system would be unstable. An optimal situation is created
when the filter frequency is at 350 Hz resulting in a symmetrical left hand
passage of the Q = 1.6 circle, indicating an equal magnitude of the closed-
loop response for the two frequencies that are closest to the circle. It results
in a maximum amplitude rise of 60 % both at the cross-over frequency and
at the second eigenmode. This optimum is confirmed in the complementary
sensitivity T ( s) and sensitivity S (s) Bode-plot of Figure 4.33.
192 Chapter 4. Motion Control
6
I-control
10
Magnitude (abs)
-1
+1
Phase
5
10
$
D-control
-1
Noise
P-control
!
4
10
|J
Phase [deg]
45
>\
>|J
> 0 1 2 3 4
10 10 10 10 10 10
Frequency [Hz]
Figure 4.34: The PID-control transfer function with additional phase shifting low
pass filter can be seen as an optimal motion controller of a mechanical
system.
A clear benefit of this method is that the influence due to the resonance
of a second eigenmode with an amplitude larger than one is effectively
suppressed by the feedback. The original Q value from the uncontrolled
system is reduced from more than twenty to less than a factor two by tamed
PID-control with an additional first order low-pass filter. Another advantage
of the additional low-pass filter is the −3 slope of the sensitivity function T (s)
at higher frequencies which means that the resonances of higher eigenmodes
and the high-frequency sensor noise will be even better suppressed.
used to find a control algorithm for a given system, such that a certain
optimisation criterion is achieved. Optimality of this PID-control scheme
with second-order low-pass filter is understood as the best combination
of robustness and performance so corresponding with the “practicality”
criterion of dynamic positioning systems.
This section has clearly demonstrated the creation of a virtual spring and
damper with an additional integrating function by PID-feedback control.
It is unfortunate that a mechanical equivalent for the integral gain ki is
not readily available. The integrating action provides a high controller gain
C at low frequencies. For that reason the closed-loop frequency response
becomes equal to one in that low-frequency region, independent of the
transfer function of the plant. For that reason the integral action is acting
like a “super-spring” that increases the loop-gain at low frequencies and
gradually reduces its distortion to zero. As a result the steady-state error of
the feedback controlled system in response to a constant reference signal
also becomes equal to zero.
These virtual elements are virtually placed in parallel to the physical spring
and damper of the plant and the combination acts as if the virtual and
real stiffness values and damping coefficients are simply added. It is very
important however to realise that this effect only occurs upon closing the
loop. In the open-loop situation, when the output y is no longer connected
to the summing point with the reference signal r , the controller is placed
in series before the plant and in that case the transfer functions of the
controller and the plant are multiplied. This difference often leads to
confusion as can be illustrated by examining only the proportional branch
of the system as example, consisting of the stiffness of the spring and the
proportional gain.
From the previous chapter it is known that the transfer function of a physical
spring with stiffness k acts open-loop as a compliance with a value of k−1 .
In the open-loop situation the total compliance of this transfer function
can be increased with P-control with a value needed to bring the total
compliance to unity gain (⇒ kp = k). After closing the loop this value of
k p = k would result in an additional spring with a value of k giving a total
stiffness of 2 k in combination with the physical spring. At first sight there
appears no relation whatsoever between the unity gain open-loop value
and the double stiffness in closed-loop but when calculating the closed-loop
frequency response, the open-loop transfer function of G (s)C ( s) = 1 results in
194 Chapter 4. Motion Control
ẋ = Ax + Bu (4.44)
y = Cx + Du
196 Chapter 4. Motion Control
u x x y
B ³ dt C
These equations show how the state-variable x of a dynamic system and the
system output y evolve over time as a function of the matrices A, B, C, D, the
input u and the initial conditions x0 of the state vector x. The matrices are
the eigendynamics matrix A, the input matrix B, the output matrix C and a
feed-through gain D.
Figure 4.35 shows the structure of this standard state-space model in a
graphical way. Like described at the figure, the different elements of these
matrices all represent a certain physical property or function.
containers in the system, like a spring (E = 0.5 kx2 ) and a body (E = 0.5 m ẋ2 ).
A damper does not contain energy. When the state-space model is used for
controlling the plant, the order of the differential equations that describe
the plant does not only include the mechanics but also the dynamics of the
applied sensors, amplifiers and actuators. A first-order filter at the sensor
introduces an additional state and the dynamics of the actuator also create
corresponding states.
First the different matrices will be examined from the viewpoint of motion
control:
• The input matrix B contains the simple gains or scaling factors of the
actuator or power amplifier, but not their dynamics as these have to
be included in matrix A, as stated above. The number of columns of
the input matrix is equal to the number of inputs ( i ) and the number
of rows equals the number of states n.
These matrices and the states in the state-space model are illustrated with
two examples, a damped mass-spring system, with and without a classical
PID-controller.
198 Chapter 4. Motion Control
u F 1 x2
x x2 x x1 x y x
B ³ dt ³ dt 1C
m
c
m
k
m
A
Figure 4.36: Graphical representation of the scalar, single input and output (SISO),
state-space representation of a mass-spring system with damping.
The loops of the damper (velocity) and the spring (position) are shown
separately.
With this SISO example the input u would be the force with a scalar value
u = F . The output y is equal to the position x, so it is also a scalar. The
input matrix B in this case is the conversion from force to acceleration with
a mass factor and the eigendynamics matrix A contains the damper and
spring relations which is shown in the following expansion of the state-space
equations:
+ ⎡ * ⎤ * + ⎡ ⎤
ẋ 0 1 x 0
ẋ = Ax + B u =⇒ = ⎣ −k − c ⎦ · +⎣ 1 ⎦F
ẍ ẋ
m
* + m m
x
y = Cx + D u = [ 1 0 ] · + [ 0 ]u = x (4.46)
ẋ
When this is written out for the acceleration the following equation is ob-
tained:
−k c F
ẍ = x − ẋ + =⇒ F = m ẍ + c ẋ + kx (4.47)
m m m
which is the standard force equation that was used in Chapter 3 to derive
the frequency response of the system. Also with more elaborate systems
4.4. State-space control representation 199
This means that by changing the terms in the A matrix, the eigenvalues
can be adapted such that the poles become located at a different position in
the Laplace plane. This is exactly what happens with feedback control and
the state-space model offers a very straightforward method to achieve an
optimal pole location, as will be shown in the following examples.
200 Chapter 4. Motion Control
y x r
1 x2
x x2 x x1 x
m
B ³ dt ³ dt 1C
³ dt
e r-x
c
u F m
k
m
A
d
D - control
kd
dt
kp
P - control
I - control
ki
the internal stiffness loop with element −k/m. In combination with the
input element 1/m by B, the value of kp is added to k. The P-control gain
therefore corresponds to a modification of the stiffness of the system, just as
presented in the example of the CD player. The similar relation exists for
the differential gain kd that corresponds to the damping coefficient c in the
mechanical system. The red dash-dotted loop over the differentiator block
( d/d t) converts the measured position into a velocity that serves as the input
to kd . In the D-control feedback loop this calculated velocity is equal to the
real velocity ẋ. In summary, kp and kd are used to change the stiffness and
damping of the system.
A special word of attention has to be given to the reference input and the I-
control loop with the additional integrator. In this PID-control example, the
reference is indicated at the same position as where it was located with the
analytical approach of the previous section on PID-control. With state-space
control this location is more free to choose. In theory the reference signal
can be inserted at any place in the control part because it is an externally
definable signal. Often the reference is inserted at the output of the feedback
controller. This is the same place where feedforward control is inserted, as
was indicated at the beginning of this chapter in Figure 4.1.
The integration of the error introduces a third feedback loop. There is
no equivalent element to the integrator in the uncontrolled second-order
mass-spring system. In closed-loop, the action of the integrator results
in a reduction of the steady-state error to zero and due to this effect it is
sometimes called a super-spring.
Regarding the placement of the poles it is clear that the feedback constants
directly determine the pole locations as kp + k determine the undamped nat-
ural frequency with two poles on the imaginary axis while kd + c shifts these
poles over a circle back to the real axis. The fact that kp and kd are added to
the terms in the A matrix indicates that another A matrix is obtained with
different eigenvalues and this effect is further used with full state feedback.
202 Chapter 4. Motion Control
ẋ = Ax + Bu
y = Cx + Du
In most systems the direct feed-through D is not present or very small and as
simplification D is further assumed to be equal to zero for this explanation
of state feedback. As explained, in state feedback it is assumed to have full
information of the state vector x. This means that all states of the system
are directly measured or calculated from a limited set of measurements.
The feedback-loop is closed with a feedback matrix K, with n columns and i
rows. The resulting output of the feedback controller equals u = −Kx. When
the reference is not taken into account, as that only adds an additional
setpoint term to u without changing the closed-loop system dynamics, the
state-space notation of the resulting dynamic system becomes:
As a result a new system is created, looking the same as the open-loop system,
but with a new eigendynamics matrix A = A − BK. So to achieve the desired
performance of the closed-loop controlled system under state feedback, the
desired pole locations have to be defined for stable operation and the required
4.4. State-space control representation 203
Feedforward
r e u x2 x2 x1 y
k3 ³ dt B ³ dt ³ dt C
a2
a1
k2
k1
I - control loop K
Figure 4.38: State feedback on a SISO positioning system. The position x1 and
velocity x2 are both measured. The derivative of the velocity, ẋ2 equals
the acceleration and is in this case not measured nor required for full
state feedback as this second order system is fully described by two
states. By applying proportional feedback for the position and the
velocity, the poles can be placed at the appropriate locations in the
Laplace plane. The I-control loop is additional to reduce steady-state
errors.
In state-space control the integral action is a special case as one might say
that this creates another state by the controller itself without any direct
link to a similar state in the system. This state is not needed for controlling
the system dynamics as two states are in principle sufficient for a second
order system. As was previously explained, the I-control action is added
to a feedback control system in order to reduce the steady-state errors by
means of an increase of the loop-gain at low frequencies. For this reason
it is customary in state-space control to introduce the I-control action as a
separate negative feedback loop from the system output y to the input after
comparing with the reference r . The integration will only reduce errors in
the low-frequency area so high-frequency components of the reference signal
will not be controlled and even attenuated by the I-control action (1/ω). This
effect is solved by adding the high-frequency part of the reference to the
feedforward input of the system. In that case the state-feedback loops will
control this part of the frequency spectrum.
Side note: A critical designer of this control system also could say here:
“Why should I use two sensors. One position sensor is enough as I can
apply D-control to derive the velocity”. While this statement is fully correct,
one must not forget that it is always necessary to deal with sensor noise.
If the position sensor is noisy, the derivative action would amplify this
noise especially at high frequencies and this noise would be inserted in
the feedback loop. For higher order systems with for instance multiple
springs and bodies, where only the position of the last body is measured,
even multiple derivatives of the sensor signal would be needed to apply full
state feedback and this would amplify the noise even more. This is another
example of the often economic trade-off that a mechatronic designer needs
to make.
methodology.
With System Identification the model of the plant, including the sensor, is
obtained by fitting the parameters of the differential equation to a measured
set of input and output data. When the structure of the expected model is
known, only the model parameters are identified, which is called grey-box
system identification. If also the structure of the system is unknown, this is
called black-box system identification and the identification algorithm then
also has to estimate the model-order and structure. The system identifica-
tion directly gives the state-space form or the transfer function of the system,
but is always limited to the minimal realisation of the system, which means
that from the system dynamics of the plant only those eigenmodes are con-
sidered that can be observed as well as controlled. This is in general not a
problem when no other forces act on the system that excite the un-observed
and un-controlled modes and it can even be desired as it avoids excitations of
these modes by noise in the control loop. Nevertheless it is important to be
aware of this fact because the un-observable or un-controllable eigenmodes
might very well be a main cause for problems in a complex mechatronic
system that consists of many actuated subsystems.
In the previous section it was shown how a suitable controller can be realised,
if all states are measured. It is however not always possible to measure all
positions, velocities and related derivatives in a mechatronic system, while
precision sensors are also quite expensive. As mentioned before, using the
derivative of the position signal in order to obtain the velocity signal, like in
a D-control feedback system is possible, but may be too sensitive to noise.
One way to get the full information of the state vector is to build a real-time
estimator that is based on the model of the plant including the eigendy-
namics. Such an estimator is also called an observer while it observes the
behaviour of a system by comparing it with the modelled behaviour. Such
an observer also allows a trade-off between the bandwidth (speed) of the es-
timation and the noise performance. An observer with an optimal trade-off
between these two important properties is called a Kalman-filter, named
after the Hungarian mathematician and electronic engineer Rudolph Emil
Kálmán.
Figure 4.39 shows the configuration of a state observer in combination with
state-feedback control of the observed system. The blocks in the dashed
box represent the real mechatronic system. The blocks in the dotted box
represent the mathematical model that is implemented on a computer to
206 Chapter 4. Motion Control
u x x y
B ³ dt C
Real
A system
e -
L
r u x obs xobs yo
B ³ dt C
-
A Real-me
model
K
Model-based controller
vector will quickly follow changes in the system but will also respond more
to noise, originating for instance from the sensor. A slow response of the
state observer introduces some “smoothing” or masking of fast disturbing
signals, for instance when the measurement is corrupted by sensor noise.
Like with the name of an optimal observer, the observer gain that is provid-
ing the optimal trade-off between speed and noise performance for a known
co-variance of the sensor noise is called the Kalman-gain.
~*
1.8e6 ³ dt
@!
*
3.3e4 1/0.25 ³ dt ³ dt
Limit
! d
160 @! 0.001
dt "
?
by choosing the right components, limitations that are present in the plant
are a frequent source of errors and can even cause damage to a system when
being overlooked. One reason for such errors in practice is that standard
modelling software packages, such as Matlab and Simulink, are allowing
extremely large control action, error inputs, and system states, that may
not have any relation with capabilities of the real system. The limitations
(saturation) of actuators and sensors have to be taken into account and must
be integrated in the Simulink-models by adding appropriate function blocks,
like shown in Figure 4.40, in order to keep the model close to the actual
physical system. This means, that a practical insight in the limitations of
sensors, amplifiers and actuators is necessary to prevent costly mistakes in
modelling mechatronic systems.
As an example to visualise the effect of such limitations, Figure 4.41 shows
the behaviour of the PID-controlled system from the previous section (see
Equation 4.33) in case of a saturation of the actuation force, putting a
limit to the maximum possible control effort. The reference signal is a
trapezoidal signal with a fundamental frequency of 5 Hz and a peak-to-peak
amplitude of 0.2 m (dotted black line in the first panel of the figure). For
the unconstrained system the actuation force ranges up to about 1600 N
(blue solid line), while the control effort for the constrained system is limited
in the simulation to 1 kN (yellow dashed line) and 300 N (red solid line),
respectively.
While the control action exceeds the actuation force limit, the control-loop is
broken and the system does not react to changes in the control action as the
maximum actuation force is already applied (and with this the maximum
energy that can be supplied to the system per unit of time). This means
that due to this saturation the linear control system becomes non-linear
4.5. Limitations of linear feedback control 211
0.2
?
0
-0.2
0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1
@!"
1000
0
-1000
-0.1
0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1
1000
~*
-1000
2000
*
-2000
0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 1
Figure 4.41: Behaviour of a mass positioning system under PID-control with vary-
ing limitation of the maximum actuation force: unconstrained (blue
line), limited to 1000 N (dashed yellow line), and limited to 300 N
(solid red line). The first plot shows the controlled position of the
mass in response to a trapezoidal reference signal (black dotted line).
The second plot shows the output of the PID-controller, equalling the
actuation force. The third plot denotes the error signal, which with
a scaling factor also equals the P-action. The fourth and fifth panel
show the I-action and D-action. The mismatch between the actuation
expected by the linear (unconstrained) PID-controller, given by the
blue lines, and the actual actuation causes a significant degradation
in control performance for the case where the actuation force is limited
to 300 N, and the system starts to fiercely oscillate.
and the system does no longer react as the PID-controller would expect.
For the case where the actuation force is limited to 1 kN, the saturation
level is reached only for a very short moment (dashed yellow line) due to the
D-gain when the reference signal changes. This is one of the reasons why
for practical systems one should use smooth reference signals rather than
steps and impulse-like signals. As this saturation is limiting the control
action only very little, the increase of the control error is also very small
212 Chapter 4. Motion Control
and the system remains in saturation only slightly longer for the time that
the unsaturated force would exceed this value of 1 kN. When the system
re-enters the linear regime, the PID-controller easily can compensate for
this slightly larger error during saturation.
For the case where the actuation force is limited to 300 N, however, the
energy per time unit that can be put into the system for control is much lower
than the PID-controller would demand for the given reference signal. The
mismatch between the linear system and the actual system is too large and
leads to a larger control error and a significant degradation of the system
performance. The main problem here is, that the I-gain of the controller
continues to accumulate the error during the entire time that the system
is in saturation, which takes significantly longer and consequently results
in much larger accumulation of the I-action as would be the case for the
unconstrained system. This effect is called integrator windup. This longer
integrated and larger error cannot easily be compensated when the system
re-enters the linear regime, as the I-action only starts to decrease once
the error changes its sign like when the system is moving in the opposite
direction. This results in strong oscillations of the system and even may
lead to limit-cycling as the too large accumulation of integral action can lead
to a bang-bang action where the control signal is just switching between the
positive and the negative saturation of the actuation force.
In real systems measures have to be taken in the controller to prevent these
phenomena. One of the possibilities is to reduce or even cancel the I-control
action until the system is in the linear range again within the saturation
limit of the actuation force. This is called anti-windup control and should be
present in all PID-control systems, and is often sufficient when long range
sensors are used.
This example demonstrates that linear control, as powerful as it is, also has
its limitations that have to be known and considered in the design of the
control system.
Another limitation, not shown in the figure, would be the limited measure-
ment range of a sensor. When the sensor is out-of-range, the control-loop is
not broken towards the actuator but the controller is "flying blind" once the
sensor signal clips. This means that the actual error may be much larger
than the controller observes, but does not get compensated. With very short
range sensors and long range actuators, like in a close tracking system, one
can reduce the P-gain and increase the D-gain until stability is achieved.
The auto focus of a photo camera is a good example of such a problem, where
often the capture range of the focal sensor is much smaller than the focal
4.5. Limitations of linear feedback control 213
setting range of the lens and outside the range no information is available
about the direction. As a result the camera keeps scanning past the sharp
spot without stopping in time.
As a last example for the limitation of linear control backlash needs to
get some attention as this non-linearity is often present in less advanced
mechatronic system where the cost of direct drive actuators and air bearings
is prohibitive. With a positioning system that consists of a rotary DC
motor with a gearwheel transmission, the backlash and friction of the
transmission disturbs the transfer between the motor and the positioned
mass. This non-linearity limits the possibility to achieve a fast and well
damped feedback controlled system with precise positioning of the mass
because during reversal the transmission is broken by the backlash. This
problem can be partly solved by mounting a tacho-generator directly at the
DC-motor shaft. This sensor provides reliable state information about the
velocity of the motor and can be used for stabilisation of the (high-bandwidth)
inner position/rotation feedback loop. Non-linear errors of the transmission
will however not be measured by the tacho-generator and as a consequence
they will not be reduced in the positioning of the load-mass.
This section of the book should for sure not be misunderstood as linear
control is extremely powerful and important in motion control systems. In
most applications the performance of linear control is however limited by
the boundaries of the physical plant and these have to be considered in the
controller design. As stated above we would like to refer to the references
on linear and non-linear control given in the appendix for further reading,
in order to get an insight how modern non-linear control methods may allow
to achieve a control performance beyond the limits of linear control.
214 Chapter 4. Motion Control
Electromechanic
actuators
The eigen dynamics of a mechatronic system are not only determined by its
mechanical dynamic performance. Like mentioned before, the incorporated
subsystems such as amplifiers, actuators and sensors have their own impact
on the full system dynamic performance. It requires a thorough understand-
ing of the interaction between the mechanical and electrical engineering
domains regarding energy conversion, in order to design a well balanced
system. This interaction is covered in this chapter on electromechanics.
Although the subject of electromechanics describes the transition from elec-
trical to mechanical engineering domains and vice versa, this chapter will
only cover the part that is related to actuation. The reversed phenomena
will be presented in Chapter 8 on measurement.
After a general introduction, the physics that are related to electromag-
netic forces is covered including the applied Maxwell equations and the
calculations on magnetic fields. Starting with Section 5.2 the behaviour of
linear actuators like moving-coil Lorentz actuators and the stronger but
non-linear reluctance actuators will be presented together with the optimal
combination of both in the hybrid actuator. These basic configurations are
sufficiently representative for the design of most electromagnetic actuators.
With the derived simplified equations, reasonable calculations can be made
on concept designs. Further attention is given to the electrical properties,
that are important for the integration in mechatronic systems, especially
related to the amplifier that has to deliver the current. Electromagnetic
actuators still determine the main-stream actuation principle in precision
215
216 Chapter 5. Electromechanic actuators
Figure 5.1: Electric motor that was designed by Thomas Alva Edison to drive a
cooling fan. The motor contained no permanent magnets while the
magnetic flux was only created by current. A commutator with sliding
contacts allowed it to be used with direct current electric power that
was his preferred source of electrical energy.
(Courtesy of John Jenkins, www.sparkmuseum.com)
5.1. Electromagnetics 217
5.1 Electromagnetics
has resulted in two different directions. The first originated in the machine
tool industry and mostly used the newly discovered piezoelectric properties
of certain materials to use them as actuators. The second direction was
based on positioning of systems without strong external forces and relied
mainly on electromagnetic energy conversion principles.
Electro magnetics is the physics area that describes the phenomena, associ-
ated with electric and magnetic fields and their interaction. Although this
book focuses on comprehension, rather than mathematics, it is not possible
to escape from the fact, that the physics in electro magnetics are governed
by stipulated laws and models. These laws in itself can not be understood
by definition, but have to be accepted for reason of their capability to predict
the behaviour of electromagnetic systems. Though this complicates the
matter for people, who want to know why things work as they do, these
laws are taken as a starting point and then the subject is presented as if its
understood what causes these laws to be true.
These basic laws are called the “Maxwell equations” and will be presented
in the following to be used in the further practical implementation.
The Scottish physicist and mathematician James Clerk Maxwell (1831-1879)
formulated his equations partially based on the work of previous scientists,
but the laws got his name because of the way he combined them. The
Maxwell equations are a set of four equations, with the status of physical
laws, stating the relationships between the electric and magnetic fields and
their sources being charge density and current density.
Table 5.1 defines the variables that play a role in this chapter. For a complete
overview also the electrical variables, that were defined in Chapter 2 are
mentioned. Like the previously defined electric field E, the electrical current
density J is a vector field representing the movement of free, or unbound
charges in space. The electric permittivity 0 has also been previously
defined and the electric charge density ρ q represents the amount of unbound
charges in a volume.
The magnetic field B is a vector field. Like with the electric field, it is
graphically represented by the density and direction of magnetic flux lines.
In the design of electromagnetic actuators, the magnitude of the magnetic
field in the direction of interest is called the flux density B. It directly
relates to the quantitative nature of the magnetic flux Φ in that direction,
calculated by integrating the magnetic field over a surface perpendicular to
5.1. Electromagnetics 219
that direction.
The magnetic permeability μ represents the possibility of a magnetic field
to pass through an object. The word permeability stems from the verb “to
permeate”. The permeability in vacuum is called μ0 and is a reference for
the permeability of other materials.
The magnetising field H and its related magnitude, the magnetic field
strength H , are directly connected to the magnetic field by the magnetic
permeability: B = μH and B = μ H .
The Maxwell equations can be written either in the integral or differential
form. The integral form is more easy for explaining the meaning and will
be used to derive the equations in this chapter. The differential form is
better suited for mathematical modelling. In order to keep this differential
form more easy to write down, the mathematical terms divergence (div) and
rotation (rot) are used. These are respectively the dot (div) and cross (rot)
product between the differential vector nabla (∇) and the vector of interest.
In the three dimensional space ∇ equals:
∂ ∂ ∂
∇= , , (5.1)
∂x ∂ y ∂z
220 Chapter 5. Electromechanic actuators
n̂
dS dl n̂ L
S ddSS S
Figure 5.2: Definitions of the vectors n̂ and dl, surface S and closed loop L that
are used for the Maxwell equations. The left drawing shows surface S ,
that encloses a volume and is used with both Gauss’s laws. The right
drawing shows the closed loop L with its enclosed surface S and is used
with Faraday’s and Ampère’s law.
With the previously defined variables and using the definitions of Figure 5.2,
the first Maxwell equation is written as follows:
This first law was originally postulated by the German mathematician and
scientist Johann Carl Friedrich Gauss. It states that the surface-integral
of the electrical field over any closed three dimensional surface S , like for
instance a sphere, equals the charge qenc enclosed within the closed surface,
divided by the electric permittivity (0 ). It is directly related to Equation (2.2)
of Chapter 2. With electromagnetism this law is not used, because in
electromagnetic actuators all electrical charges are bound, which means
that they are not free and always in equilibrium with the positive charge of
protons of the wires that are used to carry the current. Nevertheless it is
mentioned for completeness and also to demonstrate the difference between
electric fields and magnetic fields as given in the second Maxwell equation.
div B = ∇ · B = 0 (5.5)
5.1. Electromagnetics 221
This second law of Gauss states that the surface-integral of the magnetic
field over a closed surface S is always zero. With any closed surface, the
magnetic flux entering the volume within the closed surface is equal to
the magnetic flux that exits that volume. When represented by flux lines
this can only be true when all flux lines form a closed loop. Gauss’s law
on magnetic fields is based on the observation that magnets always act as
dipoles, a north and south pole where the flux flows internally from south-
to north pole and externally back from north- to south pole. This is the main
difference with Gauss’s law on electric fields where electrical charges can
exist without a monopole counterpart. In case of a single charged particle,
the equivalent electrical field lines originate in the charge and just go to
infinity. Integrating over a surface surrounding that charge would result in a
finite value according to the first law of Maxwell. Insight and understanding
of Gauss’s law on magnetic fields is necessary for calculating the flux in
magnetic circuits.
The third Maxwell equation gives the relation between a change in the
magnetic field and the resulting induced electrical potential difference in a
wire that surrounds that field.
• Faraday’s law:
1 2
d
E · dl = − (B · n̂) dS (5.6)
dt
L S
∂
rot E = ∇ × E = − B (5.7)
∂t
This law was originally postulated by the English chemist and physicist
Michael Faraday (1979 – 1867). It states that the line-integral of the elec-
trical field over a closed loop L equals the change of the flux through the
open surface S bounded by the loop L. In Chapter 2 the electromotive force
Fe was introduced as this integral over the electric field inside a voltage
source. For this reason the third Maxwell equation is the foundation under
the theory that describes the creation of electricity by magnetism. A voltage
source is created inside the windings of a coil by a changing magnetic field
surrounded by that coil. It explains also the phenomenon of self-inductance,
as will be defined in Section 5.3. An important element in this law is the
minus sign that indicates, that the direction of the electric field is opposite
to the vector dl. It explains several properties of electromagnetic actuators
that will be presented in this chapter, including the damping effect that is
observed when the actuator is supplied from a low impedance source.
The conversion from mechanical into electric energy implicitly also leads
222 Chapter 5. Electromechanic actuators
• Ampère’s law:
1 2
d
B · dl = μ0 I + 0 μ0 (E · n̂) dS (5.8)
dt
L S
∂
rot B = ∇ × B = μ0 J + 0 μ0 E (5.9)
∂t
+
B
I
_
Figure 5.3: The magnetic field, generated by a current through a wire is directed
clockwise, when looking in the direction of the current. The magnitude
of the magnetic field decreases proportional with the distance to the
wire.
When the loop is a circle with radius x around a wire with a current I as
shown in Figure 5.3, the magnetic field B has a constant flux density B over
the circle and directed along the circle. This means that the left integral
term becomes equal to 2π xB. As a result the flux density becomes equal to:
μ0 I
B ( x) = (5.11)
2π x
This equation is valid under the condition that the surrounding material
is vacuum, for which Ampère’s law was postulated. In words this equation
tells us, that the magnetic flux density is proportional to the current in the
wire and the magnetic permeability in vacuum and inversely proportional to
the distance. This dependence of the magnetic permeability automatically
224 Chapter 5. Electromechanic actuators
Figure 5.4: Three currents running in parallel wires, directed towards the observer,
each generate a magnetic field that is represented by the dashed field
lines. These fields add vectorial, according to the superposition principle
of magnetic fields, resulting in a larger total field that is represented
by the solid red field lines. It shows the straightening of the resulting
field lines at some distance parallel to the orientation of the wires.
B ( x) I
H ( x) = = (5.12)
μ0 2π x
This means that the magnetic field strength is only determined by the
current and the distance to the wire. For this reason the vector field H,
corresponding with the magnetic field strength H was named “ magnetising
field” in order to indicate an electric current as the source of the magnetic
field B1 .
In practical electromagnetic systems wires are wound in coils with a certain
number of windings n. As can be seen in Figure 5.4 the magnetic fields
created by a multiple of electrical wires running in parallel, add vectorial,
because of the superposition principle of magnetic fields. As a result, by
winding the wire in a coil, the total field generated by the current increases
per winding added. After integration of the magnetic field of all windings
using Equation (5.11) over the cross section A c inside a coil with a winding
1 The real answer to the philosophical question of “what comes first” or “what causes what”,
the H or the B field, is beyond the needs of this chapter as both fields are equally present and
can be used where appropriate.
5.1. Electromagnetics 225
height hc , the following expression for the flux density (Bw ) and magnetic
field strength ( Hw ) inside the coil of Figure 5.5 would be obtained in vacuum
or air.
μ0 nI Bw,i nI
Bw ≈ =⇒ Hw = ≈ (5.13)
hc μ0 hc
This relation is a rough approximation because of the fact that the magnetic
field is not really homogeneous. The effects of the edges of the coil are
fully neglected, which limits this approximation to coils with a height to
diameter ratio of larger than one. Nevertheless it is useful in a qualitative
manner, as it shows the impact of the height of the coil. The reduction of
the magnetic field as function of the winding height hc , under further equal
conditions regarding current and number of windings, is caused by the
reduced addition of the separate flux levels of each winding at an increased
mutual distance. The diameter is cancelled out of the equation, again under
the condition, that it is approximately the same value or smaller than the
height. This can be reasoned from the fact, that in the mid position the
contributions of the different windings is equal, while closer to one side the
contribution of that side is proportionally increased, where the contribution
of the the other side is decreased. In approximation this means, that the
flux density is constant over the cross section.
The direction of the magnetic flux, relative to the current direction, directly
follows from the definitions for the Maxwell equations from Figure 5.2.
Following the direction of the current, the magnetic flux is rotating clockwise
around the wire as was shown in Figure 5.3. For a coil with multiple
windinngs this means that the direction of the flux inside the coil corresponds
with the current flowing around it in the clockwise direction. The place
where the flux leaves the magnet is defined as the “North pole” and the
other side as the “South pole”. Even though this seems to correspond with
their counterparts of the magnetic field of the planet Earth, the poles of the
earth are reversed as the definition of North pole in a magnet was made
to tell which side of a compass needle points towards the North pole of the
planet Earth. Because opposite poles attract each other this definition has
caused the reversal.
With the found relations between the current in a wire and the generated
magnetic field it is possible to derive a simplified model, that is useful for a
first order analysis of magnetic systems. Similar to Ohm’s law in electronics,
226 Chapter 5. Electromechanic actuators
Ao
hc Ai Ai Ac
Southpole B Northpole
n windings
I V
_
+
Figure 5.5: The magnetic field from a current carrying coil consists of the vectorial
addition of the magnetic fields of all windings contributions. This
results in an approximately uniform field inside the coil that is shared
by all windings.
the British physicist and electrical engineer John Hopkinson (1849 – 1898)
stated that a magnetic system can be described as a combination of a source
of magnetism that, together with a magnetic load, determines the magnetic
“current”, the flux Φ.
It was shown in the previous section that an electric current acts as a source
of magnetism. In Hopkinson’s law of magnetics this source is defined as the
multiplication of the current with the number of windings. It is called the
magnetomotive force, familiar to the electromotive force for a voltage source
of electricity:
Fm = nI [A] (5.14)
For the magnetic resistance the term Reluctance (ℜ) is introduced because
of the English word “reluctant” in relation to the unwillingness of a material
to be permeated by a magnetic field. The magnetic reluctance is defined by
means of the following relation:
Φ
ℜ= (5.15)
μA
where Φ equals the length of the path, that the magnetic flux has to follow
and A equals the cross section of the path. This is logical, when considering
that the reluctance ℜ represents the amount of effort that is needed to
5.1. Electromagnetics 227
Generally neither the cross section A , the path Φ nor μ is constant over the
entire system, so in practice the reluctance is calculated as a summation
over the path of the flux of the reluctance contributions of each region with
different properties. This means that each reluctance element adds to the
total reluctance just like with their electrical counterparts.
To check this law the induced magnetic field by a current in the coil from
Figure 5.5 in vacuum is determined, using Hopkinson’s law of magnetics.
This results in a flux density Bw inside the windings of the coil of:
Φw Fm nI
Bw = = = (5.17)
Ac Acℜ Acℜ
The reluctance can be modelled to consist of the part ℜ i with length i inside
the coil and the part ℜ o with length o outside the coil:
i o
ℜ = ℜi + ℜ o = + (5.18)
A i μ0 A o μ0
where A i and A o equal the cross sections of the flux path inside and outside
of the coil. The first term of the reluctance inside the coil can be determined
quite straightforward, because of the constant cross section A i = A c and the
height of the coil (i = hc ). The second term of the reluctance outside the
coil is more complicated, as the length o for each flux line will be different
ranging from hc at the exit of the coil to infinite. Fortunately the cross
section A o is also infinite, as it consists of all space outside the coil. As a
result the reluctance of the outside part becomes small in respect to the
inner part. This means, that the reluctance can be approximated by only
taking the inner part:
i
ℜ≈ (5.19)
A c μ0
Figure 5.6: A living frog floating inside a coil with 20T magnetic flux density that
was demonstrated by Nobel prize winner Andre Geim. The diamagnetic
properties of the water in the frog stabilises its position inside the coil.
(Courtesy of Radboud University Nijmegen)
Fm
Hw ≈ =⇒ Fm ≈ H w i (5.21)
i
One method to increase the magnetic flux, without increasing the current, is
the use of a material with a higher permeability. Because μ0 is the reference,
materials other than vacuum have a relative permeability μr in respect to the
permeability in vacuum. This value equals around 1 for non-ferromagnetic
materials like air, glass, many metals and plastics but with ferromagnetic
materials μr can reach values of about 100 for iron and nickel to as large
as 100,000 for materials like Superpermalloy. Most non-ferromagnetic
materials show some very limited effects and are called paramagnetic if
they show a μr of just above 1 or diamagnetic if they show a μr of just
smaller than 1. The latter have the “strange” property of being repelled by
a magnetic field. A famous example is the levitated living frog shown in
Figure 5.6. It was demonstrated by the Dutch/Russian Nobel prize winner
Andre Konstantinovitsj Geim (1965) at the High Field Magnetic laboratory
of the Radboud University in Nijmegen, the Netherlands. The frog is stably
5.1. Electromagnetics 229
ferromagnec yoke
I
I V
+ V +
_ Ay
n windings
_ Ay
Figure 5.7: Magnetic flux generated by a current carrying coil wound around a
ferromagnetic yoke is much larger than without the yoke, because of
the high magnetic permeability of the ferromagnetic material.
floating inside a very strong magnetic field of about 20T, because of the very
small diamagnetic properties of water, which is the main constituent of a
frog.
The μr values of diamagnetic materials are extremely small. For this reason
they are still of little practical use in electro magnetic systems and will
therefore not be further presented in this chapter. On the other hand real
ferromagnetic materials are frequently used for creating strong magnetic
fields.
When a yoke of ferromagnetic material is added to a coil, the flux and flux
density will increase significantly as the ferromagnetic material determines
a low reluctance path for the flux, both inside and outside the coil, as shown
in Figure 5.7. By using Hopkinson’s law of magnetics the flux in the yoke
can be calculated.
The magnetomotive force of the current equals:
Fm = nI (5.22)
Bw Saturaon
Bw P0 P r H w Bw P0 H w
In vacuum or air H w (I )
Saturaon
Figure 5.8: Relation between the flux density Bw and the magnetic field strength
Hw inside a current carrying coil, with and without a ferromagnetic
yoke. Because Hw is proportional to the current in the coil, an increase
in the current will result in an increase of the flux density at a rate
determined by the magnetic permeability. With a ferromagnetic yoke,
the initially strong effect of μr will reduce above a certain level of the
flux density, due to saturation. At even higher levels the curve will
asymptotically continue parallel to the line with μ0 only.
with y and A y being respectively the length and the cross section of the flux
path inside the yoke. Note that in spite of the two dimensional case theses
values are approximated one dimensional where A c is assumed orthogonal
to the flux and c is the average path length.
The flux Φy in the yoke is equal to the flux Φw in the coil:
Fm A y μ0 μr nI
Φy = Φw = = (5.24)
ℜ y
The corresponding flux density and magnetic field strength inside the coil
equal:
Φy μ0 μr nI Bw nI
Bw = = =⇒ Hw = = (5.25)
Ay y μ0 μr y
increase of the flux and the flux density. In ferromagnetic materials the
relative permeability μr is not constant but reduces at higher levels of
flux density. This is visualised in the magnetisation curve, a graphical
representation of the flux density Bw as function of the field strength Hw . The
horizontal axis is also proportional with the magnetomotive force and the
current in the magnetising coils, because the field strength is independent
of the permeability.
In Figure 5.8 the magnetisation curve of a ferromagnetic material in the
configuration of Figure 5.7 is compared with the situation without a ferro-
magnetic yoke.
The current is increased from zero starting at the origin. This current causes
a proportional magnetising field with a magnitude equal to the magnetic
field strength Hw . This also corresponds with an increasing magnetic field
of which the magnitude, the flux density, depends on the level of the relative
magnetic permeability of the material.
Above a certain value of the flux density, the material becomes saturated.
This is caused by small magnetic areas in the materials micro-structure,
that are responsible for the relative permeability and have to direct them-
selves to the magnetic field. These areas are called Weiss domains after
the French Physicist Pierre-Ernest Weiss (1865 – 1940), who discovered the
magnetic orientation of these elementary “building blocks”. A Weiss domain
has a typical size of 10−6 to 10−8 m and contains approximately 106 to 109
atoms. The orientation to the external magnetic field takes place inside
the Weiss domain and can be imagined as an elastic effect. Initially the
domains only partly orient themselves but with higher levels of magnetisa-
tion they gradually become completely oriented. As soon as this happens,
the beneficial effect to the magnetic field is reduced. When the current is
increased beyond this level the flux density increases further proportional
to μ0 only so parallel to the line corresponding with the situation without a
ferromagnetic yoke.
Bw
Br
Hc
H w (I )
Virgin
curve
Figure 5.9: Hysteresis of a permanent magnet. After the first magnetisation cycle
the material will retain some magnetic energy, resulting in a magnetic
field that is present, even when the magnetizing field caused by the
current is zero. The corresponding flux density Br is called the remnant
flux density and the negative field strength that would be required to
cancel the magnetic field due to the hysteresis, is called the coercive
force Hc .
Br,n | 1.4 T
NdFeB
Stabilized Bm
D
Br,f | 0.35 T
Ferrites
Bm P0 H m
H c,f H c,a
Hm
magnetisation direction again. The virgin curve will only be followed once.
For a permanent magnet only the upper left (or lower right) quadrant of
the magnetisation curve is used. Without an external magnetising field,
the flux density of a magnetised permanent magnet will be equal to the
remnant flux density Br . The field strength of the internal magnetising field
is equal to the external field strength, that would be necessary to cancel
the magnetic field. The corresponding magnitude of this field strength is
called the coercive force Hc . In search for ever stronger magnets, material
science has delivered a wealth of combinations of several, sometimes quite
exotic, permanent magnetic materials with very different characteristics.
Examples of the demagnetisation graph of three different permanent magnet
materials are shown in Figure 5.10.
One of the graphs belongs to a magnet material that has been rather popular
in the past. It was an alloy of Cobalt, Nickel and Aluminium with some
other additives, carrying trade names like Alnico and Ticonal. They showed
a large remnant flux density but also a curved graph. This is caused by the
Weiss domains that already start to re-orient themselves to the negative
magnetic field, giving a relative permeability μr > 1. Because of the high
hysteresis in this material this re-orientation of the Weiss domains becomes
234 Chapter 5. Electromechanic actuators
In this equation the length m of the permanent magnet is equal to the length
i of the flux path inside the equivalent coil as noted in Equation (5.21).
Southpole S B N Northpole
Ferromagnec
Ferromagnec (iron) yoke
core/yoke
Ay
Am Ag lg Ay
S N
Am Ag
Figure 5.12: Magnetic field in the air-gap caused by a permanent magnet when
neglecting flux loss. The reluctance of the magnetic flux path is a
combination of the reluctance of the permanent magnet, the air-gap
and the ferromagnetic yoke.
By combining this with Equation (5.27) the following expression for the
magnetic flux of the permanent magnet is obtained:
Br A m
Φm = (5.29)
A m y A m g
1+ +
A y m μr A g m
The flux in the air-gap equals only a part of the total flux of the magnet.
The other part is the leakage flux or stray flux, that follows a path outside
238 Chapter 5. Electromechanic actuators
Air -
S N gap
Figure 5.13: Stray flux reduces the useful flux in the air-gap. With FEM modelling
software this is clearly made visible in a comparable configuration, as
used in Figure 5.12. By counting the flux lines inside and outside the
gap the loss factor λ appears to be about 0.45. More than half of the
total flux is lost.
the useful area of the air-gap. This stray flux with its corresponding loss
factor λ is caused by the fact, that all materials conduct magnetism, even
vacuum and air. Magnetic insulation is not possible and this means that
the magnetic field will seek the lowest energy situation by using all space
available. The magnitude of this effect is shown in Figure 5.13 where in the
example configuration a magnet with a Br of 1 T creates a flux density in the
air-gap of only around 0.3 T and the loss factor λ appears to be approximately
0.45 as can be checked by counting the field lines.
For this reason in this roughly approximating calculation the following
relation for the flux Φ g in the air-gap is allowed:
In practice the value for λ will be between 0.25 and 0.75 depending on the
configuration. Next to the shown example with λ ≈ 0.45 two examples with
a lower and higher loss factor will be shown as soon as the mathematical
analysis is finished.
When Equation (5.30) is combined with Equation (5.31) the magnetic flux
density in the air-gap can be calculated:
Φg λΦM Am λB r λB r
Bg = = = = (5.32)
Ag Ag Ag A m g Ag g
1+ +
A g m A m m
This relation reads as follows, by first noting that the flux density in the air-
5.1. Electromagnetics 239
In order to reduce the stray flux and optimise the use of expensive magnet
materials it is often preferred to use a different configuration than the
240 Chapter 5. Electromechanic actuators
Bm
1
2 Br
Hc 12 Hc
Hm
one shown in Figure 5.12 by choosing the cross section of the magnets
large in respect to their length. It also appears to be useful to not include a
ferromagnetic part between the magnet and the air-gap as the ferromagnetic
part would determine a low reluctance path to the sides of the air-gap even
for the flux coming from the inner part of the magnet. This is shown in
Figure 5.15, where at the right side the effect of ferromagnetic pole pieces
is shown and at the left side the optimal configuration with permanent
magnets directly adjacent to the air-gap.
In this configuration the path length though air is often equal to the path
length through the magnets and the flux density in the air-gap using Equa-
tion (5.33) becomes:
λB r
Bg = (5.35)
2
When applying magnets with a Br of 1 T this second example shows a flux
density in the air-gap of approximately 0.4 T. The flux density in the perma-
nent magnet can be determined by using Equation (5.31) with a loss factor
λ = 0.75, as found by counting the magnetic field lines. Because A m = A g
this calculation results in a flux density of the magnet Bm of approximately
Br /2, which means that the permanent magnet is used at its optimal energy
point.
5.1. Electromagnetics 241
N N
S S
Air-gap Air-gap
N N
S S
Figure 5.15: With flat permanent magnets, directly interfacing the air-gap, the
stray flux is kept at a minimum. Often ferromagnetic pole pieces
are suggested to spread the magnetic field more uniform, however,
as shown in the right figure, this pole piece significantly increases
the stray flux, due to the low reluctance path to the edges of the
magnet. Even with a smaller air-gap the loss factor λ reduces from
approximately 0.75 to 0.6 as can be checked by counting the flux lines.
Air-gap
N N
S S
Figure 5.16: Concentration of flux by ferromagnetic pole pieces. This rotation sym-
metric structure with a large but inexpensive magnet, as commonly
applied in loudspeakers, creates a strong magnetic field directed in
the radial direction in the air-gap. A large part of the magnetic flux is
however lost outside the air-gap.
When a very high flux density has to be created in a relatively small air-gap,
it is possible to concentrate the flux by using ferromagnetic pole pieces that
have a smaller surface at the air-gap than at the surface of the magnet. Even
though this creates an increased loss it is often used with very inexpensive
242 Chapter 5. Electromechanic actuators
magnets like the ferrites that are usually applied in loudspeakers. The
effect on the flux is shown in Figure 5.16.
With Equation (5.32) this concentrating effect can be calculated, although
due to the high amount of stray flux (λ < 0.3), the magnet needs to be consid-
erably larger than would be the case with the flat magnets of Figure 5.15.
Still in this example an inexpensive magnet with a Br of 0,4 T creates a
flux density in the air-gap of approximately 0.7 T , which is fully acceptable
when low cost is the primary specification and size is no issue.
In the next section the magnetic field in the air-gap is used to create a
Lorentz actuator by inserting a current conducting wire in the magnetic
field.
5.2. Lorentz actuator 243
The Dutch physicist and Nobel prize winner Hendrik Antoon Lorentz (1853 –
1928) formulated the Lorentz force as a completion to the Maxwell equations.
The law of Faraday describes the effect of a changing magnetic field on
electrical charges hence generating electricity from kinetic energy. Based
on energy conservation laws creating electrical energy from motion is fully
complementary to creating motion energy from electrical energy so the laws
of Lorentz and Faraday are strongly related.
In vectorial notation the formulation of Lorentz describes the force on a
moving charged particle as:
F = q(E + v × B) (5.36)
with v [m/s] equals the instantaneous velocity of the particle. The first
part of the Equation qE is the electrostatic force and the second part is the
electromagnetic force. This second term is used in electromagnetic actuators.
Next to the force on a moving particle it equally represents the force on a
current flowing through a wire with length w [m], inserted in the magnetic
field. For this situation the moving charge equals the current times the
length, qv = w I, and with this relation the electromagnetic Lorentz force is
equal to:
F = w I × B (5.37)
244 Chapter 5. Electromechanic actuators
&
I
& &
F B
Figure 5.17: Determining the direction of the Lorentz force with the corkscrew rule.
When the corkscrew is rotated right handed, from the direction of
the positive current to the direction of the magnetic field (arrow), the
movement of the point of the corkscrew determines the direction of
the force.
F = BI w sin α (5.38)
F = BI w (5.39)
where w,t equals the total length of the wire inserted in the magnetic field.
This equation is widely used as a general equation for linear actuators based
5.2. Lorentz actuator 245
Permanent Magnets
Ferromagnec yoke
F
Magnec Field
F
A
Detail A
Figure 5.18: Basic flat type Lorentz actuator. The Force F is proportional to the
current and the amount of winding length of the wire in the coil
inserted in the magnetic field B. Due to the large amount of coil
outside the air-gap this is not a very efficient configuration. Detail
A shows the forces acting on a wire segment at the centre of the coil,
clearly indicating the inefficiency.
on the Lorentz principle. It clearly shows that the force only depends on
the current, the total wire length and the magnetic flux density. Because
there is no direct relation with the position, the actuator ideally has zero
stiffness. This is the main reasons why a Lorentz actuator is preferred in
precision positioning systems as it avoids transmissibility of vibrations from
the stationary part to the moving part. Later it will be shown that this
is the ideal situation. In reality at higher current levels some stiffness is
observed due to non-linearity and position dependency of the B value.
Figure 5.18 shows a basic flat type Lorentz actuator configuration for il-
lustrating the principle only as it is not the best design possible. It uses
the flat high efficiency permanent magnet configuration as described in the
previous section and the coil is flat wound and inserted partly in the mag-
netic air-gap. The force of this actuator can in principle be calculated using
Equation (5.40). The permanent magnet flux is directed perpendicular to
the current. When the curvature of the wires is neglected, the force equals
approximately BI w,t , where w,t is the part of the total length of the wire
inside the air-gap. The inner windings contribute little to the force, because
of the curvature. This is shown in Detail A that indicates the spread force
contributions along a wire segment at the centre. Only the component of
246 Chapter 5. Electromechanic actuators
Permanent Magnets
Ferromagnec yoke
F=0
Magnec Field
Figure 5.19: Non operating actuator with mechanically connected magnets and
coils. The force is created between the magnetic flux and the current
and not between the magnet-coil combination and the iron part of the
system.
Useless windings
Round coil
Race track
Figure 5.20: Flat type Lorentz actuator with improved Force to current ratio. By
increasing the width and adding two additional magnets like shown
in the upper drawing, the force is doubled at the same power loss in
the windings. Using an oval shape of the coil like a “race track” as
shown in the lower drawing, will improve the force to power ratio even
more.
Using this equation it is obvious that the actuator of Figure 5.19 is not
working, because with that configuration there is no change of the flux
through the coil as function of the displacement ( dΦw / d x = 0).
In the previous section it was demonstrated that the windings of the Lorentz
actuator of Figure 5.18 are not very well utilised. While the resistive power
248 Chapter 5. Electromechanic actuators
The most widely applied version of the Lorentz actuator is the moving-coil
loudspeaker actuator. This type is a fully rotation symmetric configuration
with a round coil that is inserted in a magnetic field that is directed radial
towards the centre of the structure, like shown in Figure 5.16. Even though
the magnetic field in the air-gap is three dimensional the orientation is
orthogonal to the current of the windings. For this reason the approximating
scalar expression for the force can be used also in this example. A version
with permanent magnets directly adjacent to the air-gap with the coil is
shown in Figure 5.21. In this configuration the coil is completely surrounded
by the permanent magnet, resulting in a maximum efficiency.
Coil
A A-A
Figure 5.21: moving-coil loudspeaker motor with permanent magnets directly ad-
jacent to the air-gap. In the middle position the coil is fully inserted
in the magnetic field giving a maximum force to power ratio.
and is caused by the fact, that off the centre, the coil is not completely
surrounded by the same magnitude of the magnetic field. When used in
a closed loop positioning system this implies a change in the control loop
amplification (gain) and a non-linear positive or negative stiffness depending
on the current level and the position relative to the centre position x0 .
This can be explained as follows. Assume the coil carries a current with
a direction that corresponds to a force in the positive x direction. When
the coil is pushed by an external force F x in the negative x direction from
outside into the air-gap, at first the force needed to move the coil increases.
This is due to the increased part of the height hc,g of the coil inside the
air-gap and continues until at x0 the maximum force is reached. As a
consequence, this trajectory showed a positive stiffness, dF/d x ≥ 0. When
the coil is moved further in the negative x direction the force decreases
again, which corresponds with a negative stiffness, dF/d x ≤ 0.
250 Chapter 5. Electromechanic actuators
hc (u) hg hc hc (u)
hc hg (o) hg (o)
F/I(x)
hc,g(x)
hg
x0 x
x x
hc,g
Figure 5.22: A Lorentz actuator has a limited stroke, determined by the dimensions
of the air-gap and the coil. When the coil is only partly inserted with
an effective height hc,g that is inserted in the air-gap, the Force to
current ratio (F/I ) is reduced. By choosing different values for the
height hc of the coil and the height hg of the air-gap, the Force to
current ratio as function of the position x can be made more constant
over a certain range of x. The gradual decrease of the magnetic field
(stray flux) at the edges of the air-gap softens the transitions.
The only method to increase the range of a Lorentz actuator, while pre-
serving a constant force to current ratio, is by combining several coils and
magnets into a electronically commutated actuator. The principle is shown
in Figure 5.23. A set of an even number of alternately magnetised perma-
nent magnets creates an alternating field in the air-gap. The coil is divided
in three equal separate sections. In Figure 5.22 it was shown that the force
factor B of each coil segment is dependent on the position. By choosing an
optimal height relative to the permanent magnets, the force factor of each
coil segment becomes an almost ideal sinusoidal function of the position.
When the centres of the segments have a distance dc equal to one third
of the total height h4 of four magnets the spatial sinusoidal forces have a
spatial phase difference of 120◦ .
This is a three-phase actuator configuration. The relation between the sinu-
soidal functions is completely comparable to three-phase power distribution
networks where three temporal sinusoidal voltages, called R, S and T with a
phase difference of 120◦ , are used to transport electrical energy. One reason
to use this method is the fact that the three sinusoidal functions add to zero
252 Chapter 5. Electromechanic actuators
h4
R S T
F
dc
2
Q5
[N]
JQ5
>JQ5
>
R S T
?
Figure 5.23: The range of a Lorentz actuator can be extended by electronic com-
mutation, using more coils and magnets. Each coil segment has a
force factor that changes sinusoidal as function of the position with a
120◦ spatial phase difference with the other coil segments. By chang-
ing the direction of the current at the zero force positions of each coil,
an almost constant force is obtained. The values are modelled for a
current per coil of 1 A and a value of (B)max = 1.
With power distribution this means that the average voltage of three wires
carrying these voltages is zero at any time. This prevents electromagnetic
5.2. Lorentz actuator 253
radiation from the power lines. Another important value is the possibility
to directly drive rotating AC inductance motors for traction and machining
centres. In mechatronic positioning systems it appears to be also very useful
in long range linear motors.
When all coil segments in the configuration of Figure 5.23 would get the
same DC current the resulting force would be zero. This might seem useless,
but when the current of each coil is reversed at the exact position ,where
the force in that segment is zero, the forces of all three segments add to
an almost constant value of two times the maximum value of BI per coil
section, with a variation as indicated in the figure.
In rotating DC motors this current commutation process is mostly done by
means of mechanical sliding contacts, although presently this commutation
is increasingly achieved electronically in order to avoid sparking at the
commutation points and wear of the sliding contacts.
The application of electronic commutation can also help solving the problem
of the not perfect continuous force by exchanging the hard-switching by a
more continuous current change.
R S T
JQ8
JQ6
JQ4
JQ2
[N]
J
>JQ2
>JQ4
>JQ6
>JQ8
>
?
JQ8
JQ6
JQ4
JQ2
I [A]
B[N\A]
J
>JQ2
>JQ4
>JQ6
>JQ8 R S T
>
?
was derived that was successfully used for that purpose, the figure of merit.
The basic function of the actuator is to generate a certain acceleration of the
optical element. Driving with higher currents can deliver more acceleration
but the penalty is a higher dissipation. The final “Figure of Merit” was
called: “G’s per square root of Watt’s”. How many G’s one would get per watt
of power. The higher the figure the better. If the figure of merit is called Q m
the relation is:
G
Qm = (5.45)
W att
Although a bit unconventional, unscientific and at least not according to SI
units this number proved to be quite suitable and has been in use for about
10 years. When analysing the figure, using the actuator design aspects, the
reason becomes clear. As a first step, the figure of merit is written as follows
according to SI rules:
F
a m F
Qm = = = (5.46)
P P m P
With a as acceleration and P as power. In such a mechanism this power
almost completely consists of the resistive power loss (P = Pl ), as the move-
ments that are made are very small. This relation already makes clear
that a small mass is favourable. A large force at low dissipation also leads
to a higher score. One step further F can be eliminated, using the simple
Lorentz actuator formula F = BI w,t with w,t being the total length of the
wires inside the magnetic field of the permanent magnet.
BI w,t
Qm = # (5.47)
m Pl
The second step is related to geometric aspects. When the part of the length
of each winding in the air-gap (active part) is called w,a and the length
of the part of each winding outside the air-gap (passive part) w,p and the
number of windings is n, it is allowed to say:
BI nw,a
Qm = # (5.48)
m Pl
The resistance R is the total resistance, including the active part and the
passive part of each winding. With n windings of a resistivity ρ r and a cross
section per wire A w , the resistance of the coil becomes:
( )
nρ r w,a + w,p
R= (5.50)
Aw
With γ being the fill factor of the windings and A c,w the cross section of the
coil windings ( bc hc ), the cross section per wire becomes:
γ A c,w
Aw = (5.51)
n
With this value the resistance equals:
( )
n2 ρ r w,a + w,p
R= (5.52)
γ A c,w
The active part of the volume of the coil within the magnetic field Vc,a
is equal to A c,w w,a . When substituted together with Equation (5.52) in
Equation (5.49), this leads with a bit algebra to the following expression for
the figure of merit:
B γVc,a w,a
Qm = ( ) (5.53)
m ρr w,a + w,p
It is demonstrated, that the actuator figure, “G’s per square root of Watt’s”
has been changed into clear understandable and far more logical engineering
choices. By maximising B, γ, Vc,a and minimising m, ρ r and w,p relative to
w,a , it became possible to realise an optimal design at such a low price,
that this part of the CD player changed from a costly part as in the first CD
players to the mass produced cheap commodity it is today.
5.3. Reluctance actuator 257
Ferromagnec yoke
d' y
'e r I e r
dt
Figure 5.25: The reluctance force in a Lorentz actuator is a force that acts indepen-
dent of the permanent magnet flux. It is caused by the attraction of
the iron part by the magnetic field of the coil. For high frequencies
this force can be reduced by a conductive ring, that counteracts the
change of the magnetic flux in the iron part, as shown at the right.
Non-ferromagnec material
Figure 5.26: Lorentz actuator without a ferromagnetic part to cancel the reluctance
force. The return path of the permanent magnet flux goes only through
air.
For those more critical applications where the linear performance is far
more important than the cost of a bit more magnet material, one can decide
to entirely leave the ferromagnetic part away. Figure 5.26 shows such a con-
figuration, where the magnets are connected by means of non-ferromagnetic
material. A second ring of magnets has been added to compensate for the
higher reluctance at the inside of the magnetic system. The increase of the
reluctance for the permanent magnet flux at the outside is less dramatic for
260 Chapter 5. Electromechanic actuators
the same reason as was presented with the air coil from Figure 5.4, where
the reluctance of the path outside the coil could be neglected in respect to
the reluctance inside the coil.
This configuration is used in the wafer stage that is presented in Chapter 9
on wafer scanners.
As will be explained with the Reluctance actuator in the next section, Equa-
tion (5.43) is not valid under all circumstances as it is limited to the situation
where dΦw/d x is only depending on the permanent magnet and not on the
current. This means that it is necessary to use a different approach to
determine the total force that acts on an electromagnetic actuator including
the reluctance force. A suitable method is based on the law of conservation
of energy in a closed system.
In an electromagnetic actuator, electrical energy is converted in three differ-
ent kinds of energy:
Φw,t
L= [H] (5.54)
I
5.3. Reluctance actuator 261
I
+ 'w I
VR R +
+ _ Ef r
d' w
VL L
dI
Current
source + Va
dt dt
_ VL L L
n2
}
_ _
+
Vm Inductor L with n windings
The total flux Φw,t is equal to nΦw , because a coil consists of n windings,
and each winding encloses the same flux Φw . The logic behind this relation
of the self-inductance is based on Faraday’s law, stating that the induced
Electric field Ef in a winding is equivalent to a change in flux over the closed
surface inside the winding. This creates a proportional electromotive force
over the winding. When each of the windings in a coil gets the same change
of flux, the total coil would show an induced electromotive forced, that is
the sum of the electromotive force of each individual winding as they are
placed in series. Due to the minus sign in Faraday’s law, the electromotive
force within the coil is directed from the negative to the positive electrode,
as explained in Chapter 2. This direction is opposite to the current direction
so, when the electrodes are defined as in Figure 5.27, the induced voltage
VL at the electrodes of the inductor is equal to this electromotive force:
dΦ w dΦw,t
VL = Fe = n = (5.55)
dt dt
With the defined relation of L, the total current induced flux equals Φw,t = LI ,
resulting in:
dI
VL = L (5.56)
dt
262 Chapter 5. Electromechanic actuators
Step 1: Power input Using the equivalent circuit diagram of Figure 5.27,
at any moment in time the electrical power is equal to:
Pin ( t) = Va ( t) I ( t) (5.58)
Va = VR + VL + Vm (5.59)
VR = IR (5.60)
The induced voltage over the self-inductance due to the change of the current
equals:
dI
VL = L (5.61)
dt
The motion voltage is the induced voltage due to the change of the flux by
the movement only. In fact the change of the flux Φw consists of two parts.
One part is related to a change in the current in the coil and matches with
the self-inductance. The other part is related to a change in the position:
∂Φw d x
Vm = n (5.62)
∂x dt
so that:
dI ∂Φ w d x
Pin = I 2 R + IL + nI (5.63)
dt ∂x dt
5.3. Reluctance actuator 263
Step 2: Power storage Energy can be stored in only one element, the
self-inductance. The stored magnetic energy is equal to the electric energy
needed to create it, so it is calculated by integrating the power that was
necessary to insert a current in the self-inductance. This power is equal to
the current times the voltage caused by the change of current:
dI
PL = I ( t)VL = I ( t)L (5.64)
dt
If the current at t0 equals 0 and at t1 equals I 1 the stored magnetic energy
E L at current level I 1 for a certain value of L is calculated by integrating
the power over the time from t0 to t1 :
t1
t1
I1
dI 1 2
EL = P L ( t) d t = I ( t) L dt = L I ( t) d I = LI (5.65)
dt 2 1
t0 t0 0
Step 3: Power output Power can exit the system in only two ways: useful
as mechanical power or useless as dissipated heat.
Pout = Pdiss + Pmech (5.68)
The dissipated heat equals:
Pdiss = I 2 R (5.69)
The mechanical power is determined in the next step.
In this equation the first term is the linear relation of the force to the cur-
rent, that is always present in any actuator. The second term is the squared
relation of the force to the current and is caused by the magnetic energy that
is stored in the self-inductance. In the Lorentz actuator of Section 5.3.1.2
the self-inductance is independent of the position because there is no sur-
rounding ferromagnetic material that can influence the reluctance of the
magnetic field from the coil. In that configuration the second term is zero
and the general equation reduces to the previously derived Equation (5.43).
dΦw
F = nI (5.74)
dx
In the following it will be shown that for a reluctance actuator without any
permanent magnets the situation is quite different, because in that case the
second term becomes exactly half of the first term which means that half
the useful electrical energy (excluding resistive losses) is used to “charge”
the coil with magnetic energy and only the other half is used for mechanical
energy.
mover
A y,m x
dx
Ag Ag Ag
F
A y,s
stator
'w
I
n
Figure 5.28: The variable reluctance actuator only works by the principle of the
reluctance force. The moving part is attracted with a force proportional
to the current squared and inversely proportional to the distance
squared.
people know it from the magnet that pulls-up a car with a helicopter in
a crime movie. To calculate the force, Equation (5.73) is used, where the
relative movement of the mover to the stator equals d x:
∂Φ w 1 2 d L ( x)
F = nI − I (5.75)
∂x 2 dx
In this case, without a permanent magnet, ∂Φw /∂ x is only determined by
the current and directly related to the self-inductance, that in its turn is
only a function of the position:
∂Φw d LI
= (5.76)
∂x dx n
Now these two equations combine to:
d L ( x) 1 2 d L ( x) 1 2 d L ( x)
F = I2 − I = I (5.77)
dx 2 dx 2 dx
The self-inductance is determined by the series reluctance of the two ferro-
magnetic parts, the stator and the mover, with a total length y = y,s + y,m
and the two air-gaps with a total length of 2g , because the magnetic flux
has to cross the air-gap twice.
Φt n2 n2
L= = = (5.78)
I ℜc + ℜ g y 2 g
+
A y μ0 μr A g μ0
266 Chapter 5. Electromechanic actuators
A g μ0
L ≈ n2 (5.79)
2 g
With this value for the self-inductance, Equation (5.77) with x = g gives
after a bit of algebra:
2
nI μ0 A g
F ≈− (5.80)
g 4
The minus sign indicates that the force on the mover is directed in the
negative x direction. This corresponds with the known “pulling” direction
of an electromagnet. It is also clear from this equation that the reluctance
actuator is extremely non-linear. First of all the force only works in one
direction and the “gain”, that equals the force to current ratio, increases with
higher currents. But also the dependence on a movement in the x direction
is high and non-linear which implies a significant stiffness. The magnitude
of the force decreases when the displacement increases. This means, that
the actuator has a negative non-linear stiffness. When saturation effects are
neglected, an infinitely small gap could create an infinite force. In reality
the flux density is limited to the saturation of the applied ferromagnetic
material, which corresponds for ferromagnetic (soft) iron to a value in the
order of 2 T. The resulting force can then be calculated with the equations
of the next section.
Elasc hinge
With the force Equation (5.80) of a reluctance actuator also the magnetic
flux density B g in an air-gap by a permanent magnet, or any other source,
can be related to the force that is exerted between the two sides of the air-gap.
In the reluctance actuator, the relation between the magnetic flux density
in the air-gap and the current is derived from the total flux that passes the
air-gap. It has a clear relation with the self-inductance L:
Φw LI
Bg = = (5.81)
A g nA g
The self-inductance value L of the reluctance actuator was obtained with
Equation (5.79):
A g μ0
L ≈ n2 (5.82)
2g
This gives with Equation (5.81):
nI μ0 2 B g g
Bg ≈ ⇒ nI ≈ (5.83)
2g μ0
This can be combined with Equation (5.80):
2
nI μ0 A g
F ≈− (5.84)
g 4
268 Chapter 5. Electromechanic actuators
B2g A g
F ≈− (5.85)
μ0
F B2g
Pm ≈ ≈− (5.86)
2 Ag 2μ 0
This means for example that a magnetic flux density of 1 T creates a pulling
force per unit of surface (negative pressure) of:
1
P1T ≈ ≈ 0.4 [MPa] (5.87)
8π · 10−7
This equals a negative pressure of 4 times vacuum, which immediately
give the reason why permanent magnets are often used for a contact-less
pre-load of air-bearings as applied in precision positioning systems.
x dx
I1
F1 I2
F2
C1 C2
A g,1 A g,2 F
Figure 5.30: A double reluctance actuator with a central mover shows a linear
current to force relationship in the middle position. This is caused by
the balancing of the force of the two non-linear actuators.
force becomes:
2
2
nI 2 μ0 A g nI 1 μ0 A g n 2 μ0 A g I 22 I 12
F = F2 − F1 = − = − (5.88)
g,2 4 g,1 4 4 2g,2 2g,1
This shows as a first conclusion, that in the mid position, where g,1 = g,2 ,
the force is zero when the currents are equal. As a next step it is interesting
to see what happens if the currents in both halves are modulated, while
keeping the sum of the currents constant. This means that current I 1
becomes I a − Δ I and current I 2 becomes I a + Δ I , while I a is the average
current.
The calculations start with Equation (5.88), with the mover in the mid
position, g,1 = g,2 = g .
n2 μ0 A g I 22 − I 12
F= (5.89)
4 2g
With this result, the force to current ratio of this actuator for the current
change Δ I can be written as:
F I a n 2 A g μ0
= (5.91)
ΔI 2g
270 Chapter 5. Electromechanic actuators
This means that the force to current ratio is depending on the level of the
current level that is common for both halves. It gives the possibility to
control the differential gain by this common current level! The second
conclusion is that the force is linearly related to the differential current
at the same mid-position! This means that this actuator can be used in a
linear control system with a separately controllable gain. Unfortunately
this is only the case at the mid position. At other positions the actuator
half with the smallest gap will dominate the other half, and the linearity
gradually disappears. Still for small displacements with a high force this
configuration can be useful.
Another drawback of this configuration is the negative stiffness that depends
on the common current. This effect can be explained from the fact that at
any position away from the mid-position the mover will mainly be attracted
to the position where it is closest to the stator. This makes this actuator less
suitable for situations, where a direct coupling between stator and mover
is undesired like in a vibration isolation system. In principle this negative
stiffness can be compensated with the positive stiffness of a mechanical
spring, but in practice it is difficult to achieve a reliable reduction of more
than a factor 10 due to tolerances.
With the reluctance actuator it was shown that by combining two reluctance
actuators the actuator can be linearised by providing it with a current
that is common for both halves. This principle has as drawback that a
DC current will contribute to the power losses in the actuator and also
the negative stiffness is quite large. It is a logical step to replace the flux
created by this common mode current by the flux of a permanent magnet
and thus combine the best of both worlds: the high force of the reluctance
actuator with the linearity of the permanent magnet actuator. To introduce
a permanent magnet in the double reluctance actuator it is necessary to
carefully consider the working principle as explained previously. First
of all the force is generated by the flux in the air-gap. Because of the
squared relation between flux and force, a force is obtained that is equal
to the multiplication of the flux related to the common current and the
flux related to the differential current. This means that a perfectly linear
very strong actuator is obtained under the following two conditions. First it
must be possible to create a common flux must in the air-gap by means of a
permanent magnet that is independent of the position of the mover. Secondly
the permanent magnet is not allowed to introduce a significant additional
reluctance for the flux from the current in the coil. In the following section
it is shown how this can be achieved.
5.3. Reluctance actuator 271
dx dx
I1 I1 I2
C1 C2 I2 C1 C2
F F
Figure 5.31: The flux in the double reluctance actuator can be changed by inverting
the current direction in one of the coils. Because of the squared relation
in the reluctance force this inversion has no influence on the forces.
Nevertheless the flux path is completely changed. This creates the
possibility to change the material properties of the mover without
impacting the reluctance force.
The first step is to be aware that the double reluctance actuator of Figure 5.30
works independent of the current direction in the coils due to the squared
relation between current and force. Though it has no impact on the force, the
flux is different if the current in one coil is reversed as shown in Figure 5.31.
If the flux in both coils is in the same direction (upwards or downwards)
the flux of both coils has to flow back vertically through the mover. When
the current in one of the coils is reversed, as shown in the right picture,
the flux of both coils share the same path which only crosses the gap. The
vertical path in the mover is avoided which means that the middle part
of the mover can be replaced by another non-ferromagnetic element like
a permanent magnet. This is shown in Figure 5.32 where the flux of the
permanent magnet perfectly combines with the induced flux of the current
without adding additional reluctance to the latter. In the right picture a
simplified version is shown where one side of the mover is connected to the
stator by means of an elastic hinge that enables rotation in combination
with a low reluctance path for the permanent magnet flux. This elastic
hinge can be used to partly compensate the still not completely avoidable
inherent negative stiffness of the actuator as will be explained later.
It is clear that the permanent magnet creates an unstable situation because
272 Chapter 5. Electromechanic actuators
x F x F
dx dx
I1 N I2 I1 N I2
PM PM S
S
C1 C2 C1 C2
Elasc hinge
Figure 5.32: By inserting a permanent magnet in the mover of the double reluctance
actuator, an additional flux is created, that can replace the flux that
was generated by the average current of the original double reluctance
actuator. This replacement strongly reduces the power consumption
of the actuator.
the magnetic circuit has its lowest reluctance at one of the two positions
as shown schematically in Figure 5.33. With the mover in the middle
position the flux is equally divided over the two half’s of the actuator and the
attraction force in both air-gaps cancel each other out. As soon as the mover
moves in one of the two directions the flux will increase in that half and
decrease in the other which will cause the negative stiffness. Figure 5.34 is
used to calculate the flux of the permanent magnet. While the reluctance
for each gap is proportional to their length x, the total reluctance of both
gaps combined is calculated by taking the reluctance of both gaps in parallel
according to the following relation.
1 1 1
= + (5.92)
ℜ ℜ1 ℜ 2
With d x the displacement of the mover, g,1 = g + d x and g,2 = g − d x being
the length of both gaps, A g the cross section of the gaps perpendicular to
the flux and the ferromagnetic material having an infinite μr , the following
equation is obtained:
1 2g − ( d x)2
ℜ=
= (5.93)
1 1 2μ0 A g g
μ0 A g +
g + d x g − d x
Figure 5.33: The permanent magnet flux division in a hybrid actuator depends on
the position of the mover.
lesser extent than with the double reluctance actuator from the previous
section. This is caused by the fact that the reluctance of the permanent
magnet field is dominated by the permanent magnet material itself which
limits the flux density to below Br . Remember that a modern permanent
magnet acts like an air coil with a very large current. With the reluctance
actuator a small gap will lead to a steep increase of flux density that would
only be limited by the saturation. Because of this lower and more linear
negative stiffness of the hybrid actuator it can more easily be compensated
by a positive spring, which in this configuration is determined by the elastic
hinge. In Figure 5.34 the effect of a current through the coils is shown. The
flux from each coil will follow the path of lowest reluctance, according to the
yellow arrows, showing that it will not pass the permanent magnet but go
round in the ferromagnetic part. As mentioned the winding direction of the
coils is such that their fluxes positively add together.
Depending on the current direction, the combination of the flux of the coils
with the flux of the permanent magnets results in a lower flux density in one
air-gap and a higher flux density in the other air-gap, causing an attracting
force in the direction of the increased flux density. As this happens in two
directions, the force is depending on the current direction and the effect is
linearised according to the same reasoning as with Equation (5.91) for the
double reluctance actuator.
x
Ag F
Flux increase dx Flux decrease
A g,1 N A g,2
I2
I1
n n
Coil 1 S Coil 2
Figure 5.34: The flux in a hybrid actuator is a combination of the flux of the per-
manent magnet and the flux of the coils. In one air-gap the flux of
the coils has the same direction as the flux of the permanent magnet,
increasing the total magnetic field, while at the other air-gap it is
just opposite. This results in a strong net force in the direction of the
strongest magnetic field.
ℜ = ℜ1 + ℜ 2 (5.95)
When one gap gets smaller the other gets bigger so the second term can be
neglected and the known relation from the Lorentz actuator is used:
dΦw
F = nI (5.96)
dx
In this equation it is necessary to replace n by 2n because the flux of the
windings of both coils are added together as they are working in the same
direction. The flux through the windings Φw consists of the flux of the per-
manent magnet and the flux caused by the current through the coils. Under
condition of a non-saturated iron part, as mentioned in Equation (5.95), the
flux caused by the current is determined by the reluctance of the air-gaps
in series. The total reluctance of these gaps is equal to:
g,1 + g,2
ℜ= (5.97)
μ0 A g
5.3. Reluctance actuator 275
This is a constant because the sum of both gap lengths is constant. This
means dΦw / d x is only determined by the permanent magnet flux. The
force can be approximated by taking the two extreme positions when the
mover is just hitting the stator. In both situations the iron determines a
low reluctance path for the permanent magnet, which means that the flux
density Bm of the permanent magnet will approach Br . So by changing the
position from one side to the other the average flux changes approximately
from zero to Br A m . This gives:
dΦ w Bm A m Br A m
= ≈ (5.98)
dx g,1 + g,2 g,1 + g,2
With the two equal coils (2nI ) this results in an estimated force to current3
ratio of:
F dΦw Br A m
≈ 2n ≈ 2n (5.99)
I dx g,1 + g,2
Like all actuators, also the hybrid actuator has a position dependency of the
force, not only related to the mentioned negative stiffness, but also due to the
position dependent force to current ratio. In the hybrid actuator this second
effect is caused by the changing total reluctance of the permanent magnet
circuit as function of the position. As discussed earlier, the reluctance for
the permanent magnet at the mid position equals the the reluctance of the
two air-gaps in parallel. In the outer positions one of the air-gaps is very
small, resulting in a lower total reluctance. This means that the flux of the
permanent magnet increases at the outer positions, giving a higher value
of dΦw / d x than in the mid position.
It is clear that these calculations are very approximative due to the complex
nature of the system, with potential saturation of the iron part and the
approximation of the permanent magnet flux in the extreme positions.
To check this approximation, the force in the mid position can be calculated
in another way, by means of Equation (5.86) with the cross section of the
air-gaps and an estimation of the flux of the permanent magnet.
Like with the double reluctance actuator two counteracting forces are com-
bined. The force in air-gap 1 works in the negative x direction and the force
in air-gap 2 in the positive x direction. The flux density in the air-gaps (B g )
equals the combination of the flux density by the permanent magnet (Bg,m )
and the flux density caused by the coil (Bg,c ). With the current direction as
defined in Figure 5.34 the total flux density in air-gap 1 equals Bg,m + Bg,c
3 Here the current ( I ) represents the variable current to control the force in a position
and in air-gap 2 it equals Bg,m − Bg,c and the following equation is obtained:
Ag Ag 2 A g Bg,m Bg,c
F = F2 − F1 = (Bg,m − Bg,c )2 − (Bg,m + Bg,c )2 = − (5.100)
2μ 0 2μ0 μ0
The minus term is due to the fact that a positive current in the shown
direction results in a force on the mover to the left which is the negative x
direction.
In case the air-gap cross section is large with respect to the length of the
air-gap, the leakage flux will be moderate (λ > 0.80). With a well designed
system, the flux density of the magnet is chosen to be Br /2 and the flux is
divided over two halves which means that a flux density in each gap is equal
to:
0. 8 · B r A m
Bg,m = (5.101)
2 2 Ag
When the reluctance of the air-gaps is large in comparison with the reluc-
tance of the ferromagnetic parts, the flux induced by the current equals
(two coils with n windings!):
Φw 2 nI 2 nI μ0
Bg,c = = = (5.102)
Ag A g ℜ g,1 + g,2
Combining these equations for Bg,m and Bg,c with Equation (5.100) gives the
following force to current ratio of the hybrid actuator:
F Br A m
= 0.8 n (5.103)
I g,1 + g,2
When this result is compared with Equation (5.99) it clearly shows more
than a factor two decrease of the force in the mid position which is caused by
the reluctance of the air-gaps for the permanent magnet flux and the choice
to use as little as possible magnet material (Bm = Br /2). In practical designs
often a larger magnet is chosen which brings the force to a higher value and
reduces the position dependency of the force to current ratio because the
flux density will be more constant closer to Br .
It can also be concluded that with increasing complexity of the magnetic
circuit the need for more exact calculations using FEM computer simulations
is more strong. Nevertheless the presented calculations provide a good sense
for the order of magnitude the real forces in a practical actuator.
Figure 5.35: Five degrees of freedom homo-polar magnetic bearing. The flux by the
current in the coils (red arrows) is guided via a different route than the
flux from the permanent magnets (blue arrows). In the air-gaps they
combine to higher or lower values depending on the current direction,
thereby enabling a change in the force in different directions.
d
b
h
b
Figure 5.36: Two different coil configurations with their basic dimensions shown
as an example of the wide amount of possibilities. Next to these
rotation symmetrical shapes, also square, race-track and even three
dimensional shapes are used.
In order to get an idea of real values, the resistance from the windings is
calculated with the following relation:
n w
R = ρr (5.105)
Aw
with
When designing a coil of a certain size, it mostly starts with the dimensions,
giving the average length of each winding (w = π d ) and its cross section
( A c,w = bh) as shown for two typical coil configurations in Figure 5.36.
A c,w relates to A w in the following manner:
γ A c,w
Aw = (5.106)
n
With γ being the fill factor which reduces the useful volume of the coil due
to round windings and the insulation. Practical values for γ range between
approximately 0.5 for thin round wires to 0.9 for flat wires. With these
factors the resistance becomes:
ρ r n 2 w
R= (5.107)
γ A c,w
5.4. Application of electromagnetic actuators 281
Assuming γ and ρ r are constant for a given coil size, the resistance is pro-
portional to the number of windings squared.
To estimate the effect of the self-inductance, Equation (5.104) is used in the
mechanical stationary situation, which means that Vm = 0
dI
Va ( t) = IR + L (5.108)
dt
After applying the Laplace transform to bring the equation to the frequency
domain the following relation is found:
j ωL
Va ( s) = I (R + sL) =⇒ Va (ω) = IR 1 + (5.109)
R
To give a value to the ability to change the force rapidly, the electrical time
constant τe is introduced, defined as:
1 L
τe = = (5.110)
ω0 R
This shows that τe can only be tuned by the dimensions of the coil, the
resistivity and the reluctance.
It Is Im
Rs Rs Rs
Ra Ra Ra
+ +
_
Vs
L = _
Vs
L + L
+ +
Amplier Actuator Vm Amplier Actuator Amplier Actuator Vm
(source) (load) _ (source) (load) (source) (load) _
In principle the effect of the two voltage sources can be combined linearly
and the total current in the circuit is equal to the sum of the currents caused
by both voltage sources.
Vs Vm Vs − Vm
It = Is − Im = − = (5.113)
R s + R a + j ωL R s + R a + j ωL R s + R a + j ωL
This means that it is allowed to analyse the behaviour of an actuator for each
voltage source separately, while the other source is replaced by a conducting
wire. The current caused by the amplifier voltage creates the force that
was described in the previous sections. The current caused by the motion
voltage is the new factor. It creates the damping effect by the resulting force
in the actuator, counteracting the velocity. The related damping coefficient
c can be derived as follows for a Lorentz actuator.
Using Figure 5.38 for the directions, it starts with the motion voltage:
with w,t being the total length of the windings inside the magnetic field.
To determine the resulting current, the impedance of the self-inductance is
neglected in respect to Ra + Rs . This is mostly allowed with Lorentz actuators
working at relatively low frequencies.
Vm (Bw,t )v
Im = = [A] (5.115)
Rs + Ra Rs + Ra
284 Chapter 5. Electromechanic actuators
dx
v
dt
BA w,t
2
v
Fd BA w,t v
Ra Im
Ra
Figure 5.38: A Lorentz actuator shows a velocity dependent damping force when the
two terminals of the coil are connected by means of a low impedance
circuit (Rs = 0). Following Faraday’s law, the induced electric field by
the movement-related flux-change creates a current I m in the direction
as shown, due to the minus sign. This current generates a Lorentz
Force Fd that is directed in the opposite direction of the movement
according to the corkscrew rule.
As the current will flow in the same magnetic field, a Lorentz force will
occur:
(Bw,t )v
Filling in for I m gives:
Rs + Ra
(Bw,t )2 v
Fd = [N] (5.117)
Rs + Ra
Fd (Bw,t )2
c= = [Ns/m] (5.118)
v Rs + Ra
The conclusion of this equation is, that the damping is proportional to the
B-factor squared. This factor is also related to the force to current ratio.
This means, that it is not preferred to reduce this B-factor when less damp-
ing is needed. In that case it is better to control the damping by adapting
the resistive value of the circuit.
In loudspeaker systems it is necessary to create damping, in order to sup-
press the resonance of the loudspeaker cone with the stiffness of its support
and the air cabinet. For that reason amplifiers for music need to have a
low output impedance. For precision mechatronic systems it is more often
required to avoid the motion related damping as much as possible, because
5.4. Application of electromagnetic actuators 285
dc=30 mm
hc=15 mm
bc=3,3 mm
Figure 5.39: Dimensions of the coil, used for the comparison of the three different
actuator types.
• Moving mass
The main force properties of the Lorentz, reluctance and hybrid actuator
types, will be compared by means of a small calculation example, all with
the same electrical power. For comparison, a coil is chosen with a fixed
configuration and total cross section of the windings as shown in Figure 5.39.
As a next step the number of windings for this design needs to be determined.
It was already shown in Equation (5.112) that the number of windings has no
influence on the dynamic properties of the actuator. The winding volume is
fixed, which means that also the mass is constant. Also the power dissipation
I 2 R is no factor in the design decision as the current is inversely proportional
to n and the resistance is proportional to n2 . This means that the only item
that determines the number of windings is the electrical power source. More
windings imply a higher voltage and a lower current, which means that the
number of windings are adapted to the amplifier. In motion systems with
very high power also the maximum wire thickness of the supply wires is a
5.4. Application of electromagnetic actuators 287
constraint, but for this comparison study this plays no role. This coil has
an average length per winding w of ∼ 100 mm and a cross section A c of 50
mm2 . In order to simplify the example, a look up table as shown in Table 5.2
is used for the wiring properties. A practical value of wire is chosen of
0.75 mm, including insulation, enabling easy calculation for this example
case. With this wire diameter, the volume can be filled with approximately
100 windings, giving a total length w,t of ten metres, which results in a
resistance of around 0.5 Ohm. One Ampère of current will then give half a
Watt of Power. With this coil the performance of the three different actuators
will be evaluated.
With the Lorentz actuator, the diameter of the chosen coil limits the diameter
of the yoke. This also limits the capability of this yoke to transfer a high
amount of magnetic flux. For this reason the chosen coil dimensions are
more wide than long. In order to avoid saturation of the yoke, the flux
density should remain below 2 T. With this value the surface of the air-gap
times the flux density in the air-gap should be lower than the maximum flux
density times the surface of the yoke. With a small calculation this means
that the uniform magnetic field in the air-gap can be maximum 0.7 T. The
height of the air-gap is equal to the coil and in the mid position F = BI t
results in a current to force ratio of 7 N/A. In reality this value will be lower
because of stray flux.
When the same coil is applied in a reluctance actuator Equation (5.80) can
be used:
2
nI μ0 A g
F =− (5.119)
g 4
If the length of the air-gap equals g = 10−3 m, which is quite large, and
A g = 6 · 10−4 m2 , then the resulting force to current ratio would amount up to
only 3.75 N/A, which is not yet impressive. When the air-gap is reduced to
0.3 mm, the force with 1 A current already becomes 37.5 N. At full closure of
the gap the iron might ultimately saturate with a flux density of around 2
Tesla. With Equation (5.85) the maximum attainable force would be equal
to:
B2g A g 4 · 6 · 10−4
Fm = − =− ≈ 2000 [N] (5.120)
μ0 4π · 10−7
This clearly shows the value of a reluctance actuator but it is necessary to ad-
mit that a large error is made due to the approximation on the permeability
of the iron (μr = ∞). This is far too optimistic when the flux density ap-
proaches saturation. By using Hopkinson’s law of magnetics, the minimum
permeability to realise this value can be determined with the stated flux
density of 2 T, using the length y of the flux path through the ferromagnetic
part and assuming a fully closed airgap g = 0:
Φ nI μ0 μr nI
B= = = =2 [T] (5.121)
Ag Agℜ y
5.4. Application of electromagnetic actuators 289
With the known values for this example, nI = 100 and y ≈ 0.1 m, the required
minimum relative permeability becomes:
2y
μr ≥ ≈ 1.6 · 103 (5.122)
μ0 nI
The same key dimensions as with the other actuators are chosen:
These values result in a linear! current to force ratio of about 30 N/A, which
clearly shows the combined performance of “the best of both worlds”, a high
force with a linear force to current relation over a moderate range.
The next criterion for the right actuator choice is the ability to change the
force rapidly. This is reflected by the electrical time constant and the related
necessary limitation of the jerk. For the same three actuators with the
example coil as shown in Figure 5.40, the electrical time constant is purely
determined by the reluctance of the flux resulting from the current in the
coil. The higher the reluctance, the better it is.
When drawing the field lines of the magnetic field, caused by the current
through the coil, in the three actuators, it is clear that the reluctance is
quite high for the Lorentz actuator because it includes a large air path. This
would even be better when no ferromagnetic part is applied. The reluctance
290 Chapter 5. Electromechanic actuators
Figure 5.40: Difference in reluctance, self-inductance and mass of the three actua-
tor types scaled to the same coil dimensions.
of the Lorentz actuator is in any case considerably smaller than with the
Reluctance and hybrid actuator because of their small air-gap. For this
reason, a Lorentz actuator is better suited for high speed precision actua-
tion with fast changing currents, while the other actuators can be applied
where high forces in a semi static situation are required. Nevertheless,
as mentioned in the previous section, the better force to current ratio of
the reluctance and hybrid actuator compensates the negative aspect of the
higher self-inductance to some extent because of the lower required current
and especially the hybrid actuator is very promising for future applications.
A detailed FEM analysis should determine the real optimum as a smaller
gap influences the dynamic properties of the hybrid and reluctance actuator
in a clearly interrelated way.
The last difference is the moving mass. As Figure 5.40 has been scaled to
the same coil size, it might be concluded, that the Lorentz actuator is at
an advantage when the coil would be the moving part. This however is not
always the case. In a precision system the wires to transport the current
to the coil determine a relatively high stiffness, that is not preferred for
the reason of transmissibility of external vibrations. This can be avoided
by choosing the magnetic circuit as the moving part. Furthermore, due to
the lower force to power ratio, more current is needed for the same force,
which implies a higher power loss. In high power precision systems, the
resulting heat has to be taken away, sometimes even by water cooling, before
5.5. Intermezzo: electric transformers 291
it radiates to other sensitive parts. The necessary cables and hoses and
cooling plates considerably increase the stiffness of connection to the coil
section and necessitate to take the permanent magnet part as mover.
In fact the Lorentz actuator has the largest moving mass of all, when looking
only to the force related to the power. Furthermore, when considering that
the variable reluctance- and hybrid actuator both have the possibility to add
more coil windings, it is clear that these actuators are absolutely preferred
from a high force to power ratio point of view in applications where the
dynamic properties and linearity are less important.
Transformer core )w
Source Load
Ii Io
+ +
I V
_
+ +
Vi Vo
n1 n2
_
_ _
Figure 5.41: In an electric transformer two coils are wound around the same ferro-
magnetic yoke, called the transformer core. Because they share the
same flux, the voltage of both coils relate to the ratio between the
windings of each coil.
In order to transform the electric current and voltage to another value, the
number of windings per coil is different and a higher number of windings
corresponds to a higher voltage with a lower current level. The primary
windings are connected to the electrical source and the secondary windings
are connected to the load. In principle a transformer can transform electric
energy in two directions and as such the term primary and secondary
windings is not logical, but this distinction has been made for practical
reasons. It is for instance not advisable to reverse the connections of a
transformer that is designed to transform the dangerous mains voltage
into a safe lower voltage, as the result would be an even higher voltage
and eventually the transformer will burn out when the fuses are not blown
before.
The figure also illustrates the galvanic insulation between the primary and
secondary windings as they each consist of insulated wires.
To determine the primary and secondary currents, the flux in the core is
first calculated. From Equation (5.124) the change of flux in the core should
equal:
dΦw V̂i
= sin ω t (5.127)
dt n1
V̂i
Φ w (ω ) = − cos ω t (5.128)
ωn1
The primary current can be calculated from the relation between flux and
current, corresponding with Hopkinson’s law of magnetics:
n1 I i
Φw = (5.129)
ℜ
In the situation without an external load the secondary current is zero and
the unloaded primary current I i,0 equals
Φw (ω)ℜ V̂i ℜ
I i,0 (ω) = =− cos ω t (5.130)
n1 ω n21
A low reluctance of the core results in a low primary current in the unloaded
situation. In an ideal transformer with μr is infinite, I i,0 will be equal to
zero. As soon as a load is present at the secondary windings, the resulting
secondary current I o creates an magnetic flux ΦI,o in the core that adds to
Φw . A primary current will start to flow with a corresponding flux ΦI,i that
compensates the additional flux caused by the secondary current, because
294 Chapter 5. Electromechanic actuators
the total flux has to remain unchanged, corresponding with the primary
voltage. In simplified form without ω t this reads as follows for an ideal
transformer:
The directions of the voltages and currents as defined in Figure 5.41 follow
from Ampère’s and Faraday’s laws. The flux from the output current should
point into the other direction of the flux by the primary current. With the
defined directions the equation becomes:
n1 I i − n2 I 0
ΦI,i + ΦI,o = =0 =⇒ n1 I i = n2 I o (5.132)
ℜ
As a consequence the currents from the primary and secondary windings
relate as follows:
I o n1
= (5.133)
I i n2
The result can be verified by a sanity check, based on the fact that an ideal
transformer does not dissipate power. This means:
P = Pi = I i Vi = Po = I o Vo (5.134)
L1 Transformer L
Source 2
Load
+
Coupled
Vi Vo
windings
R1 R2
Because the flux density has to remain below the saturation level Bs , the
cross section of the core needs to be larger than:
V̂p
Ac ≥ (5.136)
Bs ω n 1
The conclusion that can be drawn from this equation is that a transformer
can not be used to transform DC voltages as that would require an infinitely
large core. For AC voltages of low frequencies the number of the primary
windings could be increased, but that will automatically lead to an increase
in the resistive losses. This means that the design of a transformer is a
compromise between power, efficiency and size.
Because of the relation with the frequency it is unavoidable that high power
transformers that have to operate at 50 Hz are very heavy. To solve that
problem, the switched-mode power supply has been created. In this principle
first the low-frequency mains voltage is converted electronically into a very
high frequency of several kHz, before it is transformed to a lower voltage by
means of a small high-frequency transformer. The operation principle is
comparable with the switched-mode power amplifiers, that will be presented
in Section 6.3.3 in Chapter 6.
296 Chapter 5. Electromechanic actuators
5.6.1 Piezoelectricity
The word “piezo” is derived from the Greek word “πιζ o”, which means
“to press”. The piezoelectric effect was first observed by the Pierre Curie
together with his brother Paul-Jacques Curie (1856 – 1941) who also was
a physicist. They discovered that under compression a quartz crystal is
generating an electric potential, which is called the direct piezoelectric effect.
It later has been discovered that this effect also is reversible, which is called
the converse piezoelectric effect or inverse piezoelectric effect: when an electric
field is applied to the piezoelectric material by applying a voltage to the
electrodes at the piezo-surface, the material expands or contracts depending
on the direction of the electric field.
Besides quartz, several other materials are found to show a piezoelectric
behaviour:
Pb (+)
+ O (-)
Ti,Zr (+)
5.6.1.1 Poling
"
!
@$
- - - - -
+ + + + +
- - - -
+
- + + + +
- - - -
+
+
-
+ + + +
- - - -
+ + + +
+
E
Figure 5.44: Poling process of a piezoelectric material. Without poling the Weiss
domains are not aligned. During poling a strong external electric field
is applied and also the temperature is elevated to the Curie tempera-
ture. After cooling down the material, the electric field is switched off
and the material is “frozen” into a state where the piezoelectric effect
is present on the macroscopic scale as the Weiss domains stay almost
perfectly aligned.
voltage at the electrodes of the piezo. The right side of the figure illustrates
the converse piezoelectric effect where an external voltage, generating an
electric field across the piezo-material, causes the piezo-material to expand
or contract, depending on the direction of the voltage.
D = εE, (5.137)
S = CT (5.138)
5.6. Piezoelectric actuators 299
A0 + 'A
A0
V
polarizaon
n ?
d0 + 'd
F d0
<'
<
Combining these for all directions of the material into the so-called cou-
pled equations, according to the IEEE standard on piezoelectricity, the
constitutional laws for a piezo4 -material in strain charge format are:
S = CE T + dT E (5.139)
D = dT + εT E, (5.140)
with the following variables, listed with the corresponding units and dimen-
sions:
2
CE in [m /N] 6 × 6 compliance matrix with constant electrical field
F = T·A (5.141)
V = E · 0
Δ = S · 0
q = D · A,
The material equations for the stiffness kpz in [N/m] and capacitance C in [F]
are as follows:
A
k pz = (5.142)
s E · 0
εT · A
C = .
0
Δ( t) = k−1
pz · F ( t) + d i j · V ( t) (5.143)
q ( t) = d i j · F ( t) + C · V ( t).
5.6.3.1 Creep
5.6.3.2 Hysteresis
8
6
-2
-4
-6
-8
-10
0 25 50 75 100 125 150
@!
Figure 5.46: Measured hysteresis loops of a piezoelectric stack actuator for various
amplitudes of the input signal over a 150 V range. The strain gauge
signal is proportional to the measured elongation of the actuator.
5.6.3.3 Aging
5.6.4.1 Piezo-stiffness
In the data sheet of each actuator some typical parameters are specified.
Next to the geometric properties and maximum actuation voltage, important
parameters for the mechanical design are the blocking force F max and the
displacement range for the unloaded and unrestrained actuator Δ0 . Often
also the actuator stiffness is given, denoting the small signal stiffness kpz ,
relating to the other parameters via
The blocking force Fmax denotes the force that a piezo-actuator generates
when it is confined on both sides by an infinite stiff support. This means
that the entire force generated by the converse piezoelectric effect presses
against the clamping support structure as no elongation of the piezo occurs.
When the piezo-actuator is not confined or restrained, the force generated
by the actuator is available for elongation of the piezo, which means that
the actuation force is in balance with the intrinsic spring stiffness of the
piezo-actuator, resulting in a displacement Δ0 as shown in Figure 5.45.
The given value of kpz is only valid for small signal conditions. For larger
signal amplitudes, another term is superimposed on kpz that is linked to
polarisation effects of the material, where the large signal stiffness can be
up to a factor of two smaller than the small signal stiffness. Furthermore
the output impedance of the power amplifier driving the piezo-actuator has
an influence on the actuator stiffness, as the charge that is built up in the
material due to external forces, if not drained out of the piezo-material,
generates a counter-force. This is the reason why a piezo-actuator with open
electrodes appears stiffer than a piezo-actuator with shortened electrodes .
with the tube geometry of length , wall thickness d , and inner diameter
D i , and actuation voltage v.
Actuation in z−direction is done by applying a voltage on all electrodes
on the entire circumference causing a uniform elongation with a vertical
306 Chapter 5. Electromechanic actuators
displacement Δ z of
V
Δ z = d 31 · · . (5.147)
d
be in the order of several kV, which is more difficult to achieve with standard
electronics and also poses a larger safety risk. As the poling and actuation
direction is in the stacking direction, the number of layers n chosen will
define the unloaded length 0 and the positioning range of the stack-actuator.
0 eventually is n times the unloaded and unactuated thickness of the indi-
vidual layer h plus the thickness of the electrodes, glue lines and potentially
a layer of unactuated ceramic material at both ends of the stack-actuator
for isolation. With the actuation voltage V the actuator displacement Δ
amounts to:
V
Δ = n · h · d 33 · . (5.148)
h
k tot = k pz + k f . (5.149)
While the loading mass has no influence on the (static) positioning range
of the piezo-actuator, the loading spring reduces the maximum actuation
range as the actuation force now has to work against the piezo-stiffness
as well as the loading spring. So a loading spring that is as stiff as the
308 Chapter 5. Electromechanic actuators
x x0 + x
A0 + 'A
b
a
piezo itself kf = kpz will reduce the actuation range in half. What may be
desirable in some cases is to have a constant pre-loading force that is not a
function of the position but ensures that a constant pre-loading force of the
actuator is present. This can be achieved by an elongated spring with a low
stiffness. Without pre-loading, piezo-actuators can de-laminate when being
contracted too quickly, where the high tensile forces can cause micro-cracks
or even total fracture of the piezo-material.
b
r = (5.150)
a
Δx = r · Δ (5.151)
(5.152)
Due to the mechanical amplification r the effective stiffness k tot on the load
side as well as the resonance frequency f res , which depends on the total
positioned mass and the effective system stiffness, are reduced according to
5.6. Piezoelectric actuators 309
the equations
k pz
k tot = (5.153)
r2
f0
f res = . (5.154)
r
Sliding surface z
x !
y x
y z
Mechanical ground
Mechanical ground
simpler, a better noise performance can be achieved, and the system shows
less drift. The design of the power amplifier, which in this case has a low
output impedance, and the better noise performance may give an advantage
involving the system costs. A disadvantage of a voltage controlled piezo-
actuator, however, is certainly the hysteresis which in this case may have to
be compensated by other means.
Another interesting design aspect in voltage controlled piezo-actuators is
that the output impedance of the power amplifier forms together with the
capacitance of the piezo-actuator a first-order low-pass filter. On the one
hand this means that a potential bandwidth limitation may have to be
considered in the system design, on the other hand this also gives a design
freedom as adding a resistor in series with the piezo-element not only limits
the maximum current, protecting the power amplifier and the actuator, but
it also allows to shape the frequency response and introduce a roll-off before
the resonance frequency of the actuator, which may be desirable particular
in some open-loop positioning applications.
Piezoelectric
transducer
+ +
Cpz
C1 Vpz
q di j F (t )
Vi -
+
+ - Vm Vs Vr +
Vr C2 Cs
Vs
- - -
Figure 5.50: Electrical model of a piezoelectric actuator and capacitive bridge cir-
cuit to measure the charge that is generated by external forces on the
actuator.
LMS
C(s)
Figure 5.51: Schematic of the feedback control for active damping of the piezo-
actuator via self-sensing actuation. The gain γ and the LMS-block
form an adaptive balancing circuit to cancel gain variations due to the
piezo-hysteresis, but have no direct effect on the active damping.
Magn
de (abs)
0
-1
10
-2
10
102 103 104
0
Phase [deg]
Figure 5.52: Bode plot of the measured and modelled frequency response of both
the undamped and the actively damped piezoelectric tube-actuator.
second-order model that has been fitted to the measured data. Based on
the fitted model a feedback controller C (s) has been designed to dampen the
resonance peak of the tube scanner, resulting in the simulated frequency
response of the controlled model as given by the black line in Figure 5.52.
The green line of the measured frequency response of the transfer G x (s) with
active damping via the self-sensing actuation circuit confirms the reduction
of the resonance peak by almost a factor ten, as predicted in the simulation.
Chapter 6
Analogue electronics in
mechatronic systems
315
316 Chapter 6. Analogue electronics in mechatronic systems
Informaon Energy
Electrical
Feedforward Heat loss
Power
Mechanical
Error
DÎA Power
Posion +
setpoint Controller Amplier Actuator Mechanism
-
Measured posion
Sensor
Control electronics + so$ware DÍA Hardware
Electronics
best realised directly by analogue electronics. With the focus on these mainly
analogue processes within a mechatronic system, analogue electronics de-
termine the proportional behaviour of a dynamic system by amplification
and filtering of signals.
Also another factor justifies the focus on analogue electronics in this book.
The overwhelming amount of electronic products on the market, from serious
systems to gadgets, work mostly with digital electronics. This abundance
has resulted in a conviction with many people, that electronics are “ready”
and that it is not necessary to investigate this technology anymore in order
to be able to use it. Furthermore, the availability of computer simulation
software has notably resulted in a reduction of the basic understanding
of analogue electronics, because of lack of critical attitude. This has often
resulted in mistakes when designing mechatronic systems. This observation
is underlined by the fact, that even with the mentioned ample availability of
electronics in our society, only very few people, outside the confined field of
electronic expert, are able to combine the knowledge of electronics with the
less abstract mechanical dynamics. It appears to be a real different domain
for many mechatronic designers.
This large chapter consists of three sections. Section 6.1 on passive electron-
ics concentrates on signal manipulation by filtering without amplification.
The section includes impedances, sources, complex elements and passive
filters. Also a parallel is drawn between the dynamics of an electronic filter
and the dynamics of a mechanical system to show the similarity, in order to
enlarge the mutual understanding of electronic and mechanical engineers.
Section 6.2 consists of the important non-linear and active building blocks
of an electronic circuit, followed by the design of low power amplifiers, both
for small signal amplification and active filtering. Section 6.3 presents both
6.1. Passive electronics 317
In Chapter 2 the voltage source was introduced. An ideal voltage source has
an internal electromotive force Fe that is equal to the external voltage, even
if a current is delivered to the load. As an example from real life, the mains
power supply socket at home in most countries in Europe behaves almost
like an ideal voltage source of approximately 230 V alternating current (AC)
with a frequency of 50 Hz. This voltage remains almost constant even when
more lighting or stronger appliances are switched on.
The word “almost” refers to the fact that “ideal” does not exist in the real
world. Just imagine a voltage source of any arbitrary voltage that could
deliver less than 1 mA or more than 1 GA without any effect on its voltage.
318 Chapter 6. Analogue electronics in mechatronic systems
_ _ _
+ + +
Figure 6.2: The different symbols of a voltage source, that are used in schematic
diagrams of electronic circuits, show the sign of the voltage by means
of a plus and minus. Sometimes the minus is left away. The most right
version of these symbols is used in this book, as it clearly emphasises
the zero source impedance of an ideal voltage source.
This would require the source to have the capability of delivering infinite
power and that is unfortunately impossible. Nevertheless the example of
the mains supply is a very close approximation of an ideal voltage source as
long as the currents are kept within reasonable limits.
In a real voltage source, the external voltage is not fully equal to the elec-
tromotive force, due to an internal series impedance, originating from its
physical properties. One can think of the internal wiring, the size of chemi-
cal electrodes and storage components and aspects related to the chemical
separation of charge. This series impedance shows an internal electric field,
opposite to the electromotive force, just like inside an external load. This
field depends on the current and results in a change of the voltage differ-
ence over the electrodes of the source. In case of a resistive impedance, the
voltage will be lower at an increased delivered current.
An ideal voltage source would have no internal series impedance and it can
be approximated as a conducting wire when the influence of another electric
source in the circuit has to be examined. This was shown in Chapter 5 about
the interaction between an actuator and an amplifier, where the current
from the two voltage sources could be derived separately by short-circuiting
the other voltage source.
For this reason the version of the symbol that is located at the right in
Figure 6.2 is used in this book when drawing a schematic diagram of an
electronic circuit.
Most electronic circuits can be modelled with a voltage source, but sometimes
it is more practical to work with the other extreme source of electricity, the
current source. An ideal current source delivers a positive or negative
current that is independent of the impedance of the conductive load and as a
consequence the current is independent of the voltage. This functionality is
less easy to understand from reasoning, when using the model of an internal
6.1. Passive electronics 319
Figure 6.3: The different symbols of a current source show the direction of the
current by means of an arrow. The most right version of these symbols
is used in this book because of the clear semantics of the “blocking” line,
representing the infinite internal impedance.
electrical field. Instead one might use the thinking model of a voltage source,
that adapts its electromotive force actively to the load in order to keep the
current at the same level. It will be shown that this indeed is a method to
create a current source by active feedback control. When a current source
is applied in a circuit, another external voltage source, like the movement
induced voltage of an actuator, will not influence the current anymore. This
implies that an ideal current source acts like an open wire for the current
that is induced by other electric sources. In other words, a current source
has an infinite output impedance. It is indeed true that the basic model
of a current source is a voltage source with an infinite electromotive force
and an infinite series resistance. This very artificial model is the reason for
the preferred symbol as shown at the right side of Figure 6.3, as a current
source can be replaced by an open connection in a circuit when the influence
of another electric source has to be examined.
It is also obvious, that an ideal current source can not exist either. An ideal
current source would imply for example that a continuous lightning flash
over an infinite distance could be created. Nevertheless an almost ideal
current source can be created that operates over a limited voltage range and
it is a very important element in mechatronic systems in two ways. First of
all, several sensors in measurement systems behave like a current source
when the voltage is kept within a certain range. But even more important
is the need for a current source when supplying an actuator with electrical
power. In Chapter 5 it was shown, that a low impedance of the source results
in damping and unwanted transmissibility of external vibrations through
the actuator. With the application of a current-source power amplifier as
will be presented in Section 6.3 that effect can be prevented.
A real electric source can be modelled in two ways. The first consists of
a voltage source in series with a finite impedance and is postulated by
320 Chapter 6. Analogue electronics in mechatronic systems
Zs Il Il
+
+
+
Zl
VTh Vo Zl IN Zs
Vo
_
_ _
Figure 6.4: Equivalent representation of real electric sources. The Thevenin model
at the left consists of a voltage source with a series impedance. The
Norton model at the right consists of a current source with a parallel
impedance with the same value Zs as with the Thevenin model. The
relation between VTh and I N follows from Ohm’s law: VTh = I N Zs .
I1 I3
node
Z1 _
+
_ V2
+
V1 V3 Z2 I2 Z3
_ loop
+
Figure 6.5: The laws of Kirchhoff. The currents at a node in a circuit add to zero
( I 1 + I 2 + I 3 = 0). Also the voltages over a closed-loop add to zero (V1 + V2 +
V3 = 0). As long as the voltage and current directions are well defined,
these laws help to calculate voltages and currents in complex electrical
networks.
The second of Kirchhoff’s laws states that the sum of voltages over any
closed-loop in an electronic circuit can only equal zero.
Vloop = 0 (6.3)
With the presented laws the currents and voltages in several electric circuits
can be calculated. One example is the situation where different impedances
are combined in series or parallel as shown in Figure 6.6. In the situation
with two impedances in series, the current is shared and the impedance of
the combination becomes:
Vt V1 + V2 I t Z1 + I t Z2
Zserial = = = = Z1 + Z2 (6.4)
It It It
With two parallel impedances, the voltage is shared and the impedance of
the combination becomes:
Vt Vt Vt 1
Zparallel = = = = (6.5)
It I1 + I2 Vt Vt 1 1
+ +
Z1 Z2 Z 1 Z 2
322 Chapter 6. Analogue electronics in mechatronic systems
Vt
V1 V2 Vt
Z1 Z2 Î Zs
It It
serial
Vt Vt
I1
Z1 Î Zp
It It
Z2
parallel I2
Figure 6.6: Impedances in series simply add to the total impedance, as they share
the same current, while the total voltage equals the sum of the voltage
over each impedance. Parallel impedances share the same voltage,
while the currents are divided over the impedances.
Vi Z2 Vi
I= Vo = I Z2 = (6.6)
Z1 + Z2 Z1 + Z2
The relation between the output voltage Vo and the input voltage Vi equals:
Vo Z2
= (6.7)
Vi Z1 + Z2
To find the Thevenin impedance a special insight is used, stating that this
impedance is equal to the impedance of all paths inside the electric circuit
6.1. Passive electronics 323
I
I
Z1 ZTh
+
+
R1
+
+
+
Vi I VTh Vo I
Z2 Vo Vi
_
+
_ _ Vo
R2
_ _ _
Figure 6.7: A voltage divider consists of two serial impedances and is based on
the principle of the shared current. The voltage Vi gives a current I
determined by the sum of both impedances, while the output voltage
Vo is determined by the current and Z2 only. A special version of the
voltage divider is the potentiometer, that can vary the ratio between
its resistive impedances R1 and R2 to control the output voltage by a
slider.
With these values also the Norton equivalent of the voltage divider can be
modelled according to Figure 6.4. A special execution of a voltage divider
is the potentiometer as shown at the right drawing of Figure 6.7. A poten-
tiometer consists of a resistor from the input to ground with a conductive
slider that contacts the resistor on a location that is determined by an exter-
nal rotating or translating element. The potentiometer creates a variable
voltage at the output as function of the position of the slider.
324 Chapter 6. Analogue electronics in mechatronic systems
Real
'
0.8
Normalised power '! load
! Ps
Ri
0.6 Output power Po
+
~Pi Io
+
0.4
Vi Io Ro Vo
0.2 _
0
0 2 4 6 8 10
Ro R
i
Figure 6.8: The maximum power from a real voltage source is 25 % of the intrinsic
power capability of the source and this maximum occurs when the
external resistive load is equal to the resistive value of the source
impedance. The efficiency is then only 50 %.
This expression goes to zero for Ro = 0, which corresponds with the fact
that then the output voltage is zero, and when Ro = ∞ because then the
output current is zero. The output shows a maximum when Ro = Ri which
can be proven by finding the value of Ro that equals to a zero value of the
6.1. Passive electronics 325
The above reasoning should not be misunderstood. For a given real source
the maximum power is obtained when the connected load impedance equals
the internal impedance of the source. The efficiency is then however only
50 % and for a higher efficiency the load impedance needs to be significantly
smaller than the source impedance. This means that from an efficiency point
of view it is better to not use a power supply at its theoretical maximum
power capability. With for example a battery it implies that the total energy
should be drained over an extended period of time, which is mostly the case.
When the battery is loaded at is maximum output power the output voltage
would be half and then half the energy would be wasted in heat inside the
battery.
An example where the maximum transfer of power is aimed for is in trans-
port of signals over a long cable that is terminated with its characteristic
impedance, a term that will be explained in more detail in Chapter 8. With
a cable of for instance a characteristic impedance of 50 Ω, the source before
the cable and the load after the cable should both have an impedance of the
same 50 Ω and in that case the maximum signal power is transferred that
the source could deliver. This maximum power level guarantees a maximum
signal to noise ratio.
326 Chapter 6. Analogue electronics in mechatronic systems
Resistor (R)
+
Capacitor (C)
Inductor (L)
6.1.2.1 Resistors
Pl = IV (6.12)
6.1. Passive electronics 327
To get an idea about the variety of resistors, Figure 6.10 shows an overview
of several types of resistors that are used in practical electronic circuits.
Resistors exist in an enormous range of resistance values from less than
1 Ω to more than 1 GΩ. This range is available in almost the same size,
which makes them suitable in almost any application. Resistors are realised
with different technologies, ranging from winding a metal wire with a high
resistivity (wire-wound resistor) to the application of a thin carbon- or
metal layer on a non-conductive ceramic round-or-rectangular body. In the
latter case, the correct resistance value is obtained by trimming part of the
resistive layer away with a laser. These carbon or metal film resistors are
for instance recognisable in the figure by the colour code rings that indicate
the value.
Over the years, the electronic industry has made their best effort to achieve
almost ideal properties of all electronic elements. Nevertheless always some
parasitic effect is present, that needs to be considered in critical applica-
tions. A resistor always has some capacitance and self-inductance, while
requirements on these properties differ per situation. Next to the parasitic
effects, the maximum dissipated power and the allowable temperature have
given a boost to the enormous diversity in resistors. High power resistors
are mainly wire wound, because of the insensitivity of the metal to high
temperatures. These high power resistors are recognisable by their size, the
often visible winding structure and the possible mounting to a heat sink, like
shown with the green, the white and gold resistors on the photograph. As a
328 Chapter 6. Analogue electronics in mechatronic systems
+
+
q
+
+
+
E
+
d _
V _
_
_ _ q
_ _ _
_
_
_ _A _ _ _
_ _ _ _
6.1.2.2 Capacitors
point parallel to the plates will cancel each other out and as a result the
remaining field lines all run orthogonal to the surface of the plates.
With a given electric field between the two electrodes, the potential difference
increases with the distance of the electrodes. This implies, that with a given
potential difference, the magnitude of the corresponding electric field is
inversely proportional to the distance of the electrodes. This also implies,
that the amount of charge difference between one electrode to the other is
inversely proportional to the distance. In a capacitor, the charge difference
is the result of the external voltage and it is not difficult to imagine that
with a larger surface ( A ) proportionally more charge needs to be transported
between the electrodes to retain the same electric field over the entire
surface.
As a last factor that determines the behaviour of a capacitor, the insulating
material between the electrodes has its influence, due to the polarisability
of the material. This polarisability is related to the property of certain
materials, to create an electric dipole. By directing itself along the electric
field, with the positive side directed towards the negative electrode, these
dipoles counteract the electric field and as a consequence they increase the
amount of charge displacement needed to sustain the electric field. This
is in some way comparable with magnetism, where the Weiss domains of
a material with a high relative permeability μr direct to the magnetising
field H, reducing that field relative to the magnetic field B = μ o μr H1 . The
electric property, that corresponds with μ is called the electric permittivity
(ε) with units As/Vm. In vacuum the value is called ε0 and it has a direct
relation with the magnetic permeability of vacuum (μ0 ), that was presented
in Chapter 5 and the speed of light c in vacuum:
1
ε0 μ0 = (6.13)
c2
From this relation, when using the known values of c and μ0 it follows that:
1
ε0 = ≈ 8.8541878176 · 10−12 [As/Vm] (6.14)
c 2 μ0
When the material between the plate is different from vacuum the total
permittivity equals ε0 εr with the relative permittivity or dielectric constant
εr ranging from ≈ 1 for air to < 100 for most solid materials and even higher
for some ceramics and certain special polymers.
1 Due to the different definitions this is somewhat confusing, as the Weiss domains help to
increase the magnetic field B relative to the magnetising field H. It becomes comparable when
the magnetic field is taken as the source of the magnetising field. As mentioned in Section 5.1.2,
the cause and effect between both fields is a philosophical discussion as they simply co-exist.
330 Chapter 6. Analogue electronics in mechatronic systems
1
Sine
Cosine
JQ8 Power
JQ6
JQ4
JQ2
>JQ2
>JQ4
>JQ6
>JQ8
>
0 1 2
Period
Figure 6.12: The power of two multiplied sinusoidal functions with 90◦ phase differ-
ence averages to zero over time. The momentary power is alternating
positive and negative. It represents the energy storage in the reactive
impedance.
t1
1
V ( t) = V0 + I ( t) d t (6.17)
C
0
6.1. Passive electronics 331
t1
t1
t1
V1
dV 1
EC = PC ( t)d t = V ( t) I ( t)d t = V ( t)C dt = C V ( t)dV = CV12 (6.18)
dt 2
t0 t0 t0 0
for the initially high capacitance and for that reason electrolytic capacitors
are mostly applied in power supplies to store energy.
Recent developments on a variation on electrolytic capacitors with an elec-
trochemical double layer, have increased the maximum capacitance to over
one Farad. Although their maximum voltage per cell is limited, the high en-
ergy density of these super-capacitors makes them applicable in high power
applications as soon as all technological issues, regarding the combination
of multiple capacitors for higher voltages, are solved.
Other smaller capacitors are made by winding thin foils of conductive ma-
terial with intermediate non-conductive foils of a synthetic material with
favourable dielectric properties, (the other round capacitors in the figure).
Their capacitance ranges from around 100 pF to 100 μF. The smallest ca-
pacitor values are realised with single- or multilayer ceramic bodies with a
metal layer as plate material. They can range between 1 pF to 1 μF.
Due to the winding and the thin foil, the electrolytic and foil capacitors both
suffer from a parasitic resistance and some self-inductance. The parasitic
resistance is represented by its equivalent series resistance (ESR), that
limits the minimum impedance at high frequencies and induces a loss of
power. For high-frequency applications, the ceramic capacitor is preferred as
it shows the least parasitic inductance and resistance and it is also available
in an SMD shape.
6.1. Passive electronics 333
6.1.2.3 Inductors
dΦw dΦw,t
V ( t) = n = (6.21)
dt dt
Where Φw is the flux per winding and Φw,t is the flux integrated over all
windings (nΦ). The self-inductance was defined as the total amount of flux
generated per Ampère:
Φw,t nΦw n2
L= = = ⇒ Φw,t = LI (6.22)
I I ℜ
From this equation, combined with Equation (6.19), the Voltage as function
of time becomes:
dΦw,t d I ( t)
V ( t) = =L (6.23)
dt dt
Also previously presented was the stored magnetic energy E L in an inductor.
Like with the capacitor, it can be calculated by integrating the power over
the time from t0 , where the current is zero until t1 , where the current equals
I 1 for a certain value of L:
t1
t1
I1
dI 1 2
EL = P L ( t) d t = I ( t) L dt = L I ( t) d I = LI (6.24)
dt 2 1
t0 t0 0
This means that just like with the capacitor, also the impedance of an induc-
tor is a complex number. In this case however, the phase of the voltage leads
90◦ to the current and the impedance increases with increasing frequency.
Like with the capacitor, the average power over an inductor is zero, when
supplied from an AC source, as was illustrated in Figure 6.12. In every
period, the energy stored in the magnetic field in the inductor alternately
334 Chapter 6. Analogue electronics in mechatronic systems
increases in the positive power cycle and decreases in the negative power
cycle.
In reality, inductors are all made by winding wires around a core as can be
seen in Figure 6.14. This core can consist of a material with a high relative
permeability to achieve a high self-inductance or a low permeability for
reason of linearity. For very high frequencies even a core can be omitted
and only a section with a spiral track on a printed circuit board can suit the
purpose. With some numbers in a real coil with n = 100 windings, a core
diameter ( A ) of 10*10 mm, a core length ( l ) of 100 mm and a μr of 100, the
following relatively low value of the self-inductance is obtained:
The most basic filter configuration is the RC-filter that is derived from the
voltage divider of Figure 6.7, where one of the impedances is a frequency
dependent component, like for instance a capacitor. The first filter to explain
this principle is a first-order low-pass filter, that can be used for instance to
reject noise above the cut-off frequency or corner-frequency ω0 of the filter.
This filter consists of a capacitor and a resistor as shown in Figure 6.15.
As the impedance of the capacitor decreases with increasing frequency it can
336 Chapter 6. Analogue electronics in mechatronic systems
J
Magnitude (abs)
Magnitude [dB]
>J
*
J >^J
>ZJ
*^
Z1=R J >J
+
>\J
J
+
Phase [deg]
Z2
Vi jZC Vo >\
_
>|J
_ J
J
J
J
^
J
Z
J
Frequency [Hz]
Figure 6.15: An RC low-pass filter network and the Bode plot of its transfer function
with R = 15 kΩ and C = 0.1 μF, giving a time constant τ = 1.5 ms. The
increasing impedance Z2 of the capacitor reduces the magnitude of the
output signal at frequencies above the corner-frequency f 0 = ω0/2π ≈
100 Hz, where the magnitude of Z2 becomes larger than Z1 .
be reasoned, that this filter reduces the amplitude of the higher frequencies
above the frequency, where the impedance of the capacitor becomes smaller
than the impedance of the resistor. To derive the transfer function F (ω) of
this filter in the frequency domain Equation (6.7) of the voltage divider is
used:
Vo Z2
F (ω ) = = (6.7)
V i Z1 + Z2
With Z1 = R , Z2 = 1/ j ωC , the transfer function becomes:
1
Vo j ωC 1
F (ω ) = = = (6.27)
Vi 1 j ωRC + 1
R+
j ωC
At this point, the time constant τ = RC also called the RC -time can be intro-
duced. Because the transfer function of Equation (6.27) is dimensionless,
RC is inversely proportional to the frequency ω, so it has the unit of time.
It is also known from control theory, that the time constant τ defines the
step response of a system starting at zero:
& t
'
V = Vst 1 − e− τ (6.28)
J
Magnitude (abs)
>J
Magnitude [dB]
* >^J
J
>ZJ
>J
*^
1
+
J
>\J
Z1 |J
jZC
+
Phase [deg]
Vi Z2=R Vo \
_
_ J
J
J
J
J
^
J
Z
J
Frequency [Hz]
Figure 6.16: An RC high-pass filter network and the Bode plot of its transfer func-
tion with R = 15 kΩ and C = 0.1 μF, giving a time constant τ = 1.5 ms.
The increasing impedance Z2 of the capacitor reduces the magni-
tude of the output signal at frequencies below the corner-frequency
f 0 = ω0/2π ≈ 100 Hz, where the magnitude of Z2 becomes larger than
Z1 .
The Bode plot of this filter is shown in Figure 6.15 for an example with
R = 15 kΩ and C = 0.1 μF. This gives a time constant of τ = 1.5 ms and a
corresponding corner-frequency of ω0 = 1/τ ≈ 660 rad/s and f 0 = ω0/2π ≈ 100 Hz.
The −1 slope of the magnitude with -20 dB per decade is as expected with a
first-order system.
It is important to note that the characteristics of a passive filter are influ-
enced by the load after the filter. When for example the filter of Figure 6.15
would be loaded with a resistive load, equal to Z1 , two effects would be ob-
served. Firstly the gain of this filter in the pass-band would have a value of
0.5, because the load and Z1 would act like a frequency independent voltage
divider. Secondly the cross-over frequency would be a factor 2 higher. Both
effects can be calculated by first determining the equivalent Thevenin source
voltage and impedance of Z1 , combined with the load and then combine that
system with the capacitor.
To prevent this effect, these filters are commonly combined with buffer am-
338 Chapter 6. Analogue electronics in mechatronic systems
plifiers with a very high input impedance and low output (source) impedance.
These will be presented later.
The second filter of this kind is the high-pass filter, as used for instance to
reject the DC value of a signal. Its basic design consists also on a capacitor
and a resistor, but their location is exchanged as shown in Figure 6.16. It
is clear, that at low frequencies the high impedance of the capacitor will
prevent low-frequency current to flow into the output resistor.
Also with the high-pass filter Equation (6.7) is used to derive the transfer
function F (ω):
Vo Z2
F (ω ) = = (6.31)
Vi Z1 + Z2
The difference with the low-pass filter is the j ω term in the numerator,
corresponding with the differentiating Laplace variable s, which gives +1
slope with 90◦ phase lead and a 0 dB corner-frequency at ω0 = 1/τ. This term,
combined with the transfer function of the low-pass filter, results in the
Bode plot of Figure 6.16 for the same values of the resistor and the capacitor.
Also with this filter a loading impedance over V0 will influence its character-
istics. Even though the high-frequency pass-band gain will not be changed,
the load impedance comes parallel to Z2 resulting in an increase of the
cross-over frequency with more attenuation at lower frequencies.
+
1 1 1
+
Z2 Z4 Z6
jZC1 jZC2 jZC3
Vi Vo
_
R3 10 R2 100 R1 C1 10 C2 100 C3
Figure 6.17: Creating a third order low-pass filter by cascading three first-order
filters is possible with minimal mutual load. It is however better
to introduce buffer amplifiers, as the output impedance of the filter
becomes very high.
the effective output impedance of the combined filter would become very
high. For that reason it is preferred to insert intermediate buffer amplifiers
with a gain of one, like shown in Figure 6.18. How these amplifiers are made
will be presented later in this chapter. For the explanation of the principle,
it is sufficient to accept that these amplifiers enable to combine the separate
sections without mutual influence, because they combine a very high input
impedance with a very low output impedance. In that case the combined
transfer function becomes a simple multiplication of the transfer functions
of the separate filters:
1
F ( s) = (6.33)
( j ωτ1 + 1)( j ωτ2 + 1)( j ωτ3 + 1)
+
+
1 1 1
Vi Z2 Z4 Z6
jZC jZC jZC Vo
_
1 J
Magnitude (abs)
Magnitude [dB]
-1
J >^J
*^
J >J
-3
J >`J
*
J >}J
J
Phase [deg]
>|J
>}J
>^{J
1 ^ 3
J J J
Frequency [Hz]
Figure 6.18: Cascading three first-order low-pass filters with intermediate buffer
amplifiers results in a very gradual transition between the pass band
below the corner-frequency and the attenuation band above f 0 . At the
corner-frequency the attenuation is already 9 dB. The Bode plot is
drawn with R = 15 kΩ and C = 0.1 μF, corresponding with f 0 ≈ 100 Hz .
without phase and magnitude effects in the pass band and it will be shown
that these electronic dynamics have a close relationship in behaviour with
mechanical dynamics.
When it is not possible to use amplifiers, higher order filters can also be
created by applying the other complex impedance, the inductor in an LCR-
filter. When the resistor in the previous filters is replaced by an inductor,
the filter effect will be enlarged, because the impedance of the inductor
increases as function of the frequency.
In Figure 6.19 this second-order filtering effect is shown for a low-pass filter
configuration, where the capacitor and inductor have a value of respectively
100 μF and 25 mH.
6.1. Passive electronics 341
The same approach as with the first-order filters is used to derive the transfer
function:
Vo Z2
F (ω) = = (6.34)
Vi Z1 + Z2
With the defined value of ω0 the transfer function in the frequency domain
is written as:
1
F (ω) = 2
(6.37)
ω
− +1
ω20
3
J `J
^
Magnitude (abs)
J
Magnitude [dB]
J
1
J ^J
1 J
-1
J >^J
*^
+
J >J
Z1 jZ L J
+
>\
Phase [deg]
1
Vi Z2 >|J
jZC Vo
_ >Z\
_ >}J
J
1
J
^
J
3
Frequency [Hz]
Figure 6.19: A second-order LC low-pass filter network and the Bode plot of its
transfer function with C = 100 μF and L = 25 mH, giving a corner-
frequency f 0 =100 Hz. Because of the lack of damping, the magnitude
at the corner-frequency becomes infinite.
Again the same approach can be taken as with the previous filters to derive
the transfer function, but in this case the impedance to ground consists of
two parallel impedances Z2 and Z3 . The parallel configuration is noted with
symbol (∥):
Vo Z2 ∥ Z3 1 1
F (ω ) = = = =
(6.38)
Vi Z1 + Z2 ∥ Z3 Z1 1 1
+ 1 Z1 + +1
Z2 ∥ Z3 Z2 Z3
1 C
Q= =R (6.40)
2ζ L
With these defined values the the transfer function in the frequency domain
becomes:
1 1
F (ω ) = 2
= 2
(6.41)
ω 2ζω ω ω
− +j +1 − +j +1
ω20 ω0 ω20 ω0 Q
6.1. Passive electronics 343
J
Magnitude (abs)
1
Magnitude [dB]
J
>J
*
J >^J
>ZJ
+
*^
Z1 jZ L J
0
>J
+
>\
Phase [deg]
1 Z3
Vi Z2
jZC R Vo >|J
_ >Z\
_ >}J
^ Z
J J J
Frequency [Hz]
Figure 6.20: A second-order damped LC low-pass filter network and the Bode plot
of its transfer function with 100 μF, L = 25 mH and R = 16 Ω. These
values result in a corner-frequency f 0 = 100 Hz and a damping with Q =
1, ζ = 0.5. A lower value of the resistor corresponds with an increased
level of damping.
Vo 1 −ω2 LC
F (ω) = =
= (6.43)
Vi 1 1 1 L
+ +1 −ω2 LC + j ω +1
j ωC j ω L R R
Also the same relation
# for the corner-frequency ω0 = 1/ LC and damping
1
ratio Q = /(2ζ) = R C/L is used and the transfer function in the frequency
344 Chapter 6. Analogue electronics in mechatronic systems
10
Magnitude (abs)
1 0
Magnitude (dB)
>J
-1
10 >^J
>ZJ
-2
10
+
>J
1 180
Z1
jZC
+
135
Phase (deg)
Z3
Vi R Vo 90
_ Z2 jZ L 45
0
_ 10
1
10
2
10
3
Frequency (Hz)
Figure 6.21: A second-order damped LC high-pass filter network and the Bode
plot of its transfer function with 100 μF, L = 25 mH and R = 16 Ω.
These values result in a corner-frequency f 0 = 100 Hz and a damping
with Q = 1, ζ = 0.5. A lower value of the resistor corresponds with an
increased level of damping.
domain becomes:
ω2 ω2
− −
ω20 ω20
F (ω ) = = (6.44)
ω2 2ζω ω2 ω
− +j +1 − +j +1
ω20 ω0 ω20 ω0 Q
Like with the first-order filters, the only difference with the low-pass version
is the numerator, where in this case the frequency terms are squared,
corresponding with s2 . This results in a 180◦ phase shift (the minus term).
The resulting Bode plot is in a dynamic sense completely comparable to the
low-pass version with the same component values.
Series-resonant LCR-filters
Z1=R
+
+
Vi Z2 jZ L
Vo,HP
_
_
+
1
Z3
jZC
Vo,LP
_
domain becomes:
1
Vo,LP Z3 j ωC 1
FLP (ω) = = = = (6.45)
Vi Z1 + Z2 + Z3 1 −ω2 LC + j ωRC + 1
R+ + j ωL
j ωC
Vo,HP Z2 j ωL −ω2 LC
FHP (ω) = = = = (6.46)
Vi Z1 + Z2 + Z3 1 −ω2 LC + j ωRC + 1
R+ + j ωL
j ωC
When comparing these equations with Equation (6.39) and Equation (6.43),
the difference in the damping term RC versus L/R in the previous filters is
clear. The quality factor for the series resonant filter can be determined by
replacing the term in the parallel filter by the following steps.
The parallel quality factor equals
C R
Q=R = CL (6.47)
L L
When exchanging L/R by RC the quality factor for the series resonant filter
becomes:
1 1 L 1 R C
Q= CL = with the damping ratio ζ = = (6.48)
RC R C 2Q 2 L
346 Chapter 6. Analogue electronics in mechatronic systems
When these terms are applied, together with the unchanged ω0 = 1/ LC
the same equations in the frequency domain are obtained as previously
determined in Equation (6.41) and Equation (6.44).
+
Z1 jZ L m
I Z3 Vo x
1
Z2
jZC c F k
_
x V
Zm Ze
F I
Damper Resistor
x 1 V
ZD R
FD c I
Figure 6.23: The analogy between a mechanical and electronic dynamic system,
where voltage corresponds with velocity and current with force.
equals:
Vo = I ( Z1 ∥ Z2 ∥ Z3 ) (6.49)
As a next step, the values for ω0 and the damping ratio ζ for both the electrical
parallel resonant filter and the mechanical configuration are determined by
mutually interchanging the mentioned terms:
1 k 1 1 L c
ω0 = = and ζ = = = (6.51)
LC m 2Q 2R C 2 km
With these terms, Equation (6.50) gives the following transfer functions for
both the electrical and mechanical configuration:
1
jω
Vo j ωL ẋ k
(ω) = and (ω) = (6.52)
I ω2 2 j ωζ F ω2 2 j ωζ
− 2+ +1 − 2+ +1
ω0 ω0 ω0 ω0
348 Chapter 6. Analogue electronics in mechatronic systems
^
10
= 0.005 Q = 100 R = 1/c = 60
= 0.05 Q = 10 R = 1/c = 6
= 0.5 Q=1 R = 1/c = 0.6
1
10
Magnitude Zm , Z e
0
10
>
10
>^
10
Phase [deg]
>|J
^ 3 4
10 10 10
Frequency [Hz]
Figure 6.24: Bode plot of the equivalent electrical impedance Ze = V/I and the
mechanical impedance Zm = ẋ/F for 1/k = L = 10−4 and m = C =
2.5 · 10−4 . Below the corner-frequency of 1 kHz the line follows the
spring/inductor line with a +1 slope and 90◦ phase lead. At the corner-
frequency the impedance is real, which means that the level is only de-
termined by the dissipative element, the resistor or the damper. Above
1 kHz the line follows a −1 slope, according to the mass/capacitance.
Mechanical Electrical
Diode
Bipolar transistor
Figure 6.26: Two different symbols of a diode and a bipolar NPN (left) and PNP
transistor. The arrows point towards the current conducting direction.
one single piece of Silicon, called a chip. These chips are packaged in mostly
plastic housings with terminals, like shown in Figure 6.25. In the last few
decades many versions of semiconductor-based discrete components have
been created, ranging from the semiconductor diode and the bipolar tran-
sistor to special functional parts, like avalanche and zener diodes, the diac
and triac, the field effect transistor (FET), the Metal Oxide Semiconductor
FET (MOSFET) and manymore.
Integrated circuits are also diversified over many different applications,
with a clear distinction between analogue and digital ICs. The best example
of an analogue IC is the operational amplifier because of its applicability in
a wide variation of electronic functions. Digital ICs are used for logical oper-
ations, digital processing and computer memory. In the past decades, their
complexity has grown tremendously and presently they sometimes contain
hundred million transistors or more, all connected by a huge network of
integrated wiring. It would be impossible to give a complete overview of all
elements and diversity of electronics in the context of this book. For that
reason only the three most common examples of the discrete components
are explained more in depth, the Silicon diode, which is a basic non-linear
element, the Silicon bipolar transistor, that is used to create active amplify-
ing electronics and the MOSFET that is used in digital electronics and in
switched-mode power amplifiers.
The electronic circuit symbols for a diode and a bipolar transistor are shown
in Figure 6.26. A diode allows an electric current to flow in one direction,
but blocks it in the opposite direction. It can be seen as an electronic version
of a check valve. Like a check valve that needs a pressure to open and shows
a potential leakage in the blocking direction, also electric diodes do not
352 Chapter 6. Analogue electronics in mechatronic systems
Figure 6.27: N- and P-material, created in a Silicon crystal structure, where one
of the atoms is exchanged by Phosphor, to create N-material with
one free electron or by Boron, to create P-material with one missing
electron, a hole.
Semiconductor physics
• Although the P-material has free holes and the N-material free elec-
trons, those materials are not charged as they originate from a not
charged doping atom added to the Silicon crystal. This is fully identical
to an uncharged metal that contains many free electrons.
Figure 6.28: Combining N- and P-material creates a depletion layer, because elec-
trons of the N-material recombine with the holes of the P-material.
As a result, the N-material becomes positively charged and the P-
material becomes negatively charged, according to the large plus and
minus sign. This results in a measurable potential difference between
both materials. The shown small plus and minus signs represent the
remaining free moving charges.
When N- and P-material are brought into electrical contact, the electrons of
the N-material and the holes of the P-material will diffuse into each other
and the electrons will fill the holes at the interface area. As a result an
intermediate layer is created without free electrons and holes. This process
is called the recombination of the electrons and the holes and this layer is
called the depletion layer, as shown in Figure 6.28. This layer behaves like
an insulator due to the lack of free movable charge carriers.
Because the P-material loses positively charged holes and the N-material
loses negatively charged electrons, the P-material becomes negatively charged
and the N-material becomes positively charged, creating an electromotive
force, that can be detected by its external potential difference.
In Figure 6.29 it is shown what happens, if an external voltage source is
applied between the combined N- and P-material. At the left the external
voltage is in the same direction as the electromotive force, caused by the
recombination of the charge carriers. The external source adds electrons
to the P-material and takes electrons from the N-material. This results
in even less free charge carriers, and because of this enlarged depletion
layer no current can continue to flow. The thickness of the depletion layer
becomes proportional to the external voltage and the internal electromotive
force that corresponds to this large depletion layer is in equilibrium with
the externally applied voltage.
At the situation of the right of Figure 6.29, the external voltage is applied in
a direction opposite to the electromotive force of the recombined depletion
6.2. Active electronics 355
Figure 6.29: A semiconductor diode conducts current in one direction, when the
external voltage is directed opposite to the internal electromotive force
and blocks it in the other direction. In the conductive mode both the
electrons and the holes contribute to the current.
layer. In that situation electrons are added to the N-material and taken
from the P-material, compensating the effect of the recombination. As a
result the depletion layer disappears and a conductive path is created where
both the holes and the free electrons contribute to the current.
It is obvious that these effects don’t happen instantaneous without any
transition phenomena between conducting and blocking. This is illustrated
in Figure 6.30, that shows the real characteristics of a typical Silicon semi-
conductor diode. In the conductive direction it is to be expected, that the
internal electromotive force of the depletion layer determines a certain
threshold level, that the external voltage has to surpass in order to make
the diode conducting. Furthermore, due to non-ideal material properties,
always some leakage current is present in the blocking direction. This
leakage current also reduces the external voltage that can be measured
at the terminals of a not connected diode below the electromotive force
caused by the depletion layer. For a silicon based diode the threshold voltage
Vth is around 0.6 – 0.7 V in practical situations. This value is strongly
temperature dependent, because it is determined by the mobility of the
charge carriers, that cause the depletion layer. This mobility is increased at
elevated temperatures.
Another side effect is caused by the fact, that the depletion layer is a non-
charged (insulating) layer between two parallel plates, which acts like a
parasitic capacitor. The need to charge this capacitor determines the current
needed to change the diode from the conductive state into the blocking
356 Chapter 6. Analogue electronics in mechatronic systems
8 103 2 109
4 103 1 109
Id [A]
Id [A]
2 103 5 1010
0 0 Ideal
Real
2 103 5 1010
-0.2 0 0.2 0.4 0.6 0.8 -0.2 -0.1 0 0.1 0.2 0.3
Vd [V] Vd [V]
Figure 6.30: Real versus ideal characteristics of a silicon diode. A threshold voltage
Vth of approximately 0.6 V is present in the conductive direction at
moderate current levels above the mA range, as shown at the left.
At extremely low current levels in the nA range also some leakage
current occurs with a voltage in the reverse direction.
state and vice versa, when the current is reversed. This is an important
characteristic, because diodes are frequently used to rectify alternating
currents into direct currents in power supplies and high-frequency signal
demodulators.
Figure 6.31 shows an example of the most frequently used rectifier circuit,
the bridge-rectifier, that consists of 4 diodes and a storage capacitor. As can
be seen in the drawing both the positive and the negative voltage from the
input results in a unidirectional positive voltage at the output. The large
storage capacitor will flatten the pulsating voltage into a DC voltage with a
ripple that depends on the current in the load and the capacitance value of
the storage capacitor. This can be filtered further with a passive low-pass
filter, as was presented in the previous part.
In the previous chapter the electric transformer was presented. One of the
conclusions was that a high-frequency AC voltage is necessary to be able to
reduce the size. The rectification of a high-frequency AC voltage requires
very fast switching diodes in order to reduce the power dissipation, related to
the continuous charging and discharging of the intrinsic parasitic capacitor
from the depletion layer. This dissipation is caused by the unavoidable
parasitic source resistance in the circuit, that causes the power loss. Another
application of fast diodes is in the switched-mode and resonant amplifiers
that will be presented in the section on power amplifiers.
Apparently even this single component can be optimised for many different
applications.
6.2. Active electronics 357
+
+ R Vo _ R Vo
_ _
Vi Vi
_
+
posive cycle negave cycle
with capacitor
+
+ without capacitor
R Vo
+
C
_
Vi
_
Figure 6.31: Diode bridge rectifier for AC voltage. Both the flow of current at
the positive and the negative cycle of the input voltage result in a
unidirectional current in the load, because of the diode configuration.
By adding a storage capacitor a DC voltage is obtained with only a
limited AC ripple. The capacitor is charged in the positive slope of the
rectified input signal and discharged by the load in the negative slope.
Figure 6.32: Combining N-, P- and N-material gives an NPN transistor. Without
base current I b the transistor is not conducting. A base current in
the conductive direction of the base-emitter junction causes a large
number of electrons to pass the collector-base barrier, resulting in a
collector current that is larger than the base current.
Ic
β = h fe = (6.53)
Ib
The example was given for an NPN transistor, where the electrons play the
main role. For that reason in that configuration the electrons are called the
majority carriers and the holes are the minority carriers as they only play a
role in the base current.
With a PNP transistor it is just the other way around. The holes are the
majority carriers and all currents run in the opposite direction of the NPN
6.2. Active electronics 359
Ic Ic
N: Collector P: Collector
P: Base N: Base
Ib Vc-e Ib Vc-e
N: Emier P: Emier
6.2.1.3 MOSFET
The name MOSFET means Metal Oxide Semiconductor Field Effect Tran-
sistor. It is a transistor that operates because of a externally controllable
electric field, while it uses a metal oxide to insulate the input where the
controlling signal is applied. The electrodes of a MOSFET have different
names than with a bipolar transistor. The input electrode is called the Gate
and is equivalent in function to the base in a bipolar transistor. The Drain
is equivalent in function to the collector as it drains the electrons in an
N-channel MOSFET, which is the most common version and the Source is
equivalent in function to the emitter as it is the source of the electrons in
an N-channel MOSFET. in order to distinguish it from a voltage or current
source, the Source, Gate and Drain of a MOSFET will be written as a name
starting with a capital.
MOSFETs are the most common semiconductors in integrated logic circuits,
like microprocessors and memory, because of their speed, size and possibility
to work with very low power supply voltage levels. The speed is high because
the current flow takes place by majority carriers only, which are electrons
in N-channel MOSFETs and as mentioned, charge transport by electrons is
360 Chapter 6. Analogue electronics in mechatronic systems
p-Substrate 0 S
0 VT VGS
Gate + + + + +
Drain N + Channel + N Source N N
++ + + + + P
P + + + P P
+ + + +
+ + + +
+ + + +
O< On
significantly faster than the indirect charge transport by holes. Next to the
small signal type MOSFETs, also high power versions have been developed,
that consist of millions of separate small MOSFETs connected in parallel.
They are applied in high power switching applications and will be further
presented in Section 6.3.
The structure of one type of N-channel MOSFET, the horizontal enhancement-
mode type is shown in Figure 6.34. Many different types of MOSFETs exist,
the depletion-mode MOSFETs are normally conducting, but the normally
non-conducting enhancement-mode version is most frequently used. The
horizontal type is the most easy to use for explaining the working principle,
while the vertical type is used to create power MOSFETs.
The substrate of a horizontal N-channel enhancement-mode MOSFET con-
sists of P-doped Silicon. The Drain and the Source are made of N-doped
Silicon, separated by the substrate. The area in the substrate between
the Drain and the Source is called the channel. Unlike with the bipolar
6.2. Active electronics 361
transistor, the depletion layer between the P-and N-material plays no other
role than to insulate the Drain from the substrate when the voltage on the
Drain is positive relative to the substrate. Normally the junction between
the Source and the substrate is in the conductive direction during operation
and for that reason they are mostly internally connected.
The real functionality takes place in the channel area around the Gate. The
Gate is made of metal and is electrically insulated from the substrate, the
channel and the Source by a metal-oxide layer. This gave the name “MOS”
to the device, although often the metal of the Gate is replaced by high doped
poly-crystalline Silicon and the oxide by Silicon oxide, because of the better
compatibility with the Silicon manufacturing process. This makes however
no difference for the working principle.
The oxide layer between the Gate and the channel is very thin and when a
positive voltage is applied to the Gate, relative to the Source, the holes in the
channel are pushed away due to the electric field by the Gate voltage, while
the electrons are attracted. As a result, a large number of free electrons
become available in the channel. Effectively the original P-doped substrate
becomes a virtual N-material, with equal properties as the material of the
Drain and the Source. This way, the channel becomes a conductive path
between the Drain and the Source, with a width that is determined by the
voltage level on the Gate. The resistance of the channel depends on the
width and with this principle a MOSFET behaves like a controllable resistor.
The depletion mode MOSFET works in the opposite way. In that case the
channel is in principle conducting by a different doping, and with N-material
of the channel a negative voltage on the Gate will repel the electrons, creat-
ing a depleted area with an increased resistance. Because this type is not
used in amplifiers for mechatronic systems it will not further be treated.
In spite of the quite simple functionality of MOSFETs, bipolar transistors
are still more often used in analogue amplifiers and for that reason first
amplifiers with bipolar transistors will be presented. The power MOSFETs
will return when switched-mode amplifiers are presented because of their
excellent switching performance.
principle of these integrated amplifiers, first the basic building blocks are
explained in this section.
The first circuit is the emitter follower from Figure 6.35. It is also called
a common collector configuration as the collector is directly connected to
the common power supply. The principle of the emitter follower is based on
the current-amplification of the applied NPN transistor. It can for instance
be used as the buffer amplifier in the cascaded passive first-order filter of
Figure 6.18 to match a high impedance input with a low impedance output.
In principle its operation is quite straightforward. The collector is connected
to a positive voltage with in this example a value of +15 V. The emitter is
connected to a more negative voltage via an emitter resistor. For the negative
voltage a value of −15 V is chosen to be able to amplify alternating signals.
The input voltage is applied on the base. When this input voltage has a
value in between the positive and negative supply voltage, the base emitter
junction is in the conductive mode. Like with a Silicon diode, the conducting
base-emitter junction has a threshold voltage Vb−e,th of ≈ 0.6 V. For that
reason, the emitter voltage will be approximately 0.6 V more negative than
the input voltage.
The difference between the emitter voltage and the negative supply voltage
determines the emitter current I e through the emitter resistor.
Ve − Vp− Vi − 0.6 + 15
Ie = = (6.54)
Re Re
This current is equal to the sum of the collector current and the base current:
1
Ie = Ib + Ic = Ic +1 (6.55)
β
For high values of β, the collector current becomes almost equal to the emit-
ter current and for this section the base current will further be neglected.
The current in the emitter resistor is necessary to be able to deliver both
positive and negative currents to a load, as shown in Figure 6.35.b, and is
called the idle current. When the input voltage at the base is an alternating
sinusoidal signal, like shown in Figure 6.35.c, then the emitter voltage
follows the input voltage with a constant difference of ≈ 0.6 V. The external
load will pull either a positive current or a negative current from the node
between the emitter and the emitter resistor. The positive current is deliv-
ered by the transistor, on top of the current flowing into the emitter resistor.
6.2. Active electronics 363
Ib
5 V Vth | 0.6 V
Vi
Ie | Ic Io ! 0
+
Vb-e,th
0.6 V Vo
+
+
_
Vi Vi 0V
Ie | Ic
+
+
_ Ve _
Vi 0.6_
Rload
Io 0 Vo
Re
Re
Vclip
_ 5 V
Figure 6.35: The NPN transistor emitter follower is a current amplifier used to
achieve a low output impedance. The emitter voltage follows the
input voltage that is applied to the base with a difference of ≈ 0.6 V.
When a load is connected, the positive output current is delivered by
increasing the emitter current of the transistor. The negative output
current comes from the emitter resistor Re by a reduction of the emitter
current of the transistor. Ultimately the emitter current is zero and
the maximum negative voltage is reached, showing a phenomenon
called “clipping” at a level that is determined by the load in relation
to the emitter resistor and the negative power supply voltage.
R load
Vclip = Vp− (6.56)
R e + R load
Vp 15 V
5 V Vi Vo with Vbias
Io ! 0
Vi
+
+
0V
Vbias
+
_ _
| 1.2 V Vo without Vbias
Rload
Io 0 Vo
_ 5 V
Vp 15 V
Figure 6.36: A symmetric push-pull emitter follower. The positive current is de-
livered by the NPN transistor and the negative current by the PNP
transistor. A bias voltage is added to avoid non-linearity at the transi-
tion between a positive and a negative current, that would otherwise
be caused by to the threshold voltage of the base-emitter junction of
the transistors.
Based on the findings with the emitter follower, two other configurations
are shown in Figure 6.37, the variable current source and the voltage ampli-
6.2. Active electronics 365
Rload
Rload
Rc
+ Io Io
_
_
Vp
+
Ic
+
Vb-e,th Ie
0.6 V Vo
+
+
_
Io
Vi Vi
+
+
_ _
Re
Re
Vi 0.6 Vi 0.6
_ _ _
Figure 6.37: Two configurations where the load is applied to the collector.
a: With the variable current source, the collector current is only
determined by the input voltage and the emitter resistor. It has a very
high output impedance.
b: A voltage amplifier is created from a variable current source, by
inserting a resistor between the positive power supply and the collector.
The voltage over the resistor is proportional to the current and the
resistor value. With a high value for Rc , voltage amplification is
obtained at the collector output.
fier. Both circuits belong to the same type and are called collector follower
or common emitter configurations, because the output is taken from the
collector and the emitter is either directly or almost directly connected to
the common ground.
Like with the example of the emitter follower, the NPN transistor is used as
the representative example. For the PNP transistor only the signs of the
voltages and currents need to be reversed to get the same results.
The circuit of Figure 6.37.a. acts as a variable current source with a negative
current and works as follows: As long as the voltage of the collector is kept at
any value more positive than the voltage at the base, the emitter current is
only determined by the input voltage and the emitter resistance like with the
emitter follower. With a high value of the current-amplification ratio β, the
base current can be neglected and the collector current becomes equal to the
emitter current. As a consequence of this reasoning, the collector current is
not dependent on the collector voltage, which is the defined behaviour of a
current source.
The current source will prove to be very useful in the design of a differential
366 Chapter 6. Analogue electronics in mechatronic systems
(Vi − Vth )R c
Vo = Vp − I c R c = Vp − I e R c = Vp − (6.57)
Re
For the AC part of the input signal the constant values of Vth and Vp don’t
matter, which means that this configuration gives an AC voltage amplifica-
tion of:
Vo Rc
Ga = =− (6.58)
Vi Re
With an ideal transistor this gain is independent of the values of the resistors.
The minus term means that the sign of the signal is inverted. Although this
looks equal to a 180◦ phase shift this effect is not related to a phase delay
but only with a sign reversal.
Like all components that were presented, also the transistor is not ideal.
Its current-amplification ratio is not infinite nor constant and the character-
istics are non-linear because of the base-emitter junction, as was shown for
a diode in Figure 6.30. Furthermore, like the diode, also the transistor has
its dynamic limitations, that will be dealt with after the section about the
operational amplifier.
Finally, the emitter voltage does not follow the base voltage perfectly, but it
shows a certain output resistance Rs,e with a value of approximately:
0.025
R s,e = [Ω] (6.59)
IC
the expense of a reduced gain. In the past a lot of attention of electronic cir-
cuit designers was spent in optimising the settings of these single transistor
amplifiers in order to avoid the use of multiple costly transistors.
The common part of the signal is called the common-mode signal and this
part results in an equal emitter current value I e,1 = I e,2 for both transistors at
a value of half of the total current I t . This total current is determined by total
resistance to the negative supply voltage Re,t = Re,c + Re,1 ∥ Re,2 = Re,c + 0.5Re ,
because both emitter voltages are equal.
The part of the current in both transistors that is caused by the common-
mode input signal can than be calculated as:
Vi,c − 0.6
I e,1 = I e,2 = 0.5 I e,t = 0.5 (6.61)
R e,c + 0.5R e
In this equation the threshold voltage Vth ≈ 0.6 V has been taken into account.
The output voltage that is caused by the common-mode part of the input
voltage is equal for both collectors, because of the equal collector resistances:
Vi,c − 0.6
Vo,1 = Vo,2 = 0.5R c (6.62)
R e,c + 0.5R e
368 Chapter 6. Analogue electronics in mechatronic systems
Vp Vp
Rc,2
Rc,2
Rc,1
Rc,1
Vo,1 Vo,2 Vo,1 Vo,2
T1 T2 T1 T2
+ Vi,1 Vi,2 Vi,1 Vi,2
+
+
+
_ _ _ _
Re,1
Re,2
T3
+
_
Vc,set
Re,c
Rcs
Vp Vp
di<erenal pair long tailed pair
Figure 6.38: The differential amplifier consists of a pair of equal transistors with
the emitters connected either directly or via a pair of emitter resistors.
a: The differential amplification is determined by the collector re-
sistances and the total resistance between the emitters while the
common-mode amplification is determined by the collector resistors
and the total resistance from the emitters to the negative supply volt-
age.
b: With the high impedance current source T3 , the common-mode
amplification is minimised.
~?
OPA
627
<
1 8 NC
–In 2 – 7 +VS
OPA
627
+In 3 + 6 !!
–VS 4 5 <
~
drawn in the figure but all mentioned values are noted in respect to the reference which in
general is ground or 0V from the power supply. This is quite common in electronic circuits.
6.2. Active electronics 371
Vp+
T4 T5
T7
T1 T2 Vo
V i+
T8
T3
V i- T6
Vp-
Di<erenal amplicaon Voltage gain Current gain
Figure 6.40: Basic Operational amplifier with bipolar transistors, diodes and re-
sistors. Three functional stages can be distinguished, a first stage
that amplifies only the differential input voltage, a second stage with
a maximum voltage gain and a third stage with a maximum current
gain.
design any component in a control-loop as linear as possible in order to avoid residual errors.
372 Chapter 6. Analogue electronics in mechatronic systems
This means that the amplification is only depending on the ratio of the
resistors R1 and R2 . It also means that the difference in input voltages
becomes zero at infinite G o . In other words: “The output will do anything to
make the voltage at the minus terminal, the feedback point, equal to the
voltage on the plus terminal, the reference point”.
This understanding is used in most simplified designs of electronic circuits,
like presented in the following section, where different amplifier and filter
combinations are shown based on operational amplifiers.
5 The power supply is omitted for simplicity in all further circuit diagrams with operational
amplifiers. Unless otherwise mentioned a symmetrical power supply with an equal positive
and negative supply voltage of sufficient magnitude is assumed to be present.
6.2. Active electronics 373
Input + Gf
+
Output
+ Go
+
_ _
R2
Vi Vo
R1
_ _
Even though in reality all these statements are not true, in many less
critical applications general operational amplifiers behave sufficiently ideal
to justify the use of the configurations in this section.
The following rules for designing linear circuits with operational amplifiers
directly follow from the above mentioned understanding of the working
principle of a feedback controlled operational amplifier and is used through-
out all electronic circuits in this book, even when non-ideal amplifiers are
considered:
374 Chapter 6. Analogue electronics in mechatronic systems
These rules are applied in the following steps to determine the function of
most operational amplifier circuits:
2. Calculate the voltage on the positive input as this is in most cases only
determined by the input voltage and not by the feedback voltage.
+
+
G ()
+
_
I2 R2
_
Vi Vo
G ()
+
+
+
I1 R1
Vi Vo
_ _ _ _
Figure 6.42: The non-inverting amplifier has a gain larger than one, determined by
the voltage divider in the feedback path. When the output is directly
connected to the minus input, giving a feedback gain of one, the unity-
gain buffer amplifier is obtained.
This equation is equal to the previously obtained Equation (6.67) and the
rules are verified.
When the resistors in the feedback path are chosen such, that R1 = ∞ and
R 2 = 0, the gain G a becomes equal to one. As a result, this configuration acts
as a unity-gain buffer amplifier. A very important property of a non-inverting
amplifier is the high input resistance of the circuit, which is only determined
by the common-mode input resistance of the operational amplifier at the +
input multiplied by the open-loop gain of the amplifier as the negative input
is made to follow the positive input. This open-loop value of this resistance
is already high and with feedback it is almost infinite even with a real
operational amplifier. The differential input impedance does not play a role
as the voltage on the − input is equal to the voltage on the + input. This
configuration is especially useful when very weak sensor signals with a low
voltage and high source impedance need to be amplified.
On the other hand, the output resistance of this amplifier is very low, because
of the ideal operational amplifier characteristics. Even if there was a small
output resistance, the feedback will reduce the internal output resistance
by the total loop gain, according to the sensitivity function of the feedback
system.
376 Chapter 6. Analogue electronics in mechatronic systems
R2
_
I2
R1
+
I1 G ()
+
+
Vi Vo
_ _
Figure 6.43: In an inverting amplifier the output has a different sign as the input.
Vi − V − Vi
I1 = = (6.70)
R1 R1
I 2 relates to the output voltage as follows:
V − − Vo Vo
I2 = =− (6.71)
R2 R2
The current I 2 through R2 is equal to I 1 , because of rule two, resulting in
the following closed-loop gain of the circuit:
Vi Vo Vo R2
I1 = I2 = =− =⇒ Ga = =− (6.72)
R1 R2 Vi R1
The minus sign indicates, that the output signal is both amplified and
inverted, as it has a different sign than the input signal. Contrary to the
non-inverting amplifier, the closed-loop feedback gain can also be lower than
one by choosing R2 < R1 .
The main drawback of the inverting amplifier is its relatively low input
resistance, that equals R1 . For that reason this configuration is applied in
those situations, where the low input resistance gives no problems and the
inverted functionality is needed. It is also useful, when signals have to be
added or subtracted.
6.2. Active electronics 377
R4
I1 I4
R1 _
+
I2 G ()
+
+
R2
Vi,1 + Vi,2 I3 Vo
R3
+
_ _ _ Vi,3 _
Figure 6.44: Adding signals is achieved by adding the currents of different inputs
at the negative input, that acts like a virtual ground.
When R1 = R2 = R3 = R the output signal is equal to the inverted sum
of all input signals, amplified with a factor R4/R .
The calculation of the amplification is as follows. Again the rules are applied.
There is negative feedback and the positive input voltage is zero and equal
to the negative input voltage, V + = V − = 0.
Because of rule two, no current can flow into the amplifier inputs and the
currents through the resistors relate according to Kirchhoff’s first law:
I1 + I2 + I3 − I4 = 0 =⇒ I1 + I2 + I3 = I4 (6.73)
With Ohm’s law these currents are calculated from the different voltage and
resistor values:
R4 R4 R4
Vo = G a,1 Vi,1 + G a,2 Vi,2 + G a,3 Vi,3 = − Vi,1 − Vi,2 − Vi,3 (6.75)
R1 R2 R3
In the inverting and the adding configuration, the negative input acts like a
virtual ground, because the output keeps the negative input at the same
378 Chapter 6. Analogue electronics in mechatronic systems
I2
R2
I1
Inverng R1 _
+
G ()
+
Non +
R1
inverng
+
Vi,1 Vi,2 R2 Vo
_ _ _
level as the grounded positive input. This is the reason, that the signal at
one input terminal is not influenced by the signal at another input terminal.
This is especially useful, when signals are combined from sources that are
so sensitive that other voltages on its output can cause interference by non-
linearity.
This is different from the non-inverting amplifier.
When signals need to be subtracted, the circuit as shown in Figure 6.45 can
be used. It is a combination of a non-inverting and an inverting amplifier
configuration within one operational amplifier. The standard rules are
applied also for this calculation of the amplification.
As the first step it is checked that negative feedback is applied via R2 .
The next step is to calculate the voltage at the positive input and directly
apply rule one:
R2
V − = V + = Vi,2 (6.76)
R1 + R2
Using rule two, the relation between the output voltage Vo , the first input
voltage Vi,1 and V − can be determined by using Ohm’s law and Kirchhoff’s
first law:
Vi,1 − V − V − − Vo
I1 − I2 = − =0 (6.77)
R1 R2
Note the minus sign in I 2 that is caused by the defined direction of the arrow
for I 2 in Figure 6.45.
6.2. Active electronics 379
With a little bit of algebra, this equation gives the relation for V − :
R1 R2
V − = Vo + Vi,1 (6.78)
R1 + R2 R1 + R2
Resulting in:
R2 ( )
Vo = Vi,2 − Vi,1 (6.80)
R1
_
I2
+
G ()
+
I1 R I1 +
Vo Vo
_ _
Figure 6.46: A current to voltage converter creates an output voltage from the input
current. A load resistor to ground acts like a passive I to V converter.
With an operational amplifier one can create a converter that both
adds energy to the signal and avoids a voltage over the current source.
the sign is inverted and a positive current will result in a negative output
voltage Vo = − I 1 R .
This active configuration acts like an impedance that the current has to
pass through (trans) from the input to the output, for which reason this
amplifier is called a transimpedance amplifier.
+
G ()
R1 _
R3
Io
Load
Vi
R2
Ra
+ Load Io
G () _ Vcs I o R3
R1 _
+
Ra
Io R
R2 + R
Vi
G ()
Vcs I o R3 _ R
R3
Io
_
R
!
Figure 6.47: A voltage to current converter creates an output current that is deter-
mined by the input voltage. By measuring the output current with
a small resistor and using that signal in the feedback loop a current
source characteristic is realised. With a floating load this is simply
achieved as shown in the left drawing. A grounded load requires a
floating measurement resistor with a separate differential amplifier
to create the current sensing voltage Vcs for current-feedback.
This simple configuration has one drawback in the fact that the load itself
is not grounded at one of its terminals. This is called a floating load with
a common-mode voltage relative to ground equal to the voltage over the
current sensing resistor.
In case the load needs to be grounded the configuration at the right side
of Figure 6.47 can be applied where the load and current sensing resistor
are reversed. With this reversal the current sensing resistor is floating
with a common-mode voltage that is equal to the voltage over the load. To
cancel this common-mode voltage an additional subtracting amplifier is
added that gives an output voltage to ground equal to the voltage over the
current sensing resistor. The remaining part is equal to the floating-load
configuration.
6.2. Active electronics 383
For most of the filters, the inverting amplifier of Figure 6.43 is used as a
starting point. The integrator and the related first-order low-pass filter are
both shown in Figure 6.48, because they differ with only one resistor.
Ohm’s law is also valid for complex impedances, so the transfer function of
an inverting amplifier according to Equation (6.72) can be written as follows:
Z2
Ga = − (6.83)
Z1
inverting filter and amplification steps. In those situations eventually one additional inverter
can be added if the resulting signal is still inverted.
384 Chapter 6. Analogue electronics in mechatronic systems
Z2
R2 ( or15k)
Z1
R (10 k)
C (0.1F)
R1 (15k)
_ R (10 k)
_
+
G () G ()
+
+ +
Vi Vo
_ _
^ J
J
Low pass
Magnitude (abs)
Magnitude [dB]
J
1 Integrator ^J
1 J
-1
J >^J
*^
J >J
J
Phase [deg]
>\
>|J
J 1 ^ 3
J J J J J
Frequency [Hz]
In the Bode plot this non-inverting integrator gives a line with a slope
of -1, that intersects the 0 dB level at the unity gain cross-over frequency
ω0 = 2π f 0 = 1/R 1 C and a continuous phase of −90◦ over the entire frequency
band.
When the resistor R2 over C becomes smaller than infinite, it will limit
the amplification at low frequencies and as a result a first-order low-pass
filter is created. The transfer function is determined by calculating Z2 as
the impedances of R2 and C in parallel and including the gain of −1 by the
6.2. Active electronics 385
second inverter:
Vo Z2 1 R2 1
F (ω) = = =
= (6.86)
Vi Z1 1 R 1 (1 + j ωR 2 C )
R1 + j ωC
R2
The differentiator and the almost identical first-order high-pass filter are
created with the same inverting amplifier configuration, but in this case a
capacitor is placed in the forward path as shown in Figure 6.49.
When the value of R1 is zero, a pure differentiator is created. When the gain
of −1 of the second inverter stage is included, the transfer function is:
Z2
D ( s) = = sR 2 C , D (ω) = j ωR 2 C (6.87)
Z1
In the Bode plot this results in a line with a slope of +1, that intersects the
0 dB level at the unity-gain cross-over frequency ω0 = 1/R2 C and a constant
386 Chapter 6. Analogue electronics in mechatronic systems
Z2
Z1
R2 (10k) R (10k)
R1 (0 or 10k) _ R (10k)
_
+
+
+ +
Vi Vo
_ _
^
J J
Magnitude (abs)
High pass
Magnitude [dB]
1
J
< ^J
1 J
-1
J >^J
*^
J >J
|J
Phase [deg]
\
J
J 1 ^ 3
J J J J J
Frequency [Hz]
compensate the low gain of the plant. The circuit can be adapted to this
gain by changing the resistors of the second inverting amplifier.
At this point the relevant time constants of the controller are chosen accord-
ing to the corner-frequencies, defined in the previous part. The first one, τi
defines the integrator corner-frequency f i . The second one, τd defines the
differentiator corner-frequency f d and the third one τt defines the taming
corner-frequency f t .
1 1
τi = = = R2 C2 (6.91)
ωi 2π f i
1 1
τd = = = (R 1 + R 3 ) C 1 ≈ R 3 C 1
ω d 2π f d
1 1
τt = = = R1 C1
ωt 2π f t
The first term is a proportional term, the second term the integrator and
the third term the tamed differentiator. As will be shown further on, the
proportional term needs a correction factor, to become equal to the P-control
gain kp
6.2. Active electronics 389
Z1 Z2
C2 (1F)
R1 (820) _ R (3k3)
_
+
+
+ +
Vi Vo
_ _
2
10
Magnitude (abs)
1
10
0
10
>
fi fd ft
10
|J
Phase [deg]
\
>\
>|J
> 0 1 2 3 4
10 10 10 10 10 10
Frequency [Hz]
Figure 6.50: Analogue PID controller and its Bode plot. The indicated values of
the resistors and capacitors are chosen for a PID controller, according
to the rules of thumb for a mass control system, with a targeted
unity-gain of the controller at the 200 Hz bandwidth. In reality the
proportional gain should be adapted to the real gain of the plant at
200Hz.
In the Bode plot the frequency response of this controller is shown with the
values as derived from the given components, τi = 8.2 · 10−3 , τd = 2.7 · 10−3 and
τt = 2.7 · 10−4 .
As presented in Chapter 4, the positive phase in the frequency band where
the lead network is working, creates an effective damping in a motion control
feedback configuration, like the servo system in a CD player. The integration
at low frequencies reduces steady state errors, due to constant disturbing
forces like gravity.
By tuning the values for the capacitors and resistors, the controller can be
adapted to the plant by means of loop shaping.
390 Chapter 6. Analogue electronics in mechatronic systems
The control gains kp for P-control , kd for D-control and ki for I-control,
can be related to the time constants and passive component values of this
controller. For this derivation the taming factor 1/(τt s + 1) is not taken into
account as this represents just an additional pole, that is applied without
impact on the standard control gains.
To determine the control gains, Equation (6.92) is written in the additive
way by multiplication of the terms. This gives the following result, that
corresponds with Equation (4.32) from Chapter 4:
R2 τd R2 1 R2 ki
C pid ( s) = g i 1+ + gi + gi τd s = k p + + kd s (6.93)
R3 τi R 3 τi s R3 s
Even though digital controllers have taken over many control actions in
modern equipment, still one important advantage of analogue controllers
over digital ones remains and that is the lack of delay due to sampling.
In the digital world differentiating is done at any sampling moment by
taking at least two samples, the present and the previous one, and dividing
the difference over the sampling time. The result of this operation can
only be supplied to the plant at the next sampling moment so at least one
sample period later and as a result, part of the beneficial phase lead will be
lost. One of the solutions is to use a model based controller, to predict the
states, as explained in Chapter 4, but in unpredictable circumstances it is
6.2. Active electronics 391
Z3
I3
Z1 Z2 +
+
I1 I2 G ()
+
_
Z4
Vi Vx V+ Vo
_ _ _ _
Figure 6.51: Sallen-Key active second-order filter configuration with positive feed-
back.
always better to avoid delays in the D-control part. This requirement has
pushed sampling periods to ever smaller values, but in extremely fast control
situations, like with a piezoelectric scanner for a video-rate Atomic Force
Microscope, with bandwidths that exceed the Mega Hertz level, analogue
controllers might still represent the best solution possible.
In Section 6.1.3 on passive filters it was shown, that inductors are required,
to achieve higher order electronic filters with an adequate transfer function.
Especially in the low-frequency area, the required high values of the self-
inductance lead to large sizes of the components and inductors in general
are rather expensive to manufacture. In this section on electronic filters
it will be shown with one example how, by using an operational amplifier,
these filters can be created with only small size capacitors and resistors. In
principle, these filters are designed to be applied with low power signals
and not between amplifiers and actuators, so they can be used only in the
path before the power amplifier.
The example that is presented here is a Sallen-Key filter named after R.P
Sallen and E.L. Key, two engineers of MIT Lincoln laboratory in 1955. It is
built around a non-inverting unity-gain amplifier, as shown in Figure 6.51,
where the output signal is positively fed back to the input to create complex
poles in the transfer function with imaginary parts.
The rules are applied again to calculate the transfer function. The positive
input is equal to the voltage at the negative input and also equal to the
392 Chapter 6. Analogue electronics in mechatronic systems
V + = Vo (6.97)
With Kirchhoff’s first law on currents, the sum of all currents at the common
node of Z1 , Z2 and Z3 equals zero. This means with the defined directions
according to the arrows:
I1 − I2 − I3 = 0 (6.101)
Using the previously found equations for the different currents, with some al-
gebra the following generic transfer function of this configuration is derived:
Vo 1
=F= (6.102)
Vi Z1 Z2 Z1 + Z2
+1
Z3 Z4 Z4
With this equation the transfer function of the second-order low-pass filter
of Figure 6.52 will be determined as the first example.
The following values for the impedances are chosen for this example:
1 1
Z1 = Z2 = R 1 = R 2 = R , Z3 = , Z4 = (6.103)
sC 1 sC 2
C1
R1 R2 +
+
G ()
+
_
C2
Vi Vo
_ _
Vo R 2 C 1 C 2 s2
= F ( s) = 2 (6.106)
Vi R C 1 C 2 s2 + 2RC 2 s + 1
J
J
!
Magnitude (abs)
>J
*
J >^J
|J
J
^ Z
J J J
!X
Figure 6.53: Three typical characteristics of a second-order high-pass filter with the
same corner-frequency but different levels of damping. For the But-
terworth characteristics the −3 dB bandwidth is equal to the corner-
frequency of 100Hz of this example. The Chebyshev characteristics
result in a −3 dB bandwidth at 70 Hz with the steepest slope below
the pass band. The Bessel characteristics result in a −3 dB bandwidth
at 200 Hz and a very gradual slope below the pass band.
1846). The Bessel characteristic shows a constant group delay in the pass
band, representing a linear phase to frequency relationship dφ/d f .
The same characteristics can be shown for low-pass filters and these exam-
ples are only a limited set of the entire range of different filter configurations,
that are designed for very specific purposes. Their design belongs to the do-
main of the specialist and with digital control even more transfer functions
can be realised, including variable delay and other tricks. Nevertheless the
mentioned configurations are frequently applied and already very suitable
for the design of general mechatronic systems.
The most important limitation of all is determined by the finite gain and
speed of an operational amplifier. Already at the presentation of a differ-
entiator it was mentioned, that an infinite gain at infinite frequencies is
impossible. This speed limitation is caused mainly by the parasitic capacitor
over the PN-junctions of the applied transistors. Especially the parasitic
396 Chapter 6. Analogue electronics in mechatronic systems
OPA627
PARAMETER CONDITIONS MIN TYP MAX UNITS
OFFSET VOLTAGE (1)
Input Offset Voltage 40 10 μV
AP, BP, AU Grades 100 250 μV
Average Drift 0.4 0.8 μV/° C
AP, BP, AU Grades 0.8 2 μV/° C
Power Supply Rejection VS = ±4.5 to ±18V 106 120 dB
INPUT BIAS CURRENT (2)
Input Bias Current VCM = 0V 1 5 pA
Over Specified Temperature VCM = 0V 1 nA
SM Grade VCM = 0V 50 nA
Over Common-Mode Voltage VCM = ±10V 1 pA
Input Offset Current VCM = 0V 0.5 5 pA
Over Specified Temperature VCM = 0V 1 nA
SM Grade 50 nA
NOISE
Input Voltage Noise
Noise Density: f = 10Hz 15 40 nV/√Hz
f = 100Hz 8 20 nV/√Hz
f = 1kHz 5.2 8 nV/√Hz
f = 10kHz 4.5 6 nV/√Hz
Voltage Noise, BW = 0.1Hz to 10Hz 0.6 1.6 μVp-p
Input Bias Current Noise
Noise Density, f = 100Hz 1.6 2.5 fA/√Hz
Current Noise, BW = 0.1Hz to 10Hz 30 60 fAp-p
INPUT IMPEDANCE
Differential 1013 || 8 Ω || pF
Common-Mode 1013 || 7 Ω || pF
INPUT VOLTAGE RANGE
Common-Mode Input Range ±11 ±11.5 V
Over Specified Temperature ±10.5 ±11 V
Common-Mode Rejection VCM = ±10.5V 106 116 dB
OPEN-LOOP GAIN
Open-Loop Voltage Gain VO = ±10V, RL = 1kΩ 112 120 dB
Over Specified Temperature VO = ±10V, RL = 1kΩ 106 117 dB
SM Grade VO = ±10V, RL = 1kΩ 100 114 dB
FREQUENCY RESPONSE
Slew Rate: OPA627 G =
–1, 10V Step 40 55 V/μs
OPA637 G =
–4, 10V Step 100 135 V/μs
Settling Time: OPA627 0.01% G –1, 10V Step
= 550 ns
0.1% G =
–1, 10V Step 450 ns
OPA637 0.01% G =
–4, 10V Step 450 ns
0.1% G –4, 10V Step
= 300 ns
Gain-Bandwidth Product: OPA627 G=1 16 MHz
OPA637 G = 10 80 MHz
Total Harmonic Distortion + Noise G = +1, f = 1kHz 0.00003 %
POWER SUPPLY
Specified Operating Voltage ±15 V
Operating Voltage Range ±4.5 ±18 V
Current ±7 ±7.5 mA
OUTPUT
Voltage Output RL = 1kΩ ±11.5 ±12.3 V
Over Specified Temperature ±11 ±11.5 V
Current Output VO = ±10V ±45 mA
Short-Circuit Current ±35 +70/–55 ±100 mA
Output Impedance, Open-Loop 1MHz 55 Ω
TEMPERATURE RANGE
Specification: AP, BP, AM, BM, AU –25 +85 °C
SM –55 +125 °C
Storage: AM, BM, SM –60 +150 °C
AP, BP, AU –40 +125 °C
θJ-A: AM, BM, SM 200 ° C/W
AP, BP 100 ° C/W
AU 160 ° C/W
R
dVcb Ic
I cb Ccb
dt t Vcb
Ccb
Ib Ii I M
Vp
Ii
t
Figure 6.55: Speed limitation due to the collector base capacitor Ccb . A voltage
swing at the collector causes a current in Ccb , that has to be supplied
by the input voltage. With a voltage amplifier, this is perceived at the
input as a Miller capacitor with a larger value than Ccb .
capacitor Ccb between the collector and base is important, as it reduces the
current-amplification at higher frequencies. The effect of this capacitor on
a stepwise change in the input current is shown in Figure 6.55).
This input current ( I i ) would normally be equal to the base current ( I b )
and cause a proportional change in the collector current ( I c ). The collector
current will cause a proportional drop of the collector voltage over resistor
R , that induces a current ( I cb ) in C cb , that in its turn reduces the current in
the base:
dVcb
I b = I i − I cb = I i − C cb (6.107)
dt
To achieve a high speed, additional base current is necessary to compensate
this effect depending on the capacitor value. Because of the relation of the
current in Ccb with the voltage swing at the collector, this parasitic capacitor
is perceived at the input as a much larger capacitor, the Miller capacitor
CM .
C M = C cb (G v + 1) (6.108)
VCC+
36 pF
IN+ T5 T7
T4 37 pF
14 pF
15 Ω
T1 T2 OUT
7 pF
IN–
15 Ω
T3
T6 T8
460 Ω
VCC–
Figure 6.56: Schematic circuit drawing of the NE5532, a general purpose audio am-
plifier with bipolar transistor inputs. The transistors from Figure 6.40
are made recognisable and several more transistors are added for con-
trolling the internal currents and voltages and for protection like the
current protection transistors between T7 and T8 . The dominant pole
is created by a feedback capacitor of 37 pF around the high gain stage
by T4 and T5 . The other capacitors are added for further fine-tuning
the open-loop frequency response.
(Courtesy of Texas Instruments)
OPA627
®
7
Trim +VS
Trim
1 5
C1
C2
Output
6
+In –In
3 2
–VS
4
Figure 6.57: Schematic circuit drawing of the OPA627, a high precision amplifier
with Field Effect Transistor (FET) inputs to reduce the input current.
The dominant pole is created by increasing the pole of the parasitic
capacitors around the large gain amplifier stage between C1 and C2 .
(Courtesy of Texas Instruments)
2π f = 1/RC for each capacitor with its source resistance. This effect is even
increased with the collector-base capacitance of each transistor as explained
with the Miller capacitor. When two or more of these first-order poles are
located in the frequency range where the gain is larger than one, the closed-
loop transfer function becomes marginally stable or even unstable when
some additional phase lag is introduced by a third pole.
To illustrate this dynamic effect, an example open-loop frequency response
with marginal stability is shown in Figure 6.58. The red line shows the
frequency response of an amplifier with an open-loop gain of 106 and two
poles below 1 MHz. The first pole is located at 100 Hz and the second pole at
10 kHz, where the gain is still 104 . The phase margin approaches zero even
below 1MHz, where still some gain is present. This can create problems
when the closed-loop gain is < 100 as a phase margin of approximately 45◦ is
preferred for a well damped behaviour.
This problem can be solved by limiting the open-loop gain and creating one
dominant pole at a lower frequency. This causes the amplification to go down
with a slope of −1 from that frequency. The pole can be realised by placing
an internal larger capacitor in the part of the operational amplifier, that is
most prone to parasitic poles, like those related to the Miller capacitances
in the high voltage gain stage. This capacitor can be placed parallel to
an existing parasitic collector-base capacitance to move the main pole to
400 Chapter 6. Analogue electronics in mechatronic systems
6
10 120
10
5 -1 100
Magnitude (abs)
Magnitude [dB]
4
10 -1 }J
3
10 60
2 -2
10 40
1
10 20
10
0 Uncompensated 0
Compensated
0
Phase [deg]
>\
>|J
>Z\
>}J
0 2 4 6
10 10 10 10
Frequency [Hz]
Figure 6.58: Poles in the transfer function of an operational amplifier can be can-
celled by one dominant pole. This example shows two poles in the
uncompensated situation, one at 100 Hz and one at 10 kHz. It is clear
that the phase margin past 100 kHz is very small. With the dominant
pole at 1 Hz in the compensated situation the phase margin becomes
45◦ .
Gain-bandwidth product
The creation of only one dominant pole in the frequency range where the
gain is larger than one is reflected by a continuous −1 slope in the frequency
response above the corner-frequency that is determined by this pole. As a
consequence the product of the gain and the frequency is constant at any
frequency between the corner-frequency and the unity-gain cross-over fre-
6.2. Active electronics 401
Figure 6.59: The slew-rate of an operational amplifier limits the speed of change of
the output voltage. At high frequencies the output can not follow the
sinusoidal waveform anymore and tracks the signal with its maximum
dV/dt.
quency. This constant value is called the gain-bandwidth product and with
this value a designer can immediately determine the achieved bandwidth
at a certain closed-loop amplification. Ideally the gain-bandwidth product
is infinite.
Slew rate
The input offset voltage is the voltage needed between the + and - inputs, to
achieve 0 V at the output. This is caused by small differences between the
transistors in the differential gain stage of the operational amplifier. The
presence of an offset voltage means in a feedback circuit, that the first rule
in reality is a bit different. It should have been said that the amplifier will
402 Chapter 6. Analogue electronics in mechatronic systems
R2
I bias
_
+
G ()
+
+
+
V- V+ R1 Vo
I bias
_ _ _
Figure 6.60: The bias current in both inputs of an operational amplifier causes a
voltage drop over the source resistors R1 and R2 . When both resistors
are equal the effect on the output voltage is zero.
The input bias and offset current represent the currents that flow into or
out of the input terminals and that are required to operate the amplifier. In
electronics biasing refers to adding a voltage or current to the useful signal
in order to make the system work. The presence of this current implies,
that also the second rule about zero currents in the inputs is not completely
true. In a bipolar transistor input stage, like with the NE5532, the bias
current is necessary to drive the first transistor with a current level in the
order of several nA. With special transistors like the FETs in the OPA627,
the input current can be reduced to several pA. A low value of the bias
current is important when the path from the inputs to ground has a high
resistive value. This current would otherwise influence the output voltage to
compensate the related voltage drop over the resistors. The influence of the
6.2. Active electronics 403
bias current can be reduced by designing the amplifier in such a way that
the source (Thevenin) impedance of the circuit around the + input equals the
impedance of the circuit around the − input. This is shown in Figure 6.60
where the output voltage is zero in case of an equal bias current on both
inputs and when both resistors are equal:
V + = 0 − R 1 I bias
+
(6.109)
− −
Vo = V + R 2 I bias
V+ = V− =⇒ −
Vo = R 2 I bias +
− R 1 I bias =0
It is clear that an offset in the bias current will result in an offset voltage at
the output even when the resistors R1 and R2 are equal.
It is especially important to be aware that a capacitor as is applied in the
feedback path of an integrator, is not able to conduct DC-currents like the
bias current.
Ideally the bias current is zero.
Input resistance
The input resistance is the value that corresponds with the part of the input
current that is dependent on the input voltage so excluding the bias cur-
rent. This property is due to the transistors and resistors around the input
circuits. Differential-mode is the resistance between both input terminals
and common-mode is the resistance of both inputs to ground. In case of
a feedback system, the differential-mode resistance plays a very limited
role as the voltage between both inputs is kept close to zero. In reality the
differential input resistance is reduced with a factor equal to the loop gain.
With an inverting amplifier also the common-mode resistance is of little
404 Chapter 6. Analogue electronics in mechatronic systems
importance as both input voltages are kept at ground level. Only with a non-
inverting configuration, the common-mode input resistance has influence
on the input currents.
Ideally the input resistance is infinite.
In most operational amplifiers, the inputs are not allowed to exceed the
power supply voltage level and normal operation is only guaranteed over
an even smaller range. Often the inputs are protected against occasional
higher voltages by diodes but these will also reduce the input resistance
and for that reason, protective input diodes are not used in very critical
designs. Exceeding these values results in clipping of the output voltage to
the maximum level and it can even cause damage to the device.
Ideally the input voltage range is infinite.
Noise and level variations such as ripple of the power supply does preferably
not have have influence on the output voltage. The power supply rejection
ratio (PSRR) is defined as the ratio between a power supply voltage change
and an equivalent input voltage giving the same effect on the output.
Ideally the PSRR is infinite.
This determines the change of the output voltage as function of the output
current. In a feedback system this value is reduced by a factor equal to
6.2. Active electronics 405
the loop gain. Because the loop gain is frequency dependent, due to the
main pole, also the effective closed-loop output impedance will be frequency
dependent. As it increases with frequency, this can be modelled as a very
small series inductor.
Power consumption
These and several other issues make the electronic field to an important, ex-
citing and rather difficult specialisation, where a vast amount of knowledge
is gained in the last decades.
6.3. Power amplifiers 407
Combined with the actuators and the mechanics, the power amplifier deter-
mines the dynamic behaviour of the mechatronic plant. Power amplifiers
differ from the amplifiers in the previous section in their focus on reliable
electrical power delivery and efficiency. As will be demonstrated, this fo-
cus has invoked several special technologies, that are only partly or hardly
required in low power signal amplification. With the example of a modern
tube amplifier according to ancient technology as shown in Figure 6.61 it
is already clear that power is equal to size and heat but even with more
efficient semiconductor devices like the ones shown in Figure 6.62 a power
amplifier will remain always recognisable by the large components that
have to sustain elevated voltages, currents and heat. It is also true to say
that the design of power amplifiers will appear to be quite similar in certain
ways, in spite of these differences. Especially the presented topology of the
operational amplifier can be directly used for the design of linear power
Figure 6.61: Only 50 years ago proportional electronic control of high power could
only be achieved by means of electron tubes working in vacuum at very
high voltages with glowing wires to create free electrons. The shown
audio power amplifier can deliver two times 80 Watt while producing
about the same amount of heat. Fortunately semiconductors can do
the job with less effort although also with less beauty.
408 Chapter 6. Analogue electronics in mechatronic systems
Figure 6.62: The high power semiconductors as shown on the right can be distin-
guished from signal transistors as shown left by their mechanical
means to connect them with a heat sink in order to keep the tem-
perature of the semiconductor material below the level where the
semiconductor stops working (≈ 150 − 200◦ ).
amplifiers.
This section starts with an overview of the requirements of power amplifiers
with an emphasis on the application with electromagnetic actuators. This
will be followed by a subsection on linear amplifiers for moderate power
levels with both a voltage and a current source output. The last subsection
will present pulse-width Modulated switching output stages, that are used
to reduce the dissipation at high power levels.
• Dynamic properties.
• Output impedance.
• Efficiency.
• Linearity.
6.3. Power amplifiers 409
A large power implies automatically a high value of the current and volt-
age. As it appears to be difficult to integrate high voltage electronics, the
power stages in these amplifiers are often built with active and passive
discrete components. These large coils, resistors, capacitors and high power
semiconductors generally produce a lot of heat. Silicon semiconductors can
withstand temperatures on the chip of 150◦ – 200◦ , but due to the heat
transfer resistance of the packaging, mostly the temperatures at the outside
of the device have to be kept at a more reduced temperature level.
These thermal issues automatically require efficient methods to remove
the heat from the sensitive electronics. The mechanical design of signal
amplification electronics with thin printed circuit boards is a rather two
dimensional exercise but the lay-out of a power amplifier is a real three
dimensional mechanical challenge. The thermal design with cooling plates
and the use of large parts requires a design, that takes the mass and the
environmental mechanical dynamics under serious consideration. Solder
joints for instance are not capable of sustaining high mechanical loads and,
while this is a frequent cause of malfunctioning electronics, this is just an
example of the special skills needed when designing power amplifiers.
Another issue of the high power is the potential interference by magnetic and
electric fields, the Electro Magnetic Compatibility (EMC), caused by rapidly
changing high voltages and currents. Also this aspect requires a careful
three dimensional lay-out with shielding plates and conscious routing of the
wiring.
Dynamic properties
Output impedance
Efficiency
Figure 6.63 shows a basic design of a linear power amplifier with a volt-
age source output stage. At a first glance this design looks very familiar
to a low power operational amplifier. There are however at least two sig-
nificant differences. First of all the gain stage is designed to be able to
deliver a high output voltage, significantly above the regular power supply
voltage of ≈ +/ − 15 V of the differential input stage. This is achieved with
a level-shifter that consists of a discrete pair of high voltage transistors
in voltage amplification mode. The second difference is the high-current
power-output stage, that consists of a set of transistors with a very high
current-amplification ratio, like the Darlington pairs that are named af-
ter the American electro technician Sidney Darlington (1906 – 1997), who
invented the configuration. These pairs are in fact two cascaded emitter
followers and can deliver currents up to several Ampères with a current-
amplification of more than 1000. Also sometimes high-power MOSFETS
are used as these do not require a continuous input current.
The function of the level-shifter is as follows. The input of transistor T3 is at
a low voltage level and gets its input voltage from a low-power operational
412 Chapter 6. Analogue electronics in mechatronic systems
Vp 50 V
68
Da1
T1
27 k
10 k
50 pF
0.1
I idle | 10 mA
_
+
Vbias
+
0.1
G () T3 _ | 2.7 V Vo
Rload
1k
+
_
+
Vi T2
_
10 k
Da2
Vp 50 V
Figure 6.63: Basic design of a linear power amplifier with a voltage source output
stage and a closed-loop gain of ≈ 30 dB, showing the main components.
The transistors T1 , T2 and T3 determine a level-shifter to achieve a
large output voltage range. The push-pull output stage consists of two
Darlington pairs Da1 and Da2 , that can handle a large output current
up to 10 A. The operational amplifier serves as differential input and
gain stage and the capacitor C creates the dominant pole for stability.
+
G ()
+
+ R1 _
Ra
Actuator L
a
Io -1
Vm
+
1 Ra
Loopgain
Z0
Vi _ Vo We La
R2 -2
Vcs I o R3 Z0 Z
R3
_ _
Figure 6.64: Current-feedback is achieved by measuring the current with the series
resistor R3 and using this voltage as feedback signal. Because of
rule one the current through the actuator is only dependent on the
input voltage and not on the output voltage. Unfortunately the self-
inductance introduces an additional pole that can cause instability.
Current-feedback
+
G ()
+
R1 _
+
Ra
Actuator L
a
Io -1
Vm
+
1 Ra
Loopgain
Z0 R2C
Vi _ Vo We La
R2 -2
Z0 Z
R3
C
_ _
Figure 6.65: Pole compensation by a parallel capacitor over the feedback resis-
tor cancels the effect of the self-inductance but impairs the current
measurement and reduces the closed-loop gain at higher frequencies.
This can in principle be compensated by an additional differentiating
lead-network in the controller.
voltage. Unfortunately this is only true when the feedback loop has sufficient
closed-loop gain and phase margin. With a real electromagnetic actuator
the self-inductance of the actuator-windings causes problems, because above
a certain frequency the impedance of the self-inductance will determine
the impedance of the actuator as was explained in Section 5.4.1.1. This
frequency equals:
1 Ra
ωe = = (6.110)
τe La
Above this frequency, the feedback path from the output of the amplifier
to Vcs will have a low pass characteristic with one pole and a −1 slope that
adds to the internal −1 slope of the operational amplifier. This results in a
marginal stability of the feedback system as is visible at the combined −2
slope in the simplified amplitude Bode plot of Figure 6.64. This effect could
be compensated by creating a differentiating action, a lead-network in the
loop. This is possible by adding a parallel capacitor over R2 with the value:
La
τe = R 2 C = (6.111)
Ra
This would indeed increase the phase margin as shown in Figure 6.65 but
it would also change the closed-loop current source characteristics of the
6.3. Power amplifiers 415
Vp 50 V
+
Vbias
1
| 1.4 V _
100
9k1
_
Da1
G () T1
1k
+
+
100
Vbias _
+
Vi
_ | 1.4 V
Rload
Vo
T2
Da2 _
100
+
Vbias
1
| 1.4 V _
Vp 50 V
Figure 6.66: By reversing the Darlington pairs of Figure 6.63 a current source
output is obtained. This amplifier delivers 10 A output current at 1 V
input voltage. For the positive half of the signal, transistor T1 acts
as a level-shifting voltage amplifier with a gain of one, Da1 converts
the voltage of the collector of T1 into an output current. The negative
cycle is taken care of by T2 and Da2 with their surrounding resistors.
The operational amplifier only acts as a non-inverting amplifier with
a gain of 10, without feedback over the total amplifier.
voltage amplifier with a gain of ten and a low output impedance and its
output voltage is used as the input for the two level-shifter voltage amplifying
transistors T1 and T2 . The current in these transistors is determined by
the resistor of 100 Ω, that is shared to linearise the transition between
the positive and the negative cycle. In the positive cycle of the signal, the
current in T1 will increase, resulting in an equal decrease of the voltage at
the base of the Darlington pair Da1 , because of the 100 Ω collector resistor.
This results in an increased voltage over the 1 Ω resistor in the emitter of
Da1 , with a corresponding increase of the positive output current. For the
negative cycle T2 and Da2 act the same.
A special aspect in this configuration is the equal value for the shared
emitter resistor and the collector resistors of the level-shifters. This is done
to linearise the amplification of the level-shifters, because there is no further
feedback that could correct any non-linearity. This is also the reason for the
relatively high value of the emitter resistors of the Darlington pairs. The
low value of 100 Ω around the level-shifters is chosen such that the current
in the level-shifters at the maximum output current of 10 A is more than
a factor ten higher than the input current of the Darlington pairs, again
for linearisation reasons. At 10 A output current, the corresponding 10 V
voltage over the 100 Ω resistors requires a current in the level-shifters of
100 mA. The input current of the Darlington pairs is then 10 mA because of
the current-amplification ratio and that is only 10 % of the current in the
level-shifters.
It is clear from this example that all these measures come at a price in power
dissipation and performance. The relatively high currents and resistor
values both limit the output voltage and require significant measures to
keep the temperatures of the semiconductors and the other parts at an
acceptable level.
For that reason a more optimal solution will be to combine the current-
feedback configuration with grounded load of Section 6.2.4.6 with the output
stage from Figure 6.66, while using smaller emitter resistors to limit the
power dissipation. The current-feedback will reduce any non-linearity and
the collector follower configuration will reduce the problem of the additional
pole by the self-inductance of the actuator.
At the end of this chapter, when presenting three-phase resonant-mode
amplifiers, an alternative loss-less current sensing method will be shown
which gives an almost ideal amplifier configuration and can be used also
with linear power amplifiers.
6.3. Power amplifiers 417
Vp+
Current
I,V Voltage
V1
0
Vp- t1 t
Figure 6.67: With a complex load, the current from the amplifier will have a phase
difference with the voltage. This means, that the amplifier needs to
have a four quadrant capability by delivering positive and negative
currents at any sign of the output voltage. This requirement implies
a very high power dissipation.
Figure 6.68: The switched-mode power amplifier from Figure 6.69 that was de-
signed for the first electric wafer stage for the Silicon Repeater wafer
stepper.
p+ Channels p+ D
n+ p n+
n- Epi Layer
G
n+ Substrate S
Drain
Drain
symmetric structure, where the current can flow in both directions. Indeed
that is the case and although these devices are optimised to work with
currents running in only one direction, especially in power MOSFETs this
phenomenon is used to advantage in switched-mode amplifiers. This op-
timisation to work in only one direction has a direct connection with the
practice to connect the Source directly to the substrate, as they will have the
same potential, when the current runs from the Drain to the Source in an
N-channel MOSFET. As a consequence, the junction between the N-material
of the Drain and the P-material of the substrate determines a Silicon diode
between the Source and the Drain. This diode helps conducting current
from the Source to the Drain, opposite to the normal current direction and
it will be demonstrated, that this is a very useful property in switching
inductive loads.
Like all other elements in electronics, also power MOSFETs suffer from
several parasitic capacitances and for that reason they are not really ideal.
The most important is the capacitance of the Gate, that amounts in the
order of 10 nF for a power MOSFET that can switch more than 200 A. Also
the capacitance between the Drain and the Gate plays a similar negative
424 Chapter 6. Analogue electronics in mechatronic systems
Pulse-width modulator
+ 1 Vo
+
G ()
+
_ Vi
R1
~
LC- Vo
+
_
Vi 2
!!
_
C
R2
_
Vi
2
Vo
In reality the balanced switching output stage creates a high power version
of the square-wave from the output of the Schmitt trigger. By measuring the
output before the filtering, a signal is obtained that in case of an ideal power
stage would be proportional to the output of the Schmitt trigger. Because
nothing is ideal, this feedback of the real switched output signal will help to
correct errors in this stage because of the high gain of the integrator.
The Schmitt trigger in the example amplifier of Figure 6.69 is realised with
positive feedback around the second operational amplifier (IC2).
428 Chapter 6. Analogue electronics in mechatronic systems
To investigate the relation between current, voltage and power, Figure 6.72
gives a detailed insight in the behaviour of a switching power output stage,
including the reactive components.
To illustrate the effects with real numbers this example shows the signals
with the actual component values of the example amplifier, a 150 kHz
switching frequency with an output filter consisting of an inductance of
1 mH and a capacitor of 0.33 μF. An equivalent single-ended output stage is
chosen with equal positive and negative supply voltages of 30 V. A duty-cycle
of 50 % results in an average output voltage of 0 V.
One cycle of 6.66 μs is shown in the figure, equally divided in four steps.
The starting point of the first step is taken when the current in the inductor
is zero and the first power MOSFET M1 is just switched on, transferring
the full positive power supply voltage of 30 V to the voltage Vb at the bridge
between the switches. As will become clear after the last step, the voltage Vo
at the output of the filter is then at its maximum negative value of ≈ −0.1V
and never grows bigger than an alternating value of this same magnitude.
Because of this low value of the output voltage at this duty-cycle, the total
voltage difference over the inductor is only determined by the bridge-voltage
Vb = 30 V and as a result the current in the inductor of 1 mH will increase
in the first 1.66 μs to a maximum level Î L determined by:
τ
dI Vb ( t) Vb
Vb = L =⇒ Î L = dτ = d t ≈ 0.05 [A] (6.114)
dt L L
0
This current flows in the capacitor of 0.33 μF and the voltage change is
calculated by integrating the current over the 1.66 μs:
τ
dVo I L ( t) Î L
IL = C =⇒ δVo = dτ = 0.5 d t ≈ 0.1 [V] (6.115)
dt C C
0
Vp+ Vp
+
Io Io
M1 Bridge L M1 L
+ + +
IL Io Io IL Io
Vb IC Vo IC Vo
C C
- - -
M2 M2
- -
Vp Vp
Step 1: M1 switched on. Step 2: M1
<2 switching on.
IL
!
Q IL
!
! 2 and decreases.
Vo
'Q Vo
'Q
+ +
Vp Vp
Io Io
M1 L M1 L
+ +
Io IL Io IL Io
IC Vo IC Vo
C C
- -
M2 M2
- -
Vp Vp
Step 3: M2 switched on. Step 4: M2
<1 switching on.
IL
!
Q IL
!
! 1 and decreases
Vo
'Q Vo 'Q
Vb + 30 V Vb Vb Vb
0 0 0 0
+ 0.05 A - 30 V
IL IL IL IL
0 0 0 0
3.33 s
- 0.05 A
Vo Vo + 0.1 V Vo Vo
0 0 0 0
t0 - 0.1 V t t0 t t0 t t0 t
Step 1: Step 2: Step 3: Step 4:
+30 V +30 V
Vb 0V Vb 0V
+3.05 A
IL +3A
+0.05 A
IL 0A +15.1 V ; +3.02 A
Vo+15 V
Io +3 A
+0.1 V ; +0.02 A
Io 0A
Vo 0V
t0 t1
Transion from 50 % to 75 % duty cycle at t1 A$er stabilisao
$ '
Figure 6.73: When the amplifier is loaded with a resistive load of 5 Ω a change in
the duty-cycle from 50 % to 75 % will cause an average output voltage
of +15 V with a corresponding average output current of 3 A. At the
transition moment t0 first the current and the voltage will increase
until an equilibrium is reached. The ripple remains almost constant
with only a change of waveform.
M2, the current will first flow through the diode of M2 before M2 is switched
on completely. This proves the usefulness of this diode as without the diode
the voltage Vb could be forced to more negative values than Vp− , leading to
damage of the MOSFET.
As a result of the negative voltage over L the current I L will decrease again
until it is zero at the end of the second step.
The third and the fourth step are identical to the first and second step. Only
the signs are different and as a result the end of the fourth step is identical
to the beginning of the first step.
In this still quite ideal situation it is clear than no power is dissipated in any
element. The switching MOSFETs either conduct current without a voltage
between their terminals and the reactive elements only store energy.
The reasoning from Figure 6.72 is true for the situation with 50 % duty-cycle
and zero average output voltage. In that case also the average output current
I o to a resistive load is zero. This amplifier was designed for a 5 Ω load
and Figure 6.73 shows what happens with the current, when the duty-cycle
is changed to 75 % and the load of 5 Ω is applied. At that duty-cycle the
6.3. Power amplifiers 431
This corresponds with the value found in the previous analysis and is about
-50 dB below the maximum voltage level of the switching output. It is
important to note that this ripple voltage is always present, even without a
low-frequency output signal. The only way to reduce this ripple would be by
either increasing the switching frequency, by taking a higher order filter,
or by lowering the corner-frequency of the filter. In one way this illustrates
the need for very fast switches when these amplifiers are applied in high
precision positioning systems. On the other hand a signal of 100 mV at
150 kHz is not expected to give much acceleration in a heavy positioning
system and it is also not audible too.
Like with all decisions in mechatronic design, for every specific situation it
has to be examined, whether these artefacts would give a problem or not.
432 Chapter 6. Analogue electronics in mechatronic systems
• The interaction between the output filter and the load impedance.
These issues are all addressed in the following subsections to complete this
chapter on power amplifiers.
Po
' MOSFET switching branch Vb +12V Linear
Vb +15V Vp+
regulator
Linear Enable
regulator circuit D
Opto-coupler +5V
Po
'
1 4 G
Gate-driver
2 3
MOSFET
S
Vb LF output
Bridge Output lter
'FET switching branch Vp- +12V Linear
Vp-+15V
regulator
Linear Enable
regulator circuit D
Opto-coupler +5V
Neg'
1 4 G
Gate-driver
2 3
MOSFET
S
Vp-
Figure 6.74: Gate drivers for the switching power MOSFETs need to relate the
Gate voltage to the Source voltage, that is either at the negative supply
voltage Vp− or at the bridge-voltage Vb . These voltages are different
from ground and Vb is even fast switching between both supply voltages
which means that the Gate driving circuits must be insulated from
the control part. This is done by means of opto-couplers.
(Courtesy of ASML)
6.3.3.7 Charge-pumping
The presented single configuration with two power supplies appears to give
a problem with these energy sources, because the load is connected between
its output and ground. The problem occurs when the amplifier delivers a
low-frequency or DC current to the load and is explained with the help of
the black arrows in Figure 6.72.
When a DC current is delivered to the load, this DC current is superimposed
on the high-frequency AC currents as shown with the red arrows. When this
positive current is larger than the amplitude of the high-frequency current,
the total current in the inductor has become unidirectional in all steps of
the switching process. For step one and four that is no problem because
this positive current is supplied by the positive power supply. But in step
two and three this positive current will be supplied from the negative power
supply to ground. This means that the negative power supply delivers a
negative power. Or in other words, the negative power supply receives power.
This is also imaginable when recollecting the fact that the positive current
equals a flow of negatively charged electrons and in step two and three these
electrons move from ground towards the negative power supply. As a result
of this process, the negative power supply gets more charge.
The energy involved in this process originally came from the positive power
supply in step one where it was stored in the inductor and for that reason
this process is called charge-pumping. With a positive DC output current
of the amplifier, the negative supply should be capable to accept all this
negative charge by delivering this positive current. Like with a linear power
amplifier, most standard power supplies do not have this four-quadrant
capability where both the positive and negative supply voltage can handle a
positive and negative current. Take for instance the power supply that is
made with the diode-bridge rectifier and storage capacitor from Figure 6.31
of the previous section. A positive current from the negative power supply,
that is caused by charge-pumping, will result in a continuous increase
of the negative supply voltage, because the diodes of the rectifier can not
conduct current in the reverse direction. This increase of the negative supply
voltage will continue until damage occurs in the storage capacitor or in other
components.
The same effect but in the opposite direction happens with a negative DC
current, in which case a continuous charge is transported from the negative
to the positive power supply.
When only alternating currents are to be delivered to the load like in audio
6.3. Power amplifiers 435
power amplifiers, the storage capacitor can handle the short excess of charge
in each signal period. Mechatronic systems often require a more extended
period with a DC current component and in that case a solution for this
charge-pumping phenomenon has to be found.
One solution can be to use a power supply with four-quadrant capability.
In that case the resulting energy can for instance be dissipated somewhere
inside the power supply in a resistor, although then a large part of the benefit
of the switched-mode amplifier is lost. The best solution is to transfer the
energy back to the mains supply. This is achieved in special switched-mode
power supplies, that operate with the same principles as a switched-mode
amplifier. By switching currents and voltages over reactive elements and
the application of high-frequency transformers to change the voltage levels,
they are optimised for DC voltages only, with as much as possible a voltage
source output impedance. With additional switches the conversion from the
mains power to the DC output power can be reversed but power supplies
with that capability are more complex than power supplies that work in only
one direction.
Another good example of a power supply with four-quadrant capability is a
rechargeable battery, as long as the maximum charge level is not exceeded.
When the power supply is not able to handle these currents that run contin-
uously in the opposite direction of the voltage, the problem can be solved in
the amplifier itself, by using the H-bridge or dual-ended configuration as is
applied in the example amplifier.
Vp+ Vp+
M1 M3
Load
M2 M4
Vp- Vp-
Vp+ Vp-
Step 1: M1 and M3
Q
!"Q
Vp+ Vp+
M1 M3
Load
M2 M4
Vp- Vp-
Vp- Vp+
Step 2: M1 and M3
<Q
!"Q
Source of the power MOSFET at the negative side avoids the need for an
insulated Gate drive circuit for that power MOSFET.
The second advantage of a dual-ended configuration in a switched-mode
amplifier is the absence of charge-pumping. In Figure 6.75 it is shown that,
in the event of a positive DC current to the load at step one, the current
flows from the positive supply voltage to the negative supply voltage. Both
supplies are loaded with the same current and both deliver energy to the
inductors and the load. At step 2 the current flows from the negative supply
to the positive supply, while both supplies regain energy from the stored
magnetic energy of the inductors.
6.3. Power amplifiers 437
Vp+
Filter Load
1 mH
M1
0.33 1 F 8 mH
F
M2
Vp-
This is a very low value and if this load was purely resistive in reality, a filter
with an inductor of 0.1 mH and a capacitor of 3.3 μF would have been more
appropriate. The load is however an electromagnetic actuator with a series
inductance of 8 mH. This gives an electrical time constant of τe = 1.6 ms and
as a result the actual impedance rises above ≈ 100 Hz with a slope of +1 to
become approximately 430 Ω around the corner-frequency of 8.6 kHz of the
LC-filter. With that almost purely inductive value the LC-filter would have
almost no damping anymore and noise on the input of the amplifier would
be strongly amplified at that frequency.
438 Chapter 6. Analogue electronics in mechatronic systems
Hard-switching
+
Vp
C r1
Gate-driver 1
M1 L
Bridge
+
IL Io
IC Vo
C r2 C
Gate-driver 2 -
M2
-
Vp
Figure 6.77: A resonant switching amplifier uses two additional capacitors Cr1 and
C r2 parallel to M1 and M2 that reduce the transient speed in a
non-dissipative way to a sufficiently low level that the dissipa-
tion in the MOSFETs caused by their Drain-Gate capacitors
is largely avoided. The additional parallel diodes are faster
and have a slightly lower threshold voltage than the intrinsic
diodes of the MOSFETs.
(Courtesy of ASML)
In Figure 6.78 the basic principle of the control of the resonant-mode output
stage is shown for a single-ended output stage with two supply voltages.
Because the problem originates in the current, this controller is essentially
a current-feedback system. As a direct consequence of this controller, the
amplifier becomes a current source amplifier, like is preferred in a mecha-
tronic positioning system.
The controller continuously monitors the current I L through the inductor
and switches the MOSFETs when I L has reached certain predetermined
reference levels. These levels are always positive for the MOSFET that
switches the positive voltage and negative for the MOSFET that switches
the negative voltage. In case of a positive output current, the positive ref-
erence level is increased and with a negative output current, the negative
reference level is increased. The MOSFETs are switched on again as soon
as their current is reversed at the moment of zero current.
6.3. Power amplifiers 441
Current Controller +
-
Iset +
IL
-
+
Io
- +
IL Ilim
I
Io -
+ Ilim t
Ilim
- Iset
Ilim
Figure 6.78: The controller of a resonant power output stage measures the current
through the inductor. When this current has reached a certain limit,
that is defined by the output current plus an offset, the MOSFETs are
switched off. When the current is zero, the MOSFETs are switched
on. This results in a controlled output current.
(Courtesy of ASML)
The switching behaviour with this controller is shown in Figure 6.79. In this
figure some simplifications are applied. First the two capacitors from the
bridge point to the power supplies are replaced by one capacitor to ground.
This is allowed as power supplies have a voltage source characteristic with
an equivalent impedance to ground of 0 Ω. The second simplification is the
replacement of the capacitor of the output filter by a voltage source. This is
allowed as the voltage over the capacitor will change only very little within
one switching cycle of the high-frequent current, as was demonstrated with
the normal switching output stage in Figure 6.72.
The main difference with the switching cycle of Figure 6.72 is the moment
that the MOSFETs are switched off. This is determined uniquely by a pos-
itive current level for M1 and a negative current level for M2. After the
MOSFETs are switched off, the voltage Vb of the bridge will not change ex-
tremely fast, because of the additional capacitor Cr . The slope is determined
by the current level and the capacitor value. During this slope the energy
in the inductor flows partly into the capacitor and the total behaviour is
determined by the capacitor and the inductor only.
In fact it is this short moment in the natural resonating mode of the LC-
combination that gave the name of “resonant-mode” to this configuration.
442 Chapter 6. Analogue electronics in mechatronic systems
Vp+ Vp
+
Vp Vp
L Vb L Vb
M1 Bridge M1
+
IL + IL +
Vo Vp Cr Vo Vp
Cr
- -
Vb IL I lim IL I lim
-
M2
M2 I lim I lim
- -
Vp Vp
+ +
Vp Vp
Vp
Vb Vp
M1 L L Vb
M1
Vp
IL + IL +
Vo I
I lim Cr Vo Vp
Cr L
- -
I lim IL I lim
M2
M2 I lim
- -
Vp Vp
Step 3: M1< Step 4: M2 switched on,
IL
!!
Q !'
'Q
IL
Vp+ Vp
+
Vp
Vb Vp
L L Vb
M1 Bridge M1
IL + V p IL +
Cr Vo Cr Vo Vp
IL I lim
- -
IL I lim
I lim
M2 M2
I lim
- -
Vp Vp
Step 5: M2
< Step 6: M2<
!!
Q
IL !!
Q
IL
Figure 6.79: In a resonant switching power stage after switching one MOSFET off,
the current is taken over by the capacitor and without any dissipation
the bridge-voltage changes in a controlled way. The MOSFETS are
switched on as soon as the current reverses.
6.3. Power amplifiers 443
IS
+ IC
_
VH 0 IIPP
VH
B
IC IC
+
_ VH CH B
IS
+
Figure 6.80: A Hall sensor and a current sensing module that measures the current
indirectly by measuring its related magnetic field with a Hall sensor.
(courtesy of LEM)
The second difference is that the MOSFETs will only be switched on, when
the current reverses in the branch, consisting of the MOSFET and his par-
allel diode. That switching moment effectively cancels all dynamic issues in
either the diodes and the parasitic capacitors. In principle this alleviates
the need for the additional parallel diodes but imperfections in timing will
always remain so they are retained for additional robustness of the design.
One additional interesting observation can be done with this current-controlled
power stage. Like with a normal switching output stage the duty-cycle of
the switching needs to relate to the output voltage. This is true, because
the output voltage of the LC-filter equals the average of the high-frequency
signal before the filter and this average is determined by the duty-cycle.
As a consequence of the current source behaviour of this output stage, the
duty-cycle becomes a function of the load. In Figure 6.79 the duty-cycle
is 50 % and with a short-circuited output of 0 V, this duty-cycle remains
unchanged, even when the average current is not zero. The situation with a
resistive load was shown in Figure 6.78. In that case both the duty-cycle
and the frequency change.
The detailed description of all properties and signals in such an amplifier
would go far beyond the goal of this book, unfortunately, but it is interesting
to see how the current sensing can be done in such an amplifier.
444 Chapter 6. Analogue electronics in mechatronic systems
In very high power positioning systems, where these resonant power am-
plifiers are applied, the voltage levels are very high up to almost 1000 V,
with current levels of several tens of Ampères. The simple inclusion of
a series resistor to measure the current, would become a bit of a hazard.
Furthermore, the three-phase configuration, that will be presented in the
next section, requires the current to be measured at the high voltage output
of the amplifier. For this reason a lossless current sensor is used that works
with magnetism and active control.
The need for ever higher accelerations and velocities of advanced mecha-
tronic equipment has resulted in linear motors based on the Lorentz prin-
ciple with three-phase electronic commutation, that have a large range of
6.3. Power amplifiers 445
Three phase
magnitude set point R current R
transformaon
-
Phase
- current S 5 kW
?
set point S
controller actuator
serial link
(data & control) current T
The basic scheme of the second amplifier example of this chapter, that
was designed for driving a 3 phase motor with 5 kW power, is shown in
Figure 6.81. The digital controller provides a force setpoint via a serial
data link to the amplifier, while status information is transferred back to
the controller for diagnostic purposes. The force setpoint consists of the
magnitude and the position of the actuator as this is necessary to guarantee
that the currents are in phase with the actual B force value of each actuator
446 Chapter 6. Analogue electronics in mechatronic systems
coil segment.
With this information a mathematical phase transformation is executed that
creates two 120◦ shifted set point signals R and S with a sinusoidal relation to
the position. Because R+S+T=0, the third setpoint signal can be determined
by subtracting R and S from one. The three setpoint values are used as
inputs for a current amplifier where, after digital to analogue conversion,
each phase of the actuator is driven by a single-ended current-controlled
power stage with a single power supply.
Figure 6.82 shows how the three-phase actuator is connected to the three
high-power resonant-mode switching output stages. In this configuration the
three phases of the actuator coils are connected in a star configuration, with
one central connecting point and the other terminals of the coil sections are
connected to their corresponding phase terminal of the amplifier. Because
of the application of only one power supply, this amplifier becomes a special
version of the dual-ended amplifier from Section 6.3.3.8. The central point
will have a voltage of an almost constant level, equal to approximately half
the supply voltage. This voltage is constant as all currents at this node add
to zero and the first of Kirchhoff’s laws is fulfilled.
One additional advantage of using three-phase amplifiers is, that the same
rule of R+S+T=0 means that at any moment in time a positive low-frequency
current in one phase is compensated by the currents in the other phases.
This means also that the charge-pumping from one phase is compensated
by the charge-pumping of the other phases and as a result the net value of
charge-pumping is zero.
6.3. Power amplifiers 447
Vp+
M3
Phase R Vp+
Actuator
L
L Phase T
M5
Vp+ M4
L
M1
M6
Phase S
M2
1
JQ8
JQ6
JQ4
JQ2
!
J
>JQ2
>JQ4
>JQ6
>JQ8 R S T
>
?
Figure 6.82: A three-phase actuator only needs three single-ended output stages.
charge-pumping is avoided because all currents flow between the
positive and negative power supplies, just like with the dual-ended
configuration.
Optics in mechatronic
systems
449
450 Chapter 7. Optics in mechatronic systems
Section 7.3 deals with geometric optics. This is a practical method used to
calculate the imaging properties of an optical system, where the light is
approximated by ideal rays (ray tracing) and only the properties of materials
on reflection and refraction are taken under consideration. This method is
useful for a mechatronic engineer when designing simple optical systems
that might for instance be used in laboratory test setups, made from “off
the shelve” components.
For precision mechatronics the physical optics from Section 7.4 are the most
important. Physical optics deals with the effects that are based on the wave
character of light, including interference and diffraction. These effects limit
the ultimate imaging properties of an optical system and are also used for
position measurement with interferometers and encoders, as presented in
the next chapter.
Section 7.5 gives a short introduction on adaptive optics, where active con-
trol is used to correct imaging errors, enabling near-to-perfect imaging
properties of for instance terrestrial telescopes.
7.1. Properties of light and light sources 451
with:
When filling in some real numbers, the extremely low number of roughly
E = 3.5 · 10−19 J is obtained for 1 photon of visible light (λ ≈ 600 nm). A
low level of one μW (μJ/s) of visible light power would hence be equal to
about 3 · 1012 photons per second. This still immensely high number implies
that it is not always necessary to take this quantum character into account.
Nevertheless, with very sensitive measurements, the noise level is limited
by the photon character and it partly explains why a larger image sensor in
a photographic camera shows a better noise performance.
At several occasions it will appear to be practical to use the photons and
electrons for the explanation of the physical phenomena that are related
with light.
452 Chapter 7. Optics in mechatronic systems
Visible light
12,000 K
5,500 K
2,000 K
Many sources of light exist, ranging from thermal radiation by heated bodies
to secondary emission of photons by electrons that change momentum or
change state within an atom. Thermal radiation is often related to the
visible light spectrum and is caused by the increasingly fast vibrations of
charged particles inside atoms at increasing temperature. These vibrations
of charge create electromagnetic waves, following the Maxwell equations.
Due to this thermal nature of origin, the induced light consists of a wide
range of wavelength values with a peak at a wavelength, which is determined
by the absolute temperature in ◦ K according to “Wien’s displacement law”,
named after the German Physicist Wilhelm Carl Werner Otto Fritz Franz
Wien (1864 – 1928). He stated that the shape of the wavelength distribution
of a black body, defined as having a surface with a frequency independent
emissivity, is independent of the temperature, while the maximum emission
occurs at a wavelength equal to:
b
λ= [m] (7.2)
T
7.1. Properties of light and light sources 453
With b being the Wien’s displacement constant of ≈ 2.9 · 10−3 [mK]. With this
relation a pure black body starts radiating visible red light at a temperature
of about 2000 K. This corresponds with a peak radiation in the wavelength
area of about 1500 nm, so most of the light is still in the invisible infra
red region. At higher temperatures the peak emission will move to shorter
wavelengths, while a black body temperature of 5500K has its peak emission
around 530 nm, which equals the mid of the visible spectrum, like the
daylight as emitted by the sun. A higher temperature will shift the peak to
even shorter wavelengths into the invisible ultraviolet region. Figure 7.1
shows the spectra of these three examples.
365 436
405
546
Relave Spectral Power level
579
Figure 7.2: The light power spectrum of a mercury arc gas discharge lamp shows
peaks corresponding with different energy states of the electrons around
the mercury atoms in the plasma.
the process. These plasma gas discharge lamps radiate light at very specific
wavelengths, determined by the gas atoms inside the plasma. An industrial
mercury gas discharge lamp radiates with a spectrum as shown in Figure 7.2
with different narrow peaks of which the peak at 365 nm (I-line), at 405 nm
(H-line) and at 436 nm (G-line) were used in the illumination of the early
versions of wafer steppers.
The well-known fluorescent tube is also a Mercury type gas discharge lamp
that works a bit differently. It does not work with a real plasma but the
electrons are emitted by heated electrodes at the start and move to the
other electrode through vacuum that is slightly doted with mercury atoms
of which the outer electrons will be excited by the moving electrons. The
230 V AC voltage of the mains is not sufficiently high to start the lamp. For
this reason a coil with a large self-inductance is used to both create the high
starting voltage by interrupting the current through the coil by a “starter”
switch and to heat the two electrodes by this starting current, in order to
emit the first electrons that have to start the process. The peaks at 546 and
579 nm in the visible light region of Mercury are not sufficient to give an
acceptable light spectrum for domestic use and most of the light is emitted
in the invisible ultraviolet frequency range. For this reason a fluorescent
7.1. Properties of light and light sources 455
tube is coated at the inside with special fluorescent powder that converts the
UV light into a wider spectrum in the visible light range, increasing both
the efficiency and improving the colour. Although widely used in general
lighting in buildings, these fluorescent tubes are of limited importance as
light sources for precision mechatronic purposes due to their low radiance,
a term that will be defined later and is related to the large surface that
radiates the light with these lamps.
Much more interesting for the mechatronic engineering field is the laser,
which is the subject of the next section, but also the Light Emitting Diode
(LED) is shown to be of increasing importance. LEDs are already used
for many years as indicator in electronic equipment and only in the last
few years they became known for illumination purposes. The principle of
working of an LED is also based on electrons that change their excitation
state, but now not within a gas but in a solid material. For that reason
LEDs are also called solid-state lamps.
An LED is a special kind of semiconductor diode, of which the working prin-
ciple was presented in Chapter 6. P and N material are combined causing
either a barrier or a conductive path for the electric current, depending on
the direction of the voltage.
In case of conduction, the electrons recombine with the holes at the transi-
tion zone from n to p material. This recombination implies the filling of an
empty place around an atom, which corresponds to a change of the energy
level. This would result in emission of a photon, were it not that Si or Ge
are indirect band-gap materials, showing no radiation at recombination.
This means that a normal diode will not emit light nor require the power
that corresponds with that light emission. While this is not a problem for
“normal” semiconductor diodes it does not create light. Fortunately other
materials, composed from materials of adjacent valence bands (III and V) of
the periodic system like Ga,As,In,P and N, have a direct band-gap and they
do result in diodes that emit photons with a wavelength depending on the
material, at the cost of additional electric energy. For instance GaAs gives
infra red light, AlGaAs gives red light and GaAsP gives yellow light.
It has long been quite difficult to create shorter wavelengths, because that
requires a wider band gap and not all material combinations are easily
to manufacture. Eventually GaN appeared to be the right combination to
create blue light.
The single colour of an LED and its rather low power of initially only a few
456 Chapter 7. Optics in mechatronic systems
The ultimate light source is the laser. Laser is an acronym for Light Amplifi-
cation by Stimulated Emission of Radiation and is based on the phenomenon
that an electron at its excited state can be stimulated to drop to its lower
state by a photon. This photon needs to have the same frequency as the
frequency of the photon that the electron would emit by its change of energy
state. In other words, one photon will create another identical photon by
“triggering” an excited electron to drop to a lower energy state.
Like shown in Figure 7.3 one photon could stimulate two or more electrons
resulting in three or more photons with the same frequency and phase!.
This phase relationship is called coherence.
This physical process would of course come to an end, when all electrons
have reached their lower state E 0 . The process of emitting photons, when
lowering the energy level of an electron, is fortunately reversible. This
means that another light source, not necessarily coherent, like a normal gas
discharge lamp, can be applied to excite the electrons to an energy level E 2
above the excitation state E 1 needed for the stimulated emission.
This higher energy level E 2 is even less stable than E 1 , so the electrons will
drop to E 1 , while sending a photon of a frequency that is not used in the
lasing and can be neglected in the considerations. The additional external
light source for exciting the electrons is called an “Optical Pump”.
It can be concluded that the complete system amplifies the incoming photons
with a gain that depends on the frequency of the photons. This frequency
is not infinitely sharply fixed, but has some tolerance due to uncertainty
7.1. Properties of light and light sources 457
E2
E1
Electron - - Photon
Photon
Photon
E0
considerations on the energy levels of the electrons of which the deep treatise
exceeds the scope of this book.
To accommodate this effect, the term “gain bandwidth” is used, which is
quite different from the gain-bandwidth product that was defined for an
operational amplifier. Optics designers often use the term Full-width at
half-maximum (FWHM), the width of the frequency band that covers half
the optical output power, as shown in Figure 7.4. This corresponds with the
well-known -3 dB point on a logarithmic scale of electronic engineers.
The gain bandwidth tells something about the capability of the laser to
amplify photons of a specific frequency within the bandwidth range and it
automatically determines the bandwidth of the emitted light spectrum from
a laser. Other words in literature for this quality are “line width” of a single
frequency laser and “optical bandwidth”.
The bandwidth of a laser is one of its determining factors in the suitability
as a light source for technical applications. For imaging purposes a high
bandwidth can result in chromatic aberrations due to the frequency depen-
dent properties of many lens materials. On the other hand a very small
bandwidth can give problems due to the occurrence of scattered speckle,
caused by interference. This phenomenon is known from the light of a laser
458 Chapter 7. Optics in mechatronic systems
Frequency
Figure 7.4: Definition of the full width at half maximum power gain bandwidth of
a laser light source. The smaller the bandwidth is, the better is the
temporal coherence of the laser.
pointer which shows tiny spots over the illuminated area that change as a
function of displacement and angle of the incident light. These spots could
impair the image of an object that is illuminated by a laser.
An other important property of a laser, partly related to this bandwidth
and the occurrence of speckle, is the coherence. Two kinds of coherence
exist namely spatial coherence and temporal coherence. The spatial coher-
ence relates to the correlation between the electromagnetic field inside the
laser beam at a certain moment in time as function of the location in
the beam. It is a measure for the perfection of the shape of the emitted
wavefront, a term that is explained later. The spatial coherence determines
the divergence of the laser beam and hence the capability to keep a fixed
diameter over a long range.
The temporal coherence of a laser relates to the correlation between the
electromagnetic field on a certain location inside the laser beam as func-
tion of time. It is related to the bandwidth of the light from a laser and
decreases as a function of the distance from the laser source. The correla-
tion between the temporal coherence and the bandwidth is illustrated in
Figure 7.5 where 7 graphs are shown of frequencies with a little difference
between each frequency. In case all frequencies are in phase at the light
source, which means that all values of the field are equally zero at t = 0, then
the coherence disappears at some distance from the light source, at a time
t = t c that relates to the distance with the propagation speed of light. This
time depends on the bandwidth of the source. With a wide bandwidth, the
time is very short and with a single frequency source, this time is very long.
The time that passes, until coherence is lost, is called the coherence time.
Because light propagates at a constant speed in vacuum, this coherence time
7.1. Properties of light and light sources 459
value
t= 0
Time
Figure 7.5: Electromagnetic field waves as function of time for light of different
frequencies, starting all in phase. It shows the loss of phase relationship
after a certain propagation time that corresponds to the distance from
the light source.
directly relates to the more well-known and for technical applications more
useful term coherence length, being the coherence time times the velocity of
light.
In laser interferometry measurement systems as presented in the next chap-
ter, the required coherence length is often several metres. As an example to
get a feel for the required bandwidth of the light, a coherence length of ten
metres implies that the lowest frequency in the bandwidth will need to be
less than one period different from the highest frequency in the bandwidth
over this length of ten metres. One period of for instance 500 nm relates
to 10 m as 5 · 10−8 , so the bandwidth can not be more than 5 · 10−8 times the
frequency.
This kind of precision is not achieved by the electron-photon physics in the
laser only. Lasers, as used for high precision measurement systems, often
consist of a combination of a laser tube with an optical resonance chamber,
according to the Fabry-Perot principle as will be presented in Section 7.4.2.1.
This resonator consists of two parallel mirrors at a very stable distance,
such that a standing wave occurs between the mirrors. An example of a
laser utilising this principle is the Helium Neon (HeNe) laser as shown
in Figure 7.6 which radiates red light at 632.8 nm. This type of laser is
already for many years used for distance measurement purposes, because
the wavelength can be additionally stabilised by means of Iodine gas that
shows a very narrow natural frequency and enables active control of the
460 Chapter 7. Optics in mechatronic systems
Glass envelope
- High Voltage + High Voltage
Output
Figure 7.6: Cross section of a Helium Neon Laser. A high voltage creates a plasma
where the electrons of the gas are continuously excited to their higher
energy state and emit photons of different wavelengths, while falling
back from that state. Lasing takes place induced by photons of only
one wavelength, leading to a maximum emission at 632.8 nm. The
two reflectors create an optical resonator that further enhances the
frequency stability of the laser.
distance between the mirrors. It enables the use of this laser as a reference
for the standard metre. Next to this precision HeNe-laser, a large amount of
different laser types are introduced of which the semiconductor diode-laser
has been instrumental to the optical registration of signals on a disk like
the red laser diode for the CD-player and the blue lase diode for the Blu-Ray
player. These diode-lasers work according to the same principle as an LED
but with additional internal lasing and small mirror facets on the exit points
of the semiconductor material to create a Fabry-Perot resonance chamber.
Besides the frequency and phase of a light source, also the radiated energy,
the size of the source and the direction of radiation are of importance. These
all influence the amount of light that can be captured by an optical system
and transmitted to the area of interest.
The illumination power Pi [W] of the radiated light, sometimes also called
the radiant flux [Φ], is not sufficient to define the usefulness of a light
source. First of all it contains all frequencies. But even when a light source
is monochromatic, the pointing direction, spread and the radiating surface
are undefined. Many terms are introduced to narrow down these undefined
parameters of a light source in relation to the optical system, of which the
following are explained here:
• The radiance.
The radiant emittance M from a light source equals the illumination power,
radiated by the source, divided by its radiating surface W/m2 .
The radiant emittance closely relates to the term irradiance that was defined
in Chapter 2. The irradiance I r equals the power density of the received,
incident light on a defined surface W/m2 . Often the term intensity is
also used for this term but the definition is not very clear as in optics and
radiology the term intensity is also used for the energy flow per unit of solid
angle. For this reason the term irradiance is preferred in optics.
The irradiance determines all observations in optics as it is the only property
of the light that can be measured with a sensor or the human eye. A photo
detector with a given surface converts the light power, being the irradiance
times the surface, into an electric current.
With a given value for the radiant emittance of a source, the irradiance,
received at a surface in its neighbourhood, decreases proportional to the
squared distance from the source. Contrary to the terms from the next part,
the irradiance can be increased again by concentrating light on a spot with
an optical system.
7.1.3.2 Radiance
The size of a light source relates to its spatial coherence and for that reason
a point source, with an infinitely small radiating surface, pointing into the
right direction, appears to be the most ideal in respect to the performance of
an optical system. In that respect, a perfect parallel laser beam fulfils that
demand, as it can be modelled as a point source located at infinity, pointing
in only one direction.
The term radiance is used to describe the extent to which a light source
behaves like such an ideal point source. The radiance is proportional to the
radiant emittance and for that reason it is also often called the brightness
of a light source. As will be shown, the radiance also includes the radiated
direction of the light. With a real point source, the radiance would by
definition be infinite.
In physical terms the radiance L, see Figure 7.7, is defined within the next
expression for the illumination power, radiated from an infinitesimal part
of a light source in a certain direction:
With:
462 Chapter 7. Optics in mechatronic systems
Electrode
Plasma arch
dA
Normal on dA
ϑ
Glass bulb
Electrode
• dΩ: The solid cone angle that defines the radiating direction of an
infinitesimal radiating surface element d A . [sr]
• ϑ: The angle between the surface normal and the direction of the
radiating solid cone. [rad]
When the radiance is constant over the total surface A and the solid cone
angle is pointing in the direction of the normal on the surface, the expression
of the total illumination power of a light source into a certain direction,
defined by the solid cone angle Ω, can be simplified into:
Pi,t = LA Ω [W ] (7.4)
optics in lithographic exposure systems, the radiance can be increased at the image because of
the increased refractive index.
464 Chapter 7. Optics in mechatronic systems
7.1.3.3 Etendue
From the previous it can be concluded that the radiance is maximum when
the light is radiated from a very small surface at a very small angle. Unfor-
tunately only an ideal laser is capable of creating such a perfection and in
reality the optical system can only collect a part of the light.
For this reason another quality term is introduced, the optical throughput,
also often named with the French word etendue. The etendue is a measure
for an optical system of its ability to collect as much as possible light from a
source and retain as much as possible of the original radiance.
The full calculation of the etendue requires an integration of the light sent
by all infinitesimal surface elements of the source and received by all in-
finitesimal surface elements of the entry-pupil2 of the optical system. In
the practical case of a light source with a small surface, relative to the
entry-pupil of the optical system, where also both surfaces of the radiating
source and the entry-pupil are essentially parallel (ϑ = 0), the etendue G
can be approximated as:
2 As will be explained later, the entry-pupil is the cross section of an optical system that
But what does it do, how does it find out? Does it smell the nearby
paths and check them against each other? The answer is yes, it
does, in a way.
B
O
A
i r
C D E Mirror
B’
Figure 7.8: The path of the photons, reflecting from the surface of a mirror derived
by the principle of least time, results in equal angles of incidence ϑi and
reflection ϑr .
lines as long as they remain in the same medium, the shortest time cor-
responds with the shortest distance. This shortest distance is a straight
line that can be drawn directly from A to B, but that path is obstructed by
screen O and the photons can only arrive at B via the mirror.
Evidently the path via C is not taking the least time and also via D it is
not optimal yet. To determine the exact right place to reflect, a little trick
is used and a mirrored place B’ is drawn. When drawing a straight line
between A and B’ it is known that this is the shortest path so, when the
photon is reflected at the intersection point E, this is the shortest path to B.
With a little bit of trigonometry it becomes clear that the angle of incidence
ϑi equals the angle of reflection ϑr , which corresponds with what was taught
at high school.
This relation is not too difficult to derive for simple reflection, but the real
value of Fermat’s model is in predicting the path, when the photons go from
one medium to another. For this derivation Figure 7.9 is drawn, where
photons from the same source A located in a low density medium (air) now
have to reach position B located in a high density medium like glass.
First of all it was stated by Fermat that a more dense medium will slow
down the light. The speed of light ( c m ) in any medium compares with the
7.2. Reflection and refraction 467
i r
F
Air n1 C E
Glass n2 D
G t
Figure 7.9: The path of photons refracting at the interface between two different
media, derived by the principle of least time, results in unequal angles
of incidence ϑi and refraction ϑt . The shortest path is not the fastest!
Note that also some light is reflected.
speed of light in vacuum ( c) with a factor equal to its refractive index (n).
c
cm = (7.8)
n
This means that it takes more time to propagate in a high density medium,
than it would take in a low density medium for the same distance. To
accommodate this difference in speed, the term Optical path length (OPL)
is often used, where OPL equals n times the real geometric path length.
In terms of time the method of Fermat is equal to stating that photons
search for the shortest optical path, hence taking the refractive index into
account3 .
The shortest path from A would be a straight line intersecting the interface
between the media at C. This geometric path does however not correspond
with the shortest optical path with the least time, as the part from C to B
would add considerable to the time.
One might say that the photons recognise that it is better to stay a bit longer
in the air and find a more optimal path, the shortest optical path.
When moving the intersection point from E to the right, first the gain
3 One might conclude that photons slow down in a high index medium, but photons always
travel with the same constant speed. The observed delay is caused by interaction with the
electrons in the medium, where the original photons are exchanged for new photons with a
small delay.
468 Chapter 7. Optics in mechatronic systems
〈EF〉 equals 〈DE〉 times the sine of the angle between points F, D and E.
From trigonometry it follows that this angle is equal to the angle of incidence
ϑi .
Likewise 〈DG〉 equals 〈DE〉 times the sine of the angle between points E,D
and G, being equal to the angle of the transmitted and refracted light ϑt .
From this all follows:
After cancelling 〈DE〉, this equation gives the law of Snell, named after the
Dutch mathematician and astronomer Willebrord Snel van Royen(1580 –
1626), also known with the Latin name “Snellius”. When light goes from
a medium with refractive index n1 to a medium with refractive index n2
Snell’s law becomes as follows:
When calculating the refracted angle from a known incident angle, Snell’s
law is written as:
n1
sin ϑt = sin ϑi (7.12)
n2
7.2. Reflection and refraction 469
The above reasoning from the least time also shows that reversing the
direction of the photons, so going from glass to air will result in the same
shortest path, followed in the reverse direction. In this reversed situation,
the angle of incidence in the original law of Snell becomes the angle of
refraction and vice versa, resulting for the refracted angle as function of the
incident angle:
n2
sin ϑt = sin ϑi (7.13)
n1
Because sin ϑt can never be larger than one, above the critical angle ϑi =
arcsin n 1/n 2 no light will refract any more and total reflection occurs. This
effect is known from light guiding in fibres and flat glass or acrylic plates,
when illuminated from the side.
This effect does of course not occur only at or above the critical angle and
in reality at any optical surface a combination of reflection and refraction
occurs. For reflected light the irradiance ratio to the incident light is given
in the following equation:
Source
Figure 7.10: Wavefront model of the propagation of light. Concentric spheres rep-
resent photons that simultaneously originated at the source in the
centre of the sphere. The distance between the spheres represents
either the wavelength of the light or a proportional value thereof.
Before applying these rules on reflection and refraction, first the concept of
a wavefront is introduced. When observing a number of photons in vacuum,
all originating from a light source at t = 0, these photons will move with the
same constant speed of light in different directions away from the source.
At any moment in time, they will all be at the same distance from the source
and their position is defined by a sphere with its centre at the source.
The wave equivalent of this reasoning is as follows. At any moment in time,
light from a monochromatic coherent light source in vacuum will be in phase
on any concentric sphere around the source. For that reason these spheres
are called a wavefront, which is a mixing of the motion character of the
photons and the continuity character of the waves.
When working with this thinking concept of a wavefront, it is necessary to
remain aware that it is a very artificial concept as in reality light never is
completely monochromatic nor completely coherent over long ranges, like
was explained with the laser. A coherence length of metres is large, so
light from a star is never coherent. Nevertheless it is a very useful concept,
because it visualises different optical effects.
It is used to illustrate phase relations and the local direction of the light is
always orthogonal to the wavefront. A curved wavefront corresponds with
diverging or converging light rays and a flat wavefront corresponds with
parallel rays of light. This is illustrated in two dimensions in Figure 7.10,
7.2. Reflection and refraction 471
Figure 7.11: Distortion of a flat wavefront caused by the reduced propagation speed
of part of the wave, when passing through a medium with a higher
refractive index.
where the spheres are approximated by circles. The waves originating from
the point source are depicted, as if they have only one frequency with a
wavelength equal to the distance of the solid or dashed lines, while the
solid lines represent the positive maxima and the dashed lines the negative
maxima of the sine wave.
Often the dashed lines are shown only when the phase relationship is im-
portant. When only the direction is investigated, they often are omitted.
In reality the spheres would need to have a very small distance, equal to the
wavelength of the light and they would need to move with the speed of light.
This means by definition that a drawing of a wavefront only shows a snapshot
at a distinct moment of a propagating travelling wave. It does by no means
imply that a wavefront drawing represents a standing wave.
When examining interference at a very small scale, like in Section 7.4, the
distance between the spheres is equal to the wavelength. At large scale
optics however, when only the direction of the light matters, often drawings
on wavefronts are used with much larger distances of the lines, where their
distance is only proportional to the real wavelength.
From Figure 7.10 it is clearly observable that the wavefront becomes more
and more flattened out with increasing distance from the source, which
explains the almost parallel rays of light coming from the sun.
Several qualitative statements can be derived with the wavefront model
472 Chapter 7. Optics in mechatronic systems
regarding the effects of changes in the refractive index of the medium where
the waves pass through. In case these media are not homogeneous, the
waves will partly slow down resulting in a distorted wavefront, like shown
in Figure 7.11. In fact the medium in blue acts like a kind of lens and it
illustrates the observed effects, when we look through non homogeneous
gas or fluid, like the air above a heated surface.
In Figure 7.12 the previously described reflection and refraction is shown
with flat wavefronts and a shorter wavelength (lower propagation speed) in
the glass. From the fact that at the connection points both reflected and
refracted waves should have a fixed phase relationship, the law of Snell
can also be derived in a graphical way. On purpose, the term “fixed phase
relationship” is used, instead of “the same phase”, as at reflection from
a less dense medium to a more dense medium a 180◦ phase shift occurs,
comparable with the rigidly connected chain of mass spring systems in
Chapter 2. More details on this aspect can be found in specialised books,
dealing with the Fresnel equations.
Air
Glass
Figure 7.12: Reflection and refraction at the interface between two different me-
dia derived by using wavefronts. A higher refractive index causes a
shorter wavelength that at the interface is in phase with the incom-
ing wavefronts. The reflected wavefront shows 180◦ phase reversal,
because of the reflection from a low density to a high density medium.
7.3. Geometric Optics 473
Parallel rays of light approaching a piece of glass with two parallel planar
surfaces will be refracted and continue their path with the same parallelism
and distance. This is the case, because the refraction at the point of insertion
is compensated by the refraction at the exit point of the glass. This can be
seen in Figure 7.13 that also shows what would happen with a piece of glass
with two curved parallel surfaces with the same curvature. Even in that
474 Chapter 7. Optics in mechatronic systems
A
B
Figure 7.13: Refraction of two pieces of glass with parallel surfaces, one flat and one
curved, shows that only a shift of the lines will occur. The refraction
at point A is reversed at point B
.
case the rays would propagate approximately parallel after the passage and
only their distance will be changed.
In order to achieve an optical image it is necessary to use a piece of glass with
a varying thickness, slowing down the rays depending on their incidence
angle and position.
The invention of a lens is already more than three thousand years old. By
coincidence of nature it is far more easy to achieve a curved surface than
a flat surface with optical quality. When polishing a piece of glass by a
circular motion against a relatively soft counter surface with abrasive paste,
automatically a curved surface will result. The abrasion takes place most
at the outer part of the piece of glass, because the outer part makes a larger
movement in this process and the force exerted near the edges is also larger,
resulting in an essentially spherical surface.
As a first step into the modelling of the imaging properties of a lens, the
refraction at a spherical surface with radius R is shown in Figure 7.14.
First the optical axis is defined as being the axis of rotational symmetry of
the optical system. A ray parallel to the optical axis, entering the surface at
point A, is refracted in the direction of the normal towards the centre of the
surface, according to the law of Snell. A ray drawn at the optical axis would
have an angle of incidence of 90◦ in the Vertex. As a consequence this ray
would continue its path with the same direction and intersect the first ray
at F.
It can be reasoned that all parallel rays at the same distance from the optical
axis as the first ray at A will after refraction cross the optical axis at the
7.3. Geometric Optics 475
ϑi A
ϑr
Opcal axis Vertex F
Focal point
R
Figure 7.14: Refraction at a spherical surface results in converging rays that inter-
sect at the focal point F .
same point F, because of the rotational symmetry and the point F is called
the focal point.
As a an approximation also rays entering closer to the optical axis will bend
towards the same focal point F with only a small error causing the “spherical
aberration” of a lens. This aberration is small, when only those rays are
considered that are very close and almost parallel to the optical axis, the so
called paraxial rays.
• F i , the distance of second focal point to the lens is positive when located
at the right side of the lens.
Figure 7.15: Different lenses exist with names that are determined by the curvature
of their two surfaces.
• R , the radius of the surface of the lens is positive if the centre is located
at the right side of the lens.
A real lens consists of two surfaces with different curvatures where the
same law of Snell can be applied.
A wide variety of lenses exist, as shown in Figure 7.15. The different names
are derived from the curvature shape. A rounded surface is either convex,
flat (plano) or concave and all combinations can be applied.
A biconvex lens is called a positive lens as it will bend the rays towards
the optical axis (converging). Consequently a biconcave lens is a negative
(diverging) lens, and any other combination is positive or negative depending
on whether the optical path through the lens is largest near the optical axis
or at the outer part of the lens.
A combination of a convex and a concave surface is called a meniscus because
of the similarity with the shape of the human meniscus and it is the standard
configuration for eye-glasses.
The first example of imaging optics that will be shown in this section, is
the positive biconvex lens from Figure 7.16. This lens refracts the rays in
7.3. Geometric Optics 477
R2 R1
f
?
'
d
R2
R1
f
'
Figure 7.16: A positive lens, consisting of two convex surfaces that both converge
the refracted rays to the optical axis, gives a shorter focal length, than
with one convex surface only.
A negative lens, consisting of two concave surfaces that both diverge
the refracted rays, creates a “virtual” focal point at the same side as
the object.
a converging direction towards the optical axis. It can be seen that both
surfaces refract the rays that come from the left in the same direction,
resulting in a focal point closer to the lens than would be the case with one
surface only. With a negative biconcave lens the opposite result is obtained.
In this case, the rays are refracted away from the optical axis and seem to be
originating from a point located at the left side of the lens. This non-existing
point is called a virtual focal-point, because it is not the real origin of the
rays that are observed at the right side of the lens.
The same method of “the least time” by Fermat can be used to derive the
equation for the focal length of a lens but it is also possible to use the law of
Snell directly.
478 Chapter 7. Optics in mechatronic systems
So Si
Object
Fo Fi
Image
f f
Figure 7.17: A positive thin lens gives a real image, when the object is further away
than the focal length f .
Without showing the total derivation, in both cases the following equation
can be derived for the focal length of the lens:
1 1 1 ( n − 1) d
= ( n − 1) − + (7.15)
f R 1 R 2 nR 1 R 2
with:
With a very thin lens, this equation simplifies into the thin-lens equation
also called the lensmakers equation:
1 1 1
= ( n − 1) − (7.17)
f R1 R2
assumed always to come in from the left en exit from the right, according to the sign convention.
Further the optical axis is always represented by the centre line like in the previous figures.
7.3. Geometric Optics 479
figure, a real image can only be realised under the condition that S1 is larger
than f . With thin lenses the location of the image can be approximately
determined by using two rays of which the path or trace is followed. This
process is called ray tracing. The first ray is drawn through the middle of
the lens and does not change direction nor position. This is allowed as long
as the lens is very thin. The middle of the lens will then behave like two
parallel surfaces.
The second ray is drawn parallel to the optical axis and will be refracted to
the focal point of the lens. The image is found at the intersection of the first
and second ray.
The relation between S1 and S2 is determined by the focal distance f , fol-
lowing a simple relation that can be derived by trigonometry. This relation
is called the Gaussian lens formula:
1 1 1
+ = (7.18)
S1 S2 f
A simple sanity check proves this formula, as an object at infinity will result
in an image in the focal point and vice versa. It also is imaginable that a
distance of the object from the lens at twice the focal length results in full
symmetry of object and image.
7.3.1.3 Magnification
In the previous example it was shown that the size of the image differs
in most cases from the size of the object, resulting in a magnification M
that depends on the position of the object relative to the focal point of the
lens. This magnification can also be derived by means of straightforward
trigonometry, showing that the magnification is proportional to the ratio
between the distances. As a consequence of the sign conventions that were
defined previously, the magnification M of a lens has to be written as follows:
S2
M=− (7.19)
S1
The minus sign represents the image location at the opposite side of the
optical axis as the location of the object.
As a next step, it is interesting to see what happens when the object comes
closer to the lens. At the image location this will result in an increasing
distance to the right until the object is located at the focal point, resulting
in an image at infinity. When the object approaches the lens even further,
480 Chapter 7. Optics in mechatronic systems
So
Si
Virtual Object
Image Fo Fi
?
' f f
So
Si
Object
Fo Fi
Virtual
Image
' f f
Figure 7.18: A positive lens will give a virtual image, when the object is closer to
the lens than the focal length. A negative lens always gives a virtual
image, independent of the position of the object.
first the image jumps from +∞ to −∞ and then the image will approach the
lens again, but now from left as a virtual image. This is illustrated in the
upper drawing of Figure 7.18, where the resulting diverging set of rays at
the right side of the lens seem to originate from the virtual image.
The rules for magnification and location of the object and image are still
valid in this situation and because of the negative value of S2 , the sign of
the magnification becomes positive. This corresponds with the position of
the image at the same side of the optical axis as the object. Also the position
of the image can be derived from the Gaussian Lens formula, where S2
becomes negative if S1 < f .
Because of the virtual focal point, it is to be expected that a negative lens
7.3. Geometric Optics 481
will give a virtual image. This is indeed confirmed in the lower drawing of
Figure 7.18, where the image is derived with the same rays, one parallel to
the optical axis and one through the centre of the lens.
It is however not possible to create a real image with a negative lens by
shifting the object closer to the lens. This can be concluded by examining
the two rays in the figure, when the object would shift to the right. Even
past the focal point, the rays will remain diverging at the right side of the
lens, while the virtual image always remains closer to the lens then the
object.
7.3.2 Aberrations
to reach the same focal point as where a beam closer to the optical axis is
directed. Ultimately, with the most outer rays, the angles become so large
that the angle of incidence at the exit surface becomes larger than the angle
of total reflection and the light can not escape anymore.
Although the pure sphere is an extreme example, all lenses with spherical
surfaces show spherical aberration, because the radius of curvature at a
larger distance is too small to refract the outer rays to the same unique focal
point. This reasoning implies that an increase of that radius as function of
the distance can solve the problem. This deviation of the spherical surface
is called asphericity with a departure value d as shown in the figure.
An aspheric surface can not be produced by the standard production meth-
ods for polished optical surfaces. Computer controlled milling, grinding
and polishing machines are used to create these aspherical surfaces. An
example of such a machine is the magneto-rheological polishing machine of
the company QED shown in Figure 7.20. It uses the property of fluids that
stiffen by a magnetic field to concentrate the polishing force very precisely
on the surface of the lens where material has to be removed. The fluid is
continuously refreshed and filtered, which means that the process parame-
ters are not influenced by wear or floating glass particles.
Due to the high cost level of these machines the departure magnitude is a
7.3. Geometric Optics 483
Spindle
Rotaon
Lens Lens
sweeps
through
MR Fluid Nozzle Sucon
Sheared Pressure
uid distribuon Wheel
MR uid condioner
7.3.2.2 Astigmatism
F2
F1
IS
IT
Figure 7.21: Astigmatism is the effect that is observed, when the rays in one plane
have a different focal point than rays in a perpendicular plane. This
is caused by a cylinder shape superimposed on the spherical shape of
the lens. Astigmatism in the image of an object that is not located on
the optical axis is recognised by different focal distance in two planes,
the sagittal plane with image (IS ) and tangential plane with image
(IT ) .
plane, is defined by the point on the object and the optical axis. The second
plane, the sagittal plane, is orthogonal to this plane and also includes the
point on the object that is analysed. This means that the sagittal plane does
not include the optical axis. With a single lens element it intersects the
optical axis at the centre of the lens.
Rays in the tangential plane are called tangential rays and consequently
rays in the sagittal plane are called sagittal rays. Astigmatism occurs when
the sagittal and tangential rays have a different focal point and this can even
occur with a rotation symmetric lens. In that case however the astigmatism
for an object point on the optical axis would be zero.
A lens with a partly cylinder shape causes even astigmatism on the optical
axis as was shown in the upper drawing of Figure 7.21 because of the non
rotation symmetric shape.
7.3. Geometric Optics 485
Image
Figure 7.22: Coma only occurs for non paraxial rays and is caused by a difference
in focal length for rays that enter the lens at different positions. As a
result the image looks like a “comet-tail”.
7.3.2.3 Coma
While spherical aberration and astigmatism can both occur even with parax-
ial rays and objects at the optical axis, other aberrations only occur with
rays that correspond with an image that is not located on the optical axis.
An example of this kind of aberrations is the coma as shown in Figure 7.22.
Coma is caused by the same phenomenon as spherical aberration. A larger
angle of incidence at the entry surface of a lens results in an excessively
refracted beam at the exit surface. This means that the rays that enter the
lower part of the lens, where they are almost perpendicular to the surface,
will be directed to a different focal point at a larger distance than the rays
entering the upper part of the lens.
From the figure it can be seen that in this case also no real sharp focal point
can be found. The image will look like a comet shape, depending of the
position of the chosen image plane. Coma is recognised by a rather sharp
and intense spot with a flare of decreasing intensity but increasing size in
the direction of the optical axis.
Figure 7.23: Field curvature is the effect that the part of the object that is further
away from the optical axis is imaged at another distance from the lens
than the part that is close to the optical axis. Although again caused
by the same mechanism as spherical aberration and coma it is not the
same phenomenon.
Figure 7.24: Barrel and Pincushion distortion are caused by a difference in mag-
nification between parts of the object at a different distance from the
optical axis.
defined later.
Distortion is an error in the magnification of a lens. When the magnification
of the lens is depending on the distance of the object to the optical axis, the
shape of the image of a large subject becomes distorted. Figure 7.24 shows
the two most frequently observed distortions in for instance photo cameras,
the barrel distortion where the magnification decreases for objects further
away from the optical axis and the Pincushion distortion with the opposite
effect. These two examples are only first-order linear types of distortion and
in reality also higher-order distortion effects can occur.
Chromatic aberration is caused by the difference in refractive index of a
transparent material for different wavelengths. This property is called
7.3. Geometric Optics 487
The mentioned aberrations all have impact on the quality of the image
and hence need to be reduced to an acceptable level, depending on the
application.
By introducing an aspheric surface it was demonstrated how spherical
aberration can be avoided and asphericity can also be used to reduce other
errors like coma.
With low cost optics and in case of severe size limitations, like with the
photo camera in a cell phone, this is the only way to solve these problems.
Another example of such a limited optical system is the lens for the optical
pick-up unit of a CD-player. These lenses are often made from plastic,
molded in the desired aspheric shape.
In high performance imaging systems, this aspheric single-lens solution
is not sufficient and a multitude of lens elements, both spherical and as-
pherical, are often combined to create an imaging system that is minimally
488 Chapter 7. Optics in mechatronic systems
Crown Flint
Figure 7.26: By combining a positive lens made of Crown glass and a negative lens
made from Flint glass an achromatic doublet is obtained, where the
chromatic aberration of a single lens element is compensated.
From the wide variety of combinations with positive and negative lenses,
only the combination of two positive lenses will be explained in this section
as representative example.
Figure 7.27 shows this configuration, where the distance between the positive
lenses is smaller than the sum of their focal lengths.
The determination of the image location is done in two steps. First the
image I1 is determined that would be created by the first lens L1 only.
As explained previously, this is achieved by drawing two rays. The first ray
7.3. Geometric Optics 489
f2 f2
L1 d L2
1
3
O 2 F1,i
I2
F1,o 4 F2,o F2,i
I1
f1 f1
Figure 7.27: Optical system, consisting of two positive lenses positioned close to-
gether. The image is determined by first determining the image by
L1 via ray 1 and 2 and use that intermediate image to construct the
image after L2 by means of an additional ray 3 drawn from I1 through
the centre of L2 combined with ray 4 through both focal points.
is drawn parallel to the optical axis and will be refracted through the focal
point at the image side F1,i . The second ray propagates through the optical
centre of L1 so it is not refracted. At the cross section of these rays, image I1
is drawn in a dashed shape, because in reality the rays will be intercepted
by lens L2 .
With the help of image I1 , the third ray can be drawn, which is the one that
would pass through the optical centre of L2 and image I1 .
As a last step to determine the image, the fourth ray propagates from the
object through the focal point F1,o , resulting in a refracted beam after L1
parallel to the optical axis, that will in its turn be refracted by L2 through
its focal point F2,i .
The image of the combination of the two lenses is found at the intersection
of these last two rays.
The two lenses appear to behave as one single lens and with a bit of trigonom-
etry the following approximative relation for the focal length of the combi-
nation can be derived:
1 1 1 d
= + − (7.20)
f f1 f2 f1 f2
Which for very small values of d relative to f 1 and f 2 becomes:
1 1 1
= + (7.21)
f f1 f2
It is also clear that the image is inverted, like with a single positive lens.
490 Chapter 7. Optics in mechatronic systems
L1 L2
I2
O
F1,o F1,i F2,o
I1 F2,i Ft,i
f1 f1 f2 f2
Figure 7.28: Optical system, consisting of two positive lenses positioned far apart.
The image is also determined by first determining the intermediate
image by L1 . The final image is determined straightforward, because
the intermediate image is positioned between the focal points of both
lenses.
A very different situation is shown in Figure 7.28, where the lenses are
positioned further apart such that their distance is larger than the sum of
their focal lengths.
With this configuration, the image of the object by L1 is positioned between
both lenses and is called the intermediate image. The second image is simply
determined by taking the intermediate image as object for L2 and use the
same rules as with a single lens system.
From the figure it appears that the inverting effect of L1 is reversed by L2 ,
resulting in an image with the same orientation as the object.
By tracing the ray from the object, that was parallel to the optical axis, the
focal point of the combination Ft,i is found. When L2 is positioned closer to
L1 in Figure 7.28, it appears that Ft,i will move more to the right. This can
be concluded from the position of the image when the object is kept at a fixed
distance from L1 . The three rays that converge at I2 will rapidly diverge.
As a result the magnification of the combined two lenses is increased and
the focal point is shifted. This effect indicates that with two positive lens
elements a zoom lens can be created.
A special situation occurs when L2 is shifted so close to L1 that F2,o becomes
located at the same position as I1 . In that situation the second image I2 will
be located at infinity as I1 is at the focal point of L2 .
When L2 is moved even further towards L1 , I1 will enter the area between
F2,o and L2 resulting in a virtual image at the left side of the system, similar
7.3. Geometric Optics 491
Entry-pupil Exit-pupil
Stop
O I
L3
L1 L2
Figure 7.29: An optical system with an aperture stop that limits the outer rays in
the system. The entry-pupil is the image of the aperture stop observed
at the entry of the system, while the exit-pupil equals the image of
the aperture stop as observed at the exit of the optical system.
to the effect that was demonstrated a few pages back in Figure ??. It is true
that at the very moment of crossing the focal point of L1 , the image swept
from positive infinity to negative infinity and will again come closer from
the left, when L2 is moved even further.
Ultimately both focal point will coincide which results in the double-telecentric
lens that will be presented after the introduction of the important aperture
stop and pupil.
All optical imaging systems have one important limitation that is caused
by their size. In Section 7.4 on physical optics it will be shown, how the
ultimate imaging properties of an optical system are determined by the
maximum capture angle of the rays that can pass the system. This quality
of capturing as much as possible light is also directly related to the law of
conservation of radiance, as introduced in Section 7.1.3.
A geometrical measure for this dimensional property is the pupil that defines
which rays are captured and transmitted by the optical system and which
rays are blocked.
With a single lens element, this pupil is determined by the diameter of the
element itself. With a more complicated system, consisting of a multitude
of lens elements, the outer rays are in principle limited in a not well defined
manner, depending on the direction of the rays and the position and sizes of
the different elements.
492 Chapter 7. Optics in mechatronic systems
L1 L2
Aperture stop
O Chie
f ra
lo y
F1,o F2,o F2,i
F1,i I
li
f1 f1 f2 f2
Figure 7.30: When combining two positive lenses with coinciding focal points, a
double-telecentric optical system is obtained, where an object at F1,o
will be imaged at F2,i with a magnification equal to the ratio of the
focal lengths. The chief ray through the joint focal centre is parallel to
the optical axis both at the object and at the image site. The position of
the object or the image sensor has no influence on the magnification. A
symmetric cone of light from the object around the chief ray, indicated
by the dashed rays, will also create a symmetric cone of light at the
image.
7.3.5 Telecentricity
When the focal points of two positive lenses coincide, the configuration
obtains a special property, called double-telecentricity and is explained with
the dual lens-element system of Figure 7.30.
A paraxial ray from O will be refracted by L1 through the joint focal point
and continue its path towards L2 , where it will be refracted parallel to the
optical axis again.
The location of the image will be at F2,i , because all rays of any point of the
object will be refracted into a set of parallel rays in the space between the
lenses. As a consequence they themselves will be refracted to the focal point
of the second lens element L2 .
The aperture stop in a telecentric system is located at the joint focal point.
The ray through the joint focal point is called the chief ray. Officially a chief
ray is defined as any ray under an angle with the optical axis that passes
through the centre of the aperture stop.
With a double-telecentric system a paraxial ray from any point on the object
will be a chief ray and become also a paraxial ray at the image side of
the system. As a consequence, the magnification of the system is only
determined by the ratio of the focal lengths:
li f2
M=− =− (7.22)
lo f1
This means that a displacement of the object in any direction left or right
from the drawn position in Figure 7.30 will not change the size of the image,
as the path of the drawn paraxial ray will not be influenced by this displace-
ment.
It should be noted that even with this constant magnification, the focal
position of the image will change with a displacement of the object along the
optical axis. This means that these kind of lenses are specially chosen when
the requirements on magnification are more severe than the requirements
on focal distance. This is the case with for instance the projection lens of
the wafer stepper that was introduced in Chapter 1. When the wafer with
photo-sensitive resist is placed at the image location with a slight error in
its distance to the lens, this error will only result in a less than perfectly
sharp image but not in a change of the dimensions.
In the described configuration, this magnification is maintained both for a
position change of the image sensor and for a position change of the object,
which makes it a double-telecentric optical system. Also examples with
single-telecentricity exist as will be presented later.
494 Chapter 7. Optics in mechatronic systems
L1 A B L2
2
C
O Chief r
ay
F1,o F2,o F2,i
F1,i
3 I
C
A B
1
Figure 7.31: Different positions of the aperture stop have different effects. Locating
the aperture stop at A instead of the preferred position B at the joint
focal point will result in ray 1 to be blocked at an undefined location.
A smaller aperture stop at C clearly defines the cone of light bounded
by ray 2 and 3.
Also with non-telecentric optics it is preferred that the rays from any spot
on the object propagate parallel at the plane where the aperture stop is
located. This is the case with all optics that need a controllable aperture
stop, like photographic camera lenses. This can be seen from Figure 7.31
where for instance a smaller aperture stop at position C will only influence
the amount of light. As each point on the object will still be imaged by
means of light rays through the smaller aperture the overall image shape is
not altered by shadowing a part of the image. For that reason also with non
telecentric lenses this plane where the aperture is located is called a pupil
plane.
With the physical optics of the following section, this pupil plane will further
prove to be a very important area to determine the quality of an imaging
system.
The shown example with only two positive lenses represents an almost ideal
situation. When a different amount of elements are used like in Figure 7.29,
there is not always a joint focal point defined. In those cases still single-
telecentricity can be created, depending on the location of the aperture stop
inside the lens system.
The first possibility is an object-space telecentric system, where the aperture
stop is located at the first focal point inside the system when entering from
the object side. In that case the entry-pupil is located at infinity.
An example of such an object-space telecentric lens is the measurement
microscope from Chapter 3. A small displacement in the vertical direction
is not allowed to cause an error in the measurement of the object size and
object-space telecentricity prevents this error.
The second possibility is an image-space telecentric system, where the aper-
ture stop is located at the first focal point inside the system when entering
from the image side. This causes the exit-pupil to be located at infinity.
Image-space telecentric lenses are preferred in digital photography cameras.
Non-telecentricity at the image sensor has as consequence, that the cone of
light to the image is not symmetric around a paraxial ray but tilted under
an angle with the optical axis. This can lead to image artifacts, due to the
stack of a colour filter and multi-lens array before the image sensor. Only
an image-space telecentric lens guarantees that for instance no red light
ends up at a blue sensor element.
Due to size constraints, practical photographic lenses can only approximate
496 Chapter 7. Optics in mechatronic systems
One of the models to describe the properties of light was shown to be based
on the theory of electromagnetic wave propagation.
In Chapter 2 it was explained that electromagnetic waves consist of a com-
bination of two oscillating orthogonal vector fields, an electric E and a
magnetic B field. In Chapter 5, the interrelation between magnetic and
electric fields was defined by Faraday’s and Ampére’s law of the Maxwell
equations and these laws are used in dedicated modelling software for the
calculation of the real properties of light in optical systems. In this section
the wave character of the light is used to explain the performance of optical
systems without extensive calculations. Instead, graphical representations
will be used of only one of the two interrelated vector fields. The electric
E-field is often chosen for this purpose and is presented graphical as a sine
wave.
An unfortunate fact that is encountered in analysing the physical properties
of optics, is caused by the extreme frequency of the related electromagnetic
waves. Electric or magnetic oscillating fields with frequencies in the order
of 1015 Hz are impossible to measure directly. As mentioned before, only the
average optical irradiance I r can be detected by means of photo detectors
that convert the related flow of photons into an electric current.
The irradiance of an electromagnetic wave is proportional to the power of
the sinusoidal electric field, given by the following expression:
cn0 2
I r = cn0 E2rms = Ê [W/m2 ] (7.23)
2
In the following subsections, first the concept of polarisation is introduced,
because this concept is widely used in optical measurement systems. This
is followed by the explanation of the interference effects of the interaction
between different waves and the theory on diffraction and gratings. The
section will be completed by deriving the diffraction limited resolution of an
optical system, where the influence of the capture angle on the quality of
the image will be demonstrated.
7.4.1 Polarisation
z
y
z
Figure 7.32: The electric field of linear polarised light, represented in an orthogonal
coordinate system, defined by x in the direction of propagation of the
wave and y, z under 45◦ with the polarisation direction of the electric
field. The projection on the y – z plane is a line. in the upper part
the y and z components are in phase while in the lower part they are
180◦ out of phase. Shifting the phase of one of the components with
180◦ results in a rotation of the polarisation direction of 90◦ on the
y – z plane.
7.4.1.1 Birefringence
z
Figure 7.33: Elliptical polarised light. When the y component relative to the z
component of the electric field of an electromagnetic wave is slightly
different from 180 or 0◦ the projection of the electric field in the y – z
plane becomes an ellipse.
O
x
v
z
Figure 7.34: Circular polarised light. When the phase relationship between the y
and z component of the electric field of an electromagnetic wave equals
90◦ , the projection of the electric field in the y − z plane becomes a
circle. The spiral can rotate left or right handed depending whether
the phase difference is positive or negative.
7.4.2 Interference
φ=0
I r,c= 4 I r
φ = 0.5 π
I r,c= 2 I r
φ = 0.75 π
Ir,c= 0.6 I r
φ= π
Ir,c= 0
Figure 7.35: Interference between two waves with the same wavelength and ampli-
tude but different phase. The resulting electric field amplitude can
range between zero and twice the amplitude of each wave giving a
combined irradiance I r,c of maximum four times the irradiance I r of
each wave.
When measuring the resulting electric field, only the irradiance can be
observed. The irradiance can be determined by using the amplitude term of
Equation 7.24 in Equation (7.23):
cn0 2 cn0 & & ϕ ''2 &ϕ'
I r,c = Êc = 2Ê cos = 2 cn0 Ê2 cos2 (7.25)
2 2 2 2
After expanding the squared cosine term with its trigonometric identity,
this equation becomes:
1 + cos ϕ ( )
I r,c = 2 cn0 Ê2 = cn0 Ê2 1 + cos ϕ (7.26)
2
The resulting irradiance of the combined electric field varies between 2 cn0 Ê2 ,
when ϕ = 0 + n · 2π, and zero, when ϕ = π + n · 2π, with the integer n ≥ 1.
The irradiance of the separate fields is equal to:
cn0 Ê2
I r,1 = I r,2 = I r = (7.27)
2
This irradiance level is a factor four below the irradiance level of the com-
bined fields at full constructive interference. This is caused by the squared
relation between the double amplitude of the combined fields and the ir-
radiance. Although this result seems to be contradicting to the law of
conservation of energy, in reality an optical system that shows constructive
interference, also shows an equal amount of destructive interference at
another location. The energy from the destructive interference area will
add to the energy at the other area and the total energy within the system
is not changed. This effect will be mentioned at different occasions in the
following pages.
Mirror 1 Mirror 2
rO O
d
2
100 %
Transmission
Wavelength
show constructive interference with the light just entering the cavity and
as a result a standing wave occurs.
This standing wave manifests itself as if the interferometer becomes fully
transparent for that wavelength as shown in the upper half of the figure.
Other frequencies, that do not fulfil the requirement λ = 2 d /r , will result
in less constructive or even destructive interference and can not pass the
interferometer, depending on the amount of destructive interference. This
system acts like an optical notch filter.
The capability of this filter to distinguish different wavelengths is called
finesse and is determined by the ratio between reflection and transmission
of the mirrors of which the interferometer is made. When the transmission
is large also many other frequencies will pass, because there is hardly any
internal reflection, which would cause destructive interference. This is for
instance the case with glass in a normal window. On the other hand, the
more the mirrors become ideal reflectors the less other frequencies will pass.
A special version of a Fabry-Perot interferometer is the dichroic coating that
is used to prevent reflections on a lens or increase reflections on a mirror.
It consists of many thin layers of material with a different refractive index
that each determine a very small cavity tuned to a specific frequency. By
combining a multitude of layers it is possible to either create an optical
filter with a very high finesse or a more wide-band behaviour like with
photographic lenses.
7.4.3 Diffraction
0th order
-1st order 1st order
-2 nd order 2nd order
ϑ1
λ
p p
of the electromagnetic field of light. It is used in all examples of diffraction only to show the
phase relationship between the different light waves
7.4. Physical Optics 507
light at the slits with equal amplitude and phase. Orthogonal to the plane
of sight these sources create a line source so the resulting wavefront will be
like a cylinder of which only the cross section is shown in the figure.
The resulting wavefronts are found by connecting the wavefronts of the
different line-sources that are mutually in phase. Several favourable ver-
sions of these tangential lines can be found as is shown in the figure. The
orthogonal directions to each of these tangential lines are directions where
the light from all sources interfere in a constructive way and light will be ob-
served in those directions. The other directions in between show destructive
interference as both light with a positive and a negative phase is combined.
As a result, the diffraction at this grating manifests itself as a series of
separate light beams under an angle ϑ that is determined by the wavelength
of the light and the distance of the slits in the grating. These separate
beams are called diffraction orders with increasing number when starting
from the optical axis. In principle the diffraction is mainly observed at some
distance from the slits, the far field. At a closer distance the amplitude of
the field of each slit at the location of observation will be different and at a
distance in the order of the period of the grating, the light of only one slit
will be observed to be propagating in all directions.
From the figure it can be concluded that the angle between the orders will
be larger with a larger wavelength and/or a shorter periodic distance of the
slits. This relation can be derived by trigonometry which results in:
Nλ
ϑN = arcsin (7.28)
p
Amplitude gratings have one disadvantage, as they reduce the total amount
of light by the opaque or absorbing part of the surface. This property limits
the maximum reflectivity or transmissivity to approximately 50 %.
One might decrease the loss of light by making the slit wider than the
opaque part, but then the interference will be less. Ultimately the object
becomes just transparent or reflective, which was not the aim of the grating.
Fortunately the 50 % loss of light can be avoided by using a phase grating.
In its reflective form, this kind of grating consists of a row of connected
reflective stripes alternating at two different heights.
Incoming light
0.25λ p ϑ1
by destructive interference.
Knowing that energy can not get lost and no dissipation takes place, the
light should show an equal constructive interference in another direction
and indeed, at a certain angle, the travelled distances will be equal again
for all light coming in orthogonal and parting in that diffraction direction
as can be seen with the tangential lines of equal phase in the figure.
Identical to the previously explained transmission grating this diffraction
angle is equal to Equation (7.28):
Nλ
ϑN = arcsin (7.29)
p
This means that also with a phase grating higher order diffraction angles
occur, but in practice the geometry of the grating is often chosen such that
these higher orders are limited to N = ±1 or only have a very low irradiance.
Although only the effect of one point in the middle of each surface is shown,
this point is representative for the entire grating as for every point on the
low area a corresponding point at the high area can be found that combines
in the same way. By integration the total effect can be determined and
510 Chapter 7. Optics in mechatronic systems
B
O
A
1 2 3
Figure 7.39: With a reflective phase grating, light can be reflected by a different
location than determined by the law of reflection, as long as the phase
is equal for the different paths (1,2,3) of the grating.
with the second method to explain the working principle of the transmissive
version of a phase grating, the total surface is used.
Because of the fact that with a phase grating no absorbing surface is present,
the total amount of light is retained.
Several readers might wonder if reflective phase gratings can work at all
with a real mirroring surface. With real measurements on these gratings
it indeed appears that the reflected light will act different than would be
expected by applying the law of reflection that was derived in Section 7.2.
Figure 7.39 shows the reflection at a mirror and a grating. At first sight only
the middle part of a large mirror is needed for reflection and the photons
emitted by the source in other directions are not useful and can never reach
the point of observation.
With a grating surface however, this is no longer true as the photons can
be reflected in the diffraction orders even with a mirrored surface. This
is due to the fact that the simple wave theory is not sufficient to explain
everything. Although the method of least time is useful, another method is
needed to explain what really happens at a mirrored grating. Again, like
also mentioned in Section 7.2, the theory of light acting as particles helps
for this explanation.
In a nutshell this theory states that photons in principle can follow all kind
of different paths to a target point of observation. The probability of a
certain path to this location is however determined by the question whether
other photons from the same source, that arrive at that location via different
7.4. Physical Optics 511
n = 1.5 p Glass
paths, all have an equal time relation to their temporal periodicity (read,
the same phase). For a reflective grating that means that a photon that
is reflected at one high step should have the same phase at the point of
observation as any other photon from the same source reflected by another
high area.
This again is an example of a theory that as such is not understandable,
because how can a photon know anything about its colleagues, before it is
arrived at the point of observation?.
Nevertheless the theory is useful as it perfectly predicts their behaviour.
to create the necessary λ/2 phase difference between the photons that pass
through the area with a shorter optical path length and the photons that
pass through the area with a longer optical path length. This means that
the height is depending on the refractive index of the transparent material.
Figure 7.40 shows such a grating and its principle is explained by using the
second method with the full surface. With this method, the transmitted
wavefronts are drawn as if they remain flat, instead of a point source with a
circular wavefront. This is the extreme other side of the approximation and
shows to give the same result. In this example, the step size is chosen such
that its height corresponds with one wavelength in air and 1.5 wavelength
in the high index material, like glass with n = 1.5. When observing the
0th order the wavefronts from the high and low areas have a 180◦ phase
difference and at the far field these wavefronts will cancel each other out
by destructive interference. This is fully comparable with the effect of the
reflective phase grating.
Phase shifting
n = 1.5 Glass
a: Stepsize 1·λ in air, 1.5·λ in glass.
Air
?
'
$ n=1
< 90 o
n = 1.5 Glass
b: Stepsize 0.66·λ in air, , 1·λ in glass.
?
'
$ Air
n=1
n = 1.5 Glass
c: Stepsize 0.33·λ in air, 0.5·λ in glass.
Figure 7.41: By varying the step size of the phase grating, the magnitude and
phase of the 0th and 1st order is also varied. A reduced step height (b:
and c:) results in a phase advancement of both orders. The irradiance
of the orders shifts from 100 % in the 1st order at a: to ultimately
100 % in the 0th order, when the step size would be zero.
514 Chapter 7. Optics in mechatronic systems
When the step size is changed, as shown in Figure 7.41 b: and c:, the phase of
the wave through the high area is less delayed in respect to the wave through
the low area. As a result both orders get a partial positive phase shift.
With a smaller step size, the phase difference between the interfering waves
in the 0th order direction will be less than 180◦ . Consequently the magnitude
of the irradiance of the resulting wave will no longer be zero, while the phase
delay will be less then 90◦ . At the same time also the waves in the 1st order
direction will be less perfectly in phase, resulting in a phase difference of the
resulting wave and a smaller magnitude of the irradiation. Ultimately when
the step size is zero the phase grating will be just a transparent plate again,
only transmitting light in the same direction as the incoming light. With
the reflective version of the phase grating this ultimate situation would
simply be a flat mirror.
As a result of this reasoning, the phase of the 0th order will always remain
delayed relative to the phase of the 1st order diffracted wave. While the delay
of 90◦ from the 0th order relative to the incoming light gradually reduces to
zero, the phase of the 1st order will gradually shift to a positive 90◦ difference
with the incoming light. Regarding the magnitude, following the law of
conservation of energy, the sum of the irradiance of the different orders
always remains equal to the irradiance of the incoming light as the size of
the beams is not affected by the grating. By choosing the right height it is
for instance possible to create a grating with the same irradiance for the 0th
and the two 1st order beams. In that situation the phase shift of both orders
should be equal. A small approximating calculation shows the effect using
Equation (7.26) in a simplified form:
1 + cos ϕ
I r,o = I r,i (7.30)
2
with I r,o equaling the irradiance of an output order and I r,i being the irra-
diance of the incoming light. When the light going into higher orders is
estimated to be about 25 % of the total irradiance, which is a realistic figure,
then 25 % is available for each of the three main orders. This level requires
the cosine term to be equal to 0.25 for all three orders, resulting in a phase
angle of:
This means for the 0th order that the phase difference between the interfering
waves equals 120◦ and as a consequence the (negative!) phase shift of the
0th order after interference becomes half of that value equaling −60◦ relative
to phase of the incoming light. For the 1st order the phase is shifted in the
7.4. Physical Optics 515
The shown examples all had a strictly defined direction of the incoming
beam of light, orthogonal to the surface and one may wonder what happens,
when the light approaches the grating from another direction.
In principle, with a reflective grating, a beam with an angle of incidence ϑ
results in a 0th order directed with a same angle ϑ opposite to the normal
fully according to the laws of reflection. In fact as a first-order approxima-
tion the geometric effect just linearly combines with the diffraction effect.
In reality a large angle of incidence of the incoming light will cancel out
certain diffraction orders that would otherwise be directed into the reflecting
material while other orders appear at the opposite side of the normal. The
interested reader can derive this when drawing the different graphs for
other light directions.
One consequence of this reasoning is that the effect works in two directions.
With for instance a reflective amplitude grating, light that enters under an
angle corresponding with the 1st order will be reflected mainly in the direc-
tion of the 0th order. This reverse effect can also be explained by reasoning
that the optical path lengths for photons going in the opposite direction are
equal, resulting in equal travelling time and an equal phase for their wave
property.
With transmission versions of the different gratings in principle the same
reasoning is valid al long as the transparent part is planar, without optical
strength or wedge form. Otherwise the angle should be corrected for these
effects.
1st
p ϑ 0th order
Illuminaon -1st
-2nd
-3rd
Object Image
Figure 7.42: Imaging of a transparent grating with a square wave irradiance profile
illuminated perpendicular to the grating from the left. It shows that
the higher diffraction orders that represent the spatial high frequency
content of the image are located at the pupil plane further away from
the optical axis.
Pupil plane
3rd
Irradiance 2nd Irradiance
1st
0th order
Illuminaon -1st
-2nd
-3rd
Object Image
Figure 7.43: Imaging of a transparent grating with a square wave irradiance profile
with a small aperture stop at the pupil plane showing the spatial low
pass filtering of an optical system when blocking the higher diffrac-
tion orders.
0.8
0.6
0.4
0.2
0
1
0.5 1
0 0.5
0
-0.5
-0.5
-1 -1
Figure 7.44: Airy-disk irradiance profile of the image of an ideal point-source cre-
ated by a diffraction limited optical system. The fringes around the
centre peak are caused by the sharp filtering of the optical system
which is equivalent to a higher order dynamic system.
• The high spatial frequencies represent the details of the object and
they are “encoded” in the area in the pupil plane that is the most
distant from the optical axis.
In other words, an optical imaging system acts like a spatial low pass filter
with a cut off frequency that is determined by the capture angle. A small
capture angle results in a lower optical resolution and less sharp images.
An optical system of which the performance is only determined by this
diffraction related limitation is called diffraction limited. In a diffraction
limited optical system all aberrations are smaller than the smallest details
in the image. An example of a diffraction limited image is the airy-disk of
7.4. Physical Optics 519
Figure 7.44 which is an image of an ideal point-source. The point source with
its infinitely small size and infinitely high radiance is the spatial equivalent
of an impulse, having an infinite amount of frequencies. In the airy-disk the
higher order response of the spatial low pass filtering by the optical system
is visible by the shown oscillations around the irradiance peak in the centre,
a spatial “dynamic” phenomenon. As soon as the performance of an optical
system is not ideally diffraction limited, the airy-disk of the image of a point
source will show both a wider and lower middle irradiance peak. The ratio
between the peak height of a real optical system and the peak height of a
diffraction limited version of this optical system is called the Strehl ratio,
named after the German physicist Karl Strehl (1864 – 1940). The Strehl
ratio is a measure for the quality of an optical system.
Two closely related terms are used in optics to relate the above presented
capture angle with the resolution of the imaging system, the Numerical
Aperture (NA) in professional imaging systems and the f-number 〈 f /#〉 in
photography. With the help of Figure 7.45 both terms will be explained.
The f-number is the simplest term of the two as it does not take the refractive
index into account and equals just the ratio between the diameter of the
entry-pupil and the focal length.
f
〈 f /#〉 = (7.32)
d
This term is perfectly usable in photographic systems that work in air with
objects that are mostly far away and with an image location near the focal
plane. The maximum entry-pupil of a photographic lens is almost equal
to the diameter of the front lens. Usually the units, also called stops differ
a factor 2 because the captured light from the object is a function of the
surface area of entry-pupil, being squared proportional to its diameter. This
means that a stop is a factor two in light irradiance.
Regular photography lenses have an f-number between f /2.8 and f /22. Often
a practical lens design is optimised for aberrations at an f-number around
f /8. This already indicates that these lenses are generally not diffraction
limited with the exception of very expensive professional photographic lenses
that can be used wide open and sometimes even show f-numbers up to f /1. It
is obvious that these lenses need to be vary large to achieve such an opening
angle, like those used by sports photographers around football stadiums. It
also points convincingly to the inherent limitations in resolution of the small
520 Chapter 7. Optics in mechatronic systems
f
Entry-pupil
d
Figure 7.45: Definition of the Numerical Aperture (NA= n sin ϑ) and the f-number
( 〈 f /#〉 = f /d ) in an optical system. A large NA or a low 〈 f /#〉 correspond
to more captured diffraction orders giving a better resolution. For the
illustration the entry-pupil as used for the f-number with photographic
lenses is drawn just before the lens. With a telecentric lens the pupil
is located at infinity for which reason the capture angle is taken as
reference for the NA.
NA = n sin ϑ (7.33)
This term includes the refractive index n which is related to the fact that the
wavelength of light with a specific frequency is shorter at a higher refractive
index. As a consequence the angle between the diffraction orders is smaller.
In other words an optical system can capture more orders in a high refractive
index medium like water or oil than in air.
This is the reason for oil-immersion in microscopes and water-immersion in
wafer scanners6
The use of the sine instead of the tangent as with the f-number is more
practical as it clearly limits the value in air to one as the unattainable
6 The application of immersion fluids in an existing air-based optical design does not au-
tomatically increase the NA but only enables the designer to achieve a higher NA with an
adapted design that could otherwise not be realised in air.
7.4. Physical Optics 521
goal. With an NA in air that is equal to one, the object or image should be
positioned inside the lens element which is not possible. With small angles
and an refractive index of n = 1 the relation between f-number and the NA
can be approximated as follows:
d 1 1
NA = n sin ϑ = n sin arctan ≈ or 〈 f /#〉 ≈ (7.34)
2f 2〈 f /#〉 2NA
with λ the wavelength of the light in vacuum because the refractive index
has already been taken into account in the NA value and λ is mostly known
from the light source. The term k1 is named just like that, the k-one factor
and equals around 0.5 for a not optimised system.
With special “tricks” that mainly consist of illumination methods at the
object side, like shown in Figure 7.46, the system is able to image at a higher
resolution. The figure shows as an example the effect of illumination from
a direction under an angle with the optical axis, causing a tilting of all
the diffraction orders. With a perpendicular illumination the higher orders
would be blocked at the aperture stop (the dashed lines) but with this angular
illumination both the 0th and one of the higher orders is able to pass the
aperture, giving enough optical information to create an image.
When the object would be illuminated from two directions under the same
angle with the optical axis or even from all directions in three dimensions (a
cone) under the same angle, called annular illumination even both 1st orders
would be captured. Further also contrast enhancement methods at the
image side help to lower this value. The real minimum value for k1 is 0.25
as this is the situation where not even a fraction of the 1st order is captured
any more with whatever illumination method.
The mentioned methods for achieving a low k1 are applied in waferscanners
as these are not aimed at reproducing a nice representative image of the
object like in photography. Instead, they are designed to produce an image
of a certain structure by optimising all related elements. In that case the
7 The term (NA) and (CD are not written as a variable in one italic symbol and strictly it
should be given a different real variable name but the used notation is quite common in those
industries where it is relevant. For this reason the NA and the CD are noted with straight
capitals.
522 Chapter 7. Optics in mechatronic systems
Pupil plane
Irradiance Irradiance
0th order
Illuminaon
-1st
Object Image
Figure 7.46: Illumination under an angle with the optical axis enables imaging
in case the aperture is too small for capturing both 1st order angles
(dashed lines). The changed angles enable the capturing of the 0th
and one of the 1st orders.
object, also called the mask or reticle, can be adapted to make optimal use
of the possibilities of the entire optical system. The contrast enhancement
techniques at the image are realised in the photosensitive resist that sharply
distinguishes between different levels of irradiance within only a few percent.
It is needless to say that this requires a tight control of the average light
level, called Dose-control.
Image
sensor
c
d
|f
dS So
Figure 7.47: The depth of focus (dS ) of a photographic camera depends on different
parameters: the distance of the subject to the lens (S o ), the minimal
resolution ( c) and the f-number ( 〈 f /#〉 = f /d ).
With the f-number ( 〈 f /#〉 = f /d ) the following relation for the maximum
allowable focal error is derived:
c S 0 〈 f /#〉
dS = ± (7.37)
f
ϑ
dS F
Figure 7.48: Depth of focus with diffraction limited optics. The enlarged picture
at the right shows the irradiance at the focal point. The profile of an
airy-disk can be observed at the cross section of the focal point, with
dark and light areas next to the central spot. Further it also shows
that the size of the spot remains about the same over a larger range
along the optical axis than expected from the crossing lines, resulting
in a depth of focus ( dS ) of about 2 – 3 times the spot size.
diffraction limited image. In that case it is not possible to simply derive the
tolerance from geometry of the two crossing most outer rays as around the
focal point the wave character shows a very interesting effect.
It was shown in the previous section that the size of the image at the focal
point can never have an infinitely small size because of the size of the airy-
disk that is determined by the numerical aperture.
In spite of this minimum size of the spot in the centre, the maximum
irradiance area spreads over a larger area in the direction of the optical axis
than would be expected from purely geometrical analysis. This results in a
larger focal range according to the following relation:
nλ
dS = ± (7.38)
NA2
In Chapter 9 it will be shown that because of this phenomenon an optical
resolution in the order of 50 nm can be achieved with a depth of focus in the
order of 100 nm, when using immersion optics with a very high numerical
aperture of ≈ 1, 35.
7.5. Adaptive optics 525
In the past decades, the principles of active control are increasingly applied
in complex imaging systems to achieve the ultimate performance. This
mechatronic field of adaptive optics has proven its value in astronomy and
high precision optics for IC lithography but also in medical instrumentation
it became a method to achieve results that were hitherto deemed impossible.
In this section one of the main causes of a less than optimal optical per-
formance of an imaging system is presented which is the generally strong
temperature dependency of the optical path of the light.
It will be shown that the resulting problems can be reduced by inserting
active optical elements such as deformable mirrors with a suitable measure-
ment and control system.
At first sight it might seem possible to select a glass type with such a positive
d n/ dT that it compensates the effect of the increasing radius by the larger
lens size at a higher temperature.
In reality however often this choice is mostly not available while the effect
on geometry and refractive index of otherwise preferred materials can be
526 Chapter 7. Optics in mechatronic systems
Figure 7.49: The car looks as if standing in water while being in the middle of the
Mojave desert at 40 ◦ C. This mirage is caused by a lens effect that
refracts light from the sky towards the observer. This lens effect is
created by the hot air with a low density close to the ground. Also the
turbulence caused by the thermally induced motion of the air is clearly
visible at the telephone poles while the image of the approaching car
at the horizon is even hardly distinguishable.
quite different.
With for instance the professional lenses for IC lithographic exposure equip-
ment working at the Deep UV wavelength of 193 nm, it is necessary to use
Fused Silica as lens material that shows an large temperature dependency
of the refractive index dn/ dT = 15 · 10−6 while the thermal expansion coeffi-
cient is only ≈ 0.5 · 10−6 .
This means that one ◦ K of temperature could shift the focal plane of a lens
with a focal length of 10 mm with 150 nm which is not acceptable for such a
precision optical system. Even worse is the situation where the temperature
is not constant over the optical system due to absorption of light causing
additional aberrations.
Another well-known example of thermally induced optical errors is the effect
of temperature differences in air like shown in Figure 7.11 in Section 7.2.2. It
is the cause for the famous phenomenon of a mirage as shown in Figure 7.49.
7.5. Adaptive optics 527
The pupil plane inside an optical system is shown to contain all the spatial
information about the object that is imaged. This information is stored in
the different diffraction orders that all propagate parallel for any point on
the subject.
The wavefront at the pupil plane should be flat because of these parallel
rays. This means that at the pupil plane aberrations can be distinguished
as deviations from this flat wavefront. It was further also shown that the
information on the details is spread over the entire pupil plane with the finer
structures more distant from the optical axis. Adaptive Optics is based on
the principle to introduce a controllable optical element in the optical path
that corrects these deviations from the ideal wavefront after measurement
by a wavefront sensor.
The first application of adaptive optics has been in terrestrial telescopes
where the turbulence and temperature effects in the atmosphere determine
changes in the refractive index leading to wavefront errors.Especially with
the ever larger size of these telescopes, necessary to capture more light
and increase the resolution, the influence of the air becomes dominant. As
an example Figure 7.50 shows the E-ELT, the Extreme Large Telescope of
the European Southern observatory that is planned to be finished around
2018. The largest mirror has a diameter of ≈ 40 metres. It is impossible
to manufacture such large mirrors in one piece and it became customary
for telescopes to compose such a large mirror from a multitude of smaller
elements. The main mirror of the E-ELT consists of around 1000 smaller
mirror elements of 1.4 metres wide. Each of these mirror-elements is pre-
cisely positioned relative to the other mirrors, such that the light from each
element is in phase with the light of the other elements. Only under that
condition the combined mirrors will act like one large mirror with one con-
sistent wavefront.
The precision of this matching needs to be better than λ/20 which is about
15 nm, because light with wavelengths as short as 300 nm has to be imaged.
It is well understandable that a wavefront that passes more than 50 km of
atmosphere can easily become distorted by more than this 15 nm and for
that reason two of the four other mirrors of the telescope have an actively
controlled surface shape.
In the following two sections, first the Zernike modes are introduced as a
mathematically description of the wavefront aberrations, while in the second
section the principle of operation of adaptive optics is further explained.
528 Chapter 7. Optics in mechatronic systems
Figure 7.50: The Extremely Large Telescope of the European Southern Observatory
(E-ELT) has a main mirror with a diameter of just less than 40 metres
that consists of ≈ 1000 segments with a diameter of ≈ 1.4 metres that
all have to be aligned with an accuracy of λ/20 ≈15 – 50 nm. Two
adaptive secondary mirrors are used to correct for wavefront errors
by atmospheric disturbance. (courtesy of ESO)
as:
q q
Z p (ρ , ϕ) = R p (ρ ) cos( qϕ) (7.40)
where both p and q are non negative integer numbers, p − q are even and
p ≥ q.
The angle ϕ is the azimuthal angle, between a reference vector and another
vector that originates in the centre and points towards the location of interest.
The value ρ is the normalised radial distance on the unit disk with diameter
of 1.
The radial polynomials R qp (ρ ) are given by:
( p−
q)/2
q (−1)k ( p − k)!
R p (ρ ) = ρ p−2k (7.42)
k=0 k!(( p + q)/2 − k)!(( p − q)/2 − k)!
This equation results in the following list of radial polynomials for the first
21 Zernike modes, where p + q is the order of the Zernike mode8 :
R 00 (ρ ) = 1 (7.43)
R 11 (ρ ) = r
R 20 (ρ ) = 2ρ 2 − 1
R 22 (ρ ) = ρ 2
R 31 (ρ ) = 3ρ 3 − 2ρ
R 33 (ρ ) = ρ 3
R 40 (ρ ) = 6ρ 4 − 6ρ 2 + 1
R 42 (ρ ) = 4ρ 4 − 3ρ 2
R 44 (ρ ) = ρ 4
R 51 (ρ ) = 10ρ 5 − 12ρ 3 + 3ρ
R 53 (ρ ) = 5ρ 5 − 4ρ 3
R 55 (ρ ) = ρ 5
8 Several other definitions of these polynomials exist that differ in details, for instance by
normalising on maximal values at the edge of the circle, like 1 as used here. Also the units
can relate to metric values like [nm] or wavelengths. Further the order is often mentioned
as ” p“ equalling the radial order and ” q“ equalling the azimuthal order and the sequence
of numbering is either simply based on an increasing radial order or on a more complicated
expansion with spherical terms for all quadratic numbers. All these definitions are correct as
long as maintained consequently in all communication.
530 Chapter 7. Optics in mechatronic systems
Piston
Tip Tilt
Z00
@ @
Z1-1 Z11
Defocus
Trefoil Trefoil
Z2-2 Z20 Z22
! "
! "
Z3-3 Z3-1 Z31 Z33
Figure 7.51: Height map of the first 21 Zernike modes describing deviations of an
ideal flat surface in a systematic manner. Generally deviations of
wavefronts can be modelled as a combination of several Zernike modes
with different magnitude. In this chart blue corresponds with 1, green
with zero and red with -1.
(courtesy of Claudio Rocchini)
high area looks like a single sine wave. When comparing with Figure 7.22
this is less easy to imagine and with the higher order terms it is not useful
to even try, but one thing is certain, the higher the Zernike terms the more
refined the correction mechanism should be.
Several of the higher order terms can be seen as high spatial frequency
versions of the described primary aberrations. For instance Z42 and Z4−2
represent secondary astigmatism and Z51 and Z5−1 is secondary coma.
The other Zernike modes also have their own name based on their shape
like trefoil for Z33 and Z3−3 , quadrafoil for Z44 and Z4−4 , pentafoil for Z55 and
Z5−5 and consequently Z53 and Z5−3 are called secondary trefoil.
532 Chapter 7. Optics in mechatronic systems
Wavefront
analyzer
Shack -Hartmann
wavefront sensor
Actuator array
Image
Controller
Deformable mirror
Acve mirror
Distorted wavefront
Controllerr
Image
Wavefront sensor
Figure 7.53: Principle of adaptive optics with feedback control. The mirror is con-
trolled such that the error on the wavefront sensor is kept as close as
possible to zero.
|{J
Membrane
suspension c material
?
'!'!
|J
Permanent magnet Planar coil
Figure 7.54: Actively controlled mirror with “Hybrid” permanent magnet biased
actuators. The actuator at the left has no current, in the middle
the current direction increases the permanent magnet flux and at
the right the current direction decreases the flux of the permanent
magnet. (courtesy of Roger Hamelinck, TU/e)
Measurement in
mechatronic systems
The precision of mechatronic systems can not be better than the measure-
ment accuracy of all relevant parameters that are needed to control and
adapt the behaviour of the system. The other elements in the system are
never capable of correcting unknown measurement errors.
The field of metrology deals with measurement in general and is in itself
quite immense, even when only dealing with mechanical quantities. One of
the reasons for this wide scope is the economic need for undisputed quan-
tities when trading goods. Metrology has been practised from the very
moment that people started to exchange valuable items and this has re-
sulted in agreements and rules on a global scale regarding traceability, the
possibility to relate a measurement value to an agreed standard like the
kilogram and the metre. For the same reason as with the other chapters,
this chapter has to remain limited to the most relevant measurement items
for precision motion systems, being the measurement of forces, position and
motion. This includes sensors for stress, velocity and acceleration, based on
physics principles like piezoelectricity, electromagnetism and optics.
This chapter is divided in seven sections.
Section 8.1 is a more general introduction of the basic principles and mod-
elling of measurement systems. Also some definitions from the traceable
metrology field will be introduced, especially regarding measurement uncer-
tainty.
Section 8.2 presents the statistics around random errors by introducing
an important statistical method, called Dynamic Error Budgeting. This
537
538 Chapter 8. Measurement in mechatronic systems
Many years ago scientists like Laplace were convinced that somewhere in
the future all physical processes would be exactly predictable.
Quantum mechanics however, has given us other insights with for instance
the Heisenberg uncertainty principle, postulated by the German physicist
and Nobel prize winner Werner Karl Heisenberg (1901 – 1976). His prin-
ciple states that the standard deviation in the measurement errors of two
parameters, like place (σx ) and impulse (σp ) or energy (σE ) and time (σt ),
can not be smaller in combination, than given by the following expression:
h h
σx σp ≥ and σx σp ≥
4π 4π
Mostly the used terms are limited to only those that contribute to the
significance of the output value o, as determined by the uncertainty level of
the measurement.
Next to the non-linearity, external influences can impair the functionality of
the element in two ways. Firstly, the external disturbance input can result
in an additional value of the output of the element irrespective of the mea-
surand input value. This disturbance input is called the interfering input,
with a transfer gain K i . The second way of impairing the measurement
output is by changing the proportional gain of the measurand input transfer
function and is called the modifying input, with a transfer gain K m . In prin-
ciple, the interfering input can have a similar non-linear transfer function
as the measurand input, but often the higher-order terms of the related
polynomial can be neglected, as the sensitivity of the interfering input is
mostly much less than the sensitivity of the measurand. In high precision
systems, however, this linearisation is not always allowed, especially when
these effects need to be compensated.
Many external influences show a combination of both effects. Sometimes
even the gain factors K m and K i depend on the actual value of the input or
output signal, which makes it even more awkward to model. After compen-
sation of the systematic errors, a precision system generally mainly shows
random interfering errors.
Two other factors that influence the output of the element are shown in the
model, the setpoint and the dynamics. The setpoint (a) is an intentional
offset-value to define the range of output values that belongs to a certain
8.1. Introduction to measurement systems 543
Mulplied Added
M (i, em ) K m i em I (ei ) K i f ein n t 1
Non linearity
N (i ) f in n t 2 Setpoint (a)
p x1 ,x2 = p ( x) d x (8.2)
x1
The probability that the measurement can have any value, is by definition
equal to one. This implies that the integral from −∞ to +∞ of p( x) always
equals one. The related Cumulative Probability Function (CPF) is defined
by the integration of p( x) from −∞ to x. Often the PDF and the CPF are
represented in a graphical way, like shown in the example of the following
section. This representation clearly illustrates these statistical properties
of the system. On the horizontal axis the range of values is shown while the
2 This section is derived from the Phd thesis of Leon Jabben from our laboratory in Delft.
For random signals also a third term is introduced, the signal power Ps that
proves to be useful in combining several error signals. For random signals
the three terms are calculated as follows:
T
1
mean : x = lim x( t) d t
T →∞ 2T
−T
T
1
power : Ps = lim x( t)2 d t (8.4)
T →∞ 2T
−T
T
1
variance : σ2x = lim ( x ( t ) − x )2 d t
T →∞ 2T
−T
0.5
x 3Vx x 2Vx x Vx x x Vx x 2Vx x3Vx x 3Vx x 2Vx x Vx x x Vx x 2Vx x3Vx
x x
?
!!'
Figure 8.3: The probability density function and cumulative probability function of
a normal distributed random signal with a mean value x̄ and a standard
deviation σx . The probability that a measurement value is located in a
certain range is equal to the surface enclosed by the graph of the PDF
and the horizontal lines that correspond with the range boundaries.
These calculations imply that only 68 % of the measured values can be found
in a range of ±1σ around the mean value. A range of ±2σ already contains
8.2. Dynamic error budgeting 547
more than 95 % of all measurements, while at ±3σ, less than 0.3 % is found
outside the range. For this reason measurement systems are often specified
with either 2σ or 3σ values for the measurement error.
o = K i + a + N ( i ) + K m ie m + K i e i (8.8)
For small deviations the non-linear term N ( i ) can be combined with the
linear term L( i ) and the equation becomes:
∂o ∂o ∂o
δo = δi + δem + δei (8.9)
∂i ∂em ∂ei
where:
∂o ∂o ∂o
≈ K, ≈ Km i and ≈ Ki (8.10)
∂i ∂em ∂ei
In most cases the different inputs are independent and their impact can be
statistically combined using the rule of “the root of the sum of squares”. The
combined standard deviation of the element becomes:
2
2
2
∂o ∂o ∂o
σo = σ2i + σ2e m + σ2e i (8.11)
∂i ∂em ∂ei
And the median can be derived by simple addition:
ō = K ī + a + N ( i ) + K m ie m + K i e i (8.12)
548 Chapter 8. Measurement in mechatronic systems
P f1 , f2 = P( f ) (8.13)
f1
@!
!
M!
\JX
!
Object
Table to
2 Hz
Floor vibraon
4
10 100
3 ~90 nm
8 .10 80
0
CPS [nm ]
P( f ) [nm 2 /Hz]
10
2
CAS [nm]
3
6 .10 60
3
4 .10 40
total
>\
10 '
3
2 .10 20
@!
! CPS ~30 nm
! CAS
> 0 1 2 3 0 > 0
0 1 2 3
10 10 10 10 10 10 10 10 10 10
Frequency [Hz] Frequency [Hz]
?
!!'
! !
Figure 8.4: The power spectral density, the cumulative power spectrum and the
cumulative amplitude spectrum of an inspection microscope with dif-
ferent error sources, the floor vibrations, acoustic disturbance forces
acting on the measurement head and the dynamic disturbance from
the measurement head itself. The resonance of the vibration isolation
system at 2 Hz and the resonance of the microscope head at 160 Hz
show to be the main error contributors.
(Courtesy of Leon Jabben)
the measurement head shows the typical dynamic decoupling effect of this
eigenmode.
The second error source originates from acoustical disturbances by for in-
stance the air conditioning unit of the room, where the microscope is used.
The power spectral density of the combination of these disturbances with the
sensitivity of the instrument shows a maximum at the resonance frequency
of 160 Hz.
The third error source is caused by the dynamic properties of a feedback
controlled positioning system that moves the measurement head within
550 Chapter 8. Measurement in mechatronic systems
of a signal is calculated, but the standard deviation or the RMS value in the case that the
mean value is zero.
552 Chapter 8. Measurement in mechatronic systems
the heat flow is not constant. Most probably this direct thermal disturbance
on system accuracy will always remain a limiting factor in the precision of
high performance mechatronics.
Thermal effects have their dynamic impact also on the small scale as they
are the root cause for the noise in electronic components like resistors.
The large scale deformations by thermal effects mainly occur in the low-
frequency area. In the following a short overview is given of some important
sources of higher frequency dynamic noise in the mechanical and electronic
domain:
Electronic noise has many origins, which are named after their behaviour,
like thermal, shot, excess, burst and avalanche noise. The most relevant
sources of electric noise that impair precision mechatronic systems is pre-
sented in the following short overview:
Thermal noise
Any resistor will have a fluctuating potential difference across its ends that
is superimposed on the voltage caused by the current through the resistor.
8.2. Dynamic error budgeting 553
>\
10
ASML
BBN C
BBN D
>`
>{
10
>}
10 0 1 2
10 10 10
frequency [Hz]
Figure 8.5: Power spectral density of floor vibration specifications from ASML and
BBN.
(courtesy of Leon Jabben)
with k the Boltzmann’s constant (1.38 · 10−23 J/K), T the temperature and
R the resistance. To give an example, a resistor of 1 kΩ, at 20◦ C will show
noise with a RMS value of 0.13 μV from 0 Hz up to 1 kHz.
Shot noise
Shot noise results from the random passage of individual charge carriers
across a potential barrier. This is often seen with junctions in a transistor.
The noise has a normal probability density function and has a white spectral
density:
2
NS = 2 qI DC [A /Hz], (8.15)
554 Chapter 8. Measurement in mechatronic systems
with qe the charge of an electron (1.6 · 10−19 [C]), I DC the average current
[A]. An average current of 1 A will introduce noise with an RMS value of
18 nA from zero up to one kHz.
Excess noise
The noise in excess of the thermal and shot noise when a current passes
through a resistor or a semiconductor, is called excess noise. Other names
are flicker noise or one-over-f (1/ f ) noise. This noise source results from
fluctuating conductivity due to imperfect contact between two materials.
This is the reason why carbon composition resistors, which are made up
of many tiny particles molded together, show more excess noise than wire
wound resistors. The power spectral density of excess noise increases when
the frequency decreases:
Kf 2
NE = [V /Hz], (8.16)
fα
where K f is dependent on the average (DC) voltage drop over the resistor
and the index α is usually between 0.8 and 1.4, and often set to unity for an
approximate calculation. For resistors the excess noise is proportional to
the average voltage drop V over the resistor, which is why manufacturers
typically specify the excess noise as a noise index CR for one frequency
decade:
σ∗V · 106
CR = [μV/V] (8.17)
V
with σ∗V the being the standard deviation over one decade frequency range
of the voltage. For standard resistors the noise index CR typically ranges
from 1 to 10. For example, if the noise index equals 10, an average voltage
drop of one volt introduces noise with an RMS value of 17 μV in a frequency
range from 1 up to 1000 Hz. Note that a frequency range 1 mHz up to 1 Hz
introduces an equal amount of noise because of the low-frequency range!
+ + +
Vp Vp Vp
Sensing
Signal processing
Zs Zo,a
+
+
+
+
A-D Data
Vs Vi,a Z Va = GVi Vi,p Z Converter
_ i,a _ i,p
_ _
Vp
- Vp
- Vp
-
Figure 8.6: The electronic model of a measurement system shows the connection
between the three elements and the power supplies. Every successive
element determines a load for the preceding element. The output of
the first two elements is shown in the Thevenin model with a voltage
source. Depending on the properties of the system also a current source
Norton equivalent model can be applied.
8.3. Sensitive signals in measurement systems 557
Not all sensors are capable of directly converting a physical quantity into
electricity. They first need to create one of the primary effects from the list
558 Chapter 8. Measurement in mechatronic systems
In the previous section it was shown that sensing principles can be based on
a changing electrical impedance. Before this information can be treated in
the same way as a directly generated electrical signal, it is first necessary
to convert this impedance into an electrical signal4 .
The most simple way to achieve this conversion is by applying Ohms Law
and either supply the impedance with a well-known current and measure
the resulting voltage over the impedance or do just the opposite by applying
a voltage source and measuring the current. An AC source must be used
with a complex impedance like a capacitor or an inductor, but apart from
that the reasoning is the same.
Although modern electronics can be made rather precise, often special mea-
sures are needed to prevent errors that are related to the the low sensitivity
of many sensing impedances. When a potentiometer is used to measure
the displacement of the slider, the output voltage ranges from the supply
voltage Vi to zero. This is a sufficiently large signal, when compared with
interfering signals, and generally no further measures are needed. Another
example of a very sensitive resistive sensor is the NTC, a resistor with a
negative temperature coefficient based on semiconductor properties. Its
resistance can for instance range between 50 and 1000 Ω for a temperature
change from 0 – 100 ◦ C. With 1 mA current this results in a sensitivity of
approximately 10 mV/◦ C, which is also sufficient for most practical measure-
ments.
Unfortunately most variable impedance sensors have a much lower sensitiv-
ity and the noise in the supply current or voltage will play a much larger role
with those sensors. Take for instance a strain gage, a resistor to measure the
strain in a material by an increase of its resistance, when elongated. With
4 Often this conversion is considered as a part of the signal conditioning element. This is
however not logical with the aforementioned definition of the sensing element
8.3. Sensitive signals in measurement systems 559
Z3
+
Z1
3
_
+
Vp Vp
+
Vo V1 V2
+
Z2 V1 Z4
+
V2
+
_ _
4
2
V1 V2
_ _ _ _
Figure 8.7: A Wheatstone bridge has two branches each consisting of a voltage di-
vider. Only the differential voltage between the outputs of both branches
is used as measurement signal. Frequently the Wheatstone bridge is
drawn in a tilted square configuration as shown right to emphasise
the differential output voltage Vo . When all resistors are equal the
differential voltage is zero and the common-mode voltage is half the
supply voltage.
those elements, the practical resistance change upon load δR/R is often less
than 1 %. When this strain gage is supplied with a constant current source
I p , it is very difficult to keep the noise in this source below 10−6 × I p . With
an average value of R = 100 Ω, this noise causes an error with a constant
RMS value in the measurement of 10−6 × 100 Ω. With δR/R ≤ 0.01, the signal
to noise ratio is less than 104 and because the error signal has a constant
RMS value this is unacceptable for any precision measurement system.
Also the temperature has a large effect on most resistive sensors and this
influence directly interferes with the resistance change of the measurement.
To reduce the negative effects associated by the noise in the voltage source
of impedance measurement electronics, sensors like strain gages are mostly
connected in a Wheatstone bridge, named after the British scientist Sir
Charles Wheatstone (1802 - 1875), who popularised its use after the inven-
tion by the British scientist Samuel Hunter Christie (1784 - 1865).
The thinking model behind the usefulness of the Wheatstone bridge is best
explained by means of Figure 8.7. In principle the bridge consists of two
voltage dividers that share the same voltage source. These voltage dividers
560 Chapter 8. Measurement in mechatronic systems
are also called the two branches of the Wheatstone bridge. The first branch
consists of Z1 and Z2 with a corresponding output voltage V1 . Similarly
the second branch consists of Z3 and Z4 with output voltage V2 . The total
measurement signal is obtained by using an ideal differential amplifier to
amplify only the difference voltage Vo = V1 − V2 without loading the bridge.
The benefit of this configuration becomes clear when starting in the situa-
tion that all four resistors are equal. This would result in equal voltages
V1 = V2 = 1/2Vp and the voltage difference Vo would then be zero. This is
called an ideally balanced bridge and in that case the influence of the power
supply voltage Vp would be cancelled in the differential voltage. It would
only create a common-mode noise voltage and this will be rejected by the
differential amplifier.
As soon as one of the resistors, for instance Z2 , changes its value, it will
cause V1 to change, resulting in a non-zero voltage difference Vo . In this
new situation the noise of the voltage source will also be observed in the
differential voltage signal but at a much smaller value, proportional to
the measurement signal. With the same sensor and source voltage of the
previous example, this would result in a relative RMS noise level of 10−6 of
the signal value, which is far better than the previously obtained constant
RMS noise level of 10−4 of the maximum signal value.
In theory it is not really necessary that all resistors are equal to realise a
balanced bridge, as long as V1 remains almost equal to V2 . This condition
is always met, when the ratios of both branches of the Wheatstone bridge
are equal ( Z1 : Z2 = Z3 : Z4 ). For example one of the branches can consist
of resistors with much higher values than those in the other branch, like
Z1 = Z2 = 2 kΩ and Z3 = Z4 = 200 Ω.
In principle also Z1 can be different from Z2 as long as their ratio is equal
to the ratio between Z3 and Z4 but that is in most cases not a preferred
situation. With most sensors the impedance variation is only a small fraction
of the nominal impedance of the sensor, proportional to the measurand i ,
so δ Z/Z ∝ i 1. For those sensors a maximum sensitivity is obtained when
Z1 = Z2 and Z3 = Z4 .
This statement can be proven by calculating the sensitivity of for instance
V1 as function of its measuring impedance Z2 = R + δR for different values of
the other impedance Z1 = xR . With δR/R = c m i , where c m is a constant, δR
can then also be written as R c m i and the voltage V1 becomes:
R + δR R (1 + c m i ) 1 + cm i
V1 = Vp = Vp = Vp (8.18)
xR + R + δR R ( x + 1 + cm i) x + 1 + cm i
0.25
'
"cs )
0.2
0.15
0.1
0.05
0
0 1 2 3 4 5 6
~ x
Figure 8.8: The sensitivity factor of a wheatstone bridge is maximum when the
impedance values in each branch are equal ( x = 1), resulting in a working
point of half the supply voltage.
the corresponding value of the linear gain K becomes equal to the derivative
of V1 over i :
dV1 ( x + 1 + c m i )( c m ) − (1 + c m i )( c m )
K = = Vp
di ( x + 1 + c m i )2
xc m x
= Vp ≈ Vp c m = Vp c m c s (8.19)
( x + 1 + c m i )2 ( x + 1)2
with c s being the sensitivity factor of the Wheatstone bridge. The mentioned
approximation is allowed when c m i 1. The maximum sensitivity is found
when the derivative of c s over x becomes zero:
d c s ( x + 1)2 − x(2 x + 2) ( x + 1)2 − 2 x( x + 1) 1− x
= 4
= 4
= (8.20)
dx ( x + 1) ( x + 1) ( x + 1)3
This derivative is zero for x = 1 and x = ∞, but the second value of x also
gives a zero value for the sensitivity K . This means that only x = 1 is a
useful answer, proving the statement that the values of the impedances in
each branch of the Wheatstone bridge should be approximately equal for a
maximum sensitivity.
This conclusion is further illustrated in Figure 8.8, emphasizing the non-
linear relation between the sensitivity and the ratio of the impedances. Only
in a small area around x = 1 the sensitivity is almost constant.
As a last variation on the Wheatstone bridge, the two branches might differ
in the kind of impedance that is applied. For instance one branch can
consist of two resistors, while the other branch can consist of two complex
562 Chapter 8. Measurement in mechatronic systems
+
Vp Vp Vp Vp
R R R R R RR R RR R R
1
3
3
Vo _ Vo _ Vo _ Vo _
+
+
4
4
2
2
R RR R RR R RR R R RR
a: Single b: Double, opposite sign c: Double, same sign d: Two double, opposite sign
impedances of the same kind, like two capacitors or two inductors. Other
combinations with for instance only one capacitor are not allowed as then the
output voltages would show a relative phase shift for AC voltages because
of the filtering. This would result in a measurable voltage difference with a
corresponding noticeable noise level, even without measurement signal.
The application of complex impedances will be demonstrated more in detail
in Section 8.6.2 and thereafter with the presentation of capacitive and
inductive proximity sensors.
When all resistors in a balanced resistive Wheatstone bridge share the same
temperature and the same temperature sensitivity, the output voltage is
also no longer affected by the temperature. This seems attractive at first
sight, but these conditions are not easily met, because the temperature
of all elements are hardly ever equal. The not-sensing impedances of the
bridge are in most cases located inside the measurement instrument, at
some distance from the measurement location. In principle the requirement
for an equal temperature can be alleviated a bit as only each branch needs
to be isothermal. This is based on the reasoning that in that case the ratio
of both branches is not affected by the temperature. This means that one
of the branches can remain inside the measuring instrument at an equal
temperature, while the other branch is fully located at the measurement
site.
8.3. Sensitive signals in measurement systems 563
With the given values, the output voltage of the single sensor version with
Z4 = R + δR and Z1 = Z2 = Z3 = R becomes:
1 R + δR
Vo = Vp − (8.22)
2 2R + δR
After combining the terms with a common denominator, this equation can
be written as:
δR δR
Vo = −Vp ≈ −Vp (8.23)
4R + 2δR 4R
The approximation is allowed as long as δR 2R . In the not approximated
equation the δR term in the denominator causes a small non-linearity in
the sensitivity. Also the thermal effects in Z4 are not compensated.
The situation becomes however quite different with two sensing elements
with an opposite sign of the impedance change. This is shown in the second
drawing of Figure 8.9, where the second sensing impedance is located at
the other impedance of the same branch, so Z3 = R − δR . With this double
sensing principle the output voltage becomes:
1 R + δR δR
Vo = Vp − = −Vp (8.24)
2 2R 2R
In this case the thermal effects are still compensated as these work equal in
both branches but the non- linear terms in the not approximated equation
still underline the optimal location of both sensors in the same branch.
The negative effect of using two sensors in different branches is also demon-
strated in the third configuration of Figure 8.9. This configuration is unfor-
tunately the only option, when the second sensing element has the same
sign for its sensitivity as the first sensing element. In that case for instance
Z1 and Z4 are replaced by the two equally acting sensing impedances R + δR .
The calculation of the output voltage gives the following result for this
situation:
δR δR
Vo = Vp ≈ Vp (8.26)
2R + δR 2R
Although the sensitivity is also approximately twice the value of the single
sensing principle, the non-linearity is not cancelled. The main disadvantage
is however that the thermal problem is not solved, because of the different
location of the impedances in their branch. A simultaneous equal change
in the impedance by the temperature will cause a differential interfering
voltage adding to the voltage of the useful measurement signal.
It should be noted that the same result but with a different sign would be
obtained when Z2 and Z3 are replaced by the two sensing elements instead
of Z1 and Z4 .
Thinking further on this path, it appears that the most optimal configuration
is created, when all four impedances of the Wheatstone bridge would be
replaced by sensing elements. This results in a total signal of:
R − δR R + δR δR
Vo = Vp − = −Vp (8.27)
R + δR + R − δR R + δR + R − δR R
In this configuration, both the temperature effects and the impact of the
noise of the source is minimised, the sensitivity is maximum, the sensor is
linear around the working point and only 4 connections are needed to the
measurement electronics.
It will be shown in Section 8.7.1 how this can be accomplished in an inte-
grated strain gage element.
Remark on non-linearity: The argumentation regarding non-linearity
has become less important over the years with the introduction of digital
8.3. Sensitive signals in measurement systems 565
Like mentioned before, in most cases the electrical signals that are gener-
ated in a sensing element, are still susceptible to interfering signals from
external sources of energy, because of their small amplitude and high source
impedance. The sensitivity for these disturbances demands a careful con-
sideration of the interconnection from the sensing element to the signal
conditioning element. Although this interconnection is the most critical in
a measurement system also the other connections can cause problems but
first the purely analogue most sensitive interconnection will be examined.
All elements have to be interconnected with minimal two wires, because
electricity flows in a loop, and both wires are susceptible to interference.
Figure 8.10 gives an overview of the interconnection between two elements
of a measurement system with the different external disturbance sources
that can act on this interconnection.
+
_
Vn,c
Capacive interference Magnec interference
Sensing Vn,s+
Signal condioning
+
_
Zs Z c1 Zo
+
d's
+
+
dt Vi
Vs Z4 GVi
_ _
Signal loop
_
+
_
Z c2
Vn,s-
Zg1 d'g Zg2
Ground loop
dt
1 Ground 2
_ Vn,g
+
Figure 8.10: Overview of different sources of interfering signals acting on the con-
nection between two elements in a measurement system. Changing
electric fields insert a capacitive current in the connecting cables and
changing magnetic fields induce a voltage in closed loops. Grounding
at different places includes the voltage differences of these places in
the measurement.
+ + + + +
+ + + +
d'
Unshielded twisted pair (UTP) dt
~!
Shielding, return signal or ground
Signal
Coaxial cable
Grounded shielding
'
"
+
_
Vn,c
+
_
Zs Z c1
+
Zi
+
+
Vs Vi Z4 GVi
_ _
Signal loop
_
+
_
Z c2
Sensing
case the situation for both wires is often different because of the different
locations of the wires.
Fortunately it is very well possible to shield the sensitive element by means
of a conductive layer, creating a Faraday shield around the element that
blocks out electric fields as shown in Figure 8.12. This shield can be con-
tinuous but also small perforations are often allowed for cooling purposes
depending on the disturbance frequency. Very high frequencies can however
even penetrate in very small holes, depending on their wavelength!
Capacitive coupling can cause a problem with coaxial cables. In principle
the signal wire is well shielded by the return wire but this return wire
receives all capacitive disturbance signals and the resulting voltage appears
at the corresponding input of the next element. A first solution is often to
choose that input as the grounded input of the measurement system, short
circuiting these disturbance currents to a constant reference voltage. This
is mostly sufficient for less critical situations but sometimes the impedance
Zg of this zero Volt ground point is not low enough, resulting in a small
detectable disturbance on the input.
An even worse situation occurs when the two sides of the connecting cable
are both grounded. At first sight one might think that this will short circuit
the capacitive disturbances better but this additional connection creates a
ground loop with a very low impedance, where magnetic disturbances will
create a voltage that induces significant currents in the loop impedance.
Reducing Zg,1 would even make things worse, because the increased current
by the reduced loop impedance will cause an increased voltage over Zg,2 .
These negative effects are even present inside an electronic instrument,
where the loops are small. Experienced electronic designers are very keen
on avoiding ground loops even on printed circuit boards, by grounding all
electronics at one location.
Especially when the distances between the elements in a measurement
system become large like in a chemical plant, with cables up to hundreds of
metres another problem becomes prominent, the difference in potential of
the ground at different locations. This potential difference can be caused by
leakage currents of large grounded high-power electrical systems, lightning
and magnetic fields.
For these reasons double grounding should in principle always be avoided.
Safety regulations however often require the housing of mains fed electronic
570 Chapter 8. Measurement in mechatronic systems
Zs Z c1
Signal
Z4
Sensing condioning
+
_
Z c2
Vn,s-
Zg1 d'g Zg2
Ground loop
dt
1 Ground 2
_ Vn,g
+
Figure 8.13: Grounding more than one element of a measurement system will cre-
ate ground loops that introduce common-mode magnetic disturbances
and voltages due to other sources. Even though differential amplifiers
can reduce the problem, the related voltage levels can be very high,
especially with large distances, so ground loops should be avoided.
I2
R2
I1
Inverng R1 _
+
G ()
+
Non +
R1
inverng
+
Vi,1 Vi,2 R2 Vo
_ _ _
Figure 8.14: A simple differential amplifier with one operational amplifier suf-
fers from different loading of the input signals. The current of the
non-inverting input is determined by R1 + R2 , while the current at
the inverting input is determined by R1 and the voltage at the non-
inverting input.
elements this is however not the case and the input impedance has to be
increased by adding additional buffer amplifiers before the actual differential
amplifier.
Another reason for introducing additional amplifiers is the need to apply
only small resistor values, because of noise. The noise in a resistor depends
on the resistor value and the temperature as was shown in Section 8.2. Also
the input-current noise of an operational amplifier results in a noise voltage
at its output that is determined by the feedback resistor value. In practice
both inputs of an operational amplifier should “see” a resistive impedance
of less than 1 kΩ to achieve the minimum noise level and such a low value
of the load impedance is not preferred for many sensing elements.
A third reason to adapt the simple differential amplifier configuration is
based on the common-mode rejection ratio. In practice it is very expensive
to apply resistors with a better tolerance than 0.1 %. As a consequence,
with these resistor tolerances, the amplification of the inverting and non-
inverting input of the differential amplifier will also differ approximately
this same amount. As a result. the common-mode amplification will be
only a factor 103 below the differential-mode amplification, which is equal
to only −60 dB. This is far worse than the standard CMRR of an operational
amplifier, which is often better than −100 dB.
Figure 8.15 shows an amplifier that uses two additional operational ampli-
fiers at its inputs to solve these problems. This configuration is called an
8.4. Signal conditioning 573
I
R1,2
_ I
_ G ()
+
A2
+
R3
RG I
Vi(d) R2 _
+
I
Vb(2) G ()
+
Vi(2) R1,1 R2 +
+
I
A3
_
Vo
+ +
G () Vb(1) R3
A1
+
Vi(1)
_ _ _ _ _
and:
R1
Vb(2) = Vi(2) − IR 1 = Vi(2) − Vi(d) (8.29)
RG
This means that the common-mode voltage of Vb(1) and Vb(2) is equal to
the common-mode voltage of Vi(1) and Vi(2) while the differential voltage
Vi(d) = Vi(1) − Vi(2) is amplified with a factor R G/R 1 for both Vb(1) and Vb(2) .
As a consequence the common-mode rejection ratio of this combination is
improved with this factor RG/R1 . For that reason most of the gain of an
instrumentation amplifier should be realised in these non-inverting input
amplifiers, while the resistors of the differential amplifier part are often
chosen to be equal resulting in a unity gain of the second part. With this
equal resistor setting the output voltage becomes equal to:
R1 R1
Vo = Vb(1) − Vb(2) = Vi(1) − Vi(2) + 2Vi(d) = Vi(d) 1 + 2 (8.30)
RG RG
Even with a careful design it is not always possible to reduce all sources
of random errors to an sufficiently low level. In that case it is preferred
that these errors are first reduced by filtering, before the signal is further
processed in the measurement system. Simple filtering can be very use-
ful if the disturbance signal has a different frequency spectrum than the
useful signal. In that case the active filter configurations can be used like
described in Section 6.2.7 of Chapter 6. Examples are the reduction of
the high-frequency part of wide-band noise of electronic components and
single frequency disturbances can be filtered by a selective band reject filter.
The second-order “taming” low-pass filter in the PID-controller that was
presented in Chapter 4 also serves this same purpose.
Unfortunately often the disturbance signal occurs in the same frequency
band as the measurement signal. This is especially the case with low-
frequency excess noise because most mechanical measurements have also
8.4. Signal conditioning 575
+
< '
<
+
+
+
+
_ _
Vi Va Vg Vo
_ _ _ _
Vi
Va
Vg
Vo
t1 t2 t3 t4 t5 t6 t7 t8 t9 t10
Time
In radio transmission the sine wave carrier signal is multiplied with a com-
bination of a DC bias voltage and the music signal, as shown in Figure 8.17.
This method has the advantage that demodulation is fairly easy and for that
reason it is also useful in measurement systems, when the measurement
signal does not contain a DC value.
As a first step, the measurement signal is added to the DC bias voltage,
large enough to keep the voltage always unidirectional. By multiplying this
combined DC+AC signal voltage with the carrier signal, the resulting high-
frequency signal will never get a zero amplitude. This modulated signal can
be amplified with a selective amplifier with a band-pass transfer function
around the carrier frequency.
Demodulation is achieved by simple rectification, followed by a second band-
pass filter around the measurement signal frequency. This filter cancels the
DC bias voltage and the high-frequency remains of the carrier frequency.
To determine the transfer function of the selective amplifier it is useful to
look at the simple mathematics around this modulation principle. When the
8.4. Signal conditioning 577
+
+
+
+
+
Vi Va Vg Vr Vo
Vc
_ _ _ _ _
Vi
Vc
Va
Vg
Vr
Vo
Figure 8.17: Amplitude modulation by multiplying the input signal with the carrier
signal. The input signal consists of the measurement signal and a
DC bias voltage. As a result, the amplitude of the multiplier output
will never be zero. After amplifying and filtering, the demodulation is
done with a simple rectifier and a band-pass filter.
input signal with the DC bias voltage equals Vi = V̂i (1+sin ωi t) = Vi (1+sin 2π f i t)
and the carrier signal equals Vc = V̂c sin(ωc t) = V̂c sin(2π f c t) the multiplication
results in:
This means that the resulting signal contains three frequencies, the carrier
signal frequency f c and two frequencies equally spaced at both sides of the
carrier frequency at a frequency difference equal to f i . These frequencies
are called the side-bands around the sampling frequency and they both
contain the information of the measurement signal at f i .
With this result it can be reasoned that an input signal that contains differ-
ent frequencies up to f max requires the bandwidth of the selective amplifier
to range from f c − f max to f c + f max . In principle one of the two side-bands
could be filtered out and restored with special measures afterwards, but
that procedure is too specialistic for the purpose of this book.
A drawback of this simple scheme with rectifier demodulation is the fre-
quency limitation to AC signals. The synchronous demodulation of the first
578 Chapter 8. Measurement in mechatronic systems
Interference
Modulator Demodulator
!
' !
x
x
+
+
+
+
+
Vi Va Vg Vr Vo
Vc Vc
_ _ _ _ _
Vi
Vc
Va
Vg
Vr
Vo
example with a square wave signal did not have that drawback. Fortunately
it is also possible to apply synchronous demodulation with a sine wave
carrier signal, as shown in Figure 8.18. It results in the amplification of
frequencies from 0 Hz and above.
The input signal is not added to a DC bias voltage, because also any DC
content of the input signal should be maintained. The resulting signal after
modulation has a zero amplitude at Vi = 0 and shows a phase reversal, when
Vi < 0. Demodulation by rectifying would result in a frequency doubling
with a DC offset, so the demodulation is done by multiplication with the
same carrier signal.
The average of the resulting signal is equal in shape as the input signal
and can be recovered by a low-pass filter. This method is also called phase
selective detection or homodyne detection and will return later in this chapter
with different sensing principles.
8.5. Signal processing 579
After the signal conditioning element, a strong and robust signal is present
that can be used in an analogue control system. Modern control electronics
work predominantly in the digital domain, which makes it necessary to
convert the momentary values of the electrical signal into a sequence of
data that sufficiently represents the measured value.
Digital systems are characterised by the fact that they work with very
strictly timed events, synchronised by means of an accurate clock frequency
generator, mostly based on a quartz crystal oscillator.
When only the frequency of a periodic signal is important, the analogue
signals can be directly converted into the digital domain. This frequency-
to-digital conversion process is achieved by means of a comparator, a one
bit bistable element that outputs either a “zero” or a “one” depending on
the voltage level at the input. In chapter 6, a special version of such a
comparator, the Schmitt trigger, was used in the pulse-width modulator of
a switched mode power amplifier. In measurement systems the Schmitt
trigger is used to suppress disturbances.
With real analogue-to-digital conversion the magnitude of a value needs to be
digitised and that is done by taking a momentary value of the measurement
signal and generate a number to the magnitude of that sample. In the
following sections, first the Schmitt trigger will be presented, followed by
the process of creating sampled numerical data from the magnitude of a
measurement signal.
Vo +
Vi
Vo
Schmi trigger 0
t0 t
Vo
+
Vr
Vi Vo -
Vr - Vo +
Vi Vr +
Vo 0
Vr t0 t
Vr -
Vo -
Digital output
Digital output
LSB
Lig
Decimal Gray Biary
llimatr ls
0 0000 0000
1 0001 0001
2 0011 0010
3 0010 0011
4 0110 0100
5 0111 0101
6 0101 0110
7 0100 0111
8 1100 1000
9 1101 1001
10 1111 1010
LSB 11 1110 1011
12 1010 1100
13 1011 1101
14 1001 1110
MSB 15 1000 1111
Pht tectr
Figure 8.21: With direct measurement of digital data, a binary code can cause
problems by not synchronised changing of all related bits. This is
solved by using a Gray code that switches only one bit at any increment,
like in this example of an absolute rotary optical encoder. At the right
a 4-bit coding is shown as example.
(Courtesy of Heidenhain)
Vi Sampling Hold Vh
_
Vi S0 S1 S2 S3 S4
Vh S5 S6 S7 S8 S9
Figure 8.22: Sampling a signal is done with a switch, while a capacitor will hold
the value constant until a new sample is taken. A buffer amplifier
before the sampling switch prevents excessive loading of the input
signal by the hold capacitor.
clock cycles to determine all bits of the digital value of an analogue sample.
During this process each determined bit is stored in the latch and only when
all bits are found, this number is transferred to the digital processor in the
last clock cycle.
The incremental nature of digital signals is both related to the value and
to the time. Numbers can not change gradual and are only valid over a
certain period. This fact implies that digital numbers represent samples
of a momentary value of the analogue signal. Depending on the method
of analogue to digital conversion, the sampling frequency can be constant,
synchronous with the clock frequency of the digital controller or variable
depending on the signal value. The latter can be synchronised afterwards
by means of a memory where the data are written asynchronous but the
readout is synchronised.
Samples are by definition only exact at the moment of sampling and this
automatically means that intermediate information is lost. In case of a
varying analogue signal, it seems logical that the sampling frequency should
at least be larger than the largest frequency in the signal.
It is important to see, what the limitation really is.
584 Chapter 8. Measurement in mechatronic systems
With this rule in mind, first the sampling process itself is considered. Fig-
ure 8.22 shows how sampling can be done in practice. The samples of the
signal are taken with a switch that shortly connects a capacitor to the signal
voltage at the output of a buffer amplifier. During this sampling moment
the capacitor is charged to the momentary signal voltage. After opening
the switch, the sampled voltage remains stored in the capacitor until the
next sample is taken. As will be shown with the A to D converters further
in this chapter, this sample-and-hold functionality is necessary for those
converters that need the input signal to be constant during the conversion
process. Other converters do not have that requirement as they average the
input signal over each sample.
One observation that can be made from the figure is the time delay that is
caused by the “hold” functionality. In principle this delay corresponds with
the fact that information is lost between the samples. The average time
delay by sampling is half the sampling period and unfortunately this delay
can never be restored! This in itself is an important determining factor for
the minimum sampling frequency when the measurement is used to control
a process.
Even more important is the effect of sampling on the frequency spectrum.
In principle, without the hold function, sampling alone is equal to amplitude
modulation of a repeated impulse signal, while the holding action results in
a kind of square wave modulation. With reference to the previous section on
signal conditioning with amplitude modulation, the frequency spectrum of a
sampled signal also shows “side bands” around the sampling frequency. With
a sampled impulse signal this pattern is repeated at n-times the sampling
frequency due to the wide frequency spectrum of an impulse.
This phenomenon is illustrated in Figure 8.23, where also the effect of the
"hold" functionality is shown with a low-pass filter character, reducing the
magnitude of the spectrum at higher harmonics of the sample frequency.
The upper graph in the figure shows the frequency spectrum in case the
sampling frequency is much higher than the maximum signal frequency.
The original signal spectrum could be simply recovered by filtering away all
8.5. Signal processing 585
Magnitude
Filter
Filter
Figure 8.23: Sampling a signal will create mirroring side bands of the frequency
spectrum of the signal around n−times the sample frequency, where
n is an integer > 0. When f s is smaller than two times the maximum
signal frequency, overlap between the spectra will occur, impairing
the possibility to reproduce the original signal. The effect of the “hold”
action at higher frequencies is shown in red.
Aliasing
Figure 8.24 shows what happens when the sampling frequency is for instance
equal to the maximum frequency of the input signal spectrum. The side
bands around the sampling frequency overlap with the frequency spectrum
of the analogue input signal and any frequency of this spectrum will get
a corresponding alias frequency that is lower than the original frequency,
equal to the frequency difference f d = f s − f i . When using these samples for
reconstruction two frequencies would result, the original and its alias, which
has given the name to this effect. In case the sample frequency is higher
than two times the signal frequency this difference frequency remains larger
than the maximum signal frequency and when all frequencies above the
original signal frequency are filtered out, only the original frequency would
remain.
586 Chapter 8. Measurement in mechatronic systems
Magnitude
Alias of fi
fs - fi fi fs fs + fi
-Vr
R 2 3
C
1 _
R Set
Integrator
+
Dual-level Micro- Data
+ processor
Comparator
_ V
i
Reset System
0V clock
-Vr
1
0
2 -Vr
3
0
in less critical situations like with controllers for central heating of buildings.
This converter does not require a separate sampling and hold function and
can be realised with only few components around a microprocessor at a
very low cost. The second converter is the frequently applied successive-
approximation AD converter. It combines a high precision with an average
speed and has long been a standard in A-to-D conversion. This converter is
not as fast as the third converter, because it requires a separate sampling
and hold function and the quantisation error is maximum 1 LSB. The third
converter is the Delta-Sigma converter. Its principle is derived from the
dual-slope ADC, but adapted such that it creates synchronous samples of
alternating voltages at a very high sampling frequency.
8.5. Signal processing 589
A more precise and relatively fast conversion procedure is achieved with the
successive-approximation analogue-to-digital converter. With this method,
the output of a digital-to-analogue converter (DAC) is compared with the
input signal, while an efficient algorithm is used to get the right digital
number in the least possible steps. Figure 8.26 shows this working principle
with a DAC based on a R-2R ladder-network.
The R − 2R ladder-network is best understood when starting at the first sec-
tion with the least significant bit (LSB) switch S1 . The source impedance at
point (1) is two times 2R in parallel, which results in a Thevenin impedance
of RTh = R . This is indifferent of the setting of switch S1 . With both settings
a low impedance path to ground is created at the switch, as a voltage source
has a zero source impedance.
The Thevenin voltage Vth(1) of this first section at point (1) equals either zero
or 0.5Vr , depending on the setting of S1 . The Thevenin impedance of the
first section is loaded by the second section and creates a voltage divider
with the resistances R and 2R of the second section, where the 2R resistor
is connected to ground at S2 , either directly or via the voltage source. This
voltage divider results in an attenuation of the Thevenin voltage of the first
section with a factor two, so either 0 V or 0.25Vr .
The equivalent Thevenin source impedance of this first and second section
combined also equals R as two 2R resistors are connected in parallel to
ground like with the first section. When switching S2 to Vr the resulting
Thevenin voltage at point (2) will be the sum of the voltage from S1 (0 or
0.25 ×Vr ) and the voltage from S2 (0 or 0.5 ×Vr ) resulting in a total voltage
of 0, 0.25, 0.5 or 0.75 ×Vr , depending of the settings of S1 and S2 .
This reasoning can be repeated for the next stages to get the total voltage at
point (3) and point (4). While in the figure only 4 bits are shown, an accurate
multi-bit DA-converter can be designed with only two values of resistors,
which is very suitable for realisation in a precision integrated circuit.
The logic circuit for the AD-converter works as follows:
• Then the second significant bit is switched on and the same decision
process takes place: “Keep bit setting if output of DAC is smaller than
8.5. Signal processing 591
Comparator
S1
+ 1/0 S2
+ Logic processor
_ S3
_ V
i S4
2R 2R 2R 2R 2R
+
S1 1 S2 1 S3 1 S4 1
0 0 0 0
LSB MSB _ V
r
Analogue voltage
reset
reset
reset
keep
keep
keep
keep
keep
set
set
set
set
set
set
16 voltage levels
reset
reset
keep
keep
set
set
set
set
set
set
1 0 1 0 0 1 0 1 0 1 1 1
MSB (8) LSB (1) MSB LSB MSB LSB
Time (8 clock pulses) Time Time
that this stabilisation of the voltage at the DAC is always necessary because
of any parasitic capacitances that are always present in any electronic circuit.
This is indicated in the figure by the gradual slope after the (re-)setting of
any bit. This timing requirement inherently leads to a the drawback of this
method as it requires many steps for a high number of bits. During these
steps the input voltage has to remain constant until the entire procedure
is finished. This means, as was stated before, that after the moment of
sampling the momentary value of the input voltage has to be “remembered”
as long as necessary with a sample and hold circuit. Also this process
needs time as the conversion process can only start, when the hold value is
stabilised.
The consequence of this reasoning is that very high speed electronics are
needed, which in practice always implies an increase in cost.
8.5. Signal processing 593
• A latch that transfers the momentary value at its input tot the output
at the positive edge of the clock.
• A digital processor that creates digital data from the output of the
latch.
The non-inverting integrator integrates (Σ) the difference (Δ) between the
momentary value of the input signal and the output of the 1-bit DA-converter.
The output of the non-inverting integrator will rise when this difference is
positive or fall when this difference is negative. Like with the pulse-width
modulator, the comparator is used to detect the moment that the output
of the integrator crosses a reference value, but in this case the hysteresis
594 Chapter 8. Measurement in mechatronic systems
*
'
m 5 Bim
_
2 _
3
+ _ +
+
+ 4 Dii D
+
_
L
P
_ V
i
7
6 iD@
Vr+/ Vr*
+
Vr
Vi
~!'
_
Vr
+
2 2Vr
<
!
!*
@
_
2Vr
3 ¦J
!!
~ §J
4
!!
0
5
!!
0
+
6 Vr
!!
*
@ _
Vr
7
L L L L L L
C C C C C C
Figure 8.28: A cable with two wires, like a coaxial cable, is equivalent to a series
of small inductors and capacitors. This combination represents a
characteristic resistive impedance, equal to the required resistive load
to get Q = 1.
Even though the signal conditioning element serves to create a robust signal,
it is necessary to transport this signal with care in a coaxial or twisted pair
cable. Also the digital data after the A to D conversion should be transported
with a careful choice of the connecting cables. Next to the reason to prevent
interference, especially with very long distances, another reason for a well
defined connection are the high-frequency properties of a connecting cable.
Any combination of two wires will have a certain total capacitance and
inductance value. When examining a connecting cable with two wires more
in detail, such a cable can be modelled as an infinite series of infinitesimal
LC -filter elements. Such a combination behaves like a transmission-line
or wave-guide, an electrical equivalent to the travelling wave in the rope
as was presented in Chapter 2. Just like with the mechanical example, a
signal at the input of the cable will arrive with some delay at the output,
but what is more important is that the signal will be reflected back towards
the input, where it will again be reflected. This can cause standing waves
and resonances in the signal.
It was shown in Chapter 6 that an LC -filter can be damped by means of
a resistor and in the same way the reflection of a connecting wire can be
prevented when it is terminated by means of a resistor with a value that
would result in a quality factor Q = 1. With Equation (6.40) the value of this
8.5. Signal processing 597
This value is equal for any length of the same cable as both L and C depend
proportional to this length. It can be mathematically proven that a cable
with infinite length represents an input impedance that is equal to this
characteristic impedance. By terminating a non-infinite cable with its char-
acteristic impedance, the cable will behave dynamically like it is infinitely
long. This means that an optimal connection is achieved, when both the
output impedance of the preceding element and the input impedance of the
succeeding element are chosen equal to the characteristic impedance of the
cable.
When that condition is not met, a short cable might possibly not raise much
problems as it will only give an undamped response at a very high natural
frequency. For example a coaxial cable with a characteristic impedance of
50 Ω has a capacitance of approximately 50 pF per metre. Based on these
values the inductance over one metre would be equal to L = R c2 C = 125 nH.
When all capacitance is approximated to be concentrated at the end of the
cable and all inductance is in series, this combination would act as an LC
filter with a natural frequency of ω0 = 1/LC = 4 · 108 , which is approximately
60 MHz. No reflections will occur as long as the signals that are transported
do not contain any frequencies at or above ω0 . Actual wired data connec-
tions, however, already transmit frequencies above 1 Gbit per second so
in that case even one metre of cable should be terminated with the right
impedance in order to prevent reflections and high-frequency attenuation.
Longer cables will increase the problem by lowering the natural frequency
proportional to the length of the cable.
The full derivation of the related equations is beyond the scope of this book
but some observations are interesting enough to be added here.
At first sight the equal output and input impedance will result in a factor
two attenuation of the signal. This however is not bad at all as first this
level corresponds with the maximum power that the preceding element can
deliver, as was shown in Chapter 6. This is optimal from an disturbance
point of view as that is related to the interfering energy relative to the
energy in the useful signal. Furthermore, because of the transmission line
character, even frequencies above the original natural frequency will be
transmitted equally well because at any position in the cable the signal
“sees” only the resistive characteristic impedance in two directions and the
resistance is not frequency dependent.
598 Chapter 8. Measurement in mechatronic systems
Ii +
Vp
Io
?
Figure 8.29: Transfer of a digital signal with a fibre coupling prevents electric
disturbances and errors by ground loops. It consists of a light emit-
ting diode that transmits light through the optical fibre towards a
phototransistor.
Although well defined wired connections are very suitable for many ap-
plications it still suffers from the problems with ground loops and other
disturbances, that were previously presented. A real solution of all these
problems is based on the transfer of information by means of light. In Sec-
tion 6.3.3.6 of Chapter 6 the opto-coupler was introduced, consisting of a
light emitting diode (LED) and a light sensitive phototransistor that are
electrically insulated from each other. A current in the LED will give a pro-
portional light output that creates a proportional current in the transistor.
The current amplification ratio of the transistor results in a current level in
the same order of magnitude as the current in the LED. Also light sensitive
diodes can be used. They have a higher switching speed but also a reduced
current because they fail the current amplification of the phototransistor. A
clear advantage of the opto-coupler is the galvanic insulation that completely
cancels any kind of ground loops, although at very high frequencies some
capacitive coupling still remains. Because of the significant temperature
dependent non-linearity, the application of opto-couplers is mainly limited to
the digital domain, so after the signal processing element. A special version
of the opto-coupler is the fibre coupling of Figure 8.29. The insertion of an
optical fibre between the LED and the phototransistor completely cancels
the capacitive coupling but more important is the possibility to transport
the signal over very long distances without any electromagnetic interference.
When necessary even a strong semiconductor laser can used as the light
source. This technology has become the de-facto standard for data transfer
over very long distances like with internet.
8.6. Short-range motion sensors 599
The first category of position and motion sensors are those sensors that are
used to determine the relative position of two objects at short ranges and
these sensors are often called a proximity detector. Typical applications
include the boundary detection for long-range actuated systems. Also the
sensors for fixating a position by means of a servo system like in a magnetic
bearing belong to this category. Often only the accuracy at the zero position
is relevant while the sensitivity over the total measurement range is relevant
for the gain of the servo-loop.
The second category consists of those sensors that register motion like
velocity and acceleration. They are applied in many different fields like in
geophysics to detect earthquakes and in precision equipment for vibration
control.
Because of this wealth of possibilities only a few examples are selected as
these are representative for the entire field
work with an opposite sign. This means that a stable system working on one slope will be
unstable due to phase sign reversal at the other slope. This necessitates special precautions,
when the system is out of range and the stable point has to be retrieved.
600 Chapter 8. Measurement in mechatronic systems
Detector
(Photo-Transistor)
?!
z
Sensor 0
LED. This means that the current in the LED is both a modifying input as
it changes the slope (sensitivity) and an interfering input as it changes the
value at any position. The effect of this can be reduced by comparing the
current with a reference current that is proportional to the current in the
LED. Also supplying the LED with a constant current source as presented
in Chapter 6 is a suitable method to reduce the influence of the LED.
Other error sources include the reflectivity of the object, the angle of the
object, noise in the detector and ambient light that is detected by the photo-
transistor. The influence of the ambient light can be reduced by using an
optical filter in order to only detect the wavelength of the LED light. To limit
the noise of the detector an amplifier with a low input noise level must be
located as close as possible near the sensor. Figure 8.31 shows an example
of such a circuit which is also optimised for high-frequency behaviour as the
transistor is directly connected to the virtual ground at the minus input of
the amplifier. This means that the current from the transistor is transformed
into a voltage at the output of the amplifier without causing a voltage change
over the transistor itself, thus avoiding high-frequency current from the
collector to the base by the internal parasitic Miller capacitor.
I
_
G
+
_
Vo
Vb
+
Figure 8.31: A photo-transistor works optimal with reduced current noise, when it
is supplied with a constant bias voltage Vb . The virtual ground at the
minus input of the operational amplifier cancels the AC voltage over
the transistor which gives a better high-frequency behaviour.
(Courtesy of Jasper Wesselingh)
By dividing the difference of the currents by the sum of all four currents
the resulting value is independent of the irradiance of the light beam which
would otherwise be a modifying input. In reality, the relation between the
position and the found values also depends on the size of the light beam,
the distribution of the light inside the beam and the character of the sensor.
602 Chapter 8. Measurement in mechatronic systems
Beam of Beam of
light light
y
I1 I2 I1 I2
I3 Quad-cell I4 I3 Proporonal I4
Figure 8.32: The four quadrant position sensitive detector either consists of 4 sepa-
rate photo diodes as shown at the left side or of one large photosensitive
area with high resistive paths to the output wires as shown at the
right.
dd
Mirror surface
ds
Oscillang
mirror
Laser
PSD
Scan
lens
z y
Movement direcon
x
Figure 8.34: Optical deflectometer that measures the shape of a surface by a scan-
ning beam and suitable optics that images the spot of the oscillating
mirror to the PSD via the surface. Local angular variations of the
surface will result in position differences of the spot on the PSD.
the image will be positioned at the same distance6 right of the optical axis,
where the PSD sensor is located. Because the spot of the rotating mirror
is imaged on the PSD, the position of this image is not influenced by the
angle of the mirror as long as the surface that must be measured is perfectly
flat. When that is not the case the parallel incident rays on the surface
will not be reflected parallel anymore and an angular deviation at a certain
location on the surface will result in a change of position of the spot on the
PSD. When the corresponding electrical signal is synchronised with the
oscillation of the mirror, this position signal of the PSD gives information
about the slopes at different locations on the surface and by calculation the
shape of that surface can be derived.
6 This is a 1:1 optical system as both image and object are in the focal plane and a positive
lens inverts the sign of the position relative to the optical axis.
8.6. Short-range motion sensors 605
I (~)
w x x
Metal plate
+ A d
x
d 1 2
_ V (~) ε d
s l l l l dH
Distance §¨© dx ·¸¹ Area §¨© dx ·¸¹ Dielectric §¨© dx ·¸¹
dd dA
Dielectric
Figure 8.35: Three types of capacitive position sensing by changing the position of
two plates of a capacitor in two directions or changing the dielectric
properties as function of the displacement.
For air εr equals approximately one but as will be shown further in this
chapter, this assumption is a cause for errors.
This inverse relationship of the capacitance with the distance is the reason
that this physical property is used to determine displacements between
two objects that are positioned close together. An AC source is necessary
to determine the capacitive value with the complex version of Ohm’s law
(V = Z · I ). In Figure 8.35 it is shown that not only the mutual distance
between the plates can be used to measure displacements but also the
other two terms in the simple relation can be made variable corresponding
with a displacement. In case the displacement of one of the plates is in
its own plane the distance remains the same but in this case the shared
surface is changed while in the third example a separate part with another
dielectric value is inserted between the plates which also gives a change in
the capacitance according to the position of this part relative to the plates.
Each solution has its own advantages. A variation of the distance gives
generally a higher sensitivity than the parallel movement of the plates but
the latter has a better linearity which can be concluded from the respective
606 Chapter 8. Measurement in mechatronic systems
E1
C1
V1 V2 _ dx E2 x
+
Vo
+
Vp (~)
+
+
_ dx E3
V1 V2 C2
_ _
Figure 8.36: A differential capacitive sensor has a linear relation between the
displacement x and the voltage Vo .
From the previous part it can be concluded that the errors in capacitive
measurement sensors are caused by many external influences, summarising:
It is clear that the electric source has a large impact if the values are not
exactly known. Fortunately these electrical parameters can be controlled
to high levels of precision and the measurement result can in most case be
compensated. This is less the case with the error by the noise of the sensing
608 Chapter 8. Measurement in mechatronic systems
+
Vp (~) Io
_
R
G () Vo(~) _
+
+
Figure 8.37: By using a guard ring the electric field at the sensing electrode is
homogenised. The capacitance is measured by virtually grounding the
sensing electrode and measuring the current with a transimpedance
amplifier.
Moving part
Io
R
_
+
Vp (~) Transming Guard Ring Sensing electrode
_ electrode G ()
+
Vo(~) _
+
Figure 8.38: By using two electrodes and a guard ring, the distance to a moving
conducting object can be measured without the need to directly contact
the moving part. In this case the guard ring reduces the current that
flows directly over the capacitance between the transmitting electrode
and the sensing electrode.
frequency is filtered out with a selective filter. In case this sensor is applied
in a high bandwidth real time feedback controlled system, all measures for
shielding must be applied, while avoiding loops in the wiring. Furthermore,
the sensing electronics should be located as close as possible to the sensor.
With extreme critical applications this might even requires measures for
cooling the heat that relates to the often high power consumption of the
required low noise amplifiers in order to avoid thermal errors.
n windings
Ac
Ac,1
Ac Ag d
Ac,2 Ag
Figure 8.39: Distance and velocity sensor based on the variable reluctance of an
air gap.
Note that the flux has to pass the air gap twice and that c consists of two
parts, one part is the core with the windings and one part is the return path
to the object of which the distance d is measured.
The self inductance was defined as follows:
Φw,t nΦw n2
L= = = (8.45)
I I ℜ
612 Chapter 8. Measurement in mechatronic systems
L1
V1 V2 _ dx x
+
Vo
+
Vp (~)
+
_ dx
V1 V2
_ _
L2
Figure 8.40: A differential reluctance sensor has a linear relation between the
displacement x and the voltage Vo .
where Φw,t equals the sum of the flux Φw of each winding. It is further
assumed that Φw is equal for all windings. Using this definition, the self
inductance becomes:
n2 L0 c1 n2
L= = with: c 2 = and: L 0 = (8.46)
ℜ0 + c 1 d 1 + c 2 d ℜ0 ℜ0
The voltage over a self inductor is proportional to the change of the total
flux Φw,t over time. This leads to the following relation:
dΦw,t d dI dL
V= = LI = L +I (8.47)
dt dt dt dt
This means that an inductive sensor can use two sensing principles: In
case of an AC current through the coil the voltage is directly related to
the position. With a DC current the voltage is proportional to the relative
velocity. This second principle is hardly used for measurement as using
permanent magnets is far more efficient to detect a relative velocity in a coil
as will be explained later. For that reason this principle will not be pursued
any further.
The first term however is important as the self inductance is inversely
proportional to the distance, similar to a single capacitive sensor. The
related non-linearity can be solved in the same way as with a capacitive
sensor by using a Wheatstone bridge and two sensors in a balanced bridge
as shown in Figure 8.40. The impedance of an inductor is proportional to
the frequency ( Z = j ωL). For that reason the other branch of the Wheatstone
bridge is taken as with the capacitive sensor in order to get a positive output
voltage with a movement in the positive x−direction. The voltage can be
calculated with Equation (8.46) in the same way as with the differential
capacitive sensor:
L0 L0
L1 = , L2 = (8.48)
1 + c 2 ( d − x) 1 + c 2 ( d + x)
8.6. Short-range motion sensors 613
50
0 100
Di<erenal
transformer
Figure 8.41: A Linear Variable Differential Transformer can measure very accu-
rately small displacements around the centre position.
The voltage Vo = V1 −V2 becomes, when applying the rules of a voltage divider:
Z2 Z4
Vo = Vp − (8.49)
Z1 + Z2 Z3 + Z4
+
Alternang V1 Phase sensive detector
magnec ux _
+
+ Amplitude of V1 V2
Vs(~) Value Vo
_ Vo
(V1 V2 ) u Vs Sign Vo
_
+
V2
_
Top
Middle
Boom
Vs V1 V2 V1 V2 (V1 V2 ) u Vs Vo
Figure 8.42: Signals in an LVDT. The secondary coils in an LVDT partly share the
flux of the primary coil. In the mid position the induced voltages are
equal. A position change of the core will increase the flux at one coil
while decreasing the voltage in the other. The difference voltage is
then proportional to the displacement while the phase corresponds
with the direction.
transformer the core is not closed but consists only of a small ferromagnetic
rod with about the same size as the primary coil. In the mid position the
ferromagnetic rod is located just inside the primary coil. The generated
magnetic flux from the primary voltage will only partly be coupled to the
secondary coils. The induced voltage in these coils therefore is small and
equal for both coils. The subtraction of these voltages gives a value of 0 V,
corresponding with this mid-position. When however the core is moved in
the direction of either one of the secondary coils, the coupling with that
coil will increase while the coupling with the other coil will decrease with a
corresponding increase and decrease of the induced voltage. In that case
the subtraction of both voltages results in a voltage with an amplitude that
is proportional to the displacement. Depending on the direction, whether
8.6. Short-range motion sensors 615
AC primary current
Ie
Induced AC eddy-current
moving towards coil A or B, the phase of the resulting signal will be inverted
or non-inverted giving the direction of the position. By using a phase se-
lective detector a DC voltage is obtained with a sign that corresponds to
the direction of the displacement. Phase selective detection can be achieved
by taking the sign of the signal obtained by multiplying the input with
the output and use the rectified and low-pass filtered output signal V1 − V2
to get the amplitude. This principle was also explained in Section 8.4.2.2
regarding synchronous demodulation of an amplitude modulated signal.
suppress the magnetic field. This causes the current in the primary coil to
increase just like with a regular transformer and by measuring this current,
a value for the distance to an object is obtained. Information on the distance
is available both in the amplitude and in the phase of the primary current
relative to the primary voltage which is explained as follows. At a large
distance from the conductive object the coil acts as an ideal self inductance
giving a current with a low amplitude that is 90◦ delayed. As soon as the
distance is smaller the eddy-current will induce a real primary current that
adds to the imaginary current level because the eddy-current is in phase
with the secondary voltage and consequently with the magnetic flux and the
primary voltage. This also means that the consumed power of the sensor
increases and the total current will be less than 90◦ out of phase.
Eddy-current sensors are not very precise but measurements with μm accu-
racy around a fixed setpoint are possible like in magnetic bearings.
8.7. Dynamic measurements of mechanical quantities 617
Mechatronic systems are ruled by dynamic, time varying signals that are
determined by forces acting on the system. The measurement of these forces
and the related velocity and acceleration of a positioning system is most
important in controlling a positioning system for all states. Also in the
dynamic modal analysis of a mechanical part inside a mechatronic system
these dynamic sensors are frequently applied.
F
A
B
Figure 8.44: A force sensing bending beam with different locations for the strain
gages. The gage at location A is elongated while the gages at locations
B and C are compressed when applying a force F in the downward
direction.
618 Chapter 8. Measurement in mechatronic systems
Vi+
Z1 Z3
Vb + Vb-
Z2 Z4
Vi-
Figure 8.45: A full bridge strain gage consisting of 4 strain sensitive resistors.
A strain gage is a piece of thin conductive material that changes its resistance
as function of a change of its dimensions. An elongation in the direction of
the current flow will increase the resistance both by the longer trajectory
and by the corresponding contraction of its diameter by the stretching effect.
These sensors are mainly used in non-destructive measurements with very
limited strain levels. As a consequence the resistive change is also very
limited in the order of less than 1 %.
A strain gage is a passive sensor and needs an external source of electricity
to function. As explained before, the Wheatstone bridge is the best choice
for this kind of sensor and in the following the Wheatstone bridge is taken
as example. It was shown that the temperature effect in the Wheatstone
bridge can be minimised by using a thermally balanced branch with two
sensors with an opposite sign in the sensitivity. In Figure 8.44 it is shown
how this can be realised with strain gages by mounting the second strain
gage close to the first strain gage on a location that either shows an opposite
strain like the reverse side of the bending beam or on the same side but
orthogonal to the first strain gage where the poisson ratio of the material
will cause the surface to show an opposite strain.
The most optimal configuration of a strain gage where both temperature
and source noise is minimised relative to the sensitivity is the full Wheat-
stone bridge with four strain gages. This can be realised on one monolithic
part when using the Poisson ratio effect according to the configuration of
8.7. Dynamic measurements of mechanical quantities 619
Material K factor
Platinum-Iridium (Pt 95 %, Ir 5 %) 5.1
Platinum-Tungsten (Pt 92 %, W 8 %) 4.0
Isoelastic (Fe 55.5 %, Ni 36 % Cr 8 %, Mn 0.5 %) 3.6
Constantan / Advance / Copel (Ni 45 %, Cu 55 %) 2.1
Nichrome V (Ni 80 %, Cr 20 %) 2.1
Karma (Ni 74 %, Cr 20 %, Al 3 %, Fe 3 %) 2.0
Advance (Cu 54 %, Mn 1 %, Ni 44 %) 7 2.0
Armour D (Fe 70 %, Cr 20 %, Al 10 %) 2.0
Monel (Ni 67 %, Cu 33 %) 1.9
Manganin (Cu 84 %, Mn 12 %, Ni 4 %) 0.47
N or P type Silicon 120-175!
d n B
Broadband
light z B
B
0
10
Intensity (abs)
Transmission
>^
10
>
10
Figure 8.46: A fibre Bragg grating is created inside an optical fibre by means of an
area with a longitudinal periodic variation of the index of refraction.
A standing wave will occur at a wavelength λB of which an integer
number just fits inside the grating period d . The light corresponding
with that standing wave is reflected.
(courtesy of Technobis Fibre Technologies)
G1 G2 G3 Gn
Analiser
λ1 λ2 λ3 λn
λ1 λ2 λ3 λ1 λ2 λ3
~!
#'
'
!
t0 t1 t2 t3 tn
'
!
Figure 8.47: A fibre Bragg grating can be used to measure strain at long distances
simultaneous at different locations on the fibre, distinguished by the
wavelength. Also the temporal response of the different reflections
gives information of the strain.
(courtesy of Technobis Fibre Technologies)
The measurement of velocity has long relied on the interaction of the mag-
netic field of a permanent magnet with a moving conductive element. Before
the onset of the electronic era the standard speedometer in cars and mo-
torbikes all worked according to the principle that a moving conductive
disk inside a magnetic field experiences a force that is proportional to the
velocity.
Presently still the majority of cars are equipped with speedometers with
needle pointers but they are no longer driven by moving disks in a magnetic
field but by a moving-coil Lorentz type rotating actuatorthat is controlled
by electronics that receive the velocity information from different sensors.
These sensors often consist of systems with an incremental character, re-
sulting in a frequency proportional to the velocity.
Incremental velocity sensors can be based on many principles. One example
is using a light source, a photo sensor and a wheel with holes that intermit-
tently blocks the light as function of the angle.
Another example is a rotating permanent magnet that induces an AC voltage
in a stationary coil with a frequency and amplitude that is proportional to
the speed according to the law of Faraday.
These principles will be presented more in depth in Section 8.8 with long
range optical encoders and this subsection will focus on the velocity sensing
that is needed in feedback systems to control damping.
It is often not necessary to use separate velocity sensors in precision posi-
tioning systems for damping in a full state feedback configuration. In most
cases damping is achieved with PD-control by differentiating the position
measurement signal as was explained in Chapter 4. In certain situations,
however, it is better to use a separate sensor, especially when the reference
for the damping action is undefined. This is the case with active vibration
isolation systems like will be presented in Chapter 9. In those systems a
damper connected between the isolated part and the vibrating environment
introduces an increase in the transmissibility of the vibrations at higher
frequencies. By using an active control system with a low stiffness actuator
and a velocity sensor, damping can be created without an increase of the
transmissibility as long as the sensor measures the velocity relative to a
reference that is completely free from vibrations.
Such a quiet reference is realised with an inertial velocity sensor like the
geophone that was originally designed to measure low-frequency vibrations
like earthquakes (Geo-phone = sound of earth).
624 Chapter 8. Measurement in mechatronic systems
Casing
N
Permanent Coils
Magnet =
Seismic
S
mass
Membrane
Figure 8.48: A geophone has a moving coil system supported by a compliant mem-
brane around a permanent magnet that induces a magnetic flux
through the coil. The movement of the coil will create an electro-
motive force, proportional to the velocity of the movement.
(©Aaronia AG, www.aaronia.com)
8.7.2.1 Geophone
Case mc
Figure 8.49: The dynamic model of a geophone as used for deriving the transfer
function.
and no voltage will be induced. Above the resonance frequency the seismic
mass will decouple and its amplitude will rapidly drop resulting in a velocity
difference and a measurable voltage. With the standard motion equations,
the influence of the internal damping on this system can be determined.
This damping is caused by eddy-currents in the coil former that often is
made from solid copper for increased mass and behaves like a short circuited
coil.
The relation between the velocity difference Δ ẋ = ẋc − ẋm and the velocity of
the casing ẋc is defined by the following equation using the rigid body model
from Figure 8.49:
d
( xc − xm )
Δ ẋ ẋc − ẋm dt s( xc − xm ) xc − xm
= = = = (8.51)
ẋc ẋc d sxc xc
xc
dt
With the second law of Newton the force balance on the moving mass m m is
defined:
^
10
0
10
Magnitude (abs)
>^
10
>
10 Ns/m
c `
c = 30 /mNs
>`
10
180
Z\
Phase [deg]
90
\
0
>^ > 0 1 ^
10 10 10 10 10
Frequency [Hz]
Figure 8.50: The Bode-plot of a geophone with a moving mass m m = 1 kg, a spring
with a stiffness of k = 1 · 102 N/m and two values of the damping co-
efficient c = 6 Ns/m (green) and c = 30 Ns/m (blue) shows the effect of
damping on phase and amplitude and the low-frequency bandwidth
limitation.
When the voltage sensitivity of the moving coil equals Vg = c g ( ẋc − ẋm ), where
c g in Vs/m equals the sensitivity constant of the geophone, the total transfer
function becomes:
Vg ẋc − ẋm m m s2
= cg = cg (8.56)
ẋc ẋc m m s2 + cs + k
Depending on the application, these two aspects can be optimised such that
a flat response above the resonance frequency is obtained with an acceptable
phase response.
It is clear from this example that the values of the mass and stiffness be-
come impractical when the bandwidth is extended to even lower frequencies.
Under gravity conditions this spring would already need to elongate with
0.1 m to compensate the static gravity-force on the mass, which means that
a pre-stressed spring must be used.
This practical consideration limits the use of this geophone to a fixed orien-
tation in respect to the earth gravity as otherwise a very long coil would be
necessary, that would not be preferable because of the increased noise by
the increased resistance.
In practice the LF-bandwidth limitation can be compensated by a suitable
filter with integrator characteristics to increase the LF-gain, but this is
limited because of the resulting amplification of LF noise (drift).
The main benefit of this geophone for the application in vibration isolation
systems is the reference of the velocity to the inertial reference of a seismic
mass as the inertial universe is by definition free from vibrations.
An alternative method to obtain this inertial velocity signal is based on
using a sensor that even more directly refers to this inert reference by
measuring acceleration. The velocity can then be derived by integration of
the acceleration signal.
628 Chapter 8. Measurement in mechatronic systems
8.7.3 Accelerometers
One way to measure a force is by compensating that force with the same force
by means of an actuator. When the force to current ratio of that actuator
is known, the current that is necessary for the compensating force is a
Posion
sensor
I
Controller Amplier R _ N
+
Permanent
Vs Magnet
S
Seismic
mass
Figure 8.51: Closed loop feedback accelerometer. The controller suppresses the
movement of the seismic mass relative to the casing with a current
proportional to the acceleration.
8.7. Dynamic measurements of mechanical quantities 629
reliable measure for this force. In Figure 8.51 this principle is used in the
closed-loop feedback accelerometer. It is build with the same components as
the geophone with the addition of a position sensor and a control system.
In this configuration the coils function as a linear Lorentz actuator instead
of a sensing element. The position sensor measures the position of the
moving coils relative to the casing. This position sensor can be any reliable
proximity detector as previously presented. The controller and the amplifier
are tuned to keep the displacement as small as possible by supplying a
current to the coils. This position feedback system creates an additional
stiffness by the proportional gain. The necessary current for that action is
proportional to the force on the mass due to the acceleration. By measuring
the current, a value is obtained for the acceleration
In Chapter 4 it was shown how such a position controller should be made in
order to remain stable by placing the poles of the system on the right location
in the complex plane with an additional D-control action. The system is a
servo-feedback system as the controller has to control the seismic mass to
follow the case just like with the optical pick-up unit of the CD player and
the same PID-control principle can be used here.
Because of the fact that this servo controlled accelerometer is a zeroing
measurement system, the position sensor inside the accelerometer only
needs to be accurate around zero. This implies that a relatively simple
sensor can be used with only a low level of the signal drift around zero
when static accelerations over a longer period must be measured. A second
benefit of this configuration is the additional virtual control-stiffness of the
spring due to the feedback that reduces the influence of any non-linearity
in the stiffness of the support structure in the frequency area where the
closed-loop gain is high.
The feedback principle with a high gain at low frequencies makes this
closed-loop feedback accelerometer especially suitable for the measurement
of static or low-frequency accelerations like gravity and the slow motion of
ships or air planes.
It is also a suitable option for low-frequency damping in vibration isolation
systems when the sensor is combined with an integrator to obtain the
inertial velocity. The complete sensor is however quite costly when compared
to the two other accelerometers so the application of a closed-loop feedback
accelerometer is limited to professional use at very low frequencies.
630 Chapter 8. Measurement in mechatronic systems
Shear Design:
Cover
Seismic mass
Piezo ceramics
Post
Socket
Shear force
Base
Compression Design:
Cover
Spring
Seismic mass
Piezo ceramics
Bolt
Socket
Compression force
Base
Bending Design:
Fricon coupling
Cover
Piezo ceramics
Spring
Seismic mass
Bending force and damping piston
Base
Seismic mass mm xm
'x xc xm xc
c k
Piezo xc
Case mc
+
IN Va VTh Va
C _ C
Piezo Piezo
Case mc Case mc
Norton Thevenin
the damping. In practice the damping is quite low which means that these
mechanical dynamics limit the high-frequency bandwidth of a piezoelectric
accelerometer to around 0.3× the resonance frequency. For the shear and
compression design the support stiffness is rather high, giving practical
values for the usable maximum frequency of several kHz, depending in the
size. The bending design is more compliant. With the same size and mass
as with the other design principles the application of the bending design is
limited to high-sensitive low-frequency measurements until an approximate
maximum of a few hundred Hertz.
For situations where only a very low mass is allowed at the measurement
location, the bending design can still be a better alternative because with a
very small size the resonance frequency can become acceptably high again.
+
amplier
1nF
100pF
+ Vo
Rp
VTh Va
Ri
_
_
Figure 8.55: The parasitic capacitance and resistance of a cable and the input
resistance of a buffer amplifier affect the sensitivity and frequency
response of a piezoelectric sensor.
Charge amplifier
10nF
100 M
_
1nF G
+
100pF
+
Rp
VTh
_ Vo
capacitance that acts as a load to the sensor and gives a proportional reduc-
tion of the sensitivity, depending on the ratio between the cable capacitance
and the sensor capacitance. Although this is a not frequency dependent
attenuation, these values should be exactly known in order to take this
attenuation into account with accurate measurements. This is only possible
up to a certain level due to the temperature dependence of these values.
Another problem is related to a possible parasitic resistance over the con-
nector that might be caused by humidity in combination with salt residues
due to manual handling.
Together with the input impedance of the amplifier and the capacitive source
impedance of the sensor this circuit forms a first-order high-pass filter. When
for this example the input resistance Ri equals 50 MΩ, this low-frequency
limit would be:
ωLP 1 1
f LP = = = ≈3 [Hz] (8.63)
2π 2πRC 2π · 5 · 107 · 1.1 · 10−9
because the total capacitance equals the impedance of the 1 nF and 100 pF
capacitor in parallel.
A second problem is caused by the inherent non-linearity of a piezoelectric
crystal.
As was shown with piezoelectric actuators these crystals exhibit a certain
level of hysteresis between the voltage and the deformation. Fortunately
this hysteresis is far less present in the relation between the charge and the
636 Chapter 8. Measurement in mechatronic systems
Although this does not look impressive from a magnitude point of view, the
effect of the parasitic impedances is cancelled as the amplifier will keep the
voltage at the minus input equal to 0 V, being the voltage at the grounded
plus input. This means that the sensor is virtually grounded and no voltage
is present over the parasitic impedances, while all current of the sensor
flows into the feedback impedance.
As this impedance is merely capacitive the voltage at the output is equal to
the stored charge in the feedback capacitor, divided by its capacitance value.
This charge is equal to the displaced charge of the piezoelectric accelerometer
for which reason this configuration is called a charge amplifier. This might
seem strange as the charge itself is not amplified and a “charge-to-voltage
converter” would probably have been a better name.
The reduction of the voltage gain to 0.1 is the direct result of a design
choice which is related to the maximum attainable value of the parallel
resistor over the feedback capacitor. In theory this resistor is not needed but
the bias current of the operational amplifier requires a non-infinite source
8.7. Dynamic measurements of mechanical quantities 637
^
10
Magnitude (abs)
0
10
>^
10
>
10
|J
Phase [deg]
>|J
>}J
>^ > 0 1 ^ 3 5
10 10 10 10 10 10 10 10
Frequency [Hz]
impedance at 0 Hz. The bias current level determines the maximum value
of the parallel resistor together with the allowable DC offset error voltage at
the output. Special amplifiers with MOSFET inputs are developed to achieve
extremely low bias currents in the order of 1 pA which would give a DC error
of 0.1 mV with the indicated 100 MΩ resistor. Compared with the voltage
amplifier situation parasitic impedances parallel to this resistor can more
easily be avoided by careful assembly of the circuit, avoiding contamination
in handling and when necessary, even a protective coating can be applied,
to keep humidity away from the sensitive input of the amplifier. In this case
the low-frequency limitation of the bandwidth is only determined by the
feedback impedances resulting in the following:
ωLP 1 1
f LP = = = ≈ 0.16 [Hz] (8.65)
2π 2πRC 2π · 1 · 10 · 10 · 10−9
8
This is a factor 20 better than with the buffer amplifier. The sacrifice of
the attenuation is easily solved by an additional voltage amplifier after the
638 Chapter 8. Measurement in mechatronic systems
Strain gage
Leaf spring
Seismic mass
Cutout
_
+
+
+
+
_ _ _
device ever. It is applied in safety devices like the air bag for a car, in
photographic cameras and Personal Digital Assistants (PDAs) to determine
the orientation towards gravity.
Like with the other accelerometers, the MEMS accelerometer applies a
seismic mass that is supported by a compliant mechanism, consisting of
several flexure leaf springs. The position of the mass can be measured both
with capacitive sensors and with integrated piezoresistive strain gages on
the supporting springs. The capacitive sensing principle is frequently used
in MEMS devices because of the simplicity. With comb structures the capac-
itance sensitivity can be increased and manufacturing these structures is
relatively easy. With capacitive sensing linear accelerations can be detected
in three orthogonal directions.
Measuring strain with integrated piezoresistive strain gages is an alterna-
tive that requires a bit more processing but it gives a very high sensitivity
640 Chapter 8. Measurement in mechatronic systems
VS DD I/O
A D XL 3 4 5 Power
Management
Control INT1
Sense ADC and
Electronics Digital Interrupt
3- Axis Filter Logic INT2
Sensor
GND CS
Vc
ω,ϑ
Vc
Figure 8.61: Incremental inductive rotation sensor giving a sinusoidal output volt-
age with a frequency depending on the rotation speed and the number
of teeth.
642 Chapter 8. Measurement in mechatronic systems
lamp
ω,ϑ
Ir
ϑ
Figure 8.62: Optical version of the incremental rotation sensor. The magnitude
of the irradiance I r from the lamp is modulated by the teeth of the
rotating wheel.
Optical encoders are widely applied both for angular and linear measure-
ments. This section will restrict to the linear versions, because in precision
mechatronic systems direct linear measurements are most commonly ap-
plied. Nevertheless, the same principles are applied in angular measure-
ment systems.
The most basic example of a linear optical encoder is shown in Figure 8.638 .
The working principle is originally based on the idea to compare the position
8 This figure, together with several of the other figures and the background information in
Collimator lens
Scale
Scanning
Reference mark
Photo detector
Current
level
‘Dark’ Current
Figure 8.63: Basic configuration of a linear optical encoder. A scanning reticle with
a grating is illuminated with parallel rays of light. The shadow of the
reticle is projected on a scale with an equal grating peiod connected to
the moving object. A photo detector detects the resulting irradiance,
that alters periodically as function of the displacement.
of one grating with well defined equidistant slits with the position of another
grating by means of shadow projection.
A parallel beam of light, originating from a light emitting diode and a colli-
mator lens, is projected on the first grating. The shadow of this grating is
projected on the second grating which has equal periodicity and is designed
such that it obscures the light when shifted with half a period in respect
to the first grating while transmitting the light when the gratings are in
phase. This causes the irradiance to be modulated as function of the position
644 Chapter 8. Measurement in mechatronic systems
Collimator lens
Scale
Figure 8.64: Four field incremental optical encoder giving robustness, directional
and absolute position information with high resolution by interpola-
tion.
4 1 2 3 4 1
A C Scanning
D recle
B A
1
Measuring
scale
B
2
Reference
mark
C
3
D
4
Figure 8.65: Sensor signals in the four field encoder as function of the relative posi-
tion of the scales. At the left side the four positions of the measuring
scale are shown relative to the scanning reticle, corresponding with
the numbered vertical lines in the signal plot at the right side.
slits to a binary coded pattern that is read out by a CCD camera. As long
as a transition of the reference mark uniquely points into a specific period
of the incremental encoder signal, the absolute position can be deduced by
setting the incremental counter accordingly.
The single field encoder has several problems that need to be addressed.
First of all it is impossible to determine the direction of movement. Further-
more, the influence of electrical noise (dark current) and contamination of
the gratings becomes large at the low signal level positions.
Figure 8.64 shows an enhanced version of this principle that solves these
issues by the following measures:
• Redundancy by using two sections per phase for the same information
but working in counter phase to have high signal levels at all positions.
I90° I90°
M I0°
0° 90° 180° 270° 360°
I0°
I90q
M arctan
I 0q
Figure 8.66: Interpolation with two sine functions at 90◦ phase shift shown graphi-
cally by means of a Lissajous figure. The angle ϕ is proportional to the
position within one cycle and can be calculated from both coordinate
values.
8.8.1.1 Interpolation
O<set I 90 o
O<set I0 o
Amplitude error
I0 o / I90 o
Phase angle
error
Signal wave
error
Figure 8.67: Deviations of the signals in their shape, magnitude and average value
cause errors in the interpolation.
where I 0◦ equals the combined signal current of the sensor parts with their
extreme values at 0◦ + n · 180◦ while I 90◦ equals the combined signal current
of the sensor parts with their extreme values at 90◦ + n · 180◦ .
Originally this calculation was done in the analogue domain by electronic
addition of both signals with different amplification ratios, followed by a
comparator. Nowadays the interpolation is far more easily achieved in the
digital domain by using high resolution AD converters for digitising the
signals follwed by a direct calculation of the angle.
Under the condition of a fairly well defined sinusoidal shape of the signal
an interpolation factor of 1024 until 4096 can be achieved.
Figure 8.67 shows a qualitative overview of the errors that are caused by
deviations from the ideal waveforms. A low interpolation error can only be
achieved by means of careful mounting and positioning of all the optical
parts in the measurement system.
648 Chapter 8. Measurement in mechatronic systems
Structured
Scale Window photo detector
Scanning
Condenser lens
Figure 8.68: The same properties can be obtained with a reflective scale as with a
transparent scale. This alternative allows to use a very long flexible
metal band scale that can be glued to the moving object. Directional
information is obtained at the photo detector by a slight difference in
periodicity of the gratings, causing a Vernier effect.
Figure 8.69: Using Vernier gratings to obtain both directional and interpolation
information with a one field CCD camera sensor.
calliper, this effect increases the position sensitivity and by reading out
the signal with a structured photo detector like a CCD camera sensor, both
directional and interpolation data are obtained.
The Vernier effect is illustrated in Figure 8.69. The upper half of the figure
shows the effect of two gratings that differ one line over the total width of
the grating of 40 lines. The corresponding Vernier ratio is 40 with a Vernier
period that equals the shown size of the grating. The upper right image
shows the visual effect when the two gratings are on top of each other in
two situations that differ in a relative shift with half a grating period. It
is clear that the dark and light zones have moved half the Vernier period
giving a factor 40 more sensitivity over the movement of the grating. The
moving direction of the dark and light zones is related to the relative moving
direction of the gratings which enables to determine that direction from the
signal of the CCD camera sensor.
Several Vernier regions can be combined to increase the robustness against
contamination as shown in the lower half of Figure 8.69. As the entire
surface of a sensor can only be used once, the designer can decide either for
a very large Vernier ratio or for more Vernier regions for robustness. This
650 Chapter 8. Measurement in mechatronic systems
"
<
-1 0 +1 '
Scale
Lig
diode or
Collimatorlens solid state laser
3 * Photo
detectors
Figure 8.70: A phase grating with a periodicity in the order of the wavelength of
the light result in a very high resolution and a three phase position
signal because of the interference at the detectors.
+1st − and −1st −order at the scale. Secondly the ±1st −order directions from
the scanning reticle will be reflected back orthogonal to the surface of the
scale according to the fact that light, entering under an angle corresponding
with the 1st −order, will be diffracted in the direction of the 0th −order.
Because of the symmetry and the equal diffraction angles, the resulting four
beams are recombined at the scanning reticle where they will undergo a
third diffraction with equal 0th - and ±1st −orders. Only two of these will be
used further in the system. The third is not shown in the drawing and will
in reality be projected by the collimator lens on an area without a sensor.
The collimator lens that originally created the parallel rays from the light
source will create one focused spot from all returning parallel rays on a posi-
tion that is determined by the angle of these rays according to the principles
of optical imaging. Depending on the angles the rays will come together at
three photo detectors.
In the previous chapter on optics, Equation (7.26) showed that the radiance
of the interference of two beams with an equal irradiance is proportional
to 1 + cos ϕ with ϕ being the phase difference of the two beams. This means
that the phase differences between all beams at the point of interference
need to be determined.
652 Chapter 8. Measurement in mechatronic systems
-1st +1st
order order
1 λ Phase shi$ @
1 period displacement
?
'
displacement
x
λ/4 Grang period x
p ' Posive
phase shi$ φm(x) phase shi$ φm(x)
Figure 8.71: A displacement of the phase grating causes a phase shift of the
diffracted 1st and −1st −orders with a sign equal to the sign of the
order. One grating period displacement gives one wavelength phase
shift of the diffracted orders.
For this reason in the second step of reasoning the effect of a displacement of
the reflective phase grating on the phase of the reflected 1st - and −1st -order
light is determined with the help of Figure 8.71.
During the displacement of the grating with one period the phase of the
diffracted orders have also shifted exactly one period of the light. This means
that the incremental phase shift in the refracted orders as function of an
incremental displacement Δ x is equal to:
Δx
Δϕm ( x) = 360 [deg], or:
p
Δx
= 2π [rad], (8.67)
p
even more optimal for interpolation, than the 90◦ shifted signals of the four
field encoder. There is even less need for inverse sensors to overcome the
noise at the low irradiance areas because at any position sufficient signal
with a high irradiance level is available to increase the signal to noise ratio.
There is also no need for complicated structured sensors with Vernier en-
hancement. Furthermore the sensitivity for contamination is acceptable,
as contamination will only cause a reduction of the irradiance by scattering
of light but the phase relationship is not affected. The irradiance levels
of the different beams will however influence the interpolation and a very
strict adherence to narrow tolerances is required when the interferometric
encoder is used at extreme levels of interpolation.
Photo detector (R)
r,R v1+cos(2(
I m
(x)-120))
3 orders 25 %
o +60- m(x)+60
«*`J
$ =2( m(x)-120)
x -60+ m(x)-60
-60+ m(x)+60
Plane
wavefront +60 =2 m(x) x
-60+ m(x)
-60
Photo detector (S)
-60- m(x) Ir,Sv1+cos(2 m(x))
=2 m(x)
+60
-60- m(x)+60
$
+60+ (x) +60+ m(x)-60
m
-60- m(x)-60
'
' +60+ m(x)+60 =2( m(x)+120)
+/- 1 order 50 %
r,T v1+cos(2( m(x)+120))
$ I
Photo detector (T)
Beam delivery
Interferometer
Laser source
Moving object
Reference
Signal cable
Plane
Opcal Fibre Opcal
mirror
Part
Receiver
Electrical
Part
Receiver
Reference mirror
parallel to wavefront
Measurement mirror
Plane wavefront parallel to wavefront
Incident light
50 %
100 %
50 %
50 % mirror x
225 % Fringes
Photo detector
of the light is lost and transmitted back to the source with only 25 % of the
light left to be reflected at the mirror and reach the photo detector. When
all optics are well aligned, the wavefronts of both beams are still parallel
and interference takes place on the photo detector, either constructive when
both beams are in phase or destructive when both beams are out of phase
or partial in the situation in between. The reference mirror is stationary so
only the phase of the reflected beam from the measurement mirror is mod-
ulated by the distance. A displacement results in a sinusoidal irradiance
modulation of the interference at the photo detector according to the 1 + cosϕ
relation from Equation (7.26) in the previous chapter on interference.
Because of the double-passing of the trajectory from the interferometer to
the measurement mirror and back the incremental phase shift related to
an incremental movement Δ x of the measurement mirror equals:
2π N 2π f p N n
Δϕm ( x) = Δx = Δx [rad ], (8.68)
λ c
where N equals the interferometer constant defined as the number of single
trajectories that the measurement beam passes to and from the object, in
this case N = 2. λ, f p and c are respectively the wavelength, the frequency
and the propagation velocity of the light in vacuum while n equals the
refractive index.
During a movement, the irradiance at the photo detector equals zero in
8.8. Optical long-range incremental position sensors 659
every period of the modulated interference. Apparently the energy from the
source has disappeared at those positions. In reality the law of conservation
of energy abides. It appears that the light of both beams that is sent back
to the source shows constructive interference when the light at the detector
shows destructive interference and the other way around. This can be
checked when realising that reflection at a surface from a low refractive
index medium to a high refractive index material introduces a sign reversal
of the phase. This happens at the reference mirror, the measurement
mirror but only at one side of the 50 % reflecting mirror, the side on
the mirror glass with the reflective aluminium coating. When counting
the phase sign reversals at the mirrors, tracing both beams, it shows that
the beam at the photo detector that came from the reference mirror has
undergone two sign reversals while the beam from the measurement mirror
had undergone one sign reversal. This means that the sign of the phase
difference between both beams due to the displacement is reversed.
The beams that are reflected back to the source have either one sign reversal
from the measurement mirror while the beam from the reference mirror got
a total of 1.5 sign reversals. This means that both beams have the same sign
reversal and as a consequence the sign of the phase difference between
both beams due to the displacement is not reversed. This means that the
phase difference in the beams to the sensor and to the source differ exactly
one sign reversal. As a consequence, constructive interference in one of the
two beams corresponds with destructive interference in the other beam.
right side of Figure 8.74. These are caused by differences in path length
due to the tilt11 and their spatial frequency is proportional to the tilt angle.
With a single photo detector these fringes would result in a constant average
irradiance over the surface of the sensor when the fringe period is smaller
than the diameter of the sensor. In a normal interferometer this would
require a smaller sensor at the sacrifice of sensitivity but it is far better
to use a structured sensor like a CCD camera. In that case the movement
direction can be detected as these fringes will move over the surface of the
detector when the measurement mirror moves in the x-direction. A dis-
placement of Δ x = λ/N will cause a shift of the fringes with one fringe period.
This enables a good resolution with sufficient interpolation capability of the
spatial sinusoidal signal. An interpolation factor of hundred already yields
a resolution of 3 nm with a Helium-Neon laser of 632 nm.
Although with further improvements on interpolation even better results
are obtainable, this principle still has some additional drawbacks that have
challenged people into different concepts to solve these. First of all there is
the sensitivity for the tilt in the mirrors and secondly the signal frequency
ranges from 0 Hz until very high frequencies depending on the movement
velocity.
Sensitivity for the tilt of the reference mirror can be avoided by a very stable
mounting of this mirror in the interferometer. Tilt of the measurement
mirror will, however, directly influence the observed effect on the CCD
sensor. This would limit its use to systems with only one degree of freedom
on a perfect straight guiding mechanism that does not introduce angular
movements.
This problem can be avoided by replacing the measurement mirror by a
cube-corner retro-reflector, also called just a cube-corner or corner-cube. Also
the cats-eye can serve this purpose. Both optical systems have in common
that they reflect the light back (retro) in the same direction as where it came
from without any optical path difference for parallel rays, thus keeping the
reflected wavefront flat and parallel to the incident light. Figure 8.75 shows
both optical systems.
A cats-eye consists of a positive lens with a spherical mirror in its focal point.
The radius of the spherical mirror is equal to the focal length of the mirror
11 This effect of these fringes is primarily used when measuring surfaces with interferometers.
The fringe shape and distance represents the value of the surface slope relative to the reference.
8.8. Optical long-range incremental position sensors 661
R=f
Cats-eye Cube-corner
causing the chief ray through the centre of the lens to always be reflected
back in the same direction. As the reflecting surface is positioned in the
focal point of the lens all parallel rays will be focused on one point on the
spherical surface and reflected back under the same angle with the chief
ray which is always orthogonal to the spherical surface.
After the lens the reflected rays will be re-collimated parallel to the incident
rays. Their position is however mirrored in respect to the point of incidence
of the chief ray. For this reason this system is called point-symmetric. The
main drawback of a cats-eye is the fact that a simple two element cats-eye
can only be used with small apertures and approximately paraxial rays as
otherwise spherical aberration and coma will cause wavefront errors. With
aspheric optics or multiple optical elements these errors can be avoided at
662 Chapter 8. Measurement in mechatronic systems
increased cost.
The cube-corner has less problems in that respect because it does not apply
curved surfaces. It consists only of a monolithic piece of transparent material
with three mutually perpendicular mirror surfaces that internally reflect
all rays back in the same direction. Like the cats-eye, the cube-corner is a
point-symmetric retro-reflector.
also work with fully separated beams as long as they have a known phase
and frequency relationship.
The term “heterodyne” refers to the frequency difference. This split-frequency
f s can be in the order of several Megahertz. One method to generate a laser
source with such a dual frequency spectrum is based on the Zeeman effect,
named after the Dutch physicist and Nobel prize winner Pieter Zeeman
(1865 – 1943) who discovered the effect. When the laser cavity of a Helium
Neon laser is inserted in a permanent magnetic field, the normally single
laser-frequency radiation is split in two radiation parts with an equal irra-
diance but with two different frequencies and polarisation states. The two
frequencies are symmetrically spaced around the original frequency, called
the centre-frequency f c . The split-frequency is proportional to the magnetic
flux density. The polarisation direction of both radiation components is
circular with opposite directions. This circular polarised light is converted
into two orthogonal linear-polarised radiation components by a birefringent
plate with one quarter of a wavelength delay between the orthogonal polari-
sation directions. This quarter wave plate or λ/4-plate plays an important
role in the further explanation of the heterodyne interferometer.
A disadvantage of the Zeeman method to create a laser beam with two
frequencies is the limited optical power. This is due to the bandwidth of the
Fabry-Perot cavity from the laser, which is too small to optimally resonate
at both frequencies. This effect is especially prominent when a very high
split frequency is needed. As will be shown later, measuring displacements
with a very high velocity demands such a high frequency.
In case more power is needed at such a high split-frequency, a suitable
light source can be created by modulation of the light of a single-frequency
laser by means of an acousto-optic modulator. An acousto-optic modulator
contains a piezoelectric actuator that creates running sound waves in a
material like quartz. The resulting density differences act like a modulated
moving refractive-index phase grating. Depending on the intensity of the
modulation, the magnitude of the light in the diffraction orders is modulated
in a similar way as was presented on modulation by the depth of a phase
grating. More important is the frequency shift that is caused by a Doppler
effect at the moving sound waves that creates different frequencies for the
different diffraction orders. This frequency difference equals the frequency
of the sound times the order number.
Figure 8.76 shows the principle of the heterodyne interferometer. The dif-
ferent polarisation directions of the two-frequency beam components are
used to guide the light to the measurement and reference optical path in
the following way:
664 Chapter 8. Measurement in mechatronic systems
Referene mirror
parallel to wavefront
Measurement mirror
p - polarised parallel to wavefront
/4-plates Interferene
f1
pol f2 x+y
x
y
f2 x
pol f1
s - polarised Polarising
Polariser Polariser
beam splier
Photo detetor
The combined beam enters the system from the left. The beam component
with frequency f 1 is p-polarised in the plane of incidence of the polarising
beam splitter and the beam component with frequency f 2 is s-polarised.
S-polarised light will be reflected by the polarising beam splitter and p-
polarised light will be transmitted. As a consequence the s-polarised light
will serve as reference beam. It is reflected towards the reference mirror and
passes a quarter wave plate consisting of a birefringent material. When ori-
ented in the right way to the polarisation direction of the beam, this λ/4 plate
creates a phase lag of 90◦ in one of the polarisation directions relative to
the other direction as explained in Section 7.4.1.1 in the previous chapter
and changes the polarisation from linear into circular. After reflection at
the reference mirror the second pass through the λ/4 plate will convert the
circular polarised light again into linear polarised light but orthogonal to
the original direction. As a consequence the now p-polarised beam will be
transmitted through the polarising beam splitter towards the polariser and
the photo detector.
The p-polarised component of the light from the laser source serves as mea-
surement beam. It is first transmitted by the polarising beam splitter and
after a passing through a second λ/4 plate, reflection at the measurement
8.8. Optical long-range incremental position sensors 665
E( f1)
1 0+n·2π
Ir
E( f1)+E( f2)
Phase
0 π+n·2π
2π+n·2π
E( f2)
Phase
Field
Irradiance Time
Figure 8.77: Two interfering beams with different frequencies result in an irradi-
ance signal that is proportional to one plus the cosine of 2π( f 1 − f 2 ).
The phase between both frequencies shifts linear over time.
mirror and a second pass through the λ/4 plate it has become s-polarised and
is reflected towards the polariser and the photo detector where it recombines
with the reference beam.
The polariser fulfils an important requirement as the vectorial addition of
orthogonal polarised electromagnetic fields will not show destructive or con-
structive interference. With an accurately oriented polariser under 45◦ with
both polarisation directions only the component of each beam in the polari-
sation direction of the polariser will pass as shown in the right drawing on
Figure 8.76. This means that after the polariser the fields of both beams
have an equal direction and the vectorial addition will show interference at
the expense of a factor 2 loss of amplitude which corresponds with a factor
two of the radiance. It can be reasoned that also in this case constructive
interference at one location corresponds with destructive interference at
another place, hence corresponding to the physical law of conservation of
energy. In case x and y are in phase in the direction of the polariser as
drawn, the components of pol f 1 and pol f 2 in the orthogonal direction are in
counter phase.
666 Chapter 8. Measurement in mechatronic systems
The resulting interference field at the detector consists of the two fields with
different frequencies, like shown in Figure 8.77, that add according to the
following12 equation:
From this equation the sine term is the very high-frequent centre frequency.
The cosine term is the much smaller frequency difference and can be seen
as the amplitude modulation of the high-frequency term.
The irradiance is proportional13 to the field magnitude squared. Because
the high-frequency term is unmeasurably high it becomes a constant RMS
value of 2 that can be left out in the proportional equation, resulting in
the following expression for the irradiance:
I r ∝ Ê 2 cos2 (π f s t)
1 + cos(2π f s t)
∝ Ê 2 ∝ Ê 2 (1 + cos(2π f s t)) (8.71)
2
ϕs = 2π f s t (8.72)
from the Phd thesis of Jonathan Ellis from our laboratory in Delft. His thesis can be downloaded
for more details.
13 The constant cn is avoided in these expressions as it does not contribute to the under-
0
standing of the principle.
8.8. Optical long-range incremental position sensors 667
change in the optical path difference between the measurement and refer-
ence beam which enables the measurement of moovements with only the AC
part of the signal, thereby avoiding induced errors by low-frequency noise.
The approximation is valid when the motion velocity is very small in respect
to the propagation velocity of the wave which is in practice true for the
speed of light. N equals the previously defined interferometer constant and
presents itself as a multiplication factor for the Doppler effect, correspond-
ing with the number of times that the measurement beam travels the path
to the moving mirror. Furthermore the centre frequency f c is in most cases
allowed to be taken as reference as the split frequency is with even 5 MHz
only a small part ≈ 10−8 of f c . If necessary this small deterministic factor
668 Chapter 8. Measurement in mechatronic systems
p1
Reference
0V
signal
p2
Measurement
signal 0V
π
?
<
(rad) 0
Figure 8.78: The velocity and position of the measurement mirror can be detected
by measuring the difference in the timing period of the reference and
measurement irradiation signal or by measuring the relative phase.
can be taken into account. More important is the refractive index n that
shows to be a factor that directly influences the observed frequency shift. It
will be shown later that the refractive index induces a serious measurement
uncertainty when measuring in air with extreme requirements on precision.
The Doppler shift is equally present as an offset in the frequency difference
at the measurement sensor f m = f s + f d . This means that f d can be detected
by comparing the detector signal from the interferometer with a reference
signal equal to the split frequency f s of the beam components at the entry of
the interferometer. With a Zeeman laser this difference signal is obtained
by creating an interference signal from a small portion of the light via a
beam splitter, a polariser and a detector, similar to the detection after the
interferometer. With an acousto-optic modulated laser the frequency of the
modulator can directly be used as reference.
The detection can be done in several ways. One method uses a comparator
like a Schmitt trigger to detect the zero voltage crossings of the AC part of
the detector signal and the reference signal. When the timing period be-
tween the zero crossings of both signals is precisely measured, as indicated
in Figure 8.78, the difference between both signal periods is an accurate
measure for the frequencies and the corresponding velocity.
When this measurement is done at regular intervals, the incremental po-
sition change over that interval is determined by multiplying the interval
8.8. Optical long-range incremental position sensors 669
ϕt = 2π f m t = 2π( f s + f d ) t = ϕs + ϕd (8.74)
2π f c N n
Δϕm ( x) = ± Δx (8.76)
c
This relation is equal to the phase relation that was found with the homo-
dyne interferometer. This is not without logic as the heterodyne frequency
difference only introduces an additional phase shift to the phase shift caused
by the displacement. For this reason the second detection method measures
directly the incremental phase shift as function of an incremental displace-
ment by means of quadrature detection with a lock-in amplifier.
I LPF
M BPF
×
atan unwrap gain x
Rc
R BPF PLL
Rs ×Q
LPF
Figure 8.79: Schematic of a lock-in amplifier for phase measurements. Two signals
are detected and initially filtered. The reference signal R is sent to a
phase-locked loop (PLL) to generate matched sine and cosine signals.
Those are then multiplied with the filtered measurement signal M
to produce in-phase ( I ) and quadrature (Q ) outputs. Those are then
low-pass filtered and sent to an arctangent function. The phase is
then unwrapped and a gain is applied to determine the displacement
x from the unwrapped phase.
(Courtesy of Jonathan Ellis)
its possibility to generate two signals with a nominal phase offset of 90◦ .
Similar to the Lissajous plot shown with the spatial signals of a decoder
with four sensors, two temporal signals with 90◦ phase difference enable
signal interpolation and directional information. The name quadrature
detection for this method comes from radio transmission practice and is
related to the four quadrant vectorial representation defined by the phase
difference of the signals.
The two outputs with 90◦ phase difference from the phase-locked loop, Rc
and Rs , are multiplied with the measurement signal to generate two signals,
the in-phase I and the quadrature Q signals. These signals relate in the
following way to a phase shift ϕm ( x) related to the difference in optical path
lengths between the reference and measurement beams:
1.6 6
4
Displacement [μm]
1.2
2
Error [nm]
0.8 0
>^
0.4
>
No PNL
0 With PNL >`
0 1 2 3 4 5 0 1 2 3 4 5
Fringe Fringe
Figure 8.80: Comparison between a linear signal and a signal with periodic nonlin-
earity, highly exaggerated for illustrative purposes. The second graph
shows a linear displacement with 5 nm of periodic errors with the
nominal slope removed. The periodic errors typically have a first-order
component with one cycle per fringe and a second-order component
with two cycles per fringe. Higher-orders may appear from additional
ghost reflections or stray signals.
(Courtesy of Jonathan Ellis)
Periodic errors
Cube corner
Right angle
prism
Photo detector 1
f Photo detector 2
2
f1
\J¬
Figure 8.81 shows an example of such a system with one degree of freedom.
Two separate laser beams with a fixed split frequency, originating from an
acousto-optic modulated laser source are inserted in a 50 % beam splitter,
similar as the one used in a homodyne interferometer. One half of both
beams is reflected as two separate reference beams by a right angle prism,
back towards the beam splitter and the other half is transmitted as two sep-
arate measurement beams towards the movable cube-corner retro-reflector.
The right angle prism has line symmetry which means that the rays are
reflected in the same plane orthogonal to the edge line of the prism as where
they entered. As a consequence the lower reference beam f 1 will stay at the
lower side of the interferometer and the higher reference beam f 2 will stay at
the high side. The point symmetry of the cube-corner causes both beams to
change position and the measurement beam f 1 arrives at the beam splitter
at the same location as the reference beam f 2 while the measurement beam
f 2 arrives at the same place as the reference beam f 1 .
These two beams both interfere at the beam splitter and their interference
signals will show an opposite phase shift as function of the measured dis-
8.8. Optical long-range incremental position sensors 675
Frequency stability
Interference of the reference and measurement beam takes place over the
total overlapping area of both beams. As long as both beams have a flat and
parallel wavefront the interference is identical over the total surface but any
deviation to that ideal flat wavefront will cause a phase shift and reduction
of the total observed interference at the photo detector. With the exception of
the purposely applied tilt in the wavefront of the homodyne interferometer
to create moving fringes, normally all wavefronts in a laser-interferometer
measurement system need to be parallel and without any exception always
the wavefronts need to be flat. This poses extreme requirements to both
the spatial coherence of the laser source, the flatness of the optical surfaces
and perfection in the retro-reflectors. In principle optical flats are very
difficult to manufacture with an exponential cost increase as function of
676 Chapter 8. Measurement in mechatronic systems
size and maximum allowable surface topology deviation. The deviations can
be expressed using the Zernike modes as described in Section 7.5.2.1 and
have to remain at least below λ/20 to get an acceptable interference signal
but when lateral beam displacements are present, the deviations in the
surface need to remain below the required measurement accuracy of the
interferometer system.
Next to non-parallelism, a beam that is not pointing to the right direction
will induce a so called cosine error that is proportional to the cosine of the
angle of the beam with the reflecting surface of the measurement mirror.
This error is minimal when the direction is orthogonal. When initially all
interferometers are mounted such that the measurement beam is as good as
possible aligned, any change in the pointing due to temperature or to other
mechanical instabilities will have an influence on the measurement error.
The wave number σ equals the inverse of the wavelength in vacuum (1/λ).
These equations give the refractive index n T,P for dry air as a function of
the wave number σ in μm−1 , the temperature T in degrees Celsius and the
pressure P in Pascals for a wavelength in the range of 350 – 650 nm.
The refractive index for non-dry air n T,P,Pv is calculated by:
f Referene Retr
2
f f f1
1 2
From Laser
To
f1 ± f f f1 ± f
2
f1 Measurement
Polarising Beam Splier
f
2
moving object needs to be controlled. The air needs to flow faster than the
maximum velocity of the moving object in order to avoid mixing of air with
different temperatures due to warm objects like actuators.
This airflow necessitates fast blowing air showers with temperature sta-
bilisation of better then 1 mK for nanometre range errors. The additional
complication of these measures, on top of other requirements like the maxi-
mum allowable vibration forces exerted by the turbulent air, is one of the
reasons for applying encoders instead of laser interferometers in these ap-
plications.
When working in vacuum, these problems do not occur and under that condi-
tion laser interferometers are well suited for the most critical requirements,
for instance with stages for EUV lithography wafer scanners. Also in space
instrumentation distance measurements with laser interferometers is often
the best solution, especially when long distances between satellites have to
be measured.
location the detector is placed aside of the laser source. To prevent colliding
laser beams it uses two cube-corner retro-reflectors.
The light from the source enters at the upper half of the polarising beam
splitter where one of the frequencies is reflected and one is transmitted
based in their polarisation direction. One cube-corner serves as reference
mirror and shifts the beam to the other half of the polarising beam splitter.
The other cube-corner serves as measurement mirror. It prevents wavefront
errors that would otherwise be caused by angular movements and also shifts
the measurement beam to the other half of the polarising beam splitter. In
this way the return path does not collide with the entry path and a detector
can be placed next to the laser source.
It should be noted that instead of a cube-corner also a right angle prism
can be used in this configuration for the reference beam. The main benefit
of a cube-corner in this configuration is its insensitivity for rotation in any
direction when mounting the element. A right angle prism needs to be
mounted sufficiently well that the reference beam will be pointed at the
same location as the measurement return beam.
The measurement retro-reflector always needs to be a cube-corner to avoid
shifting of the beam and the creation of non-coplanar wavefronts due to
angular motions.
680 Chapter 8. Measurement in mechatronic systems
Reference Retr
f
2
f f
1 2
From Laser
f1 ± f
To photo detector
f1 ±2 f f f1 ±2 f
2
Measurement
Polarising Beam Splier plane mirror
Quarter wave plate
f1
f ! Tilt
2
f1
<"!
Figure 8.83: Basic dual-pass heterodyne interferometer with two cube-corner retro-
reflectors to shift the beams and a plane-mirror as moving element.
One λ/4 plate changes the polarisation of the measurement beam,
causing it to be reflected at the polarising beam splitter and follow a
second trajectory to the moving mirror. The lower image shows the
effect of tilting of the measurement mirror on the trajectory of the
measurement beam. Also here the polariser before the detector is not
shown.
#<
|
100
90
!
80
70
60
50
40
30
20
10
0
0
0.45
1.35
2.25
3.15
4.05
4.95
5.85
6.75
7.65
8.55
0.9
1.8
2.7
3.6
4.5
5.4
6.3
7.2
8.1
9
#<
will be reflected by the polarising beam splitter and the second cube-corner
and is directed a second time towards the moving mirror. In this second
trajectory, its polarisation will be converted via circular to the original
linear polarisation direction enabling it to pass through the polarising beam
splitter and interfere with the reference beam.
The dual-passing means that a movement of the mirror gives a double
phase shift and double Doppler effect on the measurement beam. The
related interferometer constant equals four, which is confusing with the
general wording of this interferometer as a dual-pass type. The “dual-pass”
refers to the fact that always at least the distance between the object and
interferometer has to be passed twice in both directions. Because of the
interferometer constant N = 4, the measurement signal will hence be double
sensitive to the movement and as mentioned in the previous section this
also requires a higher split frequency to be able to perform a high velocity
measurement. Next to this increased higher sensitivity for movements in
the measurement direction and insensitivity for large movement in the
orthogonal directions, also the sensitivity for tilt of the measurement mirror
682 Chapter 8. Measurement in mechatronic systems
is zero for small rotations. This can be seen by following the dashed rays in
the lower drawing of Figure 8.83. The ray, returning after the first reflection
will be tilted with double the tilt angle of the measuring mirror.
The four reflections on the polarising beam splitter and the cube-corner will
return the beam for the second trajectory with the same double tilt angle as
after the first reflection and the second reflection will add that same amount
of tilt but now in the opposite direction. The result is an unchanged optical
path length of the measurement beam.
Although the optical path length is not changed, the measurement beam
to the photo detector shows a lateral displacement of the rays, called beam
walkoff (BWO).
With the tilt angle ϑt in radians [rad] and the measurement mirror at
a distance dm from the interferometer the beam walk off is equal to the
following expression:
BWO = 4 · d m ϑt , (8.90)
Because of this beam walkoff, tilting is only allowed as long as the rays are
all kept inside the interferometer and as long as the reference beam and
the measurement beam will sufficiently overlap to induce interference. Fig-
ure 8.84 shows the effect of the beam walkoff on the interference efficiency.
The part of the beams that do not overlap will give a DC irradiance signal
to the sensor and with increased beam walkoff the AC interference part will
become relatively smaller.
By increasing the diameter of the beams a larger tolerance on tilt is allowed,
forcing the application of large optical parts. Because of the optical require-
ments on flatness and wavefront errors these parts are however extremely
expensive.
x1-interferometer
To signal processing x - y - Rz
stage
x2 -interferometer
\J¬
y-interferometer
To signal processing
Receiver
z-interferometer y-interferometer
Beam bender
Plane mirror
50 %
Plane mirror 33 %
Receiver Beam bender
A
Receiver
B Wavelength
tracker
is no longer possible.
With this configuration, the metrology frame is not a separate frame but
is consists of the base and the complete gantry. The relative position mea-
surement from the moving part A to the solid table B is done in a very
indirect way, described by means of the metrology loop. The metrology loop
is defined by the shortest path that carries information about the relative
position of two or more measurement locations and consists of a series of
solid objects measurably connected by position measurement sensors or a
calibrated sliding mechanism. In the example from the figure, the metrology
loop runs from Position B via the table to the x-interferometer that connects
its position information to the gantry. The metrology loop runs further
through the gantry to the z-interferometer that connects the position of the
gantry to the moving part A. All the parts in the loop contribute to the total
measurement and as long as all properties of all parts in the entire metrol-
ogy loop are exactly known, no errors will happen. Unfortunately reality is
far from ideal and for instance imperfections in the sliding mechanism and
thermal effects on the solid material influence the measurement, often in
an unknown and undesired way.
Based on this general statement it was concluded that either the displace-
ment measuring system should be in line with the object of which the
displacement is to be measured or the angular motions should be measured
separately in order to compensate the effects. An example of such a solution
is shown in Figure 8.87. In this case the position of a point A on top of the
measurement mirror must be measured but it is not possible to direct the
laser beams coaxial with this position as the mirror needs to be moved also
in the z direction. In that case additional measurement beams can provide
information on the rotation angle around the x-axis.
It is extremely important to be aware that any error in this angular mea-
surement will be converted into a proportional error in the z measurement
686 Chapter 8. Measurement in mechatronic systems
Measurement
Input beam beams, three axes
z
~!
Quarter y ya
wave plates A
x
e 3
1 2
Precision positioning in
wafer scanners
In Chapter 1 the invention of the first wafer stepper, the Silicon Repeater
of Philips Electronics was memorised. This machine was designed for the
purpose of realising the complicated structural lay-out of an integrated
circuit and its successors at ASML1 became an important factor in the
worldwide proliferation of electronics, because of its capability to expose an
image of a reticle on a wafer with very small details.
9.1 Introduction
The smallest details in an integrated circuit are called features and their
minimum size is mainly determined by the resolution of the optical exposure
system as indicated by the Critical Dimension (CD). In Chapter 7 it was
explained that the resolution of an optical system is determined by the
wavelength λ of the light, the Numerical Aperture (NA) of the lens and a
k 1 -factor that mainly is determined by the illumination system. With these
variables the critical dimension was shown to be equal to:
λ
CD = k 1 (9.1)
NA
This resolution is a theoretical value that is influenced by the surrounding
pattern. The wave character of the light creates an airy-disk field profile
1 Several of the figures in this chapter and the background information is gracefully provided
by ASML, market leader in lithographic exposure systems for the semiconductor industry.
687
688 Chapter 9. Precision positioning in wafer scanners
of the diffraction limited feature and the “ringing” around the peak will
add vectorial to the airy-disk field profile of neighboring features. As a
result the CD of isolated features is different from the CD of densely packed
features. Several definitions for the CD are used for this reason of which
one is chosen to be used further in this chapter, the half-pitch CD. Other
than with isolated features, the half-pitch CD refers to a regular grating of
lines and spaces of which the width of a line equals half the period (pitch).
Based on the basic formula for the CD, research and development on these
machines has always focused on the reduction of the wavelength λ and the
k-factor and on maximising the NA. The need for a short wavelength has
resulted in a continuous development of light sources with ever smaller
wavelengths, starting with Mercury arc discharge lamps with 436, 405 and
365 nm wavelength, followed by excimer lasers that produce light with a
wavelength of 248 and 193 nm. The next step will be an extreme ultraviolet
(EUV) light source with 13 nm wavelength. With EUV light only catoptric
optics with mirrors can be used, because no transparent material exists
for this short wavelength. The improvements to the k-factor have resulted
in the development of several optical techniques to direct the illumination
in such a way that as many as possible diffractive orders of the image are
captured in the aperture of the lens as was explained in Section 7.4.4.
It was the required high numerical aperture that originally forced the indus-
try to cover the wafer in parts and not as a whole. Exposing a large surface
with a high numerical aperture would only be feasible with extremely expen-
sive lenses with a very large diameter. For that reason it was assumed more
affordable to use a smaller lens and expose the wafer in steps. Each separate
exposed area is called a die and one die can contain a multiple of integrated
circuits. In practice the size of a die is determined by the projection lens and
with a wafer stepper its size was approximately 18 × 18 mm square. Still for
practical reasons the numerical aperture in air is maximised to a value of
about 0.93 and because of that limitation, most modern exposure systems
use water as intermediate medium with a refractive index of ≈ 1.44 at 193
nm. This has increased the practical value of the numerical aperture to
≈ 1.35.
Wafer steppers have remained the main method for the most critical layers
with the finest details until circa 1998 when the wafer scanner entered the
arena.
The principle of the wafer scanner was introduced much earlier by the
American company Perkin Elmer that applied a scanning principle in an
exposure system with a magnification of one from reticle to wafer. Although
it required a reticle with the same size as the wafer it was a good solution
9.1. Introduction 689
for the contamination and damage problems of the original exposure princi-
ple at that time that used a reticle in direct contact with the wafer. Like
with a document scanner, a light stripe exposes the reticle in a scanning
motion simultaneous with the wafer. Only in this case both wafer and reticle
are moving relative to the stationary light stripe. The wafer scanners of
ASML were based on that same scanning principle while they applied the
demagnifying lens of the wafer steppers to achieve a better resolution.
Figure 9.1 shows in a very schematic way how the exposure of a silicon wafer
by a wafer scanner takes place. The illuminator defines a light stripe, called
the slit with a certain width that is stationary with respect to the projection
lens. Instead of using a stationary reticle, in a wafer scanner the reticle
is placed on a reticle stage that performs a scanning motion, where the
illumination system illuminates only one part of the reticle simultaneously.
The projection lens images the reticle on the silicon wafer that also is placed
on a scanning positioning system, the wafer stage that moves synchronously
with the reticle stage. The wafer is firmly clamped to a wafer table on the
wafer stage by means of vacuum. The wafer table is also often called the
wafer chuck or just “the chuck” because it clamps the wafer tightly to the
moving stage. Smaller details can be defined by a relatively large reticle
when compared with the scanners of Perkin Elmer, because of the four times
demagnification of the projection lens and the exposure of only one die simul-
taneously. The disadvantage of this demagnification is that the reticle has
to move four times as fast with corresponding high acceleration levels and
reaction forces. In spite of these dynamic drawbacks this demagnification is
necessary because of the complexity to create reticles with sufficient quality.
The four times larger details enable the electron beam pattern generators
to realise the pattern including the assist features2 , small details that are
added to the IC pattern to tweak the imaging and enhance the resolution
around corners and in area’s where a neighbouring detail is very close by.
The scanning exposure of the reticle has several advantages over the expo-
sure of a stationary reticle as is done in a wafer stepper.
The first advantage of scanning versus stepping is the averaging of small
image position errors. Every point on the reticle is exposed over the full
2 It is beyond the scope of this book to enter deeper in the physics behind these assist
features that work on the wave character of the exposed light. The theory on these resolution
enhancement methods is a real field of experts working with highly advanced modelling
software.
690 Chapter 9. Precision positioning in wafer scanners
Scan
~
} Stepper
Scanner
?
height 32
~6
Slit
^`
Lens error
averaging
Slit movement
Die
Figure 9.1: Basic principle of a wafer scanner exposure system. A small “slit” of
light exposes a pattern on a transparent reticle and the pattern is
imaged on the wafer by a four times demagnifying projection lens. A
simultaneous movement of reticle and wafer results in a full image
of the pattern on the wafer. To cover the entire wafer, the scanning
motion is repeated with an intermediate stepping motion to the next
die position. The size of the die is increased by utilising the full width
of the circular image field of the lens and by maximising the scanning
stroke.
width of the exposure slit by the scanning motion and small image position
errors by the projection lens, present in one part of the slit, will compensate
an opposite error in on another part of the slit. If the errors are random
they average out.
The second advantage is related to the maximum size of the image field.
A lens made of round lens elements will have a circular image field where
all parts can be exposed with the same numerical aperture. Exposing a
wafer in circular regions is not very practical because in that case parts of
the wafer will not be exposed. Although hexagonal dies could fill the entire
9.1. Introduction 691
surface this method is highly unpractical when such a die has to be filled
with rectangular integrated circuits. For these reasons the die always has
a rectangular shape. With a wafer stepper the rectangle has to fit in the
circular field and a square die will be the maximum surface that fits.
A wafer scanner that uses a slit with a limited height can utilise a larger part
of the width of the image field to approximately 26 mm, without increasing
the size and cost of the lens. It even allows optimising the optical image
quality in the slit area. Further, in theory, the size of the image in the
scanning direction could be made infinitely long but the required large
reticle would become extremely expensive. For that reason the die size in
the scanning direction is limited to approximately 32 mm.
Wire
Wire
Via
Via
Metal 1
Metal 1 Pitch
Figure 9.2: Cross section of a modern integrated circuit illustrating the multitude
of interconnecting layers that have to be positioned with strict
requirements on overlay.
(courtesy of Semiconductor Industry Association. The International
Technology Roadmap for Semiconductors, 2009 Edition. International
SEMATECH:Austin, TX, 2009.)
The right image is a CMOS logic microprocessor IC of IBM.
(courtesy of IBM Research)
30
NAND
20 Flash
Logic
2
100 80 70 60 50 40 30 20
Figure 9.3: The requirements on overlay follow a trend that outpaces Moore’s law.
Soon all layers in an integrated circuit need to be positioned well within
a few nanometres.
Still for practical reasons the throughput is most frequently used during
the design of the machine.
In 2010 a throughput of more than 180 wafers per hour was the state of the
art. This value corresponds to 20 seconds per wafer, including loading and
unloading. The related extreme velocities and accelerations of the stages
and other robotic motion systems cause strong reaction forces with vibration
levels that easily could impair the critical dimensions by fading, lack of
contrast by vibrations during exposure.
All these factors result in a large set of requirements for the wafer scan-
ner. The list in Table 9.1 only gives an overview of a selection of the most
important requirements that directly determine the precision positioning-
systems, the wafer stage and the reticle stage. The values are approximated
as they are only meant to give an idea of the order of magnitude. Note
that the wafer stage position error is divided in a low-frequency part, the
moving average (MA), causing overlay errors and the high-frequency part,
the moving standard deviation (MSD) that causes image fading.
694 Chapter 9. Precision positioning in wafer scanners
Requirement Approximate
value
3000 N
Recle stage Balance
Reacon mass
forces
?
Metrology frame
Base
frame Balance
Air mounts Wafer stage mass
Granite stone
Machine pedestal
Figure 9.4: The dynamic architecture of a wafer scanner is based on two principles.
The first is to create a vibration free environment for the projection
lens and metrology frame and the second is to reduce the vibrations to
this “sacred reference” by balance masses and a well-tuned vibration
isolation system. In reality the balance mass of the wafer stage includes
the granite stone that supports the wafer stage.
The horizontal directions need to move with very high accelerations while
the vertical direction with the rotations have less stringent requirements
on acceleration. These high horizontal acceleration levels of the heavy
stages create reaction forces in the order of several kilo Newton. In the first
generations of wafer scanners, these reaction forces were directed towards
a heavy force frame that was mounted on the base frame separately from
the other more sensitive parts of the machine.
Although in that way it was expected that the mass of the force frame would
absorb these forces, the coupling of the resulting movements to the base
frame had caused many problems in practice. The remaining vibrations
sometimes excited the dynamics of the fab floor and could disturb any other
equipment that was located nearby and they still reached the sensitive
parts inside the machine by transmission via the base frame through the
air mounts.
For that reason, it was decided in more recent generations of wafer scanners
9.2. Dynamic architecture 697
that the reaction forces of the high acceleration x and y movements should
be absorbed by very heavy balance masses that are not rigidly connected
to the base frame but horizontally guided on air bearings or compliant
mechanisms.
The balance masses consist of a seismic mass that is directly connected to
the stator of the linear actuator that drives the stage. The actuation force
from the actuator drives both the stage and the balance mass in opposite
directions and causes the relative movement between the mover and the
stator of the actuator to become larger than would be the case when the
stator was connected to the force frame.
The first consequence of this increased relative movement is an increase of
the induced EMF over the motor coils, requiring a higher maximum voltage
of the power amplifiers. The second consequence is the need for a longer
movement range of the linear actuator with the associated cost of larger
coils or magnet assembly.
By increasing the mass of the balance mass its maximum displacement
decreases proportionally and for that reason the mass is chosen much larger
than the mass of the stage. With a stage of 80 kg the balance mass can
have a mass of 500 kg or more. A large mass is also large in volume and
measures should be taken to guarantee that the balance mass will not touch
any other part of the machine during its movements. To guarantee the
consistent free movement of the balance mass measures are taken to keep
its average position in the middle of the moving range, compensating the
effects of gravity and other non alternating forces working on the balance
mass. These measures either consist of a separate actuator with a low
stiffness kp feedback control or of a passive spring with a sufficiently low
compliance to prevent excess transmission of vibrations to the base frame.
The method to absorbing reaction forces in a balance mass requires the forces
to act as good as possible in a horizontal plane through the centre of mass.
Any deviation from this ideal situation will result in a proportional torque
around the horizontal axes through the centre of mass with corresponding
vibration forces in the vertical direction to the base frame.
A well designed structure with balance masses has helped keeping the trans-
mission of internal vibrations to such a moderate level that the vibration
isolation system is able to further reduce these vibrations to the level that
is required to achieve the maximum imaging quality of the wafer scanner.
698 Chapter 9. Precision positioning in wafer scanners
?
with internal
<!
Metrology frame
Lens
Air mount Air mount
Granite stone
Base frame
<
Pedestal
"!!
Figure 9.5: The frames and springs that determine the sensitivity for external
vibrations. The transmission of vibrations from the fab floor takes
place via three mass-spring systems in series. The stiffness of the air
mounts and the mass of the metrology frame and lens determine the
low-frequency limitation of the main vibration isolation element. The
stiffness of the connection between the lens and the metrology frame
determines a resonating eigenmode that will increase the sensitivity
for vibrations at the corresponding eigenfrequency.
Figure 9.5 shows a rigid body diagram of the parts of the wafer scanner that
determine the sensitivity for external and internal vibrations.
The main line of defence against disturbing vibrations is determined by the
transmissibility transfer function of the three air mounts with the combined
mass of the metrology frame and the lens. In principle the air mounts consist
of air cylinders with a large volume of which the pistons are guided by means
of air bearings. These bearings have a sufficiently small air gap that it is not
necessary to apply the usual rubber bellows that would otherwise introduce
an increased transmissibility at higher frequencies by the intrinsic damping
of rubber. The stiffness value of the three air mounts together can be in
the order of 105 N/m. With a combined mass of the metrology frame and
the projection lens of approximately 2500 kg, the natural frequency with
these air mounts is in the order of 1 Hz. The relatively low stiffness causes
a practical problem in relation to changes in the mass of the isolated body.
The total static sag of the spring by the mass of 2500 kg would be equal
to a quarter of a metre and a small variation of this mass could lead to an
unacceptable change in the position of the projection lens. For that reason
9.2. Dynamic architecture 699
@! .
'
xm
Velocity
'
'
to Force
with mass m with mass m xm
control loop
c <
k ! F k
xf
Floor Floor
?
'
@'®®¯¯
Figure 9.6: Active “skyhook” damping in a vibration isolation system avoids the
transmission of external vibrations through the damper. An absolute
velocity detector measures the velocity of the sensitive body relative to
an inert seismic mass and its output is used to exert a force opposite to
the velocity.
^
10
Low passive damping
1
Transmissibility [abs]
10 High passive damping
@'
0
10
>
10
>^
10
>Z
10
>
10
> 0 1 ^ Z
10 10 10 10 10
Frequency [Hz]
Figure 9.7: The transfer function of the active skyhook damping shows an increased
attenuation of the vibrations in a frequency band starting at the eigen-
frequency of the mass-spring system.
d2 xm d
m 2
= − c s ( xm ) + k( xf − xm ) (9.2)
dt dt
where c s equals the control gain of the velocity loop.
With the Laplace transform of the differentiation, the transfer function of xf
to xm becomes:
xm k 1
= = m cs (9.3)
xf ms2 + cs + k 2
s + +1
k k
With the known terms for the eigenfrequency ω0 and damping ratio ζ the
equation becomes:
xm 1
= 2 (9.4)
xf s s
+ 2ζ +1
ω20 ω0
This transfer function does not show the differentiating term in the numera-
tor that represented the increased transmissibility in the transfer function
of a passive damper and as a consequence the Bode-plot of Figure 9.7 shows
9.2. Dynamic architecture 701
A large mechanical structure, like the metrology frame with the lens, in-
evitably shows several dynamic eigenmodes with their related eigenfre-
quencies. Figure 9.5 shows springs between the lens and the metrology
frame that represent the stiffness of the mutual connection. Even though
this connection is very stiff, it still results in a significant resonance with
the heavy lens. Also the optical elements inside the lens with their com-
pliant mechanical mounting determine eigenmodes that might be excited
by external vibrations. The related eigenfrequencies are often in a rather
low-frequency range around 50 – 100 Hz because of the large masses and
for that reason the vibration isolation system should work especially well in
that range.
The low amount of damping that generally is present in mechanical mounts
normally creates resonances with high Q levels that increasing the vibra-
702 Chapter 9. Precision positioning in wafer scanners
Compliant parallel
guiding mechanism
Figure 9.8: An piezoelectric active lens mount with “smart-disks” that consist of a
combined piezoelectric actuator and sensor. By measuring the defor-
mation a damping force can be generated in an active feedback loop.
(Courtesy of Jan Holterman UT)
tions with a factor thirty or more. Even with a well controlled vibration
isolation system it is necessary to create additional damping to reduce the
amplitude at these eigenfrequencies. Damping in a mechanical mounting
structure is difficult to achieve with passive means because of the high
stiffness of the connection. With the related small movements it is almost
impossible to dissipate much energy in viscosity.
Smart disk
The force by the movement of the lens will generate a proportional velocity
of the actuator which is the behaviour of a damper with damping coefficient
c = 1/k i in series with the controlled smart-disk.
where ka and m a are equal to the stiffness and mass of the anti-resonator.
The anti-resonance frequency f a is chosen equal to the undamped eigen-
frequency of the lens with its mounting stiffness. The combined transfer
function of the anti-resonator and the regular transmissibility for vibration
forces from the floor to the movement of the lens via the air mounts is shown
in Figure 9.9. The Bode-plot has been derived using the standard equations
of motion from Chapter 3 and it shows that the original resonance at 100 Hz
of this example is replaced by two resonances with a smaller magnitude
depending on the damping of the tuned mass.
The observed shifting of the resonance frequencies is especially useful when
the undamped system resonates in a frequency area where external distur-
bance peaks occur like for instance the 100 Hz hum from high power mains
supply units.
704 Chapter 9. Precision positioning in wafer scanners
c1 c^
F m1 m^
k1 x k^
>^
10
>Z
Compliance [m/N]
10
>
10
In those cases where the resonance frequency is not fixed it is also possible
to create an actively controlled anti-resonator. By changing the stiffness
with proportional feedback, the tuned-mass damper system can be adapted
to the right resonance frequency.
9.3. Zero stiffness stage actuation 705
From all possible actuator types, only the mentioned Lorentz actuator almost
ideally complies with the zero-stiffness criterion. Piezoelectric actuators are
stiff by definition and reluctance based actuators have a negative stiffness
that induces a comparable, however inverted, transmission of vibrations as
a positive stiffness.
Unfortunately a simple Lorentz actuator offers only a small linear range.
In order to solve this limitation, electronic commutation was introduced in
Section 5.2.5 of Chapter 5. The shown example of Figure 5.23 had a long
set of three moving coils with a large magnet assembly and in spite of the
electronic commutation the range was still rather limited because the coil
set and magnet assembly both needed to have the same length in the motion
direction.
With the design of the first wafer stepper an alternative two phase elec-
tronic commutated Lorentz actuator was designed by one of the authors of
this book at Philips Research laboratories. This extended-range Lorentz
actuator was able to work without the above mentioned limitation and was
applied in the wafer stages of all wafer steppers until the introduction of
the wafer scanner that required a better solution.
Figure 9.10 shows the thinking steps that have led to this design. It starts
with a standard loudspeaker type Lorentz actuator with the permanent
magnet system as moving part.
The first transformation step (1) is to divide the moving ferromagnetic yoke
in a moving outer part that slides over the cylindrical inner part that holds
the coil. This step reduces the moving mass and is allowed as still the
permanent magnet field is moving in respect to the coil, creating a dΦ/d x.
The sliding could be an air bearing but in the next steps the slider will be
avoided.
Step (2) is the addition of an equal permanent magnet-coil set around the
common central yoke. This step increases the force at a certain current level
and becomes even more useful when the direction of both the permanent
magnetic flux and the current in the coil of one of the actuators is reversed
in step (3).
The sliders are deleted in the step (4) because after step three the flux of
both permanent magnets can be shared. This step is essential as now there
is a symmetrical set of two permanent magnets connected by one simple
9.3. Zero stiffness stage actuation 707
Slider
1: Coil connected with a sliding centre yoke. 2: Take two to create symmetry.
1 2 1 2 1 2 1 1 2 1 2 1 2 1
1 2 1 2 1 2 1 1 2 1 2 1 2 1
1 2 1 2 1 2 1 1 2 1 2 1 2 1
1+ 2+ 1- 2- 1+ 2+ 1- 1 2 1 2 1 2 1
Current directed
away from reader
Current directed
towards reader
Figure 9.10: Design of a zero-stiffness Lorentz type actuator with long linear-motion
range by electronic commutation. The direction of the current in the
coils corresponds with a force on the moving part to the right. The
curved arrows indicate the direction of the permanent magnetic flux.
708 Chapter 9. Precision positioning in wafer scanners
Rz
y
Figure 9.11: The usual way of creating a three-axis positioning system is done by
stacking three single-axis stages on top of each other. The driving
forces are not directed to the centre of mass, resulting in unwanted
torques and rotations. For that reason this configuration is not suitable
for fast stages with a high precision.
moving yoke that moves above two coils that are enlarged in the step (5) to
give a larger range.
Unfortunately the large coil will induce a reluctance force acting on the
moving part of the yoke (6), driving it in the outer positions where the reluc-
tance for the magnetic field of the coil is the least. This problem is largely
reduced by choosing shorter coils and applying electronic commutation as
shown in step (7).
The windings around the ferromagnetic yoke consist of two coils that are
interlaced with an alternating winding direction. This means that the first
section of coil one is wound clockwise, followed by a clockwise wound first
section of coil two, again followed by the counterclockwise wound second
section of coil one, and so on.
With commutation , reversing the current direction at any coil, the sections
of that coil that are wound clockwise give a counter clockwise current and
the other way around. This is shown in steps (8 – 10) for a movement to
the right. In principle the stator with the coils and the inner ferromagnetic
yoke can be made infinitely long by adding more sections to each coil and in
step eleven also the moving part is shown to be multiplied to increase the
force.
The main problem of this configuration is the efficiency, as the current has
to run through the entire coil, including those coil sections that are outside
the permanent magnetic field. This disadvantage could be solved by feeding
each coil section with a separate power amplifier but that was not done in
the real system in order to avoid disturbing forces due to switching the coils
on and off. It also would require a lot of separate wires while otherwise one
actuator only needs four wires with two amplifiers to control the current in
a smooth way without disturbing switching moments.
9.3. Zero stiffness stage actuation 709
Y1
X
Roller
bearings Y2
Figure 9.12: Original 3-D sketch and photographic image of the first wafer stage
with electric linear actuators for a wafer stepper in a H-configuration.
The magnetic mover is supported by roller bearings that run on rails
alongside of the coil sections while an air-bearing is used to support
the wafer chuck and mirror block, floating over a flat granite stone.
y1 Wafer y2
chuck
x
Long-stroke actuators
For this reason a different configuration was chosen, named after the letter
“H” of the alphabet with one linear actuator for the linear movement x and
two linear actuators y1 and y2 for the linear movement in the y direction,
allowing a limited rotation Rz by the difference of y1 and y2 . Figure 9.12
shows this configuration where the bearings consisted both of roller bearings
to support the permanent magnetic movers on the coil stator and the air foot,
an air bearing floating on a flat granite stone surface to support the wafer
stage itself. With later versions of the wafer stepper many improvements
were added to this basic system with for instance three separate actuators
to position the wafer in the three remaining directions, z − Rx − Ry that were
needed in order to guarantee the vertical position and flatness of the wafer
relative to the focal plane of the lens.
R S T R S T
Figure 9.14: The wafer stage of a wafer scanner. The three-phase coil section with
ferromagnetic yoke of the powerful long-stroke actuator runs over a
permanent magnet stator. The mirror block is used to measure the
position of the wafer stage with a laser interferometer.
herent drawbacks of the linear actuator that was used in the wafer stepper,
like the remaining parasitic forces by the commutation and the friction
variations of the applied roller bearings.
The solution was to use a non-commutated short-stroke Lorentz actuator
in cooperation with a second actuator, the long-stroke actuator, as shown
schematically in Figure 9.13.
This long-stroke actuator is connected to the stationary part of the short-
stroke actuator and only needs to position this part in such a way that
the permanent magnet system of the short-stroke actuator remains in the
±0.1 mm range around the centre of its range with the least stiffness value.
This configuration allows the long-stroke actuator to operate at a reduced
precision of only this same level of ±0.1 mm, even though the precision of
the short-stroke actuator needs to be on sub-nanometre levels.
The movements and vibrations of the long-stroke actuator are hardly trans-
ferred to the sensitive wafer position because of the low stiffness of the
short-stroke actuator. For that reason it is allowed to use strong electroni-
cally commutated three phase actuators with ferromagnetic yokes in the
long-stroke actuation system as shown in Figure 9.14.
This actuator is optimised to force rather than to precision positioning and
zero-stiffness with a ferromagnetic yoke around the coils. This yoke will
cause cogging forces towards those positions where the reluctance for the
permanent magnet field is smallest. Cogging forces are perceived as bumps
when moving the mover by hand over the total stroke.
712 Chapter 9. Precision positioning in wafer scanners
*'
*
z,Rx ,Ry-axis
short-stroke
actuators
x-axis long-stroke actuator
x,y,Rz-axis
short-stroke
z
actuators x
Rz Rx
Ry
Top-view wafer stage with
y
*
Figure 9.15: The short-stroke actuators are integrated within the mirror block.
The actuation in the horizontal plane is done by flat Lorentz actua-
tors while the out-of-plane movements are provided by loudspeaker
type Lorentz actuators without a ferromagnetic yoke to prevent non-
linearity by reluctance forces.
The advantage of the ferromagnetic yoke around the coils is the low reluc-
tance path for the magnetic field that increases the force to current ratio of
the actuator. This actuator can easily transport the required cables, wires
and water cooling tubes for both the short-stroke coils and its own coil sec-
tion.
Fortunately only the in plane motions are large in a wafer scanner, so only
those motions require a corresponding long-stroke actuator. The out-of-
plane motions, needed to keep the surface of the wafer in the focal plane
of the lens are realised by separate short-stroke actuators only, as shown
in Figure 9.15. These loudspeaker type Lorentz actuators are designed
without a ferromagnetic yoke in order to prevent reluctance forces as was
explained in Section 5.3 of Chapter 5.
9.3. Zero stiffness stage actuation 713
N S N S N S N
N S N S N S
S N S N
F S N S
S N S
B N S N
N F S F N S N I S N
S N S I N S N S
S N S N S N
N S N S N S N
Figure 9.16: The forces acting on a coil, positioned in the magnetic field above an
array of alternating permanent magnets. Depending on the position
in the field, the forces can be in-plane, perpendicular or a combination
of both.
@
'«
2
600
500
400
300
200
100
0
3
@ 2.5 800
! 2
600
« 1.5
«
@ 1 400
"
!
0.5 200
@!
Figure 9.17: All electric actuators have a limitation in the amount of force per unit
of moving mass of the actuator itself, the specific peak force. With a
given mass that has to be actuated, an increase of the acceleration
would require an increase of the mass of the actuator. The relative
added acceleration flattens out at a high ratio between the mass of the
actuator and the actuated mass with an asymptote at the maximum
specific peak force.
body.
A direct consequence of this relation is that the total moving mass of the
stage becomes noticeable larger than only the mass of the actuated body
that carries the wafer. For the required maximum acceleration of 120 m/s2
as mentioned in Table 9.1, the mass of the actuator is still below 50 %
of the mass of the actuated body and this can as yet be provided without
much problems, but when more acceleration is needed in the future, the
total mass will rapidly increase. For this reason, present developments on
the stages focus on the reduction of the moving mass by the application of
hollow structures with thin plates and reinforced plastics. Unfortunately
this mass reduction does not only have benefits as the sensitivity for external
disturbing forces by transmission is increased. The counterbalancing effects
of the mass with on the one hand the need to decrease the mass in order
to increase the maximum acceleration and on the other hand the need to
increase the mass in order to reduce the sensitivity for external disturbing
forces is called the mass dilemma of precision positioning systems. In other
words: ”A reduced mass requires improved system dynamics that enable
a higher control bandwidth to compensate for the increased sensitivity for
external vibrations“.
716 Chapter 9. Precision positioning in wafer scanners
• The alignment system that measures the position of the wafer relative
to the wafer stage with sub-nanometre accuracy.
• The level sensors that measures the surface profile of the wafer relative
to the wafer stage with nanometre accuracy.
2. A robot brings the wafer from the pre-alignment chuck to the wafer
stage. The position of the vacuum gripper of the robot is alternately
referenced to the pre-alignment chuck and the waferstage by a kine-
matically determined mechanical connection. The total resulting po-
sitioning error of the wafer on the waferstage is approximately ten
micrometres.
layer. The number of these reference marks can range from two till
the total number of dies. Similar alignment marks, called fiducials,
are also stably attached to the wafer stage, close to the wafer. The
alignment system measures the position of the alignment marks on
the wafer relative to the position of the fiducials.
4. The waferstage with the aligned wafer moves to the projection lens
under real-time measurement relative to the metrology frame. The
position and focal plane of the image is measured relative to the wafer
stage with a special image sensor. It is assumed that this information
remains stable during one or several exposure cycles.
The principle of alignment with the alignment marks as reference for the
pattern on the wafer is shown in Figure 9.18. The alignment marks are
phase gratings and the measurement is comparable with the interferometric
encoder of Section 8.8.1.3.
The grating is illuminated with a laser and the phase steps with a height of
0.25λ create diffraction orders mainly in the two first orders. Both orders are
recombined at the photo detector after interfering with a second reference
grating. The interference irradiance depends on the phase relationship of
these orders. As was explained with the interferometric encoder, the phase
shift by the movement in the +1st order is opposite and equal in magnitude
to the phase shift of the −1st order, resulting in a doubling of the combined
differential phase shift, thereby increasing the measurement resolution
with a factor two.
The alignment is done by a scanning movement of the wafer stage through
the range where optimal alignment should take place and the zero crossings
of the irradiance signal are measured. These zero crossings repeat with
every half grating period due to the doubling effect of the two diffraction
orders which limits the range of this alignment system to half a period of
9.4. Position measurement 719
16 Pm Alignment
Wafer laser beam
O/4
-1 +1
O/4 Wafer
Markers
Fiducial on waferstage
Figure 9.18: The alignment marks are reflective phase gratings with different
periodicity, to increase the capture range. One way to measure the
markers is shown at the right, where both first orders are imaged on a
reference transparent phase grating and their disturbance is detected
by a sensor. This configuration increases the position sensitivity with
a factor two. Note the alignment mark in the ASML logo.
250
Figure 9.19: The required depth of focus (DOF) as function of the critical dimen-
sion. After the introduction of immersion, the requirements were less
stringent.
The high numerical aperture that is necessary for a better resolution au-
tomatically implies a decreased depth of focus (DOF) as was explained in
Chapter 7. The three dimensional structures on the surface of a wafer, that
are created during the many manufacturing process steps, have made it
increasingly difficult to guarantee a sufficiently flat surface to accommodate
this continuously decreasing depth of focus of the projection lens. Figure 9.19
shows the relation between the depth of focus and the critical dimension. A
focal error of less than 100 nm over a die of 25 mm is a challenging require-
ment. To achieve such extreme specifications, this problem is approached
from two sides. The first step is to guarantee a certain flatness of the wafer.
As previously explained the deposited metal layers for the wiring are often
treated by a CMP-process to keep the surface non-planarity locally within
acceptable levels to less than 50 nm over a die.
The second step to achieve a sufficiently small focus error is to actively
position the wafer within the focal area at the region of the exposed die.
This process is also called the levelling of the wafer and with the first gen-
erations of the wafer scanner the levelling was done by means of real-time
9.4. Position measurement 721
measurement of the wafer surface ahead of the lens during scanning. This
measurement was done by means of an optical triangulation method like ex-
plained in Section 8.6.1.2 of the previous chapter on measurement. A servo
tracking feedback control system kept the surface approximately parallel
to the image in the focal plane. Unfortunately feedback control can never
prevent some remaining focal error due to the positioning system dynamics
and the limited control gain. A better solution was found to determine the
height profile of the wafer prior to exposure and use that information in
feedforward control at exposure.
With a single wafer stage a separate measurement of the surface topology of
the wafer would require a decrease in the productivity due to the additional
time for the measurement. The dual wafer stage with parallel measurement
and expose cycles was designed to solve that problem.
722 Chapter 9. Precision positioning in wafer scanners
Map
Align wafer
wafer
Load Stage
Stage
wafer Measurement Exposure
cycle Swap
Swap cycle
Expose
Unload Wafer
Wafer
Align
Recle
Figure 9.20: The process steps in a dual-stage wafer scanner between loading and
unloading consist of two cycles. In the measurement cycle the wafer is
aligned to the wafer stage and the surface is mapped for focus. In the
exposure cycle the wafer stage is aligned to the reticle and the wafer
is exposed.
Productivity has been the main driver for the decision to design a wafer
scanner with parallel processing of the measurement and the exposure cycle.
Next to the possibility to precisely measure all dimensional details of a
specific wafer without sacrificing throughput, the increased utilisation of
the expensive lens is another important reason for this duplication of the
wafer stage. With a dual wafer stage the lens can expose one wafer during
the time that the other wafer is measured and as a consequence the lens is
continuously utilised. Figure 9.20 shows the process steps of a dual-stage
wafer stage. The measurement cycle of one wafer is executed synchronous
with the exposure cycle of another wafer. As a result of the measurement,
the exposure starts with a wafer that is fully known in respect to the position
of the alignment markers to the wafer stage and the height profile of the
surface. This enables a maximum use of feedforward control.
9.4. Position measurement 723
z1-axis
Rz Ry y
x-Rz-axis reference mirror
Rx
x
z2-axis
reference mirror
x-axis
z-axis
o
45 mirror
y-Rz-Rx-axis
Figure 9.21: A mirror block for 6-axis position measurement by means of laser
interferometry. The z-axis is realised with a mirror under 45◦ that
reflects the light upwards to a reference mirror. All axes are dual
path plane mirror interferometers and Ry is determined from two z
interferometers.
The long-range measurement systems in the wafer- and reticle stage have to
be able to track the position with the specified accuracy and velocity levels
without adding latency which would reduce the accuracy of the synchroni-
sation between the wafer stage and reticle stage.
Until the end of the last century only laser interferometer measurement
systems were capable of measuring large distances with nanometre level
resolution and a latency timing that neither limited the bandwidth of the
feedback control nor interfered with the synchronisation of the stages. Sam-
ple rates far above 20 kHz are presently state of the art and with suitable
synchronised phase detection electronics a large number of measurement
axes could be realised that all worked in parallel.
Figure 9.21 shows the mirror block of a wafer stage, showing the laser beams
that are used to realise a six-axis real-time measurement system.
A special configuration is used to derive the z-axis measurement signal with
a 45◦ mirror to direct the light upwards from the wafer stage. The light
is reflected back by a stationary reference mirror that is connected to the
metrology frame and returns via the 45◦ mirror to the interferometer.
724 Chapter 9. Precision positioning in wafer scanners
With this configuration, the total optical path length of this beam is de-
termined both by the x and the z distance. By subtracting the directly
measured incremental x movement from another laser beam, the z position
is obtained.
The accuracy of laser interferometer measurement systems is strongly in-
fluenced by the refractive index of air according to the Edlén equation
(8.86) that was presented in Section 8.8.2.3 of the previous chapter on mea-
surement. The main problem with interferometer distance measurement
systems is that they measure over a long distance through air. At large
distances, small variations in the light velocity in air, that are induced by
the temperature and/or the pressure, can have a significant influence on
the measurement. To keep the accuracy of the measurement on accept-
able levels, air showers are used to condition the air over the full optical
measurement path. The temperature of the air is required to be controlled
within about 1 mK stability over an exposure cycle. Furthermore the air
has to flow at a higher speed than the maximum speed of the stages in
order to prevent mixing of the conditioned air from the air showers with
air coming from elsewhere at a different temperature. These requirements
have resulted in significant developments that always have just met the
required specifications. Still it is assumed that on the long run this method
will no longer be adequate and that also the high speed of the conditioned
air might excite dynamic resonances in the sensitive parts.
Fortunately another solution is found in a planar reflective-encoder system
with a phase grating that measures in all six directions with a sufficient
resolution to be applied as the long-range measurement system of a wafer
scanner. This alternative has two advantages over the laser interferometer.
First of all the thermally induced refractive index changes have far less
effect because of the short path through air of less than 15 mm. The thermal
sensitivity is reduced with a factor 100 to ΔL/L = 2 · 10−8 per degree Kelvin.
The second advantage has to do with the stability of the real-time metrology
loop as is explained with the help of Figure 9.22.
The real-time metrology loop is a subset of the full metrology loop that was
defined at the start of this section and contained also measurements that
were not performed in real-time. The real-time metrology loop assumes that
all alignment steps are done well and remain stable during the exposure.
This means that the real-time metrology loop connects the lens via the
metrology frame to the wafer on the waferstage.
9.4. Position measurement 725
? ?
100-500 mm
Figure 9.22: Two methods for long-range incremental position measurement. The
laser interferometer, shown at the left has long been the standard but
is partly replaced by the plane encoder system as shown at the right.
The difference is seen in the length and stability of the metrology loop
and a reduction of thermally induced instability by refractive index
changes of air.
Figure 9.23: The combination of the H-configuration with the laser interfero-meter
measurement system on the top and the plane encoder with the planar
actuation system at the bottom clearly illustrate the differences.
726 Chapter 9. Precision positioning in wafer scanners
Figure 9.24 shows the basic principle of the motion control of the wafer stage.
A modern control expert will miss elements like Multiple Input Multiple
Output (MIMO) controllers, observers and other more recently developed
control approaches.
It is true that the total controller itself consists only of six separate Single
Input Single Output (SISO) controllers for all directions with feedforward
and feedback paths. The feedback controller uses a well tuned classical
Proportional, Differentiating and Integrating PID-control algorithm.
The six-axis machine coordinate system for the position measurement and
the controller has its orientation at the centre of the image position. This is
not the place where the centre of mass of the wafer stage, nor the actuators
of the wafer stage are located. For that reason the six control forces Fc are
first transformed by a Gain Scheduling matrix operation into the forces Fcom
that should be exerted at the centre of mass of the positioning stages. The
gain scheduling matrix is constantly adapted to the actual position of the
stage. A second transformation is applied to determine the corresponding
forces of the actuators Fa by means of a Gain Balancing matrix operation.
The gain balancing matrix has to be well tuned to the physical hardware of
the wafer stage in order to compensate for performance deviations due to
tolerances in the different functional parts.
Feedforward
control
<
Posion Stage
setpoint +
+ Fc Fcom F
+ GS GB
-
°
Figure 9.24: The position control of the wafer stage is a SISO 6-axis PID-control
system. The Gain Scheduling matrix (GS) transforms the control
forces (Fc ) in six degrees of freedom, according to the machine co-
ordinate system into the forces (Fcom ) of the wafer stage coordinate
system around the centre of mass. The Gain Balancing matrix (GB)
transforms these forces into the actuator forces (Fa ).
728 Chapter 9. Precision positioning in wafer scanners
16 40
12 30
8 20
Accprole [m/s2]
4 10
0 0
-4 -10
-8 -20
-12 -30
-16 -40
0 0.05 0.1 0.15 0.2
Time [s]
Figure 9.25: The wafer stage error as function of time induced by a mismatch of
0.1% of the feedforward force when a well-tuned feedback is applied.
The reason for this rather simple and straightforward approach in control
is twofold. First of all this methodology allows a more direct investigation of
observed errors in the system. This shortens the time for trouble shooting
at the customer site, when the system has failed, or when parts have been
exchanged and the system should be tuned again. The second reason is
that the feedback part only plays a role in the robustness of the control to
deviations of the system parameters from the ideally modelled situation as
will be further explained in the following section.
A property of feedback control is the fact that it needs an error to act anyway,
so it is allowed to say that Feedback is always (too) late. The related
delay manifests itself in the settle time that is the time, needed to reduce
the effects of an disturbing event to the specified maximum error.
Feedback should only be applied to correct errors that could not be avoided
by feedforward control due to random disturbances or uncertainties in the
properties of different elements of the system. In the design of a wafer scan-
ner the focus is on creating a dynamic system that behaves as deterministic
as possible, mainly because of the impact of settle time on throughput.
A small example can place this requirement in a realistic perspective.
Imagine a wafer stage with the specifications as mentioned in the list of
Table 9.1, a mass of 20 kg, accelerating with 30 m/s2 to a maximum speed of
0.5 m/s. With these numbers the required force during acceleration becomes
9.5. Motion control 729
The above example did not include errors caused by external disturbances.
With all measures taken to its maximum, in practice still the last errors
have to be corrected in less than a just allowable 10 ms settle time and there
the mass dilemma returns in its full magnitude!
A reduced moving mass is preferred in order to reduce the electrical power
that is necessary for the required high levels of acceleration. A reduced
mass will also limit the reaction forces and will indirectly lead to reduced
errors by nonlinearities and dynamic disturbances in the actuators.
Unfortunately a reduced mass will proportionally increase the sensitivity
for external disturbance forces through elastic and damping elements. This
means that an increased feedback control gain would be needed to correct for
these errors with a corresponding higher bandwidth frequency. One might
think that a low mass construction will show higher eigenfrequencies that
enable to achieve this increased bandwidth. Unfortunately this is not a fully
correct assumption as a reduced mass without reduced overall dimensions
can only be realised by a combination of thin structures. Those structures
are not always sufficiently stiff in all directions to guarantee the required
high eigenfrequencies.
Research is done at several places to compensate these potentially harmful
eigenmodes by means of overactuation. Overactuation implies the use of
more actuators than the number of coordinate directions that have to be
controlled. By optimising the placement of the actuators in respect to the
mode shapes of interest, the excitation of the lower-frequency eigenmodes
can be avoided. As soon as these techniques are really applied, the simple
control configuration of Figure 9.24 is no longer possible as the actuation
axes are dynamically well-coupled giving a space matrix with many numbers
beside the diagonal.
9.6. Main design rules for precision 731
The following books are recommended for further reading, when more in
depth knowledge of the different disciplines within mechatronics is required.
The first two books are a useful addition to the theory on mechanics of
mechatronic systems, because they present mechanisms, transmissions,
bearings and several other aspects of mechanical engineering that are not
presented in this book. The first book is based on the practical experience
within the Engineering departments of Philips Electronics on the dynamics
of mechanisms. Earlier versions were written in Dutch, the original by
professor Wim van der Hoek and refined by professor Rien Koster, both from
the Eindhoven University of Technology. Recently a fully re-written version
is published in English by professor Herman Soemers from the University
of Twente:
Design principles for precision mechanisms
H.M.J.R.Soemers
Publisher: T-Point Print VoF (2010)
ISBN: 978-9036531030
The second book on mechanical design is written by Anton van Beek from
Delft University of technology with an emphasis on reliability, friction
phenomena and bearings:
Advanced Engineering design – lifetime performance and reliability
Anton van Beek
Publisher: Tribos (2009)
ISBN-13: 978-9081040617
733
734 Recommended other books
Regarding motion control, the following four books provide ample infor-
mation that directly connects with the described methodology in Chapter 4,
including the relations with the physical aspects of the controlled plant:
Feedback Systems: An Introduction for Scientists and Engineers
Karl Johan Åström, Richard M. Murray
Publisher: Princeton Univ Press (2008)
ISBN-13: 978-0691135762
Digital Control of Dynamic Systems
Gene Franklin, J. David Powell, Michael L. Workman
Publisher: Addison Wesley (1998)
ISBN-13: 978-0201820546
Computer-Controlled Systems: Theory and Design
Karl Johan Åström, Björn Wittenmark
Publisher: Prentice Hall (1996)
ISBN-13: 978-0133148992
Advanced PID Control
Karl Johan Åström, Tore Hägglund
Publisher: Isa (2005)
ISBN-13: 978-1556179426
For optics the following two books are recognised as being the prime source
of reference on this subject:
Principles of Optics:
Electromagnetic Theory of Propagation, Interference and Diffraction of light
Max Born, Emil Wolf
Cambridge University Press (1999)
ISBN-13: 978-0521639217
Optics
Eugene Hecht
Publisher: Addison Wesley (2002)
ISBN-13: 978-0805385663
Recommended other books 735
On physics in general, including light and waves, the many books from
Richard Feynman are all worthwhile reading. The following two examples
give a nice overview of his ability to teach the difficult material in clear
manner, not only in the classroom where his style of lecturing became famous,
but also by the examples and understandable structure of the material.
The first and most comprehensive work is based on the “Feynman lectures
on physics”, that were given to undergraduate students at the California
Institute of Technology. The textbook was originally written by the staff of
Caltech in 1964 based on recordings of the lectures and is now available in
a newly edited full set:
The Feynman Lectures on Physics, boxed set: The New Millenium Edi-
tion
Richard P. Feynman, Robert B. Leighton, Matthew Sands
Publisher: Basic Books (2011)
ISBN-13: 978-0465023820
The second example is a much smaller book that was written to explain the
principles of the theory on quantum-electrodynamics to people from outside
the particle-physics community. This very readable and even sometimes
amusing booklet is referred to, when explaining photons:
QED: The Strange Theory of Light and Matter
Richard P. Feynman, A. Zee (intro)
Publisher: Princeton University Press (2006)
ISBN-13: 978-0691125756
For measurement, the following book has often been used in our lectures
at the university:
Principles of Measurement Systems
John P. Bentley
Publisher: Prentice Hall (2005)
ISBN-13: 978-0130430281
736 Recommended other books
Nomenclature
a [m/s2 ] Acceleration.
A [-] Amplitude.
A [m2 ] Surface of Cross Section (SCS).
Ac [m2 ] SCS of a coil.
A c,w [m2 ] SCS of the windings in a coil.
Ag [m2 ] SCS of an air gap.
Ai [m2 ] SCS of the flux path inside a coil.
Am [m2 ] SCS of a permanent magnet.
Ao [m2 ] SCS of the flux path outside a coil.
Ay [m2 ] SCS of the flux path inside a yoke.
b [-] Wien’s constant.
B [T] Magnetic field.
B [T] Magnetic flux density (MFD).
Bg [T] MFD in the air gap.
Bg,c [T] MFD in the air gap induced by current.
Bg,m [T] MFD in the air gap induced by magnet.
Br [T] Remnant MFD.
Bs [T] Saturation MFD.
Bw [T] MFD inside a coil winding.
By [T] MFD inside a ferromagnetic yoke.
c [-] Dimensionless constant or factor.
c [m/s] Velocity of light.
c [Ns/m] Damping coefficient.
cd [V/m] Deformation constant piezo accelerometer.
2
ci [Is /m] Current sensitivity piezo accelerometer.
cs [-] Sensitivity factor in Wheatstone bridge.
2
cv [Vs /m] Voltage sensitivity piezo accelerometer.
C [F] Capacitor value.
C, C [m/N] Compliance, Transfer Function (TF).
C fb ( s) [-] TF of a feedback controller.
C pd ( s) [-] TF of a PD-type feedback controller.
C pid ( s) [-] TF of a PID-type feedback controller.
C ff ( s) [-] TF of a feedforward controller.
Cm [m/N] Mass compliance.
Cs [m/N] Spring compliance.
738 Nomenclature
Abbreviations
Index
RC -time, 336 bandwidth, 91, 93, 114, 171
“Hooke – Newton” law, 87 barrel, 486
0 dB level, 83 base frame, 695
batteries, 44
Abbe error, 685
BBN criteria, 552
absolute standards, 540
beam splitter, 533
AC, 49
beam walkoff, 682
acceleration, 95
bias, 402
achromatic doublet, 488
biasing, 402
adaptive feedforward control, 155, 208
bimorph, 630
adaptive optics, 525
birefringence, 499
aging, 303
bistable, 579
air foot, 695
bits, 580
air mounts, 695
bitstream, 594
air showers, 678, 724 black body, 452
air-gap, 235 black-box system identification, 205
amplitude grating, 506 Bode Sensitivity Integral, 181
amplitude modulation, 575 Bode-plot, 79
analogue-to-digital converters, 587 body, 95
angular frequency, 49 branches, 560
annular illumination, 521 Brewster’s angle, 499
anti-resonance, 121 bridge, 428
anti-resonator, 703 bridge-rectifier, 356
anti-windup control, 212 brightness, 461
aperiodic, 110 Butterworth, 394
aperture plane, 485
carrier, 575
aperture stop, 492
catadioptric, 473
asphericity, 482
catoptric, 473, 688
assist features, 689
cats-eye, 660
Astigmatism, 481
cavity, 504
asymptotically stable, 175
centre-frequency, 663
at equal amplitude, 76
channel, 360
attenuation-band, 335
characteristic impedance, 325, 597
autotransformer, 291
characteristic polynomial, 199
azimuthal order, 529
charge carriers, 353
balance mass, 695 charge control, 311
balanced bridge, 560 charge-pumping, 432, 434
Index 747