Principles of Multimedia 2e 2012 PDF
Principles of Multimedia 2e 2012 PDF
Principles of Multimedia 2e 2012 PDF
5 inch
Principles of
MULTIMEDIA 2e
Principles of Multimedia, 2e essentially discusses about characteristics, representation,
compression, storage (file format) and processing tools for understanding and utilization of Principles of Principles of
MULTIMEDIA
various multimedia components, like text, image, graphics, audio, video, animation. This
edition additionally focuses on programming concepts using which practical tasks and
tutorials related to media processing and presentations, can be carried out by the reader to
gain a deeper understanding of the underlying theoretical concepts.
Key Features :
Ranjan Parekh
MULTIMEDIA
Introduction to topics on image processing, audio processing, and video processing along
v
with numerous MATLAB coding examples for performing related tasks
Studies on concepts related to 2D and 3D graphics and animation including splines,
v
polynomials, transformations, projection, modeling, surface mapping, light, camera and
rendering
Overview of lossless and lossy compression techniques with emphasis on JPEG and
v
MPEG standards
Discussions on hardware and OS supports for multimedia including optical storage
v
technologies, transmission standards, real-time protocols and playback architectures
Focus on research issues in the fields of multimedia database, content-based storage and
v
retrieval, pattern recognition and computer vision
Information on text formats and standards, multimedia document architecture,
v
interchange formats, IPR, digital copyrights, digital library and multimedia archives
Exploration of Web-enabled multimedia content creation, hypermedia design and Web
v
programming languages like HTML, Dynamic HTML, CSS, JavaScript, XML, SMIL, SVG and
VRML
Guidelines and case studies for multimedia application development and authoring
v
2e
http://www.mhhe.com/parekh/multimedia2 Parekh
Visit us at: www.tatamcgrawhill.com
www.tmhshop.com
ISBN-13: 978-1-25-900650-0
ISBN-10: 1-25-900650-6
2e
Principles
of
Multimedia
Second Edition
About the Author
Ranjan Parekh is Assistant Professor at the School of Education Technology, Jadavpur University, Kolkata,
India. Dr. Parekh is involved with teaching subjects related to Multimedia Technologies at the post-graduate
level. His research interests include multimedia databases, image/audio/video processing, pattern recognition
and computer vision.
Principles
of
Multimedia
Second Edition
RANJAN PAREKH
Information contained in this work has been obtained by Tata McGraw-Hill, from sources believed to be
reliable. However, neither Tata McGraw-Hill nor its authors guarantee the accuracy or completeness of
any information published herein, and neither Tata McGraw-Hill nor its authors shall be responsible for
any errors, omissions, or damages arising out of use of this information. This work is published with the
understanding that Tata McGraw-Hill and its authors are supplying information but are not attempting
to render engineering or other professional services. If such services are required, the assistance of an
appropriate professional should be sought.
Typeset at Script Makers, 19, A1-B, DDA Market, Paschim Vihar, New Delhi 110 063 and printed at Avon Printers,
Plot No, 16, Main Loni Road, Jawahar Nagar, Industrial Area, Shahdara, Delhi - 110 094.
Cover Designer:
Cover printed at: SDR Printers
RACCRRYHDCZXY
Dedicated
to
my mother:
Uma Parekh
Contents
Preface xiii
Abbreviations xx
1. Multimedia—An Overview 1
1.1 Introduction 1
1.2 Multimedia Presentation and Production 1
1.3 Characteristics of a Multimedia Presentation 2
1.4 Hardware and Software Requirements 5
1.5 Uses of Multimedia 6
1.6 Analog and Digital Representations 8
1.7 Digitization 12
1.8 Nyquist’s Sampling Theorem 14
1.9 Quantization Error 16
1.10 Visual Display Systems 17
Review Questions 36
2. Text 39
2.1 Introduction 39
2.2 Types of Text 39
2.3 Unicode Standard 43
2.4 Font 45
2.5 Insertion of Text 47
2.6 Text Compression 48
2.7 Text File Formats 49
Review Questions 53
3. Image 55
3.1 Introduction 55
3.2 Image Data Representation 55
3.3 Image Acquisition 57
3.4 Image Processing 64
3.5 Binary Image Processing 77
3.6 Grayscale Image Processing 82
3.7 Color Image Processing 99
! " # ! " $
5. Audio 248
5.1 Introduction 248
5.2 Acoustics 248
5.3 Sound Waves 249
5.4 Types and Properties of Sounds 251
5.5 Psycho-Acoustics 258
5.6 Components of an Audio Systems 261
5.7 Digital Audio 267
5.8 Synthesizers 269
5.9 Musical Instrument Digital Interface (MIDI) 270
5.10 Digital Audio Processing 276
5.11 Speech 301
5.12 Sound Card 311
5.13 Audio Transmission 314
5.14 Audio File Formats 317
5.15 Surround Sound Systems 324
5.16 Digital Audio Broadcasting 329
5.17 Audio-Processing Software 332
Review Questions 338
6. Video 340
6.1 Introduction 340
6.2 Motion Video 340
6.3 Analog Video Camera 341
6.4 Analog Video Signal Representation 344
6.5 Television Systems 353
6.6 Video Color Spaces 355
6.7 Digital Video 358
6.8 Digital Video Processing 375
6.9 Video Recording and Storage Formats 380
6.10 Video File Formats 384
6.11 Video Editing Concepts 389
6.12 Video-Processing Software 393
Review Questions 398
7. Animation 399
7.1 Introduction 399
7.2 Historical Background 399
7.3 Uses of Animation 400
7.4 Traditional Animation 400
% ! " # ! " $
8. Compression 428
8.1 Introduction 428
8.2 Basic Concepts 428
8.3 Lossless Compression Techniques 432
8.4 Lossy Compression Techniques 441
8.5 Image Compression 454
8.6 Audio Compression 464
8.7 Video Compression 475
8.8 MPEG Standards Overview 487
8.9 Fractal Compression 499
Review Questions 510
Bibliography 755
B.1 Books 755
B.2 Research Articles 759
Coding 770
C.1 MATLAB Coding 770
C.2 VRML Coding 805
Index 816
Preface
The term ‘multimedia’ essentially implies ‘means of communication through multiple media’. From the earliest
times, humans have attempted to communicate their ideas through various means like gestures, sounds,
paintings, writings, etc. The multiple sense organs of humans also enable them to comprehend and support
such kind of multi-modal communications. In the real world, non-textual form of communication, like cave
paintings, were used much before the written form originated. In the computing world, however, the trend
has been just the reverse; text was the predominant form of communication over most of the earlier part
of computing history. Only in the last 20 years did non-textual forms gain signiÞcance. Dealing with text
through computers was much easier than dealing with non-textual media like pictures or sound. In recent
times, many technical barriers have been overcome and practical problems have been solved. Principles of
Multimedia gives an insight into these solutions and highlights the landmarks achieved, a theoretical treatise
of humankind’s endeavor for communicating through multiple means in the digital domain. By multiple means,
we now imply the following media: text, image, graphics, audio, video and animation, all in the digital form.
Added to the above is an important aspect of multimedia that distinguishes it from television and movies—that
of interactivity. At the basic level, this lets the viewer interact with a presentation and decide how he or she
wants to view it, e.g. jump from the opening scene to a speciÞc section instead of passively watching it from
beginning to end. At a more advanced level, interactivity lets the user change the normal course of action
within the presentation by performing certain pre-deÞned events. Good examples of such interactivity are
computer games and simulation packages like aircraft-training simulation. A ‘multimedia presentation’ is a
‘digital show’ using most or all of these components in an integrated way to communicate some idea to the
target audience. The advantage of such a presentation lies in the fact that one is allowed to present his/her
ideas in a variety of forms and ways that increase the comprehension level and retentivity of the subject matter
in the viewer’s mind. This makes it an important tool in education. Other possible uses of multimedia include
home entertainment, industrial training, information kiosks, corporate presentations, computer-aided design
applications, video conferencing and telemedicine only to name a few.
Objectives
With increase in the processing powers of today’s computers and decrease in the costs of hardware and
software, there has been a deÞnite trend in utilizing the advantages of digital multimedia in industries,
educational institutions and the research community. This is evident by the large number of digital repositories
of images, audio and video growing up all over the world. We have all heard the phrase “A picture is worth
a thousand words”. Going along those lines, a video should be worth a million. Added to that is sound and
interactivity. In order to use multimedia to its fullest extents, a thorough knowledge of its internals is essential,
especially for people who are willing to take up multimedia production as a career option. The Þrst objective
of this book is to meet this requirement, i.e. to explain as clearly as possible the fundamental concepts behind
how different components of multimedia function. The technical level of the book is set at the graduate/
post-graduate levels of science/engineering streams. It is expected that after going through the book, the
reader would comprehend the issues and standards related to the digitization, processing, compression and
playback of various media components, as well as be competent enough to design multimedia presentations
for a variety of applications.
% & ' # ( ) * #
Due to the widespread use of multimedia applications worldwide, multimedia as a subject has been
introduced in various forms in the curricula of schools and colleges all over the world. However, it being a
relatively new subject, the number of related books in the market is still limited. The situation is aggravated
by the fact that multimedia deals with a large number of different concepts put together. In fact, multimedia
is often seen as a meeting point of three different work areas: computer, communication, and entertainment.
Such being the case, a majority of the earlier books on multimedia deal with only speciÞc areas of the subject
instead of presenting the entire gamut. The second objective of this book is to provide a full and comprehensive
view of the most important and relevant aspects of the subject to cater to curricula of educational institutions
written in a way that it might be used as a textbook or reference book at the graduate and post-graduate
levels. While writing the book, syllabi of major universities have been kept in mind including Anna University,
Pune University, Jawaharlal Nehru Technological University (JNTU), West Bengal University of Technology
(WBUT), Mumbai University (MU), Rajasthan Technological University (RTU), Biju Patnaik University of
Technology (BPUT), Gujarat Technological University (GTU), Uttar Pradesh Technological University (UPTU),
Visvesvaraya Technological University (VTU), Rajiv Gandhi Proudyogiki Vishwavidyalaya (RGPV) and
Rajasthan Technological University (RTU).
It is well known that the capabilities of our computers are increasing in leaps and bounds. Keeping pace
with this, the functionalities of multimedia hardware and software are also improving day by day. In such
a scenario, students and professionals working in this Þeld need to keep their knowledge updated. In the
commercial arena too, the markets are ßooded with gadgets having newer functionalities and improved
features. People often come across a variety of terms in their day-to-day lives without fully understanding
what they stand for. A few examples are: 5.1 channel, surround sound, 3D monitor, ATRAC audio, digital
audio broadcasting (DAB), digital Dolby (DD), digital theatre systems (DTS), laserdisc, MP4, super audio CD,
torrent, virtual reality, video conferencing and so on. A third objective of this book is to provide the reader with
an up-to-date knowledge of the recent advances in this Þeld, which is more often seen in magazines and on
Websites than in textbooks. Attempts have been made to bring to light details about the state of the art in each
Þeld in an easy-to-understand manner.
A fourth objective is to give the reader an idea about the major research advances in the Þeld of multimedia
which includes image processing, audio processing, video processing, computer vision, pattern recognition,
multimedia databases, content based retrieval and non-textual data classiÞcation. The user is introduced to
tools and techniques like ArtiÞcial Neural Networks (ANN), Principal Component Analysis (PCA), Wavelet
decomposition and Eigen-space decomposition employed for handling such problems and their implementations
using MATLAB coding. Discussions on case-studies from research papers published in conferences and
journals also help highlight research issues pertaining to these areas.
(audio processing), Chapter 6 (video processing), Chapter 11 (computer vision, pattern matching, medical
imaging, content-based retrieval and data classiÞcation). A list of 212 research articles included in the
Bibliography and numerous website references in each chapter help the researcher in further readings.
Salient Features
∑ Introduction to topics on image processing, audio processing, and video processing along with numerous
MATLAB coding examples for performing related tasks
∑ Studies on concepts related to 2D and 3D graphics and animation including splines, polynomials,
transformations, projection, modeling, surface mapping, light, camera and rendering
∑ Overview of lossless and lossy compression techniques with emphasis on JPEG and MPEG
standards
∑ Discussions on hardware and OS supports for multimedia including optical storage technologies,
transmission standards, real-time protocols and playback architectures
∑ Focus on research issues in the Þelds of multimedia database, content-based storage and retrieval,
pattern recognition and computer vision
∑ Information on text formats and standards, multimedia document architecture, interchange formats,
IPR, digital copyrights, digital library and multimedia archives
∑ Exploration of Web-enabled multimedia content creation, hypermedia design and Web programming
languages like HTML, Dynamic HTML, CSS, JavaScript, XML, SMIL, SVG and VRML
∑ Guidelines and case studies for multimedia application development and authoring
% & ' # ( ) * #
∑ Pedagogy
o About 50 Solved examples including 115 coding samples included in chapters.
o Small code snippets have been included in many chapters, especially chapter 12, to illustrate
theoretical and practical examples.
o Introduction provides a background of the concepts to be covered in a chapter and the conclusion
lists the important topics discussed in the chapter as bulleted points.
o A set of about 470 Þgures are included in the book to illustrate key points.
o A list of 260 abbreviations and their full forms are included in the beginning of the book for ready
reference.
o A list of 105 book references and 212 references of research articles are included in the bibliography
for further readings. In addition, numerous online references and websites are included in each
chapter for further readings.
o Review questions have been included in each chapter to test and verify the knowledge of readers.
o In addition, problems will be included in the book website (OLC) to check problem-solving skills of
readers.
cepstral and wavelet analysis, pulse code modulation (PCM), linear predictive coding (LPC), code
excited linear prediction (CELP), components of sound cards, audio transmission, audio connectors,
surround sound systems, digital audio broadcasting (DAB), audio Þle formats, major features of audio
editing softwares.
Chapter 6 is related to Video and discusses concepts related to analog video camera, analog video transmis-
sion, generation of YC signals, chroma sub-sampling, television systems, digital video processing, video
color spaces, video recording and storage formats, video editing concepts, analog and digital video con-
nectors, video Þle formats, major features of video editing softwares.
Chapter 7 is related to Animation and discusses concepts related to keyframes and tweening, cel animation,
computer based animations, interpolations, parameter curve editing, motion paths, hierarchical
animation, inverse kinematics (IK), camera locations and movements, particle systems and space
warps, rendering algorithms, Web based animation techniques, animation Þle formats, major features of
2D and 3D animation softwares.
Chapter 8 is related to Compression and discusses concepts related to types of compressions, types of
redundancies, compression performance measurements, lossless compression techniques like run
length encoding (RLE) differential pulse code modulation (DPCM) Lempel-Ziv-Welsh (LZW) coding
Shannon-Fano coding Huffman coding and Arithmetic coding, lossy compression techniques involving
various transforms like discrete Fourier transform (DFT) discrete cosine transform (DCT) discrete
wavelet transform (DWT), image compression, audio compression, video compression, overview of
MPEG standards including MPEG-1 MPEG-2 MPEG-4 MPEG-7 MPEG-21, fractal compression.
Chapter 9 is related to CD and DVD storage technology and discusses concepts related to working principles
of optical storage, constant linear velocity (CLV) vs. constant angular velocity (CAV), data transfer
rate and X-rated speeds, various CD formats like CD digital audio CD-ROM photo-CD video-CD CD
recordable CD rewritable magneto optical discs laserdiscs, various DVD formats, CD vs. DVD, blu-ray
disc, multi-layered DVDs.
Chapter 10 is related to Multimedia Architecture and discusses concepts related to graphical user interfaces,
support for multimedia in operating systems, multimedia extension features in chipset like MMX,
hardware transmission cables like USB SCSI and FireWire, distributed multimedia systems, real-time
applications and protocols, multimedia playback architectures, synchronization types and techniques.
Chapter 11 is related to Multimedia Databases and discusses concepts related to content based storage and
retrieval, design and components of a multimedia database, low-level and high-level features, similarity
measurements, image features based on color texture and shape, audio features, video features,
classiÞcation of data, clustering, indexing, Eigen vectors and values, principal component analysis
(PCA), artiÞcial neural networks (ANN), character recognition, single-layer multi-layer perceptrons
(MLP), semantics in multimedia data, prototype implementations of multimedia databases.
Chapter 12 is related to Multimedia Documents and discusses concepts related to document architectures,
multimedia interchange formats, open media frameworks (OMF), hypertext and hypermedia, hypermedia
design models, Web programming languages like HTML CSS JavaScript XML SGML Dynamic-HTML
SMIL SVG, Digital copyrights and digital rights management, digital library standards and initiatives,
multimedia archives.
Chapter 13 is related to Multimedia Application Development and discusses concepts related to multimedia
production steps involving conceptualization, story, script, ßowline, storyboard, implementation, testing
and feedback, documentation, case studies, major features of multimedia authoring softwares, computer
game design and development.
Chapter 14 is related to Virtual Reality and discusses concepts related to forms of virtual reality, hardware and
software components, virtual reality application, virtual reality modeling language (VRML).
% & ' # ( ) * #
In addition to this the Appendix provides details of relevant speciÞcations and summaries of commands of
various programming languages used in this book. The Bibliography provides details of over 100 books and
over 200 research articles in this Þeld. Lastly, the Coding section lists out about 90 MATLAB program codes
and 25 VRML program codes for various tasks and examples discussed throughout the book.
Acknowledgements
Reviewers had asked for practical examples to be included in the book which has led to the inclusion of about
100 MATLAB examples related to various tasks related to image-, audio-, and video-processing. I am thankful
to all of them. Some names are given below.
Anirban Bhar Narula Institute of Technology, Kolkata
Rajendra Kumar Vidya College of Engineering, Meerut, Uttar Pradesh
Susan Elias School of Computer and Information, BSA Abdur Rehman University, Chennai
N J Uke Singhad College of Engineering, Pune
Sunith Varma GS Institute of Technology and Sciences, Rajiv Gandhi Technological University
(RGTU), Indore, Madhya Pradesh
I would also like to thank the editorial and production team members of Tata McGraw-Hill Education, India,
for their strong support to this project. Particularly, I am very much thankful to Ms Shalini Jha, Ms Smruti
Snigdha, Mr Sourabh Maheshwari, Ms Sohini Mukherjee, Mr Satinder Baveja, Mr P L Pandita and others
for their painstaking and meticulous effort during the developmental, editing and proofreading stages of the
manuscript.
Feedback
In conclusion it must be emphasized that unlike the basic sciences, multimedia is a laboratory-oriented subject.
We are learning a set of principles and procedures for utilizing them in the creation of better and improved
applications. Hence, practical utilization of the knowledge acquired is at the heart of multimedia. In other
words, theoretical aspects alone do not provide a complete understanding of the subject; learners must also
know how to build presentations or implement research ideas using necessary software editing, authoring and
programming tools.
In general, all readers are encouraged to provide feedback about the content matter of book as well as any
omissions or typing errors. The author can be contacted at [email protected]
— RANJAN PAREKH