L1 Introduction
L1 Introduction
INTRODUCTION
Course Info
Lectures
– Sat. 5:30pm – 8:30pm
TA
– TBA
Course Assessment
– Final Exam 90 marks
– Midterm, Projects, Assignments, Quizzes … etc. 35 marks
Textbooks and References
Textbook
1. Z. Li and M. S. Drew, Fundamentals of Multimedia, Prentice Hall, 2 nd
Edition, Springer, 2014. (or 3rd Edition, 2021).
References
1. M. Mandal, Multimedia Signals and Systems, Kluwer Academic
Publishers, 2003.
2. A. Puri, Multimedia Systems, Standards and Networks, Taylor and
Francis, 2007.
3. K. R. Rao, Z. S. Bojkovic, and D. A. Milovanovic, Multimedia
Communication Systems, Prentice Hall, 2002.
4. F. Halsall, Multimedia Communications, Addison Wesley, 2001.
Tentative Weekly Plan
Week Topic
1 Introduction to Multimedia Systems
2 Video representation and coding
3 Voice Fundamentals and coding
4 Multimedia standards
5 Audio-Video Analysis and Retrieval
6 Content-based Image Retrieval
7 Midterm (Tentative)
8 Content-based Video Retrieval
9 Multimedia Information Security and Digital Rights Management (DRM)
10 Packet Telephony Networks
11 VoIP Systems
12 H.323 Concepts and Configuration – SIP Concepts and Configuration
13 H.264
14 H.265
15 Internet Multimedia Content Distribution
5
What is Multimedia?
• Multimedia is a combination of two or more
categories of information having different
transport signal characteristics
– Text
– Audio and speech
– Images
– Graphics
– Video
What is a Multimedia System?
of multimedia information
7
Multimedia is Multidisciplinary
Computer
networks,
operating system
Multimedia Systems
Multimedia systems involve some basic enabling
techniques:
– Multimedia data representation and compression.
– Multimedia data processing and analysis.
– Transmitting multimedia data through communication
networks.
– Multimedia database, indexing, and retrieval.
1
0
Media Processing and Analysis
• In applications such as digital library,
automatic data analysis has to be done to
extract semantic meanings from audios,
images, and videos.
• Based on media processing methods such
as
– object tracking (face, eyes),
– object recognition,
– gesture recognition, etc.,
Toys Squash
Lab
Double Squash
1
3
Object Recognition
1
4
Media Delivery
• Transmitting multimedia data across the network is another
topic in multimedia computing.
• There are different issues when we transmit video and audio
through packet network.
– Quality of service
– Synchronization
– Error and congestion control
– Session setup
1
5
Multimedia Database and Indexing
• Multimedia database has to deal with large
media files.
• Multimedia data need new data structures,
indexing and searching methods.
• Content-based multimedia retrieval is
normally used in such environments.
What is a Digital Image?
A digital image is a representation of a two-
dimensional image as a finite set of digital
values, called picture elements or pixels
What is a Digital Image? (cont…)
• Pixel values typically represent gray levels
or colors.
• Digitization implies that a digital image is
an approximation of a real scene
1 pixel
What is a Digital Image? (cont…)
Common image formats include:
– 1 sample per point (Grayscale)
– 3 samples per point (Red, Green, and Blue)
– 4 samples per point (Red, Green, Blue, and “Alpha”)
Light and Electromagnetic Spectrum
The retina
Color Perception and Specification
R G B
x ; y ; z ;
RG B R G B RG B
and x y z 1.
Color Video
Principles of Color Video
• A video (a sequence of moving images) records the emitted
and/or reflected light intensity from the objects in a scene that is
observed by a video camera.
• The image function (x ,y ,t) )captured by the camera at any time t
is the projection of the light distribution in the 3D scene onto a 2D
image plane.
• In general, a video signal has spatial and temporal dimensions.
The spatial dimension (x ,y) )depends on the viewing area; and the
temporal dimension, t )depends on the duration for which the
scene is captured).
Color Video
Principles of Color Video
• If the camera has only one luminance sensor, (x ,y ,t) )is a
scalar function representing the luminance of the projected
light. Luminance image is also known as grey scale image.
• On the other hand, if the camera has three separate sensors,
each tuned to a chosen primary color, the signal is a vector
function that contains three color values at every point.
• A black and white image strictly has two colors: black and
white.
• A monochrome image is one which consists of colors of a
narrow band of wavelengths.
Color Video
Composite and Component Video
• A component video is specified by a tristimulus color
representation or a luminance-chrominance representation.
• The three color components can be multiplexed into a single
signal to form a composite video.
• The composite format is used in analog TV systems where the
chrominance signals are modulated to a higher frequency than the
luminance and adding the resulting modulated chrominance
signals to the luminance signal.
• A filter is used to separate the luminance and chrominance signals
for display in a color monitor.
• With a grey-scale monitor, only the luminance signal is extracted
for display.
Analog Video Raster
Progressive and Interlaced Scan
• In a raster scan, a camera captures a video sequence by
sampling it in both temporal and spatial directions. The
resulting signal is stored in a continuous 1-D waveform.
• The video signal consists of a series of frames separated by a
regular frame interval. Each frame consists of a consecutive
set of horizontal scan lines, separated by a regular vertical
spacing. The format is known as progressive scan.
• In the interlaced scan, each frame is scanned in two fields,
separated by the field interval. Each field thus contains half
the number of lines in a frame.
• The field containing the first line is called the top field; and that
containing the second line is the bottom field.
Analog Video Raster
Progressive and Interlaced Scan
Field 1 Field 2