PAM Finals
PAM Finals
What is Multimedia?
“Multimedia” has no strict definition.
Multimedia can have a many definitions these include:
Computer professional: uses computer to present and combine text, image, audio, video, and
interactive features in several ways.
Consumer entertainment vendor: interactive cable TV with hundreds of digital channels
available, or a cable TV-like service delivered over a high-speed Internet connection.
A Multimedia Application is an application which uses a collection of multiple media sources e.g. text,
graphics, images, sound/audio, animation and/or video.
History of Multimedia
2D and 3D graphics
2D/ 3D graphic elements are represented by 2D/3D vector coordinates.
Have properties such as a fill color, boundary thickness, and so on.
More bits Larger number of levels better quality and large storage requirements and vise
verse.
how many bits should be used to represent each sample?
Is this number the same for all signals?
It actually dependson the type of signal and what its intended use is.
For example, Audio signals which represent music, must be quantized on 16 bits, whereas
speech only requires 8 bits.
Bit Rate
Bit rate is the number of bits produced per second.It is very important for storage and distribution.
Examples
Signal Sampling rate Quantization Bit rate
Speech 8 KHZ 8 bits 64 kbps
AM Radio 11 KHZ 8 bits 88 kbps
HD TV(1080) 1920*1080 12 bits 24.88 Mbits/frame
Sampling theorem and Aliasing
What is the rate at which sampling should occur?
To determine the correct number of samples, you have to calculate
what is called Nyquist number and it is twice the maximum frequency occurring in the signal.
If a signal has maximum frequency of 10KHZ, it should be sampled at a frequency of 20KHZ.
FILTERING
From general point of view It is a methodology to keep some frequencies and remove all other
frequencies.
Analog filter: uses analog electronic circuits made up from components such as resistors,
capacitors, and operational amplifiers to produce the required filtering effect.
Digital filter: uses digital numerical computations on sampledrr, quantized values of the signal.
ADC Processor DAC
Filtering
Filters are classified into three categories according to their responses:
Low-pass filters: remove high frequency content from the input signal and keeping other
content.
High-pass filters: remove low frequency content from the input signal and keeping other content.
Band-pass filters: output signals containing the frequencies belonging to a defined band.
Digital Images
Still or static images.
They can be combined to create interesting applications, like:
Panoramic photography (Panoramas)
Segmented panoramas, also called stitched panoramas, are made by joining multiple
photographs with slightly overlapping fields of view to create a panoramic image.
Types of Image
Bitmap Image(Graphics)
Vector Image(Graphics)
Bitmap Images
The most common and comprehensive form of storage for images on computers is bitmap image.
Bitmap use combination blocks of different colors (known as pixels) to represent an image. Each
pixel is assigned a specific location and color value.
There are also called pixelized or raster images
Software to edit bitmapped graphics are :
Adobe Photoshop
Paint Shop Pro
Consists of a set of pixels has height and width (dimensions) the pixels have a bit depth
Advantage
Can have different textures on the drawings; detailed and comprehensive.
Disadvantage
Large file size.
Not easy to make modification to objects/drawings.
Graphics become "blocky" when the size is increased.
Digital Images
Bit depth: refers to the number of bits used to represent each pixel and divided into channels.
1 bit (1 channel) binary image (black or white colored)
8 bit (1 channel) gray scale image
24 bit (3 channels; R,G,B)
Vector Image
Vector images are based on drawing elements/objects to create an image.
The elements and objects are stored as a series of command that define the individual objects.
Packages that allow to create vector graphics include :
Macromedia Freehand MX
Macromedia Flash MX
Adobe Illustrator
Advantage
Small file size.
Maintain quality as the size of the graphics is increased.
Easy to edit the drawings as each object is independent of the other.
Disadvantage
Objects/drawings cannot have texture; it can only have plain colors or gradients ; limited level of
detail that can be presented in an image.
Resolution
Image resolution is a measure of how finely a device approximate continuous images using finite pixels
Different concepts of resolution
Resolution of scanners and printers is represented by their pixel density (dots per inch or dpi).
Resolution of video frames and monitors is represented by their pixel dimension (width*hieght).
Monitors also have pixel density measured by dpi.
Resolution of digital camera still is represented by the total number of pixels in the largest image it
can be recorded
Bitmapped(or raster) image has pixel dimensions, but no pixel density, it’s physical resolution depends
on the pixel density of the device it is to be displayed on.
𝑃𝑖𝑥𝑒𝑙 𝑑𝑖𝑚𝑒𝑛𝑠𝑖𝑜𝑛
𝑃ℎ𝑦𝑠𝑖𝑐𝑎𝑙 𝑑𝑖𝑚𝑒𝑛𝑠𝑖𝑜𝑛 =
𝐷𝑒𝑣𝑖𝑐𝑒 𝑟𝑒𝑠𝑜𝑙𝑢𝑡𝑖𝑜𝑛
Image File Sizes
For a 512 X 512 binary image
The number of bits used in this image
512 X 512 X 1 = 262,144 bit
= 32768 bytes
= 32,768 Kb
≈ 0.033 Mb
For a 512 X 512 Greyscale image
The number of bits used in this image
512 X 512 X 8 = 2,097,152 bits
= 262,144 bytes
= 262.14 Kb
≈ 0.262 Mb
For a 512 X 512 RGB image
The number of bits used in this image
512 X 512 X 8 X 3 = 6,291,456 bit
= 786,432 bytes
= 786.432 Kb
≈ 0.786 Mb
Image Dithering
Dithering is often used for displaying monochrome images
Dithering is used to calculate patterns of dots such that values from 0 to 255 correspond to
patterns that are more and more filled at darker pixel values, for printing on a 1-bit printer.
To get color to black and white, first turn into gray scale.
I=0.299R+0.587G+0.114B
This formula reflects the fact that green is more representative of perceived brightness than blue.
Image Dithering
Reducing Effects of Quantization by Dithering
Threshold dithering
Error diffusion dither (Floyd-Steinberg)
Ordered dithering
Pattern dithering
Threshold Dithering
For every pixel: If the intensity < 128, replace with black, else replace with white
- 128 is the threshold
- This is the naïve version of the algorithm
To keep the overall image brightness the same, you should:
- Compute the average intensity over the image
- Use a threshold that gives that average
- For example, if the average intensity is 150, use a threshold that is higher than 150 replace
with white, else replace with black
Ordered Dithering
Break the image into small blocks (n x n)
Define a threshold matrix (n x n):
Use a different threshold for each pixel of the block
Compare each pixel to its own threshold (if pixelo >=Thr pixeln=255 else pixeln=0)
20 75 80 90 150 113 50 160 0 128 32 160
150 90 180 84 155 80 220 100 192 64 224 96
50 176 16 200 220 180 15 130 48 176 16 144
235 128 190 70 220 110 220 85 240 112 208 80
Sudocode:
An algorithm for ordered dither, with n*n dither matrix, is as follows:
Pattern Dithering
Compute the intensity of each sub-block and index a pattern.
NOT the same as before
Here, each sub-block has one of a fixed number of patterns – pixel is determined only by
average intensity of sub-block
In ordered dithering, each pixel is checked against the dithering matrix before being turned
on Pattern Dithering
Video Concept
Video is an excellent tool for delivering multimedia.
Video places the highest performance demand on computer and its memory and storage.
Digital video has replaced analog video as the method of choice for making and delivering video
for multimedia.
Video
Since video is created from a variety of sources, we begin with the signals themselves
Analog video is represented as a continuous (time-varying) signal
Digital video is represented as a sequence of digital images.
Analogue Video
Video information is stored using television video signals, film, videotape or other non-computer
media
Each frame is represented by a fluctuating voltage signal known as an analogue wave for
Digitizing Video
Digital video combines features of graphics and audio to create dynamic content for multimedia
products.
Video is simply moving pictures.
Digitized video can be edited more easily.
Digitized video files can be extremely large.
Digitizing Video
A video source (video camera ,VCR, TV or videodisc) is connected to a video capture card in a
computer
As the video source is played, the analog signal is sent to the video card and converted into a
digital file (including sound from the video).
Digital Video
Video is represented as a sequence discrete images (frames) shown in quick succession
Why in the sending process the RGB converts to YUV? and in the receiver process the YUV
coverts again to RGB?
1. Better compression efficiency
2. Video standard compatibility
3. Supports both color and grayscal
YUV is turned back to RGB because display devices operate in the RGB color model, since RGB is
necessary for screen rendering, as it directly controls how pixels emit light.
Conversion to YUV
Decouple the intensity information (Y or luminance) from the color information (UV or
chrominance
The separation was intended to reduce the transmission bandwidth and is based on experiments
with the human visual system, which suggests that humans are more tolerant to color distortions.
In other words, reducing the color resolution does not affect our perception.
Display Port
Display Port is a digital display interface. It is the first display interface that uses packetized data
transmission, like the Internet or Ethernet
Display Port can achieve a higher resolution with fewer pins than the previous technologies.
The use of data packets also allows Display Port to be extensible, i.e., new features can be
added over time without significant changes to the physical interface itself.
Display Port can be used to transmit audio and video or either of them.
Compared with HDMI, Display Port has slightly more bandwidth.
Sound Facts
Wave Characteristics
Frequency: Represents the number of periods in a second and is measured in hertz (Hz) or
cycles per second.
Human hearing frequency range: 20Hz to 20kHz (audio)
Amplitude: The measure of displacement of the air pressure wave from its mean.
Characteristic of Sound
The common characteristic used to describe audio signals is the number of channels
(Dimensionality), which may be one (mono), two (stereo), or multichannel (surround sound).
Mono and stereo sound technology are the most commonly used.
Surround Sound
Surround sound aims to create a Multi-dimensional sound experience.
It uses multiple audio tracks to engulf the audience in many sources of sound, making them feel
as if they are in the middle of the action.
Surround sound is mostly used in movie theaters. It allow the audience to hear sounds coming
from all around them.
It makes the audience is completely captivated by the movie and is no longer aware of their real-
world.
Surround sound formats rely on multiple dedicated speakers that physically surround the audience.
Movie theaters today use the THX 10.2 surround sound system.
One center speaker carries most of the dialog.
The left and right front speakers carry most of the soundtrack (music and sound effects).
A pair of surround sound speakers are placed to the side of (and slightly above) the audience to
provide the surround sound and ambient effects.
Finally, a subwoofer can be used to reproduce the low and very low frequency effects.
Multimedia Authoring
What is the meaning of authoring multimedia tools?
Authoring tools provide an integrated environment for binding together the different elements of a
Multimedia production.
Intermedia Processing
Responsible for assembling different media types in one product. An example of this software is
Adobe director.
Common requirements of intermedia authoring tools
Spatial placement control
Temporal control
Interactivity setup
Multimedia Authoring Paradigms
An authoring paradigm or an authoring metaphor can be referred to as the methodology by which an
authoring tool allows an author to accomplish creation of content. There are several metaphors.
Timeline
Timelines are a useful way of representing multimedia data during the course of a presentation or
application.
o Time is represented along the x-axis
o Tracks are represented along the y-axis
The developer can move objects left and right to change the order of the information and can
lengthen or shorten the bars to change their duration
Scripting
Scripting languages are cut-down versions of complete programming languages
- They tend to have less features and are therefore easier to learn
Scripting models allow the developer to write small scripts (programs) which can be associated
with a multimedia object
- e.g. you may write a script to make a graphic image move across the screen or to make a
window pop up when an item is clicked.
Flow Control
It is looks like the flowchart.
Each part is represented an icon (symbolic picture)
Each icon does a specific task, e.g. plays a sound
Icons are then linked together to form complete applications
Can easily visualise the structure and navigation of the final application
Cards
In these, authoring systems elements are organized as pages of a book or a stack of cards.
The authoring system lets you link these pages or cards into organized sequences.
You can jump, on command, to any page you wish in a structured navigation pattern.
A page may contain hyperlinks to other pages to provide navigation or pages may be sequentially
viewed
There maybe global parameters that can be set to affect the entire application
- e.g. background colour, default font, etc
Color Spaces
There are different color spaces for image and video likes,
o RGB
o CMYK
o YUV
o HSV
RGB Color Spaces
The RGB color space is a linear color space that formally uses single wavelength primaries
(645.16 nm for R, 526.32 nm for G, and 444.44 nm for B).
The RGB color space is common in display devices and it is device dependent