0% found this document useful (0 votes)

22 views16 pages

MM 2

The document discusses capturing graphics and images, including image formats, graphics formats, and sampling of audio signals. Graphics are generated interactively while images can originate from real-world photos or digital files. Common image file formats include GIF, JPEG, and TIFF. Audio waveforms are represented digitally through sampling and quantization.

Uploaded by

Kumar Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views16 pages

MM 2

Uploaded by

Kumar Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Q1. What is Capturing Graphics and Images?

The process of capturing digital images depends initially upon the image’s origin, that is, real
world pictures or digital images. An image capturing device, such as a CCD scanner or CCD camera for
still images, or a frame grabber for moving images. Graphics are generated by use of interactive graphic
systems. Digital images are normally very large and the ways to store these are given below.

Image Formats
Image formats are basically of two kinds:

(1)Captured Image Format

This is the format that comes out from an image frame grabber, such as VideoPix card, Parallax ,
etc. It is specified by mainly two parameters:

• Spatial Resolution (specified by pixel × pixel)

• Color encoding (specified by bits per pixel)

Both these parameter values depend on hardware and software for the input/output of images.

For example, for image capturing on a SPARCstation, the VideoPix card and its software are used. The
spatial resolution is 320 X 240 pixels and the color can be encoded with 1-bit (a binary image format), 8-
bit (color or grayscale) or 24-bit (color-RGB).

(2)Stored Image Format

While storing an image, we store a two-dimensional array of values, in which each value
represents the data associated with a pixel in the image. For a bitmap, this value is a binary digit. For a
color image (pixmap), the value may be a collection of:

•Three numbers representing the intensities of the red, green and blue components of the color a that
pixel.

•Three numbers that are indices to tables of the red, green and blue intensities.

•A single number that is an index to a table of color triples.

•An index to any number of other data structures that can represent a color.

•Four or five spectral samples for each other.

The image may be compressed before storage for saving storage space. Some current image file
formats for storing images include GIF, X11 Bitmap, Sun Rasterfile, PostScript, IRIS, JPEG, TIFF, etc.

Graphics Format
Graphics image formats are specified through graphics primitives and their attributes.

•Graphics primitives include lines, rectangles, etc. specifying 2D objects or polyhedron, etc. specifying
3D objects. A graphics package determines which primitives are supported.

•Attributes of the graphics primitives include line style, line width, color effect, etc., that affect the
outcome of the graphical image.

Graphics primitives and their attributes represent a higher level of an image representation
where the graphical images are not represented by a pixel matrix, rather it is represented by bitmap or
pixmap.

A bitmap is an array of pixel values with one bit for each pixel. A pixmap is an array of pixel
values with multiple bits (e.g., 8 bits for 256 colors) for each pixel.
Q2. Audio representation in computer?

The smooth, continuous curve

of a sound waveform isn't
directly represented in a
computer. A computer measures
the amplitude of the waveform
at regular time intervals
to produce a series of numbers.
Each of these measurements is
called a sample. Figure
illustrates one period of a
digitally sampled waveform.
Figure 2.2: Sampled Waveform
Each vertical bar in Figure 2-2
represents a single sample. The
height of a bar indicates
the value of that sample.
The mechanism that converts an
audio signal into digital samples
is called an analog-to-
digital converter, or ADC. To
convert a digital signal back to
analog, you need a digital-
to-analog converter, or DAC.
 A transducer converts
pressure to voltage levels.
 Convert analog signal into a
digital stream by discrete
sampling.
 Discretization both in time
and amplitude (quantization).
 In a computer, we sample
these values at intervals to get a
vector of values.
 A computer measures the
amplitude of the waveform at
regular time intervals to
produce a series of numbers
(samples).
The smooth, continuous curve
of a sound waveform isn't
directly represented in a
computer. A computer measures
the amplitude of the waveform
at regular time intervals
to produce a series of numbers.
Each of these measurements is
called a sample. Figure
illustrates one period of a
digitally sampled waveform.
Figure 2.2: Sampled Waveform
Each vertical bar in Figure 2-2
represents a single sample. The
height of a bar indicates
the value of that sample.
The mechanism that converts an
audio signal into digital samples
is called an analog-to-
digital converter, or ADC. To
convert a digital signal back to
analog, you need a digital-
to-analog converter, or DAC.
 A transducer converts
pressure to voltage levels.
 Convert analog signal into a
digital stream by discrete
sampling.
 Discretization both in time
and amplitude (quantization).
 In a computer, we sample
these values at intervals to get a
vector of values.
 A computer measures the
amplitude of the waveform at
regular time intervals to
produce a series of numbers
(samples).
The smooth, continuous curve
of a sound waveform isn't
directly represented in a
computer. A computer measures
the amplitude of the waveform
at regular time intervals
to produce a series of numbers.
Each of these measurements is
called a sample. Figure
illustrates one period of a
digitally sampled waveform.
Figure 2.2: Sampled Waveform
Each vertical bar in Figure 2-2
represents a single sample. The
height of a bar indicates
the value of that sample.
The mechanism that converts an
audio signal into digital samples
is called an analog-to-
digital converter, or ADC. To
convert a digital signal back to
analog, you need a digital-
to-analog converter, or DAC.
 A transducer converts
pressure to voltage levels.
 Convert analog signal into a
digital stream by discrete
sampling.
 Discretization both in time
and amplitude (quantization).
 In a computer, we sample
these values at intervals to get a
vector of values.
 A computer measures the
amplitude of the waveform at
regular time intervals to
produce a series of numbers
(samples).
Figure illustrates one period of a digitally sampled waveform.

Fig: Sampled Waveform

Each vertical bar in Figure represents a single sample. The height of a bar indicates the value of that
sample. The mechanism that converts an audio signal into digital samples is called an analog-to-digital
converter, or ADC. To convert a digital signal back to analog, you need a digital-to-analog converter, or
DAC.

 A transducer converts pressure to voltage levels.

 Convert analog signal into a digital stream by discrete sampling.
 Discretization both in time and amplitude (quantization).
 In a computer, we sample these values at intervals to get a vector of values.
 A computer measures the amplitude of the waveform at regular time intervals to produce a
series of numbers (samples).

Sampling: Sound wave form the smooth, continuous is not directly represented in the computer. The
computer measures the amplitude of the wave form in the regular time interval to produce the series
the numbers. Each of this measurement is called sample. This process is called sampling.

Sampling rate: the rate at which a continuous wave form is sampled is called sampling rate. Like
frequency, sampling rate are measured in Hz. For lossless digitization the sampling rate should be at
least twice of the maximum frequency response.

Quantization: Just as a wave form is sampled at discrete times the value of sample is also discrete. The
quantization of the sample value depends on the number of bits used in measuring the height of
the wave form. The lower quantization lower quality of sound, higher quantization higher quality of
sound.

Q3. Explain speech signals.

Speech can be processed by humans or machines, although it is the dominant form of
communication of human beings. The field of study of the handling of digitized speech is called digital
speech processing.

Human Speech

Speech is based on spoken languages, which means that it has a semantic content. Human beings use
their speech organs without the need to knowingly control the generation of sounds. (Other species
such as bats also use acoustic signals to transmit information, but we will not discuss this here.) Speech
understanding means the efficient adaptation to speakers and their speaking habits. Despite the large
number of different dialects and emotional pronunciations, we can understand each other’s language.
The brain is capable of achieving a very good separation between speech and interference, using the
signals received by both ears. It is much more difficult for humans to filter Speech Output 33 signals
received in one ear only. The brain corrects speech recognition errors because it understands the
content, the grammar rules, and the phonetic and lexical word forms.

Speech signals have two important characteristics that can be used by speech processing applications:

• Voiced speech signals (in contrast to unvoiced sounds) have an almost periodic structure over a
certain time interval, so that these signals remain quasi-stationary for about 30ms.

• The spectrum of some sounds have characteristic maxima that normally involve up to five
frequencies. These frequency maxima, generated when speaking, are called formants. By definition, a
formant is a characteristic component of the quality of an utterance.

Speech Synthesis

Computers can translate an encoded description of a message into speech. This scheme is
called speech synthesis. A particular type of synthesis is text-to-speech conversion. Fair-quality text-to-
speech software has been commercially available for various computers and workstations, although the
speech produced in some lacks naturalness.
Speech recognition is normally achieved by drawing various comparisons. With the current
technology, a speaker-dependent recognition of approximately 25,000 words is possible. The problems
in speech recognition affecting the recognition quality include dialects, emotional pronunciations, and
environmental noise. It will probably take some time before the considerable performance discrepancy
between the human brain and a powerful computer will be bridged in order to improve speech
recognition and speech generation.

Q4. Explain reconstructing image

Reconstructing an image is the process of recovering or enhancing a distorted or degraded
image to obtain its original or improved version. This task is crucial in various fields, including medical
imaging, photography, and computer vision, where obtaining clear and accurate visual information is
essential. Two methods used for image reconstruction are the Radon Transform and Stereoscopy.

The methods used to reconstruct images include the Radon transform and stereoscopy

Radon Transform
It is a mathematical technique with profound implications in medical imaging, particularly in
the realm of computed tomography (CT) scans. The principle underlying the Radon Transform involves
capturing a series of X-ray projections of an object from multiple angles. Each projection represents the
integrated X-ray attenuation along a specific line through the object. These projections are then
combined using the Radon Transform to reconstruct a detailed cross-sectional image, commonly
referred to as a "tomogram."

In a CT scan, an X-ray source emits X-rays through the object, and a detector measures the intensity of
X-rays that have passed through the object. By rotating the X-ray source and detector around the object,
multiple projections are acquired from various angles. The Radon Transform mathematically processes
these projections, essentially "back-projecting" the intensity values to their corresponding positions in
the reconstructed image. The result is a cross-sectional image that reveals internal structures of the
object without the need for physical dissection.

Stereoscopy
Stereoscopy on the other hand, is a visual technique that aims to mimic the natural perception
of depth by our human visual system. This technique capitalizes on the fact that our eyes are positioned
slightly apart, giving each eye a slightly different view of the same scene. Our brain processes these
distinct views to perceive depth and spatial relationships in the environment.

To replicate this effect artificially, stereoscopy involves capturing or generating two separate
images of a scene, with a slight offset to simulate the viewpoint difference between our eyes. These
images are presented to each eye separately using specialized glasses or devices. When the brain
receives these distinct images, it fuses them to create a perception of depth. Objects appear to be at
different distances from the viewer, and the resulting experience is often referred to as a "3D effect."

Stereoscopy has applications in various domains, including photography, entertainment (3D

movies and virtual reality), and even scientific visualization. It enhances the viewer's immersion and
engagement by providing a more realistic sense of depth and dimensionality in images and videos.
Q5. Explain Television.

Multimedia System (CMP 366.3)

By: Er. Aruna Chhatkuli
Nepal College of Information
Technology
• PAL is an analogue
television color encoding
system used in broadcast
television
systems in many countries.
• 4×3 Aspect ratio.
• 625 lines
• 25 frames per second.
• Scanned in fields.
• There are slight variations:
PAL-B, PAL-G, PAL-H and
PAL-N.
• Used in continental Europe
and parts of Africa, Middle East
and South America.
• More Lines = Better
Resolution
• Fewer Frame/fields = More
Flicker
SECAM (Sequential Color and
Memory)
• SECAM is a standard used
in France and Eastern Europe.
• In contrast to NTSC and
PAL, it is based on frequency
modulation.
• It uses a motion frequency
of 25 Hz and each picture has
625 lines.
• SECAM is an analog color
television system first used in
france.
Enhanced Systems
Enhanced Definition Television
Systems (EDTV) are
conventional systems modified
to
offer improved vertical and/or
horizontal resolution. EDTV are
an intermediate solution,
to digital interactive television
system and their coming
standards.
HDTV (High-Definition
Television)
• The next generation of TV
is known as HDTV.
• HDTV is a digital system.
• 16:9 Aspect ratio.
• Permits several levels of
picture resolution similar to
that of High-Quality
Computer Monitors, with 720
or 1080 line (1280×720 pixels
or 1920×1080 pixels).
• Range from 24 to 60 frame
per second, progressive or
interlaced scan.
Television is the most important application that has driven the development of motion
video. Television is a telecommunication medium for transmitting and receiving moving
images that can be monochrome (black and white) or colored, with or without
accompanying sound. Television may also refer specifically to a television set, television
programming or television transmission.

Conventional Systems
Conventional system used in black and white and color television. Conventional television
systems employ the following standards:
NTSC (National Television Systems Committee)
• NTSC developed in U.S., is the oldest and most widely used television standard.
• The color carrier is used with approximately 4.429 MHZ or with approximately
3.57 MHZ.
• NTSC uses a quadrature amplitude modulation with a suppressed color carrier and
work with a motion frequency of approximately 30 Hz.
• 4×3 Aspect ratio.
• 525 lines
• 30 frames per second.
• Scanned in fields.
Television is the most important application that has driven the development of motion video.
Television is a telecommunication medium for transmitting and receiving moving images that can be
monochrome (black and white) or colored, with or without accompanying sound. Television may
also refer specifically to a television set, television programming or television transmission.

1] Conventional Systems:
Conventional system used in black and white and color television. Conventional television systems
employ the following standards:

NTSC (National Television Systems Committee)

• NTSC developed in U.S., is the oldest and most widely used television standard.
• The color carrier is used with approximately 4.429 MHZ or with approximately
3.57 MHZ.
• NTSC uses a quadrature amplitude modulation with a suppressed color carrier and work with a motion
frequency of approximately 30 Hz.
• 4×3 Aspect ratio.
• 525 lines
• 30 frames per second.
• Scanned in fields.

PAL
Multimedia System (CMP 366.3)

By: Er. Aruna Chhatkuli Nepal College of Information Technology

• Uses MPEG-2 compression to squeeze a 19 Megabit per second data flow so that
it can be accommodated by a standard broadcast TV channel of 6 MHz bandwidth.
• PAL is an analogue television color encoding system used in broadcast television systems in many
countries.
• 4×3 Aspect ratio.
• 625 lines
• 25 frames per second.
• Scanned in fields.
• There are slight variations: PAL-B, PAL-G, PAL-H and PAL-N.
• Used in continental Europe and parts of Africa, Middle East and South America.
• More Lines = Better Resolution
• Fewer Frame/fields = More Flicker

SECAM (Sequential Color and Memory)

• SECAM is a standard used in France and Eastern Europe.

• In contrast to NTSC and PAL, it is based on frequency modulation.

• It uses a motion frequency of 25 Hz and each picture has 625 lines.

• SECAM is an analog color television system first used in france.

2] Enhanced Systems
Enhanced Definition Television Systems (EDTV) are conventional systems modified to offer
improved vertical and/or horizontal resolution. EDTV are an intermediate solution, to digital interactive
television system and their coming standards.

HDTV (High-Definition Television)

• The next generation of TV is known as HDTV.

• HDTV is a digital system.

• 16:9 Aspect ratio.

• Permits several levels of picture resolution similar to that of High-Quality Computer Monitors,
with 720 or 1080 line (1280×720 pixels or 1920×1080 pixels).

• Range from 24 to 60 frame per second, progressive or interlaced scan.

• Uses MPEG-2 compression to squeeze a 19 Megabit per second data flow so that it can be
accommodated by a standard broadcast TV channel of 6 MHz bandwidth.

ch4 - Acquiring Audio Data PDF
No ratings yet
ch4 - Acquiring Audio Data PDF
18 pages
2015 Chapter 6 MMS IT - 1
No ratings yet
2015 Chapter 6 MMS IT - 1
18 pages
Note 01 N
No ratings yet
Note 01 N
49 pages
Chapter 6
No ratings yet
Chapter 6
9 pages
Multimedia
No ratings yet
Multimedia
2 pages
Chapter 3
No ratings yet
Chapter 3
27 pages
Pure Data
100% (1)
Pure Data
349 pages
CS 550 Multimedia&WS 2 SOUND v1
No ratings yet
CS 550 Multimedia&WS 2 SOUND v1
41 pages
Chapter 6
No ratings yet
Chapter 6
8 pages
Unit 2
No ratings yet
Unit 2
26 pages
Introduction To Digital Audio
No ratings yet
Introduction To Digital Audio
3 pages
The Digital Representation of Sound
No ratings yet
The Digital Representation of Sound
8 pages
Chapter 3
No ratings yet
Chapter 3
23 pages
Dmslecture 3
No ratings yet
Dmslecture 3
11 pages
RT Lecture 5 Slides
No ratings yet
RT Lecture 5 Slides
26 pages
Multimedia Unit-2
No ratings yet
Multimedia Unit-2
10 pages
DC 17
No ratings yet
DC 17
4 pages
Chap 3 - Data Acquisition Part 2
No ratings yet
Chap 3 - Data Acquisition Part 2
19 pages
Unit2 Ece MMC 6th Sem
No ratings yet
Unit2 Ece MMC 6th Sem
78 pages
Unit2 Ece MMC 6th Sem
No ratings yet
Unit2 Ece MMC 6th Sem
78 pages
Week-3 Representation of Speech Waveforms - EEE 2415
No ratings yet
Week-3 Representation of Speech Waveforms - EEE 2415
10 pages
A-Level Revision Notes - 31B Sound
No ratings yet
A-Level Revision Notes - 31B Sound
12 pages
Digital Audio
No ratings yet
Digital Audio
29 pages
Unit Iii Audio Fundamental and Representaion
No ratings yet
Unit Iii Audio Fundamental and Representaion
24 pages
Msa 02
No ratings yet
Msa 02
9 pages
5.6. Representing Images, Sound and Other Data
No ratings yet
5.6. Representing Images, Sound and Other Data
8 pages
1-2 - Intro - Analog and Digital Signals
No ratings yet
1-2 - Intro - Analog and Digital Signals
10 pages
2 - Digital Data Acquisition
No ratings yet
2 - Digital Data Acquisition
19 pages
Sound Recording Laa
No ratings yet
Sound Recording Laa
18 pages
Introduction (UCS749)
No ratings yet
Introduction (UCS749)
59 pages
A Level 1.1.3 Sound
No ratings yet
A Level 1.1.3 Sound
15 pages
Multimedia Technology CH 5
No ratings yet
Multimedia Technology CH 5
9 pages
Introduction (UCS749)
No ratings yet
Introduction (UCS749)
72 pages
1 Data Representation - L12 - Voice Storage
No ratings yet
1 Data Representation - L12 - Voice Storage
14 pages
Unit-2 Multimedia Information Representation
No ratings yet
Unit-2 Multimedia Information Representation
72 pages
MEH-Nakai Lab-1
No ratings yet
MEH-Nakai Lab-1
93 pages
Mul c2
No ratings yet
Mul c2
86 pages
Multimedia Digital Audio
No ratings yet
Multimedia Digital Audio
7 pages
Ch1 Introduction Part2
No ratings yet
Ch1 Introduction Part2
29 pages
Module 2
No ratings yet
Module 2
95 pages
Digital Audio Processing Revisited: Juan P Bello
No ratings yet
Digital Audio Processing Revisited: Juan P Bello
29 pages
Mod 2
No ratings yet
Mod 2
121 pages
Audiosignalprocessing
No ratings yet
Audiosignalprocessing
11 pages
Digital Audio Concept
No ratings yet
Digital Audio Concept
13 pages
Bec613a MMC Mod2
No ratings yet
Bec613a MMC Mod2
60 pages
6 - Digital Audio Technology
No ratings yet
6 - Digital Audio Technology
24 pages
Notes - 1.2.1 - Multimedia - Sound
No ratings yet
Notes - 1.2.1 - Multimedia - Sound
6 pages
MM02 1
No ratings yet
MM02 1
34 pages
Chapter4 Sound
No ratings yet
Chapter4 Sound
39 pages
Representation of Sound
No ratings yet
Representation of Sound
6 pages
MMC 15EC741 Module 2 - Watermark
No ratings yet
MMC 15EC741 Module 2 - Watermark
30 pages
Audio Digital (Ingleės)
No ratings yet
Audio Digital (Ingleės)
9 pages
2 Chapter MM Information Representation
No ratings yet
2 Chapter MM Information Representation
111 pages
A Hierarchical Approach For Audio Capture, Archive, and Distribution
No ratings yet
A Hierarchical Approach For Audio Capture, Archive, and Distribution
20 pages
Multimedia
No ratings yet
Multimedia
80 pages
Streaming Audio and Video
No ratings yet
Streaming Audio and Video
54 pages
Dereje Teferi (PHD) Dereje - Teferi@Aau - Edu.Et
No ratings yet
Dereje Teferi (PHD) Dereje - Teferi@Aau - Edu.Et
30 pages
Unit 3 SP
No ratings yet
Unit 3 SP
16 pages
Project Report Templete
No ratings yet
Project Report Templete
7 pages
Business Directory
No ratings yet
Business Directory
31 pages
Project Report3
No ratings yet
Project Report3
39 pages
Business Directory2
No ratings yet
Business Directory2
28 pages