Computer Graphics and Multimedia Notes
Computer Graphics and Multimedia Notes
Multimedia Basics:
Multimedia is a combination of text, graphic art, and sound, animation and video elements.
The IBM dictionary of computing describes multimedia as "comprehensive material, presented
in a combination of text, graphics, video, animation and sound. Any system that is capable of
presenting multimedia, is called a multimedia system"
A multimedia application accepts input from the user by means of a keyboard, voice or pointing
device.
Multimedia applications involve using multimedia technology for business, education and
entertainment.
Multimedia is now available on standard computer platforms. It is the best way to gain attention
of users and is widely used in many fields as follows:
Business - In any business enterprise, multimedia exists in the form of advertisements,
presentations, video conferencing, voice mail, etc.
Schools - Multimedia tools for learning are widely used these days. People of all ages
learn easily and quickly when they presented information with the visual treat.
Home PCs equipped with CD-ROMs and game machines hooked up with TV screens
have brought home entertainment to new levels. These multimedia titles viewed at
home would probably be available on the multimedia highway soon.
Public places - Interactive maps at public places like alone terminal.
Virtual Reality (V R) - This technology helps us feel a virtual reality effect is very
popular.
Multimedia Elements:
High-impact multimedia applications, such as presentations, training and messaging, require
the use of moving images such as video and image animation, as well as sound (from the video
images as well as overlaid sound by a narrator) intermixed with document images and graphical
text displays. Multimedia applications require dynamic handling of data consisting of a mix of
text, voice, audio components, Video components and image animation. Integrated multimedia
applications allow the user to cut sections of all or any of these components and paste them in
a new document or in another application such as an animated sequence of events, a desktop
publishing system, or a spreadsheet.
The components that fall under our definition of multimedia are:
Data elements for Multimedia Systems
Facsimile: Facsimile transmissions were the first practical means of transmitting document
images over telephone lines. The basic technology, now widely used, has evolved to allow
higher scanning density for better-quality fax.
Document images: Document images are used for storing business documents that must be
retained for long periods of time or may need to be accessed by a large number of people.
Providing multimedia access to such documents removes the need for making several copies
of the original for storage or distribution.
Photographic images: Photographic images are used for a wide range of applications such as
employee records for instant identification at a security desk, real estate systems with
photographs of houses in the database containing the description of houses, medical case
histories, and so on.
Geographic information systems map (GIS): Map created in a GIS system are being used
wildly for natural resources and wild life management as well as urban planning. These systems
store the geographical information of the map along with a database containing information
relating highlighted map elements with statistical or item information such as wild life statistics
or details of the floors and rooms and workers in an office building.
Voice commands and voice synthesis: Voice commands and voice synthesis are used for
hands-free operations of a computer program. Voice synthesis is used for presenting the results
of an action to the user in a synthesized voice. Applications such as a patient monitoring system
in a surgical theatre will be prime beneficiaries of these capabilities. Voice commands allow
the user to direct computer operation by spoken commands
Audio message: Annotated voice mail already uses audio or voice message as attachments to
memos and documents such as maintenance manuals.
Video messages: Video messages are being used in a manner similar to voice mail.
Full-motion stored and live video: Full-motion video started out as a very useful idea for on-
line training and maintenance manuals. The capability to use full-motion stored video for
electronic mail or live video for presentations and video conferencing are important
evolutionary steps. Three-dimensional video techniques are being adapted to create the concept
of virtual reality.
Holographic images: All of the technologies so for essentially present a flat view of
information. Holographic images extend the concept of virtual reality by allowing the user to
get “inside” a part, such as, an engine and view its operation from the inside.
Fractals: Fractals started as a technology in the 1980s but have received serious attention only
recently. This technology is based on synthesizing and storing algorithms that describes the
information.
4.2 MULTIMEDIA APPLICATIONS
The first widely used application of multimedia is document image management. It is primarily
intended for scanning documents and retaining their images.
Another application is image processing. It is also known as Image recognition. It is intended
for recognizing objects by analyzing their raster images. Applications that present a view of
generic multimedia applications are:
1. Document Imaging
The fundamental concepts of storage, compression and decompression, and display
technologies used for multimedia systems were developed for document image management.
Organizations such as insurance agencies, law offices, country and state governments, and the
federal government manage large volumes of documents.
Document image technology is adopted by Department of Defence for applications ranging
from military personnel records to maintenance manuals and high-speed printing systems.
Almost all document image system use workflows that are customized for the purpose for
which they are being used. The workflow defines the sequence for scanning images,
performing data entry based on the contents of the Images, indexing them and storing them on
optical media.
Document Image Hardware requirements:
Real-time image decompression and display place an important role on image processing
hardware. Image decompression and display supports 4 to 8 planes. 4 planes provide 16 colors
and 8 planes provide 256 colors. The image planes are also called bit planes, because, they are
addressed by a bit in a byte. Images must be processed at the rate of tens to hundreds of pixels
per nano-second. For high-resolution images, processing of the order of 10 pixels/ ns is enough
for monochrome still images. Gray-scale images consist of pixels that have shades of gray
ranging from 16 to 256. Color images feature color hues instead of shades of gray. Most high-
resolution monitors support 16 to 256 colors display capability. The number of colors that can
be depicted depends on the number of bits used to define the palette.
2.Image processing and Image Recognition
Image processing involves image recognition, Image enhancement, image synthesis, and image
reconstruction. An image processing system may actually alter the contents of the image itself.
Image processing systems employ the compression and decompression techniques, a wide
range of algorithm for object recognition, comparing images of objects with pre-defined
objects, extrapolating finer details to view edges more clearly, gray-scale balancing and gray-
scale and color adjustments.
Let us briefly overview the various aspects of image processing and recognition.
Image enhancement:
Most image display systems feature some level of image adjustment. Increasing the sensitivity
and contrast makes the picture darker by making borderline pixels black or increasing the gray-
scale level of pixels.
Capabilities built in the compression boards might include the following:
Image calibration: The overall image density is calibrated, and the image pixels are
adjusted to a predefined level.
Real time alignment: The image is aligned in real-time for skewing caused by improper
feeding of paper.
Gray-Scale normalization: The overall gray level of an image or picture is evaluated to
determine if it is skewed in one direction and if it needs correction.
RGB hue intensity adjustment: Too much color makes a picture garish and fuzzy.
Automatic hue intensity adjustment brings the hue intensity within pre-defined ranges.
Color Separation: A Picture with very little color contrast can be dull and may not bring
out the details. The hardware used can detect and adjust the range of color separation.
Frame averaging: The intensity level of the frame is averaged to overcome the effects
of very dark or very light areas by adjusting the middle tones
Image Animation
Computers-created or scanned images can be displayed sequentially at controlled display
speeds to provide image animation that simulates real processes. The basic concept of
displaying successive images at short intervals to give the perception of motion is being used
successfully in designing moving parts such as automobile engines.
Image annotation
Image annotation can be performed in one of two ways: as a text file stored along with the
image or as a small image stored with the original image. The annotation is overlayed over the
original image for display purposes. It requires tracking multiple image components associated
with a single page, decompressing all of them, and ensuring correct spatial alignment they are
overlayed.
Optical Character Recognition
Data entry is the most expensive component of data processing, because it requires extensive
clerical staff work to enter data. Automating data entry, both typed and handwritten, is a
significant application that can provide high returns. Optical Character Recognition (OCR)
technology is used for data entry by scanning typed or printed words in a form.
Initially, people used dedicated OCR scanners. NOW, OCR Technology is available in
software. OCR technology, used as a means of data entry, may be used for capturing entire
paragraphs of text. The capturing text is almost always entered as a field in a database or in an
editable document
Handwriting recognition
Research for Handwriting recognition was performed for CAD / CAM systems for command
recognition. Pen-based systems are designed to allow the user to write commands on an
electronic tablet.
Handwriting recognition engines use complex algorithms designed to capture data in real time
as it is being input or from an image displayed in a window, depending on the application. Two
factors are important for handwriting recognition. They are the strokes or shapes being entered,
and the velocity of input or the vectoring that is taking place. The strokes are parsed and
processed by a shape recognizer that tries to determine the geometry and topology of the
strokes. It attempts to compare it to existing shapes, such as predefined characters. The stroke
is compare with the prototype character set until a match is found or all pre-defined prototypes
have been checked without a match. Multimedia system will use handwriting recognition as
another means of user input.
Non-Textual Image Recognition
Image recognition is a major technology component in designing, medical and manufacturing
fields. Let us review the basic concepts of image recognition architecture.
For example, a general Image recognition system, - the Image Understanding Architecture has
the design which calls for three processing layers.
(i) 512 x 512 array of custom pixel processors that extract basic features such as lines
and object boundaries.
(ii) The features of an object extracted by the first layer are tracked by the DSP array,
and that information is fed into 512-M byte RAM.
(iii) At the highest level, sophisticated AI algorithms perform the difficult task of object
and scene recognition.
3.Full motion Digital video Applications
Full motion video has applications in the games industry and training, as well as the business
world. Full motion video is the most complex and most demanding component of multimedia
applications.
For business applications, some core requirements are needed.
(i) Full-motion video clips should be sharable but should have only one sharable copy.
(ii) It should be possible to attach full-motion video clips to other documents such as
memos, chapter text, presentation, and so on.
Movie Toolbox
Image Compression
Manager
Component Manager
Figure Shows the components in the Quick Time Architecture. Quick Time adjust
automatically to the hardware being used by the user. MPEG is another competing standard
which is comparatively higher-end, hardware-assisted standard. It can produce better
resolutions at faster rates.
Microsoft’s AVI
AVI is an acronym for Audio Video Interleave Standard. It is similar to Apple's Quick Time.
It offers low-cost, low-resolution video processing for the average desktop user. It is a layered
product. AVI is scalable. It allows users to set parameter such as window size, frame rate,
quality and compression algorithm through a number of dialog boxes. AVI-compatible
hardware allows enhancing performance through hardware-accelerated compression
algorithms such as DVI and MPEG. AVI supports several compression algorithms.