Moditroduction Multimedia Database
Moditroduction Multimedia Database
INTRODUCTION TO
MULTIMEDIA DATABASES
Prof. Dr. Naomie Salim
Faculty of Computer Science & Information
Systems
Universiti Teknologi Malaysia
Give examples of
multimedia applications
that deals with storing,
retrieving, processing and
sharing of multimedia
data
Eg 1. Journalism
Journalist to write article about
influence of alcohol on driving
Investigation involved:
Collect news articles about accidents,
scientific reports, television
commercials, police interviews, medical
experts interviews
Illustration:
Search photo archives, stock footage
companies for good photos
shocking, funny, etc.
Other examples
Searching movies
Based on taste of movies already seen
Based on movies a friend favor
Searching on web
Eg. searching Australian Open website (
http://www.ausopen.org)
Integrate conceptual terms + interesting
events
give info about video segments
showing female American tennis players
going to the net
Retrieval problems
EMPLOYEE (Name: char(20), City:
Char(20), Photo: Image)
How do you select employees in
Skudai?
How do you select employees that wear
tudung, wear glasses, fair and have a
mole under the lips?
Characteristics of Media
Data
Medium a type of Information
representation
Alphanumeric
Audio, video and image traditionally in
analog representation;
Static vs dynamic
Static: do not have time dimensions
(alphanumeric data, images, graphics)
Dynamic: have time dimensions (video,
animation, audio)
Multimedia
Collection of media types used together
Digital representation of
text
OCR (Optical character recognition) techniques
convert analog text to digital text
Eg. of digital representation: ASCII
Use 8 bits
Chinese char requires more space
Storage requirements depend on number of characters
Compression of text
Huffman, arithmetic coding
Since storage requirements not too high, less important
Digital representation of
audio
Audio
Digitizing pressure
waveforms
Transform into
Audio storage
requirements
Example of a CD audio
Compression (examples)
Masking: Discard soft sound because not
audible by louder sound
Speech: coding of lower frequency sounds only
MPEG: audio compression standards
Digital representation of
image
Scan analog photos & pictures using
scanner
Analog image approximated by rectangle of
small dots
In digital camera, ADC is built-in
Image compression
Exploit redundancy in image &
properties of human perception
Spatial redundancy: pixels in certain
area often appear similar (golden sand,
blue sky)
Human tolerance: error still allows
effective communication
Digital representation of
video
Compression of video
Metadata of Multimedia
Objects
Descriptive data
Give format or factual info about
multimedia object
Eg.: author name, creation date, length
of multimedia object, representation
technique
Eg. standard for descriptive data: Dublin
core
Can use SQL (metadata condition in
WHERE clause)
Metadata of Multimedia
Objects (cont.)
Annotations
Metadata of Multimedia
Objects (cont.)
Features
Derive characteristics from MM object
itself
Need language to describe features, eg.
MPEG-7
Process to capture features from MM
object is called feature extraction
Performed automatically, sometimes with
human support
Low-level Features
Low-level features
(cont.)
Eg. for images
Color histograms: # pixels having color of
certain range
Spatial relationships: eg. blue patterns appears
above yellow (beach photo),
Contrast: # dark spots neighboring light spots
High-level features
Features which are meaningful to end
user, such as golf course, forest
How can we bridge semantic gap between
low level and high level features
High level feature extraction from low level
features
Eg. text containing words football, referee
football match text
Eg. Speech to text translators (low level audio
features to text)
Eg. Video-Domain specific: loud sound from
crowd, round object passing white line,
Multimedia Information
Retrieval System (MIRS)
Component of MIRS
Feature Extraction
(Indexing)
Extraction of metadata (annotations, descriptions,
features) from incoming multimedia object
Algorithms have to consider extraction
dependencies. Eg.:
Video object segmented, choose key frame for each
segment
Extract low-level features from key frame
Based on low-level features, classify into shots of
audience, fields, close-ups
For field shots, detect positions of players
Extract body related features of players
Determine where net playing begins and ends
Incremental Maintenance
in ACOI Feature
Extraction Architecture
Possible Querying
Scenarios
Possible Querying
Scenarios (cont.)
Queries based on Profile
Users expose preferences in one way or
another
Preferences stored in user profile in
MIRS
Can use profile of a friend if not sure &
trusted
Possible Querying
Scenarios (cont.)
Queries based on Annotations
Text-based: keywords or natural language
Eg. Show me video in which Barack Obama shakes
hand with Mahathir Mohamad
Set of keywords derived from query & compared with
keywords in annotations of movies
Possible Querying
Scenarios (cont.)
Query by example
Give example MM object
MIRS extract all kinds of features from the MM
object
Resulting query based on these features
Similarity
Degree to which query & MM object of MIRS
are similar
Similarity calculated by MIRS based on
metadata of MM object & query
Try to estimate value of relevance of MM object
to user
Relevance Feedback
Starting point
query that describe info need or system provide starting
point
User can ask for another starting point if not satisfied
Can classify object based on topics & subtopics
Component of MIRS
Output Presentation
(Play)
When MIRS returns list of objects, system
has to decide whether user has right to
see them
User interface should be able to show all
kinds of MM data
What if objects are huge and result set
large?
Give user perception of content of object
Extract & present essential info for user to
browse & select objects
Text: title, summary, places where keywords occur
Audio: tune, start of song
Component of MIRS
Output Presentation
(cont.)
Streaming
Content sent to client at specific rate and
except for buffering, played directly
Audio & video is delivered as continuous
stream of packets
When resource become scarce
Quality of MIRS
Recall
r/R
Precision
r/n
Exercise
Discuss the role of DBMS in storing
MM objects
Discuss the role of Information
Retrieval systems in storing MM
objects
End of Module 1