0% found this document useful (0 votes)

76 views35 pages

Ch-7-Multimedia Data Retrieval and Management

The document discusses content-based multimedia data retrieval and management. It describes using low-level features to represent multimedia content, such as color histograms to represent images. Similarity measures are used to compare feature values between objects to find the most similar matches during retrieval. Key aspects that influence the retrieval process include the features used, similarity criteria, how objects are ranked, and whether whole objects or parts are searched.

Uploaded by

akililu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views35 pages

Ch-7-Multimedia Data Retrieval and Management

Uploaded by

akililu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Chapter 7

Multimedia Data Retrieval and

Management
Outline

 Description models for MM data retrieval

 Low-level features for MM data content
representation
 Similarity measures for MM data content
comparison

Multimedia Systems
MM data retrieval
 Features are a smarter way to represent MM data
content than their original format
 e.g., color and texture for an image
 Today we focus on which are the most suitable models
for
 representing, interpreting, describing and comparing
such features
E.g., color histograms for images by using the Euclidean distance as
similarity measure

…with the final goal to be able to retrieve from

MM collections those objects which are
most interesting for us!! 

Multimedia Systems
Content-based search
 First approach to search for MM objects relies on standard text-based
techniques, provided objects come with a precise textual description
of what they represent/describe, i.e., of their semantics
 However, the “annotation” of MM objects is a subjective, time
consuming, and tedious process (completely manual!!)
 A more convenient approach, suitable to manage large DBs,
is to
automatically extract from MM objects
a set of (low-level) relevant numerical features
that, at least partially, convey some of the semantics of the
objects
 Clearly, which are the “best” features to extract depend on the specific
medium and on the application at hand (i.e., what we are looking for)

Look for cheetahs? This is fine; but, how to find it?

Multimedia Systems
Content-based similarity search
 Once we have feature values, we can search objects by using them
 Assume a database (DB) with N MM objects (e.g., images) and, for
each of the N objects, we have extracted the “relevant features”
 E.g., we could extract some color information from images
 We can now search for objects whose feature values are “similar”
(in some sense to be defined) to the feature values of our query
 In general, this approach, much alike as it happens in text-retrieval,
cannot guarantee that all and only relevant results are returned as
result of a query

Look for cheetahs? Oops! Not really a cheetah ;-)

Multimedia Systems
The general scenario
 In general, we have a 2-levels scenario:

The objects level

Ultimately, we want to
find relevant objects
Needed to support
automatic
retrieval
The features level
We extract features from the
objects, and use them for
Color Shape Textur
e
querying the DB

Multimedia Systems
The reference architecture
 For the text-based approach, the image querying problem can
be simply transformed into a traditional information retrieval
problem
(IR)
query image
 as we saw for textual feature
documents retrieval and extraction
as we will see when image
speaking about segmentation
“MM data annotation”… GUI (optional)

 For content-based
information retrieval (CBIR) results
query
visualize
more sophisticated processor
query evaluation techniques query engine
are required
index

image feature
DB DB
Multimedia Systems
Variables of the CBIR problem
 How the set of relevant results is determined depends
 on which low-level features are used to characterize the MM
data content
 on the similarity criterion (distance function) used to
compare
such features
 on how DB objects are ranked with respect to the query
 on whether the user is interested in the whole MM data query or
only in a part of it

 ! All above aspects strongly influence the

query evaluation process

Multimedia Systems
CBIR problem
 Simplest case:
each MM data object (i.e., image) is characterized using
global low-level features and the result of a query
consists in the set of DB objects that “better match” the
visual characteristics of the target object, according to a
predefined similarity criterion, which is in turn based on
such features
 This is also defined Nearest Neighbors (NN)
search problem

Multimedia Systems
Representing color
 In a digital image, the color space that encodes the color
content of each pixel of the image is necessarily discretized
 This depends on how many bits per pixel (bpp) are used
Example:
 if one represents images in the RGB space by using 8  3 = 24 bpp,
the number of possible distinct colors is 224 = 16,777,216
 With 8 bits per channel, we have 256 possible values on each
channel
 Although discrete, the possible color values are still too many if
one wants to compactly represent the color content of an image
 This also aims at achieving some robustness in the matching process
(e.g., the two RGB values (123,078,226) and (121,080,230) are almost
indistinguishable)

 In practice, a common approach to represent color is to make

use of histograms…
Multimedia Systems
Color histograms
 A color histogram h is a D-dimensional vector, which is obtained by
quantizing the color space into D distinct colors
 Typical values of D are 32, 64, 256, 1024, …
Example: the HSV color space can be quantized into D=32 colors:
H is divided into 8 intervals, and S into 4
V = 0 guarantees invariance to light intensity
 The i-th component (also called bin) of h stores the percentage (number)
of pixels in the image whose color is mapped to the i-th color
 Although conceptually simple, color histograms are widely used since
they are relatively invariant to translation, rotation, scale changes and
partial occlusions

D = 64

Multimedia Systems
Further examples
 Two D=64 color histograms

Multimedia Systems
Comparing color histograms
 Since histograms are vectors, we can use any Lp-norm to measure the
distance (dissimilarity) of two color histograms
 However, doing so we are not taking into account colors’ correlation
 Depending on the query and the dataset, we might therefore obtain low-quality
results
 Weighted Lp-norms and relevance feedback can partially alleviate the
problem…

 The problem is that Lp-norms

just consider the difference
of corresponding bins, i.e.,
they perform a 1-1
comparison
 With color histograms, our
“coordinates” are not
unrelated (“cross-talk” effect)

Multimedia Systems
Sample queries based on color (1)
32-D HSV histograms
QueryImage Euclidean distance

Weighted Euclidean distance

Multimedia Systems
Sample queries based on color (2)
QueryImage Euclidean distance 32-D HSV histograms

Weighted Euclidean distance

Multimedia Systems
Quadratic distance
 Consider two histograms h and q, both with D bins
 Their quadratic distance is defined as:
LA
D D
L A (h, q; A)  a h  q h q
i,j i i j j


i1 j1

 h  q  T
 A  h 
q
where A = {ai,j} is called the (color-)similarity matrix
 The value of ai,j is the “similarity” of the i-th and the j-th colors (ai,i = 1)
 Note that:
 when A is a diagonal matrix we are back to the weighted Euclidean distance,
 when A = I (the identity matrix) we obtain the L2 distance
 In order to guarantee that LA is indeed a distance (LA(h,q;A)  0 h,q), it
is sufficient that A is a symmetric positive definite matrix

Multimedia Systems
Quadratic distance vs. Euclidean distance
 As a simple example, let D = 3, with colors red, orange, and blue
 Consider 3 pure-color images and the corresponding
histograms:

h1=(1,0,0) h2=(0,1,0)
h3=(0,0,1)

 Using L2, the distance between two different images is always 2

 On the other hand, let the color-similarity
A matrix be defined as:
1 0.8 0
0.8 1 0
0 0 1

 Now we have LA(h1,h2) = 0.4, whereas LA(h1,h3) = LA(h2,h3) = 2

Multimedia Systems
Representing texture (1)
 Texture provides information about the uniformity, granularity
and regularity of the image surface

 It is usually computed just considering the gray-scale

values of pixels
 V channel in HSV

Multimedia Systems
Representing texture (2)
 Tamura features correspond to properties of a texture which
are readily perceived, that is coarseness, contrast and
directionality
(3-D feature vector)
 Coarseness - coarse vs. fine
 Contrast - high vs. low contrast
 Directionality - directional vs. non-directional

 L1 or L2 distance functions are usually applied

for comparison

Multimedia Systems
Texture extraction with Gabor filters
 A Gabor filter is a Gaussian modulated by a sinusoid, which
can reveal the presence of a pattern along a certain direction
and at a certain scale (frequency)

Scale: 3 at 72° Scale: 4 at 108° Scale: 5 at 144°

 To extract texture information, one chooses a number of
directions/orientations (e.g.,6) and scales (e.g., 5) according
to which the image has to be analyzed
 For each orientation and scale, the average and the variance
(standard deviation) of the filter output are computed
 This leads to, say, 265 = 60-D feature vectors, which are
usually compared using the Manhattan (L1) distance
Multimedia Systems
Gabor filter
 Let I be an image, with I(x,y) being the gray-scale value of the pixel in
position (x,y)

 A Gabor function is written as 1 1 x y 2 cos2πx
G(x, y)  2πσ2 σ exp  
 2
x y  2  σ x σ y2 
and is completely determined by its frequency () and bandwidth (x,y)
 The Gabor filter Gm,n(x,y) for scale m and orientation n is then defined as
Gm,n (x, y)  am G(x' , y'
)x'  a  m (x cosθ  y sinθ ), y'  a  m (x sinθ  y 
n
cosθ ), θ
n n
n
nπ/K
n

where K is the total number of orientations

 Finally, the image is analyzed by convolution with the filter:
w m,n (x, y)  
i 
j G m,n (x - i,y - j)I(i,

Multimedia Systems
Representing shape
 Once one has succeeded in extracting an object’s contour,
the next step is how to represent/encode it
 A common approach is to navigate the contour, which leads
to an ordering of the pixels in the contour:
{ (x(t),y(t)) : t = 1…,M }

 A 2nd step is to represent the resulting

curve in a parametric form
 For instance, a possibility is to resort
to complex values, by setting z(t) = x(t)
+ j y(t)
 Thus, now we have vectors of
complex values…
 The problem is that each vector has a different length
Multimedia Systems
Representative points
 The idea is to keep only the D most “interesting” points
 Some methods are:
 Equally-spaced sampling (a)
 Grid-based sampling (b)
 Maximum curvature points (c) (a)
 Fourier-based methods, which first
compute the DFT of the contour, and then
keep only the first D coefficients
 Working in the frequency domain has
several advantages:
 It can be proved that by properly modifying Fourier
coefficients one can achieve invariance to scale, (b)
translation and rotation
 Further, by viewing shape as a “signal”, one can adopt
distance measures that have been developed for the
comparison of time series and that are somewhat insensitive
to signals’ modifications (c)

Multimedia Systems
Comparing shapes
 The commonest way to measure the (dis-)similarity of two shape
vectors of equal length D is based on Euclidean distance (L2)
 However, with Euclidean distance we have to face a basic problem
 Sensitivity to “alignment of values”

 Intuitively, we would need a distance measure that is able to “match” a

point of time series s even with “surrounding” points of time series q
 Alternatively, we may view the time axis as a “stretchable” one

 A distance like this exists, and is called “Dynamic Time Warping” (DTW)
 In order to understand how it works, we first need to introduce some important
concepts related to time series…
Multimedia Systems
How to measure similarity between time series
 Given two time series of equal length D, the commonest way
to measure their (dis-)similarity is based on Euclidean
distance
 However, with Euclidean distance we have to face two basic
problems
1. High-dimensionality: (very) large D values
2. Sensitivity to “alignment of values”
s

 For problem 1. we need to define q

effective dimensionality reduction
techniques that work in a (much)
lower dimensional space
 For problem 2. we will introduce D-1
L 2 s,q  st  qt
2
a new similarity criterion t 0
 
Multimedia Systems
Dimensionality reduction: DFT (1)
 The first approach to reducing the dimensionality of time series,
proposed in [AFS93], was based on Discrete Fourier Transform (DFT)
 Remind: given a time series s, the Fourier coefficients are complex
numbers (amplitude,phase), defined as:
1 D1
Sf   s exp j2πf
t f  0,..., D 
D 1
t/D
t0

 From Parseval theorem we know that DFT preserves the energy of

signal:
the D1 D1
Es  2
s 2t  ES 

f
t0

 Since DFT
 is a linearS transformation we have:
f 0
D1 D1
2 2
 L 2 (S,Q)2
L (s,q)2  t0 s  q
2 t t
  Es  q  ES  Q   S
f f

f 0 Q

thus, DFT preserves the Euclidean distance

 And? What can we gain from such transformation??
Multimedia Systems
Dimensionality reduction: DFT (2)
 The key observation is that, by keeping only a small set of Fourier
coefficients, we can obtain a good approximation of the original signal
 Why: because most of the energy of many real-world
signals concentrates in the low frequencies ([AFS93]):
 More precisely, the energy spectrum (|Sf|2 vs. f) behaves as
O(f-b), b > 0:
 b = 2 (random walk or brown noise): used to model the behavior of stock
movements and currency exchange rates
 b > 2 (black noise): suitable to model slowly varying natural phenomena (e.g.,
water levels of rivers)
 b = 1 (pink noise): according to Birkhoff’s theory, musical scores follow this
energy pattern
 Thus, if we only keep the first few coefficients (D’ << D) we can achieve
an effective dimensionality reduction
 Note: this is the basic idea used by well-known compression standards, such
as JPEG (which is based on Discrete Cosine Transform)

Multimedia Systems
An example: EEG data
 Sampling rate: 128 Hz

Time series (4 secs, 512 points) Energy spectrum

Multimedia Systems
Another example Fourier
First 4
Fourier
data values coefficients coefficients
 128 0.4995 1.5698 1.5698
points 0.5264
0.5523
1.0485
0.7160
1.0485
0.7160
0.5761 0.8406 0.8406
s 0.5973 0.3709 0.3709
0.6153 0.4670 0.4670
0.6301 0.2667 0.2667
0.6420 0.1928 0.1928
s’ 0.6515 0.1635
0.6596 0.1602
0 20 40 60 80 100 120 140
0.6672 0.0992
0.6751 0.1282
0.6843 0.1438
0.6954 0.1416
0.7086 0.1400
0.7240 0.1412
0.7412 0.1530
0.7595 0.0795
0.7780 0.1013
0.7956 0.1150
0.8115 0.1801
0.8247 0.1082
s’ = approximation of s with 0.8345 0.0812
0.8407 0.0347
4 Fourier coefficients 0.8431 0.0052
0.8423 0.0017
0.8387 0.0002
… ...
Multimedia Systems
The “alignment” problem
 Euclidean distance, as well as other Lp-norms, are not robust w.r.t., even
small, contractions/expansions of the signal along the time axis
 E.g., speech signals
 Intuitively, we would need a distance measure that is able to “match” a
point of time series s even with “surrounding” points of time series q
 Alternatively, we may view the time axis as a “stretchable” one
 A distance like this exists, and is called “Dynamic Time Warping”
(DTW)!

Fixed Time Axis “Warped” Time Axis

Sequences are aligned “one to one” Non-linear alignments are possible

Multimedia Systems
How to compute the DTW (1)
 Assume that the two time series s and q have the same length D
 Note that with DTW this is not necessary anymore!
 Construct a DD matrix d, whose element di,j is the distance
between si and qj
 We take di,j = (si - qj)2, but other possibilities exist (e.g., |si – qj|)

D= 0 1 2 3 4 5
7 25 16 25 36 16 9
6
3 1 0 1 4 0 1
s 1 2 5 4 3 7
4 4 1 4 9 1 0
q 2 3 2 1 3 4 s
L2(s,q) = 29 5 9 4 9 16 4 1
 The “rules of the game”: 2 0 1 0 1 1 4
 Start from (0,0) and end in (D-1,D-1) 1 1 4 1 0 4 9
 Take one step at a time
d 2 3 2 1 3 4
 At each step, move only by increasing i, j, or both
 I.e., never go back! q
 “Jumps” are not allowed!
 Sum all distances you have found in the “warping path”

Multimedia Systems
How to compute the DTW (2)
 The figure shows a possible warping path w, whose “cost” is 21
 The “Euclidean path” moves only along the main diagonal, and costs 29

7 25 16 25 36 16 9
3 1 0 1 4 0 1
4 4 1 4 9 1 0
s
5 9 4 9 16 4 1
2 0 1 0 1 1 4
1 1 4 1 0 4 9
2 3 2 1 3 4
warping path w

The DTW is the minimum cost among all the warping paths

 But the number of path is exponential in D 

 Ok, but we can use dynamic programming, with complexity O(D2) 

Multimedia Systems
How to compute the DTW (3)
 From the d matrix, incrementally build a new matrix WP, whose elements
wpi,j are recursively defined as:

wpi,j  di,j  min{wp i-1, j, wpi,j-1, wpi-1, j-1}

7 25 16 25 36 16 9 7 40 22 31 43 24 15
3 1 0 1 4 0 1 3 15 6 7 11 8 6
4 4 1 4 9 1 0 4 14 6 9 18 8 5
s
5 9 4 9 16 4 1 s 5 10 5 11 18 7 5
2 0 1 0 1 1 4 2 1 2 2 3 4 8
1 1 4 1 0 4 9 1 1 5 6 6 10 19
d 2 3 2 1 3 4 WP 2 3 2 1 3 4
q
q
 Then set dDTW(s,q) = wpD-1,D-1

Multimedia Systems
Back to the shape context: sample
queries [BCP05]
R = relevant 1100 objects’ contours
(same type of fish)
QueryImage

Multimedia Systems
This is not the whole story…
 Of course, many other features models (and correspondent
distance functions) have been defined in literature for MM
data
 Please, refer [SWS+00, LZL+07, DJL+08] for detailed pointers
 This was just a way to provide some concrete examples of
MM features and distance functions for comparing them!
 Note that, besides “generic” features, any specific image
domain/application needs to extract and manage specific features,
which in general require much more sophisticated tools than the one we
have seen
 E.g., face/fingerprints recognition
 Nonetheless, what is important to stress is that the
problem of how to search in large image DB’s
remains (almost) the same! 
Multimedia Systems

Docu87490 - Data Domain DD3300 Field Replacement and Upgrade Guide PDF
0% (2)
Docu87490 - Data Domain DD3300 Field Replacement and Upgrade Guide PDF
204 pages
Med Plus
No ratings yet
Med Plus
20 pages
Chapter 6
100% (1)
Chapter 6
40 pages
Role of Digital Marketing and Impact On Youth Buying Behaviour
No ratings yet
Role of Digital Marketing and Impact On Youth Buying Behaviour
14 pages
Unit 1 - PPT
No ratings yet
Unit 1 - PPT
67 pages
CBIR BasicLec15
No ratings yet
CBIR BasicLec15
95 pages
Fast Multiresolution Image Querying: CS474/674 - Prof. Bebis
0% (1)
Fast Multiresolution Image Querying: CS474/674 - Prof. Bebis
39 pages
Seminar Report
50% (2)
Seminar Report
14 pages
Chapter 2
No ratings yet
Chapter 2
55 pages
Ccs 352 Multimedia and Animation Question Bank Unitwise
No ratings yet
Ccs 352 Multimedia and Animation Question Bank Unitwise
14 pages
Khabouris Acts
No ratings yet
Khabouris Acts
84 pages
EDHRM - HR Metrics 2023 Course Outline - Revised
No ratings yet
EDHRM - HR Metrics 2023 Course Outline - Revised
4 pages
Image Search Engines
100% (1)
Image Search Engines
54 pages
GEEL1 - 02 Web and The Internet
No ratings yet
GEEL1 - 02 Web and The Internet
54 pages
Micro CC 20 Plus Communication Protocol
No ratings yet
Micro CC 20 Plus Communication Protocol
9 pages
SAC Assessment v2 2025 05 07
No ratings yet
SAC Assessment v2 2025 05 07
3 pages
Mining Multimedia Data Bases
No ratings yet
Mining Multimedia Data Bases
17 pages
Unit 1 - Merged - Lecture Slides
No ratings yet
Unit 1 - Merged - Lecture Slides
84 pages
Salut D'amour: E.Elgar
No ratings yet
Salut D'amour: E.Elgar
9 pages
CV Lecture 11
No ratings yet
CV Lecture 11
147 pages
Top 10 SSD Hard Disk in 2019
No ratings yet
Top 10 SSD Hard Disk in 2019
7 pages
Visual Image Search: Feature Signatures Or/and Global Descriptors
No ratings yet
Visual Image Search: Feature Signatures Or/and Global Descriptors
15 pages
Bell ADT D-Series General Info
100% (1)
Bell ADT D-Series General Info
32 pages
Unit 1
No ratings yet
Unit 1
8 pages
Unit 5
No ratings yet
Unit 5
32 pages
L3v3-Image Enhancement and Preprocessing
No ratings yet
L3v3-Image Enhancement and Preprocessing
75 pages
Goodall Anne M 2021 Thesis
No ratings yet
Goodall Anne M 2021 Thesis
204 pages
New Cbir System
No ratings yet
New Cbir System
10 pages
Business Studies Notes PDF Class 12 Chapter 13
No ratings yet
Business Studies Notes PDF Class 12 Chapter 13
6 pages
Chapter 1
No ratings yet
Chapter 1
58 pages
EXERSISES
No ratings yet
EXERSISES
21 pages
6 Multimediaa-Abdullah
No ratings yet
6 Multimediaa-Abdullah
32 pages
Content-Based Image Retrieval (CBIR) : Match
No ratings yet
Content-Based Image Retrieval (CBIR) : Match
71 pages
CN Module 1 Prelim
No ratings yet
CN Module 1 Prelim
42 pages
Bharat B Hwan
50% (2)
Bharat B Hwan
16 pages
Manual Quick
No ratings yet
Manual Quick
181 pages
Homophones 3
No ratings yet
Homophones 3
2 pages
Multimedia Systems: Multimedia Databases - Image Processing Basics
No ratings yet
Multimedia Systems: Multimedia Databases - Image Processing Basics
58 pages
Chit Kara 2001
No ratings yet
Chit Kara 2001
54 pages
A Parents Guide To Grammar 1
No ratings yet
A Parents Guide To Grammar 1
8 pages
Gemini Algorithm
No ratings yet
Gemini Algorithm
28 pages
Ppa CV
No ratings yet
Ppa CV
3 pages
Multi-Feature and Genetic Algorithm
No ratings yet
Multi-Feature and Genetic Algorithm
41 pages
The Course: Image Representation Image Statistics Histograms Entropy Filters Books
No ratings yet
The Course: Image Representation Image Statistics Histograms Entropy Filters Books
77 pages
A Business Letter Is A Letter Written in Formal Language
100% (1)
A Business Letter Is A Letter Written in Formal Language
5 pages
Content-Based Image Retrieval (CBIR) : Searching A Large Database For Images That A Query
No ratings yet
Content-Based Image Retrieval (CBIR) : Searching A Large Database For Images That A Query
55 pages
Design Theories: - Kruti Desai
No ratings yet
Design Theories: - Kruti Desai
22 pages
The MPEG-7: Visual Standard For Content Description
No ratings yet
The MPEG-7: Visual Standard For Content Description
57 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
The MPEG-7 Multimedia Content Description Interface
No ratings yet
The MPEG-7 Multimedia Content Description Interface
73 pages
Image Retrieval: Current Techniques, Promising Directions, and Open Issues
No ratings yet
Image Retrieval: Current Techniques, Promising Directions, and Open Issues
41 pages
PART ONE: Reading: Plagiarism
No ratings yet
PART ONE: Reading: Plagiarism
2 pages
Opensap: Big Data With Sap Hana Vora: Course Week 03 - Exercises
No ratings yet
Opensap: Big Data With Sap Hana Vora: Course Week 03 - Exercises
18 pages
Paper 8
No ratings yet
Paper 8
57 pages
PROJECT Presentation Medical Multimodal Image Retrieval
No ratings yet
PROJECT Presentation Medical Multimodal Image Retrieval
57 pages
Evaluation of Similarity Measurement For Image Retrieval
No ratings yet
Evaluation of Similarity Measurement For Image Retrieval
4 pages
Uniform Plane Wave Solution To The Wave Equation
No ratings yet
Uniform Plane Wave Solution To The Wave Equation
5 pages
Moditroduction Multimedia Database
No ratings yet
Moditroduction Multimedia Database
39 pages
AA278A Lecture Notes 8. Optimal Control and Dynamic Games: Claire J. Tomlin May 11, 2005
No ratings yet
AA278A Lecture Notes 8. Optimal Control and Dynamic Games: Claire J. Tomlin May 11, 2005
12 pages
Haier India Survey (Pune, Maharashtra Region)
100% (4)
Haier India Survey (Pune, Maharashtra Region)
10 pages
Lookup Functions Practice Sheet
No ratings yet
Lookup Functions Practice Sheet
11 pages
Assignment No.: 5: Aim: Theory
No ratings yet
Assignment No.: 5: Aim: Theory
3 pages
Content-Based Retrieval in Digital Libraries
No ratings yet
Content-Based Retrieval in Digital Libraries
33 pages
Module 1 - Introduction To Multimedia Databases
No ratings yet
Module 1 - Introduction To Multimedia Databases
39 pages
Content-Based Image Retrieval - Some Basics
No ratings yet
Content-Based Image Retrieval - Some Basics
8 pages
Approaches On The Implementation of The Multimedia Databases
No ratings yet
Approaches On The Implementation of The Multimedia Databases
10 pages
Walmart Lighter Fluid
No ratings yet
Walmart Lighter Fluid
9 pages
Domain Specific Cbir For Highly Textured Images
No ratings yet
Domain Specific Cbir For Highly Textured Images
7 pages
Region Based Image Retrieval Using MPEG-7 Visual Descriptor
No ratings yet
Region Based Image Retrieval Using MPEG-7 Visual Descriptor
5 pages
MPEG7 Algorithms Implemented For A Multimedia Management System
No ratings yet
MPEG7 Algorithms Implemented For A Multimedia Management System
6 pages
Sample Paper PDF
No ratings yet
Sample Paper PDF
3 pages
A FFT Based Technique For Image Signature Generation: Augusto Celentano and Vincenzo Di Lecce
No ratings yet
A FFT Based Technique For Image Signature Generation: Augusto Celentano and Vincenzo Di Lecce
10 pages
Color Histogram For Content-Based Image Retrieval
No ratings yet
Color Histogram For Content-Based Image Retrieval
7 pages
Multimedia Retrieval CH 5 Image Processing
No ratings yet
Multimedia Retrieval CH 5 Image Processing
27 pages
Mid Semester Project Seminar On: Content Based Image Retrieval
No ratings yet
Mid Semester Project Seminar On: Content Based Image Retrieval
21 pages
An Integrated Approach For Image Retrieval Based On Content
No ratings yet
An Integrated Approach For Image Retrieval Based On Content
6 pages
Prototyping Color-Based Image Retrieval With MATLAB: Petteri Kerminen, Moncef Gabbouj
No ratings yet
Prototyping Color-Based Image Retrieval With MATLAB: Petteri Kerminen, Moncef Gabbouj
5 pages
Content-Based Image Retrieval: Michele - Saad@mail - Utexas.edu
No ratings yet
Content-Based Image Retrieval: Michele - Saad@mail - Utexas.edu
12 pages
Color Image Techniques For Image Retrieval in Large Image Set of Database
No ratings yet
Color Image Techniques For Image Retrieval in Large Image Set of Database
10 pages
SUCCESeR Simple and Useful Multi Color Concepts For Effective Search and Retrieval
No ratings yet
SUCCESeR Simple and Useful Multi Color Concepts For Effective Search and Retrieval
7 pages
Feature Extraction Approach For Content Based Image Retrieval
No ratings yet
Feature Extraction Approach For Content Based Image Retrieval
5 pages
Image Retrieval Using Both Color and Texture Features
No ratings yet
Image Retrieval Using Both Color and Texture Features
6 pages
Content Based Image Retrieval Using Color, Multi Dimensional Texture and Edge Orientation
No ratings yet
Content Based Image Retrieval Using Color, Multi Dimensional Texture and Edge Orientation
6 pages
Content-Based Image Retrieval
No ratings yet
Content-Based Image Retrieval
4 pages
CBIR Report
No ratings yet
CBIR Report
10 pages
A Universal Model For Content-Based Image Retrieval: S. Nandagopalan, Dr. B. S. Adiga, and N. Deepak
No ratings yet
A Universal Model For Content-Based Image Retrieval: S. Nandagopalan, Dr. B. S. Adiga, and N. Deepak
4 pages
Master Hunter CX
No ratings yet
Master Hunter CX
13 pages
Multi CD800a Mje061 User
No ratings yet
Multi CD800a Mje061 User
1 page
Data Mining-Multimedia Datamining
No ratings yet
Data Mining-Multimedia Datamining
8 pages

Ch-7-Multimedia Data Retrieval and Management

Uploaded by

Ch-7-Multimedia Data Retrieval and Management

Uploaded by

Chapter 7

Multimedia Data Retrieval and

 Description models for MM data retrieval

…with the final goal to be able to retrieve from

Look for cheetahs? This is fine; but, how to find it?

Look for cheetahs? Oops! Not really a cheetah ;-)

The objects level

 ! All above aspects strongly influence the

 In practice, a common approach to represent color is to make

 The problem is that Lp-norms

Weighted Euclidean distance

Weighted Euclidean distance

 Using L2, the distance between two different images is always 2

 Now we have LA(h1,h2) = 0.4, whereas LA(h1,h3) = LA(h2,h3) = 2

 It is usually computed just considering the gray-scale

 L1 or L2 distance functions are usually applied

Scale: 3 at 72° Scale: 4 at 108° Scale: 5 at 144°

where K is the total number of orientations

 A 2nd step is to represent the resulting

 Intuitively, we would need a distance measure that is able to “match” a

 For problem 1. we need to define q

 From Parseval theorem we know that DFT preserves the energy of

thus, DFT preserves the Euclidean distance

Time series (4 secs, 512 points) Energy spectrum

Fixed Time Axis “Warped” Time Axis

 But the number of path is exponential in D 

wpi,j  di,j  min{wp i-1, j, wpi,j-1, wpi-1, j-1}

You might also like