Graph Spectral Image Processing Gene Cheunginstant Download
Graph Spectral Image Processing Gene Cheunginstant Download
download
https://ebookmass.com/product/graph-spectral-image-processing-
gene-cheung/
https://ebookmass.com/product/digital-image-processing-using-matlab-
rafael-c-gonzalez/
https://ebookmass.com/product/feature-extraction-and-image-processing-
for-computer-vision-fourth-edition-aguado/
https://ebookmass.com/product/methods-and-techniques-for-fire-
detection-signal-image-and-video-processing-perspectives-1st-edition-
cetin/
https://ebookmass.com/product/digital-image-processing-3rd-edition-
rafael-c-gonzalez-and-richard-e-woods/
Design for Embedded Image Processing on FPGAs, 2nd Edition
Donald G. Bailey
https://ebookmass.com/product/design-for-embedded-image-processing-on-
fpgas-2nd-edition-donald-g-bailey/
https://ebookmass.com/product/machine-learning-algorithms-for-signal-
and-image-processing-suman-lata-tripathi/
https://ebookmass.com/product/image-processing-for-automated-
diagnosis-of-cardiac-diseases-rajeev-kumar-chauhan-editor/
https://ebookmass.com/product/machine-intelligence-big-data-analytics-
and-iot-in-image-processing-ashok-kumar/
https://ebookmass.com/product/machine-vision-inspection-systems-
vol-1-image-processing-concepts-malarvel-m-ed/
Graph Spectral Image Processing
SCIENCES
Image, Field Director – Laure Blanc-Feraud
Compression, Coding and Protection of Images and Videos,
Subject Head – Christine Guillemot
Coordinated by
Gene Cheung
Enrico Magli
First published 2021 in Great Britain and the United States by ISTE Ltd and John Wiley & Sons, Inc.
Apart from any fair dealing for the purposes of research or private study, or criticism or review, as
permitted under the Copyright, Designs and Patents Act 1988, this publication may only be reproduced,
stored or transmitted, in any form or by any means, with the prior permission in writing of the publishers,
or in the case of reprographic reproduction in accordance with the terms and licenses issued by the
CLA. Enquiries concerning reproduction outside these terms should be sent to the publishers at the
undermentioned address:
www.iste.co.uk www.wiley.com
ERC code:
PE7 Systems and Communication Engineering
PE7_7 Signal processing
Contents
1.7. Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
1.8. References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301
Introduction to Graph Spectral
Image Processing
I.1. Introduction
2) Graph Signal Processing: In the last eight years, we have also witnessed the
birth of a new signal processing topic – called graph signal processing (GSP) – that
generalizes traditional mathematical tools like transforms and wavelets, to process
signals residing on irregular data kernels described by graphs (Shuman et al. 2013).
Central to GSP is the notion of graph frequencies: orthogonal components, computed
from a graph variation operator like the graph Laplacian matrix, that generalize the
notion of Fourier modes to the graph domain, spanning a graph signal space. Because
of its inherent powerful generality, one can easily adopt or design GSP tools for
different imaging applications, where a node in a graph represents a pixel, and the
graph connectivity is chosen to reflect inter-pixel similarities or correlations. For an
example of the GSP tool being used for image restoration, see Figure I.1 for an
illustration of a graph spectral method called left eigenvectors of the random walk
graph Laplacian (LeRAG) for JPEG image dequantization (Liu et al. 2017). GSP tools
can also easily adapt to the aforementioned modern imaging modalities, such as light
field images and 3D point clouds, that do not reside on regular 2D grids.
3) Deep Neural Networks: Without a doubt, the singular seismic paradigm shift
in data science in the last decade is deep learning. Using layers of convolutional
filters, pointwise nonlinearities and pooling functions, deep neural network (DNN)
architectures like convolutional neural networks (CNN) have demonstrated superior
performance in a wide range of imaging tasks from denoising to classification, when
a large volume of labeled data is available for training (Vemulapalli et al. 2016;
Zhang et al. 2017). When labeled training data is scarce, or when the underlying
data kernel is irregular (thus complicating the training of convolutional filters and the
selection of pooling operators), how to best design and construct DNN for a targeted
image application is a challenging problem. Moreover, a CNN purely trained from
labeled data often remains a “black box”, i.e. the learned operators like filtering remain
unexplainable.
Introduction xiii
Motivated by these technological trends, we have focused this book on the theory
and applications of GSP tools for image processing, covering conventional images
and videos, new modalities like light fields and 3D point clouds, and hybrid
GSP/deep learning approaches. Different from other graph-based image processing
books (Lezoray and Grady 2012), we concentrate on spectral processing techniques
with frequency interpretations such as graph Fourier transforms (GFT) and graph
wavelets, drawing inspiration from the long history of frequency analysis tools in
traditional signal processing. Graph frequency analysis enables the definition of
familiar signal processing notions, such as graph Fourier modes, bandlimitedness,
and signal smoothness, using graph spectral methods that can be designed.
2) The second part of the book reviews different imaging applications of GSP.
Chapters 4 and 5, titled “Graph Specral Image and Video Compression” by H.E.
Egilmez, Y.-H. Chao and A. Ortega and “Graph Spectral 3D Image Compression”
by T. Maugey, M. Rizkallah, N. M. Bidgoli, A. Roumy and C. Guillemot, focus on the
design and applications of GSP tools for the compression of traditional images/videos
and 3D images, respectively. Chapter 6, titled “Graph Spectral Image Restoration” by
J. Pang and J. Zeng, focuses on the general recovery of corrupted images, e.g. image
denoising and deblurring. As a new imaging modality, Chapter 7, titled “Graph
Spectral Point Cloud Processing” by W. Hu, S. Chen and D. Tian, focuses on the
processing of 3D point clouds for applications, such as low-level restoration and
high-level unsupervised feature learning. Chapters 8 and 9, titled “Graph Spectral
Image Segmentation” by M. Ng and “Graph Spectral Image Classification” by M. Ye,
V. Stankovic, L. Stankovic and G. Cheung, narrow the discussion specifically to
segmentation and classification, respectively, two popular research topics in the
computer vision community. Finally, Chapter 10, titled “Graph Neural Networks for
Image Processing” by G. Fracastoro and D. Valsesia, reviews the growing efforts to
employ recent GNN architectures for conventional imaging tasks such as denoising.
Before we jump into the various chapters, we begin with the basic definitions in
GSP that will be used throughout the book. Specifically, we formally define a graph,
graph spectrum, variation operators and graph signal smoothness priors in the
following sections.
xiv Graph Spectral Image Processing
There are many ways to compute appropriate edge weights. Especially common
for images, edge weight wi,j can be computed using a Gaussian kernel, as done in the
bilateral filter (Tomasi and Manduchi 1998):
li − lj 22 xi − xj 22
wi,j = exp − exp − [I.1]
σl2 σx2
L = UΛU [I.2]
where Λ is a diagonal matrix containing real eigenvalues λk along the diagonal, and
U is an eigen-matrix composed of orthogonal eigenvectors ui as columns. If all edge
1 If a graph node represents a pixel in an image, each pixel would typically have three color
components: red, green and blue. For simplicity, one can treat each color component separately
as a different graph signal.
Introduction xv
weights wi,j are restricted to be positive, then graph Laplacian L can be proven to be
positive semi-definite (PSD) (Chung 1997)2, meaning that λk ≥ 0, ∀k and
x Lx ≥ 0, ∀x. Non-negative eigenvalues λk can be interpreted as graph
frequencies, and eigenvectors U can be interpreted as corresponding graph Fourier
modes. Together they define the graph spectrum for graph G.
The set of eigenvectors U for L collectively form the GFT (Shuman et al. 2013),
which can be used to decompose a graph signal x into its frequency components via
α = U x. In fact, one can interpret GFT as a generalization of known discrete
transforms like the Discrete Cosine Transform (DCT) (see Shuman et al. 2013 for
details).
Note that if the multiplicity mk of an eigenvalue λk is larger than 1, then the
set of eigenvectors that span the corresponding eigen-subspace of dimension mk is
non-unique. In this case, it is necessary to specify the graph spectrum as the collection
of eigenvectors U themselves.
If we also consider negative edge weights wi,j that reflect inter-pixel
dissimilarity/anti-correlation, then graph Laplacian L can be indefinite. We will
discuss a few recent works (Su et al. 2017; Cheung et al. 2018) that employ negative
edges in later chapters.
2 One can prove that a graph G with positive edge weights has PSD graph Laplacian L via the
Gershgorin circle theorem: each Gershgorin disc corresponding to a row in L is located in the
non-negative half-space, and since all eigenvalues reside inside the union of all discs, they are
non-negative.
xvi Graph Spectral Image Processing
where λmax is the eigenvalue of W with the largest magnitude (also called the
spectral radius), and p is a chosen integer. As a variant to equation [I.4], a quadratic
smoothness prior is defined in Romano et al. (2017), using a row-stochastic version
Wn = D−1 W of the adjacency matrix W:
1
S2 (x) = x − Wn x22 [I.5]
2
To avoid confusion, we will call equation [I.5] the graph shift variation (GSV)
prior. GSV is easier to use in practice than GTV, since the computation of λmax is
required for GTV. Note that GSV, as defined in equation [I.5], can also be used for
signals on directed graphs.
Introduction xvii
I.6. References
Biyikoglu, T., Leydold, J., Stadler, P.F. (2005). Nodal domain theorems and bipartite subgraphs.
Electronic Journal of Linear Algebra, 13, 344–351.
Chen, S., Sandryhaila, A., Moura, J., Kovacevic, J. (2015). Signal recovery on graphs: Variation
minimization. IEEE Transactions on Signal Processing, 63(17), 4609–4624.
Cheung, G., Su, W.-T., Mao, Y., Lin, C.-W. (2018). Robust semisupervised graph classifier
learning with negative edge weights. IEEE Transactions on Signal and Information
Processing over Networks, 4(4), 712–726.
Chung, F. (1997). Spectral graph theory. CBMS Regional Conference Series in Mathematics,
92.
Dörfler, F. and Bullo, F. (2013). Kron reduction of graphs with applications to electrical
networks. IEEE Transactions on Circuits and Systems I: Regular Papers, 60(1), 150–163.
Lezoray, O. and Grady, L. (2012). Image Processing and Analysis with Graphs: Theory and
Practice, CRC Press, Boca Raton, Florida.
Liu, X., Cheung, G., Wu, X., Zhao, D. (2017). Random walk graph Laplacian based smoothness
prior for soft decoding of JPEG images. IEEE Transactions on Image Processing, 26(2),
509–524.
Milanfar, P. (2013a). Symmetrizing smoothing filters. SIAM Journal on Imaging Sciences,
6(1), 263–284.
Milanfar, P. (2013b). A tour of modern image filtering. IEEE Signal Processing Magazine,
30(1), 106–128.
Pang, J. and Cheung, G. (2017). Graph Laplacian regularization for image denoising: Analysis
in the continuous domain. IEEE Transactions on Image Processing, 26(4), 1770–1785.
Romano, Y., Elad, M., Milanfar, P. (2017). The little engine that could: Regularization by
denoising (RED). SIAM Journal on Imaging Sciences, 10(4), 1804–1844.
Shuman, D.I., Narang, S.K., Frossard, P., Ortega, A., Vandergheynst, P. (2013), The emerging
field of signal processing on graphs: Extending high-dimensional data analysis to networks
and other irregular domains. IEEE Signal Processing Magazine, 30(3), 83–98.
Su, W.-T., Cheung, G., Lin, C.-W. (2017). Graph Fourier transform with negative edges for
depth image coding. IEEE International Conference on Image Processing, Beijing.
Tomasi, C. and Manduchi, R. (1998), Bilateral filtering for gray and color images. IEEE
International Conference on Computer Vision, 839–846.
Vemulapalli, R., Tuzel, O., Liu, M.-Y. (2016). Deep Gaussian conditional random field
network: A model-based deep network for discriminative denoising. Proceedings of the
IEEE Conference on Computer Vision and Pattern Recognition, 4801–4809.
Zhang, K., Zuo, W., Chen, Y., Meng, D., Zhang, L. (2017). Beyond a Gaussian denoiser:
Residual learning of deep CNN for image denoising. IEEE Transactions on Image
Processing, 26(7), 3142–3155.
PART 1
Fundamentals of Graph
Signal Processing
Yuichi TANAKA
Tokyo University of Agriculture and Technology, Japan
1.1. Introduction
The filtering of time- and spatial-domain signals is one of the fundamental
techniques for image processing and has been studied extensively to date. GSP can
treat signals with irregular structures that are mathematically represented as graphs.
Theories and methodologies for the filtering of graph signals are studied using
spectral graph theory. In image processing, graphs are strong tools for representing
structures formed by pixels, like edges and textures.
The filtering of graph signals is not only an extension of that for standard time- and
spatial-domain signals, but it also has its own interesting properties. For example, GSP
can represent traditional pixel-dependent image filtering methods as graph spectral
domain filters. Furthermore, theory and design methods for wavelets and filter banks,
which are studied extensively in signal and image processing, are also updated to treat
graph signals.
In this chapter, the spectral-domain filtering of graph signals is introduced. In
section 1.2, the filtering of time-domain signals is briefly described as a starting
point. The filtering of graph signals, both in the vertex and spectral domains, is
detailed in section 1.3, in addition to its relationship with classical filtering.
Edge-preserving image smoothing is represented as a graph filter in section 1.4.
Furthermore, a framework of filtering by multiple graph filters, i.e. graph wavelets
and filter banks, is presented in section 1.5. Eventually, section 1.6 introduces several
fast computation methods of graph filtering. Finally, the concluding remarks of this
chapter are discussed in section 1.7.
This equation is based on the shift of the signal or impulse response. In LTI
systems, we (implicitly) assume that the shift of a discrete-time signal is well
defined, i.e. xn−k is unique and time invariant. Therefore, equation [1.1] is
equivalently represented as
⎡ ⎤
..
⎢ . ⎥
⎢. . . h−1 h0 h1 . . . ⎥
⎢
y=⎢ ⎥ x, [1.2]
⎣ . . . h−1 h0 h1 . . .⎥⎦
..
.
In equation [1.2], the impulse response hk is invariant for n, i.e. the same filter is
used for different values of n. Instead, we can use different filters for different values
of n to yield yn , whose impulse response hk [n] is often defined in a signal-dependent
manner, i.e. hk [n] = hk [m] for m = n. It is formulated as
∞
yn := hn−k [n]xk [1.3]
k=−∞
1 Here, we assume both x and y are finite length signals and their boundaries are extended or
filtered by a boundary filter to ensure that the equation is valid.
Graph Spectral Filtering 5
Famous image processing filters in this category include the bilateral filter
(Tomasi and Manduchi 1998; Barash 2002; Durand and Dorsey 2002; Fleishman
et al. 2003), anisotropic diffusion (Weickert 1998; Desbrun et al. 1999), adaptive
directional wavelets (Chang and Girod 2007; Ding et al. 2007; Tanaka et al. 2010)
and their variants.
It is well known that convolution in the time domain equation [1.1] has an
equivalent expression in the frequency (i.e. Fourier) domain as follows:
where
∞
x̂(ω) := xn e−jωn . [1.6]
n=−∞
In this chapter, we consider linear graph filters. Readers can find nonlinear graph
filters, like one used in deep learning, in the following chapters, specifically
Chapter 10.
Let us denote a graph filter as H ∈ RN ×N , where its elements are typically derived
from G and x. As in the LTI system, the filtered signal is represented as
y = Hx. [1.7]
Visit https://ebookmass.com today to explore
a vast collection of ebooks across various
genres, available in popular formats like
PDF, EPUB, and MOBI, fully compatible with
all devices. Enjoy a seamless reading
experience and effortlessly download high-
quality materials in just a few simple steps.
Plus, don’t miss out on exciting offers that
let you access a wealth of knowledge at the
best prices!
Exploring the Variety of Random
Documents with Different Content
back
back
back
back
back
back
back
back
back
back
back
back
back
back
Welcome to our website – the perfect destination for book lovers and
knowledge seekers. We believe that every book holds a new world,
offering opportunities for learning, discovery, and personal growth.
That’s why we are dedicated to bringing you a diverse collection of
books, ranging from classic literature and specialized publications to
self-development guides and children's books.
ebookmasss.com