ERD: Exponential Retinex decomposition based on weak space and hybrid nonconvex regularization and its denoising application
Abstract
The Retinex theory models the image as a product of illumination and reflection components, which has received extensive attention and is widely used in image enhancement, segmentation and color restoration. However, it has been rarely used in additive noise removal due to the inclusion of both multiplication and addition operations in the Retinex noisy image modeling. In this paper, we propose an exponential Retinex decomposition model based on hybrid non-convex regularization and weak space oscillation-modeling for image denoising. The proposed model utilizes non-convex first-order total variation (TV) and non-convex second-order TV to regularize the reflection component and the illumination component, respectively, and employs weak norm to measure the residual component. By utilizing different regularizers, the proposed model effectively decomposes the image into reflection, illumination, and noise components. An alternating direction multipliers method (ADMM) combined with the Majorize-Minimization (MM) algorithm is developed to solve the proposed model. Furthermore, we provide a detailed proof of the convergence property of the algorithm. Numerical experiments validate both the proposed model and algorithm. Compared with several state-of-the-art denoising models, the proposed model exhibits superior performance in terms of peak signal-to-noise ratio (PSNR) and mean structural similarity (MSSIM).
keywords:
image denoising, Retinex decomposition, weak space, nonconvex, regularization1 Introduction
Image decomposition aims to decompose a given image into several unrelated components with intrinsic features, and has been widely used in image denoising, image enhancement and pattern recognition. Various decomposition techniques have been proposed, among which the variational image model based on regularization and functional minimization is an effective method for solving this ill-posed inverse problems. This mathematical model can be expressed as follows [1, 2, 3]:
where is the input image, and are the regularization terms, which are used to promote the prior information in image components and , respectively. By solving the problem (1.1), we can obtain an effective decomposition of the image into its desired components. This variational approach has shown promising results in various applications, such as image denoising, which decomposes the image into clear image and noise component [4, 5, 6]; image structure-texture decomposition, which decomposes the image into cartoon component and oscillation component [7, 8, 9]; Retinex decomposition decomposes the image into illumination component and reflection component [10, 11, 12].
The Retinex theory, originally proposed by Land and McCann in 1971 [13], has a valuable property [14, 15]: the reflection component represents the inherent color information of the object itself, which may contain high contrast, while the illumination component represents the influence of external light sources in the environment, which is usually a smooth field. Based on this property, Kimmel et al. [16] utilized the norm to impose constraints on reflection and illumination in bounded variation (BV) space and space, respectively, and proposed an image decomposition model based on variational Retinex. In order to more effectively separate images with rich illumination, Ng and Wang [17] proposed a new Retinex image decomposition model (also called TVH1), which simultaneously minimizes illumination and reflection by combining more prior constraints. And then, Liang and Zhang [18] modified some regularization constraints in TVH1 model and proposed a high-order total variation decomposition model that uses a second-order total variation regularizer to characterize the illumination for the first time, and this model can flexibly select the constraint domain of the components according to different application scenarios. Recently, Xu et al. designed an image structure and texture estimator in [19] and proposed a new structure and texture aware Retinex (STAR) model for image decomposition.
However, the aforementioned Retinex models may not be well-suited for noisy images, as the decomposition quality tends to decrease significantly with increasing noise levels. To address this issue and improve the decomposition quality of noisy images, Liu et al. [20] proposed an exponential Retinex total variation (ETV) model, which has demonstrated satisfactory results in both Retinex decomposition and denoising applications. Building upon the ETV model, Wang et al. [21] introduced a nonconvex exponential total variation (NETV) model that utilizes a nonconvex norm to measure both the illumination and reflection components.
Traditionally, the norm is commonly employed as the fidelity term to quantify oscillation information. However, Meyer pointed out in [22] that the norm may not be the best choice for measuring oscillation information since oscillation functions tend to have large norms. Consequently, it becomes challenging to effectively separate the oscillation component from high-frequency components [23, 24, 25]. With this fact, Meyer proposed three weak function spaces, , , and , to accurately measure oscillation functions. Subsequently, numerous researchers delved into these function spaces and developed optimization algorithms for computing norms defined within these spaces. For example, Osher et al. [26] leveraged the dual space of as an approximation for the space and presented an efficient approach to simplify the norm. This advancement significantly improved algorithmic accuracy when working with these function spaces.
In this paper, we propose an exponential Retinex image decomposition model for image denoising, based on the Retinex decomposition of noisy images. This new model combines oscillatory component weaker-norm modeling with the generalized nonconvex hybrid regularization technique. The proposed decomposition model separates the noise image into three components: oscillation, reflection, and illumination. The oscillation information is measured using the weak norm, while the reflection component is measured using the generalized non-convex first-order TV regularizer, which captures piecewise constant features. The illumination component, on the other hand, is measured using the generalized non-convex second-order TV regularizer, which captures spatial smoothness. By incorporating these regularizers, our model effectively separates the oscillating noise component from the denoised image. The denoised image is then reconstructed using the reflection and illumination components. To solve this proposed model, we employ the ADMM algorithm [27, 28]. Specifically, we utilize the Majorization-Minimization (MM) algorithm [29, 30] to solve two generalized nonconvex subproblems. Furthermore, under reasonable assumptions, we rigorously prove the convergence of our proposed algorithm. In summary, the main contributions of this paper are as follows:
-
An exponential Retinex image decomposition model based on weak space and hybrid nonconvex regularization is proposed, which effectively decomposes the noisy image into three components: oscillation, reflection, and illumination.
-
An ADMM combined with the MM algorithm is introduced to solve the proposed model, and the sufficient conditions for the convergence of the proposed algorithm are provided.
-
We apply the proposed decomposition model to image denoising, where the restored image is reconstructed by reflection and illumination components. The effectiveness and superiority of the proposed model and algorithm are validated through numerical experimental results.
The rest of this paper is organized as follows. In Section 2, we introduce the Retinex theory and show the MM algorithm for solving non-convex minimization problems in detail. In Section 3, we propose a image exponential Retinex decomposition model based on weak space and hybrid nonconvex regularization, and introduce a numerical algorithm for the proposed model. In addition, the convergence conditions of the proposed algorithm are provided in this section. The numerical experimental results showing the performance of the proposed model are given in Section 4. Finally, we conclude this paper in Section 5.
2 Background knowledge
In this section, we provide a concise overview of the Retinex theory and the MM algorithm, which are relevant to our current research.
2.1 Retinex theory
Retinex theory was proposed by Land and McCann [13] in 1971, which is based on color perception modeling of human visual system. The Retinex theory holds that the color of the object itself is independent of the illumination distribution and intensity on the surface of the object, but is related to the reflection ability of different wavelengths of light. Based on this, the imaging process of an observed image can be represented by the following mathematical model,
where denotes entrywise multiplication, , and represent the perceived image, illumination and reflection, respectively. In general, we assume that is a piecewise constant region and is a spatial smoothing function.
In order to effectively separate and from the above ill-posed problem, a commonly used approach is to eliminate the coupling relationship between and in problem (2.1) by logarithmic transformation.
where , and .
Most Retinex variational models are based on Eq.(2.2). The logarithmic operation converts the multiplicative model into an ordinary additive model, effectively reducing the complexity of the algorithm and saving time costs. Additionally, the additive model facilitates discussions on algorithm convergence under certain conditions.
2.2 MM algorithm for solving a nonconvex problem
When facing a challenging optimization problem that is difficult to solve, it is often necessary to find an appropriate alternative function. Subsequently, the MM algorithm [29, 30] is utilized to convert the original optimization problem into an alternative function optimization problem.
To provide a clearer illustration of the MM algorithmic process, let’s consider a general nonconvex optimization problem [31, 32] in image processing as
where is an image domain, is a data fidelity term coefficient, and is a concave potential function which usually has three choices as follows, (a) , (b) , and (c) .
In order to find an alternative function that meets the constraints of the MM algorithm, the following inequality is obtained by the properties of the concave function,
Since the non-differentiability of the indicator function at zero, we define the following formula to replace the differential in the numerical algorithm, i.e.,
where , and is a very small positive value, which is set at in this paper. And then, we mark
and can easily prove that the following two conditional relations hold, i.e., (i), (ii), . Obviously, is the substitution function that we have found. According to the MM algorithm, we let , and the original minimization problem can be approximately replaced by
where is a constant that does not depend on , which can be omitted in the minimization problem. The minimization problem (2.6) is a classical smooth problem, and we can readily obtain its analytic solution by the necessary condition of functional extremum,
Similarly, the differential operator is defined as
So far, we use the MM algorithm to obtain the numerical solution of the original minimization problem (2.3), i.e., Eq.(2.7). In addition, the sequence generated by Eq.(2.7) clearly converges to the solution of the convex minimization problem (2.6). Since the original problem (2.3) is a nonconvex minimization problem, which may have multiple unequal solutions. Therefore, we will discuss the sequence converges to a extremal solution of the original minimization problem (2.3) in the following.
Lemma 2.1.
In [33], the author has summarized the convergence of MM algorithm for solving various optimization problems. Among them, in the case that the substitution function is constructed by the properties of the concave function, the solution sequence generated by the MM algorithm converges to a stationary point (also called stable point) of the original nonconvex minimization problem. Specifically, the sequence generated by Eq(2.7) converges to a stationary point of the minimization problem (2.3) as .
3 The proposed model
In this section, we first present an image exponential Retinex decomposition model based on weak space and hybrid nonconvex regularization. And then, this new model is numerically solved in the framework of ADMM combined with MM algorithm. Finally, we discuss the convergence conditions of proposed numerical algorithm.
3.1 The proposed model
An observed image can be considered as a combination of multiple disjoint components. We can accurately model these different components by utilizing the Retinex theory, which decomposes an image into illumination and reflection. Building upon this concept, we propose the following image degradation model:
where is a degraded image with noise, and represent the original clear image and gaussian additive noise, respectively.
In order to maintain the probability density of the Gaussian noise , we apply a logarithmic transformation only to the original image as shown in Eq.(2.2). Additionally, we perform an exponential transformation on the components and in the logarithmic domain. By doing so, the model (3.1) can be expressed as follows:
With the image modeling (3.2), we propose an exponential Retinex image decomposition model based on weak space and hybrid nonconvex regularization, which is defined as,
where , , and are positive regularization parameters, and is a nonconvex potential function. The term is modeled based on the dual space of the Hilbert space , which aims to depict the oscillatory component. The two nonconvex regularization terms and characterize the prior features of illumination and reflection , respectively. The last term is to ensure the stability of this model, where the parameter is a very small value.
The model (3.3) for image exponential Retinex decomposition aims to characterize different components individually. By utilizing a weaker -norm, it effectively extracts the oscillation component from the degraded image . The remaining components are further decomposed into a spatially smooth illumination and a piecewise constant reflection , which are measured by two generalized nonconvex regularizers. This decomposition model allows for better extraction of oscillation information from an image, resulting in improved denoising effects. Figure 1 illustrates the block diagram of the proposed technique.
Fig.1 The block diagram of the proposed technique
Theorem 3.1.
If , then is a convex functional.
Proof.
We need to first simplify the norm in order to prove the convexity of functional . Osher et al. have provided a popular method to simplify the norm in [26]. We assume with and can use a unique Hodge decomposition of g as
where is a scalar function and Q is a divergence-free vector field. And then, we can obtain that , i.e., . Finally, we ignore Q from the expression g and can obtain the following relationship as
We define to simplify the proof process. Therefore, we only need to prove that the functional satisfies the first-order condition of the convex functional, i.e.,
where are any two test functions. According to the fact that and , we can rewriter the formula (3.5) as
The inequality is clearly established, which implies the functional satisfies condition (3.5) for any . In summary, we prove that the functional is strictly convex for .
Remark. In fact, Tang et al. [34] used a concrete example to verify that the space can accurately model oscillation information. In this example, a one-dimensional oscillation function is used to validate this point. This example indicates that as . Through this example, we can know that a periodic function has a smaller norm in space. Therefore, it is appropriate to employ space to measure the oscillation components in the minimization model since their -norm is very small.
3.2 Numerical algorithm
In this subsection, a numerical algorithm combined with ADMM is proposed to solve the non-convex optimization problem. Next, we present the specific solution process of porposed model (3.3) in the following.
We introduce and constrain the auxiliary variables , , and in order to simplify the complex coupling relationship between variables and in the optimization problem (3.3). At this point, the augmented Lagrangian function of the original minimization problem can be rewritten as
where , and are the Lagrangian multipliers, and , are the parameters of penalty terms. It should be pointed out that can be omitted in the minimization process. Then, we set the initial value of the variable and use the alternating direction iterative algorithm to optimize the augmented Lagrangian function . In this way, we convert the solution of the original minimization problem into the optimization of several subproblems.
In fact, the difficulty of solving the subproblems is much lower compared to directly solving the original minimization model (3.3). Moreover, due to the non-convex or nonlinear nature of these subproblems, we can only resort to the MM algorithm to find numerical solutions instead of analytical solutions. In the following, we provide a detailed description of the solving process for all subproblems.
(1) -subproblem
Firstly, the -subproblem can be described as
It should be noted that is a nonlinear functional, and it is difficult to obtain the analytic solution of the variable directly by using the necessary condition of functional extremum. We perform a first-order linear Taylor expansion of the nonlinear term at point as [35]
where is the remainder of a Taylor expansion that can be approximated by a quadratic term. Therefore, we rewrite the minimization problem (3.8) to
where is a slight number, and the last term is to approximate the Taylor remainder. According to the Euler-Lagrange equation, we can easily find the necessary conditions for the solution of the above optimization problem (3.9) as
Furthermore, we can obtain the analytical solution of the -subproblem as
(2) -subproblem
According to Theorem 3.1, we transform the -norm into the ordinary -norm. The -subproblem is a convex smooth minimization problem, which is described as
where and are the gradient operator and the Laplace inverse operator, respectively. The solution of this minimization problem satisfies Euler-Lagrange equation, i.e.,
where operator is the conjugate operator of operator , and they also satisfy the relation . Both sides of Eq.(3.12) can be rewritten by the operator as
where is an identity matrix of the same size as . It should be noted that the operator in Eq.(3.13) is a convolution operation, so we cannot directly separate the variable from it. A general approach is to use Fourier transform to transform the convolution operation in the spatial domain into the ordinary multiplication in the frequency domain, thus obtaining an analytical solution of the -subproblem as
here and later, and are defined as Fourier transform and inverse Fourier transform operations, respectively.
(3) and -subproblem
Firstly, the -subproblem is described as
Obviously, this minimization problem is easy to be solved because it is convex and smooth. By using the variational method, we can directly obtain the necessary conditions for the solution of the minimization problem (3.15) as
where is a conjugate operator of . Similarly, Eq.(3.16) also contains two-dimensional convolution operation. Assuming that satisfies the periodic boundary condition, it is not difficult to obtain from the above linear equation by Fourier transform and its inverse transform as
Next, we use similar steps to solve the -subproblem
Similarly, according to the Euler-Lagrangian equation, the necessary condition for the solution of the minimization problem (3.18) is
Further, we solve the iterative form of by the Fourier domain, which is described as
(4) and -subproblem
Now, we solve and nonconvex subproblems. In fact, we only need to solve one of the minimization problems, because the forms of these two non-convex optimization problems are similar. Here we solve the -subproblem in detail, which is written as
There is not difficult to perceive that the minimization problem (3.21) is a classical nonconvex optimization problem. In Section 2.2, we introduce in detail the solution of such problems by MM algorithm, whcih is updated in the form of Eq.(2.7). Therefore, we directly obtain the solution of problem (3.21) as
Next, we solve the -subproblem similarly,
Likewise, we obtain the iterative form of variable in the framework of MM algorithm as
(5) Updating Lagrange multipliers
Finally, the Lagrange multipliers , and are updated according to the ADMM algorithm as follows
To tackle the challenges posed by the proposed model (3.3), we have successfully combined multiple numerical methods within the framework of ADMM. This integration allows us to achieve efficient and accurate solutions for image decomposition. In Algorithm 1, we present a step-by-step procedure that outlines our algorithm in detail.
Remark. Before starting the iteration process, several variables and parameters need to be initialized in Algorithm 1. These include , , , , , , , and . Additionally, parameters such as , , , , , and the penalty parameter must be set initially. Once these initializations are complete, the algorithm proceeds to iterate starting from . The iteration continues until the relative error of the variable is greater than a preset accuracy threshold denoted as (which is defined in Section 4). The algorithm terminates when the relative error falls below this threshold. It is worth noting that Algorithm 1 consists of only one loop. This indicates that its complexity is determined by the size of the input image () and the number of iterations () performed by the algorithm. Therefore, we can conclude that the complexity of Algorithm 1 is on the order of .
Algorithm 1. For solving the proposed model (2.3) |
Initialize: , , , , , , , and ; |
Set parameters: , , , , , and ; |
While relative-error , do |
Compute by Eq.(3.10); |
Compute by Eq.(3.14); |
Compute by Eq.(3.17); |
Compute by Eq.(3.20); |
Compute by Eq.(3.22); |
Compute by Eq.(3.24); |
Updata Lagrange multipliers , and by Eqs.(3.25); |
Set ; |
End while |
Output: , and . |
3.3 Convergence analysis
In this subsection, we will discuss the convergence of the proposed algorithm, which is solved using the ADMM framework. The algorithm involves solving two non-convex minimization subproblems, namely and , using the MM algorithm. It is important to note that while the convergence analysis of ADMM for convex optimization problems is well-established [27, 28], studying the convergence of non-convex optimization problems presents a significant challenge
Now, we need to make several crucial assumptions that are essential for proving the convergence of the algorithm.
Assumption 3.2.
The non-convex potential function is closed, proper and lower semi-continuity. In addition, its gradient function satisfies the Lipschitz continuity condition, i.e., there exists a positive constant such that
Assumption 3.3.
The penalty coefficients and are large enough to satisfy and . In this case, the equivalent extreme value problem (3.9) of subproblem, subproblem (3.21) and subproblem (3.23) are strictly convex.
Assumption 3.4.
The energy functional is bounded below, i.e., .
In the following work, we prove the convergence of Algorithm 1 with three panels in turn. Firstly, Theorem 3.5-3.8 prove that the augmented Lagrangian function decreases monotonically as . Secondly, Theorem 3.9 prove that the augmented Lagrangian function is bounded below and convergent. Finally, Theorems 3.10-3.11 prove that the sequence generated by Algorithm 1 converges to a limit point as , where is a stationary point of the augmented Lagrangian function .
Theorem 3.5.
Let the sequence generated by Algorithm 1, and we can infer that there must be three positive constants , , and such that
(3.26a) | |||||
(3.26b) | |||||
(3.26c) |
Proof.
We first prove that the augmented Lagrangian function is strictly convex with respect to . Obviously, the terms and in the function are quadratic smooth terms with respect to , so they are strictly convex with respect to . The seventh term in the function can be expanded into
and the second term in Eq(3.27)
is a linear term with respect to , so it is convex. In addition, we can infer that the term in Eq(3.27) is convex with respect to , which can be obtained by Eq(3.6).
By the above discussion, we conclude that the function is strictly convex with respect to . According to the definition of convex function, we can infer that there must be a positive constant such that
In the subproblem, since is a minimum point of the function , according to the necessary condition of the extreme point, we have
And combining (3.28) and (3.29), we conclude that
Similarly, we can also infer that there must be a positive constant such that
Finally, we prove that inequality (3.26c) holds. It is worth noting that we can not directly determine the convexity of the nonlinear term with respect to . In fact, we use the first-order linear Taylor expansion to solve the subproblem well. Therefore, according to the equivalent extreme value problem (3.9) of the subproblem (3.8), we only need to judge the convexity of the following formula with respect to ,
where is equivalent to a constant. Obviously, the first and third terms in the above formula are the quadratic term and the first term about respectively, so they are both convex with respect to . The parameter in the Taylor remainder cannot determine the sign, because we cannot determine the convexity of the nonlinear term with respect to . Fortunately, we can deduce that the expression is strictly convex with respect to , which is attributed to the fact that is satisfied in Assumption 3.3.
By the above discussion, we conclude that the function is strictly convex with respect to . Similarly, we can infer that there must be a positive constant such that
The desired result is obtained.
Theorem 3.6.
Let the sequence generated by Algorithm 1, and we can infer that there must be three positive constants , and such that
(3.30a) | |||||
(3.30b) | |||||
(3.30c) |
Proof.
According to Theorem 3.1, the function is strictly convex with respect to . Therefore, we can infer that there must be a positive constant such that
Next, we prove that the inequality (3.30b) holds. There is a fact , because is a minimum point of the minimization problem (3.21). We have
In addition, since the potential function is nonconvex, there must be a positive constant such that
Now, combining (3.31) and (3.32), we conclude that
which implies that
Similarly, we can infer that there must be a positive constant such that
The desired result is obtained.
Theorem 3.7.
Let the sequence generated by Algorithm 1, and we can infer that
Proof.
In the subproblem, since is a minimum point of the augmented Lagrangian function , according to the necessary condition of the extreme point, we have
Furthermore, according to the updated form of Lagrange multiplier Eqs(3.25), we obtain
Therefore, we can easily derive the inequality
where is an operator norm. Similarly, since and are solutions of the minimization problems (3.21) and (3.23), respectively, we have
and
Combining the updated form of Lagrange multiplier Eqs(3.25), we obtain
And combining the Assumption 3.2, we easily derive the inequality system
Finally, combining (3.25), (3.35) and (3.37), we obtain
The desired result is obtained.
Theorem 3.8.
Let the sequence generated by Algorithm 1. If the parameter is large enough such that , and we can infer that
Proof.
During the iteration of the proposed algorithm, the difference of the augmented Lagrangian function can be rewritten as
According to Theorem 3.5-3.7, we derive
Obviously, if , we obtain
The desired result is obtained.
Theorem 3.9.
The augmented Lagrangian function is bounded below, and it is convergent as .
Proof.
The augmented Lagrangian function can be rewritten as
Furthermore, combining (3.34) and (3.36), we derive the inequality system as follows
Since is a nonconvex function, we have
By above discussion, we obtain
Finally, the augmented Lagrangian function is monotone decreasing according to Theorem 3.8, so the function is convergent as .
The desired result is obtained.
Theorem 3.10.
Let the sequence generated by Algorithm 1. If the parameter is large enough such that , and we can infer that
Proof.
Firstly, combining Theorem 3.9 and taking the limit of Eq(3.38), we obtain
Next, combining (3.35) and (3.37), we have
Finally, combining the updated format of Lagrange multiplier and in Eqs(3.25), we can easily deduce
The desired result is obtained.
Theorem 3.11.
Let . If are the compact sets, the sequence generated by Algorithm 1 converges to a limit point , which is a stationary point of the augmented Lagrangian function .
Proof.
Since are compact sets, according to Theorem 3.10, then we can infer that there must be a subsequence of such that as .
Next, we prove that the point is a stationary point of the augmented Lagrangian function . The sequence must satisfy the sufficient condition of the minimization problem, so we obtain
Finally, combining Theorem 3.10, passing the limit in the above six equations along the subsequence , we have
which implies that the point is a stationary point of the function .
The desired result is obtained.
4 Numerical experiment
In this section, we conduct a series of numerical experiments to further illustrate the superiority and effectiveness of the proposed model from four different perspectives. Firstly, we verify the robustness of the selection of the initial value in Algorithm 1 through experimental results. Secondly, we perform numerical experiments to test two commonly used generalized nonconvex functions mentioned in Subsection 2.2. Throughout these experiments, unless stated otherwise, the generalized nonconvex function in Algorithm 1 is set as . We then proceed to experiment and demonstrate the ternary decomposition results of two test images under both noise-free and noisy experimental environments. Finally, we compare the proposed model with several advanced denoising models on specific test image sets to clearly illustrate its effectiveness in denoising applications
In the numerical experiments, the iteration stop condition of the proposed algorithm is that the maximum number of iterations or the relative error of is less than . Here the relative error is defined as
In addition, the PSNR [36] and MSSIM [37] are selected as the quantitative evaluation indices for assessing the quality of the restored images. It is well known that higher values of PSNR and MSSIM indicate better image quality. It should be noted that Fig.2 displays some of the test images used in the numerical experiments, with a size of . All numerical experiments in this paper were implemented in MATLAB and executed on a PC equipped with Windows 10, an Intel Core i5-8300H CPU running at 2.3GHz, and 16GB of RAM.
Fig.2 Some test images for numerical experiments.
4.1 Sensitivity analysis of initial value
This example aims to demonstrate the robustness of the selection of initial values of in the algorithm based on the experimental results. The proposed algorithm in this paper requires initializing certain parameters and variables, with particular interest in three variables: , and . However, considering the numerous combinations for selecting initial values for these three variables, it significantly increases the experimental cost. In fact, we find that the three variables satisfy the relation by the numerical algorithm in this paper, so we only choose different initial values to verify their robustness to the experimental results. Base on this, we set four initial value images (all gray values are ), (all gray values are ), (gray values are some random numbers between to ) and (observed image) as experimental objects.
Figure 3 illustrates the experimental results of the “Cameraman” image obtained by the proposed model using different initial values of . The first row shows the difference image between the noisy input image and the restored image, which mainly consists of noise components and oscillation characteristics. The second and third rows display the restored image and a locally zoomed-in region of that image, respectively. Upon examining these results, it is evident that the visual effects of the restored images are almost identical regardless of the different initial values .
In addition, we also select two other test images, namely “Butterfly” and “Peppers1”, to serve as subjects for supplementary experiments, and test them at different noise levels. Table 1 shows the PSNR values of the restored image obtained by the proposed model with different initial values . It should be noted that the last column in Table 1 represents the standard deviation (SD) of the four PSNR values in each row, which measures the dispersion of PSNR values. From the results, it is evident that these standard deviation (SD) values are consistently around 0.02, which indicates that the different initial values of have a negligible quantitative impact on the experimental results.
Fig.3 The experimental results of “Cameraman” image under different initial values with .
4.2 Test of different potential functions
This example aims to examine the impact of different non-convex potential functions on the image restoration outcomes of the proposed model. To accomplish this, we have chosen two specific non-convex potential functions, namely and , for conducting comparative experiments on the test image “Peppers1” contaminated with Gaussian noise having a variance of . The functional graphs corresponding to the exponential function and the logarithmic function are depicted in Fig.4.
It is important to note that the purpose of this experiment is primarily to assess the impact of different non-convex potential functions on the denoising results of the model rather than achieving optimal image restoration effects. Therefore, apart from varying the non-convex parameters and in the potential functions, all other parameters remain unchanged across all experiments. Consequently, it is evident that the image restoration outcomes may not be optimal in this particular scenario.
It is widely recognized that non-convex functions can effectively preserve structural information such as image edges and contours, exhibiting strong denoising capabilities. However, they may yield staircase effects in smooth areas. From the function graph of the non-convex potential function shown in Fig.4, it is evident that the non-convex function becomes weaker as the parameter increases. In other words, a smaller value of (e.g., ) can better retain structural information in the image, while increasing the value of weakens the denoising effect but restores smoother areas. On the contrary, the change trend of the parameter in the function is opposite to that of , i.e., larger values of result in stronger non-convexity. Fig.5 illustrates the schematic diagram of the proposed model after denoising the “Peppers1” image using different values of and for the non-convex potential functions and respectively. By comparing Fig.5(i) and (l), it is visually evident that compared to when , a smaller value of leads to more pronounced staircase effects, which further confirms our previous observations based on visual effects. Similarly, a similar conclusion can be drawn by observing Fig.5(q) and (t). Fig.6 presents local cross-sections of the denoised image, where we also observe that stronger non-convexity results in more severe staircase effects.
Furthermore, Table 2 presents the PSNR and MSSIM values for all denoised images in Fig.5, with the optimal value highlighted in bold. It is observed that the image quality is highest when in the non-convex potential function . This suggests that the reflection component and illumination component of the “Peppers1” image have a sparse representation in the TV transform domain under this condition. In fact, when in , the non-convexity becomes stronger compared with the case of , resulting in higher PSNR and MSSIM values for the restored image. However, since the “Peppers1” image is not strictly sparse in the TV transform domain, as continues to increase, the quality of the restored image decreases.
Fig.4 Plots of different non-convex potential functions.
Fig.5 Restoration results of “Peppers1” image using different potential functions with .
Fig.6 The local cross-section of restored images in Fig.5.
4.3 Three component decomposition by the proposed model
In this section, we select two typical test images “Logvi”, and “Checkboard” to verify the effectiveness of the proposed model at different noise levels. In fact, the proposed model (3.2) in this paper is based on the retinex theory background, which decomposes a noisy image into three different components according to the retinex imaging theory (3.1), that is, the reflection component with piecewise constant characteristics, the illumination component with spatial smoothing characteristics, and the noise with high frequency oscillation characteristics. It is worth noting that the main purpose of this experiment is to verify that the proposed model can accurately decompose the above three components and can use and to reconstruct the restored image .
Fig.7 shows the decomposed and restored image results for “Logvi” at and , with the last row representing the local cross sections , and from left to right. We find that the noisy image “Logvi” is well decomposed into piecewise constant component , spatially smooth component and oscillatory component from Fig.7. In addition, the restored image reconstructed according to the reflectivity and the illumination also retains the edge and texture information, which indicates that the proposed decomposition model can be effectively applied to image denoising. For example, Fig.7(b)-(d) show a schematic diagram of the three components , and when , respectively, and (e) is the reconstructed image from and . In order to illustrate the validity of the model more clearly, we may as well observe the Fig.7(g)-(j), which is the locally zoom-in region of above four subimages. Obviously, we can clearly find from the image that the reflection component has an obvious piecewise constant region, the illumination component has a spatial smoothing feature, and the noise component is chaotic. It can be seen from the Fig.7(j) that the restored image remains the edge and texture information better, and compared with the reflected image , it has a slighter step effect and is more suitable the characteristics of a natural image.
We also test the decomposition effect of “Checkboard” image at , and the decomposition results and cross-section diagram are shown in Fig.8. Obviously, we find from the Fig.8 that the proposed model divides the noise image into three components , and with different characteristics, and successfully reconstructs the restored image . The third row in Fig.8 shows the local cross-section diagrams of , and , respectively. By observing the gray value curve Fig.8(f)-(h), we notice that the reflection image contains a wide range of piecewise constant regions, and the illumination has the characteristics of piecewise linear smoothing. This is because the proposed model uses a generalized nonconvex first-order TV regularizer to measure the reflection component , a generalized nonconvex second-order TV regularizer to measure the illumination component , and the weaker-norm to extract the oscillation component .
From the above two decomposition experiments, we conclude that the proposed model can accurately extract the characteristics of different components in the degraded image, i.e., the reflectance , illumination and noise can be well separated.
Fig.7 Decomposition results of “Logvi” image at different noise levels.
Fig.8 Decomposition results of “Checkboard” image with .
4.4 The proposed decomposition model for image denoising application
The proposed variational decomposition model in this paper is based weak space and hybrid nonconvex regularization, so the expression of the restored image is described as . In order to evaluate the performance of the proposed model in the field of image denoising, we compare the proposed model with some classical or popular variational models on different types of test images in this section. And the main purpose of the experiment is to verify the effectiveness of the denoising performance of the proposed model on various types of images. Specifically, we test the restoration effect of the proposed model on four image sets (gray images, real brain MR images, color images and three data sets), and compare some other variational models, including NTV [31], TGV [38], NNTV- [34] and NETV [21] models.
As we know, NTV model is the most classical nonconvex variational model with excellent structural recovery efficiency. The TGV model is a high-order variational hybrid regularization model, which has excellent denoising effect but insufficient retention of structural information such as edges and contours. And we set the parameters and according to the suggestion of [38] in the experiment. NNTV- model is a variational decomposition model proposed by Tang et al., which uses weak space and nonconvex regularizers to decompose images into oscillatory and structural components. Since the model directly regards the structural component as the restored image, the restored image lacks some texture oscillation information. NETV model is based on retinex theory and can better restore the image structure by using non-convex regularization term to model the source reflectance and illumination respectively. Finally, it should be noted that the above four variational models used for comparison are solved under the framework of ADMM algorithm, and the non-convex potential functions in the models are selected as .
Example 1. Tests on gray images
In this example, we test the denoising performance of the proposed model on three gray images “Cameraman”, “Butterfly”, and “Peppers1”. In order to make the proposed model more convincing in denoising effectiveness, we let these gray images used in the experiment be contaminated by Gaussian noise with different noise levels ( and 15). Fig.9 and Fig.10 show the denoising results of gray images contaminated by Gaussian noise with standard deviation and , respectively. In which, the data in the second and fourth rows is the zoom-in region of the images, and the last row represents the colorbars of the difference image between the restored image and the real clean image. In addition, all the denoising data of the above gray test images are presented in Table 3, and the optimal values are bolded.
From Fig.9, we find that the NTV model and NNTV- model are similar in the visual perception of the restored image, that is, they can better restore the sharp edge and contour information in the images. However, the smooth region of the image causes serious staircase effect, which destroys texture and detail information of the images. It is worth noting that the NNTV- model suffers from a slighter staircase phenomenon than the NTV model, because the NNTV- model uses weak space to separate the oscillating components in the images. TGV model has strong denoising efficiency and can restore smooth areas in images, but it also over-smooths the structural information such as edges and contours in images. For example, from the local zoom-in region of the “Peppers1” image in Fig.9, it is easy to find that the NTV model exists more serious piecewise constant region, while the TGV model blurs the edges of the image. Compared with the above models, NETV model has better edge and texture recovery effect, but its denoising performance is insufficient. Finally, we draw the point from the difference image shown in figures that the proposed model has the singlest colorbars, that is, the proposed model has better results in both denoising performance and texture and edge retention.
In addition, the PSNR and MSSIM values shown in Table 4 can also intuitively find that the proposed model has the best PSNR value compared with the above variational models. It is undeniable that the MSSIM values of restored “Cameraman” image by the TGV model is slightly higher than those of the proposed model. Therefore, we further illustrate the superiority of the proposed model from a quantitative perspective based on the numerical results obtained from the experiment.
Fig.9 The denoising results of different models for the gray test images().
Fig.10 The denoising results of different models for the gray test images().
Example 2. Tests on real brain MR images
As we know, medical images are produced by complex imaging systems. During the imaging process, the images will be seriously degraded, which is caused by the interference of unstable signals such as electromagnetic or current. Therefore, in this example, we select three real cleaner brain MR images as test objects and add high intensity Gaussian noise with standard deviation to them in order to evaluate the denoising effect of the proposed model on complex medical images. Fig.11 and Fig.12 show the denoising results of “Brain1”, “Brain2” and “Brain3” images respectively and contain the zoom-in region of the test images and the colorbars of the difference images. The PSNR and MSSIM values are listed in Table 4. In addition, we also show the gray value function curve of the local cross-section of the restored images in Fig.13.
From the denoising results, we find that the NTV model exists serious piecewise constant regions, but this influence is smaller for the NNTV- model. TGV model removes most of the noise in images. Such as the zoom-in region of “Brain1” in Fig.11, it is obvious that the noise in smooth regions almost non-existent, but the sharp contour may be mistakenly regarded as noise being smoothed. NETV model can balance the recovery of edge and smooth region, but it is difficult to separate high-frequency oscillation information, so its denoising effect is less effective. It is clear that the proposed model overcomes the above shortcomings, and can preserve the edge and texture information while denoising. In addition, we draw a similar conclusion by observing the PSNR and MSSIM values in Table 4, i.e., the proposed model has the largest PSNR and MSSIM values on real brain MR images.
Finally, we further verify the above viewpoint from the gray value curve of the local cross-section of the restored image. From Fig.13(a), we find that the TGV model smoothes the region where the gray value changes rapidly, and NNTV- model exists staircase effect in low-frequency oscillation regions. Similarly, from Fig.13(b), we also observe that the NTV model presents the more serious piecewise constant phenomenon, and the NETV model retains some high-frequency oscillation components. It can be concluded that the proposed model is superior and generalizable and shows excellent results in recovering complex medical images.
Fig.11 The denoising results of noisy “Brain1” image with
Fig.12 The denoising results of noisy “Brain2” and “Brain3” images with .
Fig.13 The local cross-section of restored images in Fig.12.
Example 3. Tests on color images
In this example, we test the denoising performance of the proposed model on some color images. In fact, a color image contains three RGB color channels, then it can also be considered as a three-dimensional array of . And the noisy color images are generated by the RGB channels that are all contaminated by Gaussian noise. In this experiment, it should be noted that all the denoising models denoise the three color channels of the image, and the PSNR and MSSIM values are the average values of the three restored RGB channels. The PSNR and MSSIM values of the denoised image are listed in Table 5, and Fig.14 shows the recovery results of the color images at , where the third and fifth rows represent the local zoom-in region of the images.
Similarly, from Fig.14, we clearly observe that the NTV model and NNTV- model recover the structural information of the image well. For example, the overall structure of the house in “House” image is well preserved, but the smooth information of the roof and wall has a serious staircase effect. It is worth noting that the TGV model can restore these smooth regions mentioned above, but its strong denoising performance will cause the loss of edge information. NETV model can not only retain the contour of the image, but also restore part of the texture and detail information, but its denoising performance is insufficient to completely eliminate the noise in images. The proposed model combines the advantages of the above models, i.e., this model effectively reduces the staircase phenomenon, and can remove the noise while restoring the image edge and texture information.
In addition, the experimental data in Table 5 also reflect the advantages of the proposed model in color image restoration. We find that the MSSIM values of NNTV- model are only slightly higher than those of the proposed model for “Boats” image at . In all the remaining cases, the PSNR and MSSIM values of the proposed model are ahead of other variational models. In summary, we can conclude that the proposed model still has superior recovery results on color images.
Fig.14 The denoising results of the color test images with .
Example 4. Tests on three DataSets
In the last example, we further validate the effectiveness and superiority of the proposed model by conducting experiments on three image datasets: Set14 [39], IVC [40], and Tid2008 [41]. The proposed model is compared with several classical or state-of-the-art variational models in PSNR values, including TV [42], NTV [31], TGV [38], HOTV [43], NLTV [44], OGS-TV [45], NNTV- [34] and NETV [21]. It should be noted that we do not impose any mandatory resizing of test images in these datasets.
Table 6 lists the average PSNR values of the restored images obtained by the denoising models mentioned above on three datasets with different noise levels. Based on the numerical results, it is evident that the proposed model achieves the highest average PSNR values, indicating superior recovery performance. Thus far, in this subsection, we have effectively demonstrated the generalization and superiority of our proposed model in image restoration through four illustrative examples.
5 Conclusions
In this paper, we propose an exponential Retinex decomposition model for image denoising based on weak space and hybrid nonconvex regularization. This model decomposes the noisy image into three incoherent parts, and reconstructs the denoised image by using the reflection component and the illumination component. Specifically, the oscillation component, reflection component and illumination component are measured by weak space, nonconvex first order TV regularizer and nonconvex second order TV regularizer, respectively. In addition, we also propose an ADMM combined with MM algorithm to solve the proposed model, and provide sufficient conditions for the convergence of the proposed algorithm. Numerical experiments were conducted to compare the proposed model with several state-of-the-art denoising models. The experimental results demonstrate that the proposed model outperforms these models in terms of PSNR and MSSIM values. This further confirms its effectiveness and superiority.
It is important to acknowledge that there are still some limitations in the proposed model, which also serve as potential directions for our future research: (1) There is a need to explore more efficient optimization algorithms that can guarantee the uniqueness and global convergence of the numerical solution. This will enhance the reliability and stability of the model; (2) The proposed model takes too much time by using the parameter values determined by trial and error method. In the future work, we will focus on the adaptive methods of these parameters.
Acknowledgement
This work was supported in part by the Natural Science Foundation of China under Grant No. 62061016, 61561019, the Doctoral Scientific Fund Project of Hubei Minzu University under Grant No. MY2015B001, the Innovative Project of the School of Mathematics and Statistics, Hubei Minzu University under Grant No. STK2023002.
Declarations
Conflict of Interest: The authors declare that they have no conflict of interest.
Data Availability Statement: Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
References
- [1] C. Maria, T. Maciej, Z. Chao, P. Krzysztof, Enhancing single-shot fringe pattern phase demodulation using advanced variational image decomposition, Journal of Optics 21 (4) (2019) 045702.
- [2] S. Wang, K. Xia, L. Wang, J. Zhang, H. Yang, Improved rpca method via weighted non-convex regularization for image denoising, IET Signal Processing 14 (5) (2020) 269–277.
- [3] L. Fan, H. Li, M. Shi, Z. Hua, C. Zhang, Two-stage image denoising via an enhanced low-rank prior, Journal of Scientific Computing 90 (2022) 57.
- [4] L. M. Tang, Y. J. Ren, Z. Fang, C. J. He, A generalized hybrid nonconvex variational regularization model for staircase reduction in image restoration, Neurocomputing 359 (24) (2019) 15–31.
- [5] X. G. Lv, Y. Z. Song, S. X. Wang, J. Le, Image restoration with a high-order total variation minimization method, Applied Mathematical Modelling 37 (16-17) (2013) 8210–8224.
- [6] J. Bai, X. C. Feng, Fractional-order anisotropic diffusion for image denoising, IEEE Transactions on Image Processing 16 (10) (2007) 2492–2502.
- [7] J. F. Aujol, G. Gilboa, T. Chan, S. Osher, Structure-texture image decomposition-modeling, algorithms, and parameter selection, International Journal of Computer Vision 67 (1) (2006) 111–136.
- [8] L. M. Tang, Z. Fang, C. C. Xiang, S. Q. Chen, Image selective restoration using multi-scale variational decomposition, Journal of Visual Communication and Image Representation 40 (pt.B) (2016) 638–655.
- [9] T. F. Chan, S. Esedoglu, F. E. Park, Image decomposition combining staircase reduction and texture extraction, Journal of Visual Communication and Image Representation 18 (6) (2007) 464–486.
- [10] J. M. Morel, A. B. Petro, C. Sbert, A pde formalization of retinex theory, IEEE Transactions on Image Processing 19 (11) (2010) 2825–2837.
- [11] X. Fu, Y. Liao, D. Zeng, Y. Huang, X. Zhang, X. Ding, A probabilistic method for image enhancement with simultaneous illumination and reflectance estimation, IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society 24 (12) (2015) 4965.
- [12] W. Ma, S. Osher, A tv bregman iterative model of retinex theory, Inverse Problems and Imaging 6 (4) (2007) 697–708.
- [13] E. H. Land, J. J. McCann, Lightness and retinex theory, Journal of the Optical Society of America 61 (1) (1971) 1–11.
- [14] E. H. Land, H. Edwin, The retinex theory of color vision, Scientific American 237 (6) (1978) 108–128.
- [15] Q. Zhao, P. Tan, Q. Dai, L. Shen, E. Wu, S. Lin, A closedform solution to retinex with nonlocal texture constraints, IEEE Transactions on Pattern Analysis and Machine Intelligence 34 (7) (2012) 1437–1444.
- [16] R. Kimmel, M. Elad, D. Shaked, R. Keshet, I. Sobel, A variational framework for retinex, International Journal of Computer Vision 52 (1) (2003) 7–23.
- [17] M. K. Ng, W. Wang, A total variation model for retinex, SIAM Journal on Imaging Sciences 4 (1) (2011) 345–365.
- [18] J. Liang, X. Zhang, Retinex by higher order total variation decomposition, Journal of Mathematical Imaging and Vision 52 (3) (2015) 345–355.
- [19] J. Xu, Y. Hou, D. Ren, L. Liu, F. Zhu, M. Yu, H. Wang, L. Shao, Star: A structure and texture aware retinex model, IEEE Transactions on Image Processing 29 (2020) 5022–5037.
- [20] L. Liu, Z. F. Pang, Y. Duan, Retinex based on exponent-type total variation scheme, Inverse Problems and Imaging 12 (5) (2018) 1199–1217.
- [21] Y. Wang, Z. F. Pang, Y. Duan, K. Chen, Image retinex based on the nonconvex tv-type regularization, Inverse Problems and Imaging 15 (6) (2021) 1381–1407.
- [22] Y. Meyer, Oscillating Patterns in Image Processing and Nonlinear Evolution Equations: The Fifteenth Dean Jacqueline B. Lewis Memorial Lectures, American Mathematical Society, 2001.
- [23] L. A. Vese, S. J. Osher, Image denoising and decomposition with total variation minimization and oscillatory functions, Journal of Mathematical Imaging and Vision 20 (1-2) (2004) 7–18.
- [24] L. M. Tang, L. Wu, Z. Fang, C. Y. Li, A non‐convex ternary variational decomposition and its application for image denoising, IET signal processing 16 (3) (2022) 248–266.
- [25] L. M. Tang, C. J. He, Multiscale texture extraction with hierarchical (bv,gp,l2) decomposition, Journal of Mathematical Imaging and Vision 45 (2) (2013) 148–163.
- [26] S. Osher, A. Sole, L. Vese, Image decomposition and restoration using total variation minimization and the norm, SIAM Multiscale Modeling and Simulation 1 (3) (2003) 349–370.
- [27] W. Deng, W. Yin, On the global and linear convergence of the generalized alternating direction method of multipliers, Journal of Scientific Computing 66 (3) (2016) 889–916.
- [28] M. V. Afonso, J. M. Bioucas-Dias, M. A. T. Figueiredo, An augmented lagrangian approach to the constrained optimization formulation of imaging inverse problems, IEEE Transactions on Image Processing 20 (3) (2011) 681–695.
- [29] D. R. Hunter, K. Lange, A tutorial on mm algorithms, American Statistician 58 (1) (2004) 30–37.
- [30] D. R. Hunter, R. Li, Variable selection using mm algorithms, Annals of Statistics 33 (4) (2005) 1617–1642.
- [31] M. Nikolova, M. K. Ng, C. P. Tam, Fast nonconvex nonsmooth minimization methods for image restoration and reconstruction, IEEE Transactions on Image Processing 19 (12) (2010) 3073–3088.
- [32] M. Nikolova, M. K. Ng, S. Zhang, W. K. Ching, Efficient reconstruction of piecewise constant images using nonsmooth nonconvex minimization, SIAM Journal on Imaging Sciences 1 (1) (2008) 2–25.
- [33] Y. Sun, P. Babu, D. P. Palomar, Majorization-minimization algorithms in signal processing, communications, and machine learning, IEEE Transactions on Signal Processing 65 (3) (2017) 794–816.
- [34] L. M. Tang, H. L. Zhang, C. J. He, Z. Fang, Non-convex and non-smooth variational decomposition for image restoration, Applied Mathematical Modelling 69 (2019) 355–377.
- [35] M. Benning, F. Knoll, C.-B. Schönlieb, T. Valkonen, Preconditioned admm with nonlinear operator constraint, in: IFIP Conference on System Modeling and Optimization, 2015, pp. 117–126.
- [36] A. Horé, D. Ziou, Image quality metrics: Psnr vs. ssim, in: 2010 20th International Conference on Pattern Recognition, 2010, pp. 2366–2369.
- [37] W. Zhou, A. C. Bovik, H. R. Sheikh, E. P. Simoncelli, Image quality assessment: from error visibility to structural similarity, IEEE Transactions on Image Processing 13 (4) (2004) 600–612.
- [38] K. Bredies, K. Kunisch, T. Pock, Total generalized variation, SIAM Journal on Imaging Sciences 3 (3) (2010) 492–526.
- [39] L. Wu, L. M. Tang, L. C. Yan, Hybrid regularization model combining overlapping group sparse second-order total variation and nonconvex total variation, Journal of Electronic Imaging 31 (4) (2022) 043012.
- [40] P. L. Callet, F. Autrusseau, Subjective quality assessment irccyn/ivc database, Informatique Traitement Signal Image (2005).
- [41] N. Ponomarenko, V. Lukin, A. Zelensky, K. Egiazarian, M. Carli, F. Battisti, Tid2008-a database for evaluation of full-reference visual quality assessment metrics, Advances of Modern Radioelectronics 10 (4) (2009) 30–45.
- [42] L. I. Rudin, S. Osher, E. Fatemi, Nonlinear total variation based noise removal algorithms, Physica D Nonlinear Phenomena 60 (1-4) (1992) 259–268.
- [43] X. G. Lv, Y. Z. Song, S. X. Wang, L. Jiang, Image restoration with a high-order total variation minimization method, Applied Mathematical Modelling 37 (16-17) (2013) 8210–8224.
- [44] D. Lv, Q. Zhou, J. K. Choi, J. Li, X. Zhang, Nonlocal tv-gaussian prior for bayesian inverse problems with applications to limited ct reconstruction, Inverse Problems and Imaging 14 (1) (2020) 117.
- [45] J. Liu, T. Z. Huang, I. W. Selesnick, X. G. Lv, P. Y. Chen, Image restoration using total variation with overlapping group sparsity, Information Sciences 295 (2015) 232–246.