Fuzzy Regression Models
Fuzzy Regression Models
net/publication/228526466
CITATIONS READS
58 3,879
1 author:
Arnold Shapiro
Pennsylvania State University
35 PUBLICATIONS 1,153 CITATIONS
SEE PROFILE
All content following this page was uploaded by Arnold Shapiro on 20 February 2021.
Abstract
Recent articles, such as McCauley-Bell et al. (1999) and Sánchez and Gómez (2003a,
2003b, 2004), used fuzzy regression (FR) in their analysis. Following Tanaka et. al.
(1982), their regression models included a fuzzy output, fuzzy coefficients and an non-
fuzzy input vector. The fuzzy components were assumed to be triangular fuzzy numbers
(TFNs). The basic idea was to minimize the fuzziness of the model by minimizing the
total support of the fuzzy coefficients, subject to including all the given data.
The purpose of this article is to revisit the fuzzy regression portions of the foregoing
studies and to discuss issues related to the Tanaka approach, including a consideration of
fuzzy least-squares regression models.
Acknowledgments:
This work was supported in part by the Robert G. Schwartz Faculty Fellowship and the
Smeal Research Grants Program at the Penn State University. The assistance of Michelle
L. Fultz is gratefully acknowledged.
ARC2005_Shapiro_06.pdf 1
1 Introduction
Recent articles, such as McCauley-Bell et al. (1999) and Sánchez and Gómez (2003a,
2003b, 2004), used fuzzy regression (FR) in their analysis. The former use it to predict
the relationship of known risk factors to the onset of occupational injury, while the latter
used it to investigate the term structure of interest rates (TSIR). Following Tanaka et. al.
(1982), their models took the general form:
~ ~ ~ ~ (1)
Y = A0 + A1 x1 + L + An xn
~
where Y is the fuzzy output, Ãi, j=1,2,..., n, is a fuzzy coefficient, and x = (x1, ..., xn) is an
n-dimensional non-fuzzy input vector. The fuzzy components were assumed to be
triangular fuzzy numbers (TFNs). Consequently, the coefficients, for example, can be
characterized by a membership function (MF), µA(a), a representation of which is shown
in Figure 1.
As indicated, the salient features of the TFN are its mode, its left and right spread, and its
support. When the two spreads are equal, the TFN is known as a symmetrical TFN
(STFN).
The basic idea of the Tanaka approach, often referred to as possibilistic regression, was to
minimize the fuzziness of the model by minimizing the total spread of the fuzzy
coefficients, subject to including all the given data.
The purpose of this article is to revisit the fuzzy regression portions of the foregoing
studies, and to discuss issues related to the Tanaka (possibilistic) regression model. This
ARC2005_Shapiro_06.pdf 2
discussion is not meant to be exhaustive but, rather, is intended to point out some of the
major considerations. The outline of the paper is as follows. We first define and
conceptualize the general components of fuzzy regression. Next, the essence of the
Tanaka model is explored, including a commentary on some of its potential limitations.
Then, fuzzy least-squares regression models are discussed as an alternative to the Tanaka
model. Throughout the paper, the same simple data set is used to show how the ideas are
implemented. The paper ends with a summary of the conclusions of the study.
2.1 Motivation
where the dependent (response) variable, yi , the independent (explanatory) variables, xij,
and the coefficients (parameters), βj, are crisp values, and εi is a crisp random error term
with E(εi)=0, variance σ2(εi )=σ2, and covariance σ(εi , εj) = 0, ∀i,j, i≠ j.
Although statistical regression has many applications, problems can occur in the
following situations:
Thus, statistical regression is problematic if the data set is too small, or there is difficulty
verifying that the error is normally distributed, or if there is vagueness in the relationship
between the independent and dependent variables, or if there is ambiguity associated with
the event or if the linearity assumption is inappropriate. These are the very situations
fuzzy regression was meant to address.
There are two general ways (not necessarily mutually exclusive) to develop a fuzzy
regression model: (1) models where the relationship of the variables is fuzzy; and (2)
ARC2005_Shapiro_06.pdf 3
models where the variables themselves are fuzzy. Both of these models are explored in
the rest of this article, but, for this conceptualization, we focus on models where the data
is crisp and the relationship of the variables is fuzzy.
It is a simple matter to conceptualize fuzzy regression. Consider for this, and subsequent,
examples the following simple Ishibuchi (1992) data:
Table 1: Data Pairs
i 1 2 3 4 5 6 7 8
xi 2 4 6 8 10 12 14 16
yi 14 16 14 18 18 22 18 22
Starting with this data, we fit a straight line through two or more data points in such a
way that it bounds the data points from above. Here, these points are determined
heuristically and OLS is used to compute the parameters of the line labeled YH , which
takes the values yˆ = 13 + .75 x , as shown in Figure 2(a).
Similarly, we fit a second straight line through two or more data points in such a way that
it bounds the data points from below. As shown in Figure 2(b), the fitted line in
this case is labeled YL and takes the values yˆ = 11 + .5 x .
Assuming, for the purpose of this example, that STFN are used for the MFs, the modes of
the MFs fall midway between the boundary lines.1
1
This approach to choosing the mode was discussed by Wang and Tsaur (2000) p. 357.
ARC2005_Shapiro_06.pdf 4
For any given data pair, (xi, yi), the foregoing conceptualizations can be summarized by
the fuzzy regression interval [YiL , YiU ] shown in Figure 3.2
Yih =1 is the mode of the MF and if a SFTN is assumed, Yih =1 = Yi = (YiU + YiL )/2 . Given
the parameters, (YU,YL, Yh=1), which characterize the fuzzy regression model, the i-th
data pair (xi,yi), is associated with the model parameters (YiU , YiL , Yih =1 ) . From a
regression perspective, we can view YiU - yi and yi - YiL as components of the SST, yi -
Yih =1 as a component of SSE, and YiU - Yih =1 and Yih =1 - YiL as components of the SSR, as
discussed by Wang and Tsaur (2000).
In possibilistic regression based on STFN, only the data points involved in determining
the upper and lower bounds determine the structure of the model, as depicted in Figure 2.
The rest of the data points have no impact on the structure. This problem is resolved by
using asymmetric TFNs.
Combining Equation (1) and Figure 1, and, for the present, restricting the discussion to
STFNs, the MF of the j-th coefficient, may be defined as:
| a −aj |
µ A (a) = max 1 − , 0 (3)
j
cj
where aj is the mode and cj is the spread, and represented as shown in Figure 4.
2
Adapted from Wang and Tsaur (2000), Figure 1.
ARC2005_Shapiro_06.pdf 5
Figure 4: Symmetrical fuzzy parameters
Defining
~
{ ~ ~
}
A j = {a j , c j }L = A j : a j − c j ≤ A j ≤ a j + c j L , j = 0,1, L , n (4)
and restricting consideration to the case where only the coefficients are fuzzy, we can
write
~ ~ n
~ (5)
Yi = A0 + ∑Ax
j =1
1 ij
n
= (a0 , c0 ) L + ∑ (a j , c j ) L xij
j =1
This is a useful formulation because it explicitly portrays the mode and spreads of the
fuzzy parameters. In a subsequent section, we explore fuzzy independent variables.
If, as in Figure 3, the supports3 are just sufficient to include all the data points of the
sample, there would be only limited confidence in out-of-sample projection using the
estimated FR model. This is resolved for FR, just as it is with statistical regression, by
extending the supports.
Consider the MF associated with the j-th fuzzy coefficient, a representation of which is
shown in Figure 5.
3
Support functions are discussed in Diamond (1988: 143) and Wünsche and Näther (2002: 47).
ARC2005_Shapiro_06.pdf 6
Figure 5: Estimating Aj using an "h-certain" factor
For illustrative purposes, a non-symmetric TFN is shown, where c Lj and c Rj represent the
left and right spread respectively. Beyond that, what makes this MF materially different
from the one shown in Figure 4, is that it contains a point "h" on the y-axis, called an "h-
certain factor," which, by controlling the size of the feasible data interval (the base of the
shaded area), extends the support of the MF.4 In particular, as the h-factor increases for a
given data set, so increases the spreads, c Lj and c Rj .
An h-certain factor also can be applied to the observed output. Thus, the i-th output data
~
might be represented by the STFN, Yi = (y i , e i ) , where yi is the mode and ei is the spread,
as shown in Figure 6. Here, the actual data points fall within the interval yi ± (1-h) ei, the
base of the shaded portion of the graph.
4
Note that the h-factor has the opposite purpose of an α-cut, in that the former is used to extend the
support, while the latter is used to reduce the support.
ARC2005_Shapiro_06.pdf 7
Figure 6: Observed Fuzzy Output
Given the foregoing, two general approaches are used to fit the fuzzy regression model:
The possibilistic model. Minimize the fuzziness of the model by minimizing the total
spreads of its fuzzy coefficients (see Figure 1), subject to including the data points of
each sample within a specified feasible data interval.
The least-squares model. Minimize the distance between the output of the model and
the observed output, based on their modes and spreads.
The details of these approaches are addressed in the next two sections of this paper.
n
min c0 + ∑ c j | x ij |, c j ≥ 0 (6)
j=1
Figure 7 shows the first step in the containment requirement, by showing how Figure 5
can be easily extended to portray the fuzzy output of the model.
ARC2005_Shapiro_06.pdf 8
Figure 7: Fuzzy output of the model
Putting this together with the observed fuzzy output, Figure 6, results in Figure 8, which
shows a representation of how the estimated fuzzy output may be fitted to the observed
fuzzy data.
The key is that the observed fuzzy data, adjusted for the h-certain factor, is contained
within the estimated fuzzy output, adjusted for the h-certain factor. Formally,
n n
a0 + ∑ a j xij + (1 − h) c0 + ∑c j | xij | > yi + (1 − h) ei (7)
j =1 j =1
ARC2005_Shapiro_06.pdf 9
n n
a0 + ∑ a j xij − (1 − h) c0 + ∑ c j | xij | < yi − (1 − h) ei
j =1 j =1
Figure 95 shows the impact of the h-factor on the sample data, given h=0 and h=.7.
The result is what one would expect. Increasing the h-factor expands the confidence
interval and, thus, increases the probability that out-of-sample values will fall within the
model. This is comparable to increasing the confidence in statistical regression by
increasing the confidence interval.
The possibilistic linear regression model, as depicted by equations (6) and (7), is
essentially the fuzzy regression model used by Sánchez and Gómez (2003a, 2003b, 2004)
to investigate the TSIR.6
5
Adapted from Chang and Ayyub (2001), Figure 4.
6
Key components of the Sánchez and Gómez methodology included constructing a discount function from
a linear combination of quadratic or cubic splines, the coefficients of which were assumed to be TFNs or
STFNs, and using the minimum and maximum negotiated price of fixed income assets to obtain the spreads
of the dependent variable observations. Given the fuzzy discount functions, the authors provided TFN
approximations for the corresponding spot rates and forward rates. It was necessary to approximate the
spot rates and forward rates since they are nonlinear functions of the discount function, and hence are not
TFNs even though the discount function is a TFN.
ARC2005_Shapiro_06.pdf 10
3.1 Criticisms of the Possibilistic Regression Model
There are a number of criticisms of the possibilistic regression model. Some of the major
ones are the following:
• The original Tanaka model was extremely sensitive to the outliers. [Peters (1994)].
• There is no proper interpretation about the fuzzy regression interval [Wang and
Tsaur (2000)]
• The fuzzy linear regression may tend to become multicollinear as more independent
variables are collected [Kim et al (1996)].
y i = β 0 + β1 x i + ε i , i = 1,2,..., m (8)
Conceptually, the relationship between the fuzzy i-th response and explanatory variables
in (9) can be represented as shown in Figure 10.
ARC2005_Shapiro_06.pdf 11
Figure 10: Fuzzy i-th response and explanatory variables
n
~ ~
min ∑ (Y i − b0 − b1 X i ) 2 (11)
i =1
There are a number of ways to implement FLSR, but the two basic approaches are FLSR
using distance measures and FLSR using compatibility measures. A description of these
methods follows.
Diamond (1988) was the first to implement the FLSR using distance measures and his
methodology is the most commonly used. Essentially, he defined an L2- metric d(.,.)2
between two TFNs by [Diamond (1988: 143) equation (2)]
+ ((m1 + r1 ) − (m2 + r2 ) ) 2
Given TFNs, it provides a measure of the distance between two fuzzy numbers based on
their modes, left spread and right spread.7
7
The methods of Diamond's paper are rigorously justified by a projection-type theorem for cones on a
Banach space containing the cone of triangular fuzzy numbers, where a Banach space is a normed vector
space that is complete as a metric space under the metric d(x, y) = ||x-y|| induced by the norm.
ARC2005_Shapiro_06.pdf 12
The case most similar to the Sánchez and Gómez model takes the form
~ ~ ~
Yi = β 0 + β1 xi + ε~i , i = 1,2,..., m (13)
In the case most reminiscent of statistical regression, the coefficients are crisp and the
task becomes the least-squares optimization problem
~ ~
min ∑ d (a + bX i , Yi ) 2 (16)
a,b
Once again, the solution is given by (12), adjusted to take into account the sign of b.
One approach to this problem (Hong et al (2001)) is to replace the t-norm min(a,b) with
the t-norm Tw(a,b) = a, if b=1; b, if a=1; 0, otherwise. Since Tw(a,b) is a shape preserving
operation under multiplication, it resolves the problem. This approach is used in Koissi
and Shapiro (2005).
Another approach is to use approximate TFNs. This was done by Sánchez and Gómez
(2003a), albeit in another context.
ARC2005_Shapiro_06.pdf 13
4.2 FLSR using compatibility measures
As indicated, γ ranges from 0, when the MFs are mutually exclusive, to 1, when the
modes of the MFs coincide.
Celmiņš compatibility model, which involved maximizing the compatibility between the
data and the fitted model, follows from this measure. The objective function is
m
∑ (1 − γ
i =1
i )2 (19)
Thus, for example, when there is a single crisp explanatory variable, [Chang and Ayyub
(2001: 190)]
~ ~
Yˆ = A0 + A1 x (20)
= m0 + m1 x ± c0 + 2c01 x + c12 x 2
where m0 and m1 are determined using weighted LS regression, and c0, c1, and c01 are
determined using iteration and the desired compatibility measure.
8
Adapted from Chang and Ayyub (2001), Figure 2.
ARC2005_Shapiro_06.pdf 14
An example of the use of the Celmiņš compatibility model applied to our sample data is
shown in Figure 12.9
The essential characteristics of the model in this case are the parabolic curves for the
upper and lower bounds and that the higher the compatibility level, the broader the width
of the bounds.
5 Comment
The studies of McCauley-Bell et al. (1999) and Sánchez and Gómez (2003a, 2003b,
2004) provide some interesting insights into the use of fuzzy regression. However, their
methodology relies on possibilitic regression, which has the potential limitations
mentioned in section 3.1. Since some of these limitations can be circumvented by using
FLSR techniques, it is important that researchers are familiar with these techniques as
well. If this article helps in this regard, it will have served its purpose.
References
Bardossy, A. (1990) "Note on fuzzy regression," Fuzzy Sets and Systems 37, 65-75.
9
Adapted from Chang and Ayyub (2001), Figure 5.
ARC2005_Shapiro_06.pdf 15
Celmiņš, A. (1987) "Least squares model fitting to fuzzy vector data," Fuzzy Sets and
Systems, 22(3), 245-269
Hojati, M., C. R. Bector and K. Smimou. (2004) "A simple method for computation of
fuzzy linear regression," European Journal of Operational Research (forthcoming)
Hong, D. H., J-K. Song and H.Y. Do. (2001) "Fuzzy least-squares linear regression
analysis using shape preserving operations," Information Sciences 138 185-193
Ishibuchi, H. (1992) "Fuzzy regression analysis," Fuzzy Theory and Systems, 4, 137-148
Kao, C. and C-L Chyu. (2003) "Least-squares estimates in fuzzy regression analysis,"
European Journal of Operational Research 148, 426-435
Kim, K. J., H. Moskowitz and M. Koksalan. (1996) "Fuzzy versus statistical linear
regression," European Journal of Operational Research, 92(2) 417-434
McCauley-Bell, P. and H. Wang. (1997) "Fuzzy linear regression models for assessing
risks of cumulative trauma disorders," Fuzzy Sets and Systems, 92(3), 317-340
Peters, G. (1994) "Fuzzy linear regression with fuzzy intervals," Fuzzy Sets and Systems,
63(1), 45-55
Tanaka, H., Uejima, S. and Asai, K. (1982) "Linear regression analysis with fuzzy
model," IEEE Transactions on Systems, Man and Cybernetics, 12(6), 903-907.
ARC2005_Shapiro_06.pdf 16
Wang, H.-F. and R.-C. Tsaur. (2000) "Insight of a fuzzy regression model," Fuzzy Sets
and Systems, 112(3), 355-369
Wünsche, A. and W. Näther. (2002) "Least-squares fuzzy regression with fuzzy random
variables," Fuzzy Sets and Systems, 130(1), 43-50
ARC2005_Shapiro_06.pdf 17