key: cord-0044861-hjxwgo66
authors: Pérez-Cañedo, Boris; Rosete, Alejandro; Verdegay, José Luis; Concepción-Morales, Eduardo René
title: A Fuzzy Goal Programming Approach to Fully Fuzzy Linear Regression
date: 2020-05-15
journal: Information Processing and Management of Uncertainty in Knowledge-Based Systems
DOI: 10.1007/978-3-030-50143-3_53
sha: 4bfcf516e29baa8032154a479ab9ab4bfc41a5e7
doc_id: 44861
cord_uid: hjxwgo66

Traditional linear regression analysis aims at finding a linear functional relationship between predictor and response variables based on available data of a given system, and, when this relationship is found, it is used to predict the future behaviour of the system. The difference between the observed and predicted data is supposed to be due to measurement errors. In fuzzy linear regression, on the other hand, this difference is supposed to be mainly due to the indefiniteness of the system. In this paper, we assume that predictor and response variables are LR-type fuzzy numbers, and so are all regression coefficients; this is known as fully fuzzy linear regression (FFLR) problem. We transform the FFLR problem into a fully fuzzy multiobjective linear programming (FFMOLP) problem. Two fuzzy goal programming methods based on linear and Chebyshev scalarisations are proposed to solve the FFMOLP problem. The proposed methods are compared with a recently published method and show promising results.

Traditional linear regression is one of the most frequently applied technique for finding functional relationships between predictor and response variables, and for making predictions. However, decision problems arising in ever-changing environments are difficult to describe or formulate with precise terms. Expert knowledge then gains a special value, and the need for its introduction into classical decision-making techniques has motivated the appearance and development of several mathematical theories dealing with uncertainty and vagueness. Among those theories, Fuzzy Sets Theory [31] has succeeded in numerous practical situations and is now an established research field. Fuzzy linear regression is a natural extension of the classical regression analysis and allows to predict the future behaviour of systems whose structure is not well defined and/or is influenced by subjectivity. It is particularly useful to forecast, e.g., future demands, resource availability and prices that could then be used to set up fuzzy optimisation problems in areas such as production, transportation, project management and so forth. Fuzzy linear regression has been used to forecast airport demand [21] , oil consumption [1] , house prices [32] , sales [5] and short-term load in power distribution systems [24] . Several other applications are reported in [6] .

Numerous fuzzy linear regression models and methods have been developed since the 1980s. Tanaka et al. [25] introduced fuzzy linear regression analysis and formulated a regression problem with crisp predictor variables, fuzzy response variable and fuzzy coefficients as a conventional linear programming problem. A modified version of Tanaka et al.'s [25] fuzzy linear regression method allowing negative spreads in the parameters was proposed in [3] . Chang and Lee [2] proposed fuzzy least square deviation and least absolute deviation models based on ranking functions. A multiobjective approach was proposed by Sakawa and Yano [23] by simultaneous consideration of the model fit and fuzziness. Recent methods for fuzzy linear regression have been presented in [14, 18, 22] . A comprehensive review until year 2019 is provided by Chukhrova and Johannssen [6] .

So far, fuzzy linear regression methods mainly resort to the minimisation of crisp-valued distance functions between fuzzy numbers, either by direct generalisations of known crisp distance functions or by the use of linear ranking functions to defuzzify response observations and model predicted values, and then taking the absolute value of the difference as the distance between the two fuzzy numbers. A simulation study, considering distance functions from both approaches, was conducted in [14] to determine the best distance function in fuzzy linear regression using Monte Carlo methods. Notably, Voxman [26] has argued that the distance between two fuzzy numbers should also be a fuzzy number, and proposed a fuzzy-valued distance function. However, to the best of our knowledge, fuzzy-valued distance functions have not been used in fuzzy regression analysis.

In this paper, we seek to evaluate other models and methods for fuzzy linear regression analysis, which do not rely on crisp-valued distance functions. We propose two methods based on FFMOLP for fuzzy linear regression analysis, in which the predictor variables, response variable and regression coefficients are LR-type fuzzy numbers. The proposed methods rely on the lexicographic approach to fully fuzzy linear programming (FFLP) with inequality constraints recently proposed in [19] . The rest of the paper is organised as follows. Section 1.1 presents some fundamental definitions on LR-type fuzzy numbers. Section 1.2 outlines the lexicographic method [19] for solving FFLP problems. In Sect. 2, we formulate the FFLR problem as a FFMOLP problem, and propose two fuzzy scalarisation methods based on classical goal programming to solve it. Section 3 discusses a numerical example. Lastly, Sect. 4 presents the conclusions and remarks for future work.

Dubois and Prade [8] defined the concept of LR-type fuzzy number and proposed simple formulae for arithmetic operations. In this section, we present some definitions concerning LR-type fuzzy numbers taken from reference [8] .

A fuzzy numberã = (m, α, β) LR is said to be an LR-type fuzzy number if its membership function is given by: 

1 = (m 1 , α 1 , β 1 ) LR andã 2 = (m 2 , α 2 , β 2 ) LR be any LR-type fuzzy numbers, thenã 1 =ã 2 if and only if m 1 = m 2 , α 1 = α 2 and β 1 = β 2 . Definition 3. An LR-type fuzzy numberã = (m, α, β) LR is said to be non- negative (resp. non-positive) if m − α ≥ 0 (resp. m + β ≤ 0). This is denoted bỹ a ≥ 0 (resp.ã ≤ 0).

Definition 5. Letã 1 = (m 1 , α 1 , β 1 ) LR andã 2 = (m 2 , α 2 , β 2 ) LR be two LR-type fuzzy numbers, then fuzzy addition is given byã 1 

The reader is referred to [15] for the definition of the product of unrestricted LR-type fuzzy numbers.

Due to the vast number of practical situations where fuzzy quantities must be compared, ranking fuzzy numbers is still recognised as a fundamental research problem in Fuzzy Sets Theory. Many ranking methodologies have been proposed in the literature [29, 30] . However, several researchers have noticed that most existing ranking methodologies cannot yield a total order of fuzzy numbers in a strict sense. To resolve this issue, lexicographic ranking criteria have been proposed as an alternative [11, 27, 28] .

The integration of lexicographic ranking criteria into FFLP methods started with [12, 13] and has been recently investigated in [7, 9, 10, 16, 19, 20] . In particular, the use of lexicographic ranking criteria for handling fuzzy inequality constraints has been proposed in [12, 19, 20] . In this section, we present the lexicographic method [19] for solving FFLP problems with inequality constraints. This method constitutes the basis of the results presented in the following sections.

Firstly, we need to introduce an order relation on F( ). Letã = (m, α, β) LR be an arbitrary LR-type fuzzy number, and suppose we have three linear functions of the parameters ofã,

Based on the above idea, we may consider the following criterion for ranking LR-type fuzzy numbers.

It can be shown that satisfies the total order properties. That is, for allã,b andc in F( ):

-ã ã (reflexivity); -ã b orb ã (comparability); -ifã b andb c, thenã c (transitivity); -ifã b andb ã, thenã =b (anti-symmetry).

Next, we present the lexicographic method proposed in [19] for solving FFLP problems with equality and inequality constraints.

The FFLP problem can be formulated as follows, wherec j ,ã ij andb i are LR-type fuzzy parameters,x j denote the LR-type fuzzy decision variables, and is an order relation on F( ); here, we assume that is given by Definition 7. 

By using Definitions 2 and 7, FFLP problem (1) is transformed into problem (2), which is then transformed into problem (3). To carry out these transformations,

In addition, I e , I le and I ge denote the index sets of the fuzzy equality, less-than-or-equal-to and greater-than-orequal-to constraints of FFLP problem (1), respectively; and M are positive real numbers sufficiently small and large, respectively.

Proof. See [19] .

In order to solve FFLP problem (1), we must choose a lexicographic criterion for ranking LR-type fuzzy numbers. There are several such criteria in the literature (see, e.g., [11, 27] ). Notably, the solution method outlined here is general enough so as to allow a decision-maker to use the criterion that best fits the decision-making problem at hand.

Letx j ,Ã j (j = 0, 2, . . . , n) andỹ be LR-type fuzzy numbers. Then the FFLR model is formulated as in Eq. (4).

In Eq. (4), eachx j is termed fuzzy predictor variable,ỹ fuzzy response variable andÃ j fuzzy regression coefficient. Now, let us consider a sample of LRtype fuzzy numbers X |Ỹ , whereX = (x ij ) i=1,2,...,m j=1,2,...,n contains the observations corresponding to each fuzzy predictor variablex j , and the column vector

..,m contains the observations of the fuzzy response variableỹ. We wish to determine the estimates ofÃ j so as to obtain the best fitting model given the available data.

In what follows, we formulate the FFLR problem as a FFMOLP problem. To this aim, we introduce two non-negative fuzzy deviation variablesSp i and Sn i for each sample. Thus, the following set of fuzzy equalities is obtained.

Therefore, we may consider the following FFMOLP problem:

In order to solve (P1), we resort to two known classical scalarisation methods based on goal programming, which are extended to the fuzzy case: linear scalarisation method and Chebyshev (minimax) scalarisation method.

In this method, each objective function is multiplied by a positive weighting factor and the resulting expressions are added together. Thus, we have,

Hereafter, w i = 1 for i = 1, 2, . . . , m since no particular preference for the objective functions shall be considered.

In thus, the objective is to minimise the maximal deviation.

The whole procedure can be summarised in the following six steps.

1. Input: Sample dataX andỸ ; 2. choose a lexicographic criterion for ranking LR-type fuzzy numbers; 3. set up FFMOLP problem (P1); 4. choose either of the proposed scalarisation methods, and set up FFLP problem (l-P1) or FFLP problem (ch-P1); 5. solve the FFLP problem chosen in Step 4 by using the lexicographic method outlined in Sect. 1.2; 6. output:Ã j for j = 0, 1, . . . , n as the estimated regression parameters.

The example in this section is taken from references [4, 17, 18] . The dataset contains 30 samples, each having four predictor variables and one response variable (see Table 1 ). It is a real-life dataset comprising triangular fuzzy numbers used to subjectively evaluate employee's performance according to work quality, inability to endure job stress, frequency of delays, and communication and coordination ability. As part of the solution procedure from Sect. 2.3, the functions f 1 (ã) := 3m + β − α, f 2 (ã) := m + β and f 3 (ã) := α + β were used to define a lexicographic order relation on F( ).

We applied the proposed methods and obtained the following two models. The estimated responses of both models are shown in Table 2 . (30, 11, 9) LR (29, 8, 8) The obtained models' predicted values were compared with the ones reported by Li et al. [18] , according to the overall absolute distance from the observed responses, using Eqs. (5) and (6). 

In order to compare the predicted values, first Li et al.'s [18] solution is converted to LR representation of fuzzy numbers, since the authors used a different representation. From the last row of Table 2 , it can be seen that the model obtained by the Linear Scalarisation Method (FFLP problem (l-P1)) has the smallest overall distance value, followed by Li et al.'s [18] model and the model obtained by using the proposed Chebyshev Scalarisation Method.

In this paper, we proposed two methods for FFLR analysis. Contrary to existing methodologies that use crisp-valued distance functions, we formulated the FFLR problem as a FFMOLP problem. Fuzzy linear and Chebyshev scalarisations were proposed to solve the FFMOLP problem using a lexicographic method for FFLP. The proposed methods were compared with a recently published method and showed promising results. In a future work, we plan to conduct an extensive simulation study and consider real-world applications to gain more insights into the performance of the proposed methods. In addition, the use of fuzzy-valued distance functions for FFLR analysis will be investigated.

557) LR (38.684, 11.941, 12.009) LR (37, 12, 12) LR (36.090, 11.285, 12.074) LR (35.668, 13.370, 12.557) LR (36.890, 11.890, 12.009) LR (60, 11, 12) LR (59.973, 12.457, 9.485) LR (59.387, 13.753, 10.754) LR (58.658, 11.549, 10.187) LR (59

A flexible fuzzy regression algorithm for forecasting oil consumption estimation

Fuzzy least absolute deviations regression based on the ranking of fuzzy numbers

Fuzzy linear regression with spreads unrestricted in sign

Fuzzy regression models using the least-squares method based on the concept of distance

Forecasting methods using fuzzy concepts

Fuzzy regression analysis: systematic review and bibliography

A mathematical model for solving fully fuzzy linear programming problem with trapezoidal fuzzy numbers

Operations on fuzzy numbers

An effective computational attempt for solving fully fuzzy linear programming using MOLP problem

A new algorithm to solve fully fuzzy linear programming problems using the MOLP problem

Ranking fuzzy numbers based on lexicographical ordering

Fully fuzzified linear programming, solution and duality

Solving a full fuzzy linear programming using lexicography method and fuzzy approximate solution

Different distance measures for fuzzy linear regression with Monte Carlo methods

Mehar's method for solving fully fuzzy linear programming problems with L-R fuzzy parameters

A new method to find the unique fuzzy optimal value of fuzzy linear programming problems

Fuzzy least-absolutes regression using shape preserving operations

A new fuzzy regression model based on least absolute deviation

A method to find the unique optimal fuzzy value of fully fuzzy linear programming problems with inequality constraints having unrestricted L-R fuzzy parameters and decision variables

An epsilon-constraint method for fully fuzzy multiobjective linear programming

Econometric and fuzzy models for the forecast of demand in the airport of Rhodes

Least-squares approach to regression modeling in full interval-valued fuzzy environment. Soft Comput

Multiobjective fuzzy linear regression analysis and its application

Short-term load forecasting for the holidays using fuzzy linear regression method

Linear regression analysis with fuzzy model

Some remarks on distances between fuzzy numbers

Ranking fuzzy number based on lexicographic screening procedure

Total orderings defined on the set of all fuzzy numbers

Reasonable properties for the ordering of fuzzy quantities (I)

Reasonable properties for the ordering of fuzzy quantities (II)

Fuzzy sets

Affordable levels of house prices using fuzzy linear regression analysis: the case of Shanghai

Acknowledgements. The research of José Luis Verdegay is supported in part by project TIN2017-86647-P (Spanish Ministry of Economy and Competitiveness and FEDER funds from the European Union).