key: cord-0046459-0t6fse01 authors: Ushakov, Anton V.; Vasilyev, Igor title: Locating Facilities Under Deliberate Disruptive Attacks date: 2020-05-20 journal: Mathematical Optimization Theory and Operations Research DOI: 10.1007/978-3-030-49988-4_25 sha: 4a18d01aad775e35c8adb3aa6733dae86aaaca34 doc_id: 46459 cord_uid: 0t6fse01 Facility disruptions or failures may occur due to natural disasters or a deliberate man-made attack. Such an attack is known as interdiction. Recently, facility location problems, addressing intentional strikes against operating facilities and strategies to reduce their impact, have received particular attention. In this paper, we present a new location-interdiction median problem aimed at designing a distribution network which is robust to the worst-case, long-term facility losses. We suppose that there are two players: defender (system designer) and attacker. The defender decides where to locate facilities to minimize the overall cost of supplying the demands of customers. The attacker determines which r facilities to interdict to maximize the cost of serving the customers from the remaining operational facilities. Note that we suppose that the facilities are attacked simultaneously and interdicted facilities become unavailable. We propose bilevel and single-level integer formulations of this problem. For a particular case when the attacker hits a single facility, we develop a fast local search procedure based on implicit enumeration of interdiction strategies. We test our approaches in a series of computational experiments on well-known test problems. Facility location problems are ones of the most widely studied problems in combinatorial optimization and integer programming. Facilities and the corresponding connecting infrastructure are the main ingredients in any production, distribution, or service system and supply chains [29] . A traditional approach to the design of such systems assumes that all their components always remain operable. However, some accidents, like natural disasters or man-made attacks, can substantially reduce the system efficiency and even make it incapacitated. For example, the current coronavirus outbreak is disrupting the global supply chains from tech industry (production of cell phones and electronics) to fashion and car manufacturing. Hyundai and Nissan announced that they would stop manufacturing in South Korea, Europe, and the US because of a lack of parts from China [5, 8] . Meanwhile, Tesla considers to delay the production in Shanghai for at least one week. The disruptions can also be man-made. For example, the current French pension reform strike is the longest French strike over the last 50 years, which results in billion losses. Given a service system, one may expect two main types of disruptions: facility disruptions (loss of production capabilities) and failures of links connecting facilities and customers. In the field of facility location, reliability and sustainability issues of service systems are extensively studied in the scope of the so-called reliable facility location. Such kind of models usually assume that the system components have some inherent probability of failure. Thus, the aim is to maximize the expected system efficiency, e.g. minimize the expected service cost in case of possible failures. The stochastic approach may be quite useful when modeling sudden losses of components due to natural disasters or some inherent system failures. However, disruptions may be caused not only by natural disasters (like earthquakes or fire) but also by disruptive man-made actions (sabotage, deliberate attacks). Intentional attack on a supply network is also called interdiction [11] . Interdiction models are aimed at identifying critical infrastructure elements such that the loss of them due to an attack causes the maximum possible harm and makes a system much less efficient. Originally, such models were mostly studied in military planning applications. Indeed, it is natural to identify the places for interdicting enemies supply lines in order to inflict the maximal damage. The first interdiction problem was considered in [22] where the aim of interdiction was to disrupt enemy supply flows with a limited budget available [35] . Note that most of the military-induced applications are focused on interdicting links of transportation networks (e.g. shortest paths) [18] . Recently, the interest has revived for interdiction problems due to fast growing threat of terrorist attacks (especially, after the September 11 attacks). Nowadays, we are witnessing how vulnerable may be facilities to such threats. International counter-terrorism efforts are forcing the terrorist groups, like al Qaeda, transfer their attention from so-called hard targets (embassies and military assets) to soft targets (civilian facilities). The main goal of such attacks is to engender the largest possible harm. For a service system, the harm is viewed as the loss of service coverage or efficiency (increase in the overall service costs). The first facility interdiction models were addressed in [11] where the r-interdiction median and covering problems were proposed. For example, the r-interdiction median problem is to find which r existing facilities to remove in order to decrease the efficiency of the existing supply system the most. The next research in this field was focused on developing strategies of hedging against the worst-case attacks. Thus, in [10] , the authors extend the rinterdiction problem by the possibility of fortifying (protecting) some existing facilities so that they become immune to attacks. The problem is then modeled as a bilevel integer program involving two players: attacker and defender [27] . This problem has received great attention and a number of effective solution algorithms has been developed. Among most effective are the implicit enumeration algorithm [27] , an exact approach based on reformulating the problem as the maximal covering problem [28, 36] , and a branch-and-cut algorithm from [26] . Numerous papers addressing various bilevel facility location protection-interdiction problems have been published since then, e.g. hub interdiction median problems [15, 24] , the stochastic r-interdiction median problem with fortification [19] , a r-interdiction median problem involving a defender's budget for fortification and penalties for serving the demands of customers from interdicted facilities [4] , a partial interdiction model where some demand after interdiction can be outsourced at some cost [1] , an interdiction problem with hierarchical system of facilities [7] , a more general median protection/interdiction problem that involves outsourcing, capacitated nested hierarchical facilities, and the budget of interdiction [14] , etc. The aforementioned fortification-interdiction models assume that there is an existing service system and the goal is to thwart intentional attacks by protecting some facilities. Another approach consists in taking into account possible worst-case facility losses in the initial location of facilities. Such problems, known as location-interdiction models [29] , are not widely addressed in the literature. For example, in [23] , the authors extend the r-interdiction covering problem from [11] and introduce a maximal covering location-interdiction problem aimed at maximizing the coverage of customers if the most critical facilities are failed. Note that location-interdiction problems are more difficult to solve than the protection-interdiction ones and most of effective algorithms developed for protection problems are not applicable to them [29] . Other closely related problems are aimed at combining both location and protection decisions. An example is the model proposed in [2] , where capacitated facilities may be located either protected or unprotected. In [3] the problem is to first locate a given number of facility followed by protecting some of them with a limited protection budget. In this paper we propose a new facility location-interdiction model by extending the r-interdiction median problem in the way closely related to [23] . We formulate this problem as a bilevel integer linear program where the upper-level player (defender) finds p sites for locating facilities in order to minimize the overall cost of serving customers, taking into account that the lower-level player (attacker) hits some r facilities in order to inflict the maximal possible damage. We demonstrate that this problem can be reformulated as a single-level integer program so that small-and medium size problem instances can be solved with a general purpose commercial solver. For a particular case when the attacker hits a single open facility, we develop a fast local search heuristic. Our model relies upon the uncapacitated p-median problem. It is one of the basic and widely studied facility location problems. It has been applied in designing service and distribution systems in private and public sectors [20] . It is also a powerful modeling tool with applications to production [9, 21] and machine learning [16, 30] . We suppose that there is a set I (|I| = m) of potential facility location sites and a set J (|J| = n) of customers. We denote as d ij the shortest distance (service cost) between a customer j and the facility located at cite i. As for the p-median problem, each customer is served by the closest open facility. If for a customer, the closest open facility is interdicted, she/he is reassigned to the closest open non-interdicted facility. The interdicted facilities become completely unavailable. As the attacker has some limited resources for interdiction, we assume that only r out of p open facilities can be attacked. Note that we suppose that the attacker has perfect information about where the facilities are located, hence he/she always hits only open facilities. We also suppose that all r facilities chosen by the interdictor are attacked simultaneously, thus the defender has no way to thwart any losses. Our problem can be viewed as a static Stackelberg game where two players make sequential decisions. The first player (leader or defender) first decides where to locate p facilities in order to minimize the overall sum of distances between customers and their closest open facilities, taking into account that some facilities may be interdicted; the second player (follower or attacker), knowing where the first player has located the facilities, tries to inflict the maximal possible harm to the service system by attacking r of them. Thus, some customers have to be reassigned to more distant facilities, which reduces the system efficiency. Let us introduce the following decision variables: if a facility is located at site i; 0, otherwise. x ij = 1, if a customer j is assigned to the facility located at site i; 0, otherwise, , if a facility located at site i is interdicted; 0, otherwise, Furthermore, to ensure the assignment of customers to the closest facilities, we define the set W ij = {k ∈ I : d kj > d ij }, i.e. the set of potential sites which are farther from customer j than site i. The bilevel interdiction-location median problem can be written as follows: where x * ij (y) is a solution of the attacker's problem: Note that the defender and the attacker has the opposite objectives. The system planner attempts to locate p facilities to minimize the overall sum of distances between customers and the facilities remained after eliminating r of them. The interdictor determines which open facilities to hit in order to maximize the cost of serving customers. Constraints (2) set the number of facilities opened by the defender. Constraints (5) and (6) are the standard facility location constraints guaranteeing that each customer is served by exactly one open facility. Constraints (7) and (8) impose that the attacker can hit only r out of p open facilities. Finally, constraints (9) ensure that each customer is served from the closest open facility after attack occurs. Indeed, if a facility is located at site i and not interdicted, a customer j cannot be served by a more distant facility than i. As was noted in [27] , these constraints can also be adapted to settings where customers are not supposed to be served from the closest open facilities. Instead, they may have some facilities preference ordering known for both the defender and the attacker (e.g. see [31] [32] [33] ). Bilevel programming problems are usually very hard to solve, especially when the problems on both levels are mixed-integer programs. In facility location, bilevel problems usually arise in competitive location when there are two players that compete for serving the demand of customers from a market [17] . A good example of competitive facility location problems is the well-known (r|p)-centroid problem [6, 12] . This problem is known to be Σ P 2 hard [13] . As was noted above, the location-interdiction problems are much more challenging than widely studied fortification-interdiction problems. Their structure hinders from application some very efficient exact approaches based on implicit enumeration and decomposition techniques. In this section we demonstrate that the introduced location-interdiction median problem can be formulated as a single-level integer program. Thus, it can be solved by a commercial branch-and-bound solver. For our formulation, we denote by H the set of all interdiction patterns, i.e. it consists of all possible strategies of interdicting r out of m potential location sites. We also denote by h the index corresponding to a specific interdiction strategy and by I h the set of interdicted sites in pattern h. Our formulation uses the variables y i that have the same definition as those in formulation (1)- (10) . We also introduce the following binary variables: 1, if a customer j is assigned to the facility located at site i when interdiction pattern h occurs; 0, otherwise, Thus, we can formulate our problem as follows: i∈I The objective function (11) is to minimize the overall cost z of serving customers from operational facilities remained after the worst-case attack on r potential location sites. Constraints (13)- (14) have the same definition as those in formulation (1)-(10), i.e. they ensure that each customer j must be assigned only to one open facility when interdiction pattern h occurs. Constraints (12) define the cost of serving customers from a given set of facilities if r sites corresponding to pattern h are lost. Note that given a set of facilities, if none of them are located at the sites from interdiction pattern h, then the cost z is equal to the p-median objective value. Finally, constraints (16) ensure that customers cannot be served by facilities located at the sites interdicted in pattern h. Note that the model actually involves the exponential number of variables and constraints. However, the common assumption is that terrorist groups usually have limited resources and are able to simultaneously hit only a small number of facilities [27] . In this case, one can determine all possible m r interdiction patterns in advance. In particular, if only one facility is supposed to be attacked, the model contains O(m 2 n) decision variables and constraints. Bilevel integer problems are quite challenging for exact methods, especially in case of real-life instances of large size. Thus, the development of heuristics to find quality feasible solutions in reasonable time is quite natural. The main idea is to use a local search procedure which attempts to improve an initial solution by searching its neighbors. The location-interdiction median problem is based on the well-known p-median problem, hence the simplest and natural strategy is to develop a local search procedure over the SWAP neighborhood. It consists of all defender's solutions which are obtained from a current incumbent by closing one facility and opening another one in a different place. In order to compute the defender's objective value for a given set of open facilities (the set of variables y i ), we actually have to find an optimal solution of the attacker's problem: i: yi=1 k∈Tij (y) i: yi=1 where T ij (y) is the set of open defender's facilities which are farther from customer j than facility i. This problem can be solved by a commercial branch-andbound solver. On the other hand, in some applications where the numbers of both open and interdicted facilities are small, the attacker's problem can effectively be solved by enumerating all p r possible facility losses. In this section we focus on a special case of the attacker's problem where r = 1. Our main goal is to develop a local search procedure which is faster than the naive implementation based on explicit enumeration. Throughout this section we use the following notations. We denote as d(i, j) the distance (service cost) between a customer j and a location i. We suppose that S is a solution of the defender's problem, i.e. S is any subset of I consisting of p elements. The closest open facility in S for customer j is denoted as c 1 (j). To perform a fast interchange, our algorithm needs the second and the third open facilities to j that we denote as c 2 (j) and c 3 (j), respectively. Searching the swap neighborhood, the algorithm picks one candidate facility to add to the current solution and one to drop from it that we refer to as f i and f r , respectively. Following [25] , we assume that the distance between any facility site and any customer can be computed in O(1) time. For example, this is the case when the distance matrix is already calculated. Note that for the p-median problem, there are several fast implementations of the swap-based local search, including prominent Whitaker's implementation [34] and a fast implementation from [25] . Given a solution S, we need to find a neighborŜ providing a better value of the defender's objective or to check that such an improving neighbor does not exist. Thus, we have p sites for dropping and m − p sites for insertion. Given a pair (f i , f r ), we have to compute the gain of swapping f r with f i . A straightforward implementation assumes that, for each possible swap (f i , f r ), we have to solve the attacker's problem, i.e. to enumerate all possible strategies of interdicting facilities in the solution obtained after swapping. If r = 1, the number of interdiction patterns is equal to p. However, the computation time can be further reduced by avoiding the explicit enumeration needed to find an optimal solution of the attacker's problem. Indeed, the value of the defender's objective function z lead for a solution S can be written as follows: where z med is the value of the p-median objective. In other words, the cost that the defender will get after a disruptive strike can be computed as the sum of costs for serving customers from the closest open facilities plus the cost induced by the customers whose closest facility was lost due to attack. Thus, for each pair (f i , f r ), we must compute the gain from swapping f i and f r according to both the p-median objective and the value induced by attack: where medcost(f i , f r ) is computed as follows: In other words, if the closest facility of a customer j is not replaced (it is not f r ), the customer either reassigns to f i , if it is closer, or stays assigned to the current facility. If customer j is served from f r , then he/she assigns to the closest open facility: f i or c 2 (j). If medcost(f i , f r ) < 0, then we can gain from such a swap with respect to the p-median objective. The value of losscost(f i , f r ) can be computed in a similar way (see Algorithm 1). Finding an optimal solution of the attacker's problem when assessing the candidate pair (f i , f r ) 1: for all (i ∈ S) closed(i) := 0 2: for all j ∈ J do 3: if c1(j) = fr then 4: (c1(j) , j) and d(fi, j) < d(c2(j), j) then 10: The main idea is to implicitly calculate the costs of all interdiction patterns, considering each customer j independently. Note that the attacker can cause a harm to customer j if he hits the closest open facility to j. If the closest facility to j is not dropped, f i may be either the closest to j or the second closest. In the first case, the attacker may inflict damage only by hitting f i , hence j remains assigned to the same facility (she/he was assigned before swapping). In the second case, the attacker striking the closest facility c 1 (j) forces the customer be served from the next facility: f i . There is also another option. The candidate f i may also be farther than the second closest facility c 2 (j). In this case, if c 2 (j) is dropped, the attacker engenders a harm by hitting the closest facility c 1 (j) and forcing the customer assign to the third closest facility c 3 (j). Otherwise, if c 2 (j) is not removed, the customer assigns to it in case of attack. If the closest facility c 1 (j) is dropped, there are two options. The new facility f i is closer than c 2 (j), thus an attack on f i results in assigning the customer to the second closest facility c 2 (j). If the new facility is farther than c 2 (j), the customer in case of attack on c 2 (j) would be served from the third closest facility: c 3 (j) or f i . In all cases described above, for each customer j, only an attack on the closest facility (after swapping) can leads to reassigning the customer to the next closest facility. Thus, closed in Algorithm 1 accumulates the contribution of each particular customer to the cost of all possible interdiction patterns. The values of medcost(f i , f r ) and losscost(f i , f r ) can obviously be computed in O(n) time for each candidate pair. As the total number of swaps is p(m − p), each iteration of local search takes O(pmn) time. Note that the standard implementation of swap-based local search for the p-median problem requires the same time per iteration. After a swap is accepted, updating the closest facilities for each customer requires O(pn) time in the worst case. In this section we report some computational experiments to test the proposed integer linear program as well as the local search procedure devised. We utilized several widely used test problem instances from the well-known TSPLIB 1 . Our integer linear model was set up using IBM ILOG Concert technology and C++ programming language. It was then solved with IBM ILOG CPLEX 12.8.0 2 , freely available for non-commercial research. We implemented the local search procedure using C++ programming language as well. Note that our local search is not parallel, while CPLEX was allowed to use four parallel threads. We suppose that the number of interdictions r is equal to 1. The initial solutions for the local search were chosen at random. We performed only one run of the local search on each particular test problem. The results are presented in Table 1 , where the first column shows the problem name, column p contains the number of defender's facilities to be located. In Objective values, we report three following values. Column P med shows the optimal value of the p-median objective, i.e. the service cost without the threat of possible attacks. Columns IntrCP LX and LS demonstrate the values of the defender's objective function found by CPLEX and the local search, respectively. Columns T ime cplx and T ime LS represent the running times of CPLEX and the local search. We can observe that CPLEX requires very much time to find an optimal solution of the single-level location-interdiction median problem. For example, CPLEX spent more than almost two hours to solve berlin52 instance involving 52 customers and 5 facilities. Recall that the single-level problem contains very large number of variables and constraints, which makes it much harder than the underlying p-median problem. Observe that the found objective values of the location-interdiction problem are in general much larger than those of the p-median problem, especially when the number of facilities is small. For example, for the problem berlin52, incorporating risks of a possible deliberate attack when locating 3 facilities results in increasing the service cost for over 27% with respect to the cost of the location strategy that does not assume facility losses. Thus, if the system planner is riskaverse, she may attempt to mitigate the impact of an attack but it may be quite costly. Note that in case when 8 facilities are located, the increase in service cost is about 20%. However, for the problem fl1400, the increase is only 9% if 200 facilities are located. Thus, when the number of open facilities is small, the system planner may prefer a risky strategy (locating facilities according to the p-median protocol) rather than the robust design of a service system resulting in increase in the service cost. On the other hand, when the number of facilities is relatively large, a robust system can be designed with small increase in the service cost. We see that the local search procedure found the same solutions as CPLEX in most cases (especially for small problem instances). Recall that we ran the local search only once. For berlin52, p = 8, it found a solution that even a little better. Note that this may happen due to a relatively large duality gap inherent in the single-level integer program. Observe that the proposed heuristic is very fast, especially in comparison with the commercial solver. It spent less than one second to find very close to optimal solutions. In this paper we proposed a new location-interdiction median problem based on the well-known p-median problem. This problem can naturally be formulated as a bilevel integer linear program where the system designer makes decision about where to locate facilities; and the attacker interdicts some facilities to engender the maximal harm to the system efficiency. We demonstrated that the problem can be formulated as a single-level integer linear problem that can be solved with a commercial solver. Since it can be utilized to solving only small problem instances, even assuming that only one facility is interdicted, we developed a fast implementation of local search over the basic SWAP-neighborhood. Our further research may be focused on developing alternative formulations of the problem as well as exact solution approaches (branch-and-cut methods). Our research will also aim at extending the proposed technique for larger values of r and developing new heuristics. Finally, the developed local search procedures may be employed to devise various metaheuristics. A bilevel partial interdiction problem with capacitated facilities and demand outsourcing A bilevel fixed charge location model for facilities under imminent attack A bilevel p-median model for the planning and protection of critical facilities The budget constrained r-interdiction median problem with capacity expansion. Cent Weakest link: Global supply chains disrupted by coronavirus An exact method for the discrete (r|p)-centroid problem A bi-level programming model for protection of hierarchical facilities under imminent attacks Automative News Europe: Nissan faces global parts shortage due to coronavirus A decomposition approach for a very large scale optimal diversity management problem Protecting critical assets: the r-interdiction median problem with fortification Identifying critical infrastructure: the median and covering facility interdiction problems A local search heuristic for the (r|p)-centroid problem in the plane On the complexity of the (r -p)-centroid problem in the plane A bi-level model and solution methods for partial interdiction problem on capacitated hierarchical facilities An implicit enumeration algorithm for the hub interdiction median problem with fortification Cluster analysis and mathematical programming Competitive location in cognitive radio networks Shortest-path network interdiction Analysis of facility protection strategies against an uncertain number of attacks: the stochastic r-interdiction median problem with fortification Median problems in networks A three-stage p-median based exact method for the optimal diversity management problem Optimal interdiction of a supply network Designing robust coverage networks to hedge against worst-case facility losses A branch-and-cut algorithm for the multiple allocation r-hub interdiction median problem with fortification A fast swap-based local search procedure for location problems An exact approach for the rinterdiction median problem with fortification A bilevel mixed-integer program for critical infrastructure protection planning An exact solution approach for the interdiction median problem with fortification Location problems under disaster events Automatic defects classification with p-median clustering technique New lower bounds for the facility location problem with clients' preferences Polyhedral study of simple plant location problem with order The branch and cut method for the facility location problem with client's preferences A fast algorithm for the greedy interchange for large-scale clustering and median location problems Analysis of budget for interdiction on multicommodity network flows An exact algorithm for solving the bilevel facility interdiction and fortification problem The reported study was funded by RFBR according to the research project No. 18-07-01037.