key: cord-155475-is3su3ga authors: Kalogeratos, Argyris; Mannelli, Stefano Sarao title: Winning the competition: enhancing counter-contagion in SIS-like epidemic processes date: 2020-06-24 journal: nan DOI: nan sha: doc_id: 155475 cord_uid: is3su3ga In this paper we consider the epidemic competition between two generic diffusion processes, where each competing side is represented by a different state of a stochastic process. For this setting, we present the Generalized Largest Reduction in Infectious Edges (gLRIE) dynamic resource allocation strategy to advantage the preferred state against the other. Motivated by social epidemics, we apply this method to a generic continuous-time SIS-like diffusion model where we allow for: i) arbitrary node transition rate functions that describe the dynamics of propagation depending on the network state, and ii) competition between the healthy (positive) and infected (negative) states, which are both diffusive at the same time, yet mutually exclusive on each node. Finally we use simulations to compare empirically the proposed gLRIE against competitive approaches from literature. In recent years, the growing amount of available data on networks led to a revolution in the application of diffusion processes. The enrichment of analysis by means of detailed information regarding specific populations yielded to a plethora of realistic and accurate models. Through diffusion models, it is possible to study disparate branches of knowledge: in economy (competition among products [1] , viral marketing campaigns [2] ), in epidemiology (disease spreading, vaccination and immunization problems), in computer science (computer viruses, information flow), in social sciences (social behavior [3] ) and medicine (obesity diffusion [4] , smoking cessation [5] , alcohol consumption [6] ) are just some instances. A large number of social behaviors can be modeled as states propagating over networks [3, 4, 5, 7] . Consequently to the availability of diffusion models, many intervention strategies were developed aiming to answer questions like: What are the most dangerous computers in a network? How to maximize the customer awareness for a product? On which individuals is better to focus to win a poll? Few studies proposed strategies to advantage a state compare to another (like in marketing campaigns) or to mitigate the diffusion of an undesirable state (like in epidemiology). Most of them are static strategies based on the network structure (e.g. [8] ), while others are dynamic strategies that use the whole information about the current state of the system to suggest the best elements to treat. Among them, the Largest Reduction in Infectious Edges (LRIE) [9] results to be the optimal greedy algorithm for resource allocation under limitations in the resource budget, in the N -intertwined Susceptible-Infected-Susceptible (SIS) epidemic model. This model is a two-state continuous-time Markov process over a network, in which a node can change state according to a transition rate that is linear in its neighbors' states. However SIS models have been deemed too simple to describe the complexity of real-world phenomena such as the contemporary presence of two distinct viruses which spread on the same network. In particular, there can be considered two possible cases; in the first an individual can be infected simultaneously by both diseases (e.g. as in the SI 1 I 2 S model [1] ) and, in the second, mutual exclusivity only one infection is allowed for each individual at a given time (e.g. SI 1|2 S model [10] ). Other attempts tried to change the dynamical equations (like in the SISa [3, 7] ). In this study, we propose the Generalized Largest Reduction in Infectious Edges (gLRIE) strategy, which is adapted for the diffusion competition of recurrent epidemics, as well as nonlinearity and saturation of the functions of node transition rates. This strategy includes the LRIE strategy [9] and, as such, provides an optimal greedy approach for this more sophisticated network diffusion setting. gLRIE computes a node score using only local information about the state of close-by nodes. Although in the present formulation the method can be applied to any two-state recurrent Markov process and is easily generalizable to more states, in this work we focus on social behaviors that can be 'healthy' or 'unhealthy' with negative effects in the social environment. Given a limited amount of resources we would like to target the few key-individuals so as to minimize the negative effects. Apart from the mentioned habits affecting one's personal health (e.g. unhealthy diet, smoking, etc.), the recent COVID-19 pandemic highlighted yet another interesting 'unhealthy' misbehavior: the disrespect of confinement under a city lock-down, or of social distantiation guidelines in general. Indeed, this kind of misbehavior is a determinant factor for the reproduction rate (the infamous R t ) of an epidemic over time, and can be readily enforced by making more controls in key areas, or using mobility and contact information at individual level. A graph G = (V, E) is a set of nodes V, let N = |V|, endowed with a set of edges E ⊂ V × V. It can be intuitively represented by its adjacency matrix A = {0, 1} N , where each A ij element is 1 if (i, j) ∈ E, and 0 otherwise. Without loss of generality, we refer to undirected graphs without self-loops, i.e. A = A T and A ii = 0, ∀i = 1, ..., N . The neighborhood of node i is the set of all nodes connected to it with a direct edge, and is denoted by N i = {i k , ∀k ∈ {1, ..., d i } : (i k , i) ∈ E}. The size of N i equals to the node degree, i.e. |N i | = d i = j A ji . We also denote the indicator function by 1{·}. The standard continuous-time homogeneous SIS model describes the spread of a disease over a graph, where each node i represents an individual that can be in either the susceptible or the infected state: X i (t) = 0 or 1, respectively). The system at time t is hence globally represented by the node state vector X(t) ∈ {0, 1} N . The state of a specific node i evolves according to the following stochastic transition rates: where the parameters β, δ are the transition rates encoding respectively the infection aggressiveness and self-recovery capability of nodes. The epidemic control is realized by the resource allocation vector R(t) ∈ {0, 1} N , whose coordinate R i (t) = 1 iff we heal node i at time t, and 0 otherwise. Finally, ρ is the increase in recovery rate when a node receives a resource unit (thought as treatment). A generic two-state recurrent model. In this paper we study the dynamic epidemic suppression problem by first introducing the following generic two-states Markovian process: (II.2) I i and H i are two node-specific memoryless functions; respectively the infection rate function and recovery rate function for node i. The rate functions depend on the current overall network state X(t) and implicitly on the network structure (we omit this dependency in our notation). A Markovian Poisson process can be recovered using the rate functions of Eq. II.2 as follows: In the Dynamic Resource Allocation (DRA) problem [9, 11] , the objective is to administer a budget of b treatment resources, each of them of strength ρ, in order to suppress an undesired states diffusion. The treatments can not be stored and their efficiency is limited to a certain value. In [9] , a greedy dynamic score-based strategy is developed, called Largest Reduction of Infectious Edges (LRIE), in order to address the DRA problem. Specifically, each node is associated with a score quantifying how critical it is for further spreading the infection of the standard SIS model, Eq. (II.1). Other score-based solutions have been proposed, e.g. based on fixed priority planning [11] , or static ones based on spectral analysis [8] (see details in Sec. IV). The proposed Generalized Largest Reduction in Infectious Edges (gLRIE) strategy, each time identifies and targets the most critical nodes in order reduce the disease in as quickly as possible. The idea generalizes the one introduced in [9] as it to a wider range of models. Let N I (t) . = i X i (t) be the number of infected nodes at time t. In a Markovian setting, given the state of the population X at time t, the best intervention with respect to the resource allocation vector R would minimize the following cost function: where γ can be chosen so as to give emphasis on short-term effects [12, 13] . Expanding in series with respect to u, yields: The detailed evaluation of the three terms can be found in the Supplementary Material 1 . Here, though, we present the final results. To simplify our notation, we denote the updated transition rate of node j if node i is considered healthy, respectively: for the positive diffusion by We define accordingly the differences in these rates: Then, the final forms of the derivatives are: In the third equation, and since our purpose is to minimize Eq. III.1 with respect to R i , we let the terms that are independent to any R i to get absorbed in the function Ξ(t). The terms of the expansion provide information about the way in which healing a node affects the cost function: the first order does not provide any new information, the second order suggests something as trivial as to heal only infected nodes, while the third order quantifies the contribution of healing a specific node in reducing the cost function. Based on Eq. III.4 we derive the following score for each infected node i: The score has the following interpretation. We can identify two main parts: the quantification of the transition rate H i + I i of the node, and the effect of its recovery on the neighbors On the one hand, if a node could get easily reinfected (high I i value) or is going to be healed rapidly by either the self-recovery or the positive diffusion (high H i value), then it is not a good candidate to invest resources on. On the other hand, if a possible node recovery would largely increase the healing rate of its infected neighbors (low ∆H −i j value), then the node is attributed with a higher score. Finally, if a possible node recovery would largely decrease the infectious rate of its infected neighbors (low ∆I −i j value), then the node gets also higher score. Algorithm. At time t, the gLRIE strategy would take as input the network state X(t)and the budget of resources b. It would independently compute the criticality score of Eq. (III.5) for each node, rank them and finally note with 1's in the resource allocation vector R(t) which nodes to target while respecting the budget, i.e. i R(t) = min(b, i X i (t))). The computational cost of the algorithm is O(N 2 + N log N ). In this section we select competitors from the literature, define specific diffusion functions for the comparison, and present simulations on random and real networks. Other strategies. As a naive baseline, we use the Random Allocation (RAND) that targets infected nodes at random. The second competitor is the Largest Reduction in Spectral Radius (LRSR) [8] , which is based on spectral graph analysis generalized to arbitrary healing effects (ρ = ∞). LRSR selects nodes that maximize the eigen-drop of the largest eigenvalue of the adjacency matrix, known as spectral radius. The next competitor is the Maxcut Minimization (MCM) [11] , which introduces the priority planning approach. The strategy proceeds according to a precomputed node priorityorder, that is a linear arrangement of the network with minimal maxcut, i.e. maximum number of edges need to be cut in order to split the ordering in two parts. The last but most direct competitor is the greedy dynamic LRIE [9] that we generalize in this work. Diffusion function. Generally, the state transition rate for a node can be assumed to be a function either of the absolute number of neighbors in the opposing state (standard for SIS), or of the fraction of those nodes our of all neighbors. Here we take as an example the former option, as the strategies presented in the literature consider that type and it is therefore a fair comparison. Future work could include additional experiments with the latter type. Social behaviors have complex properties that are not covered by the standard SIS models, such as non-linearity and saturation in the node transition rates [7] . We employ sigmoid functions to model these model properties: where n and d − n are the number of infected and healthy neighbors. Also, s I (resp. s H ) parameter controls the saturation level and I (resp. H ) the slope at the origin. Next, we present comparative experiments in Erdös-Rényi (ER), Preferential Attachment (PR), and Small-World (SW) random networks of size 300 nodes each. First we gradually introduce non-linearity in the diffusion, and then we show the effects of introducing also competition diffusion. From linear to non-linear spreading. We first consider only the negative diffusion (i.e. H = 0) and we gradually increase I in an ER random graph, moving gradually from linear (as in the standard SIS model) to non-linear functions. Fig. 1 shows the average over 1,000 simulations of the percentage of infected nodes over time and the 95% confidence interval under the hypothesis of Gaussian distribution. The results show that in the presence of non-linearity our strategy becomes much more efficient than the competitors. Introducing competition. Next, in Fig. 2 we present the effects of the positive diffusion, embedded in the function H, on ER, PR, and SW random networks. The last plot each row shows the shape of the diffusion functions used in the simulations. The simulations show that, unlike gLRIE, the methods of the literature lack modeling power to deal with this complex setting involving non-linearity and competition, and suppress the infection. We performed simulations on the Gnutella 2 peer-to-peer network containing 8,846 nodes and 31,839 edges. Two scenarios were used for the simulations, with and without positive diffusion, using a wide range of parameters. Out of the many possible evaluation metrics for the quality of a strategy, e.g. expected extinction time (EET), final percentage of infection (FIS), area under the curve (AUC), we choose the AUC. This has many advantages: it provides useful measurements even if the strategy did not removed the infection, which is a limitation of the EET metric; it accounts for the total amount of infected nodes in the process, which in a socioeconomic context is more interesting than the FIS metric. The empirical comparison between gLRIE and competitors such as LRIE and MCM is summarized in Fig. 3 In each heatmap, we fix the shape of the transition function H of the positive diffusion and only play with the parameters of the function I of the negative diffusion: its saturation level increases along the x-axis and its slope increases along the y-axis. On the top-left side of a heatmap, the epidemic parameters define a weak infection and any strategy would perform well, while on the bottom-right side the infection becomes hard to completely remove for all strategies (given the amount of resources). Moreover, in the left border, the low saturation level causes I to already saturate with just one neighbor of the opposing state. In the regime where I is almost linear and H = 0, gLRIE and LRIE are equivalent and perform almost the same. The general remark on the results is that gLRIE appears to be the most versatile and best performing strategy in this setting of competitive spreading. In this paper we discussed a general form of recurrent two-states continuous-time Markov process that allows both non-linear node transition functions and competition among the two states. We then proposed the Generalized LRIE (gLRIE) strategy to suppress the diffusion of the undesired state. Experiments showed that gLRIE is well-adapted to the considered setting of competitive spreading, and makes better use of the resources available by targeting the most critical infected nodes compared to competitors from literature. Future work could generalize to more competing epidemic states and the incorporation of factors related to the network structure in the node scores. Winner takes all: competing viruses or ideas on fair-play networks The dynamics of viral marketing Emotions as infectious diseases in a large social network: the SISa model The spread of obesity in a large social network over 32 years The collective dynamics of smoking in a large social network The spread of alcohol consumption behavior in a large social network Infectious disease modeling of social contagion in networks On the vulnerability of large graphs A greedy approach for dynamic control of diffusion processes in networks Interacting viruses in networks: can both survive? Suppressing epidemics in networks using priority planning Optimal control of epidemics in metapopulations Optimizing the control of disease infestations at the landscape scale