key: cord-0818405-xfnqxsat
authors: Frihi, Zahrate El Oula; Barreiro-Gomez, Julian; Choutri, Salah Eddine; Tembine, Hamidou
title: Toolbox to simulate and mitigate COVID-19 propagation
date: 2021-02-18
journal: SoftwareX
DOI: 10.1016/j.softx.2021.100673
sha: a7a3b24d60304fae1acaaba08cc76944b06a6c9f
doc_id: 818405
cord_uid: xfnqxsat

In this paper we present an age-and-migration-dependent Markov-chain-based model for the propagation of the COVID-19, together with a user-friendly MatLab-based toolbox. We present several examples to estimate the propagation of the virus worldwide and in a particular country.

Current code version 1 Motivation and significance COVID-19 is a new disease that was discovered in December 2019 in humans. It is believed that the virus causing the disease jumped from a wild animal to the human body in a local wildlife meat market in Wuhan in China. Since then, the disease started to spread rapidly, from human to human, in China and neighbouring countries such as South Korea. The epidemic was declared as pandemic by World Health Organization (WHO) on 11 March 2020, after reaching tens of countries and leaving hundreds of deaths. As of writing the world counts thousands of deaths and hundreds of thousands of infections in 180 countries and territories. The continents with the biggest number of infections are Aisa (mainly China, Iran and South Korea), Europe (mainly western Europe) and the American continents (mainly USA and Brazil). Since no vaccine has been found yet, countries try to control and slow down the spread of the pandemic by reducing the international mobility and imposing quarantine for infected people or newly arriving persons. Some countries imposed a total lockdown such as Italy and Spain. The rapid international spread of the pandemic and the policies applied by countries to combat it, inspired us to suggest, in this paper, a model for the propagation of COVID-19 via Markov chains over a graph (network) representing the map of the world. Mathematical modeling of infectious diseases has been an important tool in studying the propagation of a number of diseases. Epidemic models aim to answer questions such as: is there a possibility to have a big outbreak of a certain infectious disease? how many infections will we have? how many people should be vaccinated to prevent an outbreak ? The deterministic SIR model is one of the simplest epidemic models where the propagation dynamics of the disease can be defined by 2 differential equations (see e.g. [1] ), one for the number of susceptible people and the other for the infected people. The population is divided into three classes: susceptible, infected and removed (recovered or dead). In this type of model individuals can gain a permanent immunity. There exist other variants of SIR model such as SIS (susceptibleinfected-susceptible) and SEIR (susceptible-exposedinfected-removed). All previous mentioned models assume that people mix homogeneously and the commu- nity is closed (no births, immigration, emigration and deaths). The closure assumption means that the total size of the population is constant, and it is equal to the sum of the number of susceptible, infected and removed people, which makes the derivation of an equation for the number of removed people trivial, given the equations for susceptible and infected people.

The output of the model relies on the assumption that people mix uniformly with each other. However, that might not be sufficient to obtain good results. For example, if we consider an epidemic outbreak in a school (small community), it seems reasonable to assume some uncertainty in the final number of infected people. Moreover, even if the basic reproduction number is larger than one and the community is large but the outbreak is initiated by few infected people, it can be possible, by chance, that the epidemic never takes off [2] . For extensions to more realistic stochastic models, see e.g. [3] . Many stochastic models of infectious diseases are based on continuous-or discrete-time Markov chains. In these models, the states of the Markov chains are the number of people that are susceptible, infected,. . . etc. The major issues with these approaches are the big size of the state space and the large number of the associated Kolmogorov's differential equations describing the spread of the disease. For continuous-time Markov chain models, see e.g. [4] [5] [6] [7] . For discrete-time models, see e.g. [8] [9] [10] .

Another type of epidemic models is the network epidemic model [11] . This model is based on an undirected graph that describes the social structure in the community where the epidemic spreads. The nodes represent individuals and the edges represent the connection between individuals. Most of network-based models consider the propagation as a stochastic process or a mean-field approximation of it. The network epidemic models have other applications such as a malware propagation over computer networks, see e.g. [12, 13] . 

We model the spread of COVID-19 in the whole world via Markov chains over an undirected graph G = (N, E), where N = {1, 2, . . . , n} is the set of nodes that are connected to each other by a set of edges E. The n × n adjacency matrix of the graph is given, for any i, j ∈ N, by: [A] ij = 1 if i, j ∈ E, 0 otherwise. The graph represents the map of the world and each node represents an individual, the state of each individual is described by a Markov chain. Each Markov chain has a discrete state space E = {S, I, R, D} where S is susceptible, I is infected, R is recovered and D is dead. The chain jumps from a state to another following certain transition probabilities. The infection occurs, with a probability α ∈ [0, 1], when a susceptible individual interacts with an infected one from his neighbour N . Any infected individual (or node) can recover with probability γ. The recovered persons do not gain a permanent immunity as they can be reinfected with probability β.

Our Markov chain model takes in consideration the age distribution in each continent. The transition probability from the state I to the state D (the probability δ) is affected by the age. Old infected people have higher probability of death. Therefore, we set 3 different transition probabilities δ 1 , δ 2 and δ 3 for the different age categories (0,40), [40,70) and 70+, respectively. The sum of the three previous transition probabilities equals δ. The 2   1  2  3  4  5  6  7  8  9  10  11  12  13  14  15  16  17  18  19  20  21  22  23  24  25  26  27  28  29  30  31  32  33  34  35  36  37  38  39  40  41  42  43  44  45  46  47  48  49  50  51  52  53  54  55  56  57  58  59  60  61  62  63  64  65 J o u r n a l P r e -p r o o f age distributions for each continent is generated (by default) via normal distributions with suitable parameters. It is possible to change to other distributions depending on the continent. Furthermore, our model incorporates long distance interactions that can happen by travelling inside the same country/continent or to a different country/continent. We do that by considering a wider set of neighbours N ′ . Each person i ∈ N has her own set of local neighbourhood N i and a set of global neighbourhood N ′ i , for simplicity we dropped the subscripts. Next, we introduce the proposed toolbox in order to simulate the propagation of the virus depending on the age and migration constraints between regions.

The toolbox we propose is based on MatLab. It is mainly composed of six functions, which are explained briefly below. For details, refer to the associated toolbox documentation.

The function construct continents(I,resolution grid) is in charge of generating the required data for each (populated) continent/region on the map of the world. This function takes as input the image of the world (or a country) I and the chosen resolution resolution grid of its discretization (the number of nodes associated). As an output, the function returns six clusters (corresponding to each continent) composed of three elements: the number of nodes corresponding to the specific continent, a matrix of x-y coordinates of each node specific to the continent and the corresponding filtered continent from the input image.

The function show map(S,I,R,D,C,color) plots the discretized map by using the nodes constructed by the function construct continents. This functions take as inputs: the set of susceptible people S, the set of infected people I, the set of recovered people R, the set of dead people D, the set of coordinates of the nodes corresponding to the map of the world C and the color of the nodes corresponding to susceptible people. As an output, this function generates the plot of the map with the characteristics selected in the inputs.

The function neighbor(agent,radius,xpos,ypos) allows computing a set around a certain node. This function is fundamental in order to compute the set of susceptible nodes around a sick node to propagate the virus.

It takes the following inputs: a given node/case called agent, a scalar value called radius which determines the size of the neighbourhood of a node (high values for international travels), xpos corresponding to all x coordinates of all the nodes on the map of the world, and ypos corresponding to all y coordinates of all the nodes on the map of the world. This function returns a unique output which is a set of nodes corresponding to the neighborhood set of agent.

The function show age(C,age1,age2,age3) plots the map of the world and showing the distribution of three age ranges defined by age1, age2, and age3. After inserting the distribution of ages for each continent (there are three ranges of age) , this function plots the map of the world and assigning each node an age accordingly.

The function flight connections(A,S NorthAmerica, ...,S Oceania) computes the set of nodes belonging to the regions to which a node can travel.

This functions takes as input the set of nodes (susceptible people) corresponding to each continent and an adjacency matrix A ∈ {0, 1} 6×6 describing the possible intercontinental travel. As outputs, the function returns set of nodes to which an agent belonging to a certain continent can move to another. Each output set corresponds to a continent.

The function update(S,I,R,D,xpos,ypos, flight connections,p,Age) updates the situation of the propagation of the virus after each iteration. This is is done by updating all the input sets for each continents. The input flight connections is the set of agents that can be infected from other continent because of flights and connectivity of continents, whereas p is the set of transition probabilities p = [p1,p2,p31,p32,p33,p4,p5] of the underlying Markov chain. As outputs, the sets of susceptible, infected, recovered and dead are updated.

We present two illustrative examples, the first is a simulation of the propagation of the virus on the map of the world, which is divided into six different continents, the simulation is done with and without intercontinental connections, see 2 and 3. The second example considers a unique country divided into several regions. Let the country be Colombia, which is divided into 32 departments. Therefore, we consider not only the different transition rates for the departments, but also the migration constraints among them. Figure 4 shows the propagation of the virus during a year without quarantine whereas Figure 5 shows the propagation of the virus during a year with quarantine and no allowed domestic flights.

The MatLab toolbox can help in raising research questions about the efficiency of the lockdown and the closure of aerial spaces. Using the toolbox one can simulate the spatial propagation of the disease and compare different lockdown strategies. Furthermore, our algorithm can be easily extended to take into account finer division of the map of the world. One can divide each continent into countries, each country into regions and each region into zones and then find out best lockdown policies by managing the interconnection between all of them. This toolbox simplifies the simulation process of the propagation of COVID-19 (or any other viruses) and made the simulation more realistic by taking into account the age distribution in each continent as well as long distance travels. Our research group at Learning & Game Theory Laboratory uses the toolbox to generate simulations and study the propagation of COVID-19. Moreover, the lab is working on extending it to including different divisions and adding more features such as hospital capacity in each region, economic situation,. . . etc.

We have presented a Markov chain-based model for the propagation of COVID-19 with migration constraints given by a connected graph representing the allowed air traffic. Furthermore, the model takes into account an age-dependent mortality since it has been observed that the elderly people are more prone to death by the virus than younger people. The model is simple and can be easily tuned by means of the transition probabilities of the Markov chain. Besides, we have presented a userfriendly toolbox for the simulation of the propagation throughout multiple regions. The regions can be different continents, countries and/or states in a country. We have presented several examples explaining the use of the toolbox and have also provided the open source codes, which are accessible to any researcher. The goal is that the contribution presented in this article can be used, not only to motivate the research of the propagation of the COVID-19, but also as a model that can be used as a reference in the design of optimal controllers.

No conflict of interest exists: We wish to confirm that there are no known conflicts of interest associated with this publication and there has been no significant financial support for this work that could have influenced its outcome .  1  2  3  4  5  6  7  8  9  10  11  12  13  14  15  16  17  18  19  20  21  22  23  24  25  26  27  28  29  30  31  32  33  34  35  36  37  38  39  40  41  42  43  44  45  46  47  48  49  50  51  52  53  54  55  56  57  58  59  60  61  62  63  64 5   1  2  3  4  5  6  7  8  9  10  11  12  13  14  15  16  17  18  19  20  21  22  23  24  25  26  27  28  29  30  31  32  33  34  35  36  37  38  39  40  41  42  43  44  45  46  47  48  49  50  51  52  53  54  55  56  57  58  59  60  61  62  63  64 3  4  5  6  7  8  9  10  11  12  13  14  15  16  17  18  19  20  21  22  23  24  25  26  27  28  29  30  31  32  33  34  35  36  37  38  39  40  41  42  43  44  45  46  47  48  49  50  51 

Containing papers of a mathematical and physical character

Stochastic epidemic models: a survey

An introduction to stochastic epidemic models

Reproduction numbers and thresholds in stochastic epidemic models i. homogeneous populations

The stochastic si model with recruitment and deaths i. comparison with the closed sis model

Stochastic models of some endemic infections

On methods for studying stochastic disease dynamics

An examination of the reed-frost theory of epidemics

Discrete-time sis models with complex dynamics

Generalized markov models of infectious disease spread: A novel framework for developing dynamic health policies

The structure and function of complex networks

Mean-field-type games in engineering

Modeling malware spreading dynamics

We gratefully acknowledge support from U.S. Air Force Office of Scientific Research under grant number FA9550-17-1-0259