key: cord-0165123-fxqjywdq
authors: Hurd, T. R.
title: COVID-19: Analytics Of Contagion On Inhomogeneous Random Social Networks
date: 2020-04-06
journal: nan
DOI: nan
sha: 5bbb857e2d1c399eccdcb348d16a7e5659713480
doc_id: 165123
cord_uid: fxqjywdq

Motivated by the need for novel robust approaches to modelling the Covid-19 epidemic, this paper treats a population of $N$ individuals as an inhomogeneous random social network (IRSN). The nodes of the network represent different types of individuals and the edges represent significant social relationships. An epidemic is pictured as a contagion process that changes daily, triggered on day $0$ by a seed infection introduced into the population. Individuals' social behaviour and health status are assumed to be random, with probability distributions that vary with their type. First a formulation and analysis is given for the basic SI ("susceptible-infective") network contagion model, which focusses on the cumulative number of people that have been infected. The main result is an analytical formula valid in the large $N$ limit for the state of the system on day $t$ in terms of the initial conditions. The formula involves only one-dimensional integration. Next, more realistic SIR and SEIR network models, including"removed"(R) and"exposed"(E) classes, are formulated. These models also lead to analytical formulas that generalize the results for the SI network model. The framework can be easily adapted for analysis of different kinds of public health interventions, including vaccination, social distancing and quarantine. The formulas can be implemented numerically by an algorithm that efficiently incorporates the fast Fourier transform. Finally a number of open questions and avenues of investigation are suggested, such as the framework's relation to ordinary differential equation SIR models and agent based contagion models that are more commonly used in real world epidemic modelling.

1. Introduction of the inhomogeneous random social network (IRSN) framework that provides a flexible and scalable architecture for modelling complex network characteristics. In particular, we will develop infection cascade models for networks of individuals classified by arbitrary types.

2. The large N asymptotics for infection cascades in IRSN models is developed, yielding explicit and efficiently computable recursive probabilistic formulas for the daily update of the state of the system, in particular, the day-by-day changes in the fraction of the individuals of each type that are susceptible, infected or recovered.

3. We provide details of how the contagion analytics can in principle be used to provide large scale investigations into potential policy interventions that one might invoke to mitigate or suppress the progress of the contagion.

4. Overall, the intent of this new framework is to provide a purely analytical toolkit for networks, capable of handling thousands of different types of individuals, that can run on a laptop. The network framework is capable of providing much faster results, with a similar degree of accuracy, than is possible with large-scale agent-based epidemic models normally used for informing health policy.

Studying the spread of infectious diseases using the tools of network science has a substantial literature, reviewed for example in Keeling and Eames (2005) . The book Newman (2010) provides a broad overview of networks in all areas of science, including applications to epidemic modelling, while Pellis et al. (2015) explores current challenges in network epidemic models. The framework developed here is a variation of the network cascade model of Watts (2002) , generalized to allow for random edge weights as in Hurd and Gleeson (2013) .

IRSN models for any value of N can always be explored by pure simulation, as in agentbased modelling. Alternatively, sequences of IRGs parametrized by increasing N have an important property called locally tree-like (LT). As described by Aldous and Steele (2004) and others, this property means that the random graph sequence is "locally weakly convergent" as N → ∞ to a connected Galton-Watson random tree, leading to a host of simplifications that are described by percolation theory on random graphs. The LT property of IRGs implies for example that for any k > 1, the density of cycles of length k in the graph goes to zero as N goes to infinity. The heuristic large N cascade arguments at the core of this paper are related to rigorous results in the literature surveyed by van der Hofstad (2016) that connect percolation properties on random graphs to properties of Galton-Watson processes.

Section 2 of the paper introduces inhomogeneous random social networks (IRSNs) and defines the basic SI infection cascade mechanism on such networks. Section 3 explores the large N analytical properties of the model, including the large scale "locally tree-like" structure of the skeleton. It then focusses on the recursive characterization of the stochastic infection cascade mapping in the N = ∞ limit. Section 4 discusses how the network approach to epidemiology fits in with two other approaches to infectious disease modelling: SI compartment models and agent based models. Section 5 explores how the basic SI network model can easily be extended to analogues of the SIR and SEIR compartment models. Section 6 provides a numerical implementation of the SI model, showing the flop count for computing a daily update to be O(M 2 × Nfft) where M is the number of types and Nfft is the number of lattice points in each one-dimensional integration. Section 7 addresses the issue of calibrating ISRN models to real health and social data. In Section 8, we explore a simple illustration of how the method can be used to understand potential policy interventions to protect the residents of a seniors' centre while a pandemic rages in the community outside. Finally, a concluding section discusses some possible next steps for having a better understanding of contagion risk in inhomogeneous random social networks.

1. For a positive integer N , [N ] denotes the set {1, 2, . . . , N }.

2. For a random variable X, its cumulative distribution function (CDF), probability density function (PDF) and characteristic function (CF) will be denoted F X , ρ X = F X , andf X respectively. Note thatf X = F(ρ X ) where F denotes the Fourier transform.

3. For any event A, 1(A) denotes the indicator random variable, taking values in {0, 1}.

4. Any collection of random variables X = (X 1 , X 2 , . . . ) generates a sigma-algebra (or informally "information set") denoted by σ(X).

6. The L 2 Hermitian inner product of two complex valued functions f (x), g(x) on a domain D is defined to be f, g

This section provides the core modelling assumptions of the framework, in a simplified susceptible-infected (SI) setting in which infected individuals never recover from the disease, and continue to infect susceptible individuals indefinitely. More realistic infection mechanisms will be considered in later sections. A social network represents a population of individuals as nodes of a graph, whose undirected edges represent the existence of a substantial social connection at a moment in time. Our network setting for the spread of an infectious disease begins with a number of preliminary assumptions:

1. The population is classified into a large but finite disjoint collection of "types" that represent people's important attributes, such as age, gender, living arrangement, profession, country and location.

2. In this paper, the network of social contacts, initially random, is taken to be constant during the epidemic; the mathematical methods adapt to time-varying networks.

3. At the start of the outbreak on day 0, most of the population is susceptible ("S"), but a small "seed" of infected individuals is placed in the network.

4. Infected individuals pass on a random viral load to each of their social contacts.

5. An individual's state of health is represented by a random "immunity buffer". Over time they experience an accumulation of viral load; if the total viral load exceeds their buffer, they become infected "I".

The social system at any moment in time will be represented as an inhomogeneous random social network, or IRSN. This is the specification of a multidimensional random variable that captures two levels of structure. The primary level of the IRSN, called the skeleton graph, is an undirected random graph with N nodes labelled by v ∈ [N ], representing people of a wide variety of types, and where an un directed edge labelled by (vw) 

represents the existence of a significant social interconnection between w and v, such as a family relationship. The secondary layer specifies the health and mutual exposures of people, conditioned on knowledge of the skeleton graph.

Inhomogeneity in the IRSN model arises through classifying people by a finite (possibly very large) number of types that can include a wide range of attributes. The collection of random types T := {T v } v∈[N ] will be assumed to completely determine the dependence structure of the remaining random variables. In other words, the remaining random variables will exhibit conditional independence with respect to the sigma-algebra or "information set" σ(T ) := σ(T v , v ∈ [N ]) .

The skeleton graph is modelled as an undirected inhomogeneous random graph (IRG), generalizing Erdös-Renyi random graphs, in which edges are drawn independently between unordered pairs of nodes, not with equal likelihood but with likelihood that depends on their types. This class has its origins in Chung and Lu (2002) and has been studied in generality in Bollobás et al. (2007) and the textbook van der Hofstad (2016). The IRG structure arises by the assumption that edge indicators are Bernoulli random variables I vw = I wv defined for unordered pairs of individuals (v, w) , that are independent conditioned on the assignment of node types.

Assumption 1 (Skeleton Graph). The primary layer of an IRSN, namely the skeleton graph IRG(P, κ, N ), is an inhomogeneous random graph with N nodes labelled by v ∈ [N ]. It can be defined by two collections T , I of random variables T v , v ∈ [N ] and I vw , v, w ∈ [N ], with sigma-algebras ("information sets") σ(T ), σ(I).

1. Each node v ∈ [N ], representing a person, has type T v drawn independently with probability P(T ) from a finite list of types [M ] of cardinality M ≥ 1.

corresponds to a non-zero entry of the symmetric incidence matrix I. For each pair (v, w), I vw = I wv is the indicator for w to be (significantly) socially connected to v. Conditioned on the vector of all types T , the collection of edge indicators I := {I vw } is an independent family of Bernoulli random variables with probabilities

Here κ : [M ] 2 → [0, ∞), the probability mapping kernel, determines the likelihood that two people v, w of the given types have a social connection, or edge. The assumed N dependence assures sparseness of the graph for large N , and for consistency we require that N − 1 ≥ max T,T κ(T, T ).

The additional fundamental assumption of the IRSN modeling framework is that the relevant health attributes of all people are summarized by an independent collection of multivariate random variables, conditioned on the skeleton. Essentially, we will assume: (i) that each individual has a random "immunity buffer", and (ii) in case they are infected, a random viral load will be transmitted to each of their social contacts. Finally, we assume that as soon as a person's cumulative viral exposure exceeds their buffer, then they become infected and infective.

Definition 1.

1. The initial immunity buffer∆ v of node v prior to the crisis is a nonnegative value that represents the resistance of that person to the virus.

2. The nominal exposure pair between w and v is a pair denoted by (Ω vw , Ω wv ) of positive values: Ω vw represents the total viral load transmitted from v to w should v, w be connected (i.e. if I vw = 1), and if v is infected.

3. The health state of the network before the onset of the outbreak is determined by the collection of conditionally independent random variables Ω vw and∆ v .

4. Each person becomes infected at the first time t that ∆ v = 0.

Initial values are considered just prior to the outbreak. The epidemic trigger on day 0 introduces a number of infected individuals in the population:

Definition 2. An epidemic trigger at a moment in time, which we label by day t = 0, occurs when for each T , a specified seed fraction Π (0) (T ) ∈ [0, 1] of all type T individuals are infected.

Now we make some pragmatic probabilistic assumptions about the initial buffer and exposure random variables, conditioned on the vector of individual types T = T vv∈ [N ] .

Assumption 2 (Immunity Buffers and Exposures). The secondary layer of an IRSN, the collection of initial immunity buffers and potential exposures∆ v , Ω vw are continuous nonnegative random variables that are mutually independent, and independent of

2. Immediately after the trigger with initial seed probabilities Π (0) (T ),

3. The initially infected individuals are those with ∆

4. For each edge vw, Ω vw and Ω wv are a pair of random variables. Conditioned on

In summary, a finite IRSN representing the system after a crisis trigger amounts to a collection of random variables {T, I, ∆ (0) , Ω} satisfying Assumptions 1 and 2.

We now consider how such IRSNs will evolve on a day-to-day basis when a trigger infection occurs at time t = 0. Recall that each person becomes infected at the first time t that ∆ v = 0.

The infection state of each individual at day t will be identified by the infection indicator random variable defined by

that takes values either 0 ("susceptible") and 1 ("infected"). The infection state of individual w at day t now influences the infection shock transmitted to another individual v:

The aggregated infection shock transmitted to v is:

and the impacted immunity buffer of v on day t + 1 is:

Putting (4, 5, 6, 7) together gives the complete infection mapping at day t ≥ 0.

The IRSN framework just introduced specifies the joint distributions of the random variables {T, I, ∆ (0) , Ω (0) }, thereby providing a compact stochastic representation of the state of a given real world network of N individuals at the moment an outbreak is triggered. The same distributional data defines a sequence of random networks with varying N . As we will see in this section, the so-called locally tree-like property of the IRG skeleton has very important analytical implications in the limit N → ∞.

The distribution of the number of social contacts of nodes in IRGs, in other words their degree distributions, has a natural Poisson mixture structure in the large N limit. By permutation symmetry, one only needs to consider individual 1 with arbitrary type T 1 = T , whose degree is defined as d 1 = N w=2 I w1 , a sum of conditionally IID random variables. Since e ikI w1 = 1 + I w1 (e ik − 1), each term has the identical conditional characteristic function (CF)

The conditional CF of d 1 is the N − 1 power of this function, and can be written

to display its asymptotic structure as N → ∞.

Proposition 1. The characteristic function of the degree d v of an individual v, conditioned on its type T ∈ [M ], is 2π-periodic on R and has the N → ∞ limiting behaviour:

where λ(T ) = T P(T )κ(T, T ). Here, convergence of the logarithm of (10) is in L 2 ([0, 2π]).

This type of limit can be handled by Lemma 2, stated and proved in the Appendix. Proposition 1 tells us that for different values of T , the conditional degree distribution is asymptotic to a Poisson distribution with mean parameter λ(T ) = T λ(T, T ) where λ(T, T ) = P(T )κ(T , T ). Now, recall that a finite mixture of a collection of probability distribution functions is the probability distribution formed by a convex combination. Thus the asymptotic unconditional degree distribution of any individual is a finite mixture with characteristic function:

Each mixture component has a Poisson distribution with Poisson parameters λ(T ) and the mixing variable is the individual-type T with mixing weight P(T ).

This section provides the most important formula of the paper, namely a characterization given in Section 3.2.2 of the stochastic dynamics of the tth day of the infection cascade defined by equations (4, 5, 6, 7). The formula remains conjectural in the sense that it depends on the asymptotic independence of shocks hitting a given node, an unproven property that nevertheless we expect should result in any cascade setting such as ours where a "tree independent" transmission mechanism acts on a locally tree-like random social network.

Consider for t = 0 the single shock transmitted from 2 to 1 for two typical individuals 1, 2, that is, S

2 as defined by (5). Since e ikI 21 Ω 21 D (0)

2 (e ikΩ 21 − 1), the characteristic function of S (0) 21 conditioned on the type T 1 = T is given for finite N by a sum over the possible types of node 2, T 2 = T :

Next consider the asymptotic distribution of the total infection shock S

transmitted to individual 1 in day 0. For any N , its characteristic function conditioned on the type T 1 = T , is

One can prove that any finite collection of shocks {S

w1 } w =1 are identical and asymptotically independent, conditioned on the type T 1 = T . However, this fact cannot prove the following stronger statement:

where ∼ represents an unproven step. If this unproven step is accepted as a conjecture, then from (13) and the argument proving Proposition 1, the characteristic function of the total infection shock S

w1 transmitted to individual 1 in day 0, conditioned on the type T 1 = T must be:

Finally, the impacted immunity buffer ∆

(1)

1 at the end of day 0 is given by (7). We can see directly that S (0) 1 and ∆ (0) 1 share no common health random variables, and are therefore independent conditionally on the type T of individual 1. From the multiplicative property of characteristic functions of sums of independent random variables, the impacted immunity buffer ∆

(1) 1 has the product conditional characteristic function

By the Fourier Inversion Theorem, one can compute the CDF by taking an L 2 -inner product of the kernel Z(k, x) := e ikx 2πik with the CF of ∆

The conditional infection probability is got from (19) by taking x = 0:

Remark 1. To handle the singularity of Z at k = 0, one can show that under analyticity assumptions, it is sufficient to shift the k-integration slightly into the upper half complex plane. We assume enough regularity that we can do this throughout the paper.

In summary, day 0 of the infection cascade mapping has been broken down into three substeps that capture the probabilistic implications of equations (4, 5, 6, 7). Each of these substeps depends on the initial conditional distributional data for the collection {T v , I vw , Ω vw , ∆ 

In its most reduced form, the proposed infection cascade dynamics is assumed to be given by iterates t = 0, 1, 2, . . . of the mapping from Π (0) to Π (1) defined above. This dynamics also leads to formulas mapping the probability distributions for the collection {∆ (17) and (14) with Π (0) replaced by Π (t) :

2. Compute the univariate distribution of the impacted immunity buffer ∆

1 using the formula (18):

3. Compute the conditional infection probability using formula (20):

The previous derivations lead to recursive formulas (21)-(23) for the cumulative infection probabilities Π (t+1) (T ), with no need to actually computef (t+1) ∆ using (22). This formulation is however too restrictive in general. In particular, it will be necessary to determine the fraction of new type T infectives on day t + 1, that is π (t+1) (T ) :

Note that we can recursively compute the quantitiesf

∆ , π (t) . The importance of the incremental formulation is that it makes it clear how to introduce flexibility to adjust the dynamics in different ways, as we shall explore in subsequent sections.

In a nutshell, the network approach to modelling infectious disease is intermediate in complexity between compartment models, the most popular framework and reviewed in Brauer (2008) , and the more complex agent-based models (ABMs) that underpin much public policy as reported in Ferguson and Ghani (2020) .

Drawing a random sample of the underlying IRSN for a fixed size N , following assumptions 1 and 2, can be thought of as setting the initial conditions for an agent-based SI contagion model (ABM). Equations (4, 5, 6, 7) give the behavioural rules these agents follow to up-date their immunity buffers and decide to become infected. Our large N cascade mapping formulas provide an approximation to day-by-day rates of infection realized on the finite N sample. It is important to understand the key simplifications that underlie this approximation.

The most important simplification is the washing out of correlations between different parts of the network, as exemplified by the terms in the sum (6). A heuristic argument relates the information not accounted for in the approximation to the information lost if we "homogenize" the ABM as follows. Given any realization of node types T v , v ∈ [N ], one has a group of permutations τ of the labels v that preserve the node types, i.e. T τ (v) = T v . Such a τ effectively "rewires" the finite IRG, by mapping any sample of indicators I vw , v, w ∈ [N ] toĨ vw = I τ (v)τ (w) . This rewiring preserves the statistical properties of the skeleton, but breaks all social connections, for example mother-child relations. We can "homogenize" the agent-based model by applying a randomly chosen τ to the skeleton each day of the contagion.

Clearly homogenization leads to an "exchangability" symmetry amongst the nodes within each type T ∈ [M ] that is likely not present in the original random sample. We expect that the original ABM will be well approximated by the homogenized ABM if N is large, and this in turn will be well approximated by the large N asymptotic formulas. Furthermore, this line of thinking suggests that more fine-grained type decompositions reduce the effect of homogenization, and hence lead to more accurate approximations.

We should also consider the relation between the M type IRSN model of contagion with a more conventional compartment SI model with M types. In this setting, the population is modelled by an infinite collection of agents falling into disjoint compartments S T , I T , T ∈ [M ] representing susceptibles and infectives of type T . The standard multi-type SI model follows the ODEs (ordinary differential equations) for the fractional amounts s(t|T ), i(t|T ) subject to the constraints s(t|T ) + i(t|T ) = P(T ) for all t ≥ 0:

Each constant transmission coefficient k(T, T ) of the compartment model represents some average rate that type T infectives infect type T susceptibles. This will be some approximation of the detailed network type-to-type transmission mechanism on an IRSN that intertwines the quantities P(T ), κ(T, T ), ρ Ω (·|T, T ) and the time and type dependent buffer PDF ρ (t) ∆ (·|T ).

The simple SI contagion formulation above assumes that infected individuals never recover, and continue indefinitely to infect other susceptibles. This may be reasonable during the early phase of a contagion, but it is not reasonable over longer periods.

An SIR (susceptible-infected-removed) model arises when we assume that each day a constant fraction β(T ) ∈ [0, 1) of infected type T individuals recover or die. We now define I (t) (T ) to represent the total fraction of infectious individuals at the end of day t, while R (t) (T ) represents the removed (recovered or dead) fraction. If on day t the fraction of new infectives is π (t) (T ), then on day t + 1 we have

The remaining fractions satisfy recursions

When infected by COVID-19, as for other infectious diseases, there is a short period averaging T e ∼ 5.1 days, called the exposed period during which the infected person is not contagious. As in compartment models, it is straightforward to model this additional effect by supposing that all new infections are in the exposed class (E), and each day a fraction γ(T ) of the type T exposed class becomes contagious, moving into the infectious class (I). Of individuals in the I-class, a fraction β(T ) recovers into the R-class. Let E (t) (T ), I (t) (T ), R (t) (T ) denote the fraction of type T individuals in each class at the end of day t. Note that γ(T )E (t−1) (T ) is the fraction of new type T infectives on day t ≥ 1, and thus the type T newly exposed fraction on day t + 1 is

(33) The remaining fractions satisfy recursions:

6 Numerical Implementation

The core of the numerical implementation of the stochastic cascade mapping will be to approximate integrals such as (23) using the Fast Fourier Transform (FFT). The FFT works most effectively on a grid of nonnegative integers we denote by [Nfft] := {0, 1, 2, . . . , Nfft − 1} whose log-size log 2 (Nfft) is a small integer, chosen to compromise between precision and computational efficiency. All immunity buffers and exposures will be taken to have integer values on a smaller grid {0, 1, 2, . . . , deltamax − 1} that represent multiples of a unit of viral dose. That is, we assume that every PDF ρ X can be replaced by a dimension Nfft probability vector with components ρ X (x), x ∈ [Nfft], such that ρ X (x) = 0 for x ≥ deltamax. Here deltamax Nfft is a practical upper bound on immunity: anyone with ∆ ≥ deltamax will be assumed likely to resist infection even when all their social contacts get infected.

The characteristic functionf X is now replaced by the FFTf X := F(ρ X ) of ρ X , defined for each k ∈ [Nfft] byf

Then the inverse FFT ρ X = F −1 (f X ) is given by

With the grid [Nfft] set this way, we can implement the incremental SI infection mapping of Section 3.3 with the following steps:

1. Initialize arrays P, κ, ρ Ω , ρ 4. For each day t = 0, 1, 2, . . . compute recursively the updated arrays bŷ

One sees immediately that for day t the computational complexity is dominated by (37) which amounts to O(Nfft × M 2 ) flops for the complex matrix-vector multiplication, followed by Nfft × M complex exponentiations. Memory usage is dominated by storing the constant matrix R with Nfft × M 2 components. Since Nfft = 2 10 is a typical value, there is clearly no difficulty in computing the general model with several thousand types on an ordinary laptop.

This section addresses some of the issues in implementing the infection cascade model on IRSNs, and its generalizations, for a real world network ofN individuals. The central issue is to construct a sequence of IRSNs of size N increasing to infinity, that is statistically consistent with the real world network when N =N . Then the statistical model for N = ∞ can be subjected to epidemic triggers with any initial infection probabilities Π (0) (T ), and the resultant infection cascade analytics developed in Section 3 will yield measures of the resilience of the real world network.

The type of network data available to policy makers varies widely from one health jurisdiction to another. Here we imagine a minimal dataset forN = T ∈[M ]N T individuals classified into M types labelled by T ∈ [M ], whereN T denotes the number of individuals of type T . Individual type will be assumed not to change over the past N m months. As a first estimation step, we choose the empirical type distribution:

Now suppose for illustration that the interconnectivity, exposures and health statistics of the network have been observed monthly for the past N m months. For any of the monthly observations of the network, edges are drawn between any ordered pair (v, w) of individuals if the exposure of individual w to individual v exceeds a specified threshold (a "significant exposure"). LetÊ = T,T Ê T,T be the total number of significant exposures in the network identified in the N m month historical database, decomposed into a sum over the individual types involved. This data then leads to the empirical connection kernel

Recall from the previous section that buffers and exposures are assumed to take values on the integer grid {0, 1, 2, . . . , deltamax} for some moderately large integer deltamax. For each T → T edge e ∈ [Ê T,T ] we observe the value Ω e , while for each v ∈ [N m ×N T ] we also observe samples ∆ v of the type T immunity levels. Then, in view of the intrinsic uncertainties involved, it is reasonable to infer empirical distributions ρ Ω (·, T, T ) and ρ ∆ (·, T ) from a parametric family of discrete distributions on {0, 1, 2, . . . , deltamax} that match the sample means and variancesμ Ω (T, T ),σ 2 Ω (T, T ),μ ∆ (T ),σ 2 ∆ (T ). The data described above leads to a natural calibration of the pre-trigger IRSN model for any value of N ≥N (including N = ∞) at any time in the near future. The increasing sequence of random IRSN models based on these empirical probability distributions is hoped to capture essential aspects of systemic risk in our specific real world network of sizeN . This hope can be realized if it turns out that the N = ∞ infection cascade analytics provide a reasonably accurate approximation to simulation results for finiteN .

The purpose of this example is to provide an easy-to-visualize context for the IRSN framework, namely the setting of a seniors residence with 100 residents (type T = 1), 50 trained staff workers (type T = 2) within a town of total population N 0 = 10000. We also consider the same IRSN specification scaled up by a multiplier N = kN 0 . In anticipation of an oncoming contagion, the workers have been trained to high standards of hygiene and care and the residents (who are elderly but healthy) have been instructed in social-distancing and hygiene. The townspeople ("outsiders", with type T = 3) on the other hand have only average ability to social distance, and so the contagion hits the town before the centre. The goal of this example is to investigate the vulnerability of the centre to internal contagion starting in the outside town. The benchmark network parameters are given in Table 1 , together with numerical implementation parameters deltamax = 30, Nfft = 256.

The upper left plot of Figure 1 shows the daily infective and removed fractions for the three types, in the benchmark SIR model without further policy interventions. We see that the contagion starts in the outside community, but rapidly invades the centre, resulting in similar infection rates, with a time delay of about 2 days. One can see that the strategy failed for two reasons: first, the contagion was allowed to gain a foothold in the centre and infect a resident; second, the hygiene within the centre was not adequate to contain the resulting seed infection.

What further policy improvements implemented by the management might lead to a better result? The remaining plots in Figure 1 show the results for several combinations of policy interventions. Strategy A is to improve internal hygiene by quarantining all residents and dramatically reducing contacts between workers: λ(1, 1) changes from 4 to 0.5 and λ(2, 2) changes from 5 to 1. Strategy B is to dramatically reduce the connectivity between the centre and the outside: λ(2, 3) changes from 4 to 0.5. We observe that neither A nor B succeeds. Strategy A manages to reduce the contagion to about 37% of the residents, but fails because there is a continual reintroduction of infection from outside. Strategy B also fails: reducing the connections to outsiders simply delays the onset of contagion within the centre by about 10 days. However, the combination of both strategies A and B led to a success in keeping 97% of the residents healthy.

These policy interventions target the social connectivity in the network through social distancing and quarantine. Another important channel would be to reduce the mean viral exposures entering in the exposure PDFs, by measures such as encouraging more cleanliness and the use of masks. Yet another channel is to improve individual immunity buffers by vaccination or other health improvements.

Large N networks typically exhibit "resilient" states that are intrinsically resistant to contagion and "susceptible" states that amplify any introduced infection. Moreover they can be made to transition discontinuously from a resilient state to a susceptible state by varying a key parameter. Figure 2 shows the long-time values of the removed fractions, as functions of a multiplier z that rescales the benchmark probability mapping kernel κ → zκ. One sees the remarkable transition from resilient to susceptible at a critical value z * ∼ 0.70. This single graph shows clearly the general principle that any contagion can be prevented at the outset by sufficiently strong restrictions on social interactions. 

This paper is intended to provide a road map for future research using IRSNs as a tool in understanding aspects of epidemic risk. We end with a brief discussion of three interesting areas of exploration that have so far been left unaddressed. One line of inquiry asks about the accuracy of the large N approximation to real world models of this type. A first step in this direction is to investigate "synthetic models" to compare the large N asymptotic formulas to simulation studies for finite N . An optimistic hope is that N = ∞ formulas will prove to be an effective tool for explaining the systemic resilience of moderately large networks.

A second line of inquiry focusses on calibrating IRSN models of this type to real world social systems. Here the critical issue is the availability of data along the lines discussed in Section 7. Where a suitable representation of a real world network can be found, it will then be of interest to investigate the multiple dimensions of vulnerability exhibited by the calibrated cascade model.

A third avenue of investigation is how to design network models that can be used as a tool to explore and understand further social risk effects. Examples of interesting effects include: the impact of exceptional "superspreader" nodes; overlapping contagions such as influenza and coronavirus; more diverse types of nodes; country wide networks and the global network.

Lemma 2. Let I be any hyperinterval in R d andȳ > 0. Suppose g(x, y) : I × [0,ȳ] → C is a bivariate function such that g(·, y), ∂ y g(·, y), ∂ 2 y g(·, y) are pointwise bounded and in L 2 (I) for each value y ∈ [0,ȳ]. Then 

and apply the Lemma to the logarithm of (9).

Proof of Lemma 2. Under the assumptions, one can show directly that f (x, y) := log(1 + yg(x, y))] − yg(x, 0) satisfies lim y→0 f (x, y) = lim y→0 ∂ y f (x, y) = 0 and hence by Taylor's remainder theorem

One can also show that ∂ 2 y f (x, v) is in L 2 (I) for each value v ∈ [0,ȳ] providedȳ > 0 is small enough. Then, by Fubini's Theorem, for y ∈ [0,ȳ] log(1 + yg(x, y))] − g(x, 0) 2 ≤ ( 

The objective method: probabilistic combinatorial optimization and local weak convergence

The phase transition in inhomogeneous random graphs

Compartmental models in epidemiology

Connected components in random graphs with given expected degree sequences

The global impact of covid-19 and strategies for mitigation and suppression

On Watts cascade model with random link weights

Networks and epidemic models

Networks: An Introduction

Eight challenges for network epidemic models

Random Graphs and Complex Networks: Volumes I and II. Book, to be published

A simple model of global cascades on random networks