key: cord-0823247-5ixr3b8g
authors: Pechlivanoglou, Tilemachos; Li, Jing; Sun, Jialin; Heidari, Farzaneh; Papagelis, Manos
title: Epidemic Spreading in Trajectory Networks
date: 2021-10-29
journal: Big Data Research
DOI: 10.1016/j.bdr.2021.100275
sha: feb51784e0d9e661919dd85da46eb52c3dbe2d4b
doc_id: 823247
cord_uid: 5ixr3b8g

Epidemics of infectious diseases, such as the one caused by the rapid spread of the coronavirus disease 2019 (COVID-19), have tested the world's more advanced health systems and have caused an enormous societal and economic damage. The mechanism of contagion is well understood. As people move around, over time, they regularly engage in social interactions. The spatiotemporal network representing these interactions constitutes the backbone on which an epidemic spreads, causing outbreaks. At the same time, advanced technological responses have claimed some success in controlling the epidemic based on digital contact tracing technologies. Motivated by these observations, we design, develop and evaluate a stochastic agent-based SEIR model of epidemic spreading in spatiotemporal networks informed by mobility data of individuals (trajectories). The model focuses on individual variation in mobility patterns that affects the degree of exposure to the disease. Understanding the role that individual nodes play in the process of disease spreading through network effects is fundamental as it allows to (i) assess the risk of infection of individuals, (ii) assess the size of a disease outbreak due to specific individuals, and (iii) assess targeted intervention strategies that aim to control the epidemic spreading. We perform a comprehensive analysis of the model employing COVID-19 as a use case. The results indicate that simple individual-based intervention strategies that exhibit significant network effects can effectively control the spread of an epidemic. We have also demonstrated that targeted interventions can outperform generic intervention strategies. Overall, our work provides an evidence-based data-driven model to support decision making and inform public policy regarding intervention strategies for containing or mitigating the epidemic spread.

Epidemics of infectious diseases, such as the one caused by the rapid spread of the coronavirus disease 2019 (COVID- 19) , have tested the world's more advanced health systems and have caused an enormous societal and economic damage. The mechanism of contagion is well understood. As people move around, over time, they regularly engage in social interactions. The spatiotemporal network representing these interactions constitutes the backbone on which an epidemic spreads, causing outbreaks. At the same time, advanced technological responses have claimed some success in controlling the epidemic based on digital contact tracing technologies. Motivated by these observations, we design, develop and evaluate a stochastic agent-based SEIR model of epidemic spreading in spatiotemporal networks informed by mobility data of individuals (trajectories). The model focuses on individual variation in mobility patterns that affects the degree of exposure to the disease. Understanding the role that individual nodes play in the process of disease spreading through network effects is fundamental as it allows to (i) assess the risk of infection of individuals, (ii) assess the size of a disease outbreak due to specific individuals, and (iii) assess targeted intervention strategies that aim to control the epidemic spreading. We perform a comprehensive analysis of the model employing COVID-19 as a use case. The results indicate that simple individualbased intervention strategies that exhibit significant network effects can effectively control the spread of an epidemic. We have also demonstrated that targeted interventions can outperform generic intervention strategies. Overall, our work provides an evidence-based data-driven model to support decision making and inform public policy regarding intervention strategies for containing or mitigating the epidemic spread.

From the Plague of Athens (430 to 426 BC) [1, 2] to the Spanish Flu (1918) [3, 4] , pandemics have had a significant impact on human society [5] . In the last 20 years alone, the world has seen many infectious disease outbreaks. Notorious examples include the pandemics caused by the Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) [6] , the influenza A virus subtype H1N1 (swine flu) [7] , the Middle East Respiratory Syndrome coronavirus (MERS-CoV) [8] , the Ebola virus (EVD) [9] , the Zika virus (ZIKV) [10] , and most recently the Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2) [11] . These pandemics have tested the world's more advanced health systems and have caused an enormous societal and economic damage. Conventional methods to address the rapid spread of an infectious disease include physical distancing, confinement measures and human-based contact tracing of infected individuals. These describe some of the common policies imposed by governing authorities and jurisdictions that aim to contain or slow down the spread of the virus to levels that can be managed by healthcare units and socio-political institutions. While these easily understood policies can be effective in controlling the spread of the disease and saving lives [12, 13] , they have well-known drawbacks: (i) they are imposing extreme restrictions or limitations on individuals' activities or freedom, leading to a slowdown of social and economic activities of the community and to socioeconomic side-effects for the individuals themselves; (ii) they depend on human-based contact tracing of infected individuals that are cumbersome, expensive, slow and inaccurate; and (iii) they do not provide the means of a controlled transition to an immune community through well-defined intervention strategies that can easily translate to health policy and potentially ameliorate the socioeconomic impact.

More recently, advanced technological responses to the problem based on digital contact tracing have claimed some success in controlling the epidemic [14] . Digital contact tracing or proximity tracing, enabled by GPS-enabled devices, mobile apps [15] and beyond [16] , represents the ability to track and reconstruct the close contacts that an individual had with other people within a time period. The way that proximity tracing can have an impact in containing or slowing down the disease spread is straightforward.

Individuals that are known to be infected, can inform (via cloud services) recent close contacts that have been digitally traced, who can then take precautions and avoid further contacts with other people by isolating themselves or seeking expert advice. The process can involve third parties, such as governing authorities and/or health experts responsible for the containment of the disease.

The focus of the current research is on utilization of GPSenabled digital contact traces of individuals (i.e., mobility data or trajectories) to inform a more comprehensive analysis and modeling of disease spreading through methods of graph mining [17] and trajectory data mining [18] . In particular, we present a datadriven model for the spread of the disease in a community that take into account the mobility patterns of individuals. As people move in cities, they engage in different types of interaction with other people, resulting in different mobility patterns. As such, the relative risk of them being infected or infecting others can be substantially different. We systematically study the effect of the individual variability of mobility behavior to the risk of infection of an individual. This observation can have significant consequences to a model's accuracy of how the disease propagates in a community, as well as to the intervention strategies that can be designed to control the epidemic.

Motivated by the feasibility of digital contact tracing technologies [19] and the inherent limitations of traditional epidemiological models (see Section 2), this paper presents datadriven models of infectious disease spreading that incorporate individual variability due to individuals' mobility patterns. Our study aims to clarify how differences in mobility patterns can inform infectious epidemic dynamics and determine the impact of various intervention strategies. In summary, the major contributions of this work are as follows:

• we present novel data-driven models for assessment of the risk of infection of an individual based on mobility patterns and the amount of time they spend in proximity with others ("individual risk assessment");

• we present a stochastic agent-based Susceptible-Exposed-Infected-Removed (SEIR) network model for infectious disease spreading in trajectory networks ("community risk assessment");

• we design and evaluate novel individual-based intervention strategies for containing (or mitigating to an acceptance rate) the spread of an infectious disease in trajectory networks ("containment intervention strategies");

• we design and evaluate novel individual-based immunization strategies for providing a controlled and safe transition to an immune community ("targeted immunization");

• we present a large-scale case study using model parameter values that resemble the recent COVID-19 outbreak and realistic synthetic mobility data in a real urban environment (large University campus and surroundings) that allows for many human-human interactions; the model and algorithms presented generalize to other similar infectious diseases;

• we provide source code and data to encourage reproducibility of results.

The remainder of this paper is organized as follows: Section 2 provides background information, introduces notation and provides definitions of the technical problems of interest in this paper. Section 3 presents our epidemic model, algorithmic details of epidemic spreading in trajectory networks and descriptions of disease containment intervention strategies. Section 4 presents an experimental evaluation of the different models and methods for varying settings. We review the related work in Section 5 and conclude in Section 6.

In this section, we introduce notation and preliminaries of the problem of interest, as well as formal problem definitions. The background mostly relates to the definition of a contact network or a trajectory network as defined in [20] . We also provide background information related to the basic reproductive number R 0 and its limitations, as well as information about the SEIR epidemic model and its variations, as we employ it in our study.

We assume monitoring of mobility data of individuals within a finite observation area A. For the needs of our study, this area typically represents the administrative boundaries of a city or a city neighborhood where daily human contacts occur. Since A is a relatively small region, the Earth surface it represents has a low curvature and is close to flat. We can therefore, for simplicity and without loss of generality, assume that individuals move in a finite 2-dimensional Euclidean space R 2 and not on the surface of the Earth. This assumption allows to approximate geodesic distances on Earth with Euclidean distances in R 2 , a common practice in many real-world algorithms and services.

Consider a set of objects N = {u 1 , u 2 , . . . , u N } moving in an observation area A, defined as a finite 2-dimensional Euclidean space R 2 for a finite observation time interval [0, T ], forming a set of trajectories P. We formally define a trajectory as follows.

and (x, y) ∈ A ⊆ R 2 represent latitude and longitude coordinates in the 2D Cartesian system. We assume that an object might appear and disappear multiple times during the observation time interval [0, T ].

As individuals move in A, they can encounter each other, forming contacts. Following Pechlivanoglou and Papagelis [20] , we define a contact as follows.

A contact c u,v between two moving individuals u, v ∈ N occurs when their physical proximity (spatial distance) d u,v is smaller than or equal to a threshold τ (i.e. d u,v ≤ τ ).

Several approaches exist to estimate the spatial distance of two points in Euclidean plane. We employ its simplest form, the Euclidean distance, given by:

where (x u , y u ) and (x v , y v ) are the spatial coordinates of individuals u and v at a time t, where 0 ≤ t ≤ T respectively. The two individuals u and v are considered to be in contact for as long as their spatial distance remains consistently smaller than a proximity threshold τ . We extend the concept of a contact to include its temporal dimension and formally define an event as follows.

An event e u,v between two moving objects u, v ∈ N , represents a contact c u,v that lasted for a time interval [t s , t e ], where t s represents the time point of the beginning of the contact and t e represents the time point the contact ended. An event is represented by the quadruple e u,v = (u, v, t s , t e ). We also define the duration of the event as δ(e u,v ) = t e − t s . Note that, in our setting, we do not preclude the case that two individuals are in contact multiple times. In this case, the contact information between two moving individuals u and v is represented by a sequence of events

. . , (u, v, t n s , t n e )}. We also define the duration of all events as (E u,v 

Furthermore, in this paper we employ a universal proximity threshold τ , so the contacts will always be reciprocal, meaning that

. Indeed, this is sufficient for the case of human-to-human interactions we examine in this work.

A network that is constructed by connecting pairs of individuals that are close to each other based on physical proximity is called a proximity network. However, a proximity network is static and does not capture well the idea of individuals moving in space. When individuals are moving, the temporal dimension of interactions must be considered, and the resulting network can be thought of as a temporal network, also referred to as a time-varying network. Most characterizations of temporal networks discretize time by grouping together temporal information into a sequence of T network "snapshots" G t (V t , E t ), t ∈ {1, 2, . . . , T }. Each snapshot contains the vertices V t and edges E t , representing the individuals and their contacts, respectively, within a basic time unit t (e.g., second, minute, hour, etc.). The resulting data structure can be thought of as either a single aggregation graph with varying vertices and edges, or a sequence of proximity graphs. In either case, we re-

There are many possible metrics to determine the importance (or influence) of an individual (or node) in a temporal network. Note that the term node centrality refers to node importance that is common in static network analysis, and isn't applicable for trajectory networks. This is because measures of node centrality in the traditional setting of a static network are commonly based on shortest paths (e.g., betweenness centrality [21, 22] ), but shortest paths in temporal networks take a different character [23] . For example, in [24] , the authors define minimum temporal paths to capture the different characterizations of time-constraint shortest paths including cases of earliest-arrival paths, latest-departure paths, or fastest paths. It is possible to evaluate a notion of temporal betweenness [25] , but in our setting, we focus on notions of importance that are critical in the context of epidemic modeling in the trajectory network. Similarly to Pechlivanoglou and Papagelis [20] , we define metrics that relate to the temporal node degree and the duration of events, and use these metrics to construct node profiles that describe the behavior of each individual.

We define the following metrics related to node degree in the trajectory network:

• C u : a set of all contacts of u during the observation time interval [0, T ]. • D deg u (k): a distribution that represents the fraction of the time steps t i ∈ [0, T ] that u has node degree k.

In this paper, we are interested in the assessment and mitigation of the risk of infectious disease spreading in trajectory networks based on mobility data. In particular, we aim to address the following problems: Problem 3. Given a trajectory network G(V , E), and the parameters of an emerging infectious disease, determine the impact of various epidemic containment intervention strategies that can easily translate to health policy. The focus is on comparative analysis of the impact of targeted individual-based interventions against a null model (informed by less sophisticated horizontal measures).

The basic reproductive number R 0 (sometimes called basic reproduction ratio), is the most widely used parameter in epidemiology. It can be thought of as the expected number of new infections caused by a single infected individual. Commonly used epidemiological models suggest that R 0 = 1 is a critical value. When R 0 < 1, each infected person produces less than one new case in expectation, therefore the size of the outbreak is constantly trending downwards, until eventually the disease dies off. On the other hand, when R 0 > 1, each infected person produces more than one new cases in expectation, therefore the size of the outbreak is constantly trending upwards. In principle, the larger the value of R 0 , the more challenging it is to control the epidemic.

Despite its usefulness as an approximate indication of the spreading power of the disease, many studies have stressed the limitations of R 0 . An underlying assumption of R 0 is that the disease is spreading in a perfect mixing network (i.e., a complete graph) or a regular tree network -a special type of a network topology that has no cycles and each internal node has a constant number of children, defined by a branching factor d. However realworld communities do not resemble a complete graph or regular trees, since some people have more contacts than others and it is common for people to have common friends (forming triangles or cycles). It is also easy to see how the basic computation of R 0 breaks down when we consider transmission of infection to be a stochastic process involving discrete individuals [26] .

For the purposes of this work, when we refer to R 0 for individuals, we define it as "the expected number of secondary cases produced, in a completely susceptible population, produced by a typical infected individual" [27] .

Compartmental models of epidemic modeling divide the population into separate divisions (compartments) and people transition between them based on their health status during an epidemic.

For instance, in the classic SIR model [28, 29] , people progress between three compartments: susceptible (S), infectious (I) and removed/recovered (R). For many infectious diseases, there is a significant latent period (incubation) during which susceptible individuals have been infected, but are not yet infectious themselves. During this period an individual is considered to be in a compartment labeled as exposed (E ), and the model is known as SEIR. The current research employs the SEIR model for modeling the spread of a virus in the community. Depending on assumptions of population structure and transmission progression, there are two main classes of the SEIR model studied in the literature.

Homogeneous population. The first class, assumes a large, homogeneously mixing population where individuals move between compartments at certain transition rates described by ordinary differential equations [30, 31] :

where β is the transmission rate, σ is the incubation rate, and γ is the recovery rate, respectively. This is a deterministic model, so for a fixed set of parameter values and SEIR model initialization (t = 0), it produces the same outcome at each simulation.

This model can inform about the state of the epidemic spread in the community and provide insights about future trends as well as inform health policy at large [32] . However, there are certain limitations of this model. Its results and usefulness are limited by the inherent assumption that all individuals share the same characteristics.

Heterogeneous population. The second class, assumes heterogeneity of population and is based on an agent-based SEIR model, where each agent is representing an individual [33, 34] . This approach allows to model individual characteristics and behavior towards the epidemic. In our research we focus on heterogeneity that is attributed to different mobility and contact patterns of individuals, over time. Different mobility patterns, lead to complex spatio-temporal social interactions between people in the community [20, 35] . These models are more challenging to analyze and interpret as they depend on a stochastic (probabilistic) process of epidemic spreading that increases the complexity [36] . However, they are more realistic and can help to better understand the emergence of a disease due to different individual behaviors. In addition, since they operate (simulate) on individual-level behavior, they provide an opportunity to design targeted intervention strategies that can more easily translate to health policy. In section 3 we present the details of the agent-based SEIR model.

Recent studies on epidemic modeling highlight the importance of individual variability in modeling the spread of an infectious disease in a community and predicting its relevant outcome. For example, Gomes et al. [37] studied the effect of the biological variation in susceptibility of individuals and their physical exposure to infection. And, Britton et al. [38] studied how population heterogeneity affects herd immunity. In our research, we study the effect of individual variability in epidemic modeling that is due to mobility patterns. We first present a method for computing the risk of infection of any individual in the community, as a result of their spatiotemporal interactions. Then, we present a stochastic agentbased epidemic model (and optimizations) that can better capture the dynamic disease spreading in a community.

We integrate individual variation by modeling the risk of infection of an individual in relation to its mobility patterns and contacts over a time period. Intuitively, we would like to model that the more contacts an individual has and the more time they spent with each other, the higher the risk of infection. Formally, given a trajectory network G(V , E), an individual u ∈ N and its contacts C u during [0, T ], we model the risk of infection risk u of an individual by the following three methods, each offering a different level of analysis. risk (1) 

Out of the three definitions, risk (1) u is the simplest one as it is based on the node degree in the aggregation network (i.e., the network defined by aggregating the edges of a temporal network over [0, T ]); risk (2) u takes into consideration both the number of contacts of u and the total duration of these contacts (due to potentially multiple events); risk (3) u models the risk of infection as a probability of getting infected by any of its contacts factoring the total duration of these contacts (due to potentially multiple events), where β is the transmission probability of the disease.

In particular, we use a geometric function to represent the risk attributed to each distinct contact. The outcome is a regularized metric for risk (capped at 1), so that specific contacts with a very long duration do not dominate the overall risk of an individual.

While the actual value of an individual's risk of infection does not hold any natural interpretation, it is important for our analysis to represent the relative risk rrisk u of u to other individuals in the network. We therefore normalize each risk metric by the aggregated risk of all N individuals in the network to get the relative risk of u ∈ N , as follows:

We utilize the relative risk in our experimental analysis.

We present a stochastic agent-based SEIR network model for epidemic spreading in a trajectory network, where nodes represent individuals and edges represent contacts of nodes. According to the epidemic model, each node can be at one of the following infection states, at any discrete time t:

• Susceptible (S). This is the initial state of all nodes; a node can get exposed to the infection by any of its infected neighbors with probability β, per time step. • Exposed (E ). A node is in this state if it has been infected by one of its neighbors, but it is not yet infectious itself. A node stays in this state for as long as the incubation period of the disease lasts, which for simplicity we model as a constant that lasts I f time steps. After that period, the node becomes infectious and switches to state I with certainty. Depending, on the disease we aim to model, the certainty can be relaxed by incorporating a parameter to control the probability of a node switching to I (or to S).

• Infected (I). A node is in this state if it is infectious, therefore can transmit the disease to any of its neighbors with probability β.

• Removed (R). A node is in this state if it has been removed, meaning either has passed away or has recovered. Nodes that are in I will be removed after I r time steps with a recovery probability γ . The recovered nodes are neither infectious anymore nor susceptible to the infection. is given by S(t), E(t), I(t), and R(t), respectively. We also define two special sets of infected nodes: (i) the initial seed set of infected nodes I 0 = I(0), and (ii) the set of the infected nodes at the end of the process I T = I(T ), which represents the size of the epidemic spread.

We describe here algorithmic details of the stochastic model.

Recall that individuals move between infection states S, E, I and R based on a stochastic process. At each discrete time step t, each S(usceptible) node has a chance to switch to E(xposed), E(xposed) nodes might switch to I(nfected), and I(nfected) nodes might be R(emoved). Formally, let u ∈ S and let N u be the set of neighbors of u at time t. Each neighbor v ∈ N u such that v ∈ I, flips a biased coin with a bias equal to the transmission probability β to determine whether it will infect u. If u is infected, then it switches its infection state to E(xposed), otherwise its infection state remains S(usceptible). Similarly, an E(xposed) node will switch to I(infected) after I f steps and an I(nfected) node will switch to R(ecovered) after I r steps, with a probability γ . The pseudocode of the stochastic model of epidemic spreading is given in Algorithm 1. In our analysis, each time step in the discrete time simulation corresponds to a minute (60 secs), so negligible contacts (interactions of less than a minute) are not considered. Studies on infectious diseases have showed that prolonged exposure of a susceptible node to an infected node increases the likelihood of infection [39] . It is easy to see that in the epidemic model presented in Algorithm 1, an infected node u has multiple chances to propagate the disease. Formally, given a trajectory network G(N, V ), the probability p u,v of a susceptible node u ∈ V being infected by a neighboring infected node v ∈ V after k independent trials is given by the cumulative distribution function of the geometric distribution:

where β is the transmission probability of the disease. Eq. (5) represents the complementary probability of u not being infected after k independent trials. It is easy to see that k depends on the duration of the contact between an infected node v and u (i.e., one chance per time unit) and that 0 ≤ p u,v ≤ 1.

The epidemic spreading model we described in Algorithm 1, is a stochastic process that possesses some inherent randomness. Starting with the same initial conditions (i.e., the same sets of Susceptible and Infected nodes) and parameter values, multiple independent simulations of the epidemic spreading process can produce outputs that vary a lot, in terms of the total number of nodes infected at the end of the process. This is because the final outcome depends on flipping a biased coin at every time step to decide whether the disease will diffuse from one node to another in the network.

Interestingly, there is an equivalent deterministic model that offers a static view of the network and is more practical, as it allows for faster simulations than the stochastic model. We describe here a method that given a stochastic model of epidemic spreading in the trajectory network, converts it to a deterministic model based on percolation theory [40, 41] . In mathematics and physics, percolation theory is used to explain the flow of fluids through certain types of porous material. Similarly, in network science, it is used Algorithm 1: Epidemic spreading in trajectory networks.

u recovers with probability γ ;

if u recovers then

break; to describe the behavior of a network when nodes or links are removed.

To utilize this idea in the epidemic spreading model, recall that each infected node in the network has a probability β to infect each of its neighboring nodes at every time step t, by flipping a biased coin with a probability β. At the end of the interaction, the infected node has either infected the neighboring node, in which case we consider the edge to be "active", or not, in which case we consider the edge to be "removed or blocked". The idea of percolation is that instead of deferring the decision of whether an edge will be "active" or "removed" at runtime, we can make a decision for each edge of the trajectory network G(V , E) at the very beginning of the whole process. In practice, for each edge in the network, we just need to flip a biased coin with probability β as many times as the duration of the contact (expressed in time units), and decide whether to keep it or remove it from the network. At the end of the process a smaller network G (V , E ) is constructed, such that E ⊆ E.

In terms of the correctness of the epidemic spreading process itself, it does not matter if the decision to keep or remove an edge is made at runtime or early in the process. In terms of runtime cost, percolation allows to work on a smaller network (since many edges are already removed) and allows simulations to fin- ish faster. We therefore employ percolation in the relevant set of experiments.

In this section, we explore various network-based intervention strategies that aim to contain an epidemic [42] . These interventions change the structure of the trajectory network -the backbone on which an epidemic spreads over time -and eventually affect the size of the set I T of the infected nodes at the end of the process. The strategies relate to node immunization (network node removal) or breaking of social ties (network edge removal) and are actuated either at a network-level (governing authority decision) or a node-level (individual decision). Our goal is to design targeted models of intervention and evaluate them against sensible null models. Details of these strategies and models are presented below, along with discussion on their feasibility and their implications to health policy. Strategy 1: node immunization. Based on this strategy, we remove a fraction α n of all nodes in the network. Formally, given a set S ⊆ V of nodes to be removed, where |S| = α n |V |, the infectious disease now spreads in the induced subgraph G (V , E ) of G whose vertex set is V = V \ S and whose edge set E consists of all of the edges in E that have both endpoints in V . The real-world interpretation of this strategy is that some individuals are quarantined (i.e., they are in a state of isolation where no contacts occur) or develop immunization because of a vaccine. The network effect is that a contagious disease cannot spread through their contacts anymore.

Null model: A fraction α n of nodes is removed uniformly at random. It is important to note that this is a network-level intervention strategy, where a national authority determines a set of individuals to immune (or request to quarantine) based on an estimate of their relative risk rrisk u . Such an intervention, is resource-intensive, but also might infringe the privacy of individuals. It also carries a risk of discriminating against individuals with specific mobility patterns (i.e., super-spreaders). As a result, the feasibility of this intervention strategy is rather weak for large communities.

Strategy 2: breaking of social ties. Based on this strategy, we remove a fraction α e of edges adjacent to each node (contacts).

Formally, given a node u ∈ V and its set of neighbors (u), we

The real-world interpretation of this strategy is that individuals have some understanding of the mobility patterns of their contacts and they can make decisions about who to avoid. The network effect is that a contagious disease cannot spread through some specific contacts anymore.

Null model: For each node u, a fraction α e of its contacts to neighboring nodes (u) are removed, uniformly at random. It is important to note that this is an individual-level intervention strategy, where each individual makes a local decision about who to avoid, based on some understanding of the relative risk rrisk u associated with each of its contacts. The model assumes that individuals are in position to understand that they should avoid contacts that are frequently and regularly interacting with many others (e.g., due to their occupation or mobility habits). Such an intervention is easier and not resource-intensive to implement, due to its distributed nature and does not infringe on the privacy of individuals. As a result, the feasibility of this intervention strategy is rather high for large communities. This model resembles a "social bubble" policy practiced by many, where an individual maintains contact with only family members and a few close friends. This way, potential "network bridges" between different well-knit communities in the network are eliminated and the infectious disease finds it hard to cross between them. This is an individual-level intervention strategy, where each individual makes a local decision about who to keep in its social bubble, based on some understanding of how many friends they have in common. This is a relatively easier assumption to make (than the one made by the targeted model A). Such an intervention 10] is once again easier and not resource-intensive to implement, due to its distributed nature and does not infringe on the privacy of individuals. As a result, the feasibility of this intervention strategy is rather high for large communities.

In this section, we provide details of our experimental evaluation. We first present our synthetic data generator and describe the characteristics of the contact networks produced. Then, we present a COVID-19 use case, by specifying the parameters and refining the research questions we aim to explore. For each research question, we outline the experimental scenario and process followed to effectively address it. Finally, we discuss the results and any implications.

In order to evaluate our stochastic agent-based SEIR epidemic model, we had to rely on large-size data representing trajectories of individuals or their spatiotemporal contacts. Moreover, for simulations to be reliable, the data needs to be (almost) complete; if significant amount of information about people's mobility or contacts is missing, then any underlying analysis related to community structure and individual behavior could be significantly affected. At the same time, mobility data is highly sensitive; many contact tracing applications rely on privacy-preserving proximity data, making the collection of real-world data impossible. With these factors in mind, we opted to use synthetically generated data. On the other hand, the benefit of generating synthetic data is that all parameters could be tuned and therefore analysis can be more comprehensive.

We generated synthetic data that simulates the activity of people living and working within a specified urban area over the course of a month. We defined an observation area A of approximately 1 km 2 including the York University Keele campus and surrounding neighborhoods in Toronto, Canada. Each individual in the simulation is randomly assigned a home location and frequents a number of favorite places (out of a predefined set of places), following a normal distribution. Moreover, each person is assigned an activity level parameter that determines how "active" they are by controlling the number of hours they may spend outside their home every day and the number of places they are likely to visit. Based on existing research on daily activity [43] , each individual was assigned between 0 and 12 active daily hours, determined by their activity level. Table 1 presents the parameters of the data generator and Fig. 4 presents descriptive analytics of the generated individual mobility data, including the distribution of activity levels, the distribution of places visited and the hourly activity over the course of a month by individuals of different activity level.

We combine all previous parameters to generate a set of destinations and daily schedules for a specified number of individuals. Afterwards, the exact movement and trajectory traces of these people are simulated using Eclipse Simulation of Urban MObility (SUMO) [44] , an open source, highly portable, microscopic and continuous multi-modal traffic simulation package. SUMO is capable of modeling accurate and highly realistic movement of vehicles but also pedestrians, including movement through pedestrian crossings and crowded sidewalks. The end result is synthetic but reliable, complete datasets representing the daily movement of individuals in the observation area A, over a period of a month.

We used the synthetic data generator to model a population of 2,000, 3,000, 5,000 and 10,000 individuals moving in the same campus area. Of course, the resulting datasets correspond to different population densities. This allows us to examine the progression of an epidemic in urban areas with different population densities while controlling the rest of the parameters in the problem. Focusing on the COVID-19 epidemic use case, we can transform the trajectory traces we obtained into trajectory networks to model the spread of infection. To do this, we need to determine a specific distance threshold where two individuals are considered in contact. Prior research on SARS-CoV-2 transmission through air droplets has shown that individuals at a close physical distance of ≤1-2 m have a high probability of transmission, while there still exists a lower probability when within 2-9 m [39, 45] . As we require a single cutoff value, we selected the conservative threshold value τ = 2m. We used the tools developed by [20] to construct the corresponding trajectory networks. The properties of the generated network datasets can be seen in Table 2 .

A factor that is somewhat uncertain in related research is the duration required for two individuals to be considered in contact and, subsequently, the transmission probability per time unit β. Studies that examine definitions of contact duration typically consider the case of 1-2 m for 15 minutes or more [19] . With the 12.8% transmission probability from [39] , this would result in β ≈ 0.85% per minute. Furthermore, there are studies of transmission times in different environments such as airplanes [46] or ventilated spaces [47] . These provide values of 1.8% per minute (quadrupled for conservative results) when within 1 m and 1% per minute when in a well-ventilated space without masks, respectively. In our work, we use β = 1% per minute in most experiments, but we also explore the progress of an epidemic with different values of β.

Regarding the infection's progress, we follow the example of well-established prior research on COVID-19 [32] and utilize the SEIR model with exposure period of 3 days, infectious period of 6 days and recovery period of 10 days. The recovery probability γ helps to understand the severity of a disease in long term, since together with transmission rate β, it determines R 0 . However, analysis of varying values of parameter γ is out of the scope of our model that focuses on heterogeneity due to mobility patterns and targeted intervention strategies. In the experiments, we therefore fix the recovery probability to γ = 1 (i.e., 100%).

With these parameters selected, we aim to answer the following questions: 

We utilize each of the three proposed methods to estimate the relative risk of infection for the population sample in all datasets. The resulting distribution of risks can be seen in Fig. 5 . As can be seen, the duration-based risk (2) u gives a higher risk to a smaller number of individuals than the degree-based risk (1) u . The geometric interaction-based risk (3) u produces an estimate that is balanced between the two other metrics, and we use this in all remaining experiments. The reason why we employ risk (3) u is that it naturally captures the dynamics of the infection transmission process. In particular, the risk model needs to capture the following characteristics:

• the more contacts an individual has the higher the risk; • the longer the duration of an interaction, the higher the risk; and • the risk of infection due to a singular contact should not increase infinitely but it should plateau once it reaches a probability close to 1 (i.e., certain infection). (3) u uses a geometric function to naturally represent the risk due to these characteristics. Note that the probability of u infecting v after n attempts is increasing for every time unit (i.e., minute), demonstrates diminishing returns and it is eventually plateauing out as it approaches to 1 (i.e., 100%). If we were not considering a geometric function (or similar diminishing returns function), then the risk of certain individuals would grow continuously as a factor of the duration of the contact and would lead to disproportional large risk to certain individuals (due to certain lengthy interactions). of the model (Susceptible, Exposed, Infected and Recovered, respectively).

Furthermore, Fig. 6b shows the basic reproductive number R 0 over time. It can be seen that the R 0 fluctuates over time with values ranging from 0.0 to 2.5, while its 30-day moving average is equal to 0.939 (dashed line). Note that the R 0 is assuming a perfect mixing network (i.e., a complete graph). However, real-world communities do not resemble a complete graph. Our micro-scale analysis of infections allows to monitor the direct and secondary infections attributed to an individual u, and therefore allows to report the reproductive number R u 0 of u ∈ N . In contrast to the basic reproduction number R 0 that represents the expected number of infections directly generated by an infectious individual, the R u 0 represents the exact number of people infected by the specific individual u. Now, instead of relying on the R 0 we are in position to provide the distribution of the individual R u 0 values in the population. In Fig. 6c , we show the distribution of R u 0 for the 10k dataset, along with the mean R 0 = N u R u 0 . It is evident that there is significant variation between individuals, something that the mean R 0 fails to capture.

In Fig. 7a we can see the infected count I over the period of a month for an initial "seed" risk (3) u , all of them with high, medium, low, or random risks risk (3) u . In the case of low-risk individuals, the infection never spreads to other people as the initial ones have very limited or no contact with anyone else. In all other cases however, we can see that after a month the vast majority of the population has been infected, with that conclusion arriving faster or slower depending on the initial seed risk. Similarly, in Fig. 7b we can see the final infected counts I T for initial seeds of random risk but different size I O , for all 3 datasets. While there is some reduction in final infections when I O = 1, all other seed sizes lead to the same result, determined by the population density.

An explanation for this behavior can be found when examining the contribution of each individual in the spread of the infection, and the role of super-spreaders. As mentioned above, we use R u 0 to define the set of individuals that were directly infected by their contact with individual u. We define as R u 1 those infected by any

person v ∈ R u 0 , R u 2 those infected by v ∈ R u 1 , and so on. Furthermore, we define R u = {R u 0 , R u 1 , . . .}, i.e. all individuals that were infected directly or indirectly because of u. In Fig. 8a we can see the distribution of R u 0 for the entire population, along with each person's relative risk rrisk (2) u . As expected, high-risk individuals are responsible for the vast majority of direct disease transmissions. However, in Fig. 8b we display the equivalent distribution for R u . There, we can see that many medium or low-risk individuals are actually responsible for a lot of the secondary, indirect infections. This means that, even when a person with few contacts is infected, a single contact with a high-risk super-spreader is enough for the disease to quickly propagate across the community.

The rest of the experiments use I O = 10 individuals of random risk risk (3) u .

In Fig. 9a we can see the progress of an epidemic in a population of 10,000 when the probability of transmission β has different values. Furthermore, in Fig. 9b we can see the final infected counts for those same values of β for the different datasets. Any value above 1-2% all but guarantees the rapid infection of the entire population, and values below 1% result in significantly reduced counts when the population density isn't too high. It is evident that the transmission probability has a significant impact on the spread of an epidemic; face masks and any other means of reducing it can have critical benefits, as the vast majority of existing research also indicates [45] . 

Quarantine policies, which include social distancing, home confinement and centralized quarantine, have been widely used to break the transmission chain of epidemic spread [48] . Even though some human-rights and socioeconomic issues have been raised in the process, this public health measure has proved effective in controlling disease spread [49, 50] . Ideally, only those who are infected should be quarantined, while others can travel as they wish. However, this imposes another challenge as recent studies have shown that many infected individuals are asymptomatic or only have mild symptoms [51, 52] . These individuals are most likely not aware they are infected, still able to transmit the virus to others, and therefore in need of quarantine.

To evaluate the effect of quarantine, we incorporate a quarantine parameter q in the experiments that represents the proportion of infected individuals that will have no contact with any others until they fully recover; we assume that the rest 1 − q proportion of the infected nodes will not quarantine and will continue interacting with others (e.g., due to no symptoms). Given that the quarantine of every infectious individual corresponds with the removal of many probable-transmission contacts, the expectation for this case is that the effect of the quarantine parameter q on the epidemic will be significant. Indeed, as can be seen in Fig. 10 , even small values can result in greatly reduced infection numbers, which greatly highlights the importance of quarantine measures. For the remainder of this work we wish to examine other properties in isolation, and we therefore set the parameter q = 0.

To answer this research question, we examine the progress of the epidemic after applying the proposed intervention strategies on the 3k dataset. For each strategy, we report intervention and null-model results of I for an intervention proportion α = 0.2, and the final infected I T for different α values. Fig. 11a shows the results of removing 20% of the network nodes, i.e. individuals based on their risk. As mentioned earlier, this corresponds with targeted immunization or isolation of selected individuals. We can see that this has a substantial effect, greatly reducing the spread of the epidemic. The result is much less pronounced when removing medium or random risk individuals, and removing low-risk individuals has almost no effect. In Fig. 11b we can see the outcome for different proportions α. In this case, simply removing 30% of the highest-risk individuals practically eliminates the spread of infection completely.

Figs. 12a and 12b show the same results for the high-risk contact removal intervention. As mentioned in Section 3, this is equivalent to every person avoiding high-risk individuals among their contacts. We can see that this time the targeted strategy performs only slightly better than the null model, although that difference grows for higher intervention proportions. The result is not surprising. This is because the null model employed is not necessarily Fig. 11 . Results after high risk node immunization intervention. representing a bad strategy; by randomly selecting nodes and removing edges, one may remove high-risk, medium-risk or low-risk contacts. Furthermore, the aggregated network of the 3k dataset represents a small-world graph. It is well-known that when only a small proportion of edges are removed from that network (i.e. alpha is small), the impact on the connectivity of the network might not be significant due to the high clustering property of this type of graph.

Finally, Figs. 13a and 13b show the α = 20% progress and different-proportion final counts I T for the non-community contact removal intervention strategy. As mentioned earlier this is equivalent to the "social bubble" concept, where each person only maintains contact with their close friends and family, avoiding people from other groups. We can see that this strategy is notably more effective than the null-model one for α < 40%. Above that value, it is more beneficial to simply reduce each person's contacts overall, rather than targeting the more sporadic and (brief) contacts with people outside their community.

In order to compare the effectiveness of the proposed intervention strategies we have to take into account their relative impact on the population. Specifically, in high-risk node immunization with α n = 0.1 the percentage of removed edges α e is much higher.

In order to present a fair comparison of the intervention strategies, we report their results based on the number of removed contacts α e . The results for this experiment can be seen on Fig. 14a .

In order to compare the effectiveness of the proposed intervention strategies, we have to take into account their relative impact on the population. Specifically, in high-risk node immunization with α n = 0.1, the percentage of removed edges α e is much higher. Even with the same strategies (e.g., breaking of social ties), the number of edges to be removed can be different. As such, in order to present a fair comparison of the intervention strategies, we normalized the number of edges being removed in the experiments and report their results based on the number of removed contacts α e . The results for this experiment can be seen on Fig. 14a .

We can see that the "social bubble" intervention yields slightly better results than the high-risk individual removal one, when 30% or fewer contacts are removed. Afterwards, targeted immunization/isolation of individuals is significantly more effective. However, as mentioned in Section 3, we also need to take into consideration the feasibility of each approach. In that case, the uncommon edge intervention isn't only the most easily implemented one, but also performs very well up until a full 50-60% of all contacts have been removed. Any intervention above such magnitudes would significantly impact the movement and interactions of the population, and may be unrealistic for larger communities. Fig. 14b provides the R u 0 distributions after applying each of the interventions. This allows for a more in-depth examination of each intervention's effects and better interpretation of the performance results. In this plot, the total area under each curve corresponds to the sum of all transmissions of the infection for that case. Therefore, it is expected that all interventions will result in curves covering smaller areas than the non-intervention case. However, each intervention accomplishes this in a different way. The node immunization and high-risk edge removal interventions simply remove most of the highest-risk contacts, corresponding to many of the highest R u 0 values in the distribution, resulting in "shifting" the entire distribution to the left. On the other hand, the non-community edge removal intervention removes values throughout the distribution, resulting in a steeper curve.

Our research is related to (i) trajectory data mining, (ii) dynamic network analysis, and (iii) epidemic spreading in complex networks. These topics have been active research directions for a long time, so there is a broad spectrum of related literature. However, it is only recently that due to the technological advancement in geolocation tracking devices (e.g., GPS-enabled mobile devices), mobility data is becoming more accessible. Mining patterns in large amounts of human digital traces provides an opportunity for designing more accurate epidemic spreading models and more effective network intervention strategies than before. We cover below some of the most significant efforts relevant to that goal. Note, as well that some of the related work has already been cited throughout the manuscript to keep the discussion focused, so we omit it here.

Computational methods for mining spatiotemporal data, including trajectory/mobility data, have been extensively studied by the data mining and database communities. Two comprehensive surveys are provided by Zheng [18] and Atluri et al. [53] . Closely related to the problem of interest in this paper are problems that focus on mining the interactions among moving objects, over time, such as detecting pedestrian groups in trajectories [54] [55] [56] or determining the node centrality of moving objects in trajectory networks [20] . More recently, deep learning approaches for learning from spatiotemporal data and spatiotemporal networks have gained increasing attention [57, 58] .

Digital contact tracing: In our research, we assume that human mobile traces are available. This can be enabled by existing digital contact tracing technologies [59] [60] [61] . For instance, Aleta et al. [62] synthesized contact networks and modeled SARS-CoV-2 transmission in the Boston metropolitan area. They found that effective testing and contact tracing plays an important role in preventing second-wave spreading when complete isolation is relaxed. Other technologies of contact tracing have also been proposed [63] .

Privacy concerns of digital contact tracing: Digital contact tracing enables an easy and rapid implementation of infectious disease tracing, as it requires to gather and process simple information. However, gathering sensitive information might infringe the privacy of individuals [64] . We believe that controlling an infectious disease should not lead to a weakening of the privacy of individuals. We therefore advocate for protocols and technologies for "privacypreserving proximity tracing" that protect the privacy of individuals [65, 66] . Connectivity technology available in mobile devices and newly developed connectivity protocols can make a decisive contribution to efficiently and widely support proximity tracing enabled by bluetooth connection and/or GPS location. Both approaches protect the privacy of the user through dynamic pseudo-IDs [67] . For instance, Apple (iOS) and Google (Android) are providing privacypreserving cross-platform contact tracing via an open API and an opt-in Bluetooth-based proximity tracking. 1 In addition, the Pan-European Privacy-Preserving Proximity Tracing (PEPP-PT) protocol 2 and the Decentralized Privacy-Preserving Proximity Tracing (DP-3T) [68] protocol 3 have recently been proposed for providing a secure and decentralized privacy-preserving proximity tracing system.

The problem of objects being dispersed in space and interacting with each other if they are in close vicinity has been intensely explored in graph theory. Graph theory concepts, such as proximity graphs [69] and geometric intersection graphs [70] are characteristic examples. For instance, relative neighbor graphs [71] and Gabriel graphs [69] connect nearest neighbors if no other vertexes are nearby, while Delaunay triangulations [72] maximize the minimum angles of all triangles formed. These graphs, however, mostly deal with static data, while in our problem we are studying cases of multiple proximity graphs, one for each time unit. There has also been significant research on dynamic networks, such as time-varying networks or otherwise temporal networks. With the addition of temporal information, several concepts of static networks are not valid anymore, so they need to be adapted/studied in the context of dynamic networks. Due to its importance, dynamic network analysis is therefore an emergent discipline of network science focusing on network dynamics. Example problems include the computation of temporal node centrality [25] or computing metrics of network reachability [73] , shortest paths [24] , motifs [74] and other [75] . The dynamic nature of these systems introduces additional complexity and computational challenges. Dynamic networks have also been studied using machine learning models in the context of gradually evolving networks, where nodes/edges are added/removed over time [76, 77] . A comprehensive survey of this line of research can be found in [78] . Related to the current research, Leitch et al. [79] presented a review towards epidemic thresholds on temporal networks; they pointed out that temporal networks engage dynamics of real-world contacts, so their study is of great importance for understanding disease spreading processes. Statistical approaches or computer simulations are often necessary to explore the evolution of these external processes over evolving networks.

Mathematical modeling of epidemic spreading in networks can help to study and control the emergence of infectious diseases in a population. Based on the traditional SIR model, Weitz et al. [80] designed epidemiological interventions that can exploit the idea of 'shield immunity'. The main idea of the model is to deploy recovered individuals as focal points for sustaining safer interactions via interaction substitution. This method, however, cannot easily translate to health policy and/or individual level recommendations. In our study, we employ an agent-based SEIR model. The agent-based SEIR model has previously been employed to study the spread of epidemics in dynamic networks. For instance, Perez and Dragicevic [33] proposed a spatially explicit epidemiological model of infectious disease for understanding of the diffusion of a disease in a network of human contacts. In their model, human interactions are not fully dynamic, but are determined by the geographic area they are found at the same time. Yang et al. [81] proposed a flow-based edge betweenness method that detects important "bottleneck" edges in contact networks. They show that targeting those edges can contain the epidemic spread more than state-of-the-art edge betweenness methods. Their model does not easily translate to individual level policy, but offers high-level guidance for the network containment problem. Agent based epidemic spreading models are also very similar to the study of the epidemic spreading driven by random walks and one can translate to the other. Similar to the SEIR models, in the random walk based epidemic models, the infection is spread to the neighbors with a probability. This probability is determined by the transition matrix of the random walker. Pu et al. [82] proposed a biased random walk based spreading model. They show that the average node degree and homogeneity of the node degree plays an important role in the number of the infected nodes. More recently, Bestehorn et al. [83] derive an upper bound for the reproduction number based on a discrete-time Markovian random walk model of the infection spreading. New advances in deep learning and network represen-tation learning have also been used to model the epidemics [84] . Change et al. [85] propose an epidemic model based on the embedding of the mobility network and show the role of the social and economical disparities in the spread of the epidemics and the network structure. The main limitation of these models is that they model the epidemics on a static network and evolving aspect of the network is not analyzed. Heidari and Papagelis [76, 77] have proposed evolving network representation learning methods that lie in the intersection of these two topics and can be used to model the epidemic spreading in evolving networks.

Population heterogeneity in epidemics: Britton et al. [38] studied the effect of population heterogeneity on achieving the epidemiological objective of 'herd immunity' to the COVID-19 disease. In their study, they found that herd immunity can be achieved with less per cent of the population being infected than it was thought to be required (i.e., 60%), if one considers the age and social activity into the model. Similar ideas of evaluating the effect of individual variability in the disease spreading process have been considered by others. Our research is individual variability based on the structural information of a network; information that is related to the differences of nodes in the network based on basic graph/network characteristics (e.g., based on node centrality in the dynamic network, etc.). Related to our main thesis, Hébert-Dufresne et al. [86] argued that using R 0 alone to predict epidemic size is not enough in real-world outbreaks. They pointed out the necessity of considering heterogeneity in secondary infections to predict outbreak size. In practice, this can be done automatically, fairly cheaply, and highly accurately, by a wide-spread deployment and adoption of digital contact tracing technologies. Moreover, Lloyd-Smith et al. [87] pointed out that the basic reproduction number R 0 in the traditional epidemic analyses is a population-level estimate. Through a theoretical and statistical analysis, they showed that individual variation greatly affects epidemic growth rates, and therefore targeted control interventions would be more effective than population-wide ones. On a similar basis, Changruenngam et al. [88] studied the effect of human mobility on disease transmission dynamics in two contrasted countries. In the study, they incorporated individual human mobility in the SEIR model, which helped better describe infection spreading dynamics. In particular, based on population data of two areas, human mobility was modeled by obtaining the probability for an individual exploring new locations, from which the human mobility landscape could also be captured. Rocha and Masuda [89] augmented the SIR model by incorporating an individual-based approximation that captures the evolution of the probability that an individual is infected by another individual in the network. These studies highlighted the importance of incorporating individual variation in the epidemic model.

Instead of a complete or nearcomplete lockdown, P. Block et al. [42] proposed more moderate distancing strategies inspired by network science, including limiting contacts to similar, community-based or repetitive contacts. This study is based on a static network, while in a real-world situation, the network is dynamic informed by how individuals move around and interact with each other. Incorporating individual variation in the model has also the advantage that network interventions can be designed at an individual's level. For instance, Zhou et al. [90] reviewed several studies related to network immunization and concluded that vaccination of targeted nodes (e.g., nodes with large node degree), outperforms random immunization. Similarly, Torres et al. [91] evaluated different immunization strategies and found that node degree is a very strong measurement in determining node importance when considering the targeted nodes. In [92] , authors study the spread of epidemics on static and temporal networks. Epidemics on temporal networks is closely related to our work as it follows the same assumption in the network construction: the network evolves faster than the spread of the pathogen. However, that work assumes same degree for all the nodes in the temporal network. In our case, node degrees are determined by the mobility patterns of individuals.

Effective modeling of an emerging infectious disease has the potential to improve or save human lives. It can also minimize the societal and economic damage caused by physical distancing and confinement measures imposed by governing authorities to control an epidemic. Towards that end, we have presented a datadriven model of infectious epidemic spreading in spatiotemporal networks informed by mobility data of individuals. We designed and evaluated simple individual-based intervention strategies that exhibit network effects and can significantly control the spread of an infectious disease. We have also demonstrated that these targeted interventions can outperform generic intervention strategies. These strategies are easy to understand and translate to public health policy. While COVID-19 serves as a use case in this research, the same methodology can be used to model and mitigate any emerging infectious disease. An inherent limitation of our model is that it assumes availability of trajectories of individuals through digital tracing technologies. While these data are typically available to telecom and other third-parties through privacy-preserving techniques, our research relied only on realistic synthetic data sets.

We make source code and data sets used in the experiments publicly available 4 to encourage reproducibility of results.

Thucydides: History of the Peloponnesian War

Dna examination of ancient dental pulp incriminates typhoid fever as a probable cause of the plague of Athens

influenza: the mother of all pandemics

The 1918 influenza pandemic: insights for the 21st century

Epidemics and Pandemics: Their Impacts on Human History

The severe acute respiratory syndrome

Swine flu goes global: new influenza virus tests pandemic emergency preparedness

Middle East respiratory syndrome

Emergence of Zaire Ebola virus disease in Guinea

Zika virus outbreak

Coronavirus disease 2019 (covid-19): situation report

Contact tracing and disease control

Implementation and management of contact tracing for Ebola virus disease: emergency guideline, Tech. rep., World Health Organization

Quantifying sars-cov-2 transmission suggests epidemic control with digital contact tracing

Covid-19 digital rights tracker

Contact tracing: beyond the apps, preprint

Graph mining: laws, generators, and algorithms

Trajectory data mining: an overview

The efficacy of contact tracing for the containment of the

Fast and accurate mining of node importance in trajectory networks

A measure of betweenness centrality based on random walks

A faster algorithm for betweenness centrality

Connectivity and inference problems for temporal networks

Path problems in temporal graphs

Temporal node centrality in complex networks

Individual-based perspectives on r0

On the definition and the computation of the basic reproduction ratio r 0 in models for infectious diseases in heterogeneous populations

Infectious Diseases of Humans: Dynamics and Control

The mathematics of infectious diseases

A simple model for complex dynamical transitions in epidemics

Mathematical modelling of covid-19 transmission and mitigation strategies in the population of Ontario

An agent-based approach for modeling dynamics of contagious disease spread

Facing the covid-19 epidemic in nyc: a stochastic agent-based model of various intervention strategies

Epidemics on evolving graphs, preprint

A standard protocol for describing individual-based and agent-based models

A mathematical model reveals the influence of population heterogeneity on herd immunity to sars-cov-2

Two metres or one: what is the evidence for physical distancing in covid-19?

Stochastic Processes in Epidemic Theory

Percolation processes and related topics

Social network-based distancing strategies to flatten the covid-19 curve in a post-lockdown world

A comprehensive daily activity-travel generation model system for workers

Microscopic traffic simulation using sumo

Physical distancing, face masks, and eye protection to prevent person-to-person transmission of sars-cov-2 and covid-19: a systematic review and meta-analysis

Behaviors, movements, and transmission of droplet-mediated respiratory diseases during transcontinental airline flights

Assessment and mitigation of aerosol airborne sars-cov-2 transmission in laboratory and office environments

The concept of quarantine in history: from plague to sars

Lessons from the history of quarantine, from plague to influenza a

The impact of quarantine on mental health status among general population in China during the covid-19 pandemic

A systematic review of asymptomatic infections with covid-19

Clinical characteristics of asymptomatic and symptomatic patients with mild covid-19

Spatio-temporal data mining: a survey of problems and methods

Tensor methods for group pattern discovery of pedestrian trajectories

Trajectolizer: interactive analysis and exploration of trajectory group dynamics

A versatile computational framework for group pattern mining of pedestrian trajectories

Deep learning for spatio-temporal data mining: a survey

Learning semantic relationships of geographical areas based on trajectories

Epidemic contact tracing via communication traces

Epidemic spread in human networks

Modeling the impact of social distancing, testing, contact tracing and household quarantine on second-wave scenarios of the covid-19 epidemic

Acoustic-turf: acoustic-based privacy-preserving covid-19 contact tracing

Contact tracing mobile apps for covid-19: privacy considerations and related trade-offs

Preserving privacy in gps traces via uncertainty-aware path cloaking

Privacy-preserving contact tracing of covid-19 patients

Pact: Privacy Sensitive Protocols and Mechanisms for Mobile Contact Tracing

Decentralized Privacy-Preserving Proximity Tracing

A new statistical approach to geographic variation analysis

Polynomial-time approximation schemes for geometric intersection graphs

The relative neighbourhood graph of a finite planar set

Sur la sphère vide. a la mémoire de Georges Voronoï

Network reachability of real-world contact sequences

Temporal motifs in timedependent networks

Graph metrics for temporal networks

Evonrl: evolving network representation learning based on random walks

Evolving network representation learning based on random walks

IEEE Transactions on Knowledge and Data Engineering

Toward epidemic thresholds on temporal networks: a review and open questions

Modeling shield immunity to reduce covid-19 epidemic spread

Targeted pandemic containment through identifying local contact network bottlenecks, preprint

Epidemic spreading driven by biased random walks

A markovian random walk model of epidemic spreading

Representation learning on graphs: methods and applications

Mobility network models of covid-19 explain inequities and inform reopening

Beyond r0: heterogeneity in secondary infections and probabilistic epidemic forecasting, medRxiv

Superspreading and the effect of individual variation on disease emergence

How the individual human mobility spatio-temporally shapes the disease transmission dynamics

Individual-based approach to epidemic processes on arbitrary dynamic contact networks

Epidemic dynamics on complex networks

Node immunization with nonbacktracking eigenvalues, preprint

Epidemic processes in complex networks

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.