key: cord-0207150-hkt38czb authors: Barak, Boaz; Nitzan, Mor; Tannenbaum, Neta Ravid; Yuval, Janni title: Optimizing testing policies for detecting COVID-19 outbreaks date: 2020-07-09 journal: nan DOI: nan sha: be28b16b75c86af14398fdb5e20ba67cb6d8c849 doc_id: 207150 cord_uid: hkt38czb The COVID-19 pandemic poses challenges for continuing economic activity while reducing health risk to individuals and preventing uncontrolled outbreaks. These challenges can be mitigated by extensive testing. We study testing policies that optimize a fixed budget of tests within a single institution (e.g., business, school, nursing home, etc.) by varying the number of batches that the tests are split into. We prove that, in an exponential spread model and for reasonable parameter values, the expected size of an outbreak at initial detection is smaller when random subgroups of the population are tested frequently, as opposed to periodic testing of the entire population. We also simulate the effect of different policies in a network SEIR model taking into account factors such as variable connectivity between individuals, incubation period, and inaccurate testing results. We show that under a broad set of early-outbreak scenarios, given a certain budget of tests, increasing testing frequency of random samples of the population will reduce the societal risk, defined as the number of infection opportunities until first detection. For example, testing a quarter of the institution members every week is generally better than testing the entire institution every month. In fact, in many settings, sufficiently frequent testing (combined with mitigation once an outbreak is detected) can decrease the risk even compared to the baseline when the institution is closed and testing is not conducted. The bottom-line is a simple policy prescription for institutions: distribute the total tests over several batches instead of using them all at once. As many countries begin the process of easing restrictions during the COVID-19 epidemic, the role of testing and tracing is crucial in monitoring and controlling the viral spread, * Harvard University. † Technion ‡ MIT § All authors contributed equally to this manuscript. and a key factor in policy adjustments [1] . Here we focus on a specific aspect of this challenge: how should institutions, including businesses, schools, hospitals, prisons, and nursing homes, use a limited budget of tests for outbreak detection and mitigation. As a running example, consider a company of a 1000 employees that is allocated 1000 tests per month. A straightforward strategy is to test all employees at the first day of each month. Here we argue that under a broad set of scenarios, this is sub-optimal, and that testing subgroups of the employees more frequently (e.g. ≈ 250 employees per week) will yield better outcomes. While detailed knowledge of institution members roles, connections within the institution, preexisting conditions, and other characteristics could be leveraged to optimize a testing protocol, in this work we aim to offer a simple and general strategy, and therefore consider testing schedules that are independent of individual characteristics. Such randomized testing policies, which are oblivious to symptoms, can also be crucial in the discovery of asymptomatic or pre-symptomatic individuals. We demonstrate the benefit of frequent testing of subsamples of the population in a simple model of exponential infection spread, as well as a network SEIR model. The SIR epidemic model [2] and its variations such as SEIR (e.g. [3] ) were recently shown to be useful as a modeling framework for COVID-19 analysis [4] . We also find that, perhaps surprisingly, in many scenarios, randomized testing can completely offset the excess risk to society from opening a workplace. That is, the total number of infection opportunities can be lower in the setting when a workplace is open with (sufficiently frequent) randomized testing as opposed to the baseline where the workplace is closed and hence workers have no chance to infect one another, but undergo no routine testing. For example, in several realistic settings of the overall prevalence of COVID-19 and the internal connectivity of the workplace, keeping the business open but testing a quarter of employees every week will reduce the number of infectious days compared to the baseline scenario where the business is closed but no randomized testing takes place ( Figure 1a ). Our main result is that it is generally beneficial to distribute tests in batches by randomly sub-sampling the population, relative to administering all tests at once. We focus on analyzing the period until the first detection of an outbreak within an institution. Once an outbreak is detected, a variety of mitigation techniques can be implemented (e.g., quarantine, contact tracing, temporary distancing, etc.) that are outside the scope of our model. We analyze the cost in two models: a simple (and analytically tractable) exponential spread model, and a more realistic network-based SEIR model. We consider an exponential spread model [5, 6] , where the infection grows by a factor ofR per time unit. We show that given a budget enabling testing of a p fraction of individuals per time period, if the tests are split into large number of batches then the expected number of infected individuals at the time of the first detection scales logarithmically withR and equals 1 + lnR p (methods). In contrast, if the same budget is used to periodically test the entire population, the number of infected individuals at the time of the first detection scales nearly linearly withR. While there can be some cases where the expected cost for frequent sub-population testing is higher than periodic population-wide testing, this difference is bounded, and for many setting ofR frequent testing performs much better. See the Methods section for the derivation and more details. We examine the effect of increasing testing frequency in a stochastic network-based model of SEIR (susceptible-exposed-infected-recovered) dynamics with a non-homogenous population, based on [7, 8] (Methods). We show that in most parameter settings, frequent testing of random sub-samples is superior to infrequent testing of the entire population ( Figure 1 , Figure 2b ). We also show that under several (though not all) realistic parameters, testing a quarter of the population per week is not just superior to testing all the population every 28 days, but even improves on the baseline of keeping a business closed where workers can get infected only from an external source (Figures 1a, 2a) . That is, the total number of infection opportunities will be lower in the setting that an institution is open (with increased intra-institution infections) but its members are tested periodically (with isolation and widespread testing once an outbreak is detected), compared to the baseline where the institution is closed but no random testing takes place. Specifically, we assume an institution of N members is given a budget of N tests per time T (and hence larger T corresponds to lower budget). We compare different testing strategies, where in each strategy the total number of tests is divided into k batches that are equally allocated over time. For example, in an institution of 1000 members, if we get 1000 tests every 28 days, then the "28/1" strategy correspond to testing all members once per 28 days, while the "28/4" strategy corresponds to randomly dividing the members to four groups of 250 people each, and testing a different group every week. Here we do not assume any additional prior knowledge that distinguishes between different individuals (e.g. their role in the company, relationships to other people in the community, risk factors, etc.). To couple between the institution dynamics and the disease spread in the society, we assume that there is a constant external infection rate P ex (e.g., value of 0.001 day −1 means that every day there is 1/1000 probability for each community member to be infected by an external source). We define the societal risk as the mean number of infectious community members per day up to the point of outbreak detection (Methods). We find that in general, consistent with our results for the exponential model, more frequent testing of subsamples of the population lowers the societal cost (see Figure 1 ). This holds broadly (though not always) across variations in the overall testing budget (Figure 1a) , different values for the external compares it to testing all population once every 28 days, and panel (c) compares it to opening the business with no testing at all. Values less than 100% correspond to cases where the risk in the 28/4 policy is better than the comparators. The "business closed" baseline risk is calculated as N · P ex /γ where P ex is the external infection probability, N is the total number of individuals, and 1/γ is the mean length of the infectious period. The "no testing" risk is obtained by running a simulation for 56 days with no testing, and measuring the average number of infected individuals. infection rate (Figure 1b) , and internal reproductive rate (Figure 1c , where the internal reproductive rate refers to the expected number of cases directly generated by one case within the institution where all individuals are susceptible to infection). The relative improvement of frequent testing increases when external infection rate is higher or internal reproductive rate is higher (Figure 2 ). In most regimes, testing in batches improves even over the baseline where the business is closed (and no testing policy is used), with relative improvement increasing when the external infection probability increases or internal reproduction rate decreases (Figure 2a ). Our work suggests frequent randomized testing as a way to enable safe re-opening of businesses and institutions. Significantly, increasing the frequency of testing can yield much better outcomes for the same overall budget. We show this both via simulations of realistic scenarios, and analytically for the exponential spread model. Specifically, our analytic formula for the exponential-spread cost shows that frequent sub-population testing enables the cost to grow at most logarithmically in the growth-factor of the infection, as opposed to nearly linearly in periodic full-population testing. Our simulation results for the network SEIR model demonstrate that in many settings, frequent subsample testing can be used to control outbreaks and the cost of keeping a business, school, or institution open. The parameters we use, including the length of incubation period, the probabilities of infection, and the false negative rates, are taken from the literature, but could be refined with further knowledge. However, these should not change any of the qualitative conclusions. We remark that a complete re-opening strategy will involve many factors beyond testing, including re-organization, screening for symptoms, and changing testing frequencies based on business role or risk factors. We do not model such strategies in this paper, beyond their effect on the internal reproductive number. Our exponential infection dynamics model assumes there is an initial infection at time t 0 and the number of infected people at time t isR t−t 0 , whereR is the growth rate per time unit. For this model, we consider the cost of detecting the outbreak at time t as the number of infected individuals at this time. 1 If we measure each individual with probability p/k per 1/k time periods, and assume no false negative, then the probability that we fail to detect an outbreak at time t 0 +t is (1−p/k)R t . The probability that we detect an outbreak at time t 0 + k is equal to the probability we failed to detect up until this point (which is i/k ) multiplied by the probability we detect at time n. Since the cost at time n isR n we get that the expected cost of the k batch strategy is , while k(R 1/k − 1) → ln R and 1 − (1 − p/k) R n/k ≈ pR t dt (where dt = 1/k and t = n/k). Hence the total expected cost converges to the integral ∞ t=0 exp( −p(R t −1) lnR )p·R 2t dt which can be verified to be − exp p·(1−R t ) lnR lnR p + R t t=∞ t=0 = 1 + lnR p . In the particular case of p = 1, this cost is 1 + lnR. In contrast, if we test the entire population once per time unit, if the infection begins at a random time then at the time of detection, the expected number of infected individuals is 1 0R t dt = (R − 1)/ lnR. For largē R, this cost grows nearly linearly withR while the cost for the frequent testing grows only logarithmically. While for smallR, the cost (i.e., expected number of infected individuals at detection) for the one batch strategy can be lower than the frequent testing cost, this difference can never be more than max(1 + ln x − (x − 1)/ ln x) = 3 − e ≈ 0.28 (which occurs whenR = e). However, using frequent testing could lead to a significantly better result compared to a single batch. For example, consider a budget which allows testing everyone once per month, and an exponential increase is by a factor of 20 per month, then the cost of a single batch strategy is ≈ 6.3, while testing with large number of batches would result in cost of ≈ 3.9, which is significantly lower. Moreover, frequent testing also reduces the tail probabilities of missing a large outbreak. All our code for the SEIR dynamics is available on the GitHub repository https://github. com/boazbk/seirsplus. We use the following parameters: σ = 1/5.2 (probability of exposed to become infected per day [9] ), γ = 1/14 (probability of infected to recover per day [10] ), ψ E = 0.33 (67% false negative rate for exposed [11] ), ψ I = 0.8 (20% false negative rate for infected [11] ). We inherit all other parameters from the package [8] on which our code is based. The internal reproductive number R I 0 is estimated as R I 0 = β γ , where β is the mean probability to be infected per day and 1 γ is the mean infection period in days. We ran 500 realization for each setting of parameters, varying testing policies, external infection probability P ex , and probability of internal infection β. The societal risk over a period of time is the mean number of infected and undetected individuals per day during this period. This definition captures the number of opportunities that institution members have on average to infect either other members or people in the external society. If the probability of a member to be infected each day from an external source is P ex then the baseline societal risk is approximately N · P ex /γ where N is the number of institution members and 1/γ is the mean infection period. The baseline societal risk captures the mean number of infection opportunities that are expected to occur per day among institution members if the business is closed. In our simulations we measured the societal risk from the beginning of the simulation until the point of first detection of an outbreak. Modelling the covid-19 epidemic and implementation of population-wide interventions in italy A contribution to the mathematical theory of epidemics An introduction to stochastic epidemic models Projecting the transmission dynamics of sars-cov-2 through the postpandemic period Infectious diseases of humans: dynamics and control Early dynamics of transmission and control of covid-19: a mathematical modelling study. The lancet infectious diseases Sir model on a dynamical network and the endemic state of an infectious disease Package for simulating SEIR model with testing policies Early transmission dynamics in wuhan, china, of novel coronavirusinfected pneumonia Report of the who-china joint mission on coronavirus disease Variation in false-negative rate of reverse transcriptase polymerase chain reaction-based sars-cov-2 tests by time since exposure