key: cord-327681-c2kmog0g authors: Feng, Zhilan; Yang, Yiding; Xu, Dashun; Zhang, Pei; McCauley, Mary Mason; Glasser, John W. title: Timely identification of optimal control strategies for emerging infectious diseases() date: 2009-07-07 journal: J Theor Biol DOI: 10.1016/j.jtbi.2009.03.006 sha: doc_id: 327681 cord_uid: c2kmog0g BACKGROUND: Health authorities must rely on quarantine, isolation, and other non-pharmaceutical interventions to contain outbreaks of newly emerging human diseases. METHODS: We modeled a generic disease caused by a pathogen apparently transmitted by close interpersonal contact, but about which little else is known. In our model, people may be infectious while incubating or during their prodrome or acute illness. We derived an expression for [Formula: see text] , the reproduction number, took its partial derivatives with respect to control parameters, and encoded these analytical results in a user-friendly Mathematica™ notebook. With biological parameters for SARS estimated from the initial case series in Hong Kong and infection rates from hospitalizations in Singapore, we determined [Formula: see text] 's sensitivity to control parameters. RESULTS: Stage-specific infection rate estimates from cases hospitalized before quarantine began exceed those from the entire outbreak, but are qualitatively similar: infectiousness was negligible until symptom onset, and increased 10-fold from prodrome to acute illness. Given such information, authorities might instead have emphasized a strategy whose efficiency more than compensates for any possible reduction in efficacy. CONCLUSIONS: In future outbreaks of new human diseases transmitted via close interpersonal contact, it should be possible to identify the optimal intervention early enough to facilitate effective decision-making. Early in 2003, a physician infected while treating patients with atypical pneumonia in Guangdong Province, China, infected other travelers in their Hong Kong hotel. On returning home to Singapore, Taiwan, Toronto, and Vietnam, they transmitted the pathogen causing the disease, later named severe acute respiratory syndrome (SARS), to local residents. Global spread of this hitherto unknown pathogen led WHO to issue travel advisories and some national health authorities to quarantine travelers from affected areas. As subsequent infections were largely nosocomial, hospital infection-control procedures were increasingly enforced. Absent knowledge of the onset of infectiousness, local authorities also quarantined community contacts. Faced with an infectious disease of unknown etiology, policymakers acted quickly (Gerberding, 2003) . Case-series were among the earliest sources of information (e.g., presenting symptoms, duration of distinguishable clinical stages, and outcomes). Upon isolation of the etiologic agent, experience with illnesses caused by related pathogens became germane. While these actions were thoughtful, in retrospect some were unnecessary. We have since developed a model with which policymakers could use similar information-which should be available in future outbreaks, especially of new diseases causing serious morbidity-to assess the likely impact of available interventions. Because identifying infected people before they become ill is difficult, quarantine is inefficient; e.g., only 11 of 238 probable cases were identified before symptom onset in Singapore (Tan, 2005) and 24 of 480 in Taiwan (Hsieh et al., 2005) . Isolating people with symptoms that may herald disease, especially if they might have been exposed to someone since diagnosed, would be more efficient. Impact is proportional to the product of efficiency and efficacy, which must exceed 1 À 1=< 0 for control, where < 0 is the average number of sufficiently intimate contacts for transmission while infectious. Consequently, infected people must be highly infectious or infectious long before becoming ill for quarantine to be a better strategy than simply encouraging those with early symptoms to seek care and ensuring their proper disposition. To weigh possible interventions to control infectious disease outbreaks, modelers must faithfully represent transmission. Our model allows infected people to become infectious before or after becoming ill. It permits quarantine while asymptomatic or isolation on seeking medical care and being diagnosed, at rates and with efficiencies that depend on clinical stage. Because health communications could influence these social phenomena, our model also allows hospitalization rates during distinguishable clinical stages to evolve. How effectively cases are isolated, when authorities begin searching for their contacts, if ever, and the proportion found, also may vary temporally. Policymaking is difficult enough with accurate and complete information, neither of which is available during outbreaks of new diseases. Using only the Hong Kong case series (Lee et al., 2003; Tsang et al., 2003) and admissions to Tan Tock Seng Hospital (TTSH) through 24 March, when quarantine began, we estimated that infected people were not particularly infectious until acutely ill, a consensus reached months later (WHO, 2003) . As the distribution of infectiousness largely determines the intervention of choice (Fraser et al., 2004; Day et al., 2006) , this information is critical. Armed with such results, we might have dissuaded policymakers from quarantine, suggesting they instead encourage people with symptoms that might herald disease to seek care, especially if exposed to someone since diagnosed, and-if indicated-ensure their proper isolation. This would have minimized the number of non-infected individuals detained without substantially compromising efficacy. Besides recommending the most promising interventions in future outbreaks, we could revise early estimates of parameter values as additional or better information became available, monitor intervention impacts, and recommend changing course if necessary. This would ensure the best possible use of available public health resources and minimize social disruption. The belief that quarantine contributed substantially to the control of SARS persists despite the modest number of secondary infections that could have been prevented (e.g., the product of roughly 0.9 secondary infections per probable case in Singapore 3 and 11 probable cases quarantined-assuming perfect isolation and no competing risks-is roughly 10 cases). But modeling enables one to calculate final outbreak sizes with and without interventions conditional on others (i.e., to relax both assumptions). Moreover, 116 possible contacts per probable case were restricted to their homes in Taiwan, 316 including travelers (Hsieh et al., 2005) . In Singapore and Beijing, 38 and 13 contacts per probable case were restricted, respectively (Tan, 2005; Pang et al., 2003) . As others cared for these well, but unproductive people, the economic cost was enormous. To assist in controlling other newly emerging diseases, we modeled a generic respiratory illness caused by pathogens transmitted by close interpersonal contact. To evaluate actual interventions for SARS, we estimated some biological parameters from the case series in Hong Kong, adjusted others to fit clinically diagnosed probable cases in Singapore, and calculated outbreak sizes under alternative scenarios. Motivated by the social cost of quarantine relative to its contribution to control in Singapore, we also derived analytical expressions policymakers could use with limited information to determine the most effective strategy for controlling future outbreaks of new human diseases. Because models are hypotheses, their predictions must be evaluated and the causes of any disparities identified and remedied (Quine and Ullian, 1970) , especially before use to inform public policy. Because complex models are difficult to evaluate, Einstein advocated modeling as simply as possible (May, 2004) . To simplify, modelers commonly assume individuals move among disease stages at constant specific rates, whereupon sojourns are exponentially distributed. A property of exponential distributions-that time spent in any stage is independent of time elapsed before entry-is inconsistent with the natural history of infectious diseases. Depending on one's purpose, this discrepancy between biology and mathematics may be inconsequential. But it affects conclusions about the impact of quarantine and isolation (Feng et al., 2007) , public health interventions differing only in timing. The gamma distribution does not have the memory-less property, so is more realistic biologically (Wearing et al., 2005) . Models with gamma distributed sojourns are equivalent to ones in which stages of duration T are composed of n sub-stages each with duration T/n. To illustrate the difference, Feng et al. (2007) use the expected remaining sojourn at time-in-stage s, M n (s). For the gamma distribution with nX2, As M 0 n ðsÞo0 and lim s!1 M n ðsÞ ! T=n; here T ¼ 1=y; evidently M n (s) strictly decreases with stage time s. When s is large, moreover, the expected remaining sojourn can be as small as T/n. Hence, time remaining depends on time already spent. The gamma distribution [p n (s) for nX2] thus provides a more realistic description of disease natural history than the exponential [p 1 (s), for which M 1 (s) ¼ T for all s]. Feng et al. (2007) derived a system of ordinary differential equations from a general integral equation model suitable for evaluating quarantine and isolation by replacing the exponential with the gamma distribution. Our model resembles theirs (Fig. 1 ), but we distinguish the prodrome and acute respiratory phase and allow infected people to become infectious before or after becoming ill, at rates-products of contact rates and probabilities of transmission on contact-that may differ among stages. Thirty-two suspect SARS patients were reclassified as probable post-discharge by virtue of laboratory test results, but we do not know how many secondary infections they caused. Our model's biological parameters derive mostly from observations of early SARS patients in Hong Kong (Lee et al., 2003; Tsang et al., 2003) . We estimated time-varying control parameters by minimizing sums of squared differences between cumulative model hospitalizations, wE þ f P I P þ f R I R -where wE are those hospitalized from quarantine and fI from successive symptomatic stages, denoted by subscripts-and probable cases admitted to Tan Tock Seng Hospital (TTSH). We estimated these parameters twice, once from the first 30 days of hospital admissions (Table 1) , before quarantine began, and once from admissions during the entire outbreak (Table 2) . As we were uncertain of many parameter values, we chose either gamma or triangular distributions for Latin hypercube sampling (Table 3) . From the literature just mentioned, we obtained mean values for 1/a, 1/d P , and 1/d R of 6, 4, and 8 days, respectively. Because the gamma's mean is the product of its parameters, here called A and B, we know that B ¼ mean/A. Note that A ¼ 1 corresponds to the exponential distribution, and that the gamma's variance decreases with increasing A. We chose A ¼ 3 for all three distributions, and thus, B ¼ 6 3 ¼ 2, 4 3 , and 8 3 , respectively. (A ¼ 3 also corresponds to three sub-stages in these respective states.) As for their upper and lower values, we chose ones for which the area under the probability density function (pdf) in between was 95-98%. Similarly, for the gamma distributions of b E , b P , and b R , we used our best estimates as means (Table 2) . We chose A values of 2, 2, and 3 (i.e., assumed the variance of b E and b P exceed that of b R ), so the B values are 0:024 2 , 0:152 2 , and 0:582 3 . Each range was again chosen so that the area under the pdf in between was 95-98%. Results are similar if the A values are neither too large nor close to 1. Next we stratified the area under the corresponding density function (pdf) into M equally probable intervals (i.e., each having area 1/M, where M is at least 3 4 the number of uncertain variables, K). Letting x 0 , x 1 , y, x N be sub-interval end points, the parameter's pdf to be f(x), and its cumulative distribution function (cdf) to be F(x): Because x 0 is the lower limit of f(x), the remaining x i 's can be obtained from . . . ; M: Finally, we selected points at random from each interval. The M samples drawn for one input parameter were paired randomly with the M for another, those M pairs were paired randomly again with the M for a third, and so on. Random paring continued until samples of all K input parameters had been paired to form an M by K table. We estimated the infectious stage-specific contributions to the realized reproduction number, < i , using these M sets of input parameters. We compared final outbreak sizes with our best estimates of time-varying control parameters (i.e., quarantine and hospitalization rates and isolation efficiencies) in Singapore with hypothetical alternatives. Differences with and without interventions, or with advanced or delayed timing, are estimates of cases (and hence deaths) averted, conditional on others. Such assessments tacitly assume independence (e.g., quarantine of possible contacts did not affect the timeliness with which actual ones who developed compatible symptoms sought medical care). Social distancing may have increased with quarantine in Hong Kong (Lo et al., 2005) , but synergistic population responses in Singapore were more likely due to very effective health communications. Our analytical results include an expression for the realized reproduction number <, its partial derivatives with respect to control parameters and relationship to < 0 , the intrinsic reproduction number (Appendix A). A Mathematica TM notebook that evaluates these expressions for user-supplied parameter values, comparing the impact of non-pharmaceutical interventions on any disease transmitted by close contact, is available from the Table 1 Parameter estimates (b represents infection, r isolation efficiency, f hospitalization; subscripts E, P and R refer to pre-symptomatic, prodrome, and acute respiratory stage) from fitting predicted to observed cumulative admissions to TTSH during the first 30 days of the outbreak (cf. Fig. 2 ). Times correspond to 14 and 24 March, and 8 April, when control efforts changed (e.g., began or intensified). Chi represents the per capita rate at which exposed individuals are quarantined. Assumed distributions before and after introduction of control measures on 3/24-including not only quarantine and isolation, but exemplary health communications-are either gamma (G) or triangular (T). corresponding author. Version 6 or higher of this software or its player, available from http://www.wolfram.com/products/player/, also is required. Empirical results for SARS include parameter estimates from hospital admissions during the first 30 days and entire outbreak in Singapore by epoch; assessment of the reproduction number <, and impact of control measures via sensitivity analyses and final outbreak sizes under hypothetical scenarios. Trial periods in Table 1 correspond to before 14 March, when the second generation of cases began being reported, and between 14 and 24 March, when home quarantine began. In addition to these, Table 2 's trial periods correspond to between 24 March and 8 April, when cumulative cases abruptly increased by about 10%, and after 8 April. Fig. 2 illustrates model fits to hospital admissions with these two parameter sets. The infection rates, b, differ quantitatively (e.g., those during the acute phase are two to three times those during the prodrome and 21-25 times those during the incubation period, respectively subscripted R, P, and E) when estimated from admissions during the first 30 days or entire outbreak (Tables 2 and 3, respectively), but not qualitatively (i.e., infectiousness is negligible until patients are acutely ill). Isolation efficiency r among those hospitalized during the prodrome increased from 0.16 during the first epoch to 0.76 during the last. Among those hospitalized while acutely ill, efficiency increased from 0.16 to almost 1. Similarly, the hospitalization rate f increased from 0.16 to 0.7 during the prodrome, while that during acute illness increased from 0.25 to 0.88. Both isolation efficiencies and hospitalization rates are respectively subscripted P and R. Estimates for intervening periods are intermediate (Table 2) . Our estimates of < and contributions of each infectious state before and after control measures were introduced, given the assumed distributions and ranges described in Table 3 , are shown with their associated uncertainties in Table 4 and Fig. 3 . Our estimates of <, 3.16 and 0.86 before and after 24 March are comparable to Wallinga's and Teunis' (2004) 3.1 and 0.7 before and after 12 March, respectively. Sensitivity to control parameters is shown in Table 5 . Isolation efficiency during the incubation period hardly affects <, as infectiousness is negligible then. Because people did not seek care until acutely ill prior to 24 March, < also is relatively insensitive to isolation efficiency during the prodrome or, by virtue of nosocomial transmission during that epoch (Leo et al., 2003) , hospitalization while acutely ill. Final outbreak sizes (when birth and death are ignored, i.e., m ¼ 0) would have been larger had interventions been delayed Fig. 2 . Cumulative probable cases by date of symptom onset in Singapore (symbols) and fits to the first 30 days (dashed line) and entire outbreak (solid line). Parameter estimates differ quantitatively (i.e., they are higher initially), but are similarly qualitatively (Tables 1 and 3) , particularly the ratio of infection rates during successive clinical phases. Table 4 were calculated. ( Fig. 4a) , or had isolation efficiency declined, especially of acutely ill patients (Fig. 4b) . Final size would have been smaller had greater proportions been quarantined (Fig. 4c) or hospitalized, particularly during the prodrome (Fig. 4d) . Motivated by the desire to contribute more to the control of newly emerging diseases in future than mathematical epidemiologists contributed to past control efforts, we modeled a generic disease caused by a pathogen transmitted by close interpersonal contact, (1) derived some general analytical results and (2) analyzed one SARS outbreak to determine how we might have helped had (a) this model been available and (b) authorities used it to inform their decision-making. Our model is suitable for any directly transmitted pathogen (Fig. 1) , though inevitably less so than pathogen-specific models. Its biological states are clinical, so sojourns are estimable from early case series. To model SARS, we set the incubation period at 6 days (Lee et al., 2003) and prodrome and acute respiratory stages at 4 and 8 days (Tsang et al., 2003) ; these references were available at http://www.nejm.org on 4 7 and 3 31 , respectively. We assumed quarantined people spent half of their incubation periods at large, and varied only the proportion quarantined, but when infected people are quarantined must be a function of the stage at which those who infected them are diagnosed. Similarly, we combined the rate at which people sought medical care and their probability of diagnosis, but as these phenomena involve different social systems, they may evolve independently. Social responses are exceedingly important, but this model must be useful with insufficient information to fully characterize them. Infection is a more familiar composite parameter than quarantine or hospitalization, being decomposed into constituent contact rates and probabilities of transmission on contact only as needed (e.g., in sexually transmitted diseases, asymmetric transmission between genders or other partners may affect the impact of available interventions). Transmission of pathogens causing respiratory diseases also may be asymmetric (e.g., between children and adults), but the impact on interventions is less clear, so the composite is used. Parameter estimates based on the first 30 days of hospital admissions indicate that-consistent with numbers of secondary infections among patients isolated on successive days after symptom onset (Lipsitch et al., 2003) and observed viral loads (Peiris et al., 2003) -infectiousness of patients with SARS was negligible until symptom onset and only one third during their prodrome as while acutely ill (Tables 2 and 3) . Estimates based on the entire outbreak indicate temporal variation in the rates at which ill people sought care, probabilities of diagnosis and efficacy of isolation (Table 2) . Identifying infected people before they become ill is so difficult that quarantine is exceedingly inefficient. Consequently, unless infected people could infect many susceptible ones before becoming symptomatic-by virtue of being very infectious or infectious long before symptomatic-public health authorities should focus on identifying people with symptoms that may herald disease, encouraging them to seek care, especially if they might have been exposed to someone since diagnosed. Similarly, they should assist clinicians in diagnosing, and hospital infection control personnel in isolating effectively, those people before they become acutely ill. To evaluate actual or hypothetical control measures, one must compare otherwise identical scenarios. Community intervention trials approach this ideal, but only modeling attains it. Not using all available measures that could be effective as expeditiously as possible during actual outbreaks of potentially lethal diseases would be unthinkable, but model outbreaks can be compared with and without interventions, alone or in various combinations, or with interventions at different times, to assess their impacts. Our final size calculations with and without quarantine, but all else equal (i.e., experiments), indicate its impact would have been comparable to hospitalization during the prodrome (cf. Figs. 4c and d). But this assumes authorities could identify infected people during their incubation period. Reassuringly, our independent estimate of the rate at which exposed people were quarantined, w (Table 2 ) resembles the proportion of probable SARS patients actually quarantined, 11 238 (Tan, 2005) . In Taiwan, where this proportion was strikingly similar, 24 480 (Hsieh et al., 2005) , quarantined individuals were suspected of having SARS earlier than others (within 1.2 versus 2.89 days, respectively), but not reclassified any sooner (Hsieh et al., 2005) . Our final size calculations illustrate the importance of timely interventions, but as only probable cases had priority for limited isolation facilities, evidently these shorter onset-diagnosis intervals contributed little. In contrast, the rate at which probable cases were hospitalized during the prodrome, f P , increased from about 0.16 to 0.7 during the outbreak in Singapore (Table 2) , contributing much more to control. The potential impact of quarantine has been so regularly confused with its actual contribution to SARS' control that one wonders if this longtime staple of public health (McNeill, 1977; Rosner, 1995) has been rigorously evaluated before. Day et al. (2006 ) refine Fraser et al.'s (2004 conditions for quarantine to be effective: the number of asymptomatic infections and fraction preventable-essentially the number of pre-symptomatic infections-must be large and the probability of infected individuals being quarantined high. Short of everyone staying home after a household member becomes ill, the last condition will be difficult to meet for respiratory diseases. As the temporal distribution of infectiousness largely determines the optimal public health response (Fraser et al., 2004; Day et al., 2006) , this is among the most important epidemiological unknowns early in outbreaks of new human diseases. If patients could be infectious before developing symptoms, quarantine must be considered. But identifying people whose contacts were sufficiently intimate for infection is extremely difficult, especially given uncertainty about the mode of transmission. The gain in efficiency-by ensuring instead that people with early signs and symptoms seek medical care, clinicians diagnose, and infectioncontrol personnel isolate them effectively-may more than compensate for any loss of efficacy due to infections during the prodrome. Modeling can help to determine this distribution and, conditional on that essential information, to evaluate the relative impact of various possible public health interventions. Mark Chen, Annelies Wilder-Smith, Heng Bee Hoon, and Leo Yee-Sin shared their observations from Tan Tock Seng Hospital, and many colleagues reviewed earlier drafts of this manuscript. We are grateful to Klaus Dietz for the opportunity to participate in the workshop, Design and Analysis of Infectious Disease Studies, at the Mathematisches Forschungsinstitut Oberwolfach, and for subsequent correspondence about sojourns. The reviewers' helpful comments and suggestions further improved the manuscript. Z. Feng's research was partially supported by the CDC via contract 200-2007-M-23524 and the National Science Foundation via grant DMS-0719697. Here we present the system of equations corresponding to Fig. A1 ; an expression for <, the realized reproduction number, derived via the approach of Feng et al. (2007) ; take its partial derivatives with respect to control parameters, and describe the intrinsic number, < 0 , as a special case. A Mathematica TM notebook available from the corresponding author evaluates these expressions for user-supplied parameter values. A.1. Mathematical model and R ¼ R(t) denote numbers susceptible, exposed (infected, but not yet symptomatic), quarantined, symptomatic, hospitalized (isolated), and recovered (or immune) individuals at time t, respectively. Primes denote derivatives, lettered subscripts the prodrome and acute respiratory stages and numbered ones substages, of which there are m, n and l, respectively (Fig. A1) : Pj ¼ nd P I PðjÀ1Þ À ðf P þ nd P þ mÞI Pj ; j ¼ 2; :::; n, H 0 Pj ¼ f P I Pj þ nd P H PðjÀ1Þ À ðnd P þ mÞH Pj ; j ¼ 2; :::; n, Rj ¼ ld R I RðjÀ1Þ À ðf R þ ld R þ mÞI Rj ; j ¼ 2; :::; l, H 0 Rj ¼ f R I Rj þ ld R H RðjÀ1Þ À ðld R þ mÞH Rj ; j ¼ 2; :::; l, R 0 ¼ ld R I Rn þ ld R H Rl À mR, and N ¼ S+E+Q+I P +I R +H P +H R +R is the total population, whose size is constant by virtue of equal per capita rates of birth and death, m. Other rates, reciprocals of mean sojourns, are: w, quarantine; f, hospitalization; and progression during the incubation, a and successive stages of acute illness denoted by subscripts, d: In l(t), the force of infection, the b E,P,R are transmission coefficients and r E,P,R fractional reductions due to quarantine at home or other suitable facilities and isolation in hospitals. Given the transition probabilities, T i and T i and sojourns, D i and D i defined in Table A1 , < is given by Fig. A1 . Model diagram with sub-stages denoted by lettered and numbered subscripts, e.g., the incubation period is partitioned into E 1 , E 1 oE j oE m , and E m ; the prodrome into I P1 , I P1 oI Pj oI Pn , and I Pn , and the acute respiratory phase into I R1 , I R1 oI Rj oI Rl , and I Rl . Definitions of transition probabilities, T i and T i , and sojourns, D i , and D i . Definition Probability of surviving exposure to enter the prodrome Quarantine adjusted probability of surviving exposure Probability of surviving the prodrome to become acutely ill Quarantine adjusted probability of surviving the prodrome Mean death-adjusted sojourn in the exposed stage Mean quarantine-and death-adjusted sojourn in the exposed stage The derivatives of < with respect to quarantine, hospitalization during the prodrome and acute illness and the efficacy of isolation during these stages are @< @w nd P þ f P þ m n 8 > > > > > < > > > > > : Their relationship is given by Clearly, < ¼ < 0 when all control parameter values are zero, as D i ¼ D i and T i ¼ T i , making the three terms in brackets all equal to zero. Also,