key: cord-256610-0njpw3zz authors: Ferguson, Neil M title: Mathematical prediction in infection date: 2005-03-01 journal: Medicine DOI: 10.1383/medc.33.3.1.61124 sha: doc_id: 256610 cord_uid: 0njpw3zz Abstract It is now increasingly common for infectious disease epidemics to be analysed with mathematical models. Modelling is possible because epidemics involve relatively simple processes occurring within large populations of individuals. Modelling aims to explain and predict trends in disease incidence, prevalence, morbidity or mortality. Models give important insight into the development of epidemics. Following disease establishment, epidemic growth is approximately exponential. The rate of growth in this phase is primarily determined by the basic reproduction number (R0), the number of secondary cases per primary case when the population is susceptible. R0 also determines the ease with which control policies can control epidemics. Once a significant proportion of the population has been infected, not all contacts of an infected individual will be with susceptible people. Infection can now continue only because new births replenish the susceptible population. Eventually, an endemic equilibrium is reached whereby every infected person infects one other individual on average. Heterogeneity in host susceptibility, infectiousness, human contact patterns and the genetic composition of pathogen populations introduces substantial additional complexity into the models required to model real diseases realistically. The contribution concludes with a brief review of the recent application of mathematical models to emerging human and animal epidemics, notably the spread of HIV in Africa, the variant Creutzfeldt-Jakob disease epidemic in the UK and its relationship to bovine spongiform encephalitis in cattle, the 2001 foot and mouth epidemic in UK livestock, bioterrorism threats such as smallpox, and the SARS epidemics in 2003. An epidemic is a chain reaction of disease spread within a population. Epidemics can be described and sometimes predicted by mathematical models because they involve relatively simple processes occurring within large populations of individuals. 'Simple' means that infection and disease progression can be characterized by the transition of an individual from one state to another; for example, from a state of being uninfected and susceptible to infection, or to an infected state following contact with an infectious individual, or from the infected to the recovered state following recovery from infection and the acquisition of immunity. Defining a model therefore involves classifying the possible infection states of an individual and the processes causing movement between those states. The aim then is to predict changes through time in the proportions of the population in different infection states, and the incidence of disease-related events. This contribution discusses the modelling of epidemics, but similar techniques can be used to study the pathogenesis of viral or bacterial infections within an infected individual. An understanding of basic concepts is informed by consideration of the spread of a new infectious disease within a large population with no prior exposure. This process can crudely be divided into three phases (Figure 1) , Establishment -the epidemic grows from the first infected t individual to a point at which sufficient individuals are infected that random extinction of the disease is unlikely. Many epidemics can die out after infection of only one or two individuals, if the last infected individual recovers (or dies) before infecting anyone else. This randomness means that the establishment phase can be relatively long and is variable in duration -a possible explanation for the several decades that passed between HIV entering the human population before 1950 and the recognition of a large-scale epidemic in the early 1980s. Exponential growth -growth of epidemics is approximately exponential following disease establishment. How fast an epidemic grows is largely determined by two factors -the number of secondary cases generated by one primary case when the epidemic starts (the basic reproduction number, R 0 ) and the average time taken for secondary cases to be infected by a primary case (generation time, T G T T ). R 0 characterizes the intrinsic transmissibility of infections and depends on both disease biology (determining Neil M Ferguson is Professor of Mathematical Biology in the Department of Infectious Disease Epidemiology at Imperial College London, UK. infectiousness) and host population structure (determining contact rates). T G T T is largely determined by the incubation period of the disease and the duration of infectiousness. R 0 > 1 is essential to epidemic growth because, for an infection to be self-sustaining, every infected individual must infect at least one other. The target of vaccination campaigns is to reduce R 0 below 1, thereby driving the disease extinct. Endemicity -once a significant proportion of the population is immune, dead or chronically infected, not all contacts of an infected individual are with susceptible people. Hence, the number infected by one individual is now reduced by the proportion of the population that is no longer susceptible. This causes the incidence of new infections to decline, and when the susceptible population is exhausted, the primary epidemic is over. Disease extinction is again possible by chance, particularly in small populations. Otherwise, infection continues, with new births replenishing the susceptible population, and eventually an endemic equilibrium is reached in which every infected person infects one other individual on average. From the definition of R 0 , this occurs when 1/R 0 of an individual's contacts are susceptible at any point in time, and thus R 0 not only determines the epidemic growth rate, but also the proportion of the population that are infected by the disease, and the steady-state incidence of infection. In theory, knowledge of R 0 (and the duration of infectiousness) enables simple prediction of many key aspects of epidemic and endemic disease transmission. In reality, estimation of multifactorial quantities such as R 0 is problematic, except in some childhood diseases (e.g. measles) for which serological data can be used. Disease epidemiology typically has many additional complexities: Establishment Exponential growth Endemicity Epidemics spread through contact (e.g. person to person) The 'chain reaction' nature of the process results in exponential growth (once infected numbers are great enough to make random effects small), until there are no further susceptible contacts MEDICINE 33:3 • the nature of the transmission process (e.g. insect-borne, foodborne) • heterogeneities in the pathogen population (e.g. multiple strains with different virulence and antigenic properties) • patterns of host contact (e.g. stronger mixing within age-defined or geographically defined peer groups than without, variability in contact rates because of differences in, for example, sexual behaviour) • long and variable incubation periods (e.g. HIV) • seasonally varying contact rates (e.g. measles). Incorporation of this complexity is often essential to generate a model that it is not just qualitatively reasonable (in the sense of reproducing the basic pattern of observed trends), but capable of quantitative predictions. Sufficiently realistic models thus tend to incorporate multiple parameters that must be estimated from available epidemiological, clinical and behavioural data. Biostatistical techniques for such estimations are developing rapidly, but many problems remain, largely caused by the non-linear nature of transmission models and the extremely complex dynamics that they sometimes produce; this can include chaotic behaviour and stochastically induced intermittency in epidemic timing. Dynamic complexity can also place fundamental limits on the predictability of trends in the incidence of infectious disease. In some cases, short-term prediction of the course of an epidemic may be possible. In others, models may be predictive only in a probabilistic sense; for example, it might be possible to predict the number of epidemics of a certain size likely to occur in the next 10 years, but impossible to precisely predict weekly disease incidence over the next few months. Distinguishing these situations is a key challenge in epidemic modelling. Despite these challenges, most infectious diseases have now been modelled, and modelling has achieved notable successes in recent years ( Figure 2 ). • Trends in the incidence of HIV in southern Africa in the last 5 years have validated predictions made more than 10 years ago (and then often dismissed) that AIDS would reverse population growth in Sub-Saharan Africa in the 21st century. 1 • Models have given insights into the potential scale and duration of the epidemic of variant Creutzfeldt-Jakob disease caused by the bovine spongiform encephalitis epidemic in cattle. • In the UK in 2001, mathematical models had a pivotal role in shaping disease-control policy during the foot and mouth epidemic in livestock -the first example of real-time application of such techniques. 2 • During the severe acute respiratory syndrome epidemic in 2003, epidemiological analysis and modelling quantified key and then unknown disease parameters such as incubation period distribution, case mortality rate and transmissibility (as quantified by R 0 ). 3 • Greater computer power and improved statistical techniques are enabling construction of increasingly realistic simulation models, which are having a key role in contingency planning for bioterrorism 4 and influenza pandemic planning. The spread of HIV-1 in Africa -sexual contact patterns and the predicted demographic impact of AIDS The role of mathematical modelling in the control of the 2001 FMD epidemic in the UK In 2001, mathematical models were used to predict the course of the foot and mouth disease epidemic in UK livestoc K k, and to evaluate the potential effect of control measures. 2 The predictions above were made by a team at Imperial College London and released by the UK government in early April 2001. K They indicated that additional culling or vaccination at farms f f neighbouring infected farms was nec f f essary to control the epidemic; simply accelerating the slaughter of animals at infected farms was insufficient.In 2003, models were used to analyse the SARS epidemic. 3 The graph below shows the fit of a model to the epidemic in Hong Kong. This analysis showed that R 0 for SARS was about 3.