key: cord-0204957-lccd5s6z
authors: Bai, Fan
title: An age-of-infection model with both symptomatic and asymptomatic infections
date: 2021-11-03
journal: nan
DOI: nan
sha: e1083855c0d80a534462e7a54071b0a271f711ad
doc_id: 204957
cord_uid: lccd5s6z

We formulate a general age-of-infection epidemic model with two pathways: the symptomatic infections and the asymptomatic infections. We then calculate the basic reproduction number $mathcal{R}_0$ and establish the final size relation. It is shown that the ratio of accumulated counts of symptomatic patients and asymptomatic patients is determined by the symptomatic ratio $f$ which is defined as the probability of eventually becoming symptomatic after being infected. We also formulate and study a general age-of-infection model with disease deaths and with two infection pathways. The final size relation is investigated, and the upper and lower bounds for final epidemic size are given. Several numerical simulations are performed to verify the analytical results.

Asymptomatic infections are recorded for COVID-19 (Coronavirus SARS-CoV-2) pandemic and have been studied from many different aspects. Evidence suggests that asymptomaticity has significant impacts on the development of 20] ). Furthermore, because of its "silently transmissible" nature, the pathway of asymptomatic infection is considered as a key component for understanding the mechanisms of transmission and evaluating the efficiency of intervention measures. However, quantification of asymptomatic infections and further inference of value of symptomatic ratio f is tricky due to the unreliable recorded numbers of symptomatic patients and asymptomatic patients at any specific time post epidemic outbreak. For COVID-19, it is suggested that the asymptomatic ratio is around 35.1% ( [21] ) and it is highly age dependent. Other studies based on daily incidence data from certain geographic locations and using various parameter estimation techniques indicate very different values of symptomatic/asymptomatic ratio (e.g., [17] [18] [19] ). The purpose of this paper is to formulate the epidemic models with two infection pathways, by considering different formats of the symptomatic ratio f (either a constant or a piece-wise function), and obtain the relation between the ratio, the basic reproduction number and the final epidemic size. Another focus is to compare the final sizes of symptomatic infections and asymptomatic infections.

The Age-of-infection model was originally derived by Kermack and McKendrick in their well-known paper [16] , it is the general form of epidemic models ( [4-6, 9, 15, 25] ). This type of epidemic model is especially useful to study the epidemic models in which periods are far from exponential distributed, e.g., the current COVID-19 pandemic. The formulation of the basic reproduction number for the age-of-infection model was well studied in [25] and the final size relation has been established in [6] . The age-of-infection model with disease-induced mortality was also studied in [6] , an inequality of final size relation was developed. The general model was also extended for the heterogeneous mixing population in [12] . In this paper, we formulate and study an age-of-infection model for epidemics with both symptomatic infections and asymptomatic infections. The derivation and comparison of the basic reproduction number R 0 and the formulation of final size relations are the main focus of the study. We also consider the age-of-infection model with two infection pathways and with disease-induced mortality.

This paper consists of five parts. In section 3, we firstly review the basic general age-of-infection model in a homogeneously mixing population. In section 4, we formulate the age-of-infection model with two pathways, the symptomatic infection and the asymptomatic infection. We then calculate the basic reproduction number and obtain the corresponding final size relation. We will focus on the ratio of total symptomatic cases and total asymptomatic cases in this study, which entirely depends on the symptomatic ratio f . As an example, we study a special case of Susceptible-Exposed-Infectious-Asymptomatic-Removed (SEIAR) model, by assuming exponentially distributed disease stages. In section 5, we further consider the disease-induced deaths for the age-of-infection model with two infection pathways and investigate the final size relations. In section 6, we perform several sets of numerical simulations to verify our theoretical results. In section 7, we summarize the investigation and also propose some interesting future work in the direction of age-of-infection modeling.

We have S(t) denote the number of susceptibles at time t and have φ(t) be the total infectivity at time t. φ(t) is defined as the sum of products of the number of infected members with each infection age and the mean infectivity for that infection age ( [25] ). We assume that on average each member of the population make a contacts per unit time. The population size is N and there are no demographic effects (births, deaths, migration) in the population. We further define B(τ ) the fraction of infected members remaining infected at infection age τ and π(τ ) the mean infectivity at infection age τ . B(τ ) is a non-increasing function and satisfies that

The general age-of-infection model is

where φ 0 (t) is the total infectivity of members of the population who were infected at the initial time, at time t. The following Figure 1 demonstrates the dynamics of the general model. It is worth mentioning that the number of infectives is described as

It is straightforward to derive the equation about the change of number of infectives ( [25] )

where B ′ reflects the recovery of infected individuals. It has been well studied in [6, 25] , that the basic reproduction number for model (1) is

The final size relation for model (1) is

The uniqueness of S ∞ has been confirmed in [6] . If the initial infectivity is small and neglectible, the final size relation can be simplified as

4 The age-of-infection model with both symptomatic and asymptomatic infections

We extend the general age-of-infection model (1) by considering that the infected individuals are possibly asymptomatic or symptomatic. It is assumed that once an individual is infected, there is a probability 0 < f < 1 that this individual eventually becomes symptomatic, and the probability of becoming asymptomatic is (1 − f ). We denote φ i (t) the sum of products of the symptomatic individuals with each infection age and the mean infectivity for that infection age, φ a (t) the sum of products of the asymptomatic individuals with each infection age and the mean infectivity for that infection age. Thus, the total infectivity is

B i (τ ) and B a (τ ) represent the fraction of symptomatic and asymptomatic individuals remaining infected at infection age τ , respectively; π i (τ ) and π a (τ ) represent the mean infectivity of symptomatic and asymptomatic individuals at infection age τ , respectively. A i (τ ) := B i (τ )π i (τ ) is the mean infectivity of a symptomatic individual at infection age τ , while A a (τ ) := B a (τ )π a (τ ) is the mean infectivity of an asymptomatic individual at infection age τ . The dynamics is depicted in Figure 2 . It is noticed that the recovered symptomatic/asymptomatic patients are theoretically counted as one stage of the infection process, but with no infectivities. The age-of-infection model is

It can be proved that model (6) is an extension of the general age-of-infection model (1). We first calculate the basic reproduction number for model (6) ,

This form is consistent with Theorem 3.1 in [25] that the basic reproduction number R 0 depends only on the mean period in each infective stage, regardless of its distribution. We now generate the final size relation for model (6) . We first write

Integration with respect to t from 0 to ∞ yields

For each infection case, the probability of proceeding to the symptomatic state is f and the probability of proceeding to the asymptomatic state is (1 − f ). Thus, it is intuitive to conclude that the total number of symptomatic patients is f (S 0 − S ∞ ) and the total number of asymptomatic patients is (1 − f )(S 0 − S ∞ ). We now perform some calculations to verity this conjecture. Firstly, the numbers of symptomatic patients I(t) and asymptomatic patients A(t) are

We now focus on I(t) and calculate the derivative of I(t),

In Equation (9), the first term indicates the rate of new symptomatic infections and the second term represents the transition from infected stage to recovery stage for symptomatic patients. Thus, we have

We then integrate both sides of the equation with respect to t from 0 to ∞,

We state the following theorem to summarize all calculations.

Theorem 4.1. If the population size N is large and the number of initial infection cases is small, the final size of an age-of-infection model with both symptomatic infections and asymptomatic infections is

with R 0 being expressed in Equation (7). Moreover, the total estimated cases of symptomatic infections are f (S 0 −S ∞ ), while the total estimated cases of asymptomatic infections are (1−f )(S 0 −S ∞ ).

If the initial infection cases are not neglectible, we have the following inequality,

It is possible that the ratio f is not a constant, but a function about t. This may be caused by the mutations of responsible viruses. The model then becomes

The basic reproduction number R 0 for model (13) depends on initial ratio f (0) and is given by,

f (t) may take the form of a Heaviside function (with one or multiple switch times) or its smooth approximations (generalized logistic functions). Explicit final size relation for model (13) is impossible to obtain, but can be approximated by Equation (3) 

is small or the switch time(s) is late or close to the end of the epidemic. Some numerical simulations will be performed to show that the final sizes can not be accurately predicted, but be appropriately approximated.

In practice, the counts of symptomatic patients are relatively accurate. While the counts of asymptomatic patients basically rely on serology reports and surveys. Thus, it is generally a challenging task to correctly infer the value of f based on epidemiological data. This leads to an interesting problem of the comparison of outcomes by using the actual f and the inaccurately inferred f ′ .

First we have a similar formula for R ′ 0 based on f ′ :

We denote

to represent the average secondary infection cases caused by a symptomatic patient or an asymptomatic patient, when being introduced into a wholly susceptible population.The basic reproduction number is the linear combination of R i and R a . The difference between R ′ 0 and R 0 depends on the relation between ratios f , f ′ , R i and R a ,

Therefore, the relation between f and f ′ is insufficient to determine how incorrect fraction will affect the assumed R 0 . It is critical to compare the values of R i and R a . In practice, it is reasonable to assume that the inferred ratio f ′ is larger than the real ratio f , due to the fact that the asymptomatic cases are more easier to be missed and the asymptomatic infections are underestimated. The other situation is that the asymptomatic infections are overestimated, if the presymptomatic cases are incorrectly categorized as asymptomatic infections. This case indicates a smaller inferred f ′ . However, the comparison of R i and R a is trivial and difficult to obtain in real life. We next introduce the established result (see Lemma 

A special case of the general age-of-infection model (6) is the epidemic model with exposed stages, for both infection pathways. It is noticed that the exposed stages can be significantly different for two different types of patients. Therefore, we have the following SEIAR type of model,

Now we show that model (18) can be written in the form of the general age-of-infection model (6) . We denote µ i (τ ) and µ a (τ ) the fractions of symptomatic and asymptomatic patients with infection age τ who are not yet infectious, respectively, and also define ν i (τ ) and ν i (τ ) the fractions of symptomatic and asymptomatic patients with infection age τ who are infectious, respectively ( [6, 9] ). The basic dynamics of SEIAR model (18) is described in Figure 3 . We first focus on the upper symptomatic infection path (S → E i → I → R) in Figure 3 , µ i (τ ) and ν i (τ ) satisfy the ordinary differential equation

with initial condition µ i (0) = 1 and ν i (0) = 0. We are able to obtain the solution of the ODE system (19) 

The change of fractions µ i (τ ) and ν i (τ ) with a chosen set of parameter values (κ i = 0.07 and α i = 0.2) is visualized in Figure 4 . The function B i (τ ) is implicitly expressed by two functions µ i and ν i in Equation (20) . It is straightforward to have the averaged infectivities for both exposed patients and symptomatic patients for π E = 0 and π I = 1. Based on the definition of A i (τ ), we have

Similarly, the lower asymptomatic infection path (S → E a → A → R) can be analyzed in the same manner. It implies

From the formula for the basic reproduction number R 0 in Equation (7), we have

This result can also be verified by using the next generation matrix approach ( [24] ). If we consider the averaged infectivity for asymptomatic patients is ǫ < 1, it can be calculated that A a (τ ) = ǫe −κ i τ + κa κa−αa [e −αaτ − e −κaτ ] and further we have

The final size relation for model (18) can be generated by Theorem (4.1). We can also use the alternative method ( [7] ) to verify this result. Firstly, we denote

if function f is a non-negative integrable function defined on 0 ≤ t < ∞. It follows the lemma (see [7] ), Lemma 4.1. If f (t) is a non-negative monotone nonincreasing continuously differentiable function,

this leads to I(∞) = 0 and A(∞) = 0. We further obtain the number of all infection cases

We then add the second equation and the fourth equation, the third equation and the fifth equation in (18) , respectively,

To integrate the above equations with respect to t from 0 to ∞ implies

It implies the linear relation betweenÎ andÂ,

From the first equation in model (18), we have

Using equations (22) and (23), the final size relation for model (18) can be obtained,

It is noticed that, in Equation (23),Î andÂ can be interpreted as the sum of lengths of infection periods for all symptomatic and asymptomatic patients, respectively. If we define the length of infection period for symptomatic patient j (j ∈ [1, 2, · · · , final size of symptomatic patients]) is 1 α i | j , the law of large numbers indicates that

Same argument can be made for asymptomatic infections. Apparently the second part of the above equation holds when the final size of symptomatic patients is relatively large, and 1 α i is the predefined mean length of infection period. Therefore, we have the final size of symptomatic patients isÎα i and the final size of asymptomatic patients is isÂα a . This also implies that the ratio of sizes of symptomatic and asymptomatic patients is f 1−f . In stochastic epidemic modeling with only symptomatic path (the Sellke stochastic model), a similar term a NÎ is defined to measure the total accumulative infection pressure on a given susceptible individual during the course of the epidemic ( [14, 22] ). (functions B i and B a ) can be varied significantly. The stage of pre-symptomatic in the pathway of symptomatic infection has been frequently included in modeling COVID-19 pandemic. For an individual who is pre-symptomatic, he or she can infect other susceptible individuals. The infectivity is likely to be even higher than the patients with symptoms ([1]). Thus, for upper infection pathway in Figure 3 , it is reasonable to add one more compartment "pre-symptomatic" to describe the intermediate infectious stage. The corresponding function B i can be constructed straightforwardly. It is also possible to consider that the stays in the exposed periods (or in other compartments) follow the Gamma distributions and assume multiple exposed sub-stages ( [3, 10, 23] ).

If we consider there are disease deaths, the total population size does not remain constant. We denote N (t) is the population size over time and N 0 is the initial population size. We further assume that a fraction µ 1 (µ 2 ) of symptomatic (asymptomatic) infectives dies of disease, and µ 1 > µ 2 . The contact rate a(N ) is a density dependent saturating function ( [6, 9] ), where a(N ) is a non-decreasing function about N and a(N ) N is a non-decreasing function about N . The age-of-infection epidemic model with disease deaths is

Because of the disease deaths, the exact final epidemic size for model (25) is impossible to obtain. As the basic reproduction number R 0 only depends on the initial states of the epidemic and the population, it can be explicitly defined as ( [5, 6, 11] )

We now establish the final size relation for model (25) with respect to the value of R 0 in (26). Based on the properties of functions a(N ) and a(N ) N ( [6, 7] ), ∀t ∈ [0, ∞), we have:

where N ∞ is the final size of individuals who survive after the epidemic ends. We now integrate the first equation in model (25) for S ′ S and obtain

Because a(N (t)) N (t) is bounded and continuous for t ∈ [0, ∞), we are able to find the constant N ⋆ such that N 0 ≥ N ⋆ ≥ N ∞ , and satisfy

Similarly, if we consider both the initial symptomatic cases and initial asymptomatic cases are zero or neglectible, the integral term ∞ 0 (φ i,0 (t) + φ a,0 (t))dt can be omitted in equation (29). The estimate of final epidemic size is

It is impossible to explicitly calculate the values of N ⋆ and a(N ⋆ ), because N ⋆ depends on many factors, such as the shapes of functions A i (t) and A a (t). However, we can obtain a relatively accurate estimates by employing the inequalities in (27),

It has been investigated in [2] that, S ∞ is uniquely determined by the value of R 0 . Indicated by Theorem 4.2 (Lemma 2.1 and Lemma 2.2 in [2] ), we have the lower and upper bounds for S ∞ . The first part of inequality in equation (31) implies that S ∞ is less or equal to the final susceptible population size, when the basic reproduction number is taken as R 0 and disease-induced deaths are not considered; the second part of inequality in equation (31) shows that S ∞ is larger or equal to the final susceptible population size, when the basic reproduction number is taken as 1 1−max{µ 1 ,µ 2 } R 0 and there are no disease deaths. It is clear to see that, if max{µ 1 , µ 2 } → 0, the elementary squeeze theorem indicates that log

We still argue that, if the ratio f is a constant, the number of total symptomatic cases is f (S 0 − S ∞ ) and the number of total asymptomatic cases is (1 − f )(S 0 − S ∞ ). We have the following theorem to summarize the analysis.

Theorem 5.1. Assume the epidemic model has both symptomatic infections and asymptomatic infections, the disease induced death rates are µ 1 and µ 2 , respectively. If the basic reproduction number is R 0 , then the final susceptible population size S ∞ has the lower bound of limiting susceptible population size with basic reproduction number 1 1−max{µ 1 ,µ 2 } R 0 and with no disease death; the upper bound is the limiting susceptible population size with basic reproduction number R 0 and without disease deaths. The fraction of total symptomatic cases and total asymptomatic cases is

This theorem is the complement of Theorem 4.1 in [6] , it provides the upper bound for the estimate of limiting final susceptible population size with disease deaths considered. It also indicates that when the disease induced death rates are larger, it is more challenging to accurately predict the outcome of the epidemic.

In this section, we perform two sets of numerical simulations to verify the analytical results in previous sections. The size of population is N = 10 5 and the assumed symptomatic ratio f = 0.6 which indicates that the probability for an infected individual becoming symptomatic is 60%. We further assume the contact rate a = 0.5 (/day). Regarding the infected stages, we have κ i = 0.07 (/day) and α i = 0.2 (/day) for symptomatic infection path; and have κ a = 0.03 (/day) and α a = 0.1 (/day) for parallel asymptomatic infection path. It is therefore calculated that the basic reproduction number R 0 = 3.5, and R i = 2.5 and R a = 5. Clearly, if the symptomatic ratio f is overestimated, the estimated R 0 is less than the actual basic reproduction number. Assumptions of standard incidence and exponentially distributed periods are made to simplify the numerical simulations.

We first consider there are no disease-induced deaths for both symptomatic infections and asymptomatic infections. If the ratio f is a constant, we are able to obtain the exact final epidemic size. The total numbers of symptomatic patients and asymptomatic patients during the course of the epidemic are 5796 and 3863, respectively. It can be calculated that the ratio of two numbers is f 1−f = 1.5. The dynamics is shown in the following Figure 5 . 

The simulations are presented in Figures 6,7,8,9 and the simulated outcomes are summarized in Table 1 . If we consider the simulation results for epidemic model with constant ratio as the baseline. When the switch of ratio occurs near the end of the epidemic, it is observed in Figure 9 that the final outcome can be approximated by the baseline. However, if the switch occurs earlier, the final epidemic can be significantly different and it is entirely unpredictable. 

We now consider the epidemic models with identical parameters but with a larger population size N = 10 6 and with disease induced deaths. Firstly, we assume the death rates are µ 1 = 0.2% and µ 2 = 0.1% and the dynamics of the model is simulated in Figure 10 . We are also able to obtain the number of total symptomatic cases R i (∞) + D i (∞) = 57966 and the number of total asymptomatic cases R a (∞) + D a (∞) = 38643. Therefore the final size of epidemic is S(0) − S(∞) = 96609. The ratio is 1.50. The lower bound for final size estimation can be obtained by simulating epidemic models without disease deaths, and we have S(0) − S(∞) = 96597. If we consider another epidemic model without disease death and with the basic reproduction number be R 0 /(1 − 0.2%), the final epidemic size is 96623 and this number is the upper bound for the estimation of final size for the original model with disease deaths. It can be observed that, since the death rates are very low, two bounds are both reasonable approximations. Figure 10 : The progression of the epidemic with disease death rates µ 1,2 = 0.2%, 0.1% and the accumulation of death cases.

As a comparison, we next consider the epidemic model with larger death rates µ 1 = 2% and µ 2 = 1%. The final epidemic size is 96708, while the upper bounds and lower bounds are 96724 and 96597. If the population size is larger, it becomes more difficult to accurately predict the final epidemic size.

We have formulated an age-of-infection model to study an epidemic with symptomatic infections and asymptomatic infections. Two separate pathways are initially determined by the symptomatic ratio f , which is the probability that an individual being infected and eventually becoming symptomatic. The basic reproduction number R 0 is calculated and its value depends on the symptomatic ratio f , the values R i and R a . The incorrect inferred value of f leads to the wrongly predicted final epidemic size. It is however a challenging task to infer these three values in practice. We then proved that, for the age-of-infection model with two infection pathways, the final epidemic size is uniquely determined by R 0 . The ratio of total numbers of symptomatic patients and asymptomatic patients is f 1−f . For the similar age-of-infection epidemic model with disease deaths, the calculation of basic reproduction number R 0 is not affected. And the final epidemic size can not be explicitly generated, but can be approximated by the given lower bounds and upper bounds. If the death rates for both symptomatic and asymptomatic patients are smaller, the final epidemic size can be more accurately predicted. Two sets of numerical simulations were performed for two types of models.

The age-of-infection epidemic model is the general structure for different types of compartmental epidemic models ( [13] ) and it can be extended for other scenarios, such as the disease spread in an age-structured population with different activity levels, the impact of vaccination in the population and the behavioral changes in the population. With respect to the fundamental analysis, the complete analysis of corresponding characteristic equation for such models is still very challenging ( [4, 8, 13] ).

Declaration of Competing Interest: None.

Evaluating different epidemiological models with the identical basic reproduction number R 0

The effect of delay in viral production in within-host models during early infection

Age of infection in epidemiology models

The kermack-McKendrick epidemic model revisited

Age-of-infection and the final size relation

Epidemic models with heterogeneous mixing and treatment

Mathematical epidemiology: Past, present, and future

Mathematical Models in Population Biology and Epidemiology

Mathematical Models in Epidemiology

Mathematical Epidemiology

Age of infection epidemic models with heterogeneous mixing

Age of infection epidemic models

Stochastic epidemic models: A survey

Epidemiological models with non-exponentially distributed disease stages and applications to disease control

A contribution to the mathematical theory of epidemics

Estimating the prevalence of asymptomatic COVID-19 cases and their contribution in transmission -using henan province, china, as an example

Inferring true COVID-19 infection rates from deaths

Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19)

Transmission of 2019-nCoV infection from an asymptomatic contact in germany

Asymptomatic SARS-CoV-2 infection: A systematic review and meta-analysis

On the asymptotic distribution of the size of a stochastic epidemic

Quantifying asymptomatic infection and transmission of COVID-19 in new york city using observed cases, serology, and testing capacity

Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission

Calculation of R0 for age-of-infection models

Acknowledgements: The work is dedicated to the author's mentor Dr. Fred Brauer (1932Brauer ( -2021. The author is grateful for Fred's guidance. This work is based on numerous discussions between the author and Fred. The author also acknowledges the Post-doctoral fellowship offered by Hausdorff Center for Mathematics and the University of Bonn.