key: cord-0801964-ba2dxgk4 authors: Sharma, A.; Sapkal, S.; Verma, M. K. title: Universal epidemic curve for COVID-19 and its usage for forecasting date: 2020-11-10 journal: nan DOI: 10.1101/2020.11.07.20220392 sha: 805c18a815b2815fd47b775aa00c91a9d6cd7c76 doc_id: 801964 cord_uid: ba2dxgk4 We construct a universal epidemic curve for COVID-19 using the epidemic curves of eight nations that have reached saturation for the first phase, and then fit an eight-degree polynomial that passes through the universal curve. We take India's epidemic curve up to September 22, 2020 and overlap it with the universal curve by minimizing square-root error. The constructed curve is used to forecast epidemic evolution up to January 1, 2021. The predictions of our model and those of supermodel for India are reasonably close to each other considering the uncertainties in data fitting. epidemic evolution. Towards this aim, in this paper, we construct a universal epidemic curve for COVID-19 and use this curve to forecast the epidemic evolution in India. The SIR model. constructed by Kermack and McKendrick [21] , is one of the first models for epidemic evolution. In this model, the variables S and I describe respectively the numbers of susceptible and infected individuals, while the variable R represents the removed individuals who have either recovered or died. SEIR model, which is a generalization of SIR mode, includes exposed individuals, E, who are infected but not yet infectious [6, 14] . Researchers have constructed derivatives of the above models to include lockdowns and travel restrictions, asymptomatic infections, etc. For example, Peng et al. [34] constructed a seven-variable model that includes quarantined and death variables and predicted that the daily count of exposed and infectious individuals in China will be negligible by March 30, 2020. Chinazzi et al. [12] and Hellewell et al. [19] studied the effects of travel restrictions and isolation on epidemic evolution. Mandal et al. [29] constructed a India-specific model that includes intercity connectivity. Shayak et al. [41] modelled epidemic evolution using delayed-differential equations. In addition, Rahmandad et al. [35] has also used a model to predict Indian epidemic growth. Schüttler et al. [39] showed that I(t) or total death count could be modelled using the error function. Asymptomatic carriers play a major role in the spread of COVID-19 epidemic, hence there have been many attempts to model this effect. In particular, Ansumali et al. [2] and Robinson et al. [38] have created SAIR model that takes into this important factor. Recently, Vidyasagar at al. [49] and Agrawal et al. [1] have adopted SAIR model to construct an epidemic evolution for In-. CC-BY-NC-ND 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 10, 2020. ; https://doi.org/10.1101/2020.11.07.20220392 doi: medRxiv preprint dia; this model, termed as supermodel, has many predictions. For example, it predicts 10.6 million cases by the end of this year. Data-driven models are also used for epidemic forecast. Recent analysis of COVID-19 data reveals that the epidemic curve begins with an exponential growth, after which we it follows a sequence of power laws [53, 22, 28, 7, 11, 10, 48, 31, 42, 3] . The epidemic curve flattens after square-root growth. Motivated by this observation, in this paper we construct a universal epidemic curve for COVID-19 by appropriate normalization. We use the epidemic data of the first phases of epidemic growth. The above universal behaviour [28, 32] can be utilized for the predictions of epidemic in various countries. In this paper we overlap India's epidemic curve on the universal curve by appropriate normalization. We observe that the model predictions describe the past data quite well. In particular, the model forecast for last five weeks are in good agreement with the observed data within 12.3%. In the next section we construct the universal epidemic curve using the epidemic data of several countries. To construct the universal curve for the COVID-19 epidemic, we take the epidemic evolution curves of eight countries: France, Spain, Italy, Switzerland, Turkey, Netherlands, Belgium and Germany. We chose these countries because they have reached saturation for the first phase. We obtained the data from 'EU Open Data Portal' [13] and WorldOMeter [51] websites. The starting dates of the data collection for these countries are given in Table 1. We take the same end date, 30 June 2020, for all the nations. We consider the curves for cumulative infection counts (I(t)) versus number of days (t). To construct the universal curve, we normalize the curves for the selected countries by dividing I(t) and t with I max and t max respectively. I max and t max for each country are defined as the value of I(t) and t as on 30th June 2020 (see Table 1 ). The normalized I(t) curves indeed exhibit a universal behaviour, as shown in Fig. 1 . The dashed lines represent individual countries, whereas the solid black curve represents the average of all the eight countries. Note that the universal curve starts with an exponential part and then it contains various power-law regimes before reaching a saturation. Refer to Fig. 2 and references [10, 48, 7, 31, 42, 3] for more details on various power-law regimes of the epidemic curves. Rather than fitting with various power laws at different stages of the epidemic, we fit a large-degree polynomial that passes through the universal curve after the exponential regime. This polynomial is listed in Table 2 . 3 Modelling Indian epidemic curve using the universal curve After the construction of an unversal epidemic curve for COVID-19, we attempt to overlap the Indian epidemic is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 10, 2020. We determine these quantities approximately using the procedure outlined in Algorithm 1. We estimate t max and I max by minimizing the following function: where P (t/t max ) is the polynomial fit for the universal function, shown in Table 2 . We choose t max and I max for which the error is minimum (see Algorithm 1). Fortunately, the process converges towards the unique minimum. The numerical procedure yields I max = 26.85 million and t max = 850 days for which the value of error in equation 1 is 0.1023. These values yield a maximum overlap for India's normalised curve on the Universal curve (Fig. 4) . We expect to get a better fit with more data (after later date). Algorithm 1: Optimizing I max and t max . Input: I(t) and t as 1D arrays 1 Function P (x): is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 10, 2020. After this exercise, we construct India's epidemic curve I(t) using the universal curve. By multiplying t/t max by t max and I(t)/I max by I max , we obtain the predictions for I(t) for any date, as shown in Fig. 5 . We construct a curve forİ(t), which is the daily cases, by taking a numerical derivative of I(t) curve. Table 2 . Once India's epidemic curve I(t) has been constructed, we can forecast infection count at any date, as shown in Fig. 4 and Fig. 5 . Note that the India's I(t) curve overlaps with the universal curve quite well. Regarding the forecast, the universal curves indicate that the linear regime starts at around t/t max =0.25. For India, this time translates to the last week of September. Note that the daily cases are approximately constant in the linear regime, but they start to decrease after the linear regime. Table 3 Predictions of new COVID-19 cases on weekly basis using the universal curve or the best-fit polynomial. Week Actual weekly new cases (in millions) Predicted cases with percentage errors (in millions) Week In Table 3 , we list weekly new cases, along with the model predictions, for India. We also illustrate these numbers in Fig. 6 . The model predictions are close to the actual data, with the maximum error if 12.3%. A closer look however reveals that the model's peak for the daily cases is delayed compared to the actual cases. This is somewhat expected due to uncertainties in determination of the universal curve. On the whole, our model predictions appear to be reasonably robust. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 10, 2020. ; https://doi.org/10.1101/2020.11.07.20220392 doi: medRxiv preprint Fig. 6 Bar chart of the weekly new cases of India's epidemic (also see Table 3 ). As described in the introduction, there are interesting low-dimension models of epidemic evolution. These models are refinement of SEIR model. In this section, we compare our model predictions with some of the leading epidemic models for India. In one such model, Rahmandad et al. [35] forecasted that in early 2021, the daily infections count in India will reach 0.287 million (2.87 lacs). Also, refer to Song et al. [43] . Our model predicts that the daily count will much lower than the above numbers, unless India faces a devastating second wave. India's supermodel [49, 1] , which is based on SAIR model [2, 38] , has gained major prominence recently. This model predicts that India may have reached herd immunity with around 38 crores of the population either infected or having antibodies. One of the predictions of the supermodel is that the infection counts at the end 2021 would be 10.6 million. For January 1, 2021, our prediction for the total infection count is approximately 14.57 million, and that for daily cont is 81 thousands. These are over-predictions compared to supermodel, but the predictions are reasonably close. We believe that our present model could be improved significantly by employing machine learning algorithms. We construct a universal epidemic curve for COVID-19 using the first-phase epidemic curves for the eight countries: France, Spain, Italy, Switzerland, Turkey, Netherlands, Belgium and Germany. The curves for the individual countries collapse to a single curve within a stan-dard deviation of 0.089 indicating a generic behaviour of the epidemic. We also construct a eight-degree polynomial that fits with the universal curve. Universality of the epidemic curve is an important landmark considering that many major physical phenomena exhibit universality (e.g., law of gravitation, phase transition, etc.). Note however that the universality in epidemic is somewhat surprising considering major differences in demography, government actions, lockdown conditions, etc. The discovery of the universal epidemic curve gives us an interesting handle for forecasting the epidemic evolution. An advantage of this approach over others is that it is purely data-driven. Hence, we do not need to model the differential equation. A disadvantage of this method is that we do not have any control parameter. For example, SAIR model can be tuned by changing the coefficient of some terms of the differential equations, but we cannot do so in our model because we do not have any control over the data. We compared India's reported epidemic curve with the universal curve with appropriate scaling. We observed that India's present epidemic curve fits with the part of the universal curve. This discovery enables us to forecast the epidemic evolution. We observe that our forecasts for 5 weeks match with the observed data within 12.3%, which is quite encouraging considering so many uncertainties. Note however that our predictions tend to be systematically larger than the actual data, which could be due to errors in the construction of the universal curve. Our model predicts that the daily cases for India's COVID-19 epidemic are falling, which is consistent with the observations. This result indicates the cumulative I(t) has reached a linear regime for India's epidemic. The predictions of our models and those of the supermodel are reasonably close to each other. The universal curve could be further refined using more advanced algorithm, such as machine learning and deep neural networks. Also, we believe that this model is robust for modelling the second and third waves of COVID-19 as well. In addition, it will be interesting to work out the universal curves for the daily cases, as well as for the active cases. We are in the process of such extensions. Modelling the spread of SARS-CoV-2 pandemic -Impact of lockdowns & interventions Modelling a Pandemic with Asymptomatic Patients, Impact of Lockdown and Herd Immunity, With Applications to SARS-CoV-2. Annual Reviews in Control Evolution of COVID-19 Pandemic in India. Transactions of the Indian National Academy of Engineering Sentiment analysis of nationwide lockdown due to covid 19 outbreak: Evidence from india A predictive model for the evolution of covid-19 Epidemics: Models and Data using R Power-law distribution in the number of confirmed COVID-19 cases Healthcare impact of covid-19 epidemic in india: A stochastic mathematical model The 2019 novel coronavirus disease (covid-19) pandemic: A review of the current evidence Evolution of covid-19 pandemic: Power-law growth and saturation Momentum managing epidemic spread and Bessel functions The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak Covid-19 india cases tracker Epidemic Modelling: An Introduction A systematic review of SARS-CoV-2 vaccine candidates Modelling the covid-19 epidemic and implementation of populationwide interventions in italy Variation in government responses to covid-19. Blavatnik school of government working paper Oxford covid-19 government response tracker Feasibility of controlling COVID-19 outbreaks by isolation of cases and contacts. The Lancet Global Health The mathematics of infectious diseases Containing papers of a mathematical and physical character Patterns of the COVID19 epidemic spread around the world: exponential vs power laws. medRxiv preprint Herd immunity-estimating the level required to halt the covid-19 epidemics in affected countries Transmission Dynamics of 2019-nCoV in Malaysia. medrxiv preprint India under covid-19 lockdown The covid-19 vaccine development landscape A modified SEIR model to predict the COVID-19 outbreak in Spain and Italy: simulating control scenarios and multi-scale epidemics. medarxiv preprint Strong correlations between powerlaw growth of COVID-19 in four continents and the inefficiency of soft quarantine strategies Prudent public health intervention strategies to control the coronavirus disease 2019 transmission in India: A mathematical model-based approach Computational epidemiology Data-driven modeling reveals a universal dynamic underlying the COVID-19 pandemic under social distancing Modelling the downhill of the Sars-Cov-2 in Italy and a universal forecast of the epidemic in the world Epidemic analysis of COVID-19 in China by dynamical modeling. medarxiv preprint Estimating COVID-19 under-reporting across 86 nations: implications for projections and control Temporal Dynamics of COVID-19 Outbreak and Future Projections: A Data-Driven Approach. Transactions of the Indian National Academy of Engineering Predictions, Role of Interventions and Effects of a Historic National Lockdown in India's Response to the the COVID-19 Pandemic: Data Science Call to Arms A model for the emergence of drug resistance in the presence of asymptomatic infections Covid-19 predictions using a Gauss model Modeling and forecasting for covid-19 growth curve in india. medRxiv preprint Transmission dynamics of COVID-19 and impact on public health policy. medRxiv preprint Short-term predictions of country-specific Covid-19 infection rates based on power law scaling exponents An epidemiological forecast model and software assessing interventions on COVID-19 epidemic in China. medRxiv preprint Coronavirus Disease 2019 Vaccine Development: An Overview. Viral Immunology The trinity of covid-19: immunity, inflammation and intervention Modelling and analysis of covid-19 epidemic in india. medRxiv preprint Prediction for the spread of covid-19 in india and effectiveness of preventive measures. Science of The Total Environment p Covid-19 pandemic: Power law spread and flattening of the curve Indian Supermodel for COVID-19 Pandemic The global impact of covid-19 and strategies for mitigation and suppression Fractal kinetics of COVID-19 pandemic. medRxiv preprint Acknowledgements We thank Soumyadeep Chatterjee, Shashwat Bhattacharya, and Asad Ali for useful discussions. This project is supported by a SERB MATRICS project SERB/F/847/2020-2021.