key: cord-0728232-f0ahn4iu authors: Coelho, Marco Tulio Pacheco; Rodrigues, Joao Fabricio Mota; Medina, Anderson Matos; Scalco, Paulo; Terribile, Levi Carina; Vilela, Bruno; Diniz-Filho, Jose Alexandre Felizola; Dobrovolski, Ricardo title: Exponential phase of covid19 expansion is not driven by climate at global scale date: 2020-04-06 journal: nan DOI: 10.1101/2020.04.02.20050773 sha: 6c9306850b5da6412d800d19643c80d3dc3af2dc doc_id: 728232 cord_uid: f0ahn4iu The pandemic state of COVID-19 caused by the SARS CoV-2 put the world in quarantine and is causing an unprecedented economic crisis. However, COVID-19 is spreading in different rates at different countries. Here, we tested the effect of three classes of predictors, i.e., socioeconomic, climatic and transport, on the rate of daily increase of COVID-19. We found that global connections, represented by countries importance in the global air transportation network, is the main explanation for the growth rate of COVID-19 in different countries. Climate, geographic distance and socioeconomics did not affect this big picture analysis. Geographic distance and climate were significant barriers in the past but were surpassed by the human engine that allowed us to colonize almost every corner on Earth. Based on our global analysis, the global network of air transportation could lead to a worst-case scenario of synchronous global pandemic if board control measures in international airports were not taken and are not sustained during this pandemic. Despite all limitations of a global analysis, our results indicate that the current claims that the growth rate of COVID-19 may be lower in tropical countries should be taken very carefully, at risk to disturb well-established and effective policy of social isolation that may help to avoid higher mortality rates due to collapse of national health systems. This is the case of Brazil, a well-connected tropical country that presents the second highest increase rate of COVID-19 and might experience a serious case of human-induced disasters if decision makers take into consideration unsupported claims of the growth rate of COVID-19 might be lower in tropical countries. With the worldwide spread of the novel Coronavirus Disease 2019 , caused by the SARS-CoV-2 virus, we are experiencing a declared pandemic. One of the largest preoccupations about this new virus regards its notable ability to spread given the absence of any effective treatment, vaccine and immunity in human populations. Epidemiologists quantify the ability of infectious agents to spread by estimating the basic reproduction number (R0) statistic (Delamater et al. 2019) , which measures the average number of people each contagious person infects. According to the World Health Organization (2020), the new coronavirus is transmitting at an R0 around 1.4-2.5, which is greater than seasonal influenza viruses that spread every year around the planet (median R0 of 1. 28, Biggerstaff et al. 2014) . To anticipate the timing and magnitude of public interventions and mitigate the adverse consequences on public health and economy, understanding the factors associated with the survival and transmission of SARS-CoV-2 is urgent. Because previous experimental (Lowen et al. 2007) , epidemiological (Shaman et al. 2010 , Barreca & Shimshack 2012 and modeling (Zuk et al. 2009 ) studies show the critical role of temperature and humidity on the survival and transmission of viruses, recent studies are testing the effect of environmental variables on SARS-CoV-2 (Wang et al. 2020 , Sajadi et al. 2020 ) and forecasting monthly scenarios of the spread of the new virus based on climate suitability (Araújo & Naimi 2020, but see Chipperfield et al. 2020) . Although temperature and humidity are known to affect the spread and survival of other coronaviruses (i.e., SARS-CoV and MERS-CoV, Tan et al. 2005 , Chan et al. 2011 , Doremalen et al. 2013 , Gaunt et al. 2010 ), using the current occurrences of SARS-CoV-2 cases to build correlative climatic suitability models without taking into consideration connectivity among different locations, geographical distance and socioeconomic conditions might be inadequate. Many factors might influence the distribution of diseases at different spatial scales. Climate might affect the spread of viruses because it affects many biogeographical patterns, including the distribution of diseases and human behavior (e.g., Murray et al. 2018) . Geographic distance represents the geographical space where the disease spread following the distribution of hosts and has also been found to explain biogeographic patterns (Pulin 2003 , Nekola & White 2004 , Warren 2014 . Socioeconomic characteristics of countries could be viewed as a proxy for the ability to identify and treat infected people and for the governability necessary to make fast political decision and avoid the spread of new diseases. Finally, the global transportation network might surpass other factors as it can reduce the relative importance of geographic distance and facilitate the spread of viruses and their vectors (Brockmann & Helbing, 2013; Pybus et al. 2015) . According to the International Air Transport Association (2019) more than 4 million passengers traveled abroad in 2018. This amount of travelers reaching every corner of the world represents the magnitude of how a human niche construction (i.e. global transportation network, Kendal et al., 2011; Boivin et al., 2016) could facilitate the global spread of viruses and vectors (Brockmann and Helbing, 2013; Pybus et al. 2015) in the same way it facilitated the spread of invasive species and domesticated animals over modern human history (Boivin et al., 2016) . The spread of SARS-CoV-2 from central China to other locations might be strongly associated with inter-country connections, which might largely surpass the effect of climate suitability. Thus, at this point of the pandemic, there is still a distributional disequilibrium that can generate very biased predictions based on climatic correlative modeling (De Marco et al. 2008) . Thus, here we used an alternative macroecological approach (e.g., Burnside et al. 2012) to investigate variations on the growth rates of SARS-CoV-2, based on the geographical patterns of exponential growth rates of the disease at country level. We studied the effect of environment, socioeconomic and global transportation controlling for spatial autocorrelation that could bias model significance. By analyzing these factors, we show that the exponential growth of COVID-19 is not driven by climatic and socioeconomical variables at global scale and is explained mainly by country's importance in global transportation network (i.e., air transportation). We collected the number of people infected by the COVID-19 per day from the John Hopkins (Dong et al., 2020) and European Centre for Disease Prevention and Control (ECDC, 2020) . This data is available for 173 countries, for which only 44 had more than 100 cases recorded and for which time series had at least ten days after the 100 th case. We also performed the analysis considering countries with more than 50 cases, but it did not qualitatively change our results. Thus, we only show the results for countries with more than 100 cases. In our analysis, we only used the exponential portion of the time series data (i.e. number of people infected per day) and excluded days after stabilization or decrease in total number of cases. We empirically modelled each time series using an exponential growth model for each country and calculated both the intrinsic growth rate (r) and the regression coefficient of the log growth series to be used as the response variable in our models. Because both were highly correlated (Person's r = 0.97), we used only the regression coefficient to represent the growth rate of COVID-19 in our study. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https://doi.org/10. 1101 To investigate potential correlates of the virus growth rate, we downloaded climatic and socioeconomic data of each country. We used climatic data represented by monthly average minimum and maximum temperature (C) and total precipitation (mm) retrieved from the WorldClim database (https://www.worldclim.org) (Harris et al. 2014, Fick and Hijmans 2017) . We used monthly available data for the most recent year available in WorldClim. We extracted climatic data from the months of January, February, March, and December to represent the climatic conditions of the winter season in the Northern Hemisphere and the summer season in the Southern Hemisphere. From these data, we computed the mean value of climatic variables across each country. Finally, minimum and maximum temperatures were combined to estimate monthly mean temperature for December, January, February, and March, which was used in the model along with total precipitation for the same months. However, using different combinations of these variables (i.e., using means of minimum or maximum temperatures, as well as minimum or maximum for each month) did not qualitatively affect our results. We extracted socioeconomic data for each country. Human Development Index (HDI) rank, mean number of school years in 2015, gross national income (GNI) per capita in 2011, population size in 2015 and average annual population growth rate between 2010-2015 were used in our study and downloaded from the United Nations database (http://hdr.undp.org/en/data). We also obtained a mean value of health investment in each country by averaging the annual health investments between 2005-2015 obtained from the World Health Organization database (http://apps.who.int/gho/data/node.home). Due to the strong collinearity among some of these predictors, HDI rank and mean number of school years were removed from our final model. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https://doi.org/10. 1101 Finally, we also downloaded air transportation data from the OpenFlights (OpenFlights, 2014) database regarding the airports of the world, which contained information about where each airport is located including country location (7,834 airports), and whether there is a direct flight connecting the airports (67,663 connections). We checked the Openfligths database to make the airports and connections compatible by including missing or fixing airport codes and removing six unidentified airport connections resulting in a total of 7,834 airports and 67,657 connections. We used this information to build an air transportation network that reflects the existence of a direct flight between the airports while taking into account the direction of the flight. Thus, the airport network is a unipartite, binary, and directed graph where airports are nodes and flights are links (Fig 1, Fig S1) . In the following step, we collapsed the airports' network into a country-level network using the country information to merge all the airports located in a country in a single node (e.g., United States had 613 airports that were merged in a single vertex representing the country). The country-level network (Fig 1) is a directed weighted graph where the links are the number of connections between 226 countries which is collapsed for the 44 countries that had more than 100 cases and for which time series data had at least 10 days after the 100 th case . Afterward, we measured the countries centrality in the network using the Eigenvector Centrality (Bonacich, 1987) , hereafter centrality, that weights the We evaluated the relationship between the predictors (climatic, socioeconomic and transport data) and our growth rate parameter using a standard multiple regression (OLS) after taking into consideration the distribution of the original predictors as well . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https://doi.org/10. 1101 as the normality of model residuals. Moreover, OLS residuals were inspected to evaluate the existence of spatial autocorrelation that could upward bias the significance of predictor variables on the model using Moran's I correlograms (Legendre and Legendre 2013). Prior to the analysis, we applied logarithmic (mean precipitation, total population size, and network centrality) and square root (mean health investments) transformations to the data to approximate a normal distribution. The models used to estimate COVID-19 growth rate on different countries showed an average R 2 of 0.96 (SD = 0.03), varying from 0.83 to 0.99, indicating an overall excellent performance on estimating growth rates. Only one out of the 44 countries (i.e., exponential growth phase for at least ten days after country had 100 confirmed cases) did not show an R 2 > 0.8 for model fitting, and, therefore, we removed this country from the following analysis. The geographical patterns in the growth rates of COVID-19 cases do not show a clear trend, at least in terms of latitudinal variation, that would suggest a climatic effect at macroecological scale ( Fig. 2A) . We build one model including only climate and socioeconomic variables, which explained only 19% of the variation on growth rates with a significant (p < 0.025) and negative coefficient for annual population growth rate. This model did not have spatial autocorrelation in the residuals. When we added country centrality (i.e. country importance in global transportation network) as a predictor, the R 2 increased to 34.5%. However, socioeconomic and climatic variables had no significant effect (see Table 1 ). In this model, the exponential growth rates increased in response to countries importance in the global transportation networks (Fig 2B, Fig 2C) which is the only significant effect of the model (p < 0.0019). Mean precipitation had a marginal . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https: //doi.org/10.1101 //doi.org/10. /2020 significant effect in this model (p = 0.054), with a positive coefficient (i.e. drier countries have lower growth rates), although effect size is at least two times lower than the effect of countries importance in global transportation (Table 1) . Statistical coefficients were not upward biased by spatial autocorrelation. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https://doi.org/10. 1101 The pandemic state of SARS CoV-2 put the world in quarantine and is causing an unprecedented economic crisis. The rates of increase of new cases of COVID-19 is faster in some countries than others. To understand why growth rates are different among countries we investigate the effect of climatic, socioeconomical and human transportation variables that could have important roles on the exponential phase of COVID-19. At global scale, temperature, precipitation, mean number of school years, Gross National Income and health investments had no significant effect on the exponential phase of COVID-19, suggesting that the fast initial spreading of the disease might behave similarly in different countries, despite differences in climatic and socioeconomic conditions. Countries' importance in the global transportation network is the only variable with a significant association with COVID-19 growth rates (Fig 2) . The centrality measure is widely used to discover distinguished nodes on many networks, including epidemiological networks (e.g., Madotto and Liu, 2016) . Our findings reinforce the importance of propagule pressure on disease dissemination (Tian et al 2017 , Chinazzi et al. 2020 . It is quite likely that further phases of COVID-19 spread, in terms of peak of infections and decrease in mortality rates, are better related to socioeconomics characteristics of each county and their political decisions when secondary transmissions were identified. We can already clearly identify the effects of adopting strong social isolation policies in China (see Kraemer et al. 2020 ) and, on the opposite side of this spectrum, in European countries like Italy, Spain and England (Enserink and Kupferschmidt 2020). Our analyses call attention to the case of Brazil, a well-connected tropical country that presents the second highest increase rates of COVID-19 in its exponential phase (Fig 1A) . If decision makers take into consideration . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint . https: //doi.org/10.1101 //doi.org/10. /2020 yet unsupported claims that growth rates of COVID-19 in its exponential phase might be lower in tropical countries because of climate, we might observe terrible scenarios unrolling in tropical countries, especially in those with limited health care structure, such as Brazil. When discussing and modelling the effect of climate on SARS CoV-2 it is important to remember that modern human society is a complex system composed of strongly connected societies that are all susceptible to rare events. It is also critical to consider the negative correlations between climate and local or regional socioeconomic conditions (i.e., inadequate sanitary conditions and poor nutritional conditions) that could easily counteract any potential climatic effect at local scales, such as lower survival rates of viruses exposed to high temperatures and high UV irradiation (Duan et al. 2003 , Wang et al. 2020 . Tropical regions will experience mild climate conditions in a couple of months. Thus, regardless of the influence of local environmental conditions, tropical countries could still expect high contagious rates. Finally, climatic suitability models might be ephemeral for very mathematized modelling fields of science such as epidemiology and virology that developed over time very realistic models that enables the possibility of learning with parameters of similar viruses (i.e. SARS) that can definitely help and instruct decision makers to take actions before it is too late. Here we showed that countries' importance in the global transportation network is the only variable with a significant association with COVID-19 growth rates in its exponential phase. Our results reinforce board control measures in international airports (see Bitar et al. 2009 , Nishiura & Kamiya 2011 ) during very early stages of pandemics to prevent secondary transmissions that could lead to undesired scenarios of rapid synchronically spread of infectious diseases in different countries. The rapid international spread of the severe acute respiratory syndrome (SARS) from 2002 to . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10. 1101 2003 led to extensively assessing entry screening measures at international borders of some countries (Bell et al. 2003 , John et al. 2005 . The 2019-2020 world spread of COVID-19 highlights that improvements and testing of board control measures (i.e. screening associated with fast testing and quarantine of infected travellers) might be a cheap solution for humanity in comparison to health systems breakdowns and unprecedented global economic crises that the spread of infectious disease can cause. However, it is important to note that board control of potentially infected travellers and how to effectively identify them is still a hotly debated topic in epidemiology and there is still no consensus on accurate methodologies for its application ). We do not expect that our results using a macroecological approach at a global scale would have a definitive effect on decision-making in terms of public health in any particular country, province, or city. However, we expect that our analyses show that current claims that growth of COVID-19 pandemics may be lower in developing tropical countries should be taken very carefully, at risk to disturb well-established and effective policy of social isolation that may help to avoid higher mortality rates due to collapse in national health systems. Annual Review Border screening for SARS The effect of human mobility and control measures on the COVID-19 epidemic in China Influenza virus transmission is dependent on relative humidity and temperature Spatial analysis improves species distribution modelling during range expansion Super-Spreader Identification Using Meta-Centrality Pathogeography: leveraging the biogeography of human infectious diseases for global health management The distance decay of similarity in biogeography and ecology Fever screening during the influenza (H1N1-2009) pandemic at Narita International Airport Virus evolution and transmission in an ever more connected world The decay of similarity with geographical distance in parasite communities of vertebrate hosts Temperature and Latitude Analysis to Predict Potential Spread and Seasonality for COVID-19 Absolute humidity and the seasonal onset of influenza in the continental United States Applications of Infrared Thermography for Noncontact and Noninvasive Mass Screening of Febrile International Travelers at Airport Quarantine Stations An initial investigation of the association between the SARS outbreak and weather: With the view of the environmental temperature and its variation Increasing airline travel may facilitate cocirculation of multiple dengue virus serotypes in Asia Stability of middle east respiratory syndrome coronavirus (MERS-CoV) under different environmental conditions High Temperature and High Humidity Reduce the Transmission of COVID-19 This paper was developed in the context of the human macroecology project on the National Institute of Science and Technology (INCT) in Ecology, Evolution and Biodiversity Conservation, supported by CNPq and FAPEG. JAFDF, RD, LCT are also supported by CNPq productivity scholarships. We thank Thiago F. Rangel for his constructive comments on early version of the manuscript.