key: cord-291837-qz4g4v1u authors: Livadiotis, George title: Statistical analysis of the impact of environmental temperature on the exponential growth rate of cases infected by COVID-19 date: 2020-04-24 journal: nan DOI: 10.1101/2020.04.21.20072405 sha: doc_id: 291837 cord_uid: qz4g4v1u We perform a statistical analysis for understanding the effect of the environmental temperature on the exponential growth rate of the cases infected by COVID-19 for US and Italian regions. In particular, we analyze the datasets of regional infected cases, derive the growth rates for regions characterized by readable exponential growth phase in their evolution spread curve and plot them against the environmental temperatures averaged within the same regions, derive the relationship between temperature and growth rate, and evaluate its statistical confidence. The results clearly support the first reported statistically significant relationship of negative correlation between the average environmental temperature and exponential growth rates of the infected cases. The critical temperature, which eliminates the exponential growth, and thus the COVID-19 spread in US regions, is estimated to be Tc = 86.1 ± 4.3 F. The daily number of new cases infected by COVID-19 is currently exponentially growing for most countries affected by the virus. However, this exponential growth rate varies significantly for different regions over the globe. It is urgent and timely to understand the reasons behind this regional variation of the exponential growth rates. Little information is known about this matter, while there are indications that the environmental temperature may be a factor; for instance, northern and colder US and Italian regions experienced much more incidents than others. Typically, the evolution curve of the spread of the coronavirus initiates with a pre-exponential phase characterized by a mild logarithmic growth, followed by the outburst phase of the exponential growth. Social-distancing measures against the spread may affect the evolution curve in a way that the exponential growth slows down and starts to decay, depending though on the effectiveness and applicability of these measures. However, after the decay of the spread at some place, new infected cases may outburst in other places, marked with insignificant number of cases until that moment. Then, a newly growth phase may appear. For example, Fig.1 (left) shows the evolution curve of spread for the infected cases in mainland China; clearly, we observe the whole growth−decay cycle, as well as, a new re-growth phase. Super-strict measures, such as complete shut down and quarantines, can successfully lead to the deceleration of the exponential growth of infected cases (Giugliano, 2020) . Unfortunately, they cannot be successfully applied and followed within vast regions, and especially, for a long and indefinite period of time. Inevitably, measures may be loosened during the decay phase, leading to the birth of an equally disastrous re-growth phase. US & Italy (right); phases (color-coded): pre-exponential (pre-exp), exponential (exp) growth, decelerated growth, decay, and possibly, a re-growth. Day t=1 corresponds to 1/15/2020 for China, 2/20/2020 for Italy, 2/27/2020 for US. Evolution in China cases follows the whole growth-decay cycle, and a new re-growth phase. Italian cases are characterized by a milder exponential rate, entered the phase of decelerated growth on March 12. US suffers with a larger exponential rate, and it is not clear whether has entered the decelerated growth phase. The exponential growth rate for China rose as high as λ=0.169±0.017, while for Italy and US the rates were λ=0.090±0.004 and 0.121±0.003, respectively (with correlation coefficient > 0.99). The exponential growth is the most effective phase for the evolution curve of infected cases; and the most important question regarding this evolution is still open (Black et al. 2020) : What can influence the exponential growth rate, and thus, "flatten the curve"? Measures, strict or not, may affect the evolution of new infected cases, by shifting the spread curve from the exponential to the decelerated growth. It should be noted though that measures do not affect the exponential growth rate itself, but only the period of time that this exponential phase applies. Then, what factors do affect the exponential growth rate? The age distribution in the place where the outburst occurs is unlikely to be a factor; indeed, the number of new cases is known to be positively correlated with age, however, the exponential growth rate (China: 0.169; US: 0.121; Italy: 0.090 -decreasing rate) appears to be negatively correlated to the age median of these countries (China: 37.4; US: 38.1; Italy: 45.5 -increasing age); hence, the age is likely irrelevant to the rate variations. In addition, culture in social activities may be a factor; for example, this might be contributing in the observed differences among the exponential rates in the cases of China, Italy, and US ( Fig.1) . However, what is causing the major variation of exponential rates among different regions of the same culture? It is apparent that culture does not constitute the main factor influencing the exponential rate. Figure 2 shows the regional variation of infected cases (left) and average winter temperature (right) in Italy. The possible negative correlation, observed between regional number of infected cases and winter temperature in Italy, is an indication of the influence of temperature on the exponential growth, but it certainly does not constitute a necessary condition. The reason is that the map plots the total number of the infected cases N t , which does not depend only on the exponential rate λ, but also on the initial number of cases N 0 . It is generally accepted that the initial infected cases in Italy were travelled directly from China; since some destinations are more favorable than others, then, the initial number of cases N 0 , as well as the current number of cases N t (which is proportional to N 0 ), should be subject of regional variation. Therefore, there is a non-negligible possibility, the observed regional variation of the number of infected cases N t to be caused by the regional distribution of the initial cases N 0 . In such a case, main airport cities would have incredibly high number of infected cases outplaying a possible negative correlation of daily infected cases with regional average temperature T; the latter may be one of the reasons of the high numbers of cases observed in New York City and Rome. On the other hand, in their letter to the White House, members of a National Academy of Sciences committee said that "There is some evidence to suggest that [coronavirus] may transmit less efficiently in environments with higher ambient temperature and humidity; however, given the lack of host immunity globally, this reduction in transmission efficiency may not lead to a significant reduction in disease spread without the concomitant adoption of major public health interventions" (Relman, 2020) . Nevertheless, it has to be stressed out that there were no statistical analyses focused on the exponential growth rates of the infected cases in regions with different temperatures. For instance, several authors (e.g., Pawar et al., 2020; Yao et al., 2020) found insignificant correlations between temperatures and confirmed cases. However, their analysis was performed on the number of the infected cases N t , which is subject to the randomness of the initial cases N 0 as explained above, and not to the exponential growth rate λ, which is dependent on physical characteristics of the coronavirus, binding protein, and environment. Analysis of regional cases can show whether the speculated negative correlation between temperature and number of infected cases is true, meaning a negative correlation between temperature and exponential growth rate. If the environmental temperature plays indeed a substantial role on the virus spread, then, this can provide promising results, such as, the estimation of the critical temperature that may eliminate the number of daily new cases in heavily infected regions. The purpose of this paper is to improve understanding of the effect of environmental temperature on the spread of COVID-19 and its exponential growth rate. Then, we calculate the exponential growth rates of infected cases for US and Italian regions, derive the relationship of these rates with the environmental temperature, evaluate its statistical confidence, and determine the critical temperature that eliminates this rate. A standard model for describing the evolution of the infected cases by viruses is given by ( ) ( ) ; N t is the number of total infected cases evolved from the initial N 0 cases, N max is the maximum possible number of infected cases; λ is the exponential growth rate, and becomes clear for x t <<1, where I is negligible, leading to: The function of negative feedback I models factors that flattens the curve, such as, the measures taken against spreading. While these factors are not affecting the exponential growth rate λ, they become more effective as the number of cases increases, getting closer to N max ; exponent b controls the effectiveness of these factors; strict {loose} measures correspond to smaller {larger} values of b. Figure 3 shows the evolution curve of the number of new (ΔN t = N t+1 −N t ) and total infected cases (N t ) and how this curve flattens for stricter measures (smaller values of b). As observed in Fig.3(b) , stricter measures, nicely modeled by decreasing b, do not affect the exponential rate λ but they successfully flatten the curve. However, the same can be achieved by downgrading the exponential growth rate. It is apparent, then, how much useful would it be to know the factors that can flatten the curve by decreasing directly the exponential rate. Applied measures could be loosen and shorter! Model (1) originates from the logistic map family (e.g., Livadiotis, 2005; and references therein; Wu et al. 2020) ; other complicate versions, such as, the Susceptible-Infectious-Recovered models (e.g., Ciarochi, 2020) may be expressed by multi-dimensional differential or difference equations (e.g., Elaydi, 2005, and references therein; Livadiotis et al. 2016 ), but still, the curve flattening is governed by the same features. The two composites, the exponential growth E and the negative feedback I, are just the main and necessary conditions for reproducing the growth-decay phases of the spread curve. Their interplay shows how the spread curve can be flattened as a result of stricter measures, independently of the existent exponential rate. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020 . . https://doi.org/10.1101 What are the main factors that can affect the exponential growth rate λ of COVID-19 spread? The rate λ is expected to have positive correlation with the reproduction number R 0 (e.g., proportional to its logarithm), and negative correlation with the incubation period τ (e.g., inverse proportional) (Milligan and Barrett, 2015) . The number R 0 is a measure of how contagious a disease is; it provides the average number of people in a susceptible population that a single infected person will spread the disease to over the course of their infection (Ciarochi, 2020) , and depends on the physical characteristics of coronavirus (Hao 2020) . The incubation period τ is the time elapsed between exposure to coronavirus and first symptoms; during this period, an infected individual cannot infect others; other characteristic periods and time intervals are the latent period between exposure and infection, and the generation time, mostly concerned with transmission process (Nishiura, 2010) . Characteristic values for COVID-19 are τ~5-6 days and R 0~2 -4 (Chen, 2020) . The rate expression can be written as , and involves all the physical characteristics of the mechanisms of infection and the environmental interactions; this can be easily derived, considering difference equations (that is, iterated discrete maps) (e.g., see: Livadiotis and Elaydi, 2012; Kwessi et al., 2018; Dayeh et al, 2018) . Setting the time to be given in discrete τ-steps, then, which can be written in terms of Eq. (2), where the exponential rate is given by: The main factors that can affect the exponential rate λ are: (a) culture in social activities, and (b) environmental temperature and/or other thermodynamic parameters. Intense cultural and social activities have reasonably a positive correlation with R 0 . As previously mentioned, measures against the virus spread do not effectively influence the exponential growth rate; e.g., they do not change the culture in social activities, which are characteristics of the particular population, but they can just cease these activities for some period of time. On the other hand, the environmental temperature T can affect all the parameters influencing exponential rate. We approach this dependence by (i) a linear approximation of the phenomenological relationship between exponential rate and temperature, and (ii) the connection of reproduction number with Arrhenius behavior (with negative activation energy): (i) The temperature can affect the physical properties of coronavirus, such as, the incubation time τ, as well as, the reproduction number R 0 that depends on these physical properties (Hao, 2020) . A linear approximation absorbs the (weak) temperature dependence of any parameters involved in the exponential rate; then, Eq.(5) gives: where we set the intercept to be given in normal conditions of atmospheric temperature and pressure (NTP) (that is, T=20 C 0 , P= 1atm). Then, we rewrite the exponential rate as: (ii) Coronavirus uses their major surface spike protein to bind on a receptor -another protein that acts like a doorway into a human cell (Wrapp et al., 2020) . The whole process is a slow chemical reaction, where . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020. . https://doi.org/10.1101/2020.04.21.20072405 doi: medRxiv preprint the mechanism behind can lead to rates negatively correlated with temperature, i.e., increasing rate with decreasing temperature. This is consistent to reaction rate expressed by the Arrhenius exponential with and Benson, 1984) . Then, the effective reproduction number 0 ( ) R T is expressed as a product combining the reproduction number in the absence of temperature effect, 0 R  , and the Arrhenius exponential rate, namely, Then, Eq. (5) gives We rewrite this expression as: Reactions of negative activation energy are barrier-less, relying on the capture of the molecules in a potential well. Increasing {decreasing} the temperature leads to a reduced {gained} probability of the colliding molecules capturing one another. Exponential spread is mainly related to outdoors activities while the decelerated growth caused by effective measures is related to indoors activities. Therefore, as long as the exponential growth takes place, the environmental temperature has an effective role on the chemical reaction between virus and spike protein. Due to the negative activation energy, decreasing the environmental temperature reduces the probability of virus-protein reaction, thus the virus may stay inactive on air or surfaces and eventually die. It should be noted that both the models (7b) and (10b) consider that the exponential rate λ, or the reproduction number R 0 , are subjects to a component influenced by the culture in social activities (intercept λ 0 ) and a component mostly influenced by the temperature (slope p 2 ). In this way, the slope may indicate to some universal quantity involved, such as, the (negative) activation energy. Next, we employ the above two expressions of exponential rate λ and temperature T, Eqs.(7a,10a), in order to set the two types of statistical models for fitting (Τ, λ) measurements for US and Italian regions. We use publicly available datasets of: (1) average enviromental temperature of US and Italian regions (e.g., see: www.ncdc.noaa.gov/data-access/land-based-station-data/land-based-datasets/climate-normals; it.climate-data.org; www.weather-atlas.com); (2) time series of the number of daily infected cases of US and Italian regions (e.g., see: www.thelancet.com; www.protezionecivile.gov.it). We analyze the datasets of regional infected cases in US and Italy, derive the relationship of the exponential growth rate of the number of cases with temperature, and evaluate its statistical confidence. First, we derive the exponential growth rates of the infected cases characterizing each examined region of Italy and US; then, we plot these values against the environmental temperatures of each region, and perform the corresponding statistical analysis. We proceed according to the following steps: (i) Collect the time series of the current infected cases N t for all US and Italian regions. For each of the US and Italian regions, we plot log(N t ) and log(ΔN t ) with time t, detect the time intervals of linear relationship corresponding to the phase of exponential growth, fit the datapoints within this region, and derive the slope (on linear-log scale), that is, the exponential growth rate λ. The total N t and new cases ΔN t should be characterized by the same exponential . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020. . https://doi.org/10. 1101 rate, λ, thus the slopes resulted from the linear fits of log(N t ) and log(ΔN t ) with time are (weighted) averaged (Fig.4) . (iii) Collect environmental temperature data and calculate the temperature averaged over the whole examined region. The incubation period τ is longer than the time scale of a single day or night, thus the temperature is averaged over the daily and nightly measurements. (iv) Co-plot all the derived sample values (Τ±δΤ, λ±δλ), where each pair corresponds to each examined region; then, apply a linear fitting in order to derive the linear relationship between T and λ, as well as evaluate the statistical confidence of this relationship; repeat the same for all US and Italian regions. Determine the critical temperature T C for which the rate becomes negligible; to eliminate the uncertainties of T C as a fitting parameter, we perform the linear fitting with the statistical model Repeat (iv) and (v) with pairs of (T -1 ±δT -1 , λ±δλ); we estimate again T C by performing the linear fitting with the statistical model λ=λ 0 (-1+Τ C /T) instead of λ=p 1 +p 2 •T -1 . Linear fitting of the number of the total N t and new ΔN t infected cases with time (on linear-log scale) for the states of California and Illinois, where the slope reads the exponential rate λ. The resulted rates from the linear fitting of log(N t ) (black) and log(ΔN t ) (red) are (weighted) averaged (blue). The phases are color-coded as in Figure 1 . The hypothesis to be tested is that the exponential growth rate λ varies linearly with temperature; (x is set to be the temperature or its inverse). This is tested by examining the chi-square corresponding to the fitting of the two-parameter linear statistical model H is the α-th diagonal element of its . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020. , α=1,2, where the derivatives are numerically derived. We will use two linear statistical models, (a) , as given by Eqs.(7a,10a) ; both can be written with the linear expression: The statistical confidence of the dependence of the exponential growth rate on the environmental average temperature may be sufficiently high for the acceptance of any of the two statistical models. The goodness of the fitting of each model is evaluated using two types of statistical tests, the "reduced chi-square", the "p-value of the extremes", and their combination (e.g., Frisch et al., 2013; Schwadron et al., 2013 , Fuselier et al., 2014 , while Student's t-test is also used for evaluating the statistical confidence of the derived slopes: -Reduced Chi-Square: The goodness of a fit is estimated by the reduced chi-square value, 2 -Student's t-test: This is another test for evaluating the statistical confidence of the slope derived from the linear fitting of the temperature-rate sample points (T i ±δT i , λ i ±δλ i ) and (T i -1 ±δT i -1 , λ i ±δλ i ). We examine, whether the slope p 2 ±δp 2 has significant difference from the zero slope (null hypothesis: slope is zero), by performing the Student's t-test with t m =p 2 /δp 2 , where the corresponding p-value is derived 9 from the integration of t-distribution . The Student's t-test is not passed for the null hypothesis that they examined slope equals zero, when the corresponding p t -value is less than the acceptable confidence limit of 0.05; then, the null hypothesis is rejected, meaning the slope has statistically significant difference for zero. In addition, we examine the slopes estimated for US with those estimated for Italian regions, by deriving The linear fitting of log(N t ) or log(ΔN t ) with time t within the region of exponential growth phase, resulted to the respective rates (given by the fitted slope); their weighted averages are shown in Table 1 for US and in Table 2 for Italian regions, while plotted against the average regional temperature in Figs. 5 and 6, respectively. The method of weighted fitting for double uncertainties (x i ±δx i , λ i ±δλ i ), as described by Fasano and Vio, 1988) , is used for estimating the fitting parameters λ 0 , T C , together with their statistical, propagation, and total errors. The fits of the linear statistical model with temperature, x i =T i , (left panels in Figs. 5, 6), as well as of the alternative statistical model with inverse temperature, (right panels in Figs. 5, 6) , are both characterized with high statistical confidence, attaining high p-values (>0.05) and reduced chi-squares 2 red  values (close to 1); also, both fits provide similar estimations of T C . The fitting results are shown in Table 3 . We also examine whether the sample points (T i ±δT i , λ i ±δλ i ) are subject to statistically significant concentrations or rarefactions, namely, whether possible heterogeneities within the distribution of sample points plays significant role in the fitted relationship. For this, we derive the temperature-rate relationship and its statistical confidence by fitting the homogenized set of sample points, instead of the raw sample points; then, we examine whether the fitting parameters differ from those derived from fitting the raw sample points. We homogenize the sample points by grouping them in temperature binning of ΔT ~ 1 C 0 (e.g., see: Livadiotis & Desai 2016). We estimate the weighted mean and error of the rates included in each bin. In the case of US regions we also performed a homogenization of rates, by grouping the temperaturebinned means in rate binning of Δλ ~ 0.01 d -1 . In the case of sample points with inverse temperatures, (T i -1 ±δT i -1 , λ i ±δλ i ), the procedure is exactly the same. Homogenized datasets result in a smooth relationship between the values of binned temperature and rate, as it can be observed in the plots of rate against temperature or inverse temperature (left or right lower panels, respectively), and for both US and Italy regions (Figs. 5 and 6, respectively) . The results are highly supportive of the negative correlation between rate and temperature. The results are shown in Table 4 . We observe that the linear relationships of the growth rate with temperature or inverse temperature are characterized by high statistical confidence for the homogenized datasets (p-values much higher than significant limit of 0.05; 2 red  far from the significant limits of 0.1 and 10). Therefore, the arrangement of sample points do not affect significantly the fitting results. In addition, as shown in Tables 3 and 4 , the linear fits of sample points (T i ±δT i , λ i ±δλ i ) and (T i -1 ±δT i -1 , λ i ±δλ i ) do not pass the Student's t-test for the null hypothesis that their slopes equals zero, i.e., the corresponding p t -values are less than the acceptable confidence limit of 0.05; therefore, the negative correlation of environmental temperature with the exponential rate is statistically significant (accepted with confidence 95%). . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020 . . https://doi.org/10.1101 (1) The exponential growth rate and its uncertainty is the weighted averaging of the rates derived from total and new infected cases; (2) the environmental temperature is averaged over the time period, from τ~5 days before the appearance of the 1 st case, to 1 st April; (3) the standard deviation of temperature is given by the half difference between highest and lowest values within the examined time period, divided by √2 (similar to the standard deviation for a sinusoidal function); (4) NY: The temperature refers to the New York City, instead of the whole state, which suffers from the vast majority of the state infected cases. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020. . https://doi.org/10.1101/2020.04.21.20072405 doi: medRxiv preprint Figure 5 . Linear fitting of rates with (left) temperatures and (right) inverse temperatures for US regions. The fitting is weighted with double uncertainties (on both the temperature and rate values). The analysis is first completed for the raw measurements (upper) and then repeated for the binned averages (lower). . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020. . https://doi.org/10.1101/2020.04.21.20072405 doi: medRxiv preprint Figure 6 . As in Figure 5 , but for Italian regions. Table 3 . . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020. . https://doi.org/10. 1101 In order to improve the statistics of the estimated critical temperature, we combine the sample points (T i ±δT i , λ i ±δλ i ) of US and Italian regions. First, we perform the Student's t-test to compare the slopes corresponding to US and Italian regions; we find high p t -values (>0.05) for both fits of x=T and x=T -1 , thus, the two populations are likely characterized by the same slope. Tthe respective intercept λ 0 does not pass the same test, i.e., the intercepts corresponding to US and Italian regions are likely different. A universality may characterize the slopes of fits x=T or x=T -1 , i.e., Next, we perform the linear fits of the sample points (T i ±δT i , λ i ±δλ i ) and (T i -1 ±δT i -1 , λ i ±δλ i ) for the mixed set of US and Italian data, once the rates of the Italian regions are shifted by Δλ; the optimal fitting is obtained for that shift Δλ, for which the reduced chi-square is ~1, the p-value of the extremes is ~0.5, and the combined measure ~0 (see previous section). Figure 7 shows how the combined datasets of temperaturerates from US and Italian regions lead to the optimal fitting. (Note that the optimization is not performed for the binned datasets, since they are characterized by smaller p-values -see, Figs. 5 and 6). The results are shown in Table 5 ; we observe that the optimization is reached for two values of the shift Δλ; we estimate the weighted average of the results corresponding toe the two shifts. The weighted mean is performed separately for the fitting cases of x=T and x=T -1 ; however, the weighted mean of the critical temperature is performed for all four results. Fitting of datasets combined for US and Italian regions, with the latter's rates shifted by Δλ. The optimal fitting corresponds to shifts Δλ~ 0.031 and ~0.058, for which the reduced chi-square is ~1, the pvalue of the extremes is ~0.5, and the combined measure ~0. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020 . . https://doi.org/10.1101 Figure 8 . Relationship of the reproduction number R 0 and its uncertainty with environmental temperature T. According to this, new affected cases cease (R 0 =1) when temperature climbs to T C~3 0 C 0 or (~86 F 0 ). Up-to-date there is no systematic statistical analysis of the effect of the environmental temperature T (and possibly other weather parameters) on the exponential growth rate of the cases infected by COVID-19, while a statistically confident relationship between temperature and growth rate (either with positive or negative correlation) is still unknown. The presented analysis led to a statistically confident relationship of negative correlation between the exponential growth rate and the average environmental temperature, derived for US and Italian regions. In particular, we analyzed datasets of regional infected cases in US and Italy, derived the exponential growth rates for each of these regions and plotted them against environmental temperatures averaged within the same regions, derived the relationship of temperature -growth rate, and evaluated its statistical confidence. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted April 24, 2020. . https://doi.org/10. 1101 The performed statistical analysis involved fitting of linear statistical models with the datasets of environmental temperature (or its inverse) and exponential growth rate, finding their relationship, and evaluating its statistical confidence. The two linear models developed and used for the statistical analysis are (a) The statistical confidence of fitting was evaluated using the reduced chi-square values, the p-value of extremes, and a testing measure that combines both of these values. Also, the statistical confidence was also evaluated using the Student's t-test, where the derived slopes compared to a hypothetical zero slope. The sample points of temperature (or inverse temperature) and exponential growth rate were also tested for statistically significant concentrations or rarefactions; namely, for possible heterogeneities within the distribution of sample points that could have significant role in the results. The statistical analysis of the homogenized temperature-rate data points concluded that the negative correlation between temperature and exponential rate is stable, having no statistically significant variability due to concentrations or rarefactions, and it is characterized by a high statistical confidence. We also performed a Student's t-test and ensured that the difference between two sample means of US and Italian regions is not statistically significant. A possible universality may characterize the slope of the temperature-rate relationship. This verifies the modeling developed and used by this analysis, where the exponential rate λ, or the reproduction number R 0 , are subjects to a component influenced by the culture in social activities (intercept λ 0 ) and a component influenced by the temperature (slope p 2 ). In this way, the slope may indicate to a universal quantity involved, such as, the (negative) activation energy. Since the derived slopes for US and Italian regions are characterized by no statistically confident difference, we improved the statistics of the estimated fitting parameters by combined the sample points of US and Italian regions. From the derived relationship, among others, we were able to derive the values of the (negative) activation energy E a , and the reproduction number R 0 at normal conditions and how this depends on temperature. Therefore, the results clearly showed that there is indeed statistically significant negative correlation of temperature on the exponential growth rate of the cases infected by COVID-19. Figure 9 shows the anticorrelation between the mapped exponential rates and average environmental temperature of the USA regions examined by this analysis, which they are characterized by readable exponential growth phase in their evolution spread curve. Figure 9 . Anti-correlation between the spatial distributions of the exponential growth rates of the infected cases (left) and the average environmental temperature (right). . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted April 24, 2020 . . https://doi.org/10.1101 Given the negative correlation of the environmental temperature with the exponential growth rate, it was reasonable to ask for the critical temperature that eliminates the exponential rate, and thus the number of daily new cases in infected regions. This was found to be T C~8 6.1 ± 4.3 F 0 for US regions. It is straightforward to ask when the environmental temperature will climb above this critical value. As an example, Figure 10 plots the daily average temperatures in San Antonio, Texas, shown that will be clearly above the estimated T C threshold in the second half of May. Figure 10 . According to the statistically confident relationship between exponential growth rate of infected cases shown in Fig.8 , the critical temperature, which eliminates the exponential growth, and thus the COVID-19 spread, is T C = 86.1 ± 4.3 F 0 . The plot shows also the May-June daily, nightly, and 24h-averaged environmental temperatures in San Antonio, Texas, averaged over the last three years. The daily average temperatures will be clearly above the estimated T C threshold in the second half of May; thus, the plot suggests a possible date for loosening the strict measures in San Antonio, that is, May 24. The resulted high statistical confidence of the negative correlation of the environmental temperature on the exponential growth rate of the cases infected by COVID-19 is certainly encouraging for loosening super-strict social-distancing measures, at least, during the summery high temperatures. However, we are, by no-means, recommending a return-to-work date based only on this study. But we do think that this should be part of the decision, as well as an inspiration for repeating the same analysis in other heavily infected regions. The steps of these analyses may be followed as: (i) Identify different outbreaks in regions with the same culture in social activities and different environmental temperature; (ii) Estimate the exponential growth rates for these regions from the time series of infected cases; (iii) Plot the derived rates against the environmental temperature averaged for these regions, and repeat the analysis of this study to determine the temperature-rate relationship and its statistical confidence. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted April 24, 2020 . . https://doi.org/10.1101 How to flatten the curve of coronavirus, a mathematician explains A mathematical model for simulating the phase-based transmissibility of a novel coronavirus How COVID-19 and other infectious diseases spread: Mathematical modeling A Discrete Mathematical Model for the Aggregation of β-Amyloid An Introduction to Difference Equations Fitting a straight line with errors on both coordinates Decades-long changes of the interstellar wind through our solar system Low energy neutral atoms from the heliosheath Negative activation energies and curved Arrhenius plots. 1. Theory of reactions over potential wells Time variations in the generation time of an infectious disease: implications for sampling to appropriately quantify transmission potential Effects of temperature on COVID-19 transmission Rapid Expert Consultation on SARS-CoV-2 Survival in Relation to Temperature and Humidity and Potential for Seasonality for the COVID-19 Pandemic Solar radiation pressure and local interstellar medium flow parameters from IBEX low energy hydrogen measurements Generalized logistic growth modeling of the COVID-19 outbreak in 29 provinces in China and in the rest of the world No Association of COVID-19 transmission with temperature or UV radiation in Chinese cities