key: cord-0923849-ikrmwb3u authors: Oulhaj, A.; Ahmed, L. A.; Prattes, J.; Suliman, A.; Al Suwaidi, A.; Al-Rifai, R. H.; Sourij, H.; Van Keilegom, I. title: The competing risk between in-hospital mortality and recovery: A pitfall in COVID-19 survival analysis research date: 2020-07-14 journal: nan DOI: 10.1101/2020.07.11.20151472 sha: 2bd4ed96ddc2db7c631d3331c6b31cf3cd27b9ce doc_id: 923849 cord_uid: ikrmwb3u Background Many studies investigating mortality and recovery in COVID-19 have been published recently. The majority of these studies used the standard Cox Proportional Hazards (Cox PH) model without taking into account the presence of competing risks. This study investigates, through extensive simulations, the bias in estimating the hazard ratio (HR) of death due to COVID-19 and the absolute risk reduction (ARR) when competing risks are not taken into consideration, and suggests an alternative method. Methods We simulated data for a fictive clinical trial in COVID-19 patients, to mimic recent trials involving the use of Hydroxychloroquine, Remdesivir, and convalescent plasma therapy for example. The primary outcome is the time from randomization until death due to COVID-19. Six scenarios representing different situations of the effect of treatment on death and its competing event recovery were considered. The HR of death and the 28-day ARR were estimated using the Cox PH model and the Fine and Gray (FG) model which takes competing risks into account. The estimates were then compared to their corresponding true values and the magnitude of misestimation quantified. Results The Cox PH model misestimated the true HR of death in the majority of the scenarios. The magnitude of this misestimation increased when the process of recovery was faster and/or the chance of recovery was higher. In some cases, this model has also incorrectly shown a harmful effect of treatment when it was in fact beneficial. The true 28-day ARR of death was also misestimated, and this misestimation increased in magnitude when the process of recovery was faster. The results obtained from the FG competing risks model are all consistent and show no misestimation or changes in direction for both the HR and the 28-day ARR of death. Conclusion There is a substantial risk of misleading results in COVID-19 research if recovery and death are not considered as competing risk events. We strongly suggest the use of a competing risk approach to re-analyze relevant published data that have used the Cox PH model. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the cause of the pandemic Coronavirus Disease 2019 (COVID-19), has affected nearly 12.0 million people in over 190 countries and caused more than 546,000 deaths, as of July 07, 2020 1 . Numerous retrospective or prospective cohort studies and randomized clinical trials (RCTs) were conducted to evaluate different therapeutic interventions [2] [3] [4] [5] [6] [7] . The results of these studies have not only impacted treatment strategies in COVID-19 patients, but have also influenced health-policy decision making including continuation, modification, or termination of the use of some of the studied drugs. The recent RCTs involving the use of Hydroxychloroquine, Remdesivir and convalescent plasma therapy are some examples to cite 3 5 7 The statistical methods commonly used in these studies to investigate primary outcomes such as in-hospital mortality or recovery are based on the standard Cox proportional hazards model (Cox PH) and the Kaplan-Meier estimator 8 9 . When in-hospital mortality due to COVID-19 is the outcome of interest, these two methods implicitly treat recovered patients as right-censored. Similarly, when recovery from COVID-19 is the outcome of interest, these two methods consider patients who died as right-censored. This way of proceeding is not appropriate since it implies that patients who recovered (respectively died) have similar risk of death (respectively recovery) compared to those still at risk (i.e. still hospitalised). In fact, death and recovery, investigated in several of the previously reported studies on COVID-19 are mutually exclusive competing events, and therefore, recovery (respectively death) should be considered as a competing risk for death (respectively recovery) rather than right-censored. A competing risk is an event whose occurrence precludes the occurrence of the event of interest 10 . Ignoring the competing risk will usually lead All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint to estimates of hazard ratios (HR) and absolute risks that are largely biased 11 12 (mis-estimation of the true effect estimate and true risk), and eventually misleading conclusions. Note that if we would be interested in overall death from any disease (as opposed to death due to COVID-19), then a competing risk approach would not be necessary since recovery is in that case not a competing event but is rather censoring the event of interest. However, in clinical trials on COVID-19 or other diseases, the important question for patients and regulators is about which treatment regimens help in preventing death due to that particular disease of interest. Hence, death due to COVID-19 is the outcome of interest in COVID-19 studies rather than overall death. Despite the abundance of statistical papers recommending the use of survival models that account for competing risks 13 , these models are still underused in many medical studies especially in COVID-19 related research. The main objective of this study is to investigate, through extensive statistical simulations, the bias that occurs when estimating the HR and other quantities of interest such as the absolute risk reduction (ARR) using the Cox PH model in the presence of competing risks. The Fine and Gray (FG) modeln 14 which takes competing risks into account is also used as an alternative to the Cox PH model. In this simulation, in-hospital mortality due to COVID-19 was considered as the outcome of interest and recovery as its competing risk. The magnitude of the bias is also investigated as a function of the chance of recovery and time to recovery. We simulated data for a fictive clinical trial in COVID-19 patients where a given treatment is compared to placebo. In this simulation, 10,000 patients were randomly assigned to receive either the treatment or a placebo in a 1:1 ratio. We have chosen such a big sample size in order to discard any justification related to small sample sizes. In this simulated data, patients are entered in the All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint study at the date of randomization and are followed-up for a maximum length of stay in the hospital up to 50 days. During the 50-days follow-up, patients can either die, recover from COVID-19, withdraw from the study, are lost to follow-up or reach the end of study with no event. The primary outcome is the time from randomization until death due to COVID-19. Recovery during the follow up period is considered as a competing event (since eventually a patient will either die or recover from COVID-19), while time to loss to follow-up, withdrawal or reaching the end of the study with no event are considered as right censoring. Note that in doing so, we ignore cases where a patient dies from COVID-19 after he has been recovered. This, if it happens, is in fact very rare and would not affect the current results. The main quantities of interest to be estimated and investigated are a) the hazard ratio (HR) of the primary outcome defined as the hazard of death due to COVID-19 in treated patients divided by the hazard of death in the placebo group, and b) the 28-day ARR of in-hospital mortality defined as the risk of death due to COVID-19 within 28 days in the treated group minus the risk of death within 28 days in the placebo group. We simulated the data according to six scenarios as described in table 1. The scenarios represent different situations of the treatment effect on the primary outcome (Death) and its competing event (Recovery). Scenario 6, for instance, represents the situation where the treatment has an effect on both death and recovery but the effect on recovery is higher than the one on death. For each scenario, 1,000 samples each of size 10,000 patients (5,000 treated and 5,000 placebo) were generated from different data generating processes (DGP). More specifically, the times to death in each sample were generated from a proportional hazards model with baseline hazard coming from All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint a mixture of a point mass at infinity and an exponential variable truncated at 50 days, and the finite recovery times were generated from another proportional hazards model, again truncated at 50 days. Time to right-censoring (i.e. time to loss to follow-up, withdrawal or reaching the end of the study with no event) were generated from a mixture of a uniform distribution and a point mass distribution at 50 days. This distribution for right-censoring was chosen in a way that fewer patients withdraw during the first days of the trial and others to remain alive by the end of the study at day 50. True effect on death (True hazard ratio) True effect on recovery (True hazard ratio) 1 No effect of treatment on both death and recovery HR_d = 1 HR_r = 1 2 The treatment has moderate effect on recovery but no effect on death. The treatment has high effect on recovery but no effect on death. The treatment has moderate effect on death but no effect on recovery We also allowed the chance of recovery and the median time to recovery within each scenario to vary leading to different DGPs. More specifically, the chance of recovery in the placebo group was set to be 5%, 80%, 90% and 95%. The first value of 5%, even unrealistic, was chosen to mimic the situation where the competing event (i.e. recovery) is rare. Other values chosen for the chance of recovery match the current incidence of recovery observed in real (clinical trials and cohort studies) research around the globe. Values reflecting how fast patients recover from COVID-19 expressed in terms of median time to recovery were chosen to vary between 5 to 20 days. Finally, the hazard parameter for death was chosen to be 0.05. All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint Within each scenario and for each of the 1,000 generated samples, we estimated the HR and the 28-days ARR of in-hospital mortality using two models: the standard Cox PH and the FG competing risks model. The Cox PH model does not take recovery as a competing risk into account, while the FG model considers recovery as competing risk. The estimated quantities (HR and 28-days ARR) were then averaged across the 1,000 generated samples and compared to their corresponding true values. Patients who withdrew, who were lost to follow-up or who were still hospitalised at the end of the study were considered as right-censored in both Cox PH and FG models. All the simulations were carried out using the R software version 3.6.1 15 . the estimated HR obtained from the Cox PH model were mis-estimated (i.e. usually different from the true HR). For instance, in scenario 3 where the treatment has no effect on death but high effect on recovery, the HR estimated from the Cox PH model when the median time to recovery is around 10 days is 1.6 compared to 1 (the true HR) showing an over-estimation of 60% (Figure 1) . Furthermore, the magnitude of this over-estimation increases when the recovery process is quicker (i.e. shorter median time to recovery). As the time to recovery increases, the over-estimation in the All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint HR from the Cox PH model decreases. This over-estimation disappears when the median time to recovery is very long, i.e. at the end of follow up when recovery is no longer a competing event. The results obtained from the FG model are, however, consistent in all scenarios and show no misestimation (Figure 1) . increases. For instance, in scenario 3 where the drug has no effect on death but high effect on All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint recovery, and when the chance of recovery is around 90%, the HR estimated from the Cox PH model was around 2.1 compared to 1 (the true HR) showing a harmful effect of treatment with an over-estimation of 110% (Figure 2 ). More interestingly, the estimated HR from the Cox PH model are not only over-estimating the true HR, but also incorrectly showing that the treatment is harmful when in fact it is beneficial. This is demonstrated in scenario 6 where treatment has a beneficial effect on both death and recovery but the effect on recovery is higher than the effect on death. In this scenario, the true HR for death is 0.85 indicating a beneficial effect of treatment (a reduction in the hazard of death of 15%). However, the estimated HRs from Cox PH model were 2.2, 1.83 and 1.55 when the chance of recovery were assumed to be 95%, 90% and 80%, respectively, showing incorrectly an increased risk of death in treated compared to placebo patients ( Figure 2 ). The results obtained from the FG model are however consistent in all scenarios and show no misestimation or incorrect direction of the treatment effects ( Figure 2 ). (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint when the process of recovery is faster). For instance, in scenario 3 where the drug has no effect on death but high effect on recovery, and when the median time to recovery is around 13 days, the 28-day ARR of in-hospital mortality estimated from the Cox PH model was around 12% compared to 0% (the true 28-day ARR). Interestingly, in scenario 6 where treatment has a beneficial effect on both death and recovery but the effect on recovery is higher than the effect on death, the Cox PH models not only over-estimated but also reversed the direction of effect of the 28-day ARR of in-hospital mortality. In this scenario, the true 28-day ARR for death is -2.5% showing a reduction in mortality of 2.5% in treated compared to placebo. However, the 28-day ARR estimated from the Cox PH model were +15%, +9% and +2.5% when the median time to recovery was 5 days, 13 days and 18 days, respectively, showing incorrectly an increased risk of death in treated compared to placebo patients. The results obtained from the FG model are however consistent across all scenarios and show no mis-estimation or incorrect direction of the treatment effects ( Figure 3 ). All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint In scenario 1 (Figure 3 ), the 28-day ARR of in-hospital mortality estimated from the Cox PH model are similar to the true ones. This, however, does not exclude the possibility of a hidden misestimation of the 28-day risk of in-hospital mortality in both treated and placebo patients. As this mis-estimation is similar in both treated and placebo groups, the estimated 28-days ARR defined as the difference between the two quantities will be zero and therefore hides this mis-estimation. This is illustrated in figure 4 where the 28-day risk of death was estimated in both treated and placebo to 47% (when the median time to recovery was assumed to be 10 days) whereas the true risk is around 17% in both groups. Using statistical modelling and simulation of six scenarios mimicking real randomized clinical trials on COVID-19 similar to those involving the use of Hydroxychloroquine, Remdesivir and convalescent plasma therapy, this study discusses the impact of not taking the competing risk between death and recovery into consideration in survival analysis. The study provides clear evidence on how time to recovery and chance of recovery both affect the quality of the HR and the 28-day ARR of death estimated from standard Cox PH model. It also shows that the FG model, that takes competing risks into consideration, performs largely better than the Cox PH model and provides estimates with no bias. Our simulation strongly suggests that ignoring competing risk in survival analysis would affect the quality of the estimated HR and ARR leading to biased estimates. This effect is in particular pronounced if the chance of recovery is high and the time to recovery is short. This is of significant All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint relevance, as in some cases the HR estimated by the standard Cox PH model could lead to the opposite direction as compared to the true HR. As demonstrated e.g. in figure 1, scenario 6 , in studies, where the chance of recovery is high, Cox PH models will display a falsely increased hazard ratio of death (in particular if time to recovery is short), while the true hazard ratio for death is reduced. The use of a standard Cox PH model to investigate the association between a potential treatment and the risk of in-hospital mortality or recovery implies that all other events, apart from the primary outcome, are considered as right-censored observations. More specifically, when hospital discharge or recovery is the primary outcome, considering patients who died as right-censored assumes that they are still at risk of being discharged during the rest of follow up period. Similarly, when in-hospital mortality is the primary outcome, considering patients who recovered at a given time point as right-censored, implicitly assumes that they have similar risk, of dying from COVID-19, as those who are still at risk (i.e. hospitalized) at that time point. Since a considerable proportion of COVID-19 patients are discharged alive (i.e. recovered), the concern of competing risk is inevitably of significant relevance. Looking at currently published randomized controlled trials investigating potential treatment for COVID-19, most of the studies do use standard Cox PH models. Our simulations suggest that it would be helpful to confirm the findings of these trials with a competing risk analysis approach as the true effect of the studied drug, for example on death, may be mis-estimated or even reversed. In the recently published preliminary results from the RECOVERY trial for example, the primary endpoint (28-day mortality rate) in the hydroxychloroquine arm was met in 25.7% and in the placebo arm in 23.5% and lead to an estimated HR of death of 1.11 16 . Considering that the data on time to recovery are missing in this preliminary publication, the true HR for death may be mis-All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint estimated based on our findings. It would therefore be of interest, to take competing risk into consideration for final analysis. Cao et al reported in a trial investigating the effect of liponavir/ritonavir versus standard care in adults hospitalized for severe COVID-19, in which no benefit in terms of 28-day survival and time to clinical improvement of the antiviral treatment was observed 2 . However, the median time from randomisation to discharge was 13 days (IQR 10-16) and the statistical method chosen was a standard Cox PH model. Although there is no guarantee that using a competing risk analysis would have changed the results, the median time to discharge is within a time frame that could have impacted the neutral mortality effect observed in the trial based on our simulations. Horby et al. randomised patients hospitalised for COVID-19 to either dexamethasone or usual care and report in a preprint a reduction in 28-day mortality, a benefit that was mainly observed among those receiving invasive mechanical ventilation or oxygen at randomization 17 . As the discharge rate from hospital within 28 days was 64.6% in the dexamethasone and 61.1% in the usual care group, respectively, again a rate that could have impacted the results. Although the simulation approach in our study was helpful to investigate the interplay of various hazards on death and recovery as well as time to recovery and to suggest a potential impact on currently ongoing COVID-19 research, we appreciate that we did not analyse actual trial data of patients with COVID-19. Hence, it would be critical to apply Cox PH and competing risk analysis approaches to data from already available randomised controlled trials. Our study demonstrates that there is a substantial risk of misleading results in COVID-19 research if recovery and death due to COVID-19 are not considered as competing risk events. Therefore, we strongly suggest the use of competing risk approach (e.g. Fine and Gray regression model) in All rights reserved. No reuse allowed without permission. (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted July 14, 2020. . https://doi.org/10.1101/2020.07.11.20151472 doi: medRxiv preprint A Trial of Lopinavir-Ritonavir in Adults Hospitalized with Severe Covid-19 Remdesivir in adults with severe COVID-19: a randomised, double-blind, placebo-controlled, multicentre trial Observational Study of Hydroxychloroquine in Hospitalized Patients with Covid-19 Effect of Convalescent Plasma Therapy on Time to Clinical Improvement in Patients With Severe and Life-threatening COVID-19: A Randomized Clinical Trial Clinical Characteristics and Risk Factors for Mortality of COVID-19 Patients With Diabetes in Wuhan, China: A Two-Center, Retrospective Study Hydroxychloroquine in patients with mainly mild to moderate coronavirus disease 2019: open label, randomised controlled trial Nonparametric estimation from incomplete observations Regression Models and Life-Tables Tutorial in biostatistics: competing risks and multi-state models Chapter 5 -Competing risks: Aims and methods Importance of Considering Competing Risks in Timeto-Event Analyses: Application to Stroke Risk in a Retrospective Cohort Study of Elderly Patients With Atrial Fibrillation Competing risks: Aims and methods A proportional hazards model for the subdistribution of a competing risk R: A language and environment for statistical computing. R Foundation for Statistical Computing Randomised Evaluation of COVID-19 Therapy Effect of Dexamethasone in Hospitalized Patients with COVID-19: Preliminary Report The authors would like to thank Prof. Geert Molenberghs for interesting discussions and suggestions that improved the paper Funding