key: cord-0876839-w6i72ivp
authors: Churpek, Matthew M.; Gupta, Shruti; Spicer, Alexandra B.; Parker, William F.; Fahrenbach, John; Brenner, Samantha K.
title: Hospital-Level Variation in Death for Critically Ill Patients with COVID-19
date: 2021-03-30
journal: American journal of respiratory and critical care medicine
DOI: 10.1164/rccm.202012-4547oc
sha: 7cc24e1e25d3b89c00c83977e10c9e91e0fec92f
doc_id: 876839
cord_uid: w6i72ivp

Rationale: Variation in hospital mortality has been described for coronavirus disease (COVID-19), but the factors that explain these differences remain unclear. Objective: Our objective was to use a large, nationally representative data set of critically ill adults with COVID-19 to determine which factors explain mortality variability. Methods: In this multicenter cohort study, we examined adults hospitalized in ICUs with COVID-19 at 70 U.S. hospitals between March and June 2020. The primary outcome was 28-day mortality. We examined patient-level and hospital-level variables. Mixed-effect logistic regression was used to identify factors associated with interhospital variation. The median odds ratio was calculated to compare outcomes in higher- versus lower-mortality hospitals. A gradient-boosted machine algorithm was developed for individual-level mortality models. Measurements and Main Results: A total of 4,019 patients were included, 1,537 (38%) of whom died by 28 days. Mortality varied considerably across hospitals (0–82%). After adjustment for patient- and hospital-level domains, interhospital variation was attenuated (odds ratio decline from 2.06 [95% confidence interval (CI), 1.73–2.37] to 1.22 [95% CI, 1.00–1.38]), with the greatest changes occurring with adjustment for acute physiology, socioeconomic status, and strain. For individual patients, the relative contribution of each domain to mortality risk was as follows: acute physiology (49%), demographics and comorbidities (20%), socioeconomic status (12%), strain (9%), hospital quality (8%), and treatments (3%). Conclusions: There is considerable interhospital variation in mortality for critically ill patients with COVID-19, which is mostly explained by hospital-level socioeconomic status, strain, and acute physiologic differences. Individual mortality is driven mostly by patient-level factors.

As of April 2021, coronavirus disease(COVID- 19) has killed more than 500,000 people in the United States (1). When patients develop severe disease, they are typically transferred to the ICU, which provides more intensive monitoring together with potentially lifesaving critical care therapies such as mechanical ventilation, vasoactive agents, and extracorporeal membrane oxygenation (2, 3) . Studies conducted before the pandemic demonstrated that outcomes of critically ill patients vary across hospitals, which may relate to differences in patient characteristics and the quality of care provided at different hospitals (4) . Emerging data suggest similar variability in outcomes across hospitals for critically ill patients admitted with COVID-19 (5) (6) (7) (8) . The causes of this variability are unclear and could include differences in demographics, comorbidities, the physiologic severity of illness, socioeconomic status, resource strain, hospital quality, and treatments provided. It is also unknown how each of these domains impacts mortality risk for individual patients. A better understanding of the patient-and hospital-level factors impacting death could lead to insights into the reasons for the wide variation in reported outcomes, the determinants of individual patient outcomes, and improved healthcare delivery.

Our objective was to use a large, nationally representative data set of critically ill adults with COVID-19 to determine which factors explain the variability in mortality at both the hospital and the patient level. To do this, we linked detailed patient information with hospital-level data and then explored how different domains explained variations in 28-day mortality.

We used the database of the multicenter STOP-COVID (Study of the Treatment and Outcomes in Critically Ill Patients with COVID-19), a cohort study of 5,154 patients with COVID-19 admitted to ICUs across the United States (see Table E1 in the online supplement for the sites included in this study) (5) . We included consecutive adults (age > 18 yr) admitted to the ICU with laboratoryconfirmed COVID-19 admitted between March 4 and June 29, 2020. Patients were followed until the first of hospital discharge, death, or at least 28 days after ICU admission. Patients transferred to the ICU from other hospitals, admitted to a hospital not linked to the Medicare Hospital Compare ratings, or admitted to a hospital with fewer than 10 COVID-19 ICU admissions in the data set were excluded. A sensitivity analysis was performed by including patients transferred from outside hospitals. The study was approved by institutional review boards at each site, with a waiver of informed consent being given.

Manual chart review was performed at each site by using a standardized case report form, as previously described (5) . Patient-level data collected included admission day, demographic information, comorbidities, vital signs at ICU admission, laboratory values, medications, nonmedication treatments, organ support in the first 2 weeks of ICU admission, and outcomes, including in-hospital mortality. The STOP-COVID data set also included what type of ICU bed the patient was admitted to (e.g., medical-surgical), whether the patient was admitted to a COVID-19-specific ICU or surge unit, and the number of ICU beds at each hospital before the COVID-19 pandemic.

Additional hospital-level variables were collected by linking each study hospital to data from the following sources: the American Hospital Association Annual Survey 2020 database for hospital strain and capacity variables, the 2017 Medicare Hospital Compare ratings for hospital quality ratings, the Healthcare Cost Report Information System, and the 2015 American Community Survey socioeconomic status data, which incorporates information from communities surrounding each hospital by using a previously described methodology (Table E2 ) (9) (10) (11) . Furthermore, time-varying variables describing hospital-level strain were collected from the STOP-COVID data set (i.e., number of other patients with COVID-19 currently in the ICU at a given hospital when a patient was admitted) and from publicly available data on the number of new COVID-19 cases from the past 30 days for the county where each hospital was located (1).

The primary outcome of the study was in-hospital death within 28 days of ICU admission. If a patient was discharged alive before Day 28, they were assumed to be alive at Day 28. This assumption was confirmed in a sample of patients in a previous study (5) . A sensitivity analysis was performed using in-hospital mortality as the outcome.

Explanatory variables were categorized into six domains, including three patient-level domains and three hospital-level domains. The individual variables and domains were chosen a priori on the basis of prior literature and availability. Patient-level domains included acute physiology and severity of illness in the first 48 hours of ICU admission (e.g., vital signs, laboratory values, ventilatory support, number of vasopressors, and renal replacement therapy); demographics and comorbidities (e.g., age, sex, race, body mass index, smoking status, and preexisting conditions); and treatments provided in the first 48 hours of ICU admission (e.g., corticosteroids, remdesivir, tocilizumab, prone position ventilation). Hospital-level treatment intensity was also included as a variable in the treatment domain by calculating the percentage of mechanically ventilated patients with a Pa O In this study of 4,019 patients in 70 hospitals, we found significant interhospital variation in mortality for critically ill patients with COVID-19. This hospital-level variation was mostly explained by hospital-level socioeconomic status, strain, and physiologic differences, although individual mortality was driven mostly by patient-level factors.

ICU beds filled with patients with COVID-19, whether the patient was admitted to a COVID-19-specific ICU or surge unit, total number of medical-surgical beds, prepandemic ICU occupancy rate, number of hospital beds in the county, number of COVID-19 cases in the county from the prior 30 d), and hospital quality scores (mortality, readmission, safety, timeliness, patient experience, and effectiveness). The ICU admission day was used to create a variable that denotes the "days since study start" that a patient was admitted to the ICU, which was assigned to each patient to account for possible longitudinal changes in hospital quality (12) . The full variable list for each domain, together with additional descriptions, is provided in Table E2 . Missing values were imputed by using bagged forests from the caret package in R, which builds ensembles of decision trees, with each tree being fit to a randomly selected, bootstrapped sample of the data set by using nonmissing variables to impute missing variables (see Table 1 for the amount of missing data for each variable). This approach has the advantage of automatically modeling nonlinearities and interactions that may be important for accurate variable imputation (13) . Comparisons between patients who survived and those who died within 28 days were made for all study variables by using Wilcoxon rank sum tests and chi-square tests.

Next, mixed-effect logistic regression models were fit, first with an empty model with a random effect for each hospital and then by sequentially adjusting for variables from each domain in the order described above, which moves from patient-level to hospital-level factors. This ordering allowed for the separation of patient-and hospital-level variables to determine their contributions to interhospital variation in mortality. The change in the adjusted variation of 28-day mortality was calculated, moving from one model to the next, by examining the median odds ratio for each model. The median odds ratio can be interpreted as the difference in odds between a randomly selected lower-risk hospital and a randomly selected higher-risk hospital. It can be conceptualized as the increased risk that a subject would have if he or she were admitted to a higher-risk hospital (14, 15) . Pseudo-R 2 values were also calculated for each individual domain by using Efron's R 2 , which is calculated by taking the sum of the squared model residuals divided by the total variability in the dependent variable.

Finally, to calculate the contribution of the domains to an individual's risk of mortality, a gradient-boosted tree machine learning model was fit by using all of the variables from each domain (16) . Tenfold cross-validation was used to optimize the model's area under the receiver operating characteristic curve. Shapley values, which estimate the contribution of each variable for that individual patient's risk of 28-day mortality (17) , were then calculated for each individual patient. The individual Shapley values were then combined across all patients in the data set by using the mean of their absolute value to determine the percent mortality risk explained by each domain. All analyses were performed by using Stata version 16.1 (StataCorp) and R version 4.2 (R Foundation for Statistical Computing) with the caret, XGBoost, and iml packages. A two-sided P value of ,0.05 denoted statistical significance.

A total of 4,019 patients (median age [interquartile range (IQR)], 63 [53-72]; 63% male [n = 2,532]) from 70 hospitals were included in the analysis after exclusion criteria were applied ( Figure E1 and Table E1 ), and 1,537 patients (38%) died by 28 days. The median number of patients at a given hospital was 34 (IQR, 20-79; Figure E2 ). Patients who died were older (median [IQR], 68 [59-76] yr vs. 60 [49-68] yr), more likely to be male (66% vs. 61%), and more likely to be current or former smokers (30% vs. 23%) and had higher frequencies of most comorbidities than those who survived at 28 days (Table 1) . Most vital signs and laboratory results were significantly different during the first 48 hours of ICU admission between those who died and those who survived (Table 1) . Patients who died were also more likely to have received invasive mechanical ventilation (80% vs. 58%) and renal replacement therapy (9% vs. 5%) during the first 48 hours of ICU admission. Finally, certain medications were more often provided to those who died, such as neuromuscular blocking agents (25% vs. 17%), hydroxychloroquine (63% vs. 59%), and corticosteroids (35% vs. 21%) ( Table 1) .

Compared with patients who survived, patients who died were admitted to hospitals with a higher percentage of ICU beds occupied by patients with COVID-19 (48% vs. 31%), a higher percentage of the population traveling .45 minutes to work (23% vs. 18%), a lower prepandemic ICU occupancy rate (69% vs. Table 2) .

Twenty-eight-day mortality varied widely across hospitals, from 0% at the lowestrisk hospital to 82% at the highest-risk hospital. In the mixed-effect regression model, the median odds ratio decreased from 2.06 (95% confidence interval [CI], 1.73-2.37) in the unadjusted model to 1.22 (95% CI, 1.00-1.38) in the fully adjusted model ( Figure  1 ). This was associated with a change in the range of mortality across hospitals from 12-91% (random effects only) to 32-44% (fully adjusted model). Model adjustment with variables from the physiology, socioeconomic status, and strain domains were associated with the greatest change in the median odds ratio (all with a .0.20 change in the point estimate). The fully adjusted model explained nearly all the variability across hospitals (P value for random effect term = 0.73; see Table  E3 for model coefficients). Pseudo-R 2 values for each individual domain demonstrated similar results, with physiology (0.2), demographics (0.11), socioeconomic status (0.10), and strain (0.09) having the highest values, followed by quality (0.06) and treatments (0.04).

The Shapley values calculated from the XGBoost model using variables from all the domains found that physiology (49%), demographics and comorbidities (20%), hospital socioeconomic status (12%), strain (9%), hospital quality (8%), and treatments (3%) all contributed to mortality risk ( Figure  2 ). The mean contributions of the individual variables in each domain are shown in Figures  E3-E8 . Thus, for patients in the data set, on average, their presenting physiology explained half of their quantifiable individual risk of mortality, whereas external factors such as hospital socioeconomic status, hospital capacity and strain, hospital quality, and the treatments clinicians provided explained over one-quarter (31%) of their mortality risk. Among patient demographics, age had the highest contribution, explaining 12% of the mortality risk, whereas comorbidities explained 4% of a patient's mortality risk. Temporal trends captured by the days since study start variable only explained a small percentage of a patient's mortality risk (1%). in the fully adjusted model, and adjustment with variables from the physiology, socioeconomic status, and strain domains were associated with the greatest change in the median odds ratio. The ordering and magnitude of the domains regarding their contribution to individual risk were also similar. Adding outside hospital transfers back into the cohort also demonstrated results similar to those of the primary analysis ( Figures E11 and E12 ).

In this multicenter cohort study of 4,019 critically ill adults with COVID-19 admitted to ICUs at 70 geographically diverse hospitals across the United States, we found wide variation in 28-day mortality across hospitals. This hospital-level variability was mostly explained by differences in socioeconomic status of the hospital population, hospital capacity and strain, and presenting ICU physiology. Furthermore, the mortality risk for individual patients was largely explained by demographic characteristics and comorbidities as well as acute physiology. To our knowledge,this is the firstmanuscriptofits kind to investigate both hospital-and individual-level contributors to variation in mortality from a large, nationally representative cohort of critically ill patients with COVID-19. Our results help explain the wide variation in published mortality rates for critically ill patients with COVID-19 and quantify how different factors contribute to an individual patient's mortality.

Published reports on the outcomes of critically ill patients with COVID-19 have shown wide variations in mortality. For example, an early report by Arentz and colleagues (18) reported an in-hospital mortality rate of 67% for patients admitted to the ICU at one hospital in Washington State. In contrast, a study by Cummings and colleagues (19) reported a mortality of 39% in a study from two hospitals in New York City. This variability was summarized in a recent systematic review by Serafim and colleagues (8) , which reported an in-hospital mortality range of 1-62%. The cause of this variation has been hypothesized to be related to various factors, such as hospital strain, patient characteristics, and variability in treatment practices (5, (20) (21) (22) (23) .

Our findings provide important insights into the reasons for this wide variation in hospital-level mortality. We found that hospital socioeconomic status, physiology, and hospital strain were the most important factors explaining this variability, whereas treatments provided to patients contributed least. To our knowledge, we are the first to show that the socioeconomic status of the community surrounding a hospital is an important contributor to hospital-level variability in outcomes in a geographically representative sample of critically ill patients with COVID-19. This finding could be due to factors related to either the impact of socioeconomic status on the health status of individual patients in the study or the unobserved variability in the quality of care that hospitals provide for a population with a lower socioeconomic status (22, 24) . Interestingly, the most important individual variable from the socioeconomic status domain was the percentage of patients at the hospital who traveled .45 minutes to work. This variable has been previously used to capture the spatial mismatch hypothesis theory (25, 26) , which relates to discrepancies between the location of lowincome neighborhoods and the locations of employment opportunities. This variable was also found to be one of the most Definition of abbreviations: BMI = body mass index; IQR = interquartile range; PEEP = positive end-expiratory pressure; P/F = Pa O 2 /FI O 2 ; WBC = white blood cell. Data regarding troponin were missing for 2,544 (63%), data regarding P/F were missing for 1,576 (39%), data regarding PEEP Day 1 were missing for 1,513 (38%), data regarding procalcitonin were missing for 1,455 (36%), data regarding D-dimer were missing for 1,233 (31%), data regarding urine output were missing for 1,210 (30%), data regarding lactate were missing for 1,198 (30%), data regarding ferritin were missing for 1,054 (26%), data regarding CRP were missing for 926 (23%), data regarding arterial pH were missing for 902 (22%), data regarding smoking status were missing for 745 (19%), data regarding lymphocytes were missing for 36 (11%), data regarding aspartate aminotransferase were missing for 353 (9%), data regarding mental status were missing for 220 (5%), data regarding PEEP Day 2 were missing for 163 (4%), data for BMI were missing for 152 (4%), data regarding WBC counts were missing for 77 (2%), data regarding creatinine were missing for 70 (2%), and data regarding sodium were missing for 24 (,1%). Missing data were imputed by using bagImpute and are included in the Troponin T or I value greater than the 99th percentile upper reference limit of normal for that laboratory test. jj Refers to the P/F ratio and was only recorded in patients receiving invasive mechanical ventilation. Other values were imputed. ¶ Received renal replacement therapy for acute or chronic renal failure. **Included phenylephrine hydrochloride, epinephrine, norepinephrine bitartrate, vasopressin, dopamine hydrochloride, dobutamine, and milrinone.

important metrics of social risk in a study investigating hospital ratings and neighborhood disadvantage (9) . Our findings of increased mortality related to hospital population socioeconomic status suggest that COVID-19 may be exacerbating existing healthcare disparities in the United States (27) .

The majority of an individual's risk of mortality was related to the presenting physiology, demographics, and preexisting conditions. Only one-quarter of a patient's quantifiable mortality risk was related to other factors such as hospital capacity and strain, hospital socioeconomic status, hospital quality, and treatments. Prior work suggested that the number of preexisting ICU beds is an important predictor of mortality among critically ill patients with COVID-19 (5), suggesting a correlation between ICU capacity and outcomes.

However, additional factors such as the baseline occupancy rate before the pandemic and the number of patients with COVID-19 currently admitted to the ICU are important to consider when determining the strain on critical care resources. By including these variables and other related factors in one domain, we were able to show that strain and capacity contribute to both hospital-level variability and individual mortality. This contribution to mortality risk may be related to rationing, more aggressive goal-of-care discussions, and treatment of critically ill patients outside the normal ICU or by less experienced providers. Hospital quality scores also had some explanatory power, albeit they had less than hospital socioeconomic status or 

Fully Adjusted Unadjusted Figure 1 . Case mix-adjusted probabilities of 28-day mortality. The graphs illustrate the change in interhospital variation in death as each domain is added to the unadjusted mixed-effect model (leftmost panel) and end with the fully adjusted model (rightmost panel), which shows that most of the variation in mortality across hospitals can be explained by the domains included. The x-axis is hospital ranked by increasing probability of death in 28 days, and the y-axis shows the case mix-adjusted probability of death in the mixed-effect regression model, with the red dots denoting the point estimates and the whiskers denoting the 95% confidence intervals. The median OR and range in mortality are presented for each model. Demo = demographics; OR = odds ratio; SES = socioeconomic status. strain. This suggests that the quality of the hospital a patient with COVID-19 goes to has a small but measurable effect on their outcome, which is consistent with prior work in all hospitalized patients (28) . Of all the domains studied, the treatments provided to patients had the least impact on hospital-level variability and individual-level mortality risk. This may be explained by the fact that few treatments have shown a mortality benefit for critically ill patients with COVID-19 (29) (30) (31) (32) . Notably, the three treatments that contributed the most to improved mortalityneuromuscular blockade, aspirin, and tocilizumab-are all therapies that have previously been shown to improve outcomes for patients with COVID-19 or acute respiratory distress syndrome (33) (34) (35) .

This study has several strengths. Our cohort consisted of a geographically diverse sample of critically ill adults with COVID-19. We had access to detailed patient characteristics, physiology, interventions, and medications during their ICU stay. In addition, we were able to link the hospitals where these patients were admitted to quality scores and hospital-level socioeconomic status. Furthermore, by linking patients to the American Hospital Annual Survey data and time-varying county-level COVID-19 data, we were able to better quantify capacity and strain. Finally, in addition to standard mixed-effect regression models, we also used a state-of-theart machine learning approach to determine the contribution of individual variables to patient mortality (17) . This study also has several limitations. First, although we were able to identify variables associated with mortality, our study design does not lend itself to inferring causality. In addition, our findings only apply to patients admitted to the ICU, as we did not have data on patients who were critically ill but were not admitted to an ICU (e.g., because of bed rationing or goals of care). Furthermore, there may be additional variables that contribute to mortality risk that we did not account for in our study. For example, best practices and supportive care interventions, such as low-VT ventilation for patients with acute respiratory distress syndrome, were not collected, nor were other hospital-level factors (e.g., teaching status, intensivist coverage, and nurse-to-patient ratios) or the duration of treatments. Similarly, the Shapley values measure only quantifiable mortality that is explained by the variables in the model. It is also possible that some patients were discharged alive before Day 28 only to die at home soon thereafter (e.g., patients discharged to home hospice). Although we verified in 50 patients at six participating hospitals that all patients discharged alive before 28 days were still alive at Day 28, this might not be true at all centers. In addition, the hospital quality data were collected in 2017, which may not reflect quality of care during the present-day pandemic. Finally, we only had hospital-level socioeconomic status available as opposed to individual socioeconomic status, so we could not determine whether the impact of this domain was related to the socioeconomic status of individual patients or the resources and quality that might be associated with hospitals that provide care for patients with varying socioeconomic status characteristics.

In conclusion, we found considerable interhospital variation in death among critically ill patients with COVID-19. This variability is explained by several domains, including hospital socioeconomic status, presenting physiology, and hospital capacity and strain. Similar factors contribute to an individual patient's risk of mortality, with patient-level factors (e.g., physiology, demographics, and comorbidities) explaining most of their mortality risk.

Author disclosures are available with the text of this article at www.atsjournals.org.

of pop, median (IQR)

753 to 96,175) Metro area, n (%) 3,534 (87.9) 2,124 (85.6) 1,410 (91.7) Hospital strain, median (IQR) Hospital ICU beds w/ STOP-COVID patients

Definition of abbreviations: COVID-19 = coronavirus disease

O 2 /FI O 2 ; pop = population; STOP-COVID = Study of the Treatment and Outcomes in Critically Ill Patients w/ COVID-19

Missing data were imputed by using bagImpute and are included in the table

05 for difference between survivors and nonsurvivors (Wilcoxon rank sum test for continuous variables and chi-square test for categorical variables). † Time varying based on the date of patient admission. References 1. COVID data tracker

Severe outcomes among patients with coronavirus disease 2019 (COVID-19): United States

Severe COVID-19

Hospital volume and the outcomes of mechanical ventilation

STOP-COVID Investigators. Factors associated with death in critically ill patients with coronavirus disease 2019 in the US

STOP-COVID Investigators. AKI treated with renal replacement therapy in critically ill patients with COVID-19

STOP-COVID Investigators. In-hospital cardiac arrest in critically ill patients with COVID-19: multicenter cohort study

Clinical course and outcomes of critically ill patients with COVID-19 infection: a systematic review

Neighborhood disadvantage and hospital quality ratings in the Medicare hospital compare program

Overall hospital quality star ratings overview. Baltimore, MD: Centers for Medicare and Medicaid Services

American Community Survey socioeconomic status data

Trends in intensive care for patients with COVID-19 in England, Wales, and Northern Ireland

Random forest missing data algorithms

Appropriate assessment of neighborhood effects on individual health: integrating random and fixed effects in multilevel logistic regression

Hospital-level associations with 30-day patient mortality after cardiac surgery: a tutorial on the application and interpretation of marginal and multilevel logistic regression

The elements of statistical learning: data mining, inference, and prediction

Interpretable machine learning

Characteristics and Outcomes of 21 critically ill patients with COVID-19 in Washington State

Epidemiology, clinical course, and outcomes of critically ill adults with COVID-19 in New York City: a prospective cohort study

Outcomes of COVID-19: disparities in obesity and by ethnicity/race

Is ethnicity linked to incidence or outcomes of COVID-19?

Racial disparities in incidence and outcomes among patients with COVID-19

Case fatality rates for patients with COVID-19 requiring invasive mechanical ventilation: a meta-analysis

Healthcare disparities in critical illness

Race, spatial mismatch, and job accessibility: evidence from a plant relocation

Neighborhoods, employment, and welfare use: assessing the influence of neighborhood socioeconomic composition

Celed on JC. The structural and social determinants of the racial/ethnic disparities in the U.S. COVID-19 pandemic: what's our role?

Relationship between Medicare's Hospital Compare performance measures and mortality rates

A trial of lopinavirritonavir in adults hospitalized with severe COVID-19

ACTT-1 Study Group. Remdesivir for the treatment of COVID-19: final report

Coalition COVID-19 Brazil I Investigators. Hydroxychloroquine with or without Azithromycin in Mild-to-Moderate COVID-19

BACC Bay Tocilizumab Trial Investigators. Efficacy of tocilizumab in patients hospitalized with COVID-19

Neuromuscular blockers in early acute respiratory distress syndrome

RECOVERY Collaborative Group. Dexamethasone in hospitalized patients with COVID-19

Aspirin use is associated with decreased mechanical ventilation, intensive care unit admission, and in-hospital mortality in hospitalized patients with coronavirus disease 2019

The authors thank the clinical and research staff from the participating sites.