key: cord-348772-xcf0jmqp authors: Rickards, C. G.; Kilpatrick, A. M. title: Age-Stratified SARS-CoV-2 Infection Fatality Rates in New York City estimated from serological data date: 2020-10-20 journal: nan DOI: 10.1101/2020.10.16.20214023 sha: doc_id: 348772 cord_uid: xcf0jmqp Importance: COVID-19 has killed hundreds of thousands of people in the US and >1 million globally. Estimating the age-specific infection fatality rate (IFR) of SARS-CoV-2 for different populations is crucial for assessing the fatality of COVID-19 and for appropriately allocating limited vaccine supplies to minimize mortality. Objective: To estimate IFRs for COVID-19 in New York City and compare them to IFRs from other countries. Design, Setting, Participants: We used data from a published serosurvey of 5946 individuals 18 years or older conducted April 19-28, 2020 with time series of COVID-19 confirmed cases and deaths for five age-classes from the New York City Department of Health and Mental Hygiene. We inferred age-specific IFRs using a Bayesian framework that accounted for the distribution of delay between infection and seroconversion and infection and death. Main Outcome and Measure: Infection fatality rate. Results: We found that IFRs increased approximately 77-fold with age, with a nearly linear increase on a log scale, from 0.07% (0.055%-0.086%) in 18-44 year olds to 5.4% (4.3%-6.3%) in individuals 75 and older. New York City IFRs were higher for 18-44 year olds and 45-64 year olds (0.58%; 0.45%-0.75%) than Spanish, English, and Swiss populations, but IFRs for 75+ year olds were lower than for English populations and similar to Spanish and Swiss populations. Conclusions and Relevance: These results suggest that the age-specific fatality of COVID-19 differs among developed countries and raises questions about factors underlying these differences. COVID-19 has killed hundreds of thousands of people in the US and >1 million globally. 1 The infection fatality rate (IFR)or the chance of dying after becoming infectedof SARS-CoV-2 is a crucial metric for understanding disease severity. It has been the subject of substantial controversy, in part due to differences between the symptomatic or case fatality rate (CFR) and the infection fatality rate, and also due to differences between naïve estimates of CFR calculated from the ratio of deaths divided by cases and estimates of CFR that account for the delays between infection, case detection and death 2, 3 . Accurately estimating the IFR requires estimates of the number of infections at a point of in time and deaths that result from these infections which occur after a substantial, and highly variable, delay (mean 20.2 d; 95% CI 8.0, 50.0; 4 ). This makes simplistic calculations of IFR problematic when the incidence of infection is changing over time. Previous analyses have shown enormous variation in IFR with age 3,5-8 which indicates that accurate age-specific IFRs are needed to allocate limited supplies of vaccines to meet a goal of minimizing mortality from COVID-19 9 . The most accurate approach for estimating age-specific IFRs is a large-scale serosurvey in a population where substantial mortality due to COVID-19 has occurred 7, 8 , because deaths in younger age groups are relatively rare 4 . Substantial differences in age-specific COVID-19 IFRs are apparent in estimates from Spain 8 and England 7 , indicating the need for additional IFR estimates from other hard-hit populations. We used data from a seroprevalence study 10 conducted in New York City in the waning period of the spring epidemic and publicly available death records to estimate age-specific IFRs for the New York city population. We estimated infection fatality rates (IFRs) using data from New York City from a serological survey 10 and confirmed COVID-19 cases and deaths recorded by the New York City Department of Health and Mental Hygiene (https://www1.nyc.gov/site/doh/covid/covid-19-dataarchive.page). The serosurvey occurred on 6 days over a 10-day period (April 19-28) two-weeks after the peak in COVID-19 cases (Fig 1) . Individuals for the serological survey were recruited at grocery stores without prior advertisement to reduce bias and increase randomization 10 . Daily case counts and number of deaths were obtained from the New York City Department of Health and Mental Hygiene archive webpage for the dates 3/23/20 to 5/17/20 (https://www1.nyc.gov/site/doh/covid/covid-19-data-archive.page). We estimated IFRs using a previously established Bayesian statistical framework 4 which combines seroprevalence estimates (including uncertainty) with time series of cases and deaths to estimate the number infected and the fraction dying over time using estimates (log-normal distributions) of the delays between infection, symptom onset, becoming a case, seroconversion, and death 4 . This approach estimates the cumulative number of infected people at the time of the seroprevalence study and the fraction of those infected that go on to die. We calculated three age-specific estimates of IFRs to address mismatches in age-classes between deaths and seroprevalences and to consider both confirmed and probably COVID-19 deaths (see Supplemental Materials: Methods for details). We compared the IFR estimates for New York City to two other age-specific estimates . We show IFRs for England and Geneva, Switzerland that include care home resident deaths because the New York City deaths also include care home deaths (IFRs from Spain excluded care home deaths and did not indicate their distribution among age categories). Finally, we show IFRs for both confirmed COVID-19 deaths and excess all-cause deaths for the two studies that included these estimates (Ward et al. 2020 and Pastor-Barriuso et a. 2020). The serosurvey in New York City took place in late April, in the latter third of the initial epidemic, when new cases per day had fallen to approximately half of the peak (Fig 1) . At this point 22.7% of the population was seropositive, and an estimated 1.5 million infections had occurred 10 . Due to the delay between infections and deaths, deaths from infections detected in the serosurvey extended into late May (Fig 1) . By May 17 there were nearly 16,000 confirmed COVID-19 deaths, and nearly 4,500 probable COVID-19 deaths. The age-specific IFR for SARS-CoV-2 in New York City increased with age approximately 77-fold from 0.07% in 18-44 to 5.3% in 75+ year-olds, in the raw analysis where we assumed equal seroprevalence for all subgroup age-classes (55-64, 65-74, 75+) within the 55+ age-class (Table 1 ). In the adjusted analysis where used fine-scale age-specific seroprevalence data from Spain to estimate seroprevalence of 0-17 year-olds and variation within the 55+ age class, the IFR increased from 0.0018% for 0-17 year-olds, or roughly 2 fatalities per 100,000 infected individuals to 5.4% in 75+ year-olds (Table 1) . Including probable COVID-19 deaths increased IFRs 20-31% across the five age classes (Table S1 ). The age-specific IFRs from New York City for the 18-44 and 45-64 age classes were higher (with non-overlapping 95% confidence intervals) than corresponding IFRs for England, Geneva, France and Spain, but had overlapping confidence intervals and were similar to estimates for China (Figs. 2, S2). In contrast, the IFRs for the oldest two age classes, 65-74 and 75+ were lower (with non-overlapping 95% CIs) than IFRs from England, but had overlapping confidence intervals and were similar to corresponding IFRs from Geneva, France, Spain and China (Figs 2, S2). The IFRs in New York City were higher for the two younger age classes (18-44, 45-64), which accounted for 26% of the 15,885 deaths (Table 1) , than all three other studies based on large-scale serosurveys (Geneva, Spain and England). This may have been due to a higher prevalence of pre-existing conditions in these age-groups than in other populations; 79% and 85% of COVID-19 deaths in these two age groups had pre-existing conditions (Diabetes, Lung Disease, Cancer, Immunodeficiency, Heart Disease, Hypertension, Asthma, Kidney Disease, GI/Liver Disease, or Obesity). In contrast, IFRs for the two older age classes were very similar to estimates from most other countries despite 79% and 76% of these deaths also having preexisting conditions. Two other factors that influenced both IFRs or exposure to SARS-CoV-2 in previous studies are sex and race/ethnicity 7, 8, 10 . Although neither seroprevalence nor daily age-specific COVID-19 deaths by sex and race/ethnicity were publicly available, crude calculations can be made assuming that differences in exposure and death between ages were consistent among . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 20, 2020. . sexes and races/ethnicities. There were 55% more deaths in New York City in men than women, whereas seroprevalence was only 6.3% higher in men than women 10 , suggesting that IFRs for men were, on average across ages, 46% lower. This is a smaller sex difference than estimated for Spanish and French populations 6, 8 . Seroprevalence in New York City was 1.98-fold higher for hispanic/latinos than white people 10 whereas death rates were 5.9, 3.1, 2.5, and 1.6-fold higher for 18-44, 45-64, 65-74, and 75+ age-classes, respectively, suggesting that either age-specific exposure or age-specific IFRs differ among these races/ethnicities. Two other studies have estimated IFRs for New York City using other approaches. Our age-specific estimates for IFRs in New York City were uniformly lower (by 1.7 to 2.6-fold) than IFRs estimated from model-based inference of infections 11 , and crudely similar to, but difficult to compare to, a crude ratio of deaths four weeks after a serosurvey divided by estimated infections from that serosurvey, because that study 5 estimated IFR for age categories than that didn't match the published seroprevalence study 10 or health department death data. One shortcoming of our study is that the oldest age class for the seroprevalence study was broad (55+) and combined ages (60-69, 70-79, 80+) in which IFRs differed 10-fold in other studies 8 , whereas deaths were available in finer age classes. The two analyses we performed to quantify IFRs within this 55+ group produced similar results because the fine resolution seroprevalence data from Spain suggested only a slightly lower seroprevalence for 75+ compared to 45-54 (~4.4%; Table S2 ). In contrast, seroprevalence in England for 65-74 and 75+ age classes was 1.8 times lower than for 55-64 year-olds ( Fig S3) ; a non-representative serosurvey of residual sera from commercial labs 12 also showed lower seroprevalence in older age classes. If we assume the relative seroprevalence in New York City in these three age classes was similar to England, IFRs for the latter two groups would be 1.8-fold higher: 3.6%, and 9.5%. This IFR for the oldest group is still substantially lower than the corresponding IFR for England (Fig 2, S2) . The variation in seroprevalence within older age classes in some studies 7,13 emphasizes the importance of reporting both seroprevalence and deaths for SARS-CoV-2 for finer age classes to enable a more accurate understanding of the fatality of COVID-19 for these older age groups which are most vulnerable to death from COVID-19. A key unanswered question concerning the fatality of COVID-19 is how much age-specific IFRs have changed over time due to better case management and treatments. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 20, 2020. . https://doi.org/10.1101/2020.10.16.20214023 doi: medRxiv preprint . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 20, 2020. Supplemental Tables and Figures Table S1 . Seroprevalence and infection fatality rate (IFR) estimates for five age classes using both confirmed and probable COVID-19 deaths and either the raw seroprevalence values in 10 * There was no seroprevalence estimate for the 0-17 age class so no IFR is available for the raw analysis. . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 20, 2020. . https://doi.org/10.1101/2020.10.16.20214023 doi: medRxiv preprint Table S2 . Detailed calculations for estimating the subgroup seroprevalence for the 55+ age class into seroprevalence estimates for the 45-65, 65-75, and 75+ age classes using the relative seroprevalence values shown in Figure S1 . . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 20, 2020. . Figure S2 . Age-specific infection fatality ratio (IFR) of COVID-19 (mean  95% CI) on an untransformed scale with points plotted at the midpoint of the age-class on the x-axis (points are slightly jittered along the x-axis facilitate presentation). Lines show age-specific IFRs for different populations, either including just confirmed COVID-19 deaths, or estimates from "All cause" mortality. For New York City, "NYC (This study)" the figure shows the "adjusted" estimates from Tables 1 and S1 for both confirmed COVID-19 deaths and confirmed + probable COVID-19 deaths. Datasets with an asterisk (*) estimate the number of infections based on seroprevalence studies; others estimate infections using either testing of passengers on the Princess Cruise ship (Chia (Russell), France (Salje)) or repatriated travelers from Wuhan (China(Verity)). . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 20, 2020. . . CC-BY-NC-ND 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint this version posted October 20, 2020. . The age classes used for the seroprevalence study 10 (18-34, 35-44, 45-54, and 55+) did not match the age classes of the cases and deaths provided by the New York City Department of Health (0-17, 18-44, 45-64, 65-74, and 75+). Most importantly, the seroprevalence study grouped all individuals older than 55 and did not include individuals younger than 18. Previous estimates of IFR differ 30-fold between 50-59 (0.38%; 95% CI: 0.32-0.45), and 80+ (11.62%; 95% CI: 8.06-16.47) 8 , so we estimated IFRs for subgroups within the 55+ group by using the finer age classes reported by the New York City Department of Health for cases and deaths (55-64, 65-74, and 75+). As a result, we performed two separate analyses using different estimates of seroprevalence for these finer age classes. For the first ("raw") analysis, we used the same single seroprevalence estimate from the 55+ group for all three age classes that we split this group into: 55-64, 65-74, and 75+. For this raw analysis we did not estimate the IFR for the 0-17 age class. We estimated the seroprevalence in each of the age-classes for the death data using population-weighted averages from the seroprevalence study. We used 2017 estimates of 2010 census data for population estimates (https://www.baruch.cuny.edu/nycdata/population-geography/age_distribution.htm). For example, for the IFR for the 45-64 age class, we averaged the seroprevalence estimates for the 45-54, and 55+ age classes, weighted using the fraction of the New York City population that are in the 45-54 and 55-64 age classes. For the second ("adjusted") analysis, we used fine-resolution (5-year increments, from infancy to 85+) seroprevalence data from a study in Spain 13 (Fig. S3) , to estimate a seroprevalence for the 0-18 group in which 10 deaths due to SARS-CoV-2 occurred in New York City in this age group and to estimate variation in seroprevalence with the 55+ age group for the three oldest age classes 55-64, 65-74, and 75+. We fit a simple quadratic function to the data, which captured the decrease with age in seroprevalence after age 55 (Fig S1) . We calculated population-weighted estimates of prevalence for the subgroups within the 55+ age class, which resulted in elevated seroprevalence in 55-64 age class and lower seroprevalence in the 65-74 and 75+ age classes relative to the 55+ age class (Table S1 ). We also used data for individuals 0-17 from Spain to estimate seroprevalence for this age group in New York City. The ratio of seroprevalence in 0-17 year olds to seroprevalence in 18-44 year olds in Spain 13 was 0.72. We multiplied the seroprevalence in the 18-44 age class from the study in New York City by this relative ratio to estimate seroprevalence of 0-17 year olds (Table 1) . An interactive web-based dashboard to track COVID-19 in real time Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China Estimates of the severity of coronavirus disease 2019: a model-based analysis Serology-informed estimates of SARS-CoV-2 infection fatality risk in Assessing the Age Specificity of Infection Fatality Rates for COVID-19: Systematic Review, Meta-Analysis, and Public Policy Implications. medRxiv Estimating the burden of SARS-CoV-2 in France Antibody prevalence for SARS-CoV-2 in England following first peak of the pandemic: REACT2 study in 100,000 adults National Academies of Sciences E, and Medicine. Framework for equitable allocation of COVID-19 vaccine Cumulative incidence and diagnosis of SARS-CoV-2 infection in New York Estimating the infection fatality risk of COVID-19 in New York City during the spring 2020 pandemic wave. medRxiv Seroprevalence of Antibodies to SARS-CoV-2 in 10 Sites in the United States Prevalence of SARS-CoV-2 in Spain (ENE-COVID): a nationwide, population-based seroepidemiological study Funding was provided by NSF grant DEB-1911853. We thank Javier Perez-Saez and co-authors for making their R code available (https://github.com/HopkinsIDD/sarscov2-ifr-gva) which facilitated the analyses presented here.