key: cord-0974206-eh4egeo1 authors: Nanda, Sanjeev; Toussaint, Loren; Vincent, Ann; Fischer, Karen M.; Hurt, Ryan; Schroeder, Darrell R.; Chacin Suarez, Audry S.; Medina Inojosa, Jose R.; O’Horo, John C.; DeJesus, Ramona S.; Abu Lebdeh, Haitham S.; Mundi, Manpreet S.; Iftikhar, Salma; Croghan, Ivana T. title: A Midwest COVID-19 Cohort for the Evaluation of Multimorbidity and Adverse Outcomes from COVID-19 date: 2021-04-15 journal: J Prim Care Community Health DOI: 10.1177/21501327211010991 sha: 8866be0c491b4cfb3292ef329c5c564d799d624d doc_id: 974206 cord_uid: eh4egeo1 OBJECTIVE: To describe the process and outcome of creating a patient cohort in the early stages of the COVID-19 pandemic in order to better understand the process of and predict the outcomes of COVID-19. PATIENTS AND METHODS: A total of 1169 adults aged 18 years of age or older who tested positive in Mayo Clinic Rochester or the Mayo Clinic Midwest Health System between January 1 and May 23 of 2020. RESULTS: Patients were on average 43.9 years of age and 50.7% were female. Most patients were white (69.0%), and Blacks (23.4%) and Asians (5.8%) were also represented in larger numbers. Hispanics represented 16.3% of the sample. Just under half of patients were married (48.4%). Common comorbid conditions included: cardiovascular diseases (25.1%), dyslipidemia (16.0%), diabetes mellitus (11.2%), chronic obstructive pulmonary disease (6.6%), asthma (7.5%), and cancer (5.1%). All other comorbid conditions were less the 5% in prevalence. Data on 3 comorbidity indices are also available including the: DHHS multi-morbidity score, Charlson Comorbidity Index, and Mayo Clinic COVID-19 Risk Factor Score. CONCLUSION: In addition to managing the ever raging pandemic and growing death rates, it is equally important that we develop adequate resources for the investigation and understanding of COVID-19-related predictors and outcomes. Patients databases are an important part of healthcare. [1] [2] [3] Professionals and researchers in healthcare rely on wellcharacterized patient samples for the development of new research studies and the growth of understanding of illness and disease, especially in terms of specific or rare diseases and conditions. [1] [2] [3] For instance, the National Institutes of Health (NIH) maintains a noncomprehensive list of almost 75 active patient registries that facilitate research and discovery in areas such as Alzheimer's, heart, and kidney disease, as well as, cancer, infertility, and genetic disorders, to name just a few. 1 The COVID-19 pandemic has shined a light on just how important patient databases can be to the development and knowledge of novel diseases. With major knowledge gaps regarding COVID-19, it is paramount to medical research that adequately characterized patient cohorts are available for COVID-19 studies of predictors, outcomes, and treatments to proceed. 4 In the 9 months since the onset of COVID-19, numerous patient databases have been developed and utilized to address important research questions. For instance, some databases have focused on acquisition of patient data on cardiovascular comorbidities, 5 while others have emphasized comorbid conditions such as diabetes 6 and cancer. 7 These are important databases, but they lack the data necessary to offer medical researchers the ability to The retrospective data collection design for this database was reviewed by The Mayo Clinic Institutional Review Board (IRB) and determined to be exempt under section 45 CFR 46.101, item 2. During the study, all significant changes to study design and procedures were appropriately filed, reviewed, and approved by the IRB. We used the EPIC platform to extract data for this database. The EPIC platform provides a robust real-time platform for Research at Mayo Clinic that combines diverse data across multiple databases and presents that data to applications as if it were single-sourced. This platform enables reproducible and accessible research across the enterprise, and it is maintained by institutional resources. Any research conducted with this database must be under the oversight of our Institutional Review Board and as such can only be accessed by investigators within any site or satellite associated with our Institution. Accessibility to this specific Mayo Clinic sub-cohort will be considered by the corresponding author at a reasonable request and following institutional guidelines. This COVID-19 cohort database includes 1169 adults aged 18 years of age or older who tested positive in Mayo Clinic Rochester or the Mayo Clinic Midwest Health System. In this paper we describe clinical characteristics of the sample of consecutive patients who tested positive between January 1 and May 23 of 2020. All patients had a COVID-19 positive result on reverse-transcriptase-polymerase-chain-reaction (RT-PCR) assays from nasopharyngeal swab specimens. Data abstracted from the electronic medical record (EMR) for the above population included demographic data, clinical data and comorbidity data. Patient demographic data included: age, gender, ethnicity, race, and marital status; Patient clinical data included: dates of admission and discharge, ICU admission, hospital mortality, COVID-19 mediations, height, weight, and body mass index, pregnancy status, and smoking history; Three separate assessments of comorbidity were included in the patient data: (1) Charlson Index Score; 8, 9 (2) The U.S Department of Health and Human Services (DHHS) multi-morbidity score 8,10 ; and (3) the Mayo Clinic COVID-19 Risk Factors Score. The Charlson Comorbidity Index is a severity weighted measure of total comorbidity burden which predicts mortality risk and includes 17 comorbidities. A score of 0 on the Charlson Comorbidity Index translates to a 98% chance of 10-year survival, while a score of 7 or greater yields a 0% chance of 10-year survival. Second, the U.S Department of Health and Human Services (DHHS) multi-morbidity score is a composite score representing the cumulative increased burden of 20 conditions (see Table 1 ). International Classification of Diseases, 9th Revision (ICD-9) and 10th Revision (ICD-10) codes were used to define the 20 chronic conditions. 11 Data for patients in this databases were retrieved electronically from the Rochester Epidemiology Project (REP) 12 within a 5-year capture frame from COVID-19 diagnosis. The list of the 20 conditions is provided in Table 1 . Third, The COVID-19 risk score was designed to be a Mayo specific modification to an Epic generated COVID score. It was created by Infectious diseases as a clinical tool to assist clinicians with the timely triage and care of patients who tested positive for COVID-19 in the Mayo Clinic and Mayo Clinic Health System. The goal was to identity patients with a positive test who may require a higher level of care and triage them appropriately to prevent or mitigate an adverse outcome. The score took into consideration pertinent comorbidities (hypertension, heart disease, chronic lung conditions including COPD and asthma, chronic liver or kidney disease, diabetes, immune-compromising states, and nursing home residency) in addition to age, gender, and pregnancy. Score of 0 to 2 was considered low risk, score of 3 to 5 was considered moderate risk and score over 6 was considered high risk. The validation of this COVID-19 Risk Index Score is currently in submission. 13 Patient baseline clinical characteristics (including those that comprise reported risk scores), vital signs and laboratory values were retrieved electronically using the EPIC platform, within 30 days of the COVID-19 diagnosis. Clinical diagnosis and the Charlson comorbidity index were determined based on administrative billing data using the International Classification of Diseases 9th to 10th revision. In order to decrease false positive results, 2 occurrences of a code (either the same code or 2 different codes within the code set for a given disease) separated by more than 30 days and occurring within 5 years before the index date were required for diagnosis, an approach that has been extensively validated in the past. 14, 15 This information was reviewed for internal validation in duplicate by 2 independent investigators (AC and JMI) who were blinded to the patients other characteristics. Baseline patient characteristics are summarized using mean ± standard deviation, median and range for continuous variables and frequencies counts and percentages for categorical variables. COVID-19 patients included in this sample totaled 1169. Descriptive characteristics of this patient databases are included in Table 2 . Patients were 18 to 99 years of age (M = 43.9, SD = 17.5). The sex distribution was 50.7% female and 49.2% male. The largest racial group was Whites (69.0%) followed by Blacks (23.4%) and Asian Americans (5.8%), and Native Americans, Native Hawaiians, and nondisclosing each represented less than 1% of the patient sample. Most patients were non-Hispanic (77.0%), with the balance being Hispanic (16.3%) and non-disclosing patients (6.7%). Just under half (48.4%) were married, less than 2% were pregnant, and the majority had never smoked (72.3%). At the time of this database creation, few Americans were able to access therapeutic medications for Covid-19 and this is reflected in that with less than 1% of patients actively using Hydroxychloroquine, Remdesivir, or Tocilizumab. Among the 41.5% patients classified as obese (body mass index (BMI) equal or greater than 30 kg/m 2 ) 48.9% were classified as severely obese (BMI equal to or greater than 40 kg/m 2 ). In terms of comorbid conditions, a total of 14 conditions are present in the database. Just over a 4th of the sample (25.1%) had a cardiovascular condition, and dyslipidemia (16.0%) and diabetes mellitus (11.2%) were also common as were chronic obstructive pulmonary disease (6.6%), asthma (7.5%), and cancer (5.1%). All other comorbid conditions were less the 5% in prevalence. Comorbidity indices showed the full range of observed scores. On the DHHS Multi-Morbidity Score 19.1% of the samples had 3 or more comorbidities, on the Charlson Comorbidity Index the average was 0 (observed range = 0-13) (The age-adjusted Charlson comorbidity index observed range was 0-16), and on the Mayo Clinic COVID-19 Risk Factors Score the average was 1 (observed range = 0-9). The purpose of the present work was to create a COVID-19 patient cohort database that could be used to investigate questions pertaining to risk factors for adverse COVID-19 outcomes, especially as risk pertains to comorbid conditions. COVID-19 patient cohorts remain highly relevant to understanding the development, progression, and outcomes of COVID-19, and importantly, to identifying those at highest risk of adverse outcomes. [5] [6] [7] As the COVID-19 pandemic rages on and it once again becomes the leading cause of death in older Americans, 16 and even adds considerably to excess death in younger Americans 17 the need to identify those most vulnerable so we can do everything in our power to protect them from adverse outcomes is ever more important. While there are a few studies reporting community COVID-19 cohorts in early stages of the COVID-19 pandemic, there are no reported cohorts in the mid to late stages of the pandemic or comparison studies of cohorts before and after the COVID-19 vaccine. In a retrospective multi center cohort of hospitalized patients with COVID-19 from March 1 to April 17, 2020, Imam et al report that advanced age and increasing number of comorbidities were independent predictors of inpatient modality for COVID-19 patient's. 18 Of the 1305 patient's in this study, mean age was 61 years (compared to 43.9 years in our sample), 53.8% of patients were male (compare to 49.2% in our sample), 66.1% of patients were African American (compared to 23.4% in our sample). The top 3 comorbidities were hypertension (56.2% in their sample, 17.5% in our sample), followed by diabetes mellitus (30.1% in their sample compared 11.2% in our sample), chronic kidney disease (17.5% in their sample compared to 3.8% in our sample). Although some of these differences are likely attribute ability to older males and African American race, it highlights the significance of multi morbidity in COVID-19 patients. In another study describing the 1st 100 cases of COVID-19 in a hospital from Madrid with a 2 month follow-up, Munoz et al report a high mortality (23%) at 2 month follow up mostly attributable to advanced age and presence of multi morbidity. 19 The top 3 comorbidities in their sample were hypertension, heart disease and diabetes mellitus. Their adjusted charlson comorbidity index was 2 compared to out score of 0 indicating a sicker population that likely contributed to the higher mortality. Although no longitudinal assessments utilizing this cohort have been completed to date, we speculate that this database could have several uses in the future. For example, there is currently a deficit of information on post covid syndrome, a condition characterized by fatigue, breathlessness, arthralgias, and neuropsychological symptoms. 20 One unique aspect of this dataset is its ability to be linked to the electronic medical record. This would allow us to measure real time clinical encounters related to post covid syndrome and long term outcomes including associated morbidity and disability longitudinally in this patient cohort. This is important information as medical institutions globally prepare to provide clinical care for this little known condition. In addition, this database which represents patients with COVID-19 earlier on in the pandemic could be expanded to include patients in the middle and latter part of the pandemic. This would allow for comparative studies that provides datas on how therapeutics and vaccines altered mortablity and impacted post covid sequelae. Additionally the exact mechanism for the association of COVID-19 with cardiovascular comorbidities such as hypertension, coronary artery disease and diabetes is currently unknown. A well characterized cohort would allow future mechanistic studies related to COVID-19. Our cohort compilation has several limitations. First, our cohort reflects the demographic characteristics of the region in which this medical center is located; therefore our sample is predominantly Caucasian and our findings therefore have limited generalizability to cohorts that are more racially and ethnically diverse. This limitation could be easily addressed by the inclusion of COVID -19 patients from 2 other states (pending IRB approval) where this medical center has a presence (AZ and FL) where the population is more racially and ethnically diverse. Second, this this database is also limited in that it consists of positive Covid-19 cases in the first phase of the Covid-19 pandemic (March 2020-May 2020) and does not contain further follow up past May 2020. This limitation too can be addressed, since the database can be added to and long term follow up through the patient electronic medical record can be obtained. Additionally the database can easily adapted to include long term follow up and it can be compared with positive cases mid to late pandemic (May-August 2020 and September-December 2020, as well as January-June 2021) to detect and trace changes in pandemic patient profiles in order to improve our knowledge on treatment and prevention. This paper describes the construction of a well-characterized patient cohort established in the earliest phases of the pandemic and developed to aid in the investigation of risk factors for adverse outcomes of COVID-19. The primary advantages of this patient cohort are its early development and considerable assessment and prevalence of comorbid conditions that will aid in investigating questions of risk and vulnerability to COVID-19-related adverse outcomes. The cohort is large, well-constructed, and offers the opportunity to examine important and urgent questions regarding outcomes of COVID-19. In the future, this cohort would potentially allow for comparisons between patients afflicted by COVID-19 in earlier versus middle and later stages of the pandemic. The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: All authors declare no support from any organization for the submitted work; no financial relationships with any organizations that might have an interest in the submitted work in the previous 3 years; and no other relationships or activities that could appear to have influenced the submitted work. RTH reports consulting fees from Nestlé, and research funding from an IIIR grant with Zealand Pharmaceuticals. The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported in part by Mayo Clinic Department of Medicine and CCaTS COVID-19 Grant Award. In accordance with the Declaration of Helsinki, this study was reviewed and approved (ID 20-002783) by the Mayo Clinic Institutional Review Board (IRB). Mayo Clinic IRB approved informed consent waiver. This study was determined to be EXEMPT under 45 CFR 46.101, item 2 by the Mayo Clinic Institutional Review Board which had ethical oversight for this study. In addition, the authors assert that all procedures contributing to this work comply with the ethical standards of the Mayo Clinic Institutional Review Board guidelines on human experimentation in accordance with the Declaration of Helsinki of 1975, as revised in 2008. Registries for Evaluating Patient Outcomes: a User's Guide. Government Printing Office Engaging patients in information sharing and data collection: the role of patient-powered registries and research networks Agency for Healthcare Research and Quality (US) Role of data registries in the time of COVID-19 American Heart Association COVID-19 CVD registry powered by get with the guidelines Type 1 Diabetes and COVID-19: Preliminary Findings From a Multicenter Surveillance Study in the U.S. Diabetes Care Efficacy of low-level laser therapy for body contouring and spot fat reduction Defining and measuring chronic conditions: imperatives for research, policy, program, and practice Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases Multiple chronic conditions-a strategic framework: optimum health and quality of life for individuals with multiple chronic conditions Rochester epidemiology project data exploration portal Comparative Analysis of Covid-19 Risk Models in a Large Health System Cohort. Personal Communication Prevalence of multimorbidity in a geographically defined American population: patterns by age, sex, and race/ethnicity Multimorbidity in heart failure: a community perspective COVID-19 as the leading cause of death in the United States All-cause excess mortality and COVID-19-related mortality among US adults aged 25-44 years Older age and comorbidity are independent mortality predictors in a large cohort of 1305 COVID-19 patients in Michigan, United States The first 100 cases of COVID-19 in a hospital in Madrid with a 2-month follow-up Postdischarge symptoms and rehabilitation needs in survivors of COVID-19 infection: a cross-sectional evaluation Loren Toussaint https://orcid.org/0000-0001-8876-1848 Ivana T. Croghan https://orcid.org/0000-0003-3464-3525 All data supporting the study findings are contained within this manuscript