key: cord-0554346-x5wzzt1w authors: Swartzbaugh, Richard; Khanzada, Amil; Govindan, Praveen; Pilanci, Mert; Owoyemi, Ayomide; Atlas, Les; Estrada, Hugo; Nall, Richard; Lotito, Michael; Falcone, Rich; JenniferRanjani, J title: Using a Novel COVID-19 Calculator to Measure Positive U.S. Socio-Economic Impact of a COVID-19 Pre-Screening Solution (AI/ML) date: 2022-01-21 journal: nan DOI: nan sha: 8ac7c5434c4d931cd5cced462e6460a4762fb4e8 doc_id: 554346 cord_uid: x5wzzt1w The COVID-19 pandemic has been a scourge upon humanity, claiming the lives of more than 5.1 million people worldwide; the global economy contracted by 3.5% in 2020. This paper presents a COVID-19 calculator, synthesizing existing published calculators and data points, to measure the positive U.S. socio-economic impact of a COVID-19 AI/ML pre-screening solution (algorithm&application). In recent history, the singular event that most profoundly impacted the entire globe has been the COVID-19 pandemic. To date, COVID-19 has claimed the lives of over 800,000 individuals in the United States and 5 million worldwide [1] . The highly infectious nature of COVID-19 has filled up hospital beds in record numbers, surpassing hospital capacity and causing immense strain on healthcare systems worldwide [2] . While vaccination efforts globally are underway, distribution efforts have been impeded in low-and middle-income countries. Additionally, the emergence of new viral variants like the Omicron variant has decreased the effectiveness of current vaccines in the prevention of COVID-19 spread [3] . The US economy has shown significant recovery so far; however, it has not recovered back to pre-pandemic levels, the certainty of improvement in this recovery is affected by these newly emerging COVID-19 variants [9] . A recent surge in COVID, partially due to emerging variants, also affected job creation as fewer jobs than expected were added in December 2021 [10] . Considering these socio-economic impacts of disease outbreaks like COVID-19, it is important to assess and predict socioeconomic impacts and effects of mitigation and management approaches to help reduce outbreaks. Preliminary data and analysis supports positive U.S. socio-economic impact of a COVID-19 pre-screening solution utilizing Artificial Intelligence (AI). Sound has been used as one of the health indicators by clinicians and researchers, which often require skilled clinicians to interpret. Studies reveal that respiratory syndromes such as pneumonia, pulmonary diseases, and asthma [11] [12] [13] can be diagnosed effectively using cough samples. Recently researchers have explored the possibility of using sound instead of imaging techniques like MRI, sonography, as they offer a cheap alternative. In [14] , a crowd-sourced dataset comprising sounds from 7000 users was collected using both Android and web applications. Out of the 7000, only 200 users were tested positive for COVID-19. They aimed to use simple machine learning models like support vector machines (SVM) to discriminate the cough from users with COVID-19 from healthy users and users with asthma. In [15] , a non-invasive, real-time framework adapted from the brain model by the MIT Center for Brain Minds and Machine designed for diagnosing Alzheimer's is used to pre-screen cough to identify COVID-19 by leveraging orthogonal audio biomarkers. In this study, 96 cough samples from users with bronchitis and 136 with pertussis are collected in addition to 2660 COVID positive and 2660 healthy cough samples. Recurrent Neural networks are gaining popularity in speech recognition and audio analysis. Studies using the long short-term memory (LSTM) network models reveal that comparing cough and breath, voice is inefficient [16] . 240 recordings from healthy individuals and 60 samples from COVID-19 patients collected from several hospitals in UAE are used in this study. Mouawad et al. exploited the non-linear phenomenon in voice and speech signals to capture the repetition pattern using time-series recurrence plots [17] . They have used the data collected in partnership with Carnegie Mellon University and Voca.ai to train the machine learning models. The DiCOVA challenge was initiated to accelerate researchers in COVID-19 diagnosis, and it uses respiratory acoustics [18] . Two datasets derived from the crowd-sourced Coswara data [19] are used in the Di-COVA challenge. The first dataset focuses only on cough sounds, and the second dataset contains a collection of audio recordings, including counting numbers, phonation of sustained vowels, and breath. AI models like logistic regression (LR), multi-layer perceptron (MLP), and random forest (RF) are analyzed on the cough, breath, vowel sounds, and speech samples. A group of researchers from the University of Cambridge combined the 11 most common symptoms with 384 features obtained from 16 frame-level descriptors using linear kernel SVM [20] . Pahar et al. from Stellenbosch University, South Africa, evaluated the popular machine learning approaches like SVM, k-nearest neighbor (kNN), LR, MLP, ResNet50, and LSTM, on the Coswara dataset and the SARS COVID-19 South Africa (Sarcos) dataset [21] collected from all the six continents. Erdogan et al. [22] used 595 positive and 592 negative COVID-19 cough samples from the Virufy dataset [23] . They concluded that the performance of features based on deep learning is inferior, compared to traditional methods, due to limited data availability. In [24] , experiments are conducted to identify the most informative acoustic features as a baseline instead of the handcrafted features or Mel frequency Cepstral Coefficients. It is also demonstrated that robust features can be learned using the wavelet scattering transform amidst the noise in the data. 92.38%, among the 1103 participants, were declared negative, with the remaining 7.62% declared COVID-19 positive. In [25] , four classes of cough samples are used in this study, they are users tested COVID-19 (346), COVID-19 negative (346), healthy users with cough (101), and users without COVID but has pertussis cough (20) . A collection of 813 samples from the University of Lleida, University of Cambridge, Virufy, and Pertussis datasets, are used. Compared to the latest machine learning methods, random forest performed better on the timefrequency features. From this study, we can conclude that researchers worldwide are actively working towards a cost-effective AI/ML solution to pre-screen the infection from the cough samples collected from the user's smartphone. However, the efficiency of these ML solutions is highly dependent on the balanced nature of the datasets and often in popular datasets healthy samples often outnumber the COVID-19 positive samples. In this paper, we have utilized the costs associated with Hospital Acquired Infections (HAI) and the savings realized by preventing spread as a relevant comparable, as COVID-19 is also an Airborne Transmissible Disease. Our novel COVID-19 calculator measures the U.S. socio-economic impact of a COVID-19 AI pre-screening algorithm, and potential savings through early detection, facilitating the return to pre-COVID normalcy. Calculation assumptions are based on a 1% or 0.1% degree of positive change. By using one's own data and assumptions, and this model calculator, each user can customize the calculator based on values, weights, variables, and assumptions the user deems relevant for their local contexts or countries. The rest of the paper is organized as follows: we have discussed the various similar calculators developed for estimating the impact of COVID-19 in section 2, the proposed calculator is presented in section 3 and section 4 concludes the article. We have also summarized the sample calculations from other popular calculators in Appendix A. In considering this Article, a number of existing published articles, research, and calculators were reviewed: Control APIC is the leading professional association for infection preventionists (IPs) with more than 15,000 members. APIC's mission is to advance the science and practice of infection prevention and control. The APIC Cost of Healthcare-Associated Infections Model is designed to demonstrate the costs associated with infections and the savings realized by preventing them. It also provides tables and graphs that describe the financial impact of infections at your healthcare institution. By using your own data, you can customize this report for your respective facility. If you don't have your own data, APIC has provided data from national studies to estimate economic endpoints. Use of your healthcare organization's data will reflect the financial impact of infections to your institution. The TMIT-APIC Healthcare-Associated Infections Cost Calculator was developed in collaboration with the Texas Medical Institute of Technology (TMIT) and APIC. It provides an alternate method to determine the cost of healthcare-associated infections from the APIC cost calculator. The input to the TMIT-APIC calculator includes: hospital size category and the hospitals' infection data. The hospital size category is based on the number of beds, region and teaching status. And the infection data comprise of number of infections/year, excess costs/infection and excess length of stay (LOS)/infection. The calculator generates a report on infections such as surgical-site (SSI), ventilatorassociated Pneumonia (VAP), catheter-associated urinary tract (UTI), central line-associated bloodstram (CLABSI), Methcillin-resistant Staphylococcus aureus (MRSA) and Clostridium difficile (C. diff.). Education Week together with the Learning Policy Institute developed an interactive tool to calculate the percentage of revenue cuts for the 2019 -2020 and 2020 -2021 school year. They also accounted in the increased costs due the required high-speed internet for online education and increased number of days free or reduced price food are provided. The Bureau of Labor Statistics (BLS) uses surveys to gather data on the prices of goods and services purchased across the U.S., weights these prices by how much they contribute to the typical basket of expenditures, and then aggregates to form the consumer price index (CPI). Inflation is then measured as the rate of growth of the CPI over a specific period. Alberto Cavallo, an economics professor at Harvard Business School, constructed expenditure weights for each month of 2020 using data from the Opportunity Insights Economic Tracker at Harvard University and Brown University as compiled by Raj Chetty and other economists [7] . Cavallo's weights are, no doubt, imprecise but seem reasonable. For example, we know that consumer expenditures at restaurants fell sharply in 2020. It therefore seems likely that expenditures on Food at Home (Food Away from Home) likely increased (decreased) in 2020 and hence the true expenditure weights should be higher (lower) than normal for these categories. In fact, these weights are estimated to have increased (decreased) by 3.76 (3.07) percentage points during the month of April 2020, when stay-at-home orders were most severe for much of the country. Figure 1 portrays the 12-month changes on the US COVID inflation for the years January 2019 to October 2021. Having the estimated expenditure weights for 2020, we can then compare the official CPI-based inflation with an unofficial COVID CPI-based inflation that is easily accessible at Cavallo's website. The COVID-19 Prognostic Tool estimates mortality rates in patients with COVID-19 and is adapted from Centers for Disease Control and Prevention (CDC) materials. At present, we are not aware of any validated COVID-19 calculator that aggregates COVID value creation and have thus constructed this working model. The strength of our working model is that it leverages a number of existing models, and existing data points, and allows the user to customize the calculator based on values, weights, variables, and assumptions they deem relevant. Our novel model COVID-19 calculator measures U.S. socio-economic impact of COVID-19 AI/ML prescreening algorithm, and potential savings through early detection, facilitating the return to pre-COVID normalcy. Users may utilize criteria & assumptions they are comfortable with. For purposes of this illustration, assumptions are based on 1% or 0.1% degree of positive change. By using your own data and assumptions, and this model calculator, each user can customize the calculator based on values, weights, variables, and assumptions the user deems relevant. The following factors were considered in designing the calculator: In table 1 , we have summarized the steps involved for a sample calculation using the proposed calculator. The users can replace the sample calculations with actual data to determine the actual impact. We have provided the link to the calculator with sample calculations for better understanding. The COVIDCalc can be downloaded from: https: //github.com/virufy/covid_calc The pace at which each country recovers from COVID shock varies from one another due to several factors such as increased domestic spending power, decreased consumption of contact-intensive services, labor shortages, etc. The surge due to variants from time to time adds up to other factors causing inflation. Hence, it is important to evaluate or predict the socio-economic impact of disease outbreaks like COVID-19. Several organizations and researchers have designed calculators to determine the cost incurred with the diseases and the associated savings by preventing them. Mostly, these calculators are industry-specific like the APIC cost calculator determines the impact on healthcare institutions and the Edweek calculator on the schools. Recently, few AI/ML researchers have started exploring the usage of coughs collected from smartphones to pre-screen associated diseases like COVID-19, Asthma, Pertussis, etc. In this paper, we have designed a COVID-19 calculator that measures the socio-economic impact involved in the development of an AI/ML prescreening solution and the associated savings through early detection. We have considered 1% or 0.1% degree of positive change in the calculations. We have enabled the users of this calculator to use their own data and assumptions to customize by assigning values, variables, weights, and assumptions the user deems fit. Subject to general limitations, some copyrighted material(s), such as the included figures, are deemed fair use(s) based on this paper's intentions to stimulate creativity for the enrichment of the general public, comment, criticism, research, news reporting, teaching, and scholarship. Cited work(s) are the property of the author(s) / copyright holders. -(b) additional future potential $ through a public / private partnership for the U.S. market [e.g., $ TBD] or other value from the user TBD -(c) Additional criteria 1 from the user TBD -(d) Additional criteria 2 from the user TBD + (e) Reduction in COVID-related healthcare expenses for the U.S. market [e.g., assume a solution can contribute to a .1% reduction at COVID-related healthcare expenses -$50B loss per month, assume 12 months = $600B, which means $600M; assume conversion of loss = ten cents per dollar = $60,000,000.00] or other value from the user $ 60,000,000.00 Appendix A.4. COVID-19 Prognostic Tool CDC's interim clinical guidance for managements of confirmed COVID-19 patients' data was used by this tool created by QxMD. It demonstrates that age is one of the strong risk factors in addition to pre-existing conditions like cancer, diabetes, hypertension, cardiovascular and respiratory diseases. However, researchers from the Johns Hopkins School of Medicine states that the case fatality rate is likely overestimated and so is the inflation based on mortality estimates [26] . WHO Coronavirus (COVID-19) Dashboard Healthcare System Stress Due to COVID-19: Evading an Evolving Crisis COVID-19: Do vaccines work against omicronand other questions answered Inflation with COVID Consumption Baskets A Most Unusual Recovery: How the US rebound from COVID differs from rest of G7 State Employment and Unemployment Summary Acoustic Analysis of Cough Diagnosis of Pneumonia from Sounds Collected Using Low Cost Cell Phones Use of Cough Sounds for Diagnosis and Screening of Pulmonary Disease Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings COVID-19 Detection System Using Recurrent Neural Networks Robust Detection of COVID-19 in Cough Sounds: Using Recurrence Dynamics and Variable Markov Model DiCOVA Challenge: Dataset, Task, and Baseline System for COVID-19 Diagnosis Using Acoustics Coswara -A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis Exploring Automatic COVID-19 Diagnosis via Voice and Symptoms from Crowdsourced Data COVID-19 Cough Classification Using Machine Learning and Global Smartphone Recordings COVID-19 Detection with Traditional and Deep Features on Cough Acoustic Signals Detection of COVID-19 from Voice, Cough and Breathing Patterns: Dataset and Preliminary Results Automated Detection of COVID-19 Cough COVID-19 fatality is Likely Overestimated We are very grateful to: Appendix A. 1 The APIC calculator uses the following input to determine the potential savings: HAI incidence per 1000 patient days, mean attributable costs of HAIs, estimated cost to facility per 1000 patient days and potential decrease in HAI cases. In fig. A.2 , the estimated cost for a medium size hospital in the Northeast region with non-teaching status is portrayed. The total annual medical and surgical admissions are assumed as 20,000 and 9000 respectively. The annual patient volume with ventilator, urinary catheter and central line are assumed as 1000, 8000 and 1000 respectively. Since the actual number of infections, excess cost per HAI and excess LOS per HAI are unknown, we have left it blank and are set as 0. Fig. A.3 -A.4 , show total increased cost and revenue loss calculated by the Education week calculator using the input like state, percent of students without highspeed internet at home, increase in the number of days to provide free or reduced price food, increase number of school days added to 2020 school year, percent of student that will be impacted, percentage of revenue cuts in 2019 -2020 and 2020 -2021 school year.