key: cord-203191-7ftg6bfx authors: Guo, Kai; Wang, Zhihan; Gao, Pan; Pu, Qinqin; Wu, Min; Huang, Canhua; Hur, Junguk title: Identification of Repurposal Drugs and Adverse Drug Reactions for Various Courses of Coronavirus Disease 2019 (COVID-19) Based on Single-cell RNA Sequencing Data date: 2020-05-16 journal: nan DOI: nan sha: doc_id: 203191 cord_uid: 7ftg6bfx With more than 3.8 million people infected Coronavirus Disease 2019 (COVID-19), caused by novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), poses a critical threat to human health. There is no proven vaccine or specific drug to date, which highlights the urgent need for rapid development of therapeutics for COVID-19. To identify potentially repurposable drugs, we employed a systematic approach to mine candidates from U.S. FDA approved drugs and pre-clinical small-molecule compounds by integrating the gene expression perturbation data by chemicals from the Library of Integrated Network-Based Cellular Signatures (LINCS) project with publically available single-cell RNA sequencing dataset from mild and severe COVID-19 patients. We identified 281 FDA approved drugs that have the potential to be effective against SARS-CoV-2 infection, 10 of which are currently undergoing clinical trials to evaluate their efficacy against COVID-19. In conclusion, we have identified a list of repurposable anti-SARS- CoV-2 drugs using a systems biology approach. Coronavirus Disease 2019 (COVID-19) is a highly contagious respiratory disease resulting from a life-threatening novel coronavirus, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). It has spread rapidly throughout the globe, causing 3.8 million infections and 260 thousand of deaths by early May 2020 [1, 2] . SARS-CoV-2 is an enveloped RNA virus that belongs to the genus Betacoronavirus of the family Coronaviridae, which includes well-known severe acute respiratory syndrome coronavirus (SARS-CoV) as well as middle east respiratory syndrome coronavirus (MERS-CoV) [3] . The advancement in the management of these coronaviruses and other viruses like influenza virus H1N1 and Ebola infections have provided insight into treating COVID-19. More than 300 active clinical trials for COVID-19 are being performed [4, 5] . Chloroquine [6, 7] and its hydroxyl analogue Hydroxychloroquine [8] , Lopinavir/Ritonavir [9] [10] [11] , and Remdesivir [7, 12] , developed for treating Malaria, human immunodeficiency virus (HIV) and Ebola virus, respectively, have provided some benefits to treat COVID-19 and are being tested in ongoing trials. Although COVID-19 is less fatal than the SARS and MERS, older patients with comorbidities tend to experience more severe symptoms, making them more vulnerable. The majority of SARS-CoV-2 infected patients displayed mild symptoms and generally have a good prognosis, classified as mild COVID-19 [9, 13] . However, a large proportion of patients, especially among older men with underlying chronic diseases, have rapidly progressed to severe COVID-19 and suffered from respiratory distress requiring emergent medical interventions [14] . Unfortunately, there is no evidence from randomized clinical trials supporting vaccines or efficient treatment for COVID-19 [4, 5] . Additionally, recent studies have shown the important roles of host immune responses in protection and the pathogenesis of respiratory viral infections, for instance, SARS-CoV, MERS-CoV, and influenza A viruses [15, 16] . Liao et al. [17] reported that increased immune cell recruitment in COVID-19 patients suggested a crucial role of CD8 + T cells in successful viral control, and proposed support therapeutic strategies that target the myeloid cell compartment to treat COVID-19-associated inflammation. However, little is known about drug screens of disease-relevant cell types. Drug repurposing is an essential and universal strategy in the development of new drugs [18] . It may facilitate the discovery of new mechanisms of action for existing drugs, which is less time-consuming and cost-effective let alone the existing pharmaceutical supply chains for formulation and distribution [19, 20] . Considering that an RNA virus exhibits a considerable degree of sequence variation, drugs targeting host factors may cause less mutational resistance with more effective and broad anti-virus spectrum potential. Hence, there is an urgent need to identify potential therapeutics with new strategies for emerging infectious diseases, and repurposing clinically assessed drugs represents one of the most practicable strategies for the rapid identification of treatments to combat COVID-19. In this study, we analyzed a publicly available single-cell RNA sequencing (scRNA-seq) dataset of bronchoalveolar lavage fluid (BALF) collected from mild and severe cases as well as bulk RNA-Seq of BALF in COVID-19 patients from different experiments ( Figure 1 ). Data mining was performed by using the Library of Integrated Network-Based Cellular Signatures (LINCS) [21] , a drug perturbation database, to identify potential therapies for COVID-19 disease. A total of 281 candidates of different courses of COVID-19 independent of cell subtypes were identified, 10 of which were in clinical trials of COVID-19, including lopinavir/ritonavir [22] , dexamethasone, niclosamide, lenalidomide, hydrocortisone, metformin, atorvastatin, sildenafil, and verapamil. Subsequently, we utilized the side effect prediction based on L1000 (SEP-L1000) project to predict adverse drug reactions (ADRs) and constructed drug-ADR association [23, 24] . Our findings may aid in the rapid preclinical and clinical evaluation of these therapeutics and can provide an important drug discovery pipeline to accelerate and facilitate the development of potential treatments for COVID-19. The gene-barcode matrix files of all 6 donors containing 3 mild cases and 3 severe cases (lung BALF) and 3 healthy control (lung tissues) were downloaded from the NCBI Gene Expression Omnibus database (Accession ID: GSE145926) [17] . All expression matrices were loaded in R statistical analysis platform using Seurat v3 [25] and keeping cells with gene number between 200 and 6000, unique molecular identifier (UMI) count above 1000 and mitochondrial gene percentage below 0.1. A total of 43,914 cells collected from three healthy subjects, three mild COVID-19 patients, and three severe COVID-19 patients, were used for the analyses. We also collected a list of differentially expressed genes (DEGs) in SARS-CoV-2-infected lung BALF using a bulk RNA-Seq analysis to compare against the single-cell-based data. This DEG list was obtained from the Chinese National Genomics Data Center (https://bigd.big.ac.cn/; Accession ID: CRA002390) [26] . The LogNormalize method in Seurat was used for normalizing filtered gene-barcode matrix. Principal component analysis (PCA) was done by using the top 2,000 most variable genes. Then Uniform Manifold Approximation and Projection (UMAP) was performed on the top 50 principal components for visualizing the cells, and graph-based clustering was performed on the PCAreduced data with Seurat v3 [27] . MAST in Seurat v3 was used to perform differential analysis. DEGs were identified by comparing each cluster between all of the three groups. Genes with average log2FC > 0.25 and adjusted p-value < 0.05 were deemed as DEGs. DEGs were first sorted by the log2FC values and then the upregulated and downregulated genes were chosen to identify drugs and compounds against the LINCS database using the Connectivity Map Linked User Environment (CLUE) platform [21] . The drug connectivity score (CS) with a negative value smaller than -90 was used to determine candidate drugs and compounds. COVID-19 database from the International Clinical Trials Registry Platform (ICTRP) (https://www.who.int/ictrp/en/, updated on May 5 th , 2020) was searched for clinical trials information associated with these drugs. Both on-label and off-label adverse drug reactions (ADRs) of the candidate drugs were collected from the SEP-L1000 database (https://maayanlab.net/SEP-L1000/). The SEP-L1000 data include on-label ADRs of FDA-approved drugs collected from SIDER [28] and off-label ADRs from the PharmGKB database [29] based on the post-marketing ADR reports in the FDA Adverse Event Report System (FAERS). Our study highlighted the identification of different therapeutic effects in the varied disease course. With the high variability of the cellular compartments underlying disease progression, our drug repurposing profiles from major cell subtypes included T, B, and NK cells, macrophages, and epithelial cells. A total of 9 scRNA-seq BALF samples, including 3 healthy cases, 3 mild cases, and 3 severe cases, were collected from publicly available scRNA-seq data (Supplemental Table S1 ). After quality filtering, approximately 250,000 gene expression values from 44,000 cells were collected. The clustering analysis identified six major clusters of macrophage, NK Figure S1 ), which was determined based on the unique signature genes CD68 (macrophage cell), IL7R and CD4 (CD4+ T cell), CD8A (CD8+ T cell), MS4A1 (B cells), TPPP3 (epithelial cells), respectively (Supplemental Figure S2 ). We then compared these six major clusters across the healthy, mild, and severe COVID-19 cases and identified differentially expressed genes between any of the two courses (Supplemental Tables S2-S4 ). Connecting to the LINCS database of small-molecule perturbations on gene expression, we identified candidate drugs and compounds that can reverse these upregulated and downregulated genes via the CLUE platform. The closer the CS is to -100, a score indicating a complete reversal, the higher chance identification of drug-adverse effect associations with upregulated or downregulated DEGs, in other words, drugs may show a better response to reverse expression of DEGs upregulated or downregulated in major cell subtypes in the BALF. There were a total of 281 candidates selected out by CLUE with CS lower than -90 based on DEGs among all three comparisons between two courses (Supplementary Table S5 ). To enable prioritization of known drugs for preclinical and clinical evaluation for the therapy efficiency of SARS-CoV-2, a summary of the most among major cell subtypes during the patients' disease course and the publicly disclosed clinical trial phases are annotated in Tables 1-3. Supplementary Table S5 provides the complete list of potential anti-coronavirus agents from the current analysis, focusing on the FDA approved drugs and experimental agents that have been already tested in clinical trials. To select candidates for mild cases, drugs and compounds were ranked according to their CSs (Supplementary Table S6 ), 133 candidate drugs were identified compared to controls (mild vs healthy group), and 53 of them involved in more than one cell subtype ( (Table 1) , including HIV protease inhibitors lopinavir/ritonavir [22] combination (phase 4), glucocorticoid receptor agonist dexamethasone (Phase 3/4) [30] , DNA replication inhibitor niclosamide (Phase 2/3) and antineoplastic lenalidomide (Phase 4). The tubulin inhibitor flubendazole, widely used in treating intestinal parasites, is a potent inducer of autophagy initiation and can decrease infection of dendritic cells with the HIV [31] . Azacytidine could partially reverse the aberrant DNA methylation, a phase I clinical trial in combination with chemotherapy has been conducted to assess its therapeutic effects in children with leukemia, and in combination with APR-246 for myelodysplastic syndrome is in phase 3 clinical trial [32] . The BCL inhibitor ABT-737 exhibits potential pro-apoptotic and antineoplastic activities [33, 34] . Lopinavir is widely used for the treatment of HIV, formulated in combination with ritonavir that can increase the half-life of lopinavir [9] [10] [11] . Repurposing analysis in severe COVID-19 patients 60 potent drugs were also selected in severe cases compared to controls (severe vs healthy group) according to their average CS between the replicates, and 25 of them involved in more than one cell subtype ( Figure 2B , Supplementary Tables S8 & S9) . As listed in Table 2 Protein synthesis inhibitor brefeldin-a has used inhibit entry of some viruses, like human papillomavirus and polyomavirus [35] , and egress of others, such as herpesviruses and paramyxoviruses [36] . Indirubin, an active ingredient of traditional Chinese medicine (TCM) "Danggui Longui Wan", has potent activity against myelocytic leukemia [37] and therapeutic potential on IAV-infection [38] . A total of 111 candidate drugs were identified in severe cases compared to mild ones (severe vs mild group), 39 of them involved in more than one cell subtype ( Figure 2C , Supplementary Tables S10 & S11). As listed in Table 3 , nine drugs (those for which drugs selected out in three separate cell types or more), including fostamatinib (SYK inhibitor), VER-155008 (HSP inhibitor), KU-0063794 (MTOR inhibitor), PIK-90 (PI3K inhibitor), linsitinib (IGF-1 inhibitor), TAK-715 (p38 MAPK inhibitor), Y-27632 (Rho-associated kinase inhibitor), AZ-628 (RAF inhibitor) and lestaurtinib (FLT3 inhibitor). In this group, except lopinavir, we also following listed 4 drugs in clinical trials for the treatment of COVID-19 in Table 3 SYK inhibitor fostamatinib produced clinically-meaningful responses for adult persistent and chronic immune thrombocytopenia in two parallel, phase 3 randomized trials [39] . HSP inhibitor VER-155008 regulates Kaposi's sarcoma-associated herpesvirus lytic replication and highlights the potential to be a novel antiviral agent [40] . FLT3 inhibitor lestaurtinib obtained orphan drug approval from the FDA for acute myeloid leukemia [41] and in a phase II trial of advanced multiple myeloma and phase I trials of prostate cancer. As shown in Figures 3A & 3B and Supplemental Table S5 , lopinavir was the only one identified in all three comparisons, and interestingly, ritonavir was common in two analyses. There were 23 additional common drugs, such as SB-216763, ABT-737, JTE-907, brefeldin-a, PKCbetainhibitor, indirubin, GW-441756, flubendazole, tyrphostin-AG-1478, memantine, calyculin, kinetin-riboside, ascorbyl-palmitate, ON-01910, mirin, verrucarin-a, emetine, TPCA-1, RHOkinase-inhibitor-III[rockout], PD-158780 and NVP-AUY922. For example, the glycogen synthase kinase inhibitor SB-216763 acts as neuroprotectant [42] and prevents cardiac ischemia [43] . JTE-907 is a cannabinoid receptor inverse agonist producing anti-inflammatory effects [44] . To further demonstrate the usefulness of this strategy, we have accomplished the identification of therapeutic drugs by transcriptional changes in BALF of COVID-19 patients with a bulk RNA-seq data [26] . Ten efficient candidates were identified using the same analysis pipeline, two of which, including glycogen synthase kinase inhibitor SB-216763 and PPAR receptor antagonist GW-6471, were also included in the single-cell-based candidate lists (Supplemental Table S12 ). Further investigations are necessary to characterize the ADRs which are a central consideration during drug development [45] . Therefore, we conducted a computational approach using the SEP-L1000 database to predictive relationships between drugs and the emergence of ADRs (Supplemental Tables S13 S14). Figure 4 shows a heatmap of the top 50 drug-ADR association for On-label ( Figure 4A ) and Off-label ( Figure 4B ) ADRs. These findings highlighted drug-ADR associations and may lead to inform clinical decisions regarding treatments for COVID-19. COVID-19 has spread rapidly, and no proven vaccine or drug has yet been identified to treat it. Generally speaking, there are several ways to control or prevent emerging coronavirus disease, including antivirals, small-molecule drugs, biologics, and vaccines [4, 5] . Due to the lack of effective therapeutic agents and long development cycles of vaccines, it is, therefore, reasonable to consider repurposing existing drugs and compounds for COVID-19. Drug repurposing is a potentially important strategy for the discovery of existing medicines to tackle COVID-19 [18] . Gordon et al. [46] identified 332 high-confidence SARS-CoV-2human protein-protein interactions for drug repurposing. An additional study [47] tested the antiviral activity of 20 FDA approved drugs against SARS-CoV-2 that previously shown to inhibit SARS-CoV and MERS-CoV. Another research team [48] conducted a high-throughput analysis of the ReFRAME library to identify 30 candidates existing drugs that prevent the COVID-19 virus from replicating in mammalian cells. In a study [49] based on public data of patients with pulmonary fibrosis and the database LINCS, several drugs were identified on COVD-19 targets ACE2. Host immune responses are particularly important in the protection and pathogenesis of the respiratory viral infections like SARS-CoV, MERS-CoV, and influenza A viruses [15, 16] . Liao et al. [17] observed that T and NK cells accumulated, epithelial cells decreased in COVID-19 patients compared to controls, meanwhile, macrophages dysregulated and the cell compartments differed in mild and severe disease courses, more T and NK cells decreased in severe cases but CD8 + T cells increased in mild cases. However, drug screening of disease-relevant cell types is still unclear. Here, we analyzed publicly available data of COVID-19 patients and performed data mining by using the LINCS L1000 database to identify potential therapies for COVID-19 disease and SEP-L1000 database to predict side effects. Our approach is different from previous methods for drug repurposing for coronavirus, since it does not merely rapidly identify likely effective therapeutic agents in preventing or treating COVID-19, but tries to filter out specific medications during the patients' disease courses. Furthermore, the data and transcriptome are derived from human samples from real patients, two independent sets of experiments. Lastly, we explore the underlying risk factors associated with some side effects of the candidates. Overall, our data will guide the future development of therapies for the different durations of COVID-19 and other viral respiratory infections. This study has several limitations to note. First, the public scRNA-Seq data had a small number of clinical samples (n=9) without available patient information, which makes comparisons between studies difficult. With the fast pace and expected large number of published literature using other patient samples, our candidate lists may need to be revised. Second, our findings may not apply to children because all the sequencing data were from adults. Future work on large-scale data mining would help us in better identification of antiviral drugs. The pandemic of COVID-19 represents the greatest global public health crisis in this generation. So far, no proven vaccines and therapies have been identified. Based on the study, we thoroughly investigated potential candidates for the treatment in COVID-19 progression and predicted some possible adverse effects. The findings can guide additional repurposing studies, tailored for different stages of disease progression. The authors declare that they have no competing interests. JH, CH, MW and KG designed the project, collected data, performed analysis, and prepared figures. KG and ZW collected data, prepared figures, and wrote the manuscript. PG performed analysis and prepared tables. QP revised the manuscript. The work was partially supported by the National Institutes of Health (P20GM113123) Pilot grant to J.H. All the codes and data are available at https://github.com/guokai8/COVID19/ Table 1 . A list of potential drugs for treating COVID-19 based on LINCS database and DEGs between mild and healthy samples in B, CD4 + T, CD8 + T, Epithelial, NK cells, and Macrophage. Connectivity scores were calculated from the CLUE platform. Asterisk (*) represents the clinical trial for its efficacy in COVID-19 disease. (+) indicates drugs meeting the SC < -90 criteria, while (-) indicates drugs not meeting the criterion. Table 3 . A list of potential drugs for treating COVID-19 based on LINCS database and DEGs between severe and mild samples in B, CD4 + T, CD8 + T, Epithelial, NK cells, and Macrophage. Connectivity scores were calculated from the CLUE platform. Asterisk (*) represents the clinical trial for its efficacy in COVID-19 disease. (+) indicates drugs meeting the SC < -90 criteria, while (-) indicates drugs not meeting the criterion. Input publicly available scRNA-seq data and transcriptomic data of BALF in COVID-19 patients against the LINCS database by using the CLUE platform. Candidates are selected which can World Health Organization. Coronavirus disease (COVID-19) outbreak A Novel Coronavirus from Patients with Pneumonia in China Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding Pharmacologic Treatments for Coronavirus Disease 2019 (COVID-19): A Review Therapeutic options for the 2019 novel coronavirus (2019-nCoV) Breakthrough: Chloroquine phosphate has shown apparent efficacy in treatment of COVID-19 associated pneumonia in clinical studies Remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus (2019-nCoV) in vitro In Vitro Antiviral Activity and Projection of Optimized Dosing Design of Hydroxychloroquine for the Treatment of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) A rapid advice guideline for the diagnosis and treatment of 2019 novel coronavirus (2019-nCoV) infected pneumonia (standard version) Role of lopinavir/ritonavir in the treatment of SARS: initial virological and clinical findings Breadth of concomitant immune responses prior to patient recovery: a case report of non-severe COVID-19 Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72314 Cases From the Chinese Center for Disease Control and Prevention Immune responses in influenza A virus and human coronavirus infections: an ongoing battle between the virus and host Immunopathogenesis of coronavirus infections: implications for SARS The landscape of lung bronchoalveolar immune cells in COVID-19 revealed by single-cell RNA sequencing Renovation as innovation: is repurposing the future of drug discovery research? Drug Repurposing for Viral Infectious Diseases: How Far Are We? Drug repurposing: progress, challenges and recommendations A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles An exploratory randomized controlled study on the efficacy and safety of lopinavir/ritonavir or arbidol treating adult patients hospitalized with mild/moderate COVID-19 (ELACOI) Drug-induced adverse events prediction with the LINCS L1000 data Network-Based Assessment of Adverse Drug Reaction Risk in Polypharmacy Using High-Throughput Screening Data Comprehensive Integration of Single-Cell Data Transcriptomic characteristics of bronchoalveolar lavage fluid and peripheral blood mononuclear cells in COVID-19 patients Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression A side effect resource to capture phenotypic effects of drugs Data-driven prediction of drug effects and interactions Dexamethasone treatment for the acute respiratory distress syndrome: a multicentre, randomised controlled trial Pharmaceutical screen identifies novel target processes for activation of autophagy with a broad translational potential A phase 1 study of azacitidine combined with chemotherapy in childhood leukemia: a report from the TACL consortium An inhibitor of Bcl-2 family proteins induces regression of solid tumours The BH3 mimetic ABT-737 targets selective Bcl-2 proteins and efficiently induces apoptosis via Bak/Bax if Mcl-1 is neutralized Human papillomavirus type 16 infection of human keratinocytes requires clathrin and caveolin-1 and is brefeldin a sensitive Antiviral activity of brefeldin A and verrucarin A Molecular mechanisms of indirubin and its derivatives: novel anticancer molecules with their origin in traditional Chinese phytomedicine Anti-inflammatory effects of indirubin derivatives on influenza A virus-infected human pulmonary microvascular endothelial cells Fostamatinib for the treatment of adult persistent and chronic immune thrombocytopenia: Results of two phase 3, randomized, placebo-controlled trials Hsp70 Isoforms Are Essential for the Formation of Kaposi's Sarcoma-Associated Herpesvirus Replication and Transcription Compartments Results from a randomized trial of salvage chemotherapy followed by lestaurtinib for patients with FLT3 mutant AML in first relapse Selective smallmolecule inhibitors of glycogen synthase kinase-3 activity protect primary neurones from death Delayed cardioprotection afforded by the glycogen synthase kinase 3 inhibitor SB-216763 occurs via a KATP-and MPTP-dependent mechanism at reperfusion The cannabinoid CB2 receptor inverse agonist JTE-907 suppresses spontaneous itch-associated responses of NC mice, a model of atopic dermatitis Clinical and economic burden of adverse drug reactions Broad anticoronaviral activity of FDA approved drugs against SARS-CoV-2 in vitro and SARS-CoV in vivo Repurposing Didanosine as a Potential Treatment for COVID-19 Using Single-Cell RNA Sequencing Data