key: cord-0807348-hlbr0wh8 authors: Schultz, Bruce; Zaliani, Andrea; Ebeling, Christian; Reinshagen, Jeanette; Bojkova, Denisa; Lage-Rupprecht, Vanessa; Karki, Reagon; Lukassen, Sören; Gadiya, Yojana; Ravindra, Neal G.; Das, Sayoni; Baksi, Shounak; Domingo-Fernández, Daniel; Lentzen, Manuel; Strivens, Mark; Raschka, Tamara; Cinatl, Jindrich; DeLong, Lauren Nicole; Gribbon, Phil; Geisslinger, Gerd; Ciesek, Sandra; van Dijk, David; Gardner, Steve; Kodamullil, Alpha Tom; Fröhlich, Holger; Peitsch, Manuel; Jacobs, Marc; Hoeng, Julia; Eils, Roland; Claussen, Carsten; Hofmann-Apitius, Martin title: The COVID-19 PHARMACOME: Rational Selection of Drug Repurposing Candidates from Multimodal Knowledge Harmonization date: 2020-09-23 journal: bioRxiv DOI: 10.1101/2020.09.23.308239 sha: 266b4d3e5774fbbe8ef09a30ac0951e3ec92f40c doc_id: 807348 cord_uid: hlbr0wh8 The SARS-CoV-2 pandemic has challenged researchers at a global scale. The scientific community’s massive response has resulted in a flood of experiments, analyses, hypotheses, and publications, especially in the field drug repurposing. However, many of the proposed therapeutic compounds obtained from SARS-CoV-2 specific assays are not in agreement and thus demonstrate the need for a singular source of COVID-19 related information from which a rational selection of drug repurposing candidates can be made. In this paper, we present the COVID-19 PHARMACOME, a comprehensive drug-target-mechanism graph generated from a compilation of several disease maps and experimental data focused on SARS-CoV-2 / COVID-19 pathophysiology. By applying a systematic approach, we were able to predict the effect of drug pairs on SARS-CoV-2 infection. Experimental validation of our results demonstrate that our graph can be used to not only explore the involved mechanistic pathways, but also to identify novel combinations of drug repurposing candidates. COVID-19 is the term coined for the pandemic caused by SARS-CoV-2. Unprecedented in the history of science, this pandemic has elicited a worldwide, collaborative response from the scientific community. In addition to the strong focus on the epidemiology of the virus 1 2 3 , experiments aimed at understanding mechanisms underlying the pathophysiology of the virus have led to new insights in a comparably short amount of time 4 5 6 . In the field of computational biology, several initiatives have started generating disease maps that represent the current knowledge pertaining to COVID-19 mechanisms 7 8 9 10 . Such disease maps have proven valuable before in diverse areas of research such as cancer 11 , neurodegenerative diseases 12 13 and infectious diseases 14 . When taken together with related work including cause-and-effect modeling 8 , entity relationship graphs 15 , and pathways 16 ; these disease maps represent a considerable amount of highly curated "knowledge graphs" which focus primarily on COVID-19 biology. As of July 2020, a collection consisting of 10 models representing core knowledge about the pathophysiology of SARS-CoV-2 and its primary target, the lung epithelium, was shared with the public. With the rapidly increasing generation of data (e.g. transcriptome 17 , interactome 18 , and proteome 19 data), we are now in the position to challenge and validate these COVID-19 pathophysiology knowledge graphs with experimental data. This is of particular interest as validation of these knowledge graphs bears the potential to identify those disease mechanisms highly relevant for targeting in drug repurposing approaches. The concept of drug repurposing (the secondary use of already developed drugs for therapeutic uses other than those they were designed for) is not new. The major advantage of drug repurposing over conventional drug development is the massive decrease in time required for development as important steps in the drug discovery workflow have already been successfully passed for these compounds 20 21 . Our group and many others have already begun performing assays to screen for experimental compounds and approved drugs to serve as new therapeutics for COVID-19. Dedicated drug repurposing collections, such as the Broad Institute library 22 , and the even more comprehensive ReFRAME library 23 , were used to experimentally screen for either viral proteins as targets for functional inhibition 24 , or for virally infected cells in phenotypic assays 25 . In our own work, compounds were assessed for their inhibition of virus-induced cytotoxicity using the human cell line Caco-2 and a SARS-CoV-2 isolate 26 . A total of 64 compounds with IC50 < 20 µM were identified, from which 90% have not yet been previously reported as being active against SARS-CoV-2. Out of the active compounds, 37 are FDAapproved drugs, 19 are in phases 1-3 and 10 are preclinical candidate molecules. The described mechanisms of action for the inhibitors included kinase signaling, PDE activity modulation, and long chain acyl transferase inhibition (e.g. "azole class antifungals"). The approach presented here integrates experimental and informatics workflows as well as combining proprietary and public data to provide a comprehensive overview on the therapeutic efficacy of candidate compounds, the mechanisms targeted by these candidate compounds, and a rational approach to test the drug-mechanism associations for their potential in combination therapy. Disparate COVID-19 disease maps focus on different aspects of COVID-19 pathophysiology. Based on comparisons of the COVID-19 knowledge graphs, we found that not a single disease map covers all aspects relevant for the understanding of the virus, host interaction and the resulting pathophysiology. Thus, we optimized the representation of essential COVID-19 pathophysiology mechanisms by integrating several public and proprietary COVID-19 knowledge graphs, disease maps, and experimental data (Supplementary Table 1 ) into one unified knowledge graph, the COVID-19 Supergraph. To this end, we converted all knowledge graphs and interactomes into OpenBEL 27 , a modeling language that is both ideally suited to capture and to represent "cause-and-effect" relationships in biomedicine and is fully interoperable with major pathway databases 28 29 . In order to ensure that molecular interactions were correctly normalized, individual pipelines were constructed for each model to convert the raw data to the OpenBEL format. For example, the COVID-19 Disease Map contained 16 separate files, each of which represented a specific biological focus of the virus. Each file was parsed individually and the entities and relationships that did not adhere to the OpenBEL grammar were mapped accordingly. Whilst most of the entities and relationships in the source disease maps could be readily translated into OpenBEL, a small number of triples from different source disease maps required a more in-depth transformation. When classic methods of naming objects in triples failed, the recently generated COVID-19 ontology 30 as well as other available standard ontologies and vocabularies were used to normalize and reference these entities. In addition to combining the listed models, we also performed a dedicated curation of the COVID-19 supergraph in order to annotate the mechanisms pertaining to selected targets and the biology around prioritized repurposing candidates. The resulting BEL graphs were quality controlled and subsequently loaded into a dedicated graph database system underlying the Biomedical Knowledge Miner (BiKMi), which allows for comparison and extension of biomedical knowledge graphs (see http://bikmi.covid19-knowledgespace.de). Once the models were converted to OpenBEL and imported into the database, the resulting nodes from each model were compared (Figure 1 ). Even when separated by data origin type, the COVID-19 knowledge graphs had very little overlap (3 shared nodes between all manually curated models and no shared nodes between all models derived from interaction databases), but by unifying the models, our COVID-19 supergraph improves the coverage of essential virus-and host-physiology mechanisms substantially. Additionally, by enriching the COVID-19 supergraph with drug-target information linked from highly curated drug-target databases (DrugBank, ChEMBL, PubChem), we created an initial version of the COVID-19 PHARMACOME, a comprehensive drug-target-mechanism graph representing COVID-19 pathophysiology mechanisms that includes both drug targets and their ligands ( Figure 2 ). Only drug-target relationships that have been experimentally validated have been included. The entire COVID-19 supergraph was manually inspected and re-curated. The COVID-19 PHARMACOME is openly accessible to the scientific community at http://graphstore.scai.fraunhofer.de. An aggregate of 10 constituent COVID-19 computable models covering a wide spectrum of pathophysiological mechanisms associated with SARS-CoV-2 infection. The supergraph is depicted here in two forms: a) shows the overall graph with 150662 nodes and 573929 relationships; b) represents the distribution of major classes of entities (pathologies, molecular entities such as proteins, drugs) in the COVID-19 supergraph. At the time of the writing of this paper, six phenotypic cellular screening experiments have been shared via archive servers and journal publications (Supplementary Table 2 ). Although only a limited number of these manuscripts have been officially accepted and published, we were able to extract their primary findings from the pre-publication archive servers. A significant number of reports on drug repurposing screenings in the COVID-19 context demonstrate how appealing the concept of drug repurposing is as a quick answer to the challenge of a global pandemic. Drug repurposing screenings were all performed with compounds for which a significant amount of information on safety in humans and primary mechanism of action is available. We generated a list of "hits" from cellular screening experiments while results derived from publications that reported on in-silico screening were ignored. Therefore, we keep a strict focus on well-characterized, well-understood candidate molecules in order to ensure that one of the pivotal advantages of this knowledge base is its use for drug repurposing. The COVID-19 PHARMACOME contains several subgraphs, three of which correspond to major views on the biology of SARS-CoV-2 as well as the clinical impact of COVID-19: -the viral life cycle subgraph focuses on the stages of viral infection, replication, and spreading. -the host response subgraph represents essential mechanisms active in host cells infected by the virus. -the clinical pathophysiology subgraph illustrates major pathophysiological processes of clinical relevance. These subgraphs were annotated by identifying nodes within the supergraph that represent specific biological processes or pathologies associated with each subgraph category and traversing out to their first-degree neighbors. For example, a biological process node representing "viral translation" would be classified as a starting node for the viral life cycle subgraph while a node defined as "defense response to virus" would be categorized as belonging to the host response subgraph. Though the viral life cycle and host response subgraphs contain a wide variety of node types, the pathophysiology subgraph is restricted to pathology nodes associated with either the SARS-CoV-2 virus or the COVID-19 pathology. Two single cell sequencing data sets representing infected and non-infected cells directly derived from human samples 31 We performed phenotypic assays to screen for repurposing drugs that inhibit the replication and the cytopathic effects of virus infection. A derivative of the Broad repurposing library was used to incubate Caco-2 cells before infecting them with an isolate of SARS-CoV-2 (FFM-1 isolate, see 35 ). Survival of cells was assessed using a cell viability assay and measured by highcontent imaging using the Operetta CLS platform (PerkinElmer). Details of the drug repurposing screening are described in the supplemental material. As described in Ellinger et al., 36 Data from six published drug repurposing screenings were downloaded, and extensive mapping and curation was performed in order to harmonize chemical identifiers. The curated list of drug repurposing "hits" together with an annotation of the assay conditions is available under http://chembl.blogspot.com/2020/05/chembl27-sars-cov-2-release.html Initially, we analyzed the overlap between compounds identified in the reported drug repurposing screening experiments. Figure 3A shows no overlap between experiments, which is not surprising, as we are comparing highly specific candidate drug experiments with screenings based on large drug repositioning libraries. However, the overlap is still quite marginal for those screenings where large compound collections (Broad library, ReFRAME library) have been used. In order to identify which proteins are targeted by the repurposing hits, and to investigate the extent to which there are overlaps between repurposing experiments at the target/protein level, we mapped all of the identified compounds from the drug repurposing experiments to their respective targets. As most drugs bind to more than one target, we increase the likelihood of overlaps between the drug repurposing experiments when we compare them at the protein/target space. Indeed, Figure 3B shows an overlap of 112 targets between all the drug repurposing experiments, thereby creating a list of potential proteins for therapeutic intervention when the compound targets are considered rather than the compounds themselves. A non-redundant list of drug repurposing candidate molecules that display activity in phenotypic (cellular) assays was generated and mapped to the COVID-19 PHARMACOME. In the next step, we analyzed which areas of the COVID-19 graph respond to SARS-CoV-2 infection (indicated by significant variance in gene expression) and are targets for repurposing drugs. To this end, we mapped signals from the drug repurposing screenings to the subgraph that showed responsiveness to SARS-CoV-2 infection. In Figure 4C , we demonstrate the resulting subgraph that is characterized by the transcriptional response to SARS-CoV-2 infection and the presence of target proteins of compounds that have been identified in drug repurposing screening experiments. In order to evaluate the potential of the COVID-19 PHARMACOME to guide rational approaches towards combination therapy using repurposing drug candidates, we mapped existing combinatorial therapy data to the COVID-19 PHARMACOME. Combinatorial treatment data obtained from the paper published by Bobrowski et al. 35 were mapped to the COVID-19 supergraph. Analysis of the overlaps between the drug repurposing screening data and the data from this combinatorial treatment publication showed that four of the ten compounds in the synergistic treatment approach by Bobrowski et al. were represented in our initial non-redundant set of candidate repurposing drugs. Table 3 ). Based on the association between repurposing drug candidates and pathophysiology mechanisms in the COVID-19 PHARMACOME, we hypothesized that the number of edges between a pair of drug nodes may be linked to the effectiveness of the drug combination. In order to evaluate whether the determined outcome of a combination of drugs correlated with the distance between said drug nodes, we compared distances for combinations of drugs within the supergraph for which their effect was known. On average, drug combinations that were found to have a synergistic effect in the treatment of SARS-CoV-2 had the shortest path lengths (2.43) while antagonistic combinations were found to be farther apart (4.0). Therefore, we decided to use the path lengths between pairs of drugs as a basis for choosing new combinations of compounds to test. Ultimately, we selected five compounds: Remdesivir (a virus replicase inhibitor), Nelfinavir (a virus protease inhibitor), Raloxifene (a selective estrogen receptor modulator), Thioguanosine (a chemotherapy compound interfering with cell growth), and Anisomycin (a pleiotropic compound with several pharmacological activities, including inhibition of protein synthesis and nucleotide synthesis). These compounds were used in four different combinations (Remdesivir/Thioguanosine, Remdesivir/Raloxifene, Remdesivir/Anisomycin and Nelfinavir/Raloxifene) to test the potency of these drug pairings in phenotypic, cellular assays. Figure 6 shows the results of these combinatorial treatments on the virus-induced cytopathic effect in Caco-2 cells. By combining a significant number of knowledge graphs which represent various aspects of COVID-19 pathophysiology and drug-target information we were able to generate the COVID-19 PHARMACOME, a unique resource that covers a wide spectrum of cause-and-effect knowledge about SARS-CoV-2 and its interactions with the human host. Based on a systematic review of the results derived from published drug repurposing screening experiments, as well as our own drug repurposing screening results, we were able to identify mechanisms targeted by a variety of compounds showing virus inhibition in phenotypic, cellular assays. With the COVID-19 PHARMACOME, we are now able to link repurposing drugs, their targets and the mechanisms modulated by said drugs within one computable model, thereby enabling us to target -in a combinatorial treatment approach -different, independent mechanisms. By challenging the COVID-19 PHARMACOME with gene expression data, we have identified subgraphs that are responsive (at gene expression level) to virus infection. Network analysis along with the overview on previous repurposing experiments provided us with the insights needed to select the optimal repurposing drug candidates for combination therapy. Experimental verification showed that this systematic approach is valid; we were able to identify two drug-target-mechanism combinations that demonstrated synergistic action of the repurposed drugs targeting different mechanisms in combinatorial treatments. We are fully aware of the fact that the COVID-19 PHARMACOME combines experimental results generated in different assay conditions. In the course of our work, we accumulated evidence that assay responses recorded using Vero E6 cells in comparison to Caco-2 cells may only partially overlap. Comparative analysis of the results of both assay systems to virus infection by means of transcriptome-wide gene expression analysis is one of the experiments we plan to perform next. However, for the identification of meaningful combinations of repurposing drugs, the current model-driven information fusion approach was shown to work well despite the putative differences between drug repurposing screening assays. Given the urgent need for treatments that work in an acute infection situation, our approach described here paves the way for systematic and rational approaches towards combination therapy of SARS-CoV-2 infections. We want to encourage all our colleagues to make use of the COVID-19 PHARMACOME, improve it, and add useful information about pharmacological findings (e.g. from candidate repurposing drug combination screenings). In addition to vaccination and antibody therapy, (combination) treatment with small molecules remains one of the key therapeutic options for combatting COVID-19. The COVID-19 PHARMACOME will therefore be continuously improved and expanded to serve integrative approaches in anti-SARS-CoV-2 drug discovery and development. Epidemiological data from the COVID-19 outbreak, real-time case information Defining the epidemiology of Covid-19-studies needed Wrong but Useful-What Covid-19 Epidemiologic Models Can and Cannot Tell Us COVID-19: towards understanding of pathogenesis Single-cell landscape of bronchoalveolar immune cells in patients with COVID-19 The trinity of COVID-19: immunity, inflammation and intervention Disease Map, building a computational repository of SARS-CoV-2 virus-host interaction mechanisms Knowledge Graph: a computable, multi-modal, cause-and-effect knowledge model of COVID-19 pathophysiology Network medicine framework for identifying drug repurposing opportunities for covid-19 COVID-19Base: A knowledgebase to explore biomedical entities related to COVID-19 Atlas of Cancer Signalling Network: a systems biology resource for integrative analysis of cancer data with Google Maps Computable cause-and-effect models of healthy and Alzheimer's disease states and their mechanistic differential analysis Integrating pathways of Parkinson's disease in a molecular interaction map A comprehensive map of the influenza A virus replication cycle COVID-19Base: A knowledgebase to explore biomedical entities related to COVID-19 COVID-19 Disease Map, building a computational repository of SARS-CoV-2 virus-host interaction mechanisms Imbalanced host response to SARS-CoV-2 drives development of COVID-19 A SARS-CoV-2 protein interaction map reveals targets for drug repurposing Proteomics of SARS-CoV-2-infected host cells reveals therapy targets Drug repositioning: identifying and developing new uses for existing drugs Drug repurposing: progress, challenges and recommendations The ReFRAME library as a comprehensive drug repurposing library and its application to the treatment of cryptosporidiosis Recent advances in modeling languages for pathway maps and computable biological networks. Drug discovery today PathMe: Merging and exploring mechanistic pathway knowledge ComPath: an ecosystem for exploring, analyzing, and curating mappings across pathway databases COVID-19 severity correlates with airway epithelium-immune cell interactions identified by single-cell analysis Single-cell longitudinal analysis of SARS-CoV-2 Enrichr: a comprehensive gene set enrichment analysis web server 2016 update Discovering the false discovery rate: False Discovery Rate Evidence of SARS-CoV-2 infection in returning travelers from Identification of inhibitors of SARS-CoV-2 in-vitro cellular toxicity in human (Caco-2) cells using a large scale drug repurposing collection Discovery of Synergistic and Antagonistic Drug Combinations against SARS-CoV-2 In Vitro. BioRxiv HIV protease inhibitor nelfinavir inhibits replication of SARS-associated coronavirus The anti-HIV Drug Nelfinavir Mesylate (Viracept) is a Potent Inhibitor of Cell Fusion Caused by the SARS-CoV-2 Spike (S) Glycoprotein Warranting further Evaluation as an Antiviral against COVID-19 infections