key: cord-0921086-esw7w4id authors: Nag, Anish; Banerjee, Ritesh; Chowdhury, Rajshree Roy; Krishnapura Venkatesh, Chandana title: Phytochemicals as potential drug candidates for targeting SARS CoV 2 proteins, an in silico study date: 2021-04-05 journal: Virusdisease DOI: 10.1007/s13337-021-00654-x sha: 994cb16e059657afb3a7c3d8cd049c849174e5b9 doc_id: 921086 cord_uid: esw7w4id Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) is a member of the family Coronaviridae, and the world is currently witnessing a global pandemic outbreak of this viral disease called COVID-19. With no specific treatment regime, this disease is now a serious threat to humanity and claiming several lives daily. In this work, we selected 24 phytochemicals for an in silico docking study as candidate drugs, targeting four essential proteins of SARS-CoV-2 namely Spike glycoprotein (PDB id 5WRG), Nsp9 RNA binding protein (PDB id 6W4B), Main Protease (PDB id 6Y84), and RNA dependent RNA Polymerase (PDB id 6M71). After statistical validation, the results indicated that a total of 11 phytochemicals divided into two clusters might be used as potential drug candidates against SARS-CoV-2. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s13337-021-00654-x. Coronaviruses are single-stranded RNA viruses belonging to the family Coronaviridae and were known for causing mild respiratory infections in birds and mammals. These viruses were considered as minor pathogens for human until the emergence of two infamous zoonotic members of this family, the Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) [20] and the Middle East Respiratory Syndrome Coronavirus (MERS-CoV) [51] . In the last two decades, it caused severe and life-threatening respiratory infections in humans across the globe. However, the world is recently witnessing the new and deadlier outbreak of acute pneumonia disease called as 'Coronavirus diseases 2019' (COVID-19) caused by the same viral family member Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) [45, 53] . In late 2019, the novel coronavirus SARS-CoV-2 was first reported and identified in Wuhan of Hubei province of China and spread rapidly afterwards throughout the globe, causing severe to fatal respiratory illnesses [3] . SARS-CoV-2 is a highly transmittable pathogenic virus with an estimated reproductive number (Ro) of 2.2. World Health Organization (WHO) declared COVID-19 as a public health emergency of international concern with 23,752,965 confirmed infections and 815,038 death reports worldwide so far (26th August 2020) [47] . Some common symptoms of COVID-19 are dry cough, high fever, shortness of breath, muscle aches, fatigue, etc. that may arise within 3 to 14 days after pathogen exposure. In some severe cases, it may also cause Acute Respiratory Distress Syndrome (ARDS) leading to septic shock, and multi-organ failure due to fluid builds up within and around the lungs, drastically reduced blood pressure and oxygen starvation [29] . SARS-CoV-2 is a Beta Coronavirus containing a lipid membrane with envelope protein, hemagglutinin esterase dimer protein, membrane glycoprotein, spike protein, and positive-sense single-stranded RNA (* 30 kb) with nucleocapsid protein [2] . CoVs invade the pulmonary epithelial cells of the lower respiratory system of the patient, deliver their nucleocapsid in the cell, and replicate in the cytoplasm by hijacking the cellular machinery [37] . Homotrimer transmembrane spike (S) glycoproteins of SARS-CoV-2 promote host attachment with the help of its S1 subunit and help the virus to enter into the host cell by virus-cell membrane fusion through S2 subunit [42, 45] . Cellular proteases cleave the S protein at the S1/S2 and S2' sites to allow the entry of the viral particles, followed by fusion of viral capsid with the cellular membrane [13] . After the entry, the virion releases the viral RNA inside the cell and translates several polyproteins using the machinery of the host cell that are cleaved subsequently into 27 viral proteins by internally encoded proteases. Further processing leads to the production of several non-structural proteins (Nsps) and structural proteins that play an essential role in the synthesis of viral RNA and assembly of the virions, respectively [22, 25] . On the other hand, the virion uses RNA dependent RNA polymerase (RdRp) to replicate its daughter RNA genome [14] . These viral proteins can be the primary targets of effective drugs to suppress viral entry and replication. Currently, there is no vaccine or specific drugs available for COVID-19 except for symptomatic supportive therapy. The treatment of the infected patients is limited to isolation and application of some broad-spectrum antiviral drugs [49] . Some antiviral medications like Remdesivir, Ganciclovir, Lopinavir, and Ritonavir are being tested clinically against COVID-19. Recently, antimalarial drug hydroxychloroquine and chloroquine had been used to treat COVID-19 infected individuals [6, 23] . Among these antiviral drugs, Remdesivir specifically was found to be effective against SARS-CoV as revealed by experimental and computational biology based evidence. Experimental studies indicated that the principal mechanism of this drug to block the viral RNA transcription. CoV is susceptible to the Remdesivir, targeting RNA dependent RNA polymerase and Non Structural proteins (NSps) [1, 19, 38, 46, 50] . Furthermore, Hall et al. [11] showed that Remdesivir, along with other drugs, could inhibit the main protease of SARS CoV in an in silico study. Considering all these studies, Remdesivir was selected as a control drug in this study. However, the search for an effective and specific cure for SARS-CoV-2 is still on. Plant-derived natural products and metabolites have been used as traditional medicines to treat different diseases around the world for ages. These plant metabolites comprise several functional bioactive compounds that gained massive interest in the pharmacological and clinical industries to prevent and cure several diseases and disorders. Common phytochemicals like flavonoids, terpenoids, phenols, xanthophylls, carotenoids, and essential oils are used as potent sources of immunomodulatory, antitumor, antimicrobial, and antioxidative drugs for the treatment of several diseases [8, 15] . Several researchers demonstrated and strongly suggested the antiviral activity of several phytochemicals using various biological systems [24, 28] . The success of the quest for an appropriate antiviral drug entirely depends on the comprehensive pharmacodynamic screening and identification of potential broad-spectrum antiviral Phyto-compounds keeping in mind the bioavailability and stability. The interaction study between target proteins and drug compounds by experimental approaches are time-consuming and costly. The application of the latest biomedical tools and in-silico techniques are inexpensive techniques that help to find the efficacy of phytochemicals as the source of drugs within a short period thereby drastically reducing the time and cost of research and drug development. Hence for effective drug development against COVID-19, preliminary bioinformatics analysis of SARS-CoV-2 proteins and exploration of potential bioactive phytochemicals by in silico prediction of their interaction with the target proteins are of high importance for the best and appropriate use of limited resources. In this study, we performed a computational analysis to identify potent phytochemical compounds against different SARS-CoV-2 proteins. The drug-like properties of the selected phytochemicals were evaluated, followed by structural optimisation of the ligands. The molecular docking experiment was performed to assess the binding affinity of these phytochemicals to the SARS-CoV-2 protein receptors and predict the new potentially active bioactive compounds with antiviral properties. Twenty-Four phytochemicals were downloaded from PubChem (https://pubchem.ncbi.nlm.nih.gov/) and considered as ligands for this study. Three ( The canonical smile formats of the phytochemicals were uploaded in the SwissADME (http://www.swissadme.ch/) site to evaluate its pharmacological and drug-likeness. The proteins (PDB id 5WRG, 6W4B,6Y84, and 6M71) were prepared by retrieving the three-dimensional crystal structure of each from the RCSB protein bank (https:// www.rcsb.org/), structure optimisation was performed by UCSF Chimera software [32] . The three-dimensional structures of the phytochemicals, as well as the control drug Remdesivir (Compound CID: 121,304,016), were downloaded from PubChem in '. SDF' format. Structural optimisation and conversion in the PDB format were done by Avogadro software [12] before conducting the molecular docking analysis. Following receptor and ligand preparation, molecular docking analysis was performed by DockThor web server (https://www.dockthor.lncc.br/v2/) [36] to evaluate the binding affinities. After the minimisation process, the grid box resolution was set along the x, y, and z points (size and center), respectively in a partially blind docking mode (dimension: x, y and z: 188.446, 193.4115 and 169.673 for 5WRG; 40.4825, -12.5045 and 13.711 for 6W4B 11.6405, -0.022 and 6.329 for 6Y84 and 121.009, 121.761 and124.981 for 6M71; Grid Size was set as 40) grid was centred onto the proteins. The control drugs (Remdesivir), as well as all 24 phytochemicals, were docked with all three protein receptors and the resulting interactions were compared with those calculated docking results of the Remdesivir with the same receptors. The visualisation and analysis of the docking sites were done by Discovery Studio 2020 (BIOVIA, San Diego, USA). Multivariate data analysis based on Principal Component Analysis (PCA) tool was performed by Minitab software (Minitab 18). Generation of the single coloured heat map and hierarchical clustering analysis based on Pearson correlation was performed by Molecular Experiment Viewer 4.9.0 (MEV 4.9.0). We used the four docking scores (affinity kcal/mol) to four different targets as input to PCA to extract PC1 and PC2 coordinates as well as to construct the hierarchical clusters. ClassyFire (http://classyfire.wishartlab.com/) relies on a comprehensible, comprehensive, and computable chemical taxonomy. It is a free accessible web-based application for automated structural classification of chemical entities. Based on the PCA results, we performed ClassyFire based structural analysis of compounds to understand their intra and inter-cluster relationships. The process of drug discovery is evolving since its inception. To increase the accessibility and effectiveness of the drug discovery process, researchers have been continuously striving to develop new tools such as SwissADME. SwissADME is an open-source web server, and it predicts ADME (Absorption, Distribution, Metabolism, and Excretion) parameters and computes physicochemical descriptors, pharmacokinetic properties, drug-like nature and medicinal chemistry friendliness of one or multiple small molecules to support drug discovery. In this study, 24 phytochemicals (and Remdesivir) were screened and evaluated for drug-like properties. The result was represented in the BOILED-Egg graphical classification model (Supplementary material). BOILED-Egg graphical interface can predict passive diffusion through passive human gastrointestinal absorption (HIA) and blood-brain barrier (BBB) permeation by position in a WLOGP-versus-TPSA physicochemical space [5] . In our study, 4 (four) phytochemicals were found to show passive human gastrointestinal absorption (HIA), and 17 (seventeen) compounds showed a blood-brain barrier (BBB) permeation property. Four (4) compounds were found to be out of range. A membrane-bound transporter PGP (P-glycoprotein) mediates efflux (active transport) of a wide range of structurally unrelated drugs and other xenobiotics out of the cells. Against a concentration gradient, PGP induces the efflux of various substrates leading to the reduction of their intracellular concentration, thereby affecting the oral bioavailability of drugs [4] . BOILED-Egg graphical presentation further showed that among 25, 8 (eight) phytochemicals could function as PGP substrates (PGP?). Additionally, drug likeliness parameters, as shown in Table 1 , showed that with few parameter exceptions, these phytochemicals could be used as potential drug candidates. The current outbreak of the CoV has caused significant concern in the field of drug and vaccine development. Researchers have started investigating all possible compounds that could work against it. The target proteins used for molecular docking in this study were based on the significant role that they play for the survival of the virus. The use of phytochemicals over the chemically synthesised drugs has increased in demand in the last few decades, mainly due to their effectiveness and lesser-known side effects. A variety of active phytochemicals have been studied regularly for drug development [27] . Molecular docking is an efficient technique to study ligand-protein interaction probability. The results, in general, are expressed with free binding energy (kcal/mol; binding affinity) which is expected to be lower in case of optimal docking poses. DockThor is considered as one of the most useful docking servers. In a study, by Santos et al., 2020 showed that it could dock 40 % of the cases with an overall backbone RMSD below 2.5 Å when the top-scored docking pose was considered in other software. Further, DockThor was capable of assessing the docking poses closest to the crystal structure (i.e., best-RMSD pose), with a success rate of 60% [36] . In our study, DockThor aided molecular docking study revealed that phytochemicals could effectively bind with viral proteins when compared with the commercial drug Remdesivir (CID 121,304,016) ( Table 2 ; Fig. 1 ). SARS-CoV spike glycoprotein (PDB id 5WRG) is the surface protein that is mainly responsible for the initial attachment with the host cells receptor, Angiotensin-converting enzyme (ACE2). It also has a critical role in penetrating the cell by fusing with the membranes. The precursor chain of his protein is initially synthesised as a single polypeptide chain of 300 amino acids and further cleaved by host furin-like proteases into an amino (N)terminal S1 subunit and a carboxyl (C)-terminal S2 subunit [10] . Hence, inhibiting spike protein of SAR CoV shall prevent the viral entry, fusogenicity and propagation. Pandey et al. 2020 [30] in their work targeted the SARS-CoV spike glycoprotein with ten naturally occurring phytochemicals in an in-silico docking study and compounds like Fisetin, Quercetin, and Kaempferol showed significant binding affinities for viral spike glycoprotein. The authors observed the binding energies of the phytochemicals within the range of -6.7 to -8.5 kcal/mol, which was significantly lower than the control drug hydroxychloroquine (-5.6 kcal/mol). Lower docking binding energy reflects better binding affinity of the ligands towards target proteins. The performance of these compounds in terms of prevention of infectivity and virulence of the viral pathogens was correlated with low binding free energy in the docking study. In our research, while docked with SARS-CoV spike glycoprotein (PDB id 5WRG), nine (09) compounds namely Cartilagineol (CID 101,934,341), Flavone (CID 10,680), Stigmastanol (CID 241,572), Campesterol (CID [14] . Therefore, inhibition of RdRp can be an effective strategy to stop the growth of the viral population. In our study, Squalene (CID 638,072) showed better docking score in case of viral RNA dependent RNA polymerase protein (PDB Id 6M71) when compared with Remdesivir. Squalene, interacted with the protein through ALA688 (chain A, conventional H bond, 3.10 Å ) and ASP760 (Chain A, Conventional H bond, 1.66 Å ). Remdesivir, interacted with the protein through the amino acids ILE548 (chain A, Pi alkyl bond, 4.94 Å ), ASP760 (chain A, two conventional H bonds, 2.08 and 2.33 Å ), ASP761 (chain A, three Pi-anion attractive charges, 3.61, 4.04 and 4.24 Å ), SER814 (chain A, conventional H bond, 2.73 Å ) and CYS813 (chain A, Pi-alkyl bond, 4.46 Å ). SARS-CoV-2 main protease (PDB id 6Y84) being reported to be involved in the viral translation process through the processing of the polyproteins [52] was considered as another target for this study. In a study, the drug likeliness of various phytochemicals from Ocimum sanctum was analysed against the CoV-2 main protease using docking protocols, and Tulsinol and Dihydrodieuginol B were identified to have a potent inhibitory effect on the viral protein [44] SARS CoV-2 Nsp9 RNA binding protein (PDB id 6W4B) is a non-structural protein that is presumed to have an essential role in binding with the RNA/DNA during replication; however, its direct involvement is still unclear [25] . Silva et al., 2020 [39] reported the best docking ligands for SARS-CoV Nsp15/NendoU as (E, E)-a-Farnesene,(E)-b-Farnesene, and (E, E)-Farnesol. (E, E)-Farnesol. Flavone (CID 10,680) was found to effectively bind with SARS CoV-2 Nsp9 RNA binding protein (PDB id 6W4B) with affinity -8.405 Kcal/mol. Pi-sulfur bond (5.6 Å ) interaction was observed between MET13 (chain A) of the protein and the phytochemical. Further, Flavone showed specific interactions of Pi-alkyl bonds between ILE66 (chain A) and ARG40 (chain A) and Flavone (5.44 Phytochemicals as potential drug candidates for targeting SARS CoV 2 proteins and 4.68 Å ). MET13 (chain A) showed Pi-sulfur interaction (5.6 Å ). Finally, we identified compounds like Cartilagineol, Flavone, Campesterol, Corynan-17-Ol, Elatol, Ethylbenzene, Demecolcine, Beta cubebene and Squalene were found to be potential drug candidates for their respective targets. The extensive study on these natural phytochemicals also showed great results against other viruses like dengue, HIV, malaria, etc. [18, 43] . Many of these compounds are indicated in the literature to have pharmacological properties [48] . Cartilagineol has mainly derived from red algae Laurencia sp. Studies have been conducted to evaluate their properties, and it is well known for their antimicrobial and anti-inflammatory activity. Flavones are a large group of compounds that are naturally found in various plants such as Artemisia, Gnaphalium and Achyroclines and are medically acclaimed to have antioxidant, antimicrobial and anti-cancerous activity. Flavones are used for docking as a potential drug against viruses like Picorna virus [15] , Dengue virus [26, 41] etc. On the other hand, phytochemicals like Stigastanol and Campesterol plant sterol derivatives found in plants like algae and aerial plants like Caltopis gigantea and Carissa carandas were studied in various molecular docking experiments of cancer [40] and Human Rhinovirus [17] . Squalene is obtained from plants like Alliaria petiolata. Anacardium occidentale, and Carica papaya and showed its efficacy against viral pathogens such as HIV, Dengue and Ebola in silico [7] . Praveena et al. [33] reported Corynan-17-ol and 18, 19-didehydro-10-methoxy from Morinda tinctoria fruit extract as lead molecules against breast cancer protein ErBb2 in an in silico docking study. Elatol a key phytochemical isolated from the marine algae red Seaweed Laurencia dendroidea showed anti-leishmanial activity against Leishmania amazonensis [34, 35] . Beta cubebene, as one of the constituents of essential oil of Ocimum basilicum, showed antioxidant and antiviral activities [34] . Phytochemicals with positive docking results in our study hence can be explored further as a potential SARS CoV 2 candidate drugs in agreement with the literature. Principal Component Analysis (PCA) is a multidimensional data analysis tool which mainly deals with a large dataset and interprets them by reducing their dimensionality thereby making it easy to deduce with minimum loss of statistical information or ''variability'' [16] . It transforms measured variables into uncorrelated variables, i.e. principal components. Each of the principal components covers a separate dimension of variations of the measured dataset. While the first component shows the maximum variations of the dataset, the second component is orthogonal to the first one and covers remaining variations and so on [31] . Its working domain is vast, starting from biology, physiology, chemistry, engineering, physics and meteorology. The application of PCA ranges from data mining, quantitative structure-activity to 'omics' approaches [9] . To discern the overall quantitative relations among various phytochemicals, a PCA using the affinity values (Kcal/mol) of docked results were performed. First principal component (PC1) and the second component (PC2), as shown in Fig. 2 explained approximately 74.9 and 9.8% of the variance (total explained variations 84.7%), respectively. Considering both the components (PC1 and PC2), we observed four distinct groups in PCA analysis. Phytochemicals namely Elatol, Flavone, Rutin, Betacubebene and Campesterol were placed in the same group as Remdesivir indicating statistically similar potentials. Hierarchical cluster analysis is the iterative statistical method which involves multiple steps leading to the formation of small classes based the similar observations. To overview and interpret a large set of data, often those are grouped into smaller categories. By this, researchers can conveniently conclude [21] . Although we observed quite a few variations with PCA analysis, possibly due to different statistical algorithm, however, Remdesivir was placed with Beta-cubebene and Campesterol (Supplementary material). Further, after performing chemotaxonomic analysis in ClassyFire, we observed that median values of the molecular weights for cluster 1, 2, 3 and 4 in PCA, were 185.22, 410.73, 158.24 and 367.19 g/mol respectively. All the PCA clusters were predominantly rich in lipid variants although with variations in their percentage content (* 50, 75, 67 and 50% in Cluster 1, 2, 3 and 4 respectively) and chemical subclasses (fatty acyls-prenols, steroids-prenols, fatty acyls-prenols and steroid-prenols in cluster 1, 2, 3 and 4 respectively). Apart from lipid molecules, we also observed a significant presence of flavonoids in cluster 4. Cluster 1, however, showed the almost equal presence of saturated hydrocarbon-benzofuran along with lipids. It can be concluded from the study that 11 (eleven) phytochemicals, as mentioned in the study, are capable of inhibiting specific target protein of SARS-CoV 2 and can be further explored as potential drug candidates. Funding The authors declare no funding source. Conflict of interest The authors declare that they have no conflict of interest. Coronavirus susceptibility to the antiviral remdesivir (GS-5734) is mediated by the viral polymerase and the proofreading exoribonuclease Predicting commercially available antiviral drugs that may act on the novel coronavirus (SARS-CoV-2) through a drug-target interaction deep learning model Potential for global spread of a novel coronavirus from China Lipid formulation strategies for enhancing intestinal transport and absorption of P-glycoprotein (P-gp) substrate drugs: in vitro/in vivo case studies SwissADME: a free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules New insights on the antiviral effects of chloroquine against coronavirus: what to expect for COVID-19? Handbook of phytochemical constituent grass, herbs and other economic plants Computational prediction of drug-target interactions using chemogenomic approaches: an empirical survey The application of principal component analysis to drug discovery and biomedical data Cryo-electron microscopy structures of the SARS-CoV spike glycoprotein reveal a prerequisite conformational state for receptor binding A search for medications to treat COVID-19 via in silico molecular docking models of the SARS-CoV-2 spike glycoprotein and 3CL protease Avogadro: an advanced semantic chemical editor, visualisation, and analysis platform The novel coronavirus 2019 (2019-nCoV) uses the SARS-coronavirus receptor ACE2 and the cellular protease TMPRSS2 for entry into target cells Pharmacological therapeutics targeting RNA-dependent RNA polymerase, proteinase and spike protein: from mechanistic studies to clinical trials for COVID-19 Antipicornavirus flavone Ro 09-0179 Principal component analysis: a review and recent developments Ghosh M In-silico discovery of natural lead hits from the genus of arisaema against Human Rhino Virus The pharmacokinetic properties of HIV-1 protease inhibitors: a computational perspective on herbal phytochemicals Targeting SARS-CoV-2: a systematic drug repurposing approach to identify promising inhibitors against 3 C-like proteinase and 2 0 -O-ribose methyltransferase A novel coronavirus associated with severe acute respiratory syndrome Statistical significance for hierarchical clustering in genetic association and microarray expression studies Crystal structure of the SARS-CoV-2 non-structural protein 9 Research and development on therapeutic agents and vaccines for COVID-19 and related human coronavirus diseases Phytochemicals as potential drug candidates for targeting SARS CoV 2 proteins Antiviral agents from plants and herbs: a systematic review Severe acute respiratory syndrome coronavirus nsp9 dimerisation is essential for efficient viral growth Identification of bioflavonoid as fusion inhibitor of dengue virus using molecular docking approach Antiviral activity of phytochemicals: a comprehensive review Antiviral activity of phytochemicals: a current perspective Lung recruitability in COVID-19-associated acute respiratory distress syndrome: a single-center observational study Targeting SARS-CoV-2 spike protein of COVID-19 with naturally occurring phytochemicals: an in silico study for drug development Application of principal component and hierarchical cluster analysis to classify fruits and vegetables commonly consumed in Ireland based on in vitro antioxidant activity UCSF Chimera-a visualisation system for exploratory research and analysis In vitro and in silico analysis to identify novel lead compound from Morinda tinctoria fruit against breast cancer Chemical compositions, antiviral and antioxidant activities of seven essential oils Effect of elatol, isolated from red seaweed Laurencia dendroidea, on Leishmania amazonensis Highly flexible ligand docking: benchmarking of the DockThor program on the LEADS-PEP protein-peptide dataset In silico studies on therapeutic agents for COVID-19: drug repurposing approach Discovery and Synthesis of a Phosphoramidate Prodrug of a Pyrrolo [2, 1-f][triazin-4-amino] Adenine C-Nucleoside (GS-5734) for the Treatment of Ebola and Emerging Viruses Essential oils as antiviral agents. Potential of essential oils to treat SARS-CoV-2 infection: an in -silico investigation In-silico docking analysis of Calotropis gigantea (L.) R. Br derived compound against anti-cervical cancer activity Anti-neoplastic activity of two flavone isomers derived from Structural insights into coronavirus entry Computer aided screening of phytochemicals from Garcinia against the dengue NS2B/NS3 protease Molecular modeling of isolated phytochemicals from Ocimum Sanctum towards exploring potential inhibitors of SARS coronavirus main protease and papain-like protease to treat COVID-19 Unexpected receptor functional mimicry elucidates activation of coronavirus fusion Remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus (2019-nCoV) in vitro World Health Organisation Coronavirus diseases (COVID 19) situation report-116 Phytochemicals in medicine and food Broad spectrum antiviral agent niclosamide and its therapeutic potential Structural basis for inhibition of the RNA-dependent RNA polymerase from SARS-CoV-2 by remdesivir Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved a-ketoamide inhibitors A novel coronavirus from patients with pneumonia in China