key: cord-298989-qk0k2lmz authors: , Umesh; Kundu, Debanjan; Selvaraj, Chandrabose; Singh, Sanjeev Kumar; Dubey, Vikash Kumar title: Identification of new anti-nCoV drug chemical compounds from Indian spices exploiting SARS-CoV-2 main protease as target date: 2020-05-13 journal: J Biomol Struct Dyn DOI: 10.1080/07391102.2020.1763202 sha: doc_id: 298989 cord_uid: qk0k2lmz The 2019-novel coronavirus (nCoV) has caused a global health crisis by causing coronavirus disease-19 (COVID-19) pandemic in the human population. The unavailability of specific vaccines and anti-viral drug for nCoV, science demands sincere efforts in the field of drug design and discovery for COVID-19. The novel coronavirus main protease (SARS-CoV-2 Mpro) play a crucial role during the disease propagation, and hence SARS-CoV-2 Mpro represents as a drug target for the drug discovery. Herein, we have applied bioinformatics approach for screening of chemical compounds from Indian spices as potent inhibitors of SARS-CoV-2 main protease (PDBID: 6Y84). The structure files of Indian spices chemical compounds were taken from PubChem database or Zinc database and screened by molecular docking, by using AutoDock-4.2, MGLTools-1.5.6, Raccoon virtual screening tools. Top 04 hits based on their highest binding affinity were analyzed. Carnosol exhibited highest binding affinity -8.2 Kcal/mol and strong and stable interactions with the amino acid residues present on the active site of SARS-CoV-2 Mpro. Arjunglucoside-I (-7.88 Kcal/mol) and Rosmanol (-7.99 Kcal/mol) also showed a strong and stable binding affinity with favourable ADME properties. These compounds on MD simulations for 50 ns shows strong hydrogen-bonding interactions with the protein active site and remains stable inside the active site. Our virtual screening results suggest that these small chemical molecules can be used as potential inhibitors against SARS-CoV-2 Mpro and may have an anti-viral effect on nCoV. However, further validation and investigation of these inhibitors against SARS-CoV-2 main protease are needed to claim their candidacy for clinical trials. Communicated by Ramaswamy H. Sarma Since December 2019, a outbreak of COVID-19 with massive global impact has started in Hubei and Wuhan city in China caused by a novel coronavirus, SARS-CoV-2 (Bhoopathi et al., 2020; Kumar & Rathi, 2020; Wang et al., 2020) . On January 2020, World health organization emergency committee declared a global health emergency based on the high rate of spreading of the infection with high fatality rate (Chhikara et al., 2020; Kumar & Rathi, 2020) . As of now on 4th April 2020, Europe and USA are new epicentres of Pandemic. Even in the past, different coronaviruses have caused multiple human diseases that resulted in global epidemics such as the Middle East respiratory syndrome (MERS), severe acute respiratory syndrome (SARS) and coronavirus disease 2019 . While the coronaviruses have a significant impact on human health, the general public has an inferior awareness of coronavirus pathogenesis and infection. COVID-19 was declared a pandemic in March 2020 as the worldwide human population is facing high risk from contracting the nCoV infection. Many countries, including India, has announced lockdown in the country and to maintain social distancing to avoid further spread as no drug or vaccine is available against SARS-CoV-2. Coronavirus has been classified into four subfamilies based on their shape and host. These subfamilies are alpha-coronavirus, beta-coronavirus, gamma-coronavirus and delta coronavirus (Paules et al., 2020) . Alpha-coronavirus and beta-coronavirus are considered to have been originated from bats, while the gamma and delta coronaviruses are considered to be derived from birds and pigs (Banerjee et al., 2019) . Coronaviruses contain a positive sense, single-strand RNA genome coding for viral polymerase, RNA synthesis materials, and large nonstructural polypeptide. Coronavirus genome contains transcriptional modification, including 5'methylated cap and a 3'polyadenylated tail. Coronaviruses have very high rates of error in RNA replication due to constant errors by RNA dependent RNA polymerase (Banerjee et al., 2019; Lau et al., 2018) . Only a few protein crystal structures of SARS-CoV-2 are available on protein databank. SARS-CoV-2 main protease, a potential drug target, crystal structure (PDB-ID: 6Y84) was available and used for docking simulation and identification of potential drug molecule form Indian spices. SARS-CoV-2 main protease has a vital role in the processing of polyprotein that is translated from viral RNA, and the protease is considered as key for viral survival and growth . Medicinal plants yielding biologically active compounds have always been of great interest to scientists as they play an essential role in preventing human diseases. In the entire world, India is recognized where spices have been traditionally used as a source of medicine. Many active pharmaceutical gradients have been identified and extracted having a wide range of physiological and pharmacological properties (Sachan et al., 2018) . Apart from the regular uses of spices in culinary activities, they are widely used as indigenous medicines, nutraceuticals, in aromatherapy, as natural colouring agents, perfumes, cosmetics etc. In recent years there has been experimental evidence on physiological benefits that could be drawn in the context of various diseases like diabetes, cardiovascular issues and inflammatory disorders like arthritis and cancer. They have also been identified as preventive agents in certain conditions. Spices like red pepper, garlic, and fenugreek have been reported to have hypercholesterolemic activity, whereas fenugreek and garlic are also known to reduce and control blood sugar levels (Bhagya & Raveendra, 2017) . The main active ingredient of turmeric, Curcumin has been widely studied to have a broad spectrum of medicinal value ranging from anti-cancerous, anti-inflammatory and an anti-amyloidogenic activity (Bhagya & Raveendra, 2017) . Several computational studies related to drug or vaccine development against SARS-CoV-2 is recently published (Aanouz et al., 2020; Gupta et al., 2020; Joshi et al., 2020; Muralidharan et al., 2020; Sarma et al., 2020) . In the current study, we have utilized the prior knowledge on the medicinal values and potential applications of Indian spices, we have tried to explore if they can be used as novel agents for controlling SARS-CoV-2. The data generated using computational approaches give very encouraging results. In this study, prepared list of 45 chemical compounds from Indian spices and compound structure file was downloaded from PubChem database or Zinc database. Structure of protein SARS-CoV-2 Mpro was downloaded from RCSB protein database. (PDB ID: 6Y84). The atomic coordinates of active site were defined using report available in the literature and identification of catalytic His41, His164 and Cys145 (Khaerunnisa et al., 2020) . Energy minimization of the protein molecule was done by Swiss PDB Viewer (Guex & Peitsch, 1997) . Before docking, the assignment of charge, solvation parameters and fragmental volumes to the protein was done using the Autodock Tool 4 (ADT). The protein 6Y84 PDB molecule was further optimized by using ADT for the molecular docking using established procedure. The structure data files of the all chemical compounds were downloaded from PubChem database and converted into mol2 structures by using Open Babel. In order to further simplify the analyses, ligands were fist optimized and converted mol2 to PDBQT format by using the graphical user interface version of Raccoon. MGLTools-1.5.6, Raccoon is preparing AutoDock virtual screening tool-python (Forli, Scripps Research Institute). Compound screening using raccoon jAutoDock program Molecular screening of the all chemical compound libraries was performed by using Raccoon and MGLTools-1.5.6 software by Autodock as the engine for docking (Morris et al., 2009) . During the molecular docking period, the ligands were considered as a flexible molecule and the protein was considered as a rigid structure molecule. The configuration file for the grid parameters file and docking parameters file was generated by using Autodock. Autodock and Autogrid tools integrated with the Autodock4 were used to generate grid maps for each atom of the ligand. The grid boxes were made, such as to include one site at a time and perform docking. Grid X, Y and Z coordinates were 9.204, À4.557, and 19.602. We analyzed each ligand by setting default docking parameters except in the number of runs: We ran the LGA for 100 runs with each ligand with an initial population size of 150 random starting positions and conformation, 2.5 million number of energy evaluations. The application of grid parameters file was also used to predict the amino acid residue in the active site of the protein that interacts with the ligands. Positional root-meansquare deviation (RMSD) values were less than 1.0 Å considered ideal and clustered together for finding the favourable binding. The highest negative binding energy was considered as the ligand molecule with maximum binding affinity. Visual analysis of the docking site was performed using Pymol-2.3.3 and the results were validated by using AutoDock Tools-1.5.6 (Morris et al., 1998) . Binding interaction analyses of identifying inhibitor and the SARS-CoV-2 Mpro (PDB ID: 6Y84) was done using an online program by using Ligplot analysis (Wallace et al., 1995) . The classical molecular dynamics simulation is carried out for the prepared co-crystal structure and selected ligand-binding complex poses through Desmond Molecular dynamics package incorporated in Schrodinger suite (Chow et al., 2008) . Both the apo and ligand complex is solvated in the TIP3P model (specifies a three-site rigid water molecule with charges), using the volume occupancy in an orthorhombic box with periodic boundary conditions. For neutralizing the system, the overall charge of apo and ligand complex is solvated with appropriate cation (Naþ) or anion (Cl-) along with a salt concentration of 0.15 mol/L. The prepared system is energy minimized for a convergence threshold of 1.0 kcal/ mol/Å by using the steepest descent method, and for minimization and relaxation of the system, the NPT ensemble is applied (Selvaraj et al., 2015) . The standard temperature is kept constant at 300 K and pressure at the level of 1.013 bar for the total simulation, and each simulation is started for the total time scale of 50 ns. For the analysis of MD trajectories, the RMSD, RMSF and Hydrogen bond interactions are analyzed using the trajectory analysis incorporated in the Desmond (Selvaraj & Singh, 2014) . The absorption, distribution, metabolism, and excretion (ADME) properties of the studied top hits compound were calculated by using online SwissADME Program (Guex & Peitsch, 1997) . The significant parameters for ADME associated properties such as Lipinski's rule of five, pharmacokinetic properties the solubilities of drug and drug likeness were considered. We have created a database of 45 Indian spices compounds and their structures downloaded from PubChem and Zinc databases. Four small molecules were selected based on their binding affinity with SARS-CoV-2 Mpro as shown in the Table 1 . The list of remaining compounds used of moleculer docking is submitted as supplementary material (Table 1S ). The top three compounds include Carnosol, Rosmanol, and Arjunglucoside-I. The further insights into binding interactions of these compounds with SARS-CoV-2 Mpro were analysed using ligplot as shown in Figures 1 and 2. SARS-CoV-2 Mpro Protein (PDBID-6Y84) active site defined but unliganded (Owen et al., 2019) was used and the molecules where docked in the region where a-ketoamide was bound to protein (PDBID-6Y2F) (SARS-CoV-2) main protease with bound with a-ketoamide (Zhang et al., 2020) involving His41,164 and Cys145 in the active site. a-ketoamide was hence used as a positive control for this study and resulting interactions were analytically compared with the other Indian spices. a-ketoamide showed interactions with Cys145 via hydrogen bonds and Table 1 . Details of various kinds of interaction shown between the amino acids near the active site of SARS-CoV-2 main protease along with their respective inhibitor constant (Ki) and biological source and binding energy. The active site residues are indicated in bold. hydrophobic interactions His41. It was also seen to form hydrogen bonds with Thr26. Upon further analysis of the docking results, it was seen that our molecules had comparable binding energy as to a-ketoamide suggesting that these ingredients could indeed interact with same site (amino acids) as a-ketoamide. Carnasol (-8.2 Kcal/mol) was seen to form hydrogen bonds with Leu141, Ser144 and Cys145. Hydrophobic interactions for Carnasol include His41, Thr25, Asn142, Phe140, Glu166, Met165 (Figure 1) . Rosmanol, isolated from Rosemary (Rosmarinus officinalis), has been previously reported to have antioxidant activity, shows binding energy of -7.99 Kcal/mol and forms hydrogen bonds with Leu141, Gly 143, Ser144 and Cys145. Further hydrophobic interactions are shown with Thr25, 26, His41, Phe140, His163, and Leu27 (Figure 2a ). Arjunglucoside-I binds with hydrophobic interaction with His41, 164 and Cys145 additional to hydrogen bonding with Thr25, 26, His163, and Glu166 ( Figure 2b ). The results of MD simulation for both apo and ligand complex is analyzed for the 50 ns of time scale to understand the dynamic behavior and stability. MD simulation is performed for the total of 50 ns and the trajectories are for RMSD plot as shown in the The protein secondary structure is framed as 3 sheets, 7 beta hairpins, 9 beta bulges, 13 strands, 32 beta turns, 3 gamma turns from this 182-304 residues are occupied dominantly by loop regions. These 122 residues in the C-terminal functionally act in the MD simulation, and thus the sudden drift happens in the 45 th ns. This values of RMSD for the apo protein, is compared with ligand molecules in the Figure 3 , and for understanding the deviations, the 25 th to 50 th ns is focused. The results of ligand complex for the MD simulation of 50 ns of timescale shows that the ligand complex Alpha ketoamide (Red color) and Arjun glycoside (Blue color) shows stable throughout the simulation. While comparing these two Alpha ketoamide and Arjun glycoside ligand complex with apo protein, the ligand complex is matched with apo protein for the timescale of 45 th ns. After the 45 th ns the apo protein is drifted upwards, but the Alpha ketoamide and Arjun glycoside ligand complex remains stable. This may be due to the strong interaction pattern seen in Alpha ketoamide and Arjun glycoside interactions with the protein. Both the Alpha ketoamide and Arjun glycoside ligand bound complex are stable and positioned in the range of $2Å, which is close to the apo protein till 47 th ns. The Figures 5a and 5b shows the hydrogen bond interactions for the Alpha ketoamide and Arjun glycoside, which clearly shows the minimum participation of 2-3 hydrogen bonds seen in between the Alpha ketoamide and protein, and minimum participation of 4-6 hydrogen bonds between the Arjun glycoside and protein. This active participation of hydrogen bonds between the Alpha ketoamide and Arjun glycoside with protein makes the ligand complex stable for 50 ns of MD simulations. The ligand Carnosol shows stable movement in terms of RMSD values by showing a narrow graph, but while comparing with apo protein, the Carnosol ligand complex is deviated from the 14 th ns and stays in the range bound of $3.4 Å till the end of the simulations. For attaining the stable MD simulation for the Carnosol ligand bound complex, 2-3 hydrogen bonds are actively contributed and makes the complex stable in the dynamic state. The ligand Rosmanol bound complex shows stability till the 26 th ns with the range of $2.5 Å, but after that, the drift happens to make the RMSD value deviated in upward direction and fix the positional RMSD with the range of $4 Å. This may be due to the loss of hydrogen bonding interactions after the 27 th ns seen in the Figure 5d . Figure 5d says that the initial 27 ns shows the hydrogen bond interactions in the range of 2-3 between the protein and ligand, but after 27 th ns the ligand losses the hydrogen bonding ability and shows the hydrogen bond interactions in the range of 1-2 between the protein and ligand. Overall, the apo protein shows a narrow range of stability and notable fluctuations are also shown, that indicates the participation of loop structures. Those fluctuations are arrested through the active interactions of ligand molecules that shows strong binding between the protein and ligand. We have also analysed the molecules we report here for violation of Lipinski's rule (Table 2) . Rosmanol, Carnosol fits perfectly as within the defined parameters for non-violation of Lipinski's rule. The molecules have Log P values ranging from 1.15 to 3.27 which imply that these can effectively have suitable cell membrane permeability. Their number of hydrogen bond donors as well acceptors are well within range for Carnosol and Rosmanol but Arjunglucoside-I show high number of hydrogen bond donors and acceptors. Arjunglucoside is a large molecule having high molecular weight and total polar solvent area. Despite these factors it might prove to be essential in terms of potential drug once preceded with advanced studies. Amidst the unforeseeable outbreak of CoVID-19 there has been a sudden rise in demand of drug development, vaccines and identification of potential bioactive molecules which could prove to be useful fulfilling the purpose of broadening treatment options. In the quest for finding novel treatment regimen for these kinds of viral outbreaks, screening of already known molecules could also prove to be vital. In this context, we report here some active pharmaceutical ingredients which are present in the commonly used spices in India could prove to be useful. Preliminary in silico investigations show that indeed some molecules like Carnosol and Rosmanol have the properties which can further exploited and investigated for drug candidate against SARS-CoV-2. Although it is imperative to understand that development of rigid and highly specific treatment options will require further expeimental studies. No potential conflict of interest was reported by the author(s). Moroccan medicinal plants as inhibitors of COVID-19: Computational investigations Bats and coronaviruses Mulibenificial uses of spices: A brief review Novel 2019 coronavirus structure, mechanism of action, antiviral drug promises and rule out against itstreatment Corona virus SARS-CoV-2 disease COVID-19: Infection, prevention and clinical advances of the prospective chemical drug therapeutics: A review on Corona Virus Disease COVID-19, epidemiology, prevention, and anticipated therapeutic advances Desmond performance on a cluster of multicore processors SWISS-MODEL and the Swiss-PdbViewer: An environment for comparative protein modeling In-silico approaches to detect inhibitors of the human severe acute respiratory syndrome coronavirus envelope protein ion channel Discovery of potential multi-target-directed ligands by targeting host-specific SARS-CoV-2 structurally conserved main protease Potential inhibitor of COVID-19 main protease (Mpro) from several medicinal plant compounds by molecular docking study Coronavirus disease COVID-19: A new threat to public health Discovery and sequence analysis of four deltacoronaviruses from birds in the Middle East reveal interspecies jumping with recombination as a potential mechanism for avian-to-avian and avian-to-mammalian transmission Autodock4 and AutoDockTools4: Automated docking with selective receptor flexiblity Computational studies of drug repurposing and synergism of lopinavir, oseltamivir andritonavir binding with SARS-CoV-2 Protease against COVID-19 SARS-CoV-2 main protease with unliganded active site (2019-nCoV Coronavirus infectionsmore than Just the common cold Drug targets for corona virus: A systematic review Medicinal uses of our spices used in our traditional culture In-silico homology assisted identification of inhibitor of RNA binding against 2019-nCoV N-protein (N terminal domain) Mechanistic insights of SrtA-LPXTG blockers targeting the transpeptidase mechanism in Streptococcus mutans Validation of potential inhibitors for SrtA against Bacillus anthracis by combined approach of ligand-based and molecular dynamics simulation LIGPLOT: A program to generate schematic diagrams of protein-ligand interactions Review and prospect of pathological features of Corona virus disease Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved a-ketoamide inhibitors