key: cord-0872814-2lv8v7gj authors: Holanda, Vanderlan Nogueira; Lima, Elton Marlon de Araújo; da Silva, Welson Vicente; Maia, Rafael Trindade; Medeiros, Rafael de Lima; Ghosh, Arabinda; Lima, Vera Lúcia de Menezes; de Figueiredo, Regina Celia Bressan Queiroz title: Identification of 1,2,3-triazole-phthalimide derivatives as potential drugs against COVID-19: a virtual screening, docking and molecular dynamic study date: 2021-01-18 journal: Journal of biomolecular structure & dynamics DOI: 10.1080/07391102.2020.1871073 sha: 2c2c30372b8f46ba1c3d2e78408296ba2586de11 doc_id: 872814 cord_uid: 2lv8v7gj In this work we aimed to perform an in silico predictive screening, docking and molecular dynamic study to identify 1,2,3-triazole-phthalimide derivatives as drug candidates against SARS-CoV-2. The in silico prediction of pharmacokinetic and toxicological properties of hundred one 1,2,3-triazole-phtalimide derivatives, obtained from SciFinder® library, were investigated. Compounds that did not show good gastrointestinal absorption, violated the Lipinski's rules, proved to be positive for the AMES test, and showed to be hepatotoxic or immunotoxic in our ADMET analysis, were filtered out of our study. The hit compounds were further subjected to molecular docking on SARS-CoV-2 target proteins. The ADMET analysis revealed that 43 derivatives violated the Lipinski’s rules and 51 other compounds showed to be positive for the toxicity test. Seven 1,2,3-triazole-phthalimide derivatives (A7, A8, B05, E35, E38, E39, and E40) were selected for molecular docking and MFCC—ab initio analysis. The results of molecular docking pointed the derivative E40 as a promising compound interacting with multiple target proteins of SARS-CoV-2. The complex E40-M(pro) was found to have minimum binding energy of −10.26 kcal/mol and a general energy balance, calculated by the quantum mechanical analysis, of −8.63 eV. MD simulation and MMGBSA calculations confirmed that the derivatives E38 and E40 have high binding energies of −63.47 ± 3 and −63.31 ± 7 kcal/mol against SARS-CoV-2 main protease. In addition, the derivative E40 exhibited excellent interaction values and inhibitory potential against SAR-Cov-2 main protease and viral nucleocapsid proteins, suggesting this derivative as a potent antiviral for the treatment and/or prophylaxis of COVID-19. Communicated by Ramaswamy H. Sarma Nowadays respiratory diseases caused by virus are among the main causes of morbidity and mortality due to their easy spreading and the high pathogenicity of viral agents (Piñana et al., 2020) . At the end of 2019 a new Coronavirus, called SARS-CoV-2, appeared in Wuhan, China and became one of the most aggressive pandemics in history (Meo et al., 2020) , exceeding 3 million infected individuals and causing thousands of deaths worldwide by the beginning of May 2020. The clinical manifestations of Coronavirus disease 2019 are characterized by fever, fatigue, cough, which can rapidly progress into Acute Respiratory Discomfort Syndrome (ARDS) and death . Although vaccinal candidates against COVID-19 are already under development stage, a safe and effective vaccine for large-scale vaccination is far from becoming a reality. Hydroxychloroquine or chloroquine alone or in combination with a second-generation of macrolide, have been preconized for the treatment of COVID-19 despite no conclusive evidence of their benefits (Mehra et al., 2020) . In this regard, the search for a safe and effective treatment against the severe form of COVID-19 is still necessary . Among the arsenal of bioactive compounds, the 1,2,3-triazole group has been attracting interest of scientific community due to their wide-ranging and versatile medicinal applications such as antivirals (Sun et al., 2020) , antibacterial (Gonzaga et al., 2013) , antifungal (Thanh et al., 2019) and antiparasitic (Leite et al., 2018) agents. Furthermore, 1,2,3-triazoles can be easily synthesized by low-cost copper catalysed reactions between alkynes and azides, namely click chemistry. Besides being used to synthesize triazoles derivatives, click chemistry is also widely applied for the fusion of two or more chemical groups in a single hybrid molecule with improved biological activity (Ouyang et al., 2018) . As triazole derivatives, the phthalimide group has been shown to be useful in the formation of hybrid compounds with antiviral activity against HIV (Al-Masoudi et al., 2016) , cytomegalovirus and varicella-zoster (Mandi c et al., 2020) . Importantly, both triazole and phthalimide derivatives also have relevant anti-inflammatory activity (Assis et al., 2019) . Therefore, 1,2,3-triazole-phthalimide hybrids may not only be able to prevent viral replication but also treat the severe form of COVID-19, characterized by an intense inflammatory process called "cytokine storm" . One of the strategies for the discovery of antiviral candidates against SARS-CoV-2 is the association of in silico screeningbased approaches with the molecular docking of selected molecules against SARS-CoV-2 target proteins, to identify compounds having good drug-likeness properties. Because experimental approaches for drug discovery are highly costly and time-consuming, screening-based methods offer a fast and low-cost way for prospecting new potential drugs (Hall & Ji, 2020) . The SARS-CoV-2 viral genome encodes for 29 proteinsmany of which could serve as potential targets for anti-viral drugs. Among these proteins, spike protein (S), C-like proteases and nucleocapsid proteins have gained growing interest due to their relevant roles in the host cell recognition, viral replication, processing and formation of nucleocapsid structure. Spike glycoproteins, found on the surface of SARS-CoV-2, are a class I fusion proteins which bind to human Angiotensin Converting Enzyme 2 (ACE2) receptor mediating both the fusion of the viral envelope with the host-cell membrane and the virus entry into host cells. The chymotrypsin-like cysteine protease, also known as main protease (M pro ) or 3-C like protease (3CLPro), is also an interesting target for drugs against COVID-19. This protease cleaves the C-terminal sequence of the polyprotein PP1A and PP1AB into 16 functional non-structural proteins (nsps), which play an important role in viral replication . Besides proteases and spike proteins, the nucleocapsid protein of SARS-CoV-2 has been investigated as a target protein for drug development. The nucleocapsid protein comprises two structured domains, the N-terminal domain (NTD) and the C-terminal domain (CTD), interspersed by three intrinsically disordered regions (IDRs) known as the N-terminal arm (N-arm), the central linker region (LKR) , and the C-terminal tail (C-tail) arm (Zhou et al., 2008 . The main function of these proteins is to form complexes with ribonucleoproteins. However, they also regulate the replication, transcription of viral RNA and, in the host cells, inhibit protein translation, leading to disruption of host cell metabolism. Hence, targeting this protein can lead to blockbuster therapeutic agents for COVID-19 treatment (Bhatia, Narang & Rawal, 2020; Zeng et al., 2008) . In this study, we have performed docking study of selected 1,2,3-triazole-phthalimide derivatives against COVID-19 targets. The access to the chemical structures of 1,2,3-triazole-phthalimide hybrid compounds was performed using the SciFinderV R database (https://scifinder.cas.org). For this study, only compounds that were obtained by click chemistry and have biological activity already reported in the literature were chosen for the analysis. The chemical structure of compounds was built using ACD/ChemSketch 2.1 and optimized using the PM3 semiempirical method (Stewart, 1989) implemented in Mercury 4.3.1 (Macrae et al., 2008) . The protonation states of the ligands were verified by MarvinSketch 20.19.0 software assuming a physiological pH (Butcher et al., 2015) . The canonical smiles were obtained for ADMET in silico analysis (Karwath & Raedt, 2006) . The analysis of the pharmacokinetic and druglike properties of 1,2,3-triazole-phthalimide derivatives was carried out using the Web server SwissADME, http://www.swissadme.ch/ (Swiss Institute of Bioinformatics). Only the compounds that have good gastrointestinal adsorption and did not violate the following parameters of Lipinski's Rule of Five: (i) clogP 5; (ii) Molecular weight (MW) 500 g/mol; (iii) Number of hydrogen bond acceptors (HBA) (sum of N and O atoms) 10, (iv) Number of hydrogen bond donors (HBD) (sum of OH and NH groups) 5, were selected (Lipinski, 2004) . The number of rotatable bonds (nRotb) 10, were also considered for the further toxicity and molecular docking analysis. The prediction of toxicological potential of these compounds was performed using the pKCSM on-line platforms (Pires et al., 2015) . The AMES toxicity, hepatotoxicity, and immunotoxicity parameters were also used to filter out compounds. 2.3. Molecular docking of 1,2,3-triazole-phthalimide hybrids on SARS-CoV-2 target proteins The molecular docking of selected compounds on SARS-CoV-2 target proteins was carried out using iGemDock 2.1 and AutoDock 4.2 software. The main protease (M pro ), spike (SP) and viral nucleocapsid (NCP) proteins were chosen according to the stereochemical parameters provided by Ramachandran analysis, using the PROCHECK server (Laskowski et al., 2006) . Molecular docking was performed based on the confrontation between optimized ligands structure and biomacromolecules from SARS-CoV-2. The crystal structure of viral proteins was obtained from the Protein Data Bank (PDB) under the codes: spike: 6VSB, in its open (6VYB) and closed states (6VXX) (Walls et al., 2020; Wrapp et al., 2020) ; nucleocapsid protein, 6VYO (Yadav et al., 2020) and M pro , 5R80 (Razzaghi-Asl et al., 2020) . The validation of target protein-ligand complex structures was performed using the co-crystallized standard ligand of target proteins to ensure the virtual screening process. For this, M pro was redocked with its co-crystalized ligand RGZ: methyl 4-sulfamoylbenzoate. For each ligand-protein complex 10-docking poses were generated using Lamarckian Genetic Algorithm (L opez-Camacho et al., 2015). The pose with the lowest binding energy was selected as the final docking result. The interactions of the target-ligand complex were analysed and rendered using Discovery Studio v.20.1.0.19295. To calculate the interaction energy between receptors and ligands, the complexes obtained by molecular docking were used as an input for the molecular fractionation method with conjugated caps (MFCC). For this purpose, a convergence radius of 15 Å was defined (having its origin in the geometric center of the ligands). All the residues with at least one atom inside the spherical volume were taken into account. All atomic positions were kept fixed, with the exception of hydrogen atoms, which were optimized using consistent valence force field (CVFF). Subsequently, simulations within the Density Functional Theory formalism using the Local Density Approximation (DFT/LDA) were carried out using the DMOL3 code. To obtain the interaction energy between each eligible amino acid and the ligands, a fractionation was applied, respecting Eq. (1): where EðL À R i Þ represents the interaction energy between the ligand L and the amino acid Ri, where cap C ðiÀ1Þ ans C ðiþ1Þ is arranged including the predecessor amino acid and posteriorly Figure 1 . The chemical structures of 1,2,3-triazole-phthalimide bioactive derivatives obtained from SciFinder database. R 1 : 4-methylbenzyl 2-methylbenzyl, 4methoxybenzyl, 2-fluorobenzyl, 3-fluorobenzyl, 4-fluorobenzyl, 2-chlorobenzyl, 3-chlorobenzyl, 2,3-dichlorobenzyl, 3,4-dichlorobenzyl, 2-bromobenzyl, 4-bromobenzyl, 2-nitrobenzyl; R 2 : 2-(methylsulfanyl)-1,3-benzothiazole or phenyl; R 3 : carbohydrate or phthalimide; R 4 : 4-methyl, 4-chloro, H, 4-bromo, 3-nitro, 4-methoxy, 2chloro, 3-chloro, 4-fluoro, 2-methoxy, 3-methoxy, 2-methyl; R 5 : Methyl or H; R 6 : methylbutyl, methyl and 2-methylpropyl; R 7 : 4-fluorophenyl, p-tolyl, H, 1-pentafluorophenyl, 4-(trifluoromethyl)phenyl, 1-benzyl, 1-phenyl, 4-methoxyphenyl, 2,4-difluorophenyl, 2,6-difluorophenyl, 1-perfluorophenyl, 2-fluorophenyl, 4-pyridinyl; R 8 : H, 4-methoxybenzyl, 2-fluorobenzyl, 4-fluorobenzyl, 4-chlorobenzyl, 2,6-dichlorobenzyl, 3-bromobenzyl, 4-bromobenzyl, 2-nitrobenzyl, 3-nitrobenzyl, 4-nitrobenzyl. respectively of R i : The term EðL À C ðiÀ1Þ R i C ðiþ1Þ Þ is the total energy of the system consisting of the ligand and the residue with its caps; the E C iÀ1 is the total energy of the waste and the isolated caps EðL À C ðiÀ1Þ R i C ðiþ1Þ Þ the total energy of the system formed by the ligand and only the caps; and finally, EðC ðiÀ1Þ C ðiþ1Þ Þ the energy of the system composed by the caps. The total energy of the ligand interaction with the entire receptor is obtained by adding the individual energy of each eligible amino acid, as in Eq. (2). where N is the number of eligible amino acids. Molecular dynamics and simulation (MDS) studies were carried out in order to determine the stability and convergence of E38 and E40 molecules complexed with M pro . To set up the simulation, initially, the systems were built for complexes of E38-M pro and E40-M pro , respectively, in a system builder. For this purpose, Desmond 2018-4 was used to set up the initial parameters within an explicit SPC water model and placed in the orthorhombic box 4.0 Â 4.0 Â 4.0 Å. All the protease-ligand complexes were neutralized with NaCl by adding 0.15 M Na þ ions. ASL module was used to select the specific residues of ligand and protein molecule. The prepared systems were relaxed using Desmond default protocol of relaxation (Khan et al., 2020) . MDS run of 20 ns was set up at constant temperature and constant pressure (NPT) for the final production run. The NPT ensemble was set up using Nos e-Hoover chain coupling scheme (Dayer et al., 2017) at temperature 300 K for final production and throughout the dynamics with relaxation time 1 ps. RESPA integrator was used to calculate the binding interactions for a time step 2 fs (Kaiser, 2005) . All the other parameters were associated in the settings as described elsewhere (Chowdhury, 2020) . After the final production run, the simulation trajectories of main protease complexed with E38 and E40 molecules were analyzed for final outcome of RMSD, RMSF and number of hydrogen bonds formed from the simulation. Binding energies of the complexes were calculated using MM-GBSA (Genheden & Ryde, 2015) for every 1 ns trajectory up to 20 ns and the average binding energies with standard deviations were measured for accurate binding approximation and stability described elsewhere (Genheden & Ryde, 2015) . In silico screening of compounds against new coronavirus targets is considered a useful strategy for the detection of molecules with great probability of presenting good pharmacokinetic properties and in vitro and in vivo low toxicity . Thus, we used in silico approaches to predict the pharmacokinetics and the toxicological potential of 101 phthalimide-1,2,3-triazole derivatives, aiming to identify leader compounds against SARS-CoV-2 ( Figure 1 ). These compounds were categorized according to the radicals attached to the 1,2,3-triazole-phthalimide nucleus into groups A-H, as showed in supplementary material (Tables S1-S8) For hit compounds selection, molecules from the abovementioned series, that violated the Lipinski's rule of five (RO5) and therefore, those which have poor druglike properties, were filtered out. The lipophilicity and solubility are the key molecular properties for drug absorption. Lipinski's rule of five is a rule of thumb that describes the druggability of a given molecule. Our pharmacokinetic analysis (supplementary material, Tables S9 and S10) revealed that 44 out 101 compounds violated at least one of the Lipinski's rules. Out of these, 11 derivatives were predicted to have low gastrointestinal absorption (GIA). Although the compounds B03 and B04 did not violate the RO5, they showed a low GIA profile and were also rejected as hit compounds. Thus, 57 compounds were further submitted to ADMET analysis (Table 1) . The high oral bioavailability, preconized by Lipinski's rules, is one of main desirable characteristics of a druggable compound and an important factor for the optimization of bioactive molecules as therapeutic agents. Oral route is one of the preferred pathways of drug administration due to its unique advantages, including sustained and controllable drug delivery, easy administration and high patient compliance (Ekins et al., 2010; Homayun et al., 2019) . Passive intestinal absorption (associated with low MW), reduced molecular flexibility (measured by NRB), low TPSA or total hydrogen bond counts (HBA and HBD) are important predictors of good oral bioavailability (Brito, 2011) . A compound having CLogP ranging 0.5 to 3.5 and molecular mass < 500 kDa is considered a druggable candidate (Tetko et al., 2016) . The presence of more than two heterocycles groups in the same molecule can result in compounds with high molecular weight and LogP values. These characteristics can be observed in most of the compounds belonging to the group E (1,2,3-triazole-benzoimidazole-phthalimide derivatives) and in compounds G07 and G08 (1,2,3-triazol-benzamide-phthalimide), which are therefore excluded from our study. Independently of molecular weight, compounds with high probability of good oral bioavailability have no more than 5 hydrogen bond donors and no more than 10 hydrogen bond acceptors. According to these latter criteria, our predictive analysis showed that the C03 and C02, D05, G09, G10, and G11 have low permeability throughout biological barriers. The blood-brain barrier (BBB) protects the central nervous system (CNS) preventing certain substances (mostly harmful) from entering the brain tissues. The BBB limits the passage of most of the external compounds to maintain CNS at a steady state (Dur an-Iturbide et al., 2020; Miao et al., 2019) . In addition, P-glycoprotein (P-gp) actively transports a wide variety of compounds out of cells. This protein is highly associated with the ADMET properties, playing a major role in the multidrug resistance (MDR) phenomenon (Li et al., 2014) . Our ADMET analysis showed that 15 out 57 compounds are predicted to cross the BBB, and 27 compounds were identified as a potential substrate of P-gp. Total Clearance (TC) is an important parameter associated with both the half-life and bioavailability of a xenobiotic compound. This parameter has a direct impact in determining the dose regimen (how often) and dose size (how much) of a given drug. Thus, its prediction helps us to determine the feasibility of clinical dosing and provides a framework for the starting dose in in vivo studies (Dur an-Iturbide et al., 2020). In this work, the TC values of 1,2,3-triazoles-phtalimide derivatives varied from À0.26 to 0.85 log (mL/min/kg), for H2 and E37 derivatives, respectively. Higher values of TC suggest that the xenobiotic is removed rapidly from the body, whereas a low clearance value indicates slower removal. 3.2. Prediction of toxicological potential of 1,2,3triazole-phtalimide compounds Poor pharmacokinetics and adverse side effects, due to compound toxicity, are pointed as the main causes of late-stage failures in drug development (Li et al., 2019) . In this regard, in silico predicted toxicological potential of the above-mentioned compounds was investigated. Our results showed that 15 out 57 compounds were positive for AMES and 46 compounds presented potential to be hepatotoxic. Of these, 10 compounds were both positive for AMES and hepatotoxic. The analysis of 8 selected compounds, that were either negative for AMES test or presented no hepatotoxic potential, revealed a predicted LD 50 for acute toxicity in rats ranging from 1,888 to 2,980 mol/kg. The minimum value for oral chronic toxicity (ORCT) was 0.160 and the maximum value was 2,046 mg/kg/day. Besides presenting positive score for AMES test and hepatotoxicity the compound A09 also showed potential to cause adverse effects on the skin (Table 2) . In silico predictive toxicological evaluation is easy to perform, cost-effective, high-throughout alternative to in vitro assays and, even replaces time-consuming, expensive in vivo experiments (Myatt et al., 2018) . Derivatives E26, E27, E31, E33, and E37 showed a positive predictive result for the AMES test, a method that assesses the mutagenic potential of chemical compounds (Kauffmann et al., 2020) . The absence of a methyl at the carbon 5 of phthalimide ring and the presence of 1H-benzo[d]imidazole group may be related to the toxic effect of these compounds. In addition, changes in benzimidazole ligands can modulate their binding to deoxyribonucleic acid and the selectivity of these compounds towards the host genetic material (G€ um€ uş et al., 2009) . Besides the mutagenic potential, another important variable taken in account in our predictive analysis was the evaluation of the hepatotoxic potential of selected phthalimide-1,2,3-triazole derivatives. Although the compounds A01, A02, A04, A06, A09, A10, A11 and A12, that have phthalimide-1,2,3-triazol-phenyl as the main structural group, were negative for AMES test, they showed important hepatotoxic potential. The presence of the methyl-benzyl group linked to 1,2,3-triazole-phthalimide nucleus, as well as the insertion of binders such as fluorine and bromine, may be related to the deleterious potential of these compounds. Interestingly, the compounds from the same series that did not show a predictive hepatotoxic effect, contained chlorine at the carbon 2 or 3 of phenyl group. However, this same result was not observed when 2,3-dichloro or 3,4-dichloro was present. Like the derivatives from the A series, the molecules E25, E45, G05, and G07 that also presented predictive hepatotoxic potential, have substituted halogenated groups in their structures. In order to better understand the structure-toxicity relationship of the compounds containing halogen atoms, further evaluation of other chemical-structural parameters is required (Kortagere et al., 2008) . The derivatives B01, B02, B06, B07, and B08 also showed positive predictive results for hepatotoxicity. The difference in the number of carbons in the aliphatic chain between phthalimide and triazole, also present in the molecules C04, C05 and C06, H01 and H02, could explain, at least in part, the predicted hepatotoxic potential found for compounds belonging to this series. Consistently, the number of carbons present in the aliphatic chain, that joins pharmacophoric groups in hybrid derivatives of 1,2,3-triazole-phthalimide, has been associated with the occurrence in vivo hepatotoxicity (Da Silva et al., 2019) . The presence of the benzothiazole group in compounds of the E series did not appear to be determinant for the predicted hepatotoxicity of these molecules, since other triazole-phthalimide derivatives containing the same group, like E26, E27, E38, E31 and E33 did not present hepatotoxic potential. Heterocyclic molecules containing the benzothiazole group have shown to be more promising drug candidates to treat infectious diseases, reinforcing the pharmacological potential of benzothiazole-containing molecules (Papadopoulou et al., 2013) . It has been reported that molecules containing thalidomide group in its structure, as those of series F, presented high hepatotoxic potential (Kamiya et al., 2020) . In this regard, only the derivatives A7, A8, B5, E35, E38, E39, and E40 (Figure 2) showed good pharmacokinetic and toxicological profiles, and were therefore chosen for molecular docking. Computational simulations of ligand-protein docking are an important component of drug discovery, being mainly used for the virtual screening of hit compounds from large databases and to identify and evaluate the effects of chemical changes during leader optimization (Bordogna et al., 2011) . In this regard, the accuracy of modeled structures should be taken into account. Our stereochemical analysis of SAR-CoV-2 target proteins, obtained from PDB ( Figures S1-S5) , showed that the percentage of the sum of amino acid residues in the most favoured and additional allowed regions was ! 95%. This result confirmed the high quality of our selected target models (Laskowski et al., 2006) . In order to validate our molecular docking, the co-crystal standard ligand (RGZ, methyl 4-sulfamoylbenozate) of M pro complex (5R80) was redocked against this protein and the root-mean-square deviation (RMSD) was calculated for predicting the stability of the protein and protein-ligand complexes. Our results showed that Autodock program was able to generate a successful pose (RMSD ¼ 1.9 Å) ( Figure S6 ). It is assumed that scored poses with an RMSD of less 2.0 Å are considered to be successful (Bordogna et al., 2011; Razzaghi-Asl et al., 2020) . The docking analysis using iGemDock software revealed that the derivatives A7, A8, B5, E35, E38, E39, and E40 showed favourable interactions energies for all the tested proteins from SARS-CoV-2 (Table 3) . E38 presented higher binding energy valued for spike protein in its closed state (Spike-C) (À116.6 kcal/mol) and spike protein in prefusion conformation (À107.1 kcal/mol). The higher binding energy value for main protease (M pro ) was found for its interaction with E38 (À117.2 kcal/mol). Among the tested compounds, E39 presented highest affinity to RNA binding domain of nucleocapsid phosphoprotein with binding energy of À108.8 kcal/mol. For all ligand-protein complexes, the interaction energies were predominantly due to Van der Waals, followed by hydrogen bonds. No electrostatic interactions were observed for all interactions of compounds with SARS-CoV-2 proteins. The use of molecular docking has been largely applied as a strategy for the selection of possible leader candidates against the new coronavirus (Karypidou et al., 2018) . Since the SARS-CoV outbreak in 2002, in silico molecular docking and structural studies have demonstrated a relationship between the SARS-CoV-2 spike receptor-binding domain and the angiotensin-2 receptor (ACE2) in the host cells. This association is strictly linked to the infection and transmission of new corona virus in humans (Utomo & Meiyanto, 2020) . The wide distribution of the ACE2 receptor in human tissues and organs could be related to the appearance of symptoms in the severe phase of the disease, mainly in the pulmonary system, where it is largely expressed at the surface of type II alveolar cells (Sanchis-Gomar et al., 2020) . In our analysis we docked the selected compounds on different conformational states of SARS-CoV-2 spike protein. Overall, our results showed that all the selected compounds bound with high affinity to spike proteins independently of its conformational state (open, close, or in prefusion conformation). However, slight differences in binding energy values were observed. The lowest binding energy value found for the interaction of E38, and E35 derivatives with different conformational states of spike protein, revealed the binding plasticity of 1,2,3-triazole-phthalmide derivatives towards this protein. Liu et al. (2004) described that the inhibition of peptides in the heptad repeat regions of the SARS-CoV spike protein results in the interference of the fusogenic mechanism. In this regard, it is possible that the selected Top-7 derivatives may interfere with the SARS-CoV-2 binding, entrance and replication processes in the host cells. The predictive analysis of the inhibition constant values for best binding pose of protein-ligand complexes was performed using AutoDock software (Tables S11-S17). Our results showed that A7, E35, E38 and E40 presented higher binding affinity for at least one of the viral target SARS-CoV-2 proteins, with inhibition constant values (Ki) in the nanomolar range. All of these compounds showed high affinity for SARS-CoV-2 main protease. Comparing all ligand-protein interactions tested, the lowest values of inhibition constant (Ki) were found for the interaction of the compound E38 with M pro with Ki ¼ 23.85 nM and binding energy of À10.4 kcal/mol for the best pose. The second-best value of Ki was obtained for the E40-M pro complex with Ki ¼ 30.04 nM and a binding energy of À10.26 kcal/mol. Compound E40 also bound with high affinity to nucleocapsid protein, with Ki ¼ 91.09 nM and binding energy of À9.6 kcal/mol (Figure 3 ). In addition to the spike protein, coronavirus proteases are also essential for viral transmission and virulence by processing viral proteins involved in viral replication (B aez-Santos et al., 2015) . Compounds that act as inhibitors of these proteins can be considered as promising antiviral agents by reducing the severity of the infection (Hall & Ji, 2020) . All the top-7 compounds bound with high affinity to SARS-CoV-2 main protease, being E38 and E40 the compounds that presented the highest affinity (i.e. the lowest binding energy) towards this target. The analogous performance of these compounds toward M pro can be attributed to chemical-structural similarities of the two derivatives, which are only distinguished from each other by the presence of the phenyl and pyridin-4-yl groups in the N1 of triazole ring. Molecules containing 1,2,3-triazole group have already been described as having anti-coronavirus activity, by interacting with viral proteases, corroborating the relevance of these proteins as the main target of these compounds (Karypidou et al., 2018) . The main amino acids involved in E40-SARS-CoV-2 binding were MET165, MET49, HIS41, CYS145, GLU166 and GLN189 (M pro - Figure 4) and ALA134, GLY69, VAL133, TRP132, ILE130, ALA125, ILE131, LYS65, PHE66, ARG68 and TYR123 (NCP- Figure 5 ). The most prevalent interaction was Van der Waals and hydrogen bonds. The electrostatic binding energy values were insignificant when compared to the contributions of van der Waals and hydrogen bonds. In this study we also docked the Top-7 1,2,3-triazolephthalimide derivatives on viral nucleocapsid protein. NCPs have been considered strategic targets to fight acute respiratory infections caused by SARS-CoV-2 due to their multiple functions in the viral replication cycle (Kang et al., 2020) . All the compounds tested bound with high efficiency on NCP. Comparing the best-pose of selected molecules (1/10) for the NCP, the E40 presented the most favorable binding energy (À9.6 kcal/mol) and was the only one that achieved an inhibition constant in the nanomolar range for this target. The interaction of these compounds with nucleocapsid proteins may interfere with essential functions of the viral cycle, such as the formation of the helical ribonucleoprotein complex, during the packaging of the viral genome, and the control of basic cellular functions in the host cell, like the deregulation of the cell cycle (Cong et al., 2020; Surjit et al., 2006) . Although further in vitro analysis are needed to better understand the biological effects of the interaction of E40-SARS-CoV-2 nucleocapsid proteins complex, the high predicted binding affinity of this compound to these proteins suggests that E40 may prevent nucleocapsid proteins from participating in the replication and release of viral particles (Zeng et al., 2008) . A characteristic of RNA viruses is the high rate of genetic mutation, which can result in the evolution of new viral strains and the inefficacy of treatment. Therefore, from a public-health perspective, the ability of our top-7 selected 1,2,3-triazole-phthalimide derivatives to bind to different viral proteins, may circumvent the drug resistance caused by single-target mutations or rare simultaneous mutations of several targets in different positions . Although some compounds were excluded from our study, it is important to take in mind that these molecules can be redesigned for more rational chemical-structural modifications, decreasing its toxic potential and/or improving their biological activity against target molecules. Because our molecular docking analysis has pointed the compound E40 as the most promising compound, the best complexes obtained between this compound and the targets were subjected to quantum energy calculations by the MFCC technique. The results of the interactions between E40 and amino acids of the RNA binding domain of nucleocapsid phosphoprotein (PDB-ID 6VYO) are described in Figure 6 and Table 4 . Overall, 30 amino acids of NCP showed some kind of interaction with E40, 11 showing repulsion and 19 attraction with the ligand. The PRO67 residue presented energy value of 0.00. Consequently, it cannot be confirmed whether the interaction occurring with the ligand is repulsive or attractive. The general energy balance points to a stronger attraction (À8.03 eV) between the connection site of protein and the E40 compound. The residues GLY85, TYR86, TYR87, LEU121, PRO122, TYR123 and GLY124 presented very low energies (negative values), suggesting very strong attractions by these amino acids, which are probably the anchor residues. Figure 7 and Table 5 show MFCC results between E40 and the open state of SARS-CoV-2 spike ectodomain structure (PDB-ID 6VYB). A total of 45 residues of spike-O showed energetic interactions with the drug, 13 of which are interacting repulsively and 32 attractively. The residues VAL1040, ASP1041, LEU1049, VAL1068, ASN907, and LYS1038 demonstrated a strong attraction for the ligand, possibly acting as anchoring residues. Figure 8 and Table 6 describe the results of the MFCC for interactions between spike glycoprotein in closed state (PDB-ID 6VXX) and E40. There were 32 amino acids of spike protein interacting with E40, 5 with positive energies (repulsion), 26 with negative energies (attraction) and one with 0.0 energy (ASN394). The amino acids that stood out for having a strong attraction were: ALA522, THR523, VAL524 and PRO561. Although the structures obtained at inputs spike-C and spike-O represented different conformational states of the same structural protein of SARS-CoV-2 (spike glycoprotein), the compound E40 was strongly attached to both conformations but in different places. The results obtained for spike glycoprotein with a single receptor-binding domain up (6VSB) are shown in Figure 9 and Table 7. A total of 32 residues of spike glycoprotein (with a single receptor-binding domain up) residues, interacted with the E40 ligand, 6 of which presenting repulsion and 26 attraction interactions, with emphasis on the amino acids GLY548, THR549, GLY550, MET740 and SER750, which showed strong attraction for the compound. Figure 10 and Table 8 showed the MFCC results for E40 interactions with the main protease in complex. There are 49 amino acids of spike glycoprotein (with a single receptor-binding domain up) residues interacting with E40, 17 with slightly repulsive interactions, 3 with 0.0 energy and 29 with negative energy values. The CYS145, CYS44 and CYS85 amino acids residues, showed surprising attraction to E40, with energy values of À42.31 kcal/mol, À43.68 kcal/mol and À50.42 kcal/mol, respectively. These cysteine amino acids have already been identified as anchor residues in other studies on SARS-CoV-2 main protease (Komatsu et al., 2020) . Taking in account the general energy balance, this protein also showed the lowest energy value in this study (À8.63 eV). In a viral pandemic with rapid dispersion of cases, drug replacement strategy is promising to fight the disease, because substances that are commercially available in the pharmaceutical industry have already undergone several toxicity and safety tests. In chemistry, when atoms are held together, the energy needed to separate them is called binding energy. In covalent bonds, where the electrons are shared between atoms, the energy needed to break these bonds can vary, but energies less than À1.0 eV (Tkatchenko & Scheffler, 2009 ) suggest covalent bond among atoms. E40 showed lower energy values in all analyses. Interestingly, but not surprisingly, E40 showed affinity with all the three spike conformers (6VYB, 6VXX and 6VSB). Although, in this study, the ligand interacted with all Spike conformers, the anchorage residues were different in each interaction. This raises some concerns: would the E40 function as an inhibitor of the Spike-ACE2 interaction in any of the three conformers? Is there a possibility of allosteric inhibition by the E40? Although our results suggest a drug repositioning for COVID-19, further in vivo and in vitro studies are needed to answer these questions. The final convergence and the stability of E38 and E40 bound with the main protease was assessed by MD simulation. The analysis of RMSD and RMSF plots showed that after 20 ns of convergence, E3, bound to the main protease, displayed a significant stability ( Figure 11A and 11B) . The interactions of this protein with E38 displayed much less RMSD differences ($0.25 Å) showing its stable conformation ( Figure 11A ). The average of H-bonds formed between E38 and the Figure 12 : Surface view of main protease before MD simulation (orange) displaying little modifications in the E38-M pro binding site (yellow ring) after simulation (cyan teal). Lower panel depicted the conformational differences between the structures of E38 in the binding site provided orientation that is more accurate for higher binding. main protease showed to be 1.0 throughout the simulation period for the stable conformation of both the main protease and E38 molecule ( Figure 11C ). The structural superimposition of the main protease-E38 complex before and after simulation displayed less significant changes in the overall conformation ( Figure 11D ). In addition, accommodation of E38 at the binding cavity of this protein showed smallaltered orientation after simulation, which seems to be due to a change in the visible secondary structure at the E38-M pro binding site ( Figure 12 ). After 20 ns of convergence, the E40-M pro complex displayed significant stability as demonstrated by RMSD and RMSF plots ( Figure 13A and 13B) . The interactions of this protein with E40 displayed much less RMSD differences ($0.50 Å) conferring its stable conformation ( Figure 13A ). The average of H-bonds formed between E40 and the main protease showed to be 1.5 throughout the simulation period, for the stable conformation of both the main protease and the E40 molecule ( Figure 13C ). Structural superimposition of the main protease-E40 complex before and after simulation displayed less significant changes in the overall conformation ( Figure 13D ). In addition, accommodation of E40 at the binding cavity of M pro showed small-altered orientation after the simulation which appeared to be due to the change in the visible secondary structure at the E40-M Pro binding site ( Figure 14) . The free energies of binding using MMGBSA were calculated for the E38 and E40-M pro complexes of SARS-CoV-2. The properties of MMGBSA calculations are displayed in Table 9 . The free energy avarage of binding for every 1 ns trajectory up to 20 ns, for the main protease-E38 complex displayed a dG ¼ À63.47 kcal/mol, with standard deviation of 3.0 kcal/mol, coulombic force of À8.70, solvent accessibility of 20.20 and high ligand binding affinity of À1.81. On the other hand, the E40-main protease complex displayed a dG ¼ À63.31 with standard deviation of 7.0 kcal/mol and high binding efficiency, solvent accessibility and significant coulombic as well covalent energies. All the properties calculated in the MD simulation point the significant role of E38 and E40 compounds as a potent inhibitor of the main protease of SARS-CoV-2. Taken together, our results showed that the seven selected 1,2,3-triazole-phthalimide derivatives showed an excellent predicted pharmacokinetic and ADME properties, which are directly related to the solubility, permeability and toxicological profiles of such compounds. The in silico docking results showed that these selected compounds are potential multi-target ligands of essential SARS-CoV-2 spike, protease and nucleocapsid proteins suggesting the usefulness of 1,2,3-triazole-phthalimide derivatives for the development of drugs addressed to these proteins. Because the E40 presented predicted inhibition constant values in the nanomolar concentration range for both main protease and nucleocapsid proteins, this compound was considered the most promising as multi-target agent against SARS-CoV-2. Our MD simulation corroborated E40 and E38 as potent inhibitors of M pro of SARS-CoV-2 However, we cannot rule out the possibility that other compounds, such as E38 can be included in further investigations alone or in combination with E40. Vanderlan Nogueira Holanda would like to thank Fundação de Amparo a Ciência e Tecnologia do Estado de Pernambuco (FACEPE) for the graduate scholarship (IBPG-0333-2.08/18). The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Figure 14 . Surface view of main protease before MD simulation (green) displaying little modifications in the E40-M pro binding site (yellow ring) after simulation (cyan teal). Lower panel represented the conformational differences between the structures of E40 in the binding site provided orientation that is more accurate for higher binding. Amino acid derivatives. Part 6. Synthesis, in vitro antiviral activity and molecular docking study of new Na-amino acid derivatives conjugated spacer phthalimide backbone Synthesis and anti-inflammatory activity of new alkyl-substituted phthalimide 1H-1,2,3-triazole derivatives Design and synthesis of triazolephthalimide hybrids with anti-inflammatory activity The SARS-coronavirus papain-like protease: Structure, function and inhibition by designed antiviral compounds A summary of viral targets and recently released PDB IDs of SARS-CoV-2 Predicting the accuracy of protein-ligand docking on homology models Pharmacokinetic study with computational tools in the medicinal chemistry course pKa-critical interpretations of solubility-pH profiles: PG-300995 and NSC-639829 case studies Prediction of the 2019-nCoV 3C-like protease (3CLpro) structure: Virtual screening reveals velpatasvir, ledipasvir, and other drug repurposing candidates In silico investigation of phytoconstituents from Indian medicinal herb 'Tinospora cordifolia (giloy)'against SARS-CoV-2 (COVID-19) by molecular dynamics approach Nucleocapsid protein recruitment to replication-transcription complexes plays a crucial role in coronaviral life cycle Lopinavir; a potent drug against coronavirus infection: Insight from molecular docking study silico ADME/Tox profiling of natural products: A focus on BIOFACQUIM Evolving molecules using multi-objective optimization: Applying to ADME/Tox The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities Synthesis, cytotoxicity, and DNA interactions of new cisplatin analogues containing substituted benzimidazole ligands A search for medications to treat COVID-19 via in silico molecular docking models of the SARS-CoV-2 spike glycoprotein and 3CL protease Challenges and recent progress in oral drug delivery systems for biopharmaceuticals Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet Science resources. Chemists want NIH to curtail database Determination and prediction of permeability across intestinal epithelial cell monolayer of a diverse range of industrial chemicals/drugs for estimation of oral absorption as a putative marker of hepatotoxicity Crystal structure of SARS-CoV-2 nucleocapsid protein RNA binding domain reveals potential unique drug targeting sites SMIREP: Predicting chemical activity from SMILES Synthesis, biological evaluation and molecular modeling of a novel series of fused 1,2,3-triazoles as potential anti-coronavirus agents Alternative type of Ames test allows for dynamic mutagenicity detection by online monitoring of respiration activity Targeting SARS-CoV-2: A systematic drug repurposing approach to identify promising inhibitors against 3C-like proteinase and 2 0 -O-ribose methyltransferase Drug binding dynamics of the dimeric SARS-CoV-2 main protease, determined by molecular dynamics simulation Halogenated ligands and their interactions with amino acids: Implications for structure-activity and structure-toxicity relationships PROCHECK: Validation of protein-structure coordinates New 1,2,3-triazole-based analogues of benznidazole for use against Trypanosoma cruzi infection: In vitro and in vivo evaluations ADMET evaluation in drug discovery. 13. Development of in silico prediction models for P-glycoprotein substrates Current trends in drug metabolism and pharmacokinetics Lead-and drug-like compounds: The rule-of-five revolution Interaction between heptad repeat 1 and 2 regions in spike protein of SARS-associated coronavirus: Implications for virus fusogenic mechanism and identification of fusion inhibitors Solving molecular flexible docking problems with metaheuristics: A comparative study 1, 4-and 1, 5-di (Nphthalimidomethyl)-1, 2, 3-triazoles: Crystal structures and density functional theory studies of the alkyne and azide precursors Mercury CSD 2.0-new features for the visualization and investigation of crystal structures Substituted adamantylphthalimides: Synthesis, antiviral and antiproliferative activity Hydroxychloroquine or chloroquine with or without a macrolide for treatment of COVID-19: A multinational registry analysis Biological and epidemiological trends in the prevalence and mortality due to outbreaks of novel coronavirus COVID-19 Improved classification of blood-brain-barrier drugs using deep learning silico toxicology protocols. Regulatory Toxicology and Pharmacology Recent trends in click chemistry as a promising technology for virus-related research Novel 3-nitro-1H-1,2,4-triazole-based piperazines and 2-amino-1,3-benzothiazoles as antichagasic agents Synthesis, antitubercular evaluation and molecular docking studies of phthalimide bearing 1, 2, 3-triazoles The clinical benefit of instituting a prospective clinical communityacquired respiratory virus surveillance program in allogeneic hematopoietic stem cell transplantation pkCSM: Predicting small-molecule pharmacokinetic and toxicity properties using graphbased signatures Identification of a potential SARS-CoV2 inhibitor via molecular dynamics simulations and amino acid decomposition analysis Synthesis of 1, 2, 3-triazole 'click'analogues of thalidomide New phthalimide-benzamide-1, 2, 3-triazole hybrids; design, synthesis, a-glucosidase inhibition assay, and docking study Angiotensin-Converting Enzyme 2 and antihypertensives (angiotensin receptor blockers and angiotensin-converting enzyme inhibitors) in Coronavirus Disease Synthesis and bioactivity of phthalimide analogs as potential drugs to treat schistosomiasis, a neglected disease of poverty Optimization of parameters for semiempirical methods II Design, synthesis and structure-activity relationships of 4-phenyl-1H-1,2,3-triazole phenylalanine derivatives as novel HIV-1 capsid inhibitors with promising antiviral activities The nucleocapsid protein of severe acute respiratory syndrome-coronavirus inhibits the activity of cyclin-cyclin-dependent kinase complex and blocks S phase progression in mammalian cells Phthalimide-1, 2, 3-triazole hybrid compounds as tyrosinase inhibitors; synthesis, biological evaluation and molecular docking analysis Prediction of logP for Pt(II) and Pt(IV) complexes: Comparison of statistical and quantum-chemistry based approaches Efficient click chemistry towards novel 1H-1,2,3-triazole-tethered 4H-chromene-d-glucose conjugates: Design, synthesis and evaluation of in vitro antibacterial, MRSA and antifungal activities Accurate molecular van der Waals interactions from ground-state electron density and free-atom reference data Revealing the potency of citrus and galangal constituents to halt SARS-CoV-2 infection Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation Discovery of multitarget-directed ligands against influenza a virus from compound yizhihao through a predictive system for compoundprotein interactions Virtual screening and dynamics of potential inhibitors targeting RNA binding domain of nucleocapsid phosphoprotein from SARS-CoV-2 Chemical composition and pharmacological mechanism of Qingfei Paidu Decoction and Ma Xing Shi Gan Decoction against Coronavirus Disease 2019 (COVID-19): In silico and experimental study The nucleocapsid protein of SARS-associated coronavirus inhibits B23 The use of anti-inflammatory drugs in the treatment of people with severe coronavirus disease 2019 (COVID-19): The experience of clinical immunologists from China The nucleocapsid protein of severe acute respiratory syndrome coronavirus inhibits cell cytokinesis and proliferation by interacting with translation elongation factor 1alpha Structural characterization of the C-terminal domain of SARS-CoV-2 nucleocapsid protein