key: cord-0992846-85da3xce authors: Shu, Chuan-Jun; Huang, Xuan; Tang, Hui-Hao; Mo, Ding-Ding; Zhou, Jian-Wei; Deng, Cheng title: Mutations in spike protein and allele variations in ACE2 impact targeted therapy strategies against SARS-CoV-2 date: 2021-03-18 journal: Zool Res DOI: 10.24272/j.issn.2095-8137.2020.301 sha: 31266ebb1be03c7f543fba5abc885d51788e8598 doc_id: 992846 cord_uid: 85da3xce Coronavirus disease 2019 (COVID-19), which is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has spread rapidly worldwide with high rates of transmission and substantial mortality. To date, however, no effective treatments or enough vaccines for COVID-19 are available. The roles of angiotensin converting enzyme 2 (ACE2) and spike protein in the treatment of COVID-19 are major areas of research. In this study, we explored the potential of ACE2 and spike protein as targets for the development of antiviral agents against SARS-CoV-2. We analyzed clinical data, genetic data, and receptor binding capability. Clinical data revealed that COVID-19 patients with comorbidities related to an abnormal renin-angiotensin system exhibited more early symptoms and poorer prognoses. However, the relationship between ACE2 expression and COVID-19 progression is still not clear. Furthermore, if ACE2 is not a good targetable protein, it would not be applicable across a wide range of populations. The spike-S1 receptor-binding domain that interacts with ACE2 showed various amino acid mutations based on sequence analysis. We identified two spike-S1 point mutations (V354F and V470A) by receptor-ligand docking and binding enzyme-linked immunosorbent assays. These variants enhanced the binding of the spike protein to ACE2 receptors and were potentially associated with increased infectivity. Importantly, the number of patients infected with the V354F and V470A mutants has increased with the development of the SARS-CoV-2 pandemic. These results suggest that ACE2 and spike-S1 are likely not ideal targets for the design of peptide drugs to treat COVID-19 in different populations. progression is still not clear. Furthermore, if ACE2 is not a good targetable protein, it would not be applicable across a wide range of populations. The spike-S1 receptor-binding domain that interacts with ACE2 showed various amino acid mutations based on sequence analysis. We identified two spike-S1 point mutations (V354F and V470A) by receptorligand docking and binding enzyme-linked immunosorbent assays. These variants enhanced the binding of the spike protein to ACE2 receptors and were potentially associated with increased infectivity. Importantly, the number of patients infected with the V354F and V470A mutants has INTRODUCTION The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) epidemic has spread rapidly and widely since the end of 2019 (Holshue et al., 2020) . The World Health Organization (WHO) declared coronavirus disease 2019 , which is caused by SARS-CoV-2, a global pandemic (https://www. who.int/) in March 2020. As of mid-January 2021, there have been 89 416 559 confirmed cases and 1 935 028 deaths, affecting approximately 223 countries. COVID-19 symptoms can range from mild, self-limited respiratory disease to severe progressive pneumonia, multiple organ failure, and death . To date, no specific therapeutic agents or sufficient vaccines that have been approved for COVID-19 therapy (Shi et al., 2020b) . Several therapies, such as remdesivir, favipiravir, inactivated vaccine, chloroquine, hydroxychloroquine, convalescent plasma, and neutralizing antibodies from convalescent plasma, are under investigation, but the therapeutic efficacy of these drugs and treatments is not yet known, and clinical trial results remain controversial (Beigel et al., 2020; Chen et al., 2020; Gao et al., 2020; Shen et al., 2020; Shi et al., 2020b; Wang et al., 2020a) . For example, several studies have shown that chloroquine, hydroxychloroquine, and convalescent plasma are potential therapeutic options for COVID-19 patients (Huang et al., 2020b; Shen et al., 2020; Wang et al., 2020b) , whereas other studies have indicated that these three therapies do not significantly shorten the time to clinical improvement when compared with standard treatments alone (Geleris et al., 2020; Li et al., 2020a) . Peptide drugs are widely applied in medical treatment due to their relative safety, low production complexity, and remarkable curative action. Consequently, they have garnered increasing interest as antiviral agents (Boas et al., 2019) . The design of peptide drugs usually begins with functional protein fragments (Mirochnik et al., 2009) . Research has shown that the SARS-CoV-2 spike protein binds to receptor angiotensin converting enzyme 2 (ACE2) for viral entry (Wrapp et al., 2020) . The receptor binding domain (RBD) that interacts with ACE2 is located at the S1 region, i.e., spike-S1 Wrapp et al., 2020) . ACE2 and spike-S1 are considered attractive targets for the development of antiviral agents against human coronaviruses, such as SARS-CoV-2 and SARS-CoV. Therefore, peptide-based drugs could potentially be utilized for the treatment of SARS-CoV-2 infection by targeting viral proteins (such as spikes), mimicking ACE2, and disrupting spike-ACE2 interactions. In theory, a functional fragment of ACE2 and spike-S1 could be utilized against SARS-CoV-2 to decrease the receptor binding of SARS-CoV-2 for COVID-19 therapy. However, amino acid substitutions in SARS-CoV-2 and ACE2 could be major challenges for the above strategy. Notably, amino acid substitutions in spike-S1 have the potential to generate SARS-CoV-2 variants with increased infectivity. Furthermore, comorbidities have significant indications regarding COVID-19 outcome (Zheng et al., 2020) . Therefore, comorbidities should be considered for drugs designed to block SARS-CoV-2 entry into cells or clear SARS-CoV-2. In this study, certain underlying diseases that appear to aggravate COVID-19 patient condition were explored based on clinical data. Analysis identified several problems associated with designing peptide drugs that target ACE2. Furthermore, similar to the allele frequency in spike interaction domains of ACE2, genomic analysis of SARS-CoV-2 and binding enzyme-linked immunosorbent assay (ELISA) of the receptor-ligand pair (ACE2-spike-S1 interaction) identified two spike-S1 point mutations with increased ACE2 binding and associated infectivity. These results suggest that mutations in ACE2 and spike-S1 may pose a challenge for the exploration of potential broad-spectrum peptide drugs for the treatment of COVID-19 in different populations. Clinical data, including age, sex, clinical features, computed tomography (CT) imaging, comorbidities, complications, treatment processes, and time information for the first 37 patients who died of COVID-19 in China, were downloaded from the public database of the Hubei CDC (Center for Disease Control) in China (http://www.hbcdc.cn/). The three-dimensional (3D) structures of the ACE2 and spike protein of SARS-CoV-2 were downloaded from the Protein Data Bank (PDB ID: 6VSB) (Wrapp et al., 2020) . The complex structure was determined by cryogenic electron microscopy (Cryo-EM). Coding variants in ACE2 and corresponding allele frequency differences between populations were obtained from the China Metabolic Analytics Project (ChinaMAP) , Genome Aggregation Database (gnomAD) (Lek et al., 2016) , 1000 Genomes Project (1KGP) (The 1 000 Genomes Project Consortium, 2015), Trans-Omics for Precision Medicine (TOP Med) Program (Taliun et al., 2021) , and United Kingdom 10K (UK10K) Project (The UK10K Consortium, 2015) . Whole-genome sequences of SARS-CoV-2, SARS, bat-CoV, and pangolin-CoV were downloaded from the China National Center for Bioinformation (CNCB)/Beijing Institute of Genomics (BIG) database (https://bigd.big.ac.cn/ncov), Global Initiative of Sharing All Influenza Data (GISAID) EpiFlu TM database, and National Center for Biotechnology Information (NCBI) database. According to gene locations, the nucleotide sequences of RBD regions of coronavirus and their corresponding amino acids were acquired via BioEdit v2.0 (Tippmann, 2004) . The allele frequencies of variants located in the spike-binding regions of ACE2 were summarized. The database variants that affect amino acid sequences for spike-binding regions of ACE2 were then identified. Synonymous (dS)/nonsynonymous substitution (dN) sites between coronaviruses were calculated by DnaSP and BioEdit (Rozas et al., 2017) . The dN and dS values were also calculated using DnaSP (Rozas et al., 2017) . The nucleotide/amino acid sequences of proteins and similarities between sequences were aligned using BioEdit. Unrooted tree topology based on multiple alignments of amino acids was established with the neighbor-joining method in MEGA 6.06 (Lewis et al., 1995) . Branching consistency was tested using bootstrap analysis with 500 resamplings of the data in MEGA 6.06. We numbered the residues in spike-S1 with the deletion of the signal peptide. We also numbered S2 starting with 1. The main reason is that we wanted to unify residues between the sequence and corresponding structure. SWISS-Model was used to construct the structure of spike-S1 mutants based on the wild-type SARS-CoV-2 spike-S1 protein structure (Waterhouse et al., 2018) . We used the ProtParam tool (https://web.expasy.org/protparam/) to analyze the physical and chemical parameters of proteins (Garg et al., 2016) . The largest possible binding pocket of these proteins (ACE2, spike-S1 and its mutants) was predicted by Discovery Studio v3.0 (Gao & Huang, 2011) . These predicted pockets were utilized to construct an initial coarse model of the spike-S1-ACE2 complexes. The complex structures were refined by Rosetta software (RosettaDock and FlexPepDock modules) (Rohl et al., 2004) . The final structure was obtained based on energy scores. The interaction scores were calculated by Rosetta. High-quality 3D images of proteins were drawn by PyMOL (Ordog, 2008) . The SARS-CoV-2 spike-S1 and related mutants were cloned into the pcDNA3.1 vector and C-terminally fused to a flag-histag. Two point mutants (V354F and V470A) were constructed by site-directed mutagenesis, and the vectors were transfected into HEK293 cells. The expressed proteins were secreted into the medium due to the presence of their own signal peptides. Proteins were purified via Ni-NTA columns. The eluted protein was dialyzed against phosphate-buffered saline (PBS) for ligand-receptor binding assays. The human ACE2 protein (Nouweisen Company, China) was immobilized onto a microtiter plate at 2 μg/mL overnight. The spike-S1 recombinant protein was added as a ligand at different concentrations (0 μg/mL to 8 μg/mL) and then incubated for 1.5 h at 37 °C to allow receptor-ligand interaction. Horseradish peroxidase (HRP) anti-FLAG tag antibody (Bioworld, China) (diluted 1:8 000) was added for 1 h after washing three times with PBS. After adding PBS washing buffer another three times, the signal was visualized and optical density at 450 nm (OD450) was recorded using a microtiter plate reader. Experiments were repeated independently at least three times. The results were analyzed using GraphPad Prism 5 (www.graphpad.com). Data were analyzed using GraphPad Prism 8. Statistical significance (P<0.05) between the means of two groups was determined using Dunnett's t-test or chi-square test. Most COVID-19 patients who died exhibited abnormal regulation of renin-angiotensin system ACE2, which plays a critical role in the regulation of the reninangiotensin system (RAS), is a functional receptor for coronaviruses, including SARS-CoV, Middle East respiratory syndrome coronavirus (MERS-CoV), and SARS-CoV-2 (Zheng et al., 2020) . To explore the therapeutic potential of ACE2 inhibitors for COVID-19 patients, clinical data on the first 37 COVID-19 patients who died in China were downloaded from publicly available data of the Hubei CDC in China. Clinical data consisted of age, sex, clinical features, CT imaging, comorbidities, complications, treatment processes, and time point information for disease progression. According to the comorbidity data, COVID-19 deaths were associated with hypertension, diabetes, cardiocerebrovascular diseases, atherosclerosis, liver cirrhosis, myxoma, Parkinson's disease, and chronic bronchitis (patients: 23/37, 62.16%). All COVID-19 patients that died usually had more than one comorbidity. Hence, the ratio of COVID-19 death patients with the top four comorbidities to all COVID-19 death patients was 19/37 (51.35%). The top four comorbidities were associated with an abnormal RAS ( Figure 1A ). This suggests that most COVID-19 patients who died had comorbidities, especially RASassociated comorbidities. To analyze the effects of abnormal RAS on COVID-19 patients, the patients were divided into three groups: i.e., G1 (COVID-19 patients with comorbidities associated with abnormal RAS, 19 patients), G2 (COVID-19 patients with comorbidities but normal RAS, four patients), and G3 (COVID-19 patients without comorbidities, 14 patients). The main treatments were oxygen inhalation, anti-infective therapy, and antiviral therapy. The main complications were respiratory failure and shock, with CT images also indicating ground glass opacity and infectious lesions in the lungs. These results indicated that the treatments, complications, and CT data in the three groups were similar (Supplementary Table S1 ). Therefore, the three groups were compared under four variables, i.e., age, days from symptoms to death, sex, and type of early symptoms. The average ages (mean±SEM (standard error of the mean)) for G1, G2, and G3 were 75.00±2.57, 70.50±4.44, and 65.29±3.18, respectively ( Figure 1B ). Days from symptoms to death (mean±SEM) for G1, G2, and G3 were 15.47±1.71, 16.50±2.06, and 15.43±2.20, respectively ( Figure 1C ). The sex ratios (male/female) for G1, G2, and G3 were 2.17, 3.00, and 3.67, respectively ( Figure 1D ). These data suggest that a higher proportion of men than women died from COVID-19, which may be due to the higher expression levels of ACE2 in males . According to the average age of COVID-19 patients with death outcome as well as P-values in the three groups (p-G1 vs. G2: 0.79; p-G1 vs G3: 0.80; p-G2 vs G3: 0.70), most COVID-19 patients who died were older males. As shown in Figure 1E , COVID-19 patients with comorbidities associated with abnormal RAS had more early symptoms than those without comorbidities (G1: 7, G2: 4, G3: 4), including fever, cough, dyspnea, fatigue, diarrhea, chills, and headache. These results indicate that older males with abnormal RAS accounted for the largest proportion of COVID-19 deaths and COVID-19 patients who died often exhibited numerous early symptoms. Generally, ACE inhibitors are utilized to treat comorbidities with abnormal RAS (Fang et al., 2020) . ACE inhibitors can affect the presence of ACE2 by decreasing the receptor levels for usage by SARS-CoV-2 (Fang et al., 2020; Li et al., 2017) . The high expression of ACE2 in humans with comorbidities and abnormal RAS may facilitate SARS-CoV-2 entry into target cells in the respiratory and gastrointestinal systems and prolong viral clearance time (Zheng et al., 2020) . Hence, COVID-19 patients with comorbidities associated with abnormal RAS often showed many early symptoms and exhibited poor prognosis. However, the relationship between ACE2 expression and COVID-19 progression is not clear, as no significant differences were found between G1 and G3 in days from symptoms to death (P=0.67) and patients with abnormal RAS always showed high ACE2 expression. Several studies in mice have indicated that ACE2 plays a critical role in responding to injury in the lungs (Gu et al., 2016; Hamzelou, 2020; Imai et al., 2007) . Inhibiting ACE2 function likely prevents injuries from healing. Therefore, ACE2 can act as a good target for drugs that block SARS-CoV-2 infection; however, we do not yet know whether ACE2 is safe or applicable across a wide range of populations. Peptides that prevent interaction between SARS-CoV-2 and its ACE2 receptor, but exert no effect on receptor enzyme activity, could potentially be utilized to treat COVID-19 patients. Spike-S1 as a potential target for COVID-19 SARS-CoV-2 uses the S1 subunit in a densely glycosylated spike (spike-S1) protein to bind to the host cell receptor (ACE2) to gain entry into host cells (Wrapp et al., 2020) . Hence, to explore potential broad-spectrum peptide drugs for treating COVID-19 in different populations, variants in potential targets, i.e., ACE2 and spike-S1, should be analyzed. The cryo-EM complex structures of ACE2 and spike protein were downloaded from the PDB database (PDB ID: 6VSB) (Wrapp et al., 2020) . Binding sites between ACE2 and spike-S1 were analyzed, as shown in Figure 2A (Supplementary Table S2 ). There were 14 binding sites in spike-S1 and 12 binding sites in ACE2. The 12 binding sites in ACE2 were distributed among three interaction regions: i.e., 24-41 residues, 79-83 residues, and 330-353 residues ( Figure 2B ). In addition, 19 amino acid variants in the spikebinding regions of ACE2 were identified based on the ChinaMAP , gnomAD (Lek et al., 2016) , 1KGP (The 1 000 Genomes Project Consortium, 2015), TOP Med program (Taliun et al., 2021) , and UK10K project (Consortium et al., 2015) databases ( Figure 2C ). Only one amino acid variant, K26R (rs ID (identity document): rs4646116), was distributed in the spike-binding regions of ACE2, but the variant site did not belong to the interaction sites for ACE2-spike-S1 (Figure 2A, C) . As shown in Figure 2D , the allele frequencies for different populations were different (Supplementary Table S3 ). The highest allele frequency for this variant was 0.01 (gnomAD (ASJ (Ashkenazi Jewish))). The differences in allele frequencies of ACE2 coding variants among different populations suggest that diverse genetic backgrounds may affect ACE2 function. These results indicate that, in different populations, ACE2 may exhibit different binding capabilities for spike-S1 in SARS-CoV-2. Therefore, a drug that could inhibit the binding of one ACE2 variant in SARS-CoV-2 may not work in other patients with different ACE2 variants. To analyze amino acid variants in the spike-S1 RBD, 10 992 complete genomes of SARS-CoV-2 were downloaded from the GISAID EpiFlu TM and CNCB/BIG databases (https://bigd.big.ac.cn/ncov). There were 14 binding sites in spike-S1 distributed in one RBD, i.e., 417-505 residues ( Figure 2E ). In total, 26 amino acid variants were identified in the spike-S1 RBD ( Figure 2F ). These amino acid variants were distributed among 66 SARS-CoV-2 viruses A: Binding sites in ACE2 and spike-S1. Green and cyan schematic represents 3D structure of ACE2 and spike-S1, respectively. Yellow letters and (Supplementary Table S4 ). Although variant site Y449N belonged to the interaction sites for ACE2-spike-S1, the frequency of the spike-S1 variant was 9.10E-05. The highest frequency for spike-S1 variants was <2.09E-03 ( Figure 2F , Supplementary Table S4 ). The corresponding dN, dS, and dN/dS (ω) values of the genome and spike-S1 sequences for SARS-CoV-2 spike-S1 RBD variants were calculated by DnaSP6 (Rozas et al., 2017) . Compared to the first genome sequence of SARS-CoV-2 (GenBank ID: MN908947.3), the average dN, dS, and dN/dS (ω) values (mean±SEM) for the genome sequences of SARS-CoV-2 spike-S1 RBD variants were 3.08E-04±3.99E-5, 4.91E-4±4.36E-5, and 0.64±5.87E-2, respectively (Supplementary Table S5 ). Compared to spike-S1 of the reference SARS-CoV-2, the average dN, dS, and dN/dS (ω) values (mean±SEM) for spike-S1 sequences of the SARS-CoV-2 spike-S1 RBD variants were 1.13E-3±1.08E-4, 3.31E-4±1.48E-4, and 0.46±0.14, respectively (Supplementary Table S5 ). According to these values, there were no positive or negative amino acid substitutions. These results indicate the presence of many amino acid substitutions in the RBD of spike-S1, but these substitutions are probably random events. Therefore, spike-S1 could be a potential target for exploring broad-spectrum peptide drugs for COVID-19 if spike-S1 mutants have the same binding capability as ACE2. Enhanced receptor-ligand binding capability in two spike-S1 mutants Initially, we compared 154 complete genomes of SARS-CoV-2 downloaded from the GISAID EpiFlu TM (access date 22 February 2020) and CNCB/BIG databases (https://bigd.big. ac.cn/ncov, access date 22 February 2020) to explore variations in SARS-CoV-2 in the early stages of the epidemic. The aligned genome size of SARS-CoV-2 was 29 674 bp and was comprised of 10 protein-coding sequence (CDS) regions ( Figure 3A ). The SARS-CoV-2-Wuhan01 virus (GenBank ID: MN908947.3) was compared to bat-CoV-RaTG13 (GISAID ID: EPI_ISL_402 131), pangolin-CoV-Guangxi (GISAID ID: EPI_ISL_410 538), and pangolin-CoV-Guangdong (GISAID ID: EPI_ISL_410 721) genomes to assess the potential origin of SARS-CoV-2 before it infected humans. Compared with these three coronaviruses, the whole-genome sequence identity of SARS-CoV-2-Wuhan01 was 0.85 to 0.96 (bat-CoV-RaTG13: 0.96, pangolin-CoV-Guangxi: 0.85, pangolin-CoV-Guangdong: 0.90), which is higher than that between SARS-CoV-2 and SARS (~0.80). In addition, the dN/dS (ω) values between SARS-CoV-2 and other coronaviruses were: bat-CoV-RaTG13: 0.80E-02/0.17 (0.04); pangolin-CoV-Guangxi: 5.50E-2/0.72 (0.08), pangolin-CoV-Guangdong: 2.5E-2/0.47 (0.08); and SARS:0.11/0.93 (0.12). Compared to SARS-CoV-2-Wuhan01, there were 360/755, 1 215/3 003, and 804/2 033 synonymous/nonsynonymous substitution sites for bat-CoV-RaTG13, pangolin-CoV-Guangxi, and pangolin-CoV-Guangdong, respectively. As shown in Figure 3B , nonsynonymous substitution sites were mainly distributed in the ORF1ab and spike-S1 regions. These results indicate that the two regions had higher substitution rates than other SARS-CoV-2 regions in different hosts ( Figure 3B ). Furthermore, to explore the evolution of SARS-CoV-2, we further examined genomic variations of SARS-CoV-2 in humans. The substitution sites in five proteins, i.e., 3chymotrypsin-like protease (Mpro), papain-like protease, helicase, RNA-dependent RNA polymerase, and spike-S1, were calculated as they may be potential drug targets for the development of antiviral agents or vaccines against SARS-CoV-2, SARS, and MERS. Subsequently, we identified 138 substitution sites (synonymous sites: 52, nonsynonymous sites: 86) in the 10 CDS regions, and all substitutions could be classified as single nucleotide polymorphisms (SNPs). The substitution sites were also mainly distributed in the ORF1ab and spike-S1 regions ( Figure 3C ). However, ORF8a, ORF3a, ORF10, and spike-S1 had higher substitution rates in SARS-CoV-2 ( Figure 3D ). With a nonsynonymous/synonymous site ratio >1, our results indicate that SARS-CoV-2 underwent adaptive evolution after human host infection. The nonsynonymous substitutions could result in a SARS-CoV-2 variant with increased infectivity. To address potential increased SARS-CoV-2 infectivity, amino acid mutations caused by nonsynonymous substitutions in SARS-CoV-2-Wuhan01 versus SARS-CoV-2 mutants were analyzed. The corresponding amino acids in the bat-CoV-RaTG13, pangolin-CoV, and SARS viruses were then used as controls. As shown in Figure 3E , most amino acid substitutions from the SARS-CoV-2 viruses corresponded to conserved residues in the bat-CoV-RaTG13, pangolin-CoV, and SARS viruses, except for five amino acid substitutions in spike-S1 of SARS-CoV-2. Furthermore, the dN/dS (ω) values for the SARS-CoV-2 mutants were always below 1 (Table 1 ). These results indicate that most amino acid substitutions in SARS-CoV-2 are likely random events. However, the 10 amino acid substitutions recurred in SARS-CoV-2, i.e., I789V of papainlike protease, H36Y and V354F of spike-S1, G251V of ORF3a, D209H of membrane glycoprotein, V62L, L84S, and P85S of ORF8, and S194L, S202N, and P344S of nucleocapsid phosphoprotein ( Figure 3E ). The spike-S1 protein binds to the human cellular receptor ACE2 and is responsible for viral infection. Hence, attention should be paid to amino acid substitutions occurring in the RBD region of spike-S1 or substitutions occurring independently more than three times in the early stages of the epidemic. In addition, epidemiological and evolutionary data for H36Y, N341D, D351Y, and V354F of spike-S1, G251V of ORF3a, V62L and L84S of ORF8, and S194L and S202N of nucleocapsid phosphoprotein were analyzed. The amino acid substitution of ORF8, i.e., L84S, had the highest substitution rate (value=36) among the 10 mutant residues ( Figure 3E ). The dN/dS (ω) value for the SARS-CoV-2 mutant-L84S was 3.6E-3/0.00 (NA) compared to the wild-type ORF8. However, this site is a reverse mutation compared to its ancestor ( Figure 3E ). We selected the whole genome of these 36 SARS-CoV-2 mutant viruses and their potential ancestors (bat-CoV and pangolin-CoV viruses) to construct a phylogenetic tree (neighborjoining, bootstrap=500) ( Figure 3F ). As shown in Figure 3F , we found that the SARS-CoV-2 mutant with L84S in ORF8 had a closer phylogenetic relationship to its ancestors compared to that of SARS-CoV-2-Wuhan01 ( Figure 3F) . Hence, the SARS-CoV-2 mutant L84S was more likely be an ancestral variant than a virus with increased infectivity. Previous research has indicated that the SARS-CoV-2 mutant L84S has reduced infectivity (Tang et al., 2020) . Therefore, we focused on SARS-CoV-2 without L84S in ORF8. The amino acid substitution of ORF3a, i.e., G251V, had the second highest substitution rate (value=17) among the 10 mutant residues ( Figure 3E ). These SARS-CoV-2 mutants were mainly isolated from patients in France and Singapore. Interestingly, five patients from France with 100% identical protein sequences (Supplementary Table S6 ) shared the same V354F mutation, and this amino acid substitution was in the RBD region of spike-S1 (V354F). In addition, the SARS-CoV-2 mutant-V354F had no L84S mutant in ORF8. The dN/dS (ω) value for SARS-CoV-2 mutant-V354F was 7.0E-4/0.00 (NA) compared to the wild-type spike-S1. These results suggest that the SARS-CoV-2 mutant with the V354F-spike-S1 mutation potentially exhibits increased infectivity. Since spike-S1 is a key factor for viruses to enter human cells, we explored whether it might contribute to the increased infectivity of SARS-CoV-2 mutants based on receptor-ligand (ACE2-spike-S1 mutants) docking. Amino acid substitution in the RBD region of spike-S1 ( Figure 4A , B) could theoretically affect the infectivity of SARS-CoV-2. Therefore, three mutants found in spike-S1 (N341D, D351Y, and V354F) were utilized to investigate the docking of spike-S1 to ACE2. Based on sequence, physical, and chemical parameters, i.e., theoretical pI (isoelectric point), GRAVY (grand average of hydropathicity), and instability index, were analyzed using the ProtParam tool (https://web.expasy.org/protparam/) for wildtype and mutants. As shown in Figure 4C , the three amino acid substitutions affected the physical and chemical parameters of the spike-S1 protein, with the V354F mutant having the smallest impact among the three mutants. We further tested whether these three mutants could result in different interaction models between ACE2 and spike-S1. With the 3D structure of ACE2 and spike-S1 downloaded from the PDB database (PDB ID: 6ACJ and 6VSB), the spike-S1 mutant structures were obtained using SWISS-MODEL based on a homologous modeling algorithm (Song et al., 2018; Wrapp et al., 2020) . Complexes of the ACE2-spike-S1 mutants were obtained by Discovery studio and Rosetta package software. As shown in Figure 4D , V354F had the lowest energy and interaction score in the ACE2-spike-S1 complexes based on structural comparison and protein-protein interaction analysis ( Figure 4D, Supplementary Table S6 ). These results further suggest that spike-S1 with the V354F amino acid substitution had a higher affinity for ACE2 than wild-type spike-S1, which potentially led to increased infectivity. Therefore, given its potential increase in infectivity, SARS-CoV-2 mutant-V354F deserves close epidemic surveillance. Such action could reduce the worldwide threat of a potential explosion in SARS-CoV-2 (Shu et al., 2020) and increase the success rate of therapeutic drug design. To further explore the epidemic impact of the V354F mutant, spike-S1 sequences were downloaded from the GISAID EpiFlu TM and CNCB/BIG databases at three different time points (22 February, 25 March, and 10 April 2020) . As shown in Figure 5A and B, an increasing number of amino acid substitutions were seen in the RBD of spike-S1 with the spread of SARS-CoV-2, but most amino acid substitutions were random events, except for V354F, G463S, and V470A. Furthermore, the detected number of patients with these three mutants, i.e., V354F, G463S, and V470A, increased. By 10 April 2020, based on 16 491 complete genomes of SARS-CoV-2, the number of mutant viruses with V354F, G463S, and V470A of spike-S1 reached 20, 19, and 30, respectively ( Figure 5C ). According to interaction energy analysis, SARS-CoV-2 strains with the spike-S1 V354F or V470A substitutions were likely novel strains with increased infectivity (Figures 4D, 5A) . To examine the binding capacity of spike-S1 mutants to ACE2, we constructed expression plasmids for two mutants, V354F and V470A (additional variant subsequently reported in the USA in Figure 5A -C) and purified sufficient protein for receptor-ligand binding assay. Both the V354F and V470A variants exhibited increased binding to the human ACE2 receptor compared to the wild-type, consistent with our computer simulation (Figures 4D, 5A) . As shown in Figure 5D , both the V354F and V470A mutants had significantly lower EC 50 (concentration for 50% of maximal effect) values (EC 50 =1.32 μg/mL for V354F and EC 50 =1.24 μg/mL for V470A) than the wild-type (EC 50 =1.86 μg/mL), demonstrating that the V354F and V470A mutants had higher binding affinity and potentially increased infectivity. Therefore, the SARS-CoV-2 mutants (V354F and V470A) with potentially increased infectivity should be under close epidemic surveillance to avoid potential additional waves of SARS-CoV-2 infection worldwide. Furthermore, animals that are easily accessible to humans should be monitored to prevent cross-infection, which could result in a novel coronavirus with increased infectivity or a new recombination coronavirus (Shi et al., 2020a) . These findings highlight the challenge of designing drugs for COVID-19 therapy. As the coronavirus infects human cells via ACE2, corresponding sequences of 16 mammals were downloaded from the NCBI database. Their sequence similarities and phylogenetic relationships are shown in Figure 6A and B, respectively. The sequence logo for the 16 ACE2 sequences is presented in Figure 6C . The key amino acids of human ACE2 for interaction with SARS-CoV-2 spike-S1 were not conserved in the 15 other species. Subsequently, the interaction energy scores between SARS-CoV-2 and ACE2 of the different species were analyzed. We found that Felis catus, Paguma larvata, Pan troglodytes, Pongo abelii, and Sus scrofa may be more susceptible to SARS-CoV-2 infection, consistent with other studies ( Figure 6D ) (Shi et al., 2020a) . Hence, animals with different ACE2 sequences or key interaction amino acids in ACE2 could affect the infectivity of SARS-CoV-2. Some animals may be infected with SARS-CoV-2 because they have the same interaction residues as human ACE2. The aligned interaction sites in ACE2 were 24, 30, 34, 38, 83, and 353 (Figures 2A, 6C) . However, previous study has indicated that there is no direct evidence to support the existence of ACE2 variants that are resistant to coronavirus S-protein binding in different populations (Cao et al., 2020a) . Therefore, we should focus our attention on Figure 4 Structural pharmacological analysis among three spike-S1 protein mutants A: RMD domain in spike-S1. B: Distribution of three spike mutations, i.e., N341D, D351Y, and V354F, in 3D structure. C: Physical and chemical parameters for spike-S1 and its mutants. D: Energy and interaction scores for complex structures of ACE2-spike-S1 and its mutants. E: Number of COVID-19 patients in Europe and USA. SARS-CoV-2 mutants, especially mutants with amino acid substitutions in the RBD region of spike-S1. For example, the V354F and V470A mutants have potentially increased infectivity. These mutants may increase the difficulty of developing effective COVID-19 therapies. Based on clinical data obtained from COVID-19 patients, we found that patients with abnormal RAS often exhibited more early symptoms and poorer outcomes after standard treatment. ACE2, which is a functional receptor for SARS-CoV-2, plays a critical role in the process of RAS. However, the relationship between ACE2 expression and COVID-19 progression is not clear. In different populations, ACE2 demonstrates different binding capabilities for spike-S1 in SARS-CoV-2. Furthermore, the amino acid substitutions present in the RBD of spike-S1 are likely random events. Hence, compared to ACE2, the spike-S1 RBD acts as a good target for broad-spectrum peptide drugs that could block SARS-CoV-2 infection in different populations. Although SARS-CoV-2 genomic variations are still low, some nonsynonymous substitutions have led to SARS-CoV-2 variants with increased infectivity in humans. Here, we found that SARS-CoV-2 with V354F or V470A substitutions of spike-S1 showed increased infectivity. Both V354F and V470A are also known to increase the entry of pseudoviruses into cells expressing ACE2 . These findings increase the challenge of designing peptide drugs for COVID-19 therapy and represent potentially dangerous virus strains. Although the clinical COVID-19 data and number of mutants tested for infectivity are limited in this study, the challenges for the treatment of COVID-19 based on existing analyses are clear. As mutations or allele variations result in different target domains, ACE2 and spike-S1 do not appear to be ideal Figure 5 Epidemic situation for spike-S1 mutants and receptor-ligand binding of spike-S1 mutants A: Number of amino acid mutants in spike-S1 at different time points. B: Distribution of spike-S1 mutants worldwide. C: Number of V354F and V470A mutants at different times. D: Optical density at 450 nm (OD450) for wild-type spike-S1 and its mutants, i.e., V354F and V470A mutants. targets for the design of small-molecule drugs to treat COVID-19 across all populations. Supplementary data to this article can be found online. Remdesivir for the treatment of Covid-19-preliminary report Antiviral peptides as promising therapeutic drugs Comparative genetic analysis of the novel coronavirus (2019-nCoV/SARS-CoV-2) receptor ACE2 in different populations The ChinaMAP Favipiravir versus Arbidol for COVID-19: a randomized clinical trial. medRxiv Are patients with hypertension and diabetes mellitus at increased risk for COVID-19 infection A: Sequence similarity between different ACE2 sequences. B: Phylogenetic tree for different ACE2 sequences. C: Sequence logo for ACE2. Red, yellow, and blue triangles represent ACE2 residues that could potentially form a hydrogen bond with the spike-S1 protein of SARS-CoV-2-Wuhan01 and an ionic or PiPistack interaction with the corresponding residues in spike-S1. D: Binding interaction energy scores for ACE2 and spike-S1 MFPPI-Multi FASTA ProtParam interface Observational Study of Hydroxychloroquine in Hospitalized Patients with Does a cell protein explain covid-19 severity? First Case of 2019 Novel Coronavirus in the United States Clinical features of patients infected with 2019 novel coronavirus in Wuhan Angiotensin-converting enzyme 2 in acute respiratory distress syndrome MEGA: Molecular evolutionary genetics analysis, version 1.02 Effect of convalescent plasma therapy on time to clinical improvement in patients with severe and life-threatening COVID-19: a randomized clinical trial The impact of mutations in SARS-CoV-2 spike on viral infectivity and antigenicity The vasoprotective axes of the reninangiotensin system: Physiological relevance and therapeutic implications in cardiovascular, hypertensive and kidney diseases Short pigment epithelial-derived factor-derived peptide inhibits angiogenesis and tumor growth PyDeT, a PyMOL plug-in for visualizing geometric concepts around proteins Protein structure prediction using Rosetta DNA sequence polymorphism analysis of large data sets Treatment of 5 critically Ill patients with COVID-19 with convalescent plasma Susceptibility of ferrets, cats, dogs, and other domesticated animals to SARS-coronavirus 2 A human neutralizing antibody targets the receptor-binding site of SARS-CoV Structural basis for the high conductivity of microbial pili as potential nanowires Cryo-EM structure of the SARS coronavirus spike glycoprotein in complex with its host cell receptor ACE2 Sequencing of 53,831 diverse genomes from the NHLBI TOPMed program The 1000 Genomes Project Consortium. 2015. A global reference for human genetic variation The UK10K project identifies rare variants in health and disease Analysis for free: comparing programs for sequence analysis Development of an inactivated vaccine candidate, BBIBP-CorV, with potent protection against SARS-CoV-2 Remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus (2019-nCoV) in vitro SWISS-MODEL: homology modelling of protein structures and complexes Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation Single-cell RNA expression profiling of ACE2, the putative receptor of Wuhan 2019-nCov. bioRxiv COVID-19 and the cardiovascular system We gratefully acknowledge the GISAID EpiFlu TM (www.gisaid. org) and CNCB/BIG (https://bigd.big.ac.cn/ncov) databases. The authors declare that they have no competing interests.