key: cord-306085-gnrnsxej authors: Hassan, Sk. Sarif; Choudhury, Pabitra Pal; Roy, Bidyut title: SARS-CoV2 envelope protein: Non-synonymous mutations and its consequences date: 2020-07-05 journal: Genomics DOI: 10.1016/j.ygeno.2020.07.001 sha: doc_id: 306085 cord_uid: gnrnsxej Abstract In the NCBI database, as on June 6, 2020, total number of available complete genome sequences of SARS-CoV2 across the world is 3617. The envelope protein of SARS-CoV2 possesses several non-synonymous mutations over the transmembrane domain and (C)-terminus in 15 (0.414%) genomes among 3617 available SARS-CoV2 genomes. More precisely, it is to be mentioned that 10(0.386%) out of 2588 genomes from the USA, 3(0.806%) from Asia, 1 (0.348%) from Europe and 1 (0.274%) from Oceania contain the missense mutations over the envelope proteins of SARS-CoV2 genomes. The C-terminus motif DLLV has been changed to DFLV and YLLV in the proteins QJR88103 (Australia: Victoria) and QKI36831 (China: Guangzhou) respectively, which might affect the binding of this motif with the host protein PALS1. The present pandemic situation of the Severe Acute Respiratory Syndrome (COVID-19) is caused by the RNA virus SARS-CoV2 which is characterized by its rapid mutations up to a million times higher than that of their hosts [1] . [1] . Several mutations have been detected in various proteins of the SARS-CoV2 over a short period of time, which are recently reported in various The CoV envelope (E) protein is the smallest among the four structural proteins involved in several aspects of the virus life cycle, such as assembly, budding, envelope formation, and pathogenesis [7] . However, the molecular mechanism involving E in pathogenesis is not yet clearly understood. Notably, this protein interacts with other structural proteins such as membrane(M) and other accessory proteins viz. ORF3a, ORF7a and host cell proteins [8] . Envelope protein of SARS-CoV2 is 76 amino acids long and it possesses three important domains viz. (N)-terminus, transmembrane domain (TMD) and (C)-terminus (Fig.1) . The (C)-terminal domain of envelope protein in SARS-CoV2 binds to human PALS1, a tight junction-associated protein, which is essential for the establishment and maintenance of epithelial polarity in mammals [9, 10] . the host cell PALS1 protein to facilitate infection [9, 11, 12] . In this present study, non-synonymous mutations over the envelope protein of SARS-CoV2 across the available 3617 SARS-CoV2 genomes (as on 6th June, 2020), have been found and accordingly their probable consequences are discussed. Among these virus genomes from 3617 patients; there are 2588 were from the USA, 372 were from Asia, 287 were from Europe, 365 were from Oceania and 5 were from Africa. Here, we present the non-synonymous mutations of the envelope protein over the available 3617 SARS-CoV2 genomes (Table 1) . It is to be noted that 10 (0.386%) out of 2588 genomes from USA, 3 (0.806%) from Asia, 1 (0.348%) from Europe and 1 (0.274%) from Oceania) contain the missense mutations (Table 1) in the envelope proteins of SARS-CoV2 genomes. Changes of the R-group of each amino acid according to the mutations are also presented (Table-1) . It is to be noted that the mutation of an amino acid 1 A to an amino acid 2 A is denoted by 12 A pA where p denotes location in the reference amino acid sequence. property changes only in the case of the protein QHZ00381 for the mutation L37H in the TMD of the envelope protein. • TMD was also observed to be conserved over the SARS-CoV1 and COV2 genomes, the  In the NCBI database, as on June 6, 2020, total number of available complete genome sequences of SARS-CoV2 across the world is 3617 on which the present study of mutation over the envelope protein is made.  The envelope protein of SARS-CoV2 possesses several nonsynonymous mutations over the transmembrane domain and (C)terminus in 15 (0.414\%) genomes among 3617 available SARSCoV2 genomes. More precisely, it is to be mentioned that 10(0.386\%) out of 2588 genomes from the USA, 3(0.806\%) from Asia, 1 (0.348\%) from Europe and 1 (0.274\%) from Oceania contain the missense mutations over the envelope proteins of SARS-CoV2 genomes.  The C-terminus motif DLLV has been changed to DFLV and YLLV in the proteins QJR88103 (Australia: Victoria) and QKI36831 (China: Guangzhou) respectively, which might affect the binding of this motif with the host protein PALS1. J o u r n a l P r e -p r o o f Journal Pre-proof Emerging sars-cov-2 mutation hot spots include a novel rna-dependent-rna polymerase variant Severe acute respiratory syndrome coronavirus accessory protein 9b is a virion-associated protein Missense mutations in sars-cov2 genomes from indian patients Molecular conservation and differential mutation on orf3a gene in indian sars-cov2 genomes The divergence between sars-cov-2 and ratg13 might be overestimated due to the extensive rna modification Pathogenicity of severe acute respiratory coronavirus deletion mutants in hace-2 transgenic mice Coronavirus envelope protein: current knowledge The coronavirus nucleocapsid is a multifunctional The sars coronavirus e protein interacts with pals1 and alters tight junction formation and epithelial morphogenesis Emerging themes in pdz domain signaling: structure, function, and inhibition Pdzbase: a protein-protein interaction database for pdz-domains The sars coronavirus e protein interacts with pals1 and alters 5 tight junction formation and epithelial morphogenesis 6 The embl-ebi search and sequence analysis tools apis in 2019 Understanding the nature of variations in structural sequences coding for coronavirus spike, envelope, membrane and nucleocapsid proteins of sars-cov-2, Envelope, Membrane and Nucleocapsid Proteins of SARS-CoV-2 Genetic lesions within the 3a gene of sars-cov Structure, expression, and intracellular localization of the sars-cov accessory proteins 7a and 7b Coronavirus virulence genes with main focus on sars-cov envelope gene The authors do not have any conflicts of interest to declare.