key: cord-272986-ebgusf3o authors: Cao, Yipeng; Yang, Rui; Wang, Wei; Lee, Imshik; Zhang, Ruiping; Zhang, Wenwen; Sun, Jiana; Xu, Bo; Meng, Xiangfei title: Computational Study of Ions and Water Permeation and Transportation Mechanisms of the SARS-CoV-2 Pentameric E Protein Channel date: 2020-05-17 journal: bioRxiv DOI: 10.1101/2020.05.17.099143 sha: doc_id: 272986 cord_uid: ebgusf3o Coronavirus disease 2019 (COVID-19) is caused by a novel coronavirus (SARS-CoV-2) and represents the causative agent of a potentially fatal disease that is of public health emergency of international concern. Coronaviruses, including SARS-CoV-2, encode an envelope (E) protein, which is a small, hydrophobic membrane protein; the E protein of SARS-CoV-2 has high homology with that of severe acute respiratory syndrome coronavirus. (SARS-CoV) In this study, we provide insights into the function of the SARS-CoV-2 E protein channel and the ion and water permeation mechanisms on the basis of combined in silico methods. Our results suggest that the pentameric E protein promotes the penetration of monovalent ions through the channel. Analysis of the potential mean force (PMF), pore radius and diffusion coefficient reveals that Leu10 and Phe19 are the hydrophobic gates of the channel. In addition, the pore demonstrated a clear wetting/dewetting transition with monovalent cation selectivity under transmembrane voltage, which indicates that it is a hydrophobic voltage-dependent channel. Overall, these results provide structural-basis insights and molecular-dynamic information that are needed to understand the regulatory mechanisms of ion permeability in the pentameric SARS-CoV-2 E protein channel. COVID-19 is a severe and highly contagious respiratory illness that was first reported in China in early December 2019. Subsequently, the virus has spread worldwide. As of May 6, 2020, millions of cases have been confirmed, and hundreds of thousands have died. The World Health Organization (WHO) announced a global pandemic for COVID-19 in March 2020. In addition to the hazards of the disease itself, it has also led to a severe turbulence in international financial markets and may cause serious consequences such as a financial crisis. COVID-19 is a disease caused by a new coronavirus named SARS-CoV-2. It is speculated that it originated from bats and was transmitted to humans through an intermediate host (some kind of wildlife). Its symptoms include fever, general malaise, dry cough, shortness of breath, and respiratory distress. Comparing similar diseases, including severe acute respiratory syndrome (SARS) and Middle East respiratory syndrome (MERS), which are also caused by coronaviruses, the mortality rates for SARS and MERS were 10% and 36%, respectively 1 . Currently, COVID-19 has a mortality rate of 1 to 10% in different countries but appears to be more contagious than SARS and MERS 2 3 4 5 . Similar to other coronaviruses, SARS-CoV-2 is a long positive-sense, long single-stranded (30 kb) RNA virus. The structure of different human coronaviruses (HCoVs) is similar. The viral genome is packed by nucleocapsid (N) proteins, forming a helicoidal nucleocapsid protected by a lipid envelope 6 . Several viral proteins, including the spike (S), envelope (E), and membrane (M) proteins, are embedded within a lipid envelope 7 8 . Studies have shown that S, M, and N proteins play important roles in receptor binding and virion budding. For example, the M protein participates in virus germination and interacts with the N and S proteins. The S protein has immune recognition sites and can be used to design vaccines 9 . Currently, the importance of the E protein has not been fully revealed. Evidence suggests that the E protein maintains its morphology after virus assembly by interacting with the M protein 10 11 12 . When an E protein gene mutation occurs, it promotes apoptosis. Recent studies have shown that coronaviruses have a viroporin that can self-assemble into a pentameric structure and have ion selectivity. When there is a transmembrane voltage, the ion channel characteristics of viroporins are more significant. In addition, the Asn25Ala and Val18Phe mutations could destroy ion channel activity 13 . This indicates that the E protein may play an important role in regulating the ion equilibrium inside and outside the viral envelope. The ion channel activity of the E protein can lead to increased levels of the inflammatory cytokines IL-1β, TNF and IL-6 in the lungs, leading to the occurrence of an "inflammatory storm" 14 . This plays a key role in the progression of the disease and may cause the patient's condition to suddenly deteriorate and lead to death. In addition, its evolutionary conservatism may be an important cause of viral cross-host infection 15 16 17 . Although previous studies have suggested that the E protein of coronaviruses such as SARS and MERS oligomerizes and has ion permeability, the specific mechanism of ion permeability and channel properties remain to be explored due to the lack of a crystal structure for the E protein. In 2014, Li et al 17 identified the SARS-CoV E protein monomer structure. In 2018, Surya et al. extracted the pentamer structure of the SARS E protein by nuclear magnetic resonance (NMR) 18 , which provided strong support for the study of the SARS-CoV-2 E protein pentamer ion permeability mechanism. In this study, we obtained the amino acid sequence of the SARS-CoV-2 E protein from the National Center for Biotechnology Information (NCBI) database 19 . The E protein pentamer model of SARS-CoV-2 was built by using the homology modeling method, and the reasonableness of the model was evaluated. Subsequently, µs-level molecular dynamics (MD) simulations were performed to evaluate the pentamer's stability in the membrane environment. We tried to use potential mean force (PMF) to reveal the permeability of different physiological ions and water molecules in the pores of the E protein pentamer. The characteristics of the pentameric channel were analyzed by combining the channel diffusion coefficient and geometric properties. In addition, computational electrophysiology was applied for different transmembrane voltages of the system to reveal the effect of the voltage on the ion permeability of the pentameric E protein. Overall, exploring the mechanisms of the pentameric SARS-CoV-2 E protein not only provides valuable insights into the conduction of the channel but also has important implications for our understanding of the difference between SARS-CoV-2 and other coronaviruses. Sequence alignment and homology modeling Figure 1A shows the sequence alignment between the E proteins of two human coronaviruses (SARS-CoV and SARS-CoV-2 sequence) made by Clustal X software. The blue dotted rectangle in Figure 1B The Rampage online program 20 was used to evaluate the accuracy of the SARS-CoV-2 pentameric E protein model. More than 98.9% of the amino acids are within the acceptable range, suggesting that the SARS-CoV-2 E protein is similar to the SARS E protein NMR model. Subsequently, a 1000 ns (1 µs) MD simulation was performed to evaluate the stability of pentameric E protein embedded in the membrane environment. The root-mean-square deviation (RMSD) of the model is shown in Figure 2 , and the red and blue curves represent the whole protein and TM region, respectively. In the first 200 ns, the RMSD continued to rise, indicating that the model needed longer optimization (compared with other ion channels) to reach the pentameric structure equilibrium. During the last 800 ns, the curves plateaued. The RMSD of the whole protein and the TM region converged at ~0.4 and ~0.3 nm, respectively. There is a 0.1 nm difference between the whole protein and TM region. These findings are consistent with other ion channel data that show that the TM region has a higher stability than the other parts of the membrane protein 21 . The permeability of the SARS-CoV-2 pentameric E protein channel is very important for understanding the replication ability of viruses in cells and how they are secreted into the extracellular medium 22 . It is possible to examine a profile of the free energy of a single ion or water as a function of its position along the pore axis by calculation of PMF. In this simulation, the ions or water molecules are restrained to a continuous position along the z-axis and move freely in the pore's xy plane. Moreover, other parts of the system (proteins, ions, water molecules, and lipids) can move freely and reach equilibrium. The PMF of important physiological ions (Mg2 + ; Ca 2+ ; Cl-; K + and Na + ) and the water molecules as a function of their position along the pore (z)-axis were calculated separately. Figure 3A shows the PMF of ions and water molecules permeating through the SARS-CoV-2 E protein pentamer pore. Z M is defined as the axial direction along the pore, which is from -3 ~ 3 nm (the length of umbrella sampling is 6 nm in total). From the PMF curve, we found that the energy barriers of monovalent and divalent ions have a significant difference. The maximum energy barriers of the two divalent ions Ca 2+ and Mg 2+ are 60 kJ/mol and 70 kJ/mol, respectively. In contrast, the PMFs of the monovalent ions Na + and K + are ~ 12 kJ/mol and ~20 kJ/mol, respectively, while the Cl-energy barrier is ~ 30 kJ/mol, which is significantly smaller than that of Ca 2+ and Mg 2+ . From an energy perspective, the SARS-CoV-2 pentameric E protein is almost impermeable to divalent ions. This confirms the previous hypothesis that the SARS-CoV and MERS-CoV E protein channel is a monovalent cation channel 23 24 . The maximum energy barrier in descending order is as follows: Na + 0.3 V. Computational electrophysiology also explained that the transmembrane voltage led to an easier wetting transition for the channel. The range for keeping the channel open should be between 0.15 V and2 0.45 V. Na + and K + could be cotransported during water permeation, which is very similar to that in many hydrophobic channels 45 46 47 48 . Intriguingly, we did not observe Clconduction, which may be because chloride ions could not overcome the energy barrier under the maximum transmembrane voltage. In addition, the change in the pore radius at different voltages highlights two important sites, where the geometric radius changes the most, corresponding to Leu10 and Phe19. Phe19 contains a benzene ring group at the hydrophobic surface of the inner pore as a hydrophobic gate. The isopropyl group of Leu10 at the bottom of the pore prevents the reverse penetration (Fig 6A) . Previous studies indicated that Asn15Val and Val25Phe mutations will make the channel dysfunctional. This may be due to the additional side chain groups in the pores caused by these mutations (especially the benzene ring in Val25Phe), making 1) the radius of the pores decrease and 2) the hydrophobicity of the pores increase. In short, these mutations may increase the energy barrier, negatively affecting the wetting transition and causing the channel to be functionally closed. Therefore, the SARS-CoV-2 E protein pentamer is a voltage-dependent hydrophobic channel. We propose that the E protein may play an essential role in the virus infection and replication processes through the following mechanisms: 1) The monovalent selective permeation of the E protein pentamer ion channel may change the intracellular pH, providing a suitable microenvironment for virus replication. 2) Selective permeation can form a transmembrane voltage, creating feedback regulation and maintaining an intracellular microenvironment that is suitable for viral growth. 3) The disintegration of the ion equilibrium of the intracellular area affect the charges coming from the cell through ion channels in the cell membrane, changing the pH and making it easier for the virus to fuse with the cell membrane. This is the infection mechanism of the avian coronavirus as well as some types of influenza viruses 49 50 . 4) It has a certain signal transduction function. Although the pentameric SARS-CoV-2 E protein crystal structure is not yet available, Preparation of the SARS-CoV-2 pentameric E protein-membrane simulation system The amino acids sequence of the human coronaviruses E proteins were downloaded from the National Center for Biotechnology Information (NCBI To obtain the equilibrated pentameric SARS-CoV-2 E protein model, Charmm36 all-atom force field 53 was chosen for MD simulation. The MD time step was set at 2 fs. Electrostatic interactions were described using the Particle Mesh Ewald (PME) algorithm 54 with a cut-off of 1.2 nm. The LINCS algorithm 55 was used to constrain the bond lengths. The pressure was maintained semi-isotropically at 1 bar at both x and y directions using the Perinello-Rahman barostat algorithm 56 and the system temperature was maintained at 310 K by the Nose-Hoover thermostat 57 . Then, a 1000 ns (1 µs) MD simulations were performed for the E protein pentameric. Umbrella sampling The initial system for umbrella sampling simulations was derived from the equilibrated pentameric SARS-CoV-2 E protein mentioned above. A single ion that maintain physiological activity (Na + , K + , Ca 2+ , Mg 2+ , Cl -) or water molecule was placed at successive positions along the central pore axis by using GROMACS pull code. Energy minimization was performed before simulation for optimizing the water and ions position. The reaction coordinate defined from z +3 to -3 nm with the mass center at z = 0 nm, with a spacing of 0.1 nm between successive windows, resulting in 60 umbrella sampling simulation systems. The probe ion or water molecules were harmonically restrained by a force constant of 2000 kJ mol/nm 2 same with the pore direction. Each window was performed a 5 ns total umbrella sampling simulation. The initial 2 ns for system equilibration, then a subsequent 3 ns was applied for analysis. The PMFs were computed by the weighted histogram analysis method (WHAM) 58 , and the profile was generated by Gromacs protocol 'g_wham' 59 . Bootstrap analysis (N = 50) was used to estimate statistical error and the '-cycl' parameter was used to make the value equal. The Diffusion-coefficient was calculated using the method described by Shirvanyants et al. 60 . We restrained the E-protein pentamer helix backbones and made the side chains, membrane, water and ions free to move. Due to the ion selectivity is related to the side chain of inner pore 61 , the backbone restraint maintained the E-protein pentamer tertiary structure but the side-chain flexibility in the inner pore was not influenced. A K + as the probe ion, the calculation system was used in the same manner as for umbrella sampling. The ion mean-square displacement (MSD) was calculated along the pore's Z-axis. A total of 25 simulation systems were obtained, each widows interval distance of the K + being set as 0.24 nm. The umbrella restraint was used to maintain the K + ion's position on the x-y plane of the pore. The Einstein equation MSD=2D(z)t was used to calculate the diffusion coefficient by the protocol g_msd. The umbrella restraint can be disregarded for these analyses because of the restraint force was negligible compared to thermally induced RMS fluctuations. Computational electrophysiology (CE) is a good tool for simulating the ion conduction ability of a channel under transmembrane voltage. We established a sandwich structure including three parts (membrane-protein-water) as described by Kutzner et al. 62 as shown in Fig 4A, each water layer contained a different number of ions. Due to the imbalance of the ion distribution of the water layer, the ions gradient will produce transmembrane voltage. Specific transmembrane voltage can be applied to the simulation system by adjusting the number of ions between the water layers. In this study, we built the SARS-CoV-2 E protein sandwich system. The initial single layer system was equilibrium for 50 ns to make the structures stable. Then the single-layer system was duplicated along the z direction. The resulting system had a size of around 11×11×22 nm 3 SARS and MERS: recent insights into emerging coronaviruses Real estimates of mortality following COVID-19 infection Case-fatality rate and characteristics of patients dying in relation to COVID-19 in Italy Coronavirus: covid-19 has killed more people than SARS and MERS combined, despite lower case fatality rate Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study The SARS coronavirus nucleocapsid protein-forms and functions Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science (80-. ) Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia COVID-19 spike-host cell receptor GRP78 binding site prediction Infectious bronchitis virus E protein is targeted to the Golgi complex and directs release of virus-like particles Nucleocapsid-independent assembly of coronavirus-like particles by co-expression of viral envelope protein genes Efficient assembly and release of SARS coronavirus-like particles by a heterologous expression system Conductance and amantadine binding of a pore formed by a lysine-flanked transmembrane domain of SARS coronavirus envelope protein COVID-19: consider cytokine storm syndromes and immunosuppression A decade after SARS: strategies for controlling emerging coronaviruses Severe acute respiratory syndrome coronavirus envelope protein ion channel activity promotes virus fitness and pathogenesis Structure of a conserved Golgi complex-targeting signal in coronavirus envelope proteins Structural model of the SARS coronavirus E channel in LMPG micelles A new coronavirus associated with human respiratory disease in China Structure validation by Cα geometry: ϕ, ψ and Cβ deviation Molecular dynamics simulations of membrane channels and transporters A single polar residue and distinct membrane topologies impact the function of the infectious bronchitis coronavirus E protein SARS coronavirus E protein forms cation-selective ion channels MERS coronavirus envelope protein has a single transmembrane domain that forms pentameric ion channels Ion channels of excitable membranes Theoretical and computational models of biological ion channels Functional annotation of ion channel structures by molecular simulation Permeation process of small molecules across lipid membranes studied by molecular dynamics simulations Molecular transport through membranes: Accurate permeability coefficients from multidimensional potentials of mean force and local diffusion constants Effect of field direction on electrowetting in a nanopore Voltage gating of a biomimetic nanopore: Electrowetting of a hydrophobic barrier Viroporins: structure and biological functions Ion channel voltage sensors: structure, function, and pathophysiology Designing a hydrophobic barrier within biomimetic nanopores STIM1 activates CRAC channels through rotation of the pore helix to open a hydrophobic gate 376-Relaxation studies on cell membranes and lipid bilayers in the high electric field range CAVER 3.0: a tool for the analysis of transport pathways in dynamic protein structures COVID-19 infection: origin, transmission, and characteristics of human coronaviruses Repurposing therapeutics for covid-19: Supercomputer-based docking to the sars-cov-2 viral spike protein and viral spike protein-human ace2 interface Identification of the mechanisms causing reversion to virulence in an attenuated SARS-CoV for the design of a genetically stable vaccine Biophysical characterization of Vpu from HIV-1 suggests a channel-pore dualism Identification of an ion channel activity of the Vpu transmembrane domain and its involvement in the regulation of virus release from HIV-1-infected cells Evidence for the formation of a heptameric ion channel complex by the hepatitis C virus p7 protein in vitro Hydrophobic gating in ion channels Electric-field-induced wetting and dewetting in single hydrophobic nanopores Voltage-gated hydrophobic nanopores A role for hydrophobic residues in the voltage-dependent gating of Shaker K+ channels The avian coronavirus infectious bronchitis virus undergoes direct low-pH-dependent fusion activation during entry into host cells Variations in pH sensitivity, acid stability, and fusogenicity of three influenza virus H3 subtypes SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information CHARMM-GUI: a webbased graphical user interface for CHARMM CHARMM General Force Field (CGenFF): A force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields Particle mesh Ewald: An N⋅ log (N) method for Ewald sums in large systems LINCS: a linear constraint solver for molecular simulations Polymorphic transitions in single crystals: A new molecular dynamics method The nose-hoover thermostat The weighted histogram analysis method for free-energy calculations on biomolecules. I. The method A free weighted histogram analysis implementation including robust error and autocorrelation estimates Pore dynamics and conductance of RyR1 transmembrane domain Principles of selective ion transport in channels and pumps. Science (80-. ) Computational electrophysiology: the molecular dynamics of ion channel permeation and selectivity in atomistic detail VMD: visual molecular dynamics GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers This study was supported by the National Natural Science Foundation of China All other authors declare no competing interests.