key: cord-0839581-u3vbrclm authors: Dey, Debajit; Singh, Suruchi; Khan, Saif; Martin, Matthew; Schnicker, Nicholas J.; Gakhar, Lokesh; Pierce, Brian G.; Hasan, S. Saif title: An extended motif in the SARS-CoV-2 spike modulates binding and release of host coatomer in retrograde trafficking date: 2022-02-08 journal: Commun Biol DOI: 10.1038/s42003-022-03063-y sha: b28e79579b0b7f8373b382986e1f1f2808ef1e60 doc_id: 839581 cord_uid: u3vbrclm β-Coronaviruses such as SARS-CoV-2 hijack coatomer protein-I (COPI) for spike protein retrograde trafficking to the progeny assembly site in endoplasmic reticulum-Golgi intermediate compartment (ERGIC). However, limited residue-level details are available into how the spike interacts with COPI. Here we identify an extended COPI binding motif in the spike that encompasses the canonical K-x-H dibasic sequence. This motif demonstrates selectivity for αCOPI subunit. Guided by an in silico analysis of dibasic motifs in the human proteome, we employ mutagenesis and binding assays to show that the spike motif terminal residues are critical modulators of complex dissociation, which is essential for spike release in ERGIC. αCOPI residues critical for spike motif binding are elucidated by mutagenesis and crystallography and found to be conserved in the zoonotic reservoirs, bats, pangolins, camels, and in humans. Collectively, our investigation on the spike motif identifies key COPI binding determinants with implications for retrograde trafficking. β -Coronaviruses have been responsible for major human respiratory diseases in the last two decades. In 2002, the severe acute respiratory syndrome coronavirus (SARS-CoV) was implicated in an epidemic first reported in China before spreading to 27 countries, which resulted in 774 deaths 1 . A decade later, Middle East respiratory syndrome (MERS) was reported in Saudi Arabia in 2012 with over 30% fatality in patients 2, 3 . Most recently, the novel SARS-CoV-2 has been implicated in the COVID-19 global pandemic that has claimed over four million lives. Current efforts to contain the pandemic are focused primarily on vaccinations using the viral spike protein that is responsible for SARS-CoV-2 entry into host cells 4, 5 . Fundamental insights into spike biogenesis will advance the understanding of how β-coronaviruses exploit host resources during viral infection and may potentially lead to the development of novel therapeutics. The trimeric β-coronavirus spike is organized into an ectodomain, a transmembrane domain, and a cytosolic domain 6, 7 (Fig. 1) . In infected cells, the newly synthesized spike protein in ER is transported first to the Golgi, and then from Golgi to the ERGIC compartment, which is the site of β-coronavirus progeny assembly [8] [9] [10] . This retrograde trafficking of the posttranslationally modified spike from Golgi to ERGIC involves a cytosolic dibasic motif, K-x-H-x-x (Lys-x-His, where x is any amino acid) 9,10 ( Fig. 1 ). Such C-terminal dibasic motifs and variants such as K-x-K-x-x and K-K-x-x are widely reported in the cytosolic tail of host membrane proteins that undergo retrograde trafficking [11] [12] [13] [14] . As such, the β-coronavirus spike demonstrates molecular mimicry of dibasic trafficking motifs 9, 10 . This recycling of spike protein has been suggested to enhance interactions with the viral membrane (M) protein localized in ERGIC during progeny assembly and is crucial for spike maturation 9, 10, 15 . These observations establish a key role of the dibasic motif in SARS-CoV and SARS-CoV-2 infection and propagation cycles. Interestingly, the spike dibasic motif and adjacent residues are completely conserved in sarbecoviruses, i.e., SARS-CoV and SARS-CoV-2, although sequence divergence in residues neighboring the dibasic motif is noted in MERS-CoV 9, 10 . On the host side, retrograde trafficking is mediated by the interactions of dibasic motifs with the coatomer protein-I (COPI) complex 11, 16 . Seven subunits, namely, α, β, β', γ, δ, ε and ζ, assemble into a COPI complex on retrograde trafficking vesicles that carry cargo [17] [18] [19] [20] [21] [22] [23] [24] . Prior genetic, biochemical, biophysical, and structural investigations have shown that the binding site for host cargo dibasic motifs maps to the N-terminal β-propeller WD40 domains of α and β'COPI subunits, which are structural homologs 11, [25] [26] [27] [28] [29] . Mutagenesis analyses of α and β' subunit N-terminal WD40 domains have identified residues critical for binding of host protein dibasic motifs 27 . This and another study 28 provided important structural details of how dibasic host and viral peptides bind to αCOPI-WD40 and β'COPI-WD40 domains. Cellular and biochemical investigations in recent years and during the ongoing COVID-19 pandemic have suggested a role of COPI interactions in sarbecovirus spike trafficking, maturation, glycan processing, and syncytia formation during infection 10, 15, 30 . These studies have established a platform to investigate the underlying chemistry of spike-COPI interaction. For instance, it is not known whether the K-x-H motif is sufficient to determine the strength of this interaction, whether adjacent residues in the spike cytosolic tail play a role in this binding, and what are the sequence determinants of spike-COPI disassembly crucial for spike release in ERGIC. On the host side, it is presently not known which COPI residues are critical for spike interactions. As such, key facets of this initial binding event in spike trafficking remain largely unknown for SARS-CoV-2 as well as SARS-CoV. In the present investigation, we address these questions using a combination of bio-layer interferometry (BLI), molecular modeling, mutagenesis, X-ray crystallography, and an in silico analysis of the human membrane proteome. Employing a sarbecovirus spike hepta-peptide corresponding to the K-x-H-x-x motif, we identify critical residues in αCOPI-WD40 for hepta-peptide binding and demonstrate structural alterations in an αCOPI-WD40 mutant. Amino acid propensity is described in human dibasic motifs and adjacent downstream residues, and mutagenesis experiments driven by this analysis provide insights into how sarbecovirus spike modulates strength of binding to COPI. Collectively, our study advances the structural and biophysical understanding of how the dibasic motif hijacks COPI for spike retention in endo-membranes and trafficking to the plasma membrane during sarbecovirus infections. Direct binding of sarbecovirus spike hepta-peptide is selective for αCOPI-WD40 domain. In this investigation, we heterologously expressed and purified the N-terminal WD-40 domain of β'COPI-WD40 (residues 1-301) from Saccharomyces cerevisiae and αCOPI-WD40 (residues 1-327) from Schizosaccharomyces pombe ( Supplementary Figs. 1, 2) . Although SARS-CoV-2 infects mammals, we chose these yeast constructs for two reasons. First, the putative interaction interface for dibasic peptides is conserved between these constructs and the αCOPI and β'COPI orthologs in humans, COPA and COPB2, respectively 27, 28 . Second, these domains have been previously crystallized and structurally characterized 27, 28 . This is consistent with our aim of understanding the structural basis of spike-COPI interactions. Although β'COPI-WD40 was expressed in E. coli as described previously, αCOPI-WD40 has reportedly presented challenges in protein production 27, 28 . Hence, the αCOPI-WD40 construct was expressed Fig. 1 Organization of the coronavirus spike protein. The spike protein is divided into an ecto-domain (gray), a trans-membrane domain (green), and a cytosolic domain (yellow-white-cyan). The cytosolic domain includes a cysteine rich tract (yellow) and a dibasic motif for COPI interactions (cyan). This overall organization and the dibasic motif are conserved in the spike protein of SARS-CoV, SARS-CoV-2, and MERS-CoV, which have been implicated in wide-spread human disease. The underlined residues correspond to the peptide sequence utilized in this manuscript. in Expi293 cells. This generated yields of purified αCOPI-WD40 of 4 mg per 100 ml cell culture volume. For comparison with a previously published construct of αCOPI-WD40 expressed in E. coli 28 , the crystal structure of the purified αCOPI-WD40 domain was determined to 1.8 Å resolution (Supplementary Table 1 ). The αCOPI-WD40 domain is organized into a β-propeller and is consistent with previously described structures of αCOPI-WD40 (Cα root-mean-square-deviation is <0.5 Å) 28 . However, a peripheral loop and a short α-helix (Gly 168 -Ala 188 , shown with an arrow in Fig. 2a ) demonstrate substantial differences from previously described αCOPI-WD40 structures likely due to altered crystal packing. An N-terminal acetylation of the αCOPI-WD40 in the present investigation (yellow) and a previous structure (magenta). Arrow highlights main chain differences between these two αCOPI-WD40 structures in Gly 168 -Ala 188 . b-h BLI assay of N-biotinylated spike hepta-peptide with COPI-WD40 domain. One representative experiment of three is shown in panels (b-h). Color code for concentrations is given at the bottom of the figure. The equilibrium K D is provided with each sensorgram for comparison. b The spike wild-type peptide sequence demonstrates dose-dependent binding to αCOPI-WD40 domain. c Scrambling of the hepta-peptide sequence abolishes binding suggesting sequence-specific interaction. d β'COPI-WD40 demonstrates no interaction with the immobilized hepta-peptide. The mutant peptide, Gly-Val-Lys-Leu-Lys-Tyr-Thr, shows dose-dependent binding to (e) αCOPI-WD40 but not (f) β'COPI-WD40. g Acidification enhances binding between the wild-type spike hepta-peptide and αCOPI-WD40 domain. h β'COPI-WD40 shows weakly enhanced binding to the spike hepta-peptide upon acidification. "n.d." implies not determined for weak interactions. polypeptide was identified in the structure. Importantly, the αCOPI-WD40 domain interface for putative interactions with dibasic motifs is similar between the structure determined here and previously published structures 28 . Recent investigations of SARS-CoV-2 spike and previously of SARS-CoV spike with COPI have employed cellular lysates 10, 15, 30 . Hence, we asked if there is direct interaction between the purified components. To address this question, we first established a BLI assay to test this interaction. Monomeric constructs of the spike cytosolic domain demonstrate similar interactions with COPI as the trimeric cytosolic domain 10 . Hence, a hepta-peptide representing a monomeric dibasic motif in the sarbecovirus spike ( 1267 Gly-Val-Lys-Leu-His-Tyr-Thr 1273 ) was synthesized with an N-terminal biotin tag attached via a linker. The hepta-peptide Cterminus has a free carboxylate to mimic the C-terminus of a polypeptide. This hepta-peptide (or its sequence variants) was immobilized on a streptavidin biosensor for BLI analysis. The purified α or β'COPI-WD40 domain was provided as the analyte in the BLI assay ( Fig. 2b -h, Table 1 ). It was observed that the spike hepta-peptide binds directly to the purified αCOPI-WD40 domain with an equilibrium dissociation constant (K D ) = 4.17 ± 0.04 μM and a kinetic K D = 2.75 ± 0.09 μM at pH 7.5 ( Fig. 2b , Table 1 ). A scrambled sequence of this hepta-peptide showed no detectable interaction with αCOPI-WD40 (Fig. 2c) suggesting that the binding of the wild-type hepta-peptide was sequence specific. This binding analysis demonstrates that the C-terminal peptide of the sarbecovirus spike contains sufficient sequence and structural information to interact directly with αCOPI-WD40. This is consistent with prior COPI binding analyses with peptides corresponding to host dibasic motifs 27, 28 . In contrast, this sarbecovirus hepta-peptide demonstrates a lack of binding to β'COPI-WD40 (Fig. 2d ). This selectivity for αCOPI-WD40 is consistent with that reported for a similar spike hepta-peptide ( 1377 Phe-Glu-Lys-Val-His-Val-Gln 1383 ) from porcine epidemic diarrhea virus (PEDV), an α-coronavirus 28 . Mutation of the K-x-H motif to K-x-K in the sarbecovirus mutant peptide, i.e., 1267 Gly-Val-Lys-Leu-Lys-Tyr-Thr 1273 , demonstrates similar selectivity for αCOPI-WD40 over β'COPI-WD40 (Fig. 2e, f) . Cellular studies suggest enhanced interactions of this mutant spike sequence with COPI subunits 15, 30 . It is likely that this mutation affects the local conformation of the full-length spike protein leading to modulation of COPI interactions in a cellular environment. One of the first analyses of COPI involvement in SARS-CoV spike trafficking employed cellular lysate pull-downs to show enhanced spike-COPI interactions under acidification to pH 6.5 10 . As the sarbecovirus hepta-peptide contains His 1271 in the K-x-H motif, we asked if this acidification would affect hepta-peptide interaction with αCOPI-WD40. A nearly 3-fold enhancement in binding between the wild-type hepta-peptide and αCOPI-WD40 (equilibrium K D = 1.40 ± 0.15 μM) was observed upon acidification to pH 6.5, likely due to partial protonation of the His residue in the hepta-peptide K-x-H motif (Fig. 2g , Table 1 ). Relative to pH 7.5, this lower pH accelerated the association rate of the hepta-peptide with αCOPI-WD40 by a factor of 1.7 while concomitantly suppressing complex dissociation by another 1.7-fold ( Table 1 ). As such, acidification was inferred to be a key factor in stabilizing the hepta-peptide complex with αCOPI-WD40. All subsequent BLI assays were performed at pH 6.5. The binding of β'COPI-WD40 to the wild-type spike peptide is still substantially weaker than that with αCOPI-WD40 in low pH (Fig. 2h) . The terminal residues in the spike hepta-peptide are key modulators of αCOPI-WD40 binding and dissociation. Homology modeling was employed to analyze the structural basis of interaction between SARS-CoV-2 spike hepta-peptide and αCOPI-WD40 domain. This modeling was based on a prior cocrystal structure of αCOPI-WD40 domain with a dibasic peptide 28 . Apart from the Lys 1269 and His 1271 residues in the Kx-H motif, the terminal Tyr 1272 -Thr 1273 residues in the heptapeptide are within interaction distance of αCOPI-WD40 surface residues (Fig. 3a) . This is intriguingly suggestive of a role of these two spike residues in binding αCOPI. Interestingly, the two N-terminal residues in the hepta-peptide, i.e., Gly 1267 -Val 1268 , make no contact with the αCOPI-WD40 surface. To evaluate the role of the spike residues in binding αCOPI-WD40, in silico alanine scanning mutagenesis of the modeled spike hepta-peptide was performed ( Table 2 ). The spike Lys 1269 and His 1271 residues that constitute the K-x-H dibasic motif are predicted to be most crucial for binding αCOPI-WD40 domain. The in silico mutations of these residues to Ala yield highly unfavorable free energy changes suggestive of substantially weakened binding to αCOPI-WD40 ( Table 2 ). The Ala mutation of Tyr 1272 in the spike peptide implies a substantial role of this residue in stabilization of the spike-αCOPI-WD40 complex. This is likely due to the side chain interaction between the oxygen atom in Tyr 1272 side-chain hydroxyl group with the αCOPI-WD40 His 31 side-chain NE2 atom, along with main chain interactions of Tyr 1272 . The terminal residue in the spike, i.e., Thr 1273 , is predicted to contribute modestly to the stabilization of the complex with αCOPI-WD40 (Table 2 ). Next, we tested this in silico model of interactions between the spike hepta-peptide and αCOPI-WD40 using BLI assays ( Fig. 3b-g) . The mutagenesis of Lys 1269 or His 1271 in the spike K-x-H motif to Ala residues abolished binding to αCOPI-WD40 (Fig. 3b, c) . As expected, the dual Ala mutation of the K-x-H motif does not demonstrate any substantial binding to αCOPI-WD40 (Fig. 3d ). As such, both basic residues in SARS-CoV-2 spike K-x-H motif are individually and concomitantly required for αCOPI-WD40 binding. Replacement of either residue is sufficient to disrupt αCOPI-WD40 binding to the spike hepta-peptide. These data are consistent with the in silico predictions described above as well as with cellular assays on SARS-CoV and SARS-CoV-2 spike trafficking 10, 15, 30 . We next tested the contribution of spike hepta-peptide Tyr 1272 residue to αCOPI-WD40 binding. A BLI assay of a mutant Tyr 1272 → Ala spike heptapeptide ( 1267 Gly-Val-Lys-Leu-His-Ala-Thr 1273 ) yielded an equilibrium K D = 3.77 ± 0.34 μM, which is 2.7-fold weaker than the wildtype spike peptide (Fig. 3e , Table 1 ). Although this mutation only reduced the rate of complex formation by 1.3-fold relative to the wild-type hepta-peptide (Table 1) , it accelerated complex dissociation by 1.9-fold (Table 1 ). This suggests weakened interactions of the spike hepta-peptide with αCOPI-WD40 when the aromatic side chain interactions of Tyr 1272 are abrogated. Collectively, this BLI analysis indicates that Tyr 1272 is important for complex stability. These experimental results are consistent with the above described in silico model (Table 3) , (Table 4 ). Next, we evaluated the C-terminal position Thr 1273 in the spike. A BLI assay of a mutant Thr 1273 → Ala hepta-peptide ( 1267 Gly-Val-Lys-Leu-His-Tyr-Ala 1273 ) yielded an equilibrium K D = 1.13 ± 0.05 μM, which is similar to the wildtype hepta-peptide (K D = 1.40 ± 0.15 μM) (Fig. 3f , Table 1 ). This Thr 1273 → Ala mutation caused a slowing down of αCOPI-WD40 association-dissociation kinetics by 2.3 and 1.9-fold, respectively (Table 1) . This suggested that a β-branched residue at the C-terminus may be an important determinant in complex formation kinetics. To probe further, we generated a Thr 1273 → Val mutant hepta-peptide, which maintains a β-branched residue at the C-terminus. Val has methyl groups at the two side chain γ positions, which replace a methyl and a hydroxyl group at equivalent γ positions in Thr. A BLI assay of this mutant hepta-peptide showed a 2.3-fold weakened interaction with αCOPI-WD40 relative to the wild-type, with an equilibrium K D = 3.20 ± 0.04 μM (Fig. 3g , Table 1 ). Interestingly, this mutant demonstrated 1.5-fold slower association kinetics and 1.5fold more rapid dissociation than for the wild-type hepta-peptide (Table 1 ). Compared to the Thr 1273 → Ala hepta-peptide, this Thr 1273 → Val mutant weakened binding by 2.8-fold while accelerating αCOPI-WD40 complex association and dissociation kinetics by 1.5 and 2.8-fold respectively (Table 1 ). These data suggest that the spike C-terminal Thr residue side-chain provides interactions that modulate dissociation of the complex with αCOPI-WD40. Interestingly, a prior analysis implicated β-branched residues at the penultimate position in the PEDV spike sequence in modulating interactions with COPI-WD40 domains 28 . Electrostatics of spike hepta-peptide C-terminus drive dissociation from αCOPI-WD40. We performed an in silico analysis of the human proteome to gain insights into whether the spike extended dibasic motif demonstrates consistency with host dibasic motifs and their environment. We identified 119 sequences predicted to be membrane proteins that terminate with K-x-H-x-x and K-x-K-x-x dibasic motifs (Supplementary Table 2 ). These sequences were aligned and analyzed for the frequency of 20 amino acids at each of the positions in the dibasic motif and the two terminal residues following the motif (Fig. 4a , Supplementary Table 3 ). This analysis revealed novel details about the dibasic motif. First, it was inferred that the predominant dibasic motif is K-x-K-x-x rather than K-x-H-x-x by nearly an order of magnitude. Second, only a low frequency (0.07) of the sequences has an aromatic residue at the penultimate position, which corresponds to Tyr 1272 in the SARS-CoV-2 spike. β-Branched residues Leu, Ile, Val, Ser, and Thr are found at a high frequency of 0.38 at this penultimate position. Third, acidic residues at the C-terminus are observed in nearly a quarter (frequency = 0.24) of the sequences. Overall, with a frequency of 0.42, the C-terminal Structure-guided mutagenesis of spike hepta-peptide and binding analysis with αCOPI-WD40 domain. a In silico model of the spike heptapeptide complexed with αCOPI-WD40 domain (yellow surface). The hepta-peptide is shown as a ribbon in rainbow colors from N (blue) to C (red) terminus. The Cα-atoms in the hepta-peptide are shown as spheres. The side chains of residues that interact with αCOPI-WD40 are shown as a stick. b-g BLI analyses of αCOPI-WD40 binding to spike hepta-peptide mutants. The color code of BLI traces is given at the bottom of the figure. One representative experiment of three is shown. Color code for concentrations is given at the bottom of the figure. The mutation in the spike hepta-peptide sequence is highlighted in bold and is underlined. The equilibrium K D is provided with each sensorgram for comparison. Mutagenesis of, (b) Lys 1269 , (c) His 1271 , or (d) both abolishes binding to αCOPI-WD40. e In contrast, Tyr 1272 → Ala mutation only weakens binding to αCOPI-WD40. The middle panel shows weak binding of αCOPI-WD40 domain with a hepta-peptide wherein Lys 1269 has been mutated to Ala. f Mutagenesis of Thr 1273 to Ala in the spike hepta-peptide leads to moderately enhanced binding to αCOPI-WD40 whereas mutagenesis to Val 1273 weakens binding (g). position has a strong tendency to be occupied by charged residues such as Arg, Asp, Glu, His, and Lys. Hydroxyl side chain containing Thr, which corresponds to Thr 1273 in SARS-CoV-2 spike, is a low frequency residue (0.05). In our in silico model of the hepta-peptide complexed with αCOPI-WD40, the side chain of this Thr 1273 residue is within interaction distance of a cluster of basic residues in αCOPI-WD40 (Arg 13 , Lys 15 , and Arg 300 , Fig. 4b ). Hence, we hypothesized that the presence of a charged residue at this spike position would modulate interactions with αCOPI-WD40. This was supported by our in silico analysis, which predicted stabilization of the complex when an acidic Glu residue replaced Thr 1273 in the spike hepta-peptide (Table 2) . We next tested the role of this spike C-terminal residue in modulating αCOPI-WD40 binding using a BLI assay (Fig. 4c-e) . We employed three distinct mutations of the spike hepta-peptide at this position, i.e., acidic (Glu), basic (Arg), and neutral (Gln) (Fig. 4c-e) . The presence of an acidic Glu residue at the C-terminus was found to substantially strengthen binding of the hepta-peptide to αCOPI-WD40 with an equilibrium K D = 0.31 ± 0.01 μM, which is 4.5-fold tighter than the binding of the heptapeptide sequence (Fig. 4c, Table 2 ). This is consistent with our in silico model and is strongly suggestive of an electrostatic interaction between the Glu 1273 side chain and αCOPI Arg 13 , Lys 15 , and Arg 299 side chains to stabilize the complex. Furthermore, the rate of dissociation of αCOPI-WD40 domain from the Glu 1273 hepta-peptide is 6-fold slower than that of the wild-type spike hepta-peptide (Table 1 ). In fact, during the time course of our experiment, we did not observe complete dissociation of this complex with the Glu 1273 containing heptapeptide. To eliminate the possibility of non-specific interactions, we employed a hepta-peptide with a scrambled sequence ( Supplementary Fig. 3 ). Next, we tested whether modifying side-chain charge at the C-terminal position of the spike heptapeptide affects complex formation with αCOPI-WD40. Relative to Glu 1273 , the binding between αCOPI-WD40 domain was weakened when neutral Gln 1273 was substituted into the heptapeptide (equilibrium K D = 0.78 ± 0.04 μM, Fig. 4d) . However, the binding of the hepta-peptide with Gln 1273 was still 1.8-fold tighter than that of the wild-type hepta-peptide (Table 1 ). In contrast, basic Arg 1273 in the spike hepta-peptide (equilibrium K D = 1.73 ± 0.24 μM) yielded an interaction strength similar to the wildtype sequence (equilibrium K D = 1.40 ± 0.15 μM, Fig. 4e) . The amide carbonyl group in the Gln 1273 side-chain likely interacts with the basic residue cluster on αCOPI-WD40 through hydrogen bonding. This stabilizing interaction is disrupted when Gln is replaced by Arg 1273 in the spike hepta-peptide. Intriguingly, the rate of association with αCOPI-WD40 was slowed down relative to the wild-type hepta-peptide by a factor of 2.6 for Glu 1273 and Gln 1273 , whereas it was similar to that of Arg 1273 containing hepta-peptide (Table 2) . Overall, these data establish a critical role of the C-terminal position in the SARS-CoV-2 spike in modulating binding to αCOPI-WD40. Low frequency of acidic residues at the spike C-terminus. Given the inference that a C-terminal acidic residue strengthens binding to αCOPI-WD40, which likely interferes with spike release, we asked if a typical coronavirus spike demonstrates a paucity for such C-terminal acidic residues. A sequence and phylogenetic analysis of the five C-terminal residues in the spike protein of coronaviruses was performed to determine the frequency of Asp and Glu at the C-terminus (Fig. 5) . None of the coronavirus spike proteins that demonstrate the K-x-H-x-x, K-x-K-x-x, or K-x-R-xx dibasic motif has a C-terminal acidic residue. It has been suggested that bats are the likely genetic source of human β-coronaviruses [31] [32] [33] . Apart from bats, during the SARS-CoV epidemic and the ongoing SARS-CoV-2 pandemic, zoonotic reservoirs such as civets and pangolins have been suggested to be involved in coronavirus transmission [33] [34] [35] [36] [37] [38] . Our phylogenetic analysis showed that the extended coatomer binding motif in the spike, i.e., 1269 Lys-Leu-His-Tyr-Thr 1273 , is conserved in coronavirus isolates from these animals (Fig. 5) . Moreover, this conservation of the extended motif is seen in the WHO deemed emerging variants of concern for SARS-CoV-2, i.e., α (Pango lineage B.1.1.7), β (B.1.351), γ (P.1), δ (B.1.617.2), and the latest Ο (B.1.1.529) 39 . This is indicative of a strong selection pressure to maintain this COPI-interacting sequence in the spike protein. A polar αCOPI-WD40 interface for spike hepta-peptide binding. We subsequently focused our attention on the spike binding residues in αCOPI-WD40. The in silico modeling of SARS-CoV-2 spike hepta-peptide shows that interaction with αCOPI-WD40 domain involves predominantly polar residues (Table 3) . Amongst these residues, Arg 57 , Asp 115 , and Tyr 139 provide the highest level of stabilization to spike hepta-peptide binding ( Table 4 ). The Arg 57 side-chain interacts with the main chain carbonyl of spike His 1271 , Tyr 1272 , and Thr 1273 (Fig. 6a) . The Asp 115 side-chain forms a bond with the terminal NZ atom in the spike Lys 1269 side-chain (Fig. 6b) . This side-chain of spike Lys 1269 is further stabilized by an interaction with the hydroxyl oxygen in Tyr 139 side-chain (Fig. 6c) . Hence, the side chains of αCOPI-WD40 Arg 57 , Asp 115 , and Tyr 139 residues provide an extensive and polar interaction network for binding of the spike hepta-peptide. Therefore, mutagenesis of these three αCOPI-WD40 residues to Ala is predicted to disrupt interactions with the spike hepta-peptide as suggested by our in silico analysis (Table 3) . Next, the role of αCOPI-WD40 Arg 57 , Asp 115 , and Tyr 139 residues in binding the spike hepta-peptide was tested. We generated three single-site mutants of αCOPI-WD40 wherein Arg 57 , Asp 115 , and Tyr 139 residues were individually mutated to Ala. These mutants were expressed heterologously and purified from Expi293 cells. Analysis by size-exclusion chromatography (SEC) suggested an overall similarity in hydrodynamic radius with the wild-type αCOPI-WD40 domain ( Supplementary Fig. 2) . These three mutants were analyzed for binding to the wild-type spike hepta-peptide by BLI assays. None of the three mutants demonstrates any binding to the wild-type sequence of the spike hepta-peptide (Fig. 6d-f ). This demonstrated that αCOPI-WD40 residues Arg 57 , Asp 115 , and Tyr 139 are individually critical for binding the spike hepta-peptide. Disruption of even one of these interactions is likely sufficient to destabilize the spike-COPI complex. Structural basis of conformational changes in an αCOPI-WD40 mutant. The results of αCOPI-WD40 Arg 57 , Asp 115 , and Tyr 139 mutagenesis led us to ask if the loss of binding to the spike hepta-peptide was due to disruption of a single critical interaction or due to larger alterations in the protein structure. This is because the surface of αCOPI-WD40 contains an intricate network of residues with charged and polar side-chains. Mis-sense mutations that alter charge balance could modify αCOPI-WD40 conformations. To address this question, we crystallized αCOPI-WD40 Arg 57 → Ala and Tyr 139 → Ala mutants. The αCOPI-WD40 Asp 115 → Ala mutant did not yield crystals in the conditions we tested. The crystal structures of αCOPI-WD40 Arg 57 → Ala and Tyr 139 → Ala mutants were determined by X-ray diffraction to a resolution of 1.2 Å and 1.5 Å, respectively (Supplementary Table 1 ). The αCOPI-WD40 Arg 57 → Ala mutant structure demonstrated novel structural alterations that had previously not been reported in the crystal structures of wild-type αCOPI-WD40 or the related β'COPI-WD40 (Supplementary Fig. 4a ). The mutation of Arg 57 to an Ala residue generated a cavity in the spike hepta-peptide binding site. This change led to a 62°rotation of a nearby Tyr 97 residue side chain into the newly generated cavity in αCOPI-WD40 (Supplementary Fig. 4b ). In parallel, the residue Asp 73 underwent a substantial conformational change. This residue interacts with the side chain of Arg 57 in the wild-type αCOPI-WD40 structure. However, the loss of stabilizing interactions from the Arg 57 side chain and the reorientation of Tyr 97 caused the Asp 73 side chain to rotate away by 73°from its initial position (Supplementary Fig. 4b) . These conformational changes are accompanied by a 1.1 Å and 0.8 Å movement of Tyr 139 and His 31 side chains respectively closer towards the spike hepta-peptide as inferred from our in silico model. In contrast, the side chain of Lys 15 moves 1.7 Å away from the inferred hepta-peptide position. As such, the binding site and its vicinity demonstrate a substantially modified interaction network in Arg 57 → Ala mutant. Next, we asked if Arg 57 → Ala mutation and the associated rotameric changes caused any main chain reorganization in αCOPI-WD40. To obtain a global overview of changes in the main chain geometry, the differences in Ramachandran angles were calculated between corresponding residues in the wild-type and Arg 57 → Ala crystal structures (Supplementary Fig. 4c ). The top peak in this difference Ramachandran plot, i.e., peak 1, in this analysis corresponds to a substantial main chain twist at Gly 72 , which is near the Arg 57 → Ala mutation site. This conformational change is associated with the Asp 73 side chain rotation and repositioning of the main chain atoms from Gly 72 to Val 77 , which are pushed away from the domain core consistent with the reorientation of the Asp 73 side chain (Supplementary Fig. 4d) . Peaks 2 and 4 in this difference Ramachandran plot correspond to changes in surface loops that are 19 Å and 25 Å from the mutation site and are likely due to crystal contacts. Peaks 3 and 6 correspond to main chain rearrangement in the mutation site, i.e., Arg 57 → Ala. This is likely a combination of the mutation and modifications to side chain rearrangements in the neighborhood of Ala 57 described above. Peak 5 is associated with a surface loop 31 Å from the mutation site. This loop demonstrates weak electron density and is only partly ordered. Hence, three of the top six peaks, i.e., 1, 3, and 6, in this analysis are associated with considerable rearrangement of the αCOPI-WD40 surface upon the mutation of basic Arg 57 to neutral Ala including the site for spike hepta-peptide binding. In contrast to the Arg 57 → Ala substitution, the crystal structure of αCOPI-WD40 Tyr 139 → Ala mutant demonstrated no substantial changes as compared to the wild-type structure ( Supplementary Fig. 4c, e) . No major rearrangements of side chains or main chains were observed. This lack of conformational rearrangement contrasts with the structural changes in the αCOPI-WD40 Arg 57 → Ala mutant structure. It is likely that the electroneutral change from Tyr 139 to Ala does not perturb the local electrostatic surface sufficiently to alter protein conformation. Hence, disruption of spike hepta-peptide binding in this Tyr 139 → Ala mutant is due to the loss of a single side chain hydroxyl group. Collectively, the crystal structures of αCOPI-WD40 Arg 57 → Ala and Tyr 139 → Ala mutants reveal distinct This phylogram was generated from the alignment of five C-terminal residues in the spike protein. This penta-residue spike sequence is shown in italics to the right of each coronavirus. The residues in the dibasic motif are underlined. An acidic Asp is seen at the C-terminus of the spike proteins of only those coronaviruses that lack a dibasic motif. and contrasting structural principles by which spike heptapeptide binding is disrupted. An analysis of crystal packing in the αCOPI-WD40 structures reported here showed that the peptide binding site residues are in contact with symmetry related chains. We subsequently asked if distinct crystal packing may have contributed to these different structural consequences of Arg 57 → Ala and Tyr 139 → Ala mutations. However, similar crystal packing interactions are provided by residues His 267 and Lys 309 from a symmetry-related αCOPI-WD40 chain to the spike hepta-peptide binding site. As such, conformational differences in these two αCOPI-WD40 mutants are due to altered interaction chemistry in the heptapeptide binding site. Conservation of αCOPI-WD40 residues critical for spike hepta-peptide binding. The mutagenesis, BLI, and crystallographic analyses described here are focused on αCOPI. Hence, we asked if αCOPI Arg 57 , Asp 115 , and Tyr 139 residues are conserved in bats, pangolins, camels, and humans, which have been implicated as zoonotic reservoirs and hosts for β-, coronaviruses [40] [41] [42] [43] [44] . Overall, αCOPI is 96.5-99.8% identical in these multicellular higher organisms (Supplementary Table 4 ). In contrast, αCOPI conservation is relatively moderate between these organisms and yeast at 46.8-47.1% sequence identity. An analysis of αCOPI sequence conservation across 150 species using the CONSURF server 45 demonstrated that Arg 57 and Asp 115 are completely conserved whereas Tyr 139 is replaced by Phe or Trp in 5.3% and 0.7% of the sequences, respectively ( Supplementary Data 1 and 2) . Importantly, all these three αCOPI residues are found to be 100% identical in yeast, bat, pangolin, camel, and human αCOPI. This suggests an evolutionary pressure on these three residues in binding dibasic motifs in host proteins, which is exploited by the sarbecovirus spike to hijack the host COPI machinery. This conservation of the three αCOPI residues extends to chicken, which is has been suggested to be a host for γand δ-coronaviruses 31 . αCOPI residues such as Lys 15 and His 31 that are suggested to be involved in spike heptapeptide binding by our in silico analysis demonstrate complete conservation. Interestingly, this conservation extends to S. cerevisiae β'COPI wherein αCOPI residues Arg 57 and Asp 115 are replaced by Arg 59 and Asp 117 in β'COPI, respectively. However, αCOPI Tyr 139 is semi-conserved and is replaced by Phe 142 in β'COPI. Once the sarbecovirus spike is delivered from ER to Golgi, its trafficking to the progeny virus assembly site consists of three distinct steps, i.e., spike-COPI binding in donor membranes such as cis-Golgi, inter-organelle trafficking, and dissociation of spike-COPI at the destination, which is ERGIC 8 . This trafficking pathway can be disrupted by either weakening of spike-COPI binding leading to premature complex dissociation or enhanced stability of this complex, which interferes with spike release. This is supported by recent cellular imaging and biochemical analysis of the SARS-CoV-2 spike protein 15 . Hence, elucidating the determinants of spike-COPI interactions is fundamental to understanding sarbecovirus assembly. Employing a spike hepta-peptide and a purified aCOPI-WD40 domain, the present investigation expounds on the biophysical and structural bases of spike-COPI interactions. We demonstrate that direct binding of purified αCOPI-WD40 domain to the SARS-CoV-2 spike hepta-peptide is modulated by an extended coatomer binding motif that stretches beyond the spike K-x-H residues. Our data show that residues such as acidic Glu in the C-terminal position in the spike likely interact with complementary charged basic residues in αCOPI-WD40. This interaction strengthens spike binding to the host αCOPI. This analysis is consistent with a recent preprint that shows a key role of this SARS-CoV-2 spike C-terminal position in pull-down assays of the spike cytosolic domain with COPI subunits 30 . A second cellular investigation has recently shown that the inferred stabilization of the spike-COPI complex by a Lys 1269 -x-His 1271 → Lys 1269 -x-Lys 1271 spike mutation has dramatic effects on SARS-CoV-2 spike processing and trafficking (Jennings et al., 2021) . This functional analysis suggests a key role of spike-COPI complex dissociation in modulating spike trafficking and function. Hence, it is likely that residues that strengthen spike-COPI complex stability beyond that from wild-type interactions are avoided in the spike C-terminus. This includes acidic Glu and unbranched Ala residue that stabilize the αCOPI-WD40 domain as demonstrated in the present investigation. Interestingly, our analysis of the human membrane proteome suggests that the occurrence of a charged residue such as Glu and β-branched residues is a high probability event at the C-terminus of dibasic motifs. This raises an intriguing question of whether such charged residues present a structural and biophysical disadvantage to Tyr 139 residues in αCOPI-WD40 in an in silico model. These three interacting αCOPI-WD40 residues are shown as yellow-red-blue sticks whereas the other residues are shown as a yellow surface for simplicity. The corresponding interacting residues in the spike hepta-peptide are labelled and shown as green-red-blue sticks and spheres for Cα atoms. The BLI analysis of Arg 57 → Ala, Asp 115 → Ala, and Tyr 139 → Ala mutants with the wild-type spike hepta-peptide is shown in panels (d), (e), and (f), respectively. All three mutants demonstrate no substantial binding of the spike hepta-peptide. One representative experiment of three is shown in panels (d-f). spike-COPI interactions, and hence, are selected against in sarbecoviruses. Such a highly stabilized complex may not undergo dissociation in ERGIC to release the spike for processing and downstream virion assembly. Interestingly, acidic residues are absent from the C-terminus of coronavirus spike proteins that have a dibasic motif for COPI dependent trafficking. We note that complex formation of Glu 1273 or Gln 1283 containing spike heptapeptide is substantially slower than in the wild-type. Even Arg 1273 , which lacks charge complementarity with αCOPI-WD40 residues shows slower association kinetics. Glu, Gln, and Arg have long side chains unlike Thr 1273 , which is suggestive of a role of side chain size in modulating interactions with COPI. On the coronavirus side, the spike dibasic motif and adjacent residues demonstrate complete conservation in β-coronavirus isolates from bats, pangolins, civets, camels, and humans [33] [34] [35] [36] [37] [38] 46 . However, variations in K-x-H motif as well the absence of a dibasic motif in the spike in animal isolates of coronaviruses suggest that this mechanism of direct interactions between COPI and the spike may not be universal. It has been suggested that dibasic motifs in coronavirus proteins other than the spike may be involved in modulating COPI dependent trafficking by oligomerization with the spike protein 9 . An example is bovine coronavirus, which demonstrates a dibasic motif in the enzyme hemagglutinin esterase but not in the spike protein 47 . The interaction of this enzyme with the spike protein 48 offers a potential route for COPI dependent trafficking of the spike. Our investigation identifies three αCOPI residues Arg 57 , Asp 115 , and Tyr 139 , as essential for spike hepta-peptide binding. These αCOPI residues are completely conserved across organisms associated with β-coronavirus infections such as bats, pangolins, camels, and humans and in chicken, which is infected by γand δ-coronaviruses. This suggests a likely conserved mechanism for COPI dependent spike trafficking. Interestingly, the critical αCOPI residues identified in the present analysis are broadly consistent with a prior genetic and biophysical study that implicated αCOPI Arg 57 and Lys 15 , and β'COPI Arg 59 and Asp 117 (equivalent to αCOPI Asp 115 ) as critical for dibasic motif binding, retrograde trafficking, and growth of yeast cells 27 . Building on this prior investigation, our crystallographic analysis of αCOPI Arg 57 → Ala and Tyr 139 → Ala mutants presents two complementary structural results to substantially advance the understanding of how these residues are critical for COPI architecture. The αCOPI Arg 57 → Ala mutant demonstrates a rearrangement of the spike hepta-peptide binding site and of neighboring residues whereas the Tyr 139 → Ala mutant structure is largely similar to the wild-type αCOPI structure. Yet, both mutants demonstrate the same functional outcome, i.e., loss of spike hepta-peptide binding. Given this structural sensitivity of αCOPI, and presumably β'COPI, to changes in electrostatics, this raises an interesting question about the structural basis of how mutations in these subunits alter normal retrograde trafficking. It is relevant to note that the COPA gene, which encodes the human homolog of αCOPI, has been implicated in a set of clinical disorders collectively known as the COPA syndrome 49, 50 . Here, mis-sense mutations including ones that modify side chain charge in the WD40 domains compromise COPA protein function in retrograde trafficking 49 . Based on data presented here, it would be of interest to investigate the structural basis of this dysfunction to gain deeper insights into COPI biology. In conclusion, our present analysis and supporting prior investigations demonstrate that the extended dibasic motif in the sarbecovirus spike functions as an effective tool to hijack the COPI complex involved in retrograde trafficking. In broader terms, our structural analysis provides a basis to further investigate the structural and functional consequences of αCOPI and β'COPI mutations in disrupting retrograde trafficking. Protein and peptide production. The S. pombe αCOPI-WD40 domain was synthesized by TOPGENE and cloned in pcDNA3.1(+) with a C-terminal strep-tag for affinity purification. Five mutations (Leu 181 → Lys, Leu 185 → Lys, Ile 192 → Lys, Leu 196 → Lys and Phe 197 → Lys) were incorporated in the gene to improve solubility as suggested previously 28 . Expression was performed in Expi293 mammalian cells using the Thermo Fisher ExpiFectamine expression kit. Protein purification was performed by affinity chromatography of the clarified cellular lysate followed by SEC in a Superdex 75 chromatography column. Arg 57 → Ala, Asp 115 → Ala, and Tyr 139 → Ala mutants of αCOPI-WD40 domain were expressed and purified as described for the wild-type protein. The purified αCOPI-WD40 domain in 150 mM NaCl, 5 mM dithiotreitol (DTT), 10% glycerol, and either 20 mM Tris-HCl (pH 7.5) or 50 mM MES-NaOH (pH 6.5) was flash-frozen in liquid nitrogen until further experimentation. All mutations in pcDNA3.1(+)-αCOPI-WD40 were made by GenScript. β'COPI-WD40 (residues 1-301) from S. cerevisiae was cloned in pSUMO vector with an N-terminal strep-tag, a Hisx8 tag, and a Ulp1 protease cleavage site, and expressed overnight in E. coli pLysS cells at 18°C. This fusion protein was purified by affinity chromatography and SEC in 150 mM NaCl, 5 mM dithiotreitol (DTT), 10% glycerol, and 20 mM Tris-HCl (pH 7.5) followed by overnight digestion with Ulp1 protease. The digested β'COPI-WD40 domain was subjected to negative purification by Ni-NTA and SEC and was flash-frozen in liquid nitrogen until further experimentation. Peptide synthesis was performed by Biomatik (USA) with an N-terminal biotin tag and a (PEG) 4 linker between the tag and the peptide. No modification was performed at the C-terminus of the peptides thereby leaving a free terminal carboxylate group. BLI assay. Biotinylated spike hepta-peptides were tethered to streptavidin (SA) biosensors (FortéBio) in a 96-well plate format. Purified αCOPI-WD40 domain was provided as the analyte. Kinetics measurements for determination of binding affinity were performed on an Octet RED96 system (FortéBio). Data acquisition was carried out using the Data Acquisition 11.1 suite. Briefly, SA biosensors were hydrated in 200 µL of kinetics buffer (20 mM Tris-HCl (pH 7.5) or 50 mM MES-NaOH (pH 6.5), 150 mM NaCl, 5 mM DTT, 10% glycerol, 0.2 mg/ml bovine serum albumin (BSA), and 0.002% Tween 20) for 10 minutes prior to binding. The spike hepta-peptide (5 µg/ml) was loaded on the biosensors for 15 s. A baseline was established by rinsing the biosensor tips in the kinetic buffer for 30 s. This was followed by association with αCOP in varying concentrations over 60 s and dissociation in the baseline well for 90 s. A temperature of 25°C and a shake speed of 1000 rpm was maintained during acquisition. All experiments were carried out in triplicates. A new sensor was used for each replicate. Data processing and analysis were performed in the FortéBio Data Analysis 11.1 software suite. Raw data was subtracted from the 0 µM αCOP signal as a reference. The baseline step immediately before the association step was used for the alignment of the y-axis. An interstep correction between the association and dissociation steps was performed. Reference subtracted curves were processed with the Savitzky-Golay filtering method and subjected to global fitting using a 1:1 binding model. All fits to BLI data had R 2 value (goodness of fit) > 0.9. Crystallization and structure determination. Purified αCOPI-WD40 domain was concentrated to 2 mg/ml in 20 mM Tris-HCl (pH 7.5), 150 mM NaCl, 5 mM DTT and 10% glycerol buffer. Crystal trays were set up with the hanging drop vapor diffusion method with 0.5 µL of αCOPI-WD40 mixed with an equal volume of reservoir buffer. Crystals were within 48 h at 22°C in 20% PEG3350 and 0.25 M sodium citrate tribasic dihydrate. Crystals were cryo-protected in mother liquor supplemented with 20% ethylene glycol and flash-frozen in liquid nitrogen. Purified Arg 57 → Ala and Tyr 139 → Ala mutants of αCOP were concentrated tõ 2.2 mg/ml and crystallized as described for the wild-type protein. Crystals for Arg 57 → Ala were obtained in 22% PEG3350 and 0.2 M trisodium citrate and for Tyr 139 → Ala in 18% PEG3350 and 0.2 M potassium-sodium tartrate. The crystals for Arg 57 → Ala and Tyr 139 → Ala αCOPI-WD40 mutants were cryoprotected in 20% glycerol and 20% ethylene glycol, respectively. X-ray diffraction data for wildtype αCOPI-WD40 was collected at the beamline GM/CA 23-ID-D of the Advanced Photon Source at the Argonne National Laboratory and at the National Synchrotron Light Source II (NSLS II) beamline 17-ID-1 AMX at the Brookhaven National Laboratory for the mutants. The X-ray diffraction data for the wildtype protein crystals was indexed, integrated and scaled using HKL3000 51 whereas those for the mutants were processed using XDS 52 as part of the data acquisition and processing pipeline at the beamline. The data processing statistics are given in Supplementary Table 1 . The scaled data were merged using AIMLESS in CCP4 suite 53 . Molecular replacement was performed in Phenix using a previously determined αCOPI-WD40 domain structure (PDB ID 4J87) as the search model 54, 55 . Iterative model building and refinement were performed in Phenix.refine 56 Analysis of Ramachandran angles. The crystal structures of wild-type, Arg 57 → Ala, and Tyr 139 → Ala αCOPI-WD40 were analyzed in Molprobity 59 . For each structure pair, i.e., wild-type with Arg 57 → Ala or wild-type with Tyr 139 → Ala, per residue difference in Ramachandran angles was determined using equation (1)- Here, Ramachandran angles for wild-type and mutant structures are represented as (ψ WT , ϕ WT ) and (ψ m , ϕ m ), respectively. Each structure-pair was superimposed in PyMol and inspected to ensure consistency with the results of the Ramachandran angle analysis. We identified three instances of surface exposed residues (Asp 96 , Asn 257 in the wild-type coordinates, and Ser 11 in Arg 57 → Ala coordinates) where the structures were highly similar between corresponding main chain atoms in the wild-type and mutant but the sign of a dihedral angle close to 180°had been flipped. The Cα rmsd of short penta-residue stretches of the polypeptide chain centered at each of these residues was 0.14 Å, 0.06 Å, and 0.18 Å, respectively. The signs of the Ramachandran angles for these residues were corrected manually. In silico analysis of sarbecovirus spike hepta-peptide with αCOPI-WD40. Structural modeling of the SARS-CoV-2 spike C-terminus peptide (sequence: GVKLHYT) in complex with the αCOPI-WD40 domain was performed using homology modeling in Modeller 60 and the structure of the αCOP-WD40 complexed with Emp47p peptide (PDB ID 4J8B) as a template. Prior to computational mutagenesis, models were processed with FastRelax 61 in Rosetta (v. 3.5), with backbone and side chain atoms constrained to the input coordinates. The command line parameter settings for FastRelax execution ("relax" executable) used were: -relax:constrain_relax_to_start_coords -relax:coord_constrain_sidechains -relax:ramp_constraints false -ex1 -ex2 -use_input_sc -correct -no_his_his_pairE -no_optH false -flip_HNQ -nstruct 1 Computational mutagenesis simulations to predict effects on binding affinities (ΔΔGs) for point substitutions were performed using a previously described protocol implemented in Rosetta (v. 2.3) 62 . Default parameters were used, with the exception of extra rotamers allowed during packing of modeled side chains, specified by command line parameters: -extrachi_cutoff 1 -ex1 -ex2 -ex3 Sequence analysis of dibasic motifs in the human membrane proteome. UNIPROT identifiers of secreted and membrane-bound human proteins, as well as secreted/membrane-bound protein isoforms, were downloaded from the Human Protein Atlas (http://www.proteinatlas.org) 63 . The corresponding protein sequences were obtained from UNIPROT, leading to~6800 sequences, which were parsed using an in-house Perl script to identify C-terminal motif residues. Of these, 119 sequences that demonstrated a C-terminal dibasic motif were analyzed further for amino acid propensities in the dibasic motif and neighboring residues. K4JZD2; bat Rhinolopus alpha-coronavirus/HuB2013, A0A0U1WJW2; human coronavirus 229E, P15423; rodent coronavirus, A0A2H4MWY1; mink coronavirus strain WD1127, D9J1Z4; bat coronavirus HKU8, B1PHK2; bat alpha-coronavirus SAX2011, A0A0U1WHD7; bat alpha-coronavirus SC2013, A0A0U1UZD0; porcine epidemic diarrhea virus strain CV777, Q91AV1 CoV, P59594; avian infectious bronchitis virus strain Beaudette, P11223; bulbul coronavirus HKU11-934, B6VDW0; common moorhen coronavirus HKU21, H9BR35; porcine coronavirus HKU15, X2G836; munia coronavirus HKU13-3514, B6VDY7; night heron coronavirus HKU19, H9BR17; bat coronavirus RaTG13, A0A6B9WHD3; Middle East respiratory syndrome coronavirus isolate Camel/Qatar_2_2014, KJ650098.1; Civet SARS-CoV SZ3 All BLI experiments were performed in independent triplicates. A statistical correlation coefficient (CC1/2) was used for crystallographic resolution estimation between half datasets. 5% of the crystallographic reflections were omitted from refinement to calculate R free and to avoid over-fitting Data availability Coordinates for the crystal structures have been deposited in the Protein Data Bank with IDs: 7S22 (wild-type), 7S16 (Arg 57 → Ala), and 7S23 (Tyr 139 → Ala). The plasmids for constructs pSUMO-β′-COPI-WD40, pcDNA3.1(+)-αCOPI-WD40, pcDNA3.1(+)-αCOPI-WD40 mutant Arg 57 → Ala, pcDNA3.1(+)-αCOPI-WD40 mutant Asp 115 → Ala, and pcDNA3.1(+)-αCOPI-WD40 mutant Tyr 139 → Ala will be SARS-beginning to understand a new virus The predictors of 3-and 30-day mortality in 660 MERS-CoV patients Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia Safety and efficacy of the BNT162b2 mRNA COVID-19 vaccine Silver Spring, MD: U.S. Food and Drug Administration, Vaccines and Related Biological Products Advisory Committee Architecture of the SARS coronavirus prefusion spike Synthesis and characterization of a native, oligomeric form of recombinant severe acute respiratory syndrome coronavirus spike glycoprotein Coronavirus M proteins accumulate in the Golgi complex beyond the site of virion budding Intracellular targeting signals contribute to localization of coronavirus spike proteins near the virus assembly site The cytoplasmic tail of the severe acute respiratory syndrome coronavirus spike protein contains a novel endoplasmic reticulum retrieval signal that binds COPI and promotes interaction with membrane protein Coatomer interaction with di-lysine endoplasmic reticulum retention motifs Signalmediated retrieval of a membrane protein from the Golgi to the ER in yeast Retrieval of transmembrane proteins to the endoplasmic reticulum The KKXX signal mediates retrieval of membrane proteins from the Golgi to the ER in yeast A weak COPI binding motif in the cytoplasmic tail of SARS-CoV-2 spike glycoprotein is necessary for its cleavage, glycosylation, and localization Coatomer is essential for retrieval of dilysine-tagged proteins to the endoplasmic reticulum A structure of the COPI coat and the role of coat proteins in membrane vesicle assembly β-COP, a 110 kd protein associated with non-clathrin-coated vesicles and the Golgi complex, shows homology to β-adaptin En bloc incorporation of coatomer subunits during the assembly of COP-coated vesicles A 102 kDa subunit of a Golgi-associated particle has homology to beta subunits of trimeric G proteins Purification of a novel class of coated vesicles mediating biosynthetic protein transport through the Golgi stack A coat subunit of Golgi-derived non-clathrin-coated vesicles with homology to the clathrin-coated vesicle coat protein β-adaptin Coatomer': a cytosolic protein complex containing subunits of non-clathrin-coated Golgi transport vesicles beta'-COP, a novel subunit of coatomer The α-and β′-COP WD40 domains mediate cargo-selective interactions with distinct di-lysine motifs Bimodal interaction of coatomer with the p24 family of putative cargo receptors Molecular basis for recognition of dilysine trafficking motifs by COPI Rules for the recognition of dilysine retrieval motifs by coatomer α-COP can discriminate between distinct, functional di-lysine signals in vitro and regulates access into retrograde transport Sequences in the cytoplasmic tail of SARS-CoV-2 spike facilitate expression at the cell surface and syncytia formation Discovery of seven novel mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus Ecology, evolution and classification of bat coronaviruses in the aftermath of SARS Bats are natural reservoirs of SARS-like coronaviruses Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor Isolation and characterization of viruses related to the SARS coronavirus from animals in southern China Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats Prevalence and genetic diversity of coronaviruses in bats from China Isolation of SARS-CoV-2-related coronavirus from Malayan pangolins SARS-CoV and emergent coronaviruses: viral determinants of interspecies transmission Evidence for zoonotic origins of Middle East respiratory syndrome coronavirus Origin and cross-species transmission of bat coronaviruses in China MERS-CoV, and other emerging infections in east Africa Probable pangolin origin of SARS-CoV-2 associated with the COVID-19 outbreak ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules Isolation of MERS coronavirus from a dromedary camel Structure and orientation of expressed bovine coronavirus hemagglutinin-esterase protein Protein interactions during coronavirus assembly COPA mutations impair ER-Golgi transport and cause hereditary autoimmune-mediated lung disease and arthritis COPA syndrome: a novel autosomal dominant immune dysregulatory disease HKL-3000: The integration of data reduction and structure solution-from diffraction images to an initial model in minutes How good are my data and what is the resolution? Phaser crystallographic software The detection of sub-units within the crystallographic asymmetric unit Towards automated crystallographic structure refinement with phenix.refine Coot: model-building tools for molecular graphics Collaboration gets the most out of software MolProbity: more and better reference data for improved all-atom structure validation Protein structure modeling with MODELLER Relaxation of backbone bond geometry improves protein energy landscape modeling A simple physical model for binding energy hot spots in protein-protein complexes A subcellular map of the human proteome New algorithms and methods to estimate maximumlikelihood phylogenies: assessing the performance of PhyML 3.0 Computational design of the affinity and specificity of a therapeutic T cell receptor Nashville USA) for advice on protein purification, Dr. Edvin Pozharski (University of Maryland Baltimore USA) and Dr. Travis Gallagher (National Institute of Standards and Technology, Rockville USA) for advice on crystallographic data collection, Dr. Andrey Galkin (University of Maryland Baltimore USA) for advice on BLI assays, and Dr This article was supported by funds through the Maryland Department of Health's Cigarette Restitution Fund Program, University of Maryland Marlene and Stewart Greenebaum Comprehensive Cancer Center (National Cancer Institute -Cancer Center Support Grant (CCSG) -P30CA134274), The Holden Comprehensive Cancer Center at The University of Iowa and its National Cancer Institute Award P30CA086862. This research used resources of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility, operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02-06CH11357. GM/CA@APS has been funded by the National Cancer Institute (ACB-12002) and the National Institute of General Medical Sciences (AGM-12006, P30GM138396) The authors declare no competing interests. Supplementary information The online version contains supplementary material available at https://doi.org/10.1038/s42003-022-03063-y.Correspondence and requests for materials should be addressed to S. Saif Hasan.Peer review information Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Gene Chong.Reprints and permission information is available at http://www.nature.com/reprintsPublisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/ licenses/by/4.0/.