key: cord-1012568-x51dj8sk authors: Salvi, Nicola; Bessa, Luiza Mamigonian; Guseva, Serafima; Camacho-Zarco, Aldo; Maurin, Damien; Perez, Laura Marino; Malki, Anas; Hengesbach, Martin; Korn, Sophie Marianne; Schlundt, Andreas; Schwalbe, Harald; Blackledge, Martin title: (1)H, (13)C and (15)N backbone chemical shift assignments of SARS-CoV-2 nsp3a date: 2021-01-21 journal: Biomol NMR Assign DOI: 10.1007/s12104-020-10001-8 sha: 3fc8085f94012cdd2efda3ed9a0f35869ee6917c doc_id: 1012568 cord_uid: x51dj8sk The non-structural protein nsp3 from SARS-CoV-2 plays an essential role in the viral replication transcription complex. Nsp3a constitutes the N-terminal domain of nsp3, comprising a ubiquitin-like folded domain and a disordered acidic chain. This region of nsp3a has been linked to interactions with the viral nucleoprotein and the structure of double membrane vesicles. Here, we report the backbone resonance assignment of both domains of nsp3a. The study is carried out in the context of the international covid19-nmr consortium, which aims to characterize SARS-CoV-2 proteins and RNAs, providing for example NMR chemical shift assignments of the different viral components. Our assignment will provide the basis for the identification of inhibitors and further functional and interaction studies of this essential protein. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) represents a significant threat to human health and to the stability of contemporary societies. The development of vaccines or potential inhibitors is essential if the disease is to be eradicated. One important step in the development of effective therapeutic strategies is the characterization of potentially druggable targets that constitute the functioning virus. The international covid19-nmr research consortium is dedicated to the measurement and rapid dissemination of NMR-based parameters that will further deepen our understanding of the viral life cycle and in particular facilitate the identification of potential drug binding sites. With over 1900 amino acids the non-structural protein nsp3 is the largest of the coronavirus proteins (Snijder et al. 2003) . It comprises a large number of structured domains and disordered linker regions in SARS-CoV (Neuman 2016) , and is an essential component of the replication-transcription complex (Lei et al. 2018 ). As such this protein represents a promising target for inhibitory strategies. Nsp3a constitutes the N-terminal section of nsp3, comprising an N-terminal ubiquitin-like domain (Ubl1) and a hypervariable acidic intrinsically disordered region (IDR) (Neuman et al. 2014) . Nsp3a has been shown to interact with the nucleoprotein (N) in the related beta-coronavirus mouse hepatitis virus (MHV) (Keane and Giedroc 2013) . The acidic IDR domain has a high Glu/Asp content and is predicted to be 23 amino acids longer in SARS-CoV-2 compared to SARS-CoV. In SARS-CoV, pulldown experiments show nsp3a to associate with viral proteins nsp8, nsp9, components of the replication/transcription complex, as well as different domains of nsp3 (Imbert et al. 2008) . The ubiquitin-like domain colocalizes with the nucleoprotein in viral liquid droplets in SARS-CoV-2 (Carlson et al. 2020 ) and has been shown to bind single stranded RNA in SARS-CoV (Serrano et al. 2007). Recently nsp3 was shown to play a structural role in the formation of molecular pores in double membrane vesicles associated with the endoplasmic reticulum (Stertz et al. 2007; Wolff et al. 2020) . Here, we present the near-complete backbone resonance assignment of the two domains of nsp3a, providing the basic data required for screening of interaction partners and detailed mapping of the interaction using the homologous structures of nsp3a from SARS-CoV (Serrano et al. 2007) and MHV (Keane and Giedroc 2013) . The primary sequence of nsp3a (amino acids 1-206 of nsp3) from SARS-CoV-2 was extracted from NCBI genome entry NC_045512.2 [GenBank entry MN908947.3]. A commercially synthesized gene (GenScript Biotech) was codonoptimized for expression in Escherichia coli and subcloned in a pET21b( +) vector. Hexa-histidine and TEV-cleavage tags were included at the N-terminus to facilitate protein purification. After protease cleavage, the proteins contain N-terminal GAM-extensions. The nsp3a plasmid was transformed into BL21 (DE3) E. coli cells and the protein expressed heterologously with an N-terminal His 6 tag. Cells were grown at 37 °C until OD 600 of 0.6-0.8, at which point protein expression was induced with IPTG and incubated for 5 h at 37 °C. Bacteria were harvested by centrifugation and the cell pellet resuspended in buffer A (50 mM Tris-HCl pH 8.0 and 250 mM NaCl) with protease inhibitors (complete, Roche). Cell lysis was performed by sonication, followed by centrifugation (45 min, 18,000 rpm at 5 °C). The protein was purified by affinity chromatography on Ni-NTA agarose (ThermoFisher), washed with buffer A supplemented with 20 mM imidazole and eluted with buffer A supplemented with 500 mM imidazole. TEV cleavage was achieved by incubation with TEV protease at 4 °C coupled with dialysis into buffer A supplemented with 2 mM DTT, and the protein concentrated and subjected to size exclusion chromatography on a HiLoad 16/600 Superdex 75 column (GE Healthcare) in NMR buffer (50 mM Na-phosphate, pH 6.5, 150 mM NaCl). For 15 N and 13 C isotope labelling, cells were grown in M9-minimal medium supplemented with 15 N-NH 4 -Cl and 13 C 6 -d-glucose (1 g/L each). A suite of BEST-and BEST-TROSY (BT) double and triple resonance assignment experiments, including BEST-HNCA, BEST HN(CO)CA, BT-HNCO, BT-HN(CO)CACB and intraresidue BT-iHNCACB (Lescop et al. 2007; Solyom et al. 2013) , were recorded on 15 N, 13 C-labeled samples (632 μM) at 298 K using a Bruker Avance III spectrometer equipped with a cryoprobe at a 1 H frequency of 850 MHz. The experimental parameters for these acquisitions are summarized in Table 1 . All spectra were processed using NMRFx Analyst (Norris et al. 2016 ) and analysed using CCPNMR Analysis Assign (Skinner et al. 2016 ) and NMR-FAM-SPARKY (Lee et al. 2015) . Manual assignment of residues was assisted using I-PINE (Lee et al. 2019 ). The 15 N, 1 H-HSQC of Nsp3a(1-206) is typical of a twodomain protein comprising both folded and unfolded domains (Fig. 1) . The folded domain is well-resolved while the unfolded domain has a more restricted chemical shift dispersion in the 1 H dimension. Nevertheless, a high percentage of resonances could be assigned in both domains (92% 1 H N , 92% 15 N H , 93% 13 Cα, 38% 13 Cβ and 83% 13 C'). 13 Cβ assignments are low because of difficulty in detecting transfer to these nuclei in the 3D experiments. This assignment has been deposited in the biological magnetic resonance databank (BMRB ID: 50446). Secondary structural elements in Nsp3a from SARS-CoV-2, predicted on the basis of 1 H, 15 N and 13 C secondary chemical shifts appear in the same regions as in the homologous protein from SARS-CoV (Serrano et al. 2007) , suggesting that the three-dimensional fold is very similar (Fig. 2) . Nevertheless, we note the presence of a short helical propensity around residue 110 and apparent extended or beta-sheet sampling between residues 185 and 206. Further investigation will be required to ascertain the nature of this apparent non-random coil behaviour. Phosphorylation modulates liquid-liquid phase separation of the SARS-CoV-2 N protein The SARS-coronavirus PLnc domain of nsp3 as a replication/transcription scaffolding protein Solution structure of mouse hepatitis virus (MHV) nsp3a and determinants of the interaction with MHV nucleocapsid (N) protein NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy I-PINE web server: an integrative probabilistic NMR assignment system for proteins Nsp3 of coronaviruses: structures and functions of a large multi-domain protein A set of BEST triple-resonance experiments for time-optimized protein resonance assignment Bioinformatics and functional analyses of coronavirus nonstructural proteins involved in the formation of replicative organelles Atlas of coronavirus replicase structure NMRFx processor: a cross-platform NMR data processing program Nuclear magnetic resonance structure of the N-terminal domain of nonstructural protein 3 from the severe acute respiratory syndrome coronavirus Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks CcpNmr analysis assign: a flexible platform for integrated NMR analysis Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage BEST-TROSY experiments for time-efficient sequential resonance assignment of large disordered proteins The intracellular sites of early replication and budding of SARS-coronavirus Double-membrane vesicles as platforms for viral replication