key: cord-290948-cuu78cvl authors: Imbert, Isabelle; Snijder, Eric J.; Dimitrova, Maria; Guillemot, Jean-Claude; Lécine, Patrick; Canard, Bruno title: The SARS-Coronavirus PLnc domain of nsp3 as a replication/transcription scaffolding protein date: 2008-02-05 journal: Virus Res DOI: 10.1016/j.virusres.2007.11.017 sha: doc_id: 290948 cord_uid: cuu78cvl Many genetic and mechanistic features distinguish the coronavirus replication machinery from that encoded by most other RNA viruses. The coronavirus replication/transcription complex is an assembly of viral and, most probably, cellular proteins that mediate the synthesis of both the unusually large (∼30 kb) RNA genome and an extensive set of subgenomic mRNAs. The viral components of the complex are encoded by the giant replicase gene, which is expressed in the form of two polyproteins (pp1a and pp1ab) that are processed into 16 cleavage products (nonstructural proteins 1–16). Using the combination of yeast two-hybrid screening and GST pull-down assays, we have now analyzed all potential interactions between SARS-Coronavirus nonstructural proteins, which may contribute to the structure and/or function of the viral replication/transcription complex. We demonstrate the existence of a complex network of interactions involving all 16 nonstructural proteins. Our results both confirmed previously described associations and identified novel heterodimerizations. The interaction map thus provides a sum of the interactions that may occur at some point during coronavirus RNA synthesis and provides a framework for future research. The family Coronaviridae includes the genera Coronavirus and Torovirus that, together with the more distantly related Arteriviridae and Roniviridae families, form the order Nidovirales (for reviews see Gorbalenya et al., 2006) . In 2003, a novel coronavirus was identified as the etiological agent of the Severe Acute Respiratory Syndrome (SARS; Drosten et al., 2003; Ksiazek et al., 2003; Peiris et al., 2003) and is now known as the SARS-Coronavirus (SARS-CoV). Coronaviruses are enveloped viruses with large positivesense single-stranded RNA genomes ranging from 27 to 32 kb, the largest among the known RNA viruses. Like other nidoviruses, coronaviruses genomes are polycistronic with about two thirds being occupied by two large open reading frames (ORFs 1a and 1b) that specify the viral nonstructural proteins (nsps) or "replicase". The coronavirus genome is expressed using a variety of regulatory mechanisms, including the synthesis and proteolytic processing of replicase polyproteins and the generation of an extensive nested set of subgenomic (sg) mRNAs to express structural and accessory proteins from the 3 -proximal part of the genome (for recent reviews, see Pasternak et al., 2006; Sawicki et al., 2007) . Genome expression starts with the translation of replicase ORFs 1a and 1b, with expression of the latter involving a −1 ribosomal frameshift (Brierley et al., 1989; Brierley and Dos Ramos, 2006) occurring with an estimated efficiency of 20-40%. In the case of SARS-CoV, pp1a and pp1ab are 4382 and 7073 amino acids long, respectively (Marra et al., 2003; Rota et al., 2003; Snijder et al., 2003; Thiel et al., 2003) . These precursors are cleaved co-and post-translationally by two viral proteases, a papain-like protease (PLP2) residing in nsp3 and the 3C-like protease (3CLpro; nsp5), to produce the mature replicase subunits nsp1 to nsp16 (Fig. 1) . Most of these subunits are believed to associate with a poorly characterized membranebound complex (Bost et al., 2001; Gosert et al., 2002; Shi et al., 1999; Snijder et al., 2006; van der Meer et al., 1999 ) that mediates the synthesis of genome RNA (replication) and sg mRNAs (transcription). The sg mRNAs are produced by a unique mechanism involving discontinuous negative-strand RNA synthesis to produce subgenome-length templates for sg mRNA synthesis (Sawicki and Sawicki, 1995; reviewed in Pasternak et al., 2006; Sawicki et al., 2007) . Knowledge related to the functions and interactions of coronavirus nsps has remained limited thus far. nsp1 and nsp2 are rather variable in sequence, and possibly subgroup-specific replicase subunits Ziebuhr et al., 2001) . SARS-CoV nsp1 (20 kDa) was implicated in suppression of cellular gene expression by promoting host cell mRNA degradation (Kamitani et al., 2006) and in chemokine dysregulation (Law et al., 2007) . SARS-CoV nsp2 (70 kDa) is dispensable for replication in cell culture, but its deletion attenuates viral replication (Graham et al., 2005) . SARS-CoV nsp3 is a large multidomain protein of 1922 amino acids Thiel et al., 2003) that is thought to contain at least seven domains: (1) an N-terminal Glu-rich acidic domain (AD); (2) an X domain (XD) with poly(ADP-ribose) binding properties Saikatendu et al., 2005) ; (3) the SUD domain (for SARS-CoV Unique Domain, an insertion not found in any other coronavirus thus far) with a specific affinity for oligo(G)-strings (Tan et al., in press); (4) a papain-like protease (PLP2), recently shown to exhibit deubiquitinating activity (Barretto et al., 2005; Harcourt et al., 2004; Lindner et al., 2005; Ratia et al., 2006) ; (5) an unknown domain possibly extending the papain-like protease domain, termed PLnc for Papain-Like noncanonical (see below); (6) a transmembrane domain (Kanjanahaluethai et al., 2007) corresponding to the N-terminal of the Y domain; and (7) the remainder of the Y domain, the abbreviation "Y domain" will be used for this part in this study. The multi-spanning transmembrane domains in nsp3, nsp4 and nsp6 are presumed to serve as a scaffold for the assembly of the membrane-associated replication/transcription complex (RTC). nsp5 (34 kDa) is a 3C-like cysteine protease (3CLpro), also known as "main protease" (M pro ) (Anand et al., 2003; Gorbalenya et al., 1989) , cleaving the replicase polyproteins at 11 conserved sites containing a canonical Leu-Gln (Ser, Ala, Gly) signature. nsp8 protein (22 kDa) was recently postulated to be a processivity factor associated with the viral RNA-dependent RNA polymerase (RdRp) in nsp12. Indeed, the crystal structure of a hexadecameric nsp7-nsp8 complex provided a first glimpse of the sophisticated architecture of the coronavirus replication and transcription machinery (Zhai et al., 2005) . The properties of the central channel of this complex suggest that it may encircle double-stranded RNA. This hypothesis was strengthened by the finding that nsp8 displays primase activity in an in vitro assay (Imbert et al., 2006) and might thus catalyse the synthesis of RNA primers for the primer-dependent RdRp. nsp9 (13 kDa) is a ssRNA-binding protein (Egloff et al., 2004; Sutton et al., 2004) postulated to stabilise and protect nascent and template RNA strands during replication and/or transcription. Much like nsp8 and nsp9, nsp10 (18 kDa) is also a nucleic acid-binding protein with two zinc fingers (Joseph et al., 2006; Matthes et al., 2006) . The phenotype of a temperature-sensitive nsp10 mutant (Q65E) of the coronavirus mouse hepatitis virus (MHV) implicated nsp10 in minus-strand RNA synthesis (Sawicki et al., 2005; Siddell et al., 2001) . The C-terminal part of nsp12 (102 kDa) contains the poorly characterized RdRp domain that is significantly divergent from both cellular and viral polymerases. Only one study reported polymerase activity of a purified coronavirus RdRp (Cheng et al., 2005) . nsp13 (67 kDa) comprises a C-terminal helicase domain with a 5 -3 directionality and an N-terminal zinc-binding domain (Ivanov et al., 2004b; Seybert et al., 2000 Seybert et al., , 2005 . Finally, nsp14 (58 kDa), nsp15 (38 kDa) and nsp16 (33 kDa) contain exoribonuclease (Minskaia et al., 2006) , endoribonuclease (Bhardwaj et al., 2006; Ivanov et al., 2004a) and predicted 2 -O-ribose methyl transferase activities (von Grotthuss et al., 2003) , respectively, which are all essential for efficient coronavirus RNA synthesis (Almazan et al., 2006) . In addition, as illustrated above, coronavirus replicases have evolved to include a number of nonessential functions that may provide a selective advantage in specific host cells only. Most of the 16 coronavirus nsps are known or thought to be part of the RTC, which is formed following genome translation and replicase polyprotein processing. As for many plus-strand RNA viruses (Mackenzie, 2005; Salonen et al., 2004; Snijder et al., 2006) , it has been demonstrated that nidovirus RNA synthesis is associated with cytoplasmic membranes, in this case double-membrane structures that likely originate from the endoplasmic reticulum. Obvious benefits of complex formation are the possibility to couple functions residing in different subunits and the sequestration of viral components. The complex and sophisticated coronavirus replication/transcription strategy suggests that the composition of the complex may change during the life cycle of the virus or that -alternatively -partially different complexes dedicated to, e.g. either replication or transcription may co-exist, probably relying on the same central RNA synthesizing catalytic subunit, the nsp12-RdRp. The network of interactions within the SARS-CoV RTC has only begun to emerge. Dual-labelling studies in SARS-CoVinfected cells have confirmed the colocalization of a variety of replicase subunits (Harcourt et al., 2004; Ivanov et al., 2004a; Prentice et al., 2004; Snijder et al., 2006) and newly synthesized viral RNA (Harcourt et al., 2004) . Chemical cross-linking experiments demonstrated that SARS-CoV nsp10 could be cross-linked to nsp9 (Su et al., 2006) , which in turn interacts with nsp8 (Sutton et al., 2004) . Very recently, a SARS-CoV genome-wide analysis for intraviral protein-protein interactions was performed (von Brunn et al., 2007) . Interactions were tested by yeast two-hybrid system and subsequently confirmed by coimmunoprecipitation experiments in mammalian cells. nsp2 and nsp8 showed a rather large number of interactions. In this study, the nsp3 protein was subdivided into 2 fragments by cutting to the SUD domain. Coronavirus nsp12-RdRp is believed to play a central role in both replication and transcription. Here, we investigated interactions between all possible combinations of SARS-CoV nsps, including the nsp12-RdRp, which are potentially involved in the assembly and/or function of the RTC. The experiments were performed with in vitro translated nsps, as well as ex vivo, in Saccharomyces cerevisiae expressing these nsps in the context of a two-hybrid system. We observed a complex network of interactions involving all 16 nsps, confirming previously documented associations and identifying numerous novel heterodimerization events, and thus providing a framework for future research. The Frankfurt-1 isolate of SARS-CoV (GenBank accession number AY291315; Thiel et al., 2003) was amplified in Vero-E6 cells and used in this study for the production of the expression vectors. All protocols used for the yeast two-hybrid system were described previously (Walhout and Vidal, 2001 ). MaV103 and MaV203 yeast strains (leu2-3112, tryp-901, his3 200, ade2-1, gal4A, gal80A, SPAL10::URA3, GAL1::lacZ, GAL1::HIS3-@LYS2, can1R, cyh2R), which are MATa and MATalpha, respectively, were used for the yeast two-hybrid matrix. RT-PCR products encoding different SARS-CoV proteins/domains, i.e. nsp1, nsp2, six nsp3 subdomains (AD, XD, SUD, PLP2, PLnc, and Y), nsp5 to nsp10, and nsp12 to nsp16 were first cloned into pDONR201 using the Gateway system (Invitrogen). All constructs were verified by sequence analysis. The boundaries of the domains used are detailed in Table 1 . Using the Gateway system, each cDNA was subcloned into pAD and pDB vectors, which were used to transform MaV203 and MaV103 cells, respectively. Diploid cells were generated by mating MaV103 expressing genes fused to the DNA-binding domain (DB-ORF) and MaV203 expressing genes fused to the activation domain (AD-ORF). Selection was performed using selective complete (SC) medium lacking tryptophane and leucine. Patches representing potential interaction pairs were then examined for two-hybrid phenotypes by replica-plating onto selective plates, i.e. YPD (yeast extract, peptone, d-glucose) medium plate with a nitrocellulose filter on to perform the ␤-galactosidase filter lift assay, as well as on SC-Leu-Trp-His + 25 mM 3-AT and SC-Leu-Trp-Ura plates to assess the three two-hybrid phenotypes (␤-galactosidase, His and Ura) (Walhout and Vidal, 2001) . The matrix experiment was carried out three times, using clones derived from three different plasmid transformation and mating experiments. For GST fusion protein expression vectors, BamHI/EcoRI restriction fragments encoding the SARS-CoV PLnc domain, nsp8 and nsp9, and a BamHI restriction fragment encoding nsp12 were inserted in-frame into pGEX-4T-2 (GE Healthcare), to generate GST-PLnc, GST-nsp8, GST-nsp9 and GST-nsp12, respectively. GST-p55, encoding the HIV-1 Gag precursor, was constructed as described in Douaisi et al. (2004) . To create SARS-CoV protein expression vectors, EcoRI/KpnI restriction fragments encoding nsp1, XD, SUD, PLP2, PLnc, Y, nsp5, nsp6, nsp7, nsp10, nsp16, EcoRI/XbaI restriction fragments encoding nsp2, nsp8, nsp9, AD, a XbaI/NotI restriction fragment encoding nsp12, and KpnI/NotI restriction fragments encoding nsp13, nsp14, nsp15 were cloned into pTNT (Promega). All constructs were sequenced, and the proteins expressed from these plasmids did not carry any foreign tags, except an additional N-terminal Met residue. The GST-PLnc (69 kDa), GST-nsp8 (48 kDa), GST-nsp9 (38.8 kDa), GST-nsp12 (132 kDa) and GST-p55 (80 kDa) fusion proteins were expressed in E. coli strain C41(DE3) (Avidis SA, France), which contained the pLysS plasmid (Novagen). Bacterial cultures were grown at 37 • C in LB medium supplemented with ampicillin (100 g ml −1 ) and chloramphenicol (17 g ml −1 ) until the OD 600 reached 0.5. Protein expression was induced by adding 500 M isopropyl 1-thio-␤-d-galactopyranoside (IPTG) and cells were incubated for 4 h at 37 • C. In the case of GST-PLnc and GST-nsp12 expression, however, induction was performed with 100 M of IPTG and cells were incubated for 17 h at 17 • C. Cell pellets were resuspended in PBS supplemented with 0.25 mg ml −1 , of lysozyme, 0.1 mg ml −1 DNase, 10 g ml −1 RNaseA, 1 mM phenylmethylsulphonylfluoride (PMSF), protease inhibitor cocktail (Sigma), and 1% Triton X-100. The suspensions were sonicated and centrifuged at 25,000 × g for 30 min, after which 500 l of a 50% slurry of glutathione-sepharose 4B beads (GE Healthcare) was added to the clarified supernatants. The beads were allowed to bind proteins for 1 h, washed three times with PBS, and finally resuspended in 500 l of PBS. RNAs specifying SARS-CoV nsp1, nsp2, nsp3 domains (AD, XD, SUD, PLP2, PLnc, Y), nsp5 to nsp10, and nsp12 to nsp16 were in vitro transcribed from the T7 promoter present in the pTNT expression vector. Transcripts were in vitro translated in the presence of [ 35 S]methionine using the TNT T7 coupled reticulocyte lysate system (Promega) according to the manufacturer's instructions. In vitro binding analyses were performed as described previously (Dimitrova et al., 2003) . Each interaction was independently tested three times. Protein samples were resolved by SDS-PAGE, transblotted onto Hybond-P membranes (GE Healthcare) and probed with an anti-GST antibody that was kindly provided by Dr. Teillaud (INSERM Unité 255, Paris, France). The RTCs of mammalian positive-stranded RNA viruses, including coronaviruses, are commonly associated with intracellular membranes (Prentice et al., 2004; Snijder et al., 2006) and consist of multiple virus-encoded protein subunits as well as host proteins. To further characterize the molecular interactions between SARS-CoV RTC subunits, we performed a yeast two-hybrid matrix screen with all nsps encoded by this virus. An interaction was considered valid when it reproducibly produced a positive result for one or more of the three different yeast phenotypes tested (Fig. 2a) . Subsequently, as a general strategy, GST pull-down assays were used to confirm interactions (see below). Because of the large size of the membrane-associated nsp3 (213 kDa), six nsp3 subdomains were defined (Table 1) . The general bioinformatics approach that led to the design of nsp3 domains was described previously (Ferron et al., 2005) . Briefly, we performed a BLAST search against the VaZyMolO database with the full sequence corresponding to nsp3. The protein is predicted to be organized in 6 modules that are the F197 (AD) domain structurally confirmed by crystallography (Serrano et al., 2007) , the F200 (XD) domain structurally confirmed by crystal- lography , the F209 (SUD), the F203 (PLP2) domain also structurally confirmed by crystallography (Ratia et al., 2006) , the F250 (PLnc) (Tan et al., in press) , and a hydrophobic domain followed by the F202 (Y domain). The hydrophobic cluster analysis of module F250 (PLnc domain) can be interpreted as a single and globular domain flanked by the PLP2 in N-terminal and a hydrophobic domain in C-terminal. A total of 19 proteins/domains were used in our screening matrix. Out of the 361 protein pairs tested in the yeast two-hybrid analysis, 15 combinations produced a positive result, accounting for 4% of the possible protein interactions (Fig. 2) . In comparison, about 6% of nonstructural protein-protein interactions were positive in the von Brunn et al. (2007) Y2H study. Surprisingly, no interaction is common to both studies. This discrepancy could be explained by the fact that two-hybrid false positives may arise in two-hybrid arrays and by the difference between the Y2H systems used (Uetz, 2002) . With our Y2H system, to eliminate potential false-positive interaction, three different phenotypes were assessed (␤-galactosidase, His and Ura) and the transcription from each reporter gene (LacZ, HIS3 and URA3) is driven by independent promoters. At last, we have considered an interaction as positive if at least one phenotype in 2 out of the 3 matrix experiments was positive. Two SARS-CoV proteins (nsp3 and nsp14) were found to be involved in interactions with 4 or more partners. Domains from nsp3 interacted with nsp5, nsp6, nsp12, nsp13, nsp14 and nsp16, and in particular for the PLnc subdomain the unexpectedly large number of 5 interaction partners was identified. Since this domain is comprised of only 386 amino acids, it is tempting to speculate that it might sequentially interact with these partners rather than with all of them simultaneously. Consequently, nsp3 may serve as one of the RTC scaffolding proteins by bringing several nsps into close proximity of each other. The presence of a transmembrane domain within nsp3, and the observed interaction with the nsp6 transmembrane protein, reinforces the hypothesis of a membrane-associated scaffolding function for nsp3 in the infected cell. The nsp3 central role was not so clear in the von Brunn et al. (2007) study because nsp3 protein was subdivided only into two fragments (nt 2719-4431 and nt 4885-8484), splitting the SUD domain in two. The nsp14 protein is an exoribonuclease playing an elusive, yet essential, role in the SARS-CoV life cycle (Minskaia et al., 2006) . In addition to its interaction with nsp3, nsp14 interacted strongly with both nsp10, a protein with two zinc-fingers potentially involved in nucleic acid recognition, and nsp16, the 2 -O-ribose methyltransferase. Interestingly, the interactions between nsp10, nsp14 and nsp16 were the only reciprocal ones, meaning that they were also detected when the DB and AD fusion partners were swapped. It is tempting to speculate that these three proteins form a complex that is associated with nsp3 and that the other interactions that were detected may reflect different and/or alternative complexes existing at different time points. The surface of interaction of nsp10 with nsp16 was mapped by reverse yeast two-hybrid system and in mammalian cells, using the bioluminescence resonance energy transfer technology (BRET). Point mutations that abrogate the nsp10-nsp16 interaction also inhibit the interaction between nsp10 and nsp14, albeit with subtle differences, suggesting that the interacting surface is overlapping (PL, unpublished data). Two crystal structures were available for nsp10. One of them reveals a unique spherical dodecamer architecture (Su et al., 2006) and the other shows a monomer (Joseph et al., 2006) . nsp10 protein is able to bind single-and double-stranded RNA and DNA without apparent sequence specificity (Joseph et al., 2006) and was suggested to be involved in SARS-CoV replication based on structural similarities with nuclear proteins (Su et al., 2006) . The overall positive charge of the inner and outer surfaces of the dodecamer would be compatible with an RNA-binding function. Interestingly, nsp10 also interacts with nsp9, a single-stranded RNA binding protein (Egloff et al., 2004; Su et al., 2006; Sutton et al., 2004) that in turn interacts with nsp8, the putative RNA primase (Imbert et al., 2006) . In human coronavirus 229E, mutagenesis of the active site of the nsp14 exonuclease greatly reduced the accumulation of viral RNA, suggesting an essential role for this enzyme in replication and transcription (Minskaia et al., 2006) . This critical function of nsp14 was also supported by results obtained with a SARS-CoV replicon, in a study that also addressed the impact of mutations within or deletion of nsp15-and nsp16-coding sequences, clearly demonstrating that these proteins are required for efficient replication (Almazan et al., 2006) . Finally, by interacting with nsp3 through the PLnc domain, and with nsp7, nsp9 and nsp10, the nsp14 exoribonuclease may also be a key RTC scaffolding component, irrespective of its nucleic acid binding activity. A number of nsps (nsp1, nsp13, and nsp15) interacted with a single partner and these interactions were only detected when they were used as prey (Fig. 2) . SARS-CoV nsp1, a small protein , unique to SARS-CoV, believed to be involved in host cell RNA degradation, may not directly participate in viral RNA synthesis. Consequently, it may be not too surprising that its only interaction partner was nsp9, a singlestranded RNA binding protein that, in turn, was not densely connected to other nsps (Fig. 2) . In the case of the large nsp13 and nsp15 subunits, it is impossible at this stage to assess whether their size (or any other factor) may have impeded their expression or correct folding in yeast, thus preventing detection of protein-protein interaction in the yeast two-hybrid system. Alternatively, the toxicity of RNA-modifying enzymes that is often observed when they are expressed as recombinant proteins may have played a role. nsp13 and nsp15 could also belong to larger hetero-multimers, like the nsp7-nsp8 complex. Notably, and despite the documented interaction of nsp8 with nsp7, nsp15 and nsp8 formed the sole isolated protein pair. We failed to detect nsp10 dimerization in the yeast-two hybrid system, as in the von Brunn et al. (2007) study, even though one of the two crystal structures reveals a dodecameric arrangement based on nsp10-nsp10 interactions (Su et al., 2006) . The second crystal structure shows a monomer (Joseph et al., 2006) . The dodecamer supercomplex may lack biological relevance, because the second zinc-finger lacks the last cysteine residue and the remainder of the Cterminal tail is disordered. Unexpectedly also, in particular since helicases are known to interact with RdRps in RTCs (Dimitrova et al., 2003; Piccininni et al., 2002) , the nsp13 helicase stood alone and interactions between nsp12 and nsp13 were not detected. In the yeast two-hybrid matrix screening, a number of proteins did not interact with any other nsp or subdomain. These were nsp2 and the XD, SUD and Y domains of nsp3. It is interesting to note that nsp2, which appears to be unique to SARS-CoV, is dispensable for replication and transcription (Graham et al., 2005) . Also, the ADP-ribose-1 -monophosphatase (ADRP) activity of the XD subdomain was found to be dispensable for Fig. 4 . GST pull-down analysis to identify interaction partners of nsp12-RdRp. Each panel shows an autoradiogram from a SDS-PAGE analysis of in vitro-translated nonstructural proteins pulled down by binding to either glutathione-sepharose-bound GST-p55 (negative control; lanes A) or GST-nsp12 (lanes B). Equal amounts of [ 35 S]methionine-labelled target were used for lanes (A and B) and the input material was also run next to the pull-down samples, as indicated for each panel. coronavirus replication (Putics et al., 2005) . The SUD domain binds oligo(G) stretches and may be involved in the modulation of host-cell signalling pathways (Tan et al., in press ). Finally, the Y domain of nsp3 has no obvious role in RNA synthesis. Therefore, it is logical to propose that they may rather play a role in virus-host interactions to, e.g. modulate immunity and/or pathogenesis. Surprisingly, despite its presumed central role in RNA synthesis, the nsp12-RdRp was found to interact with only two other nsps, namely nsp7 and nsp3. nsp7-nsp8 could be part of a larger complex including nsp12 on the one hand and nsp14, nsp9 and nsp10 on the other (Fig. 2) . These results also reflected the potential limitations of the yeast-two-hybrid approach when analyzing large proteins like nsp12, prompting us to develop an alternative to confirm and/or extend our interaction map. It is well-known that -for technical reasons -yeast twohybrid screening may not reveal all interactions that normally occur in a cellular environment, owing, e.g. to steric hindrance of the fusion partners used in yeast two-hybrid assays or the inability of certain fusion proteins to be expressed and targeted to the yeast nucleus. Therefore, as an additional approach to identify potential interactions of the PLnc domain, nsp8, nsp9 and nsp12-RdRp with other SARS-CoV nsps, we carried out a set of pull-down assays using GST fusion proteins. This type of assay can be used to analyze the binding of a radiolabeled, in vitro-translated protein to an interaction partner presented in the form of a purified GST fusion protein bound to glutathione-sepharose beads. GST fusion proteins, as well as a HIV-1 gag fusion protein (GST-p55) that was used as negative control, were expressed in E. coli and immobilized on glutathione-sepharose beads (Fig. 3a) . GST-p55, GST-PLnc, GST-nsp8 and GST-nsp9 were much more soluble in bacteria Fig. 6 . Schematic summary of GST pull-down results. Nodes in blue and green represent nsp3 domains and ORF1b-encoded proteins, respectively. Proteins that did not interact with the bait are depicted at the left side of each diagram. than GST-nsp12 and could be purified in considerably larger amounts (Fig. 3a) . Therefore, only 1/50 of a standard GST-p55, GST-nsp8 and GST-nsp9 preparation was used in the assays, whereas the complete yield of a GST-nsp12 purification was used (Fig. 3a) . To obtain potential interaction partners, not containing any foreign tags, all SARS-CoV nsps and domains used before (Table 1) were in vitro translated in the presence of [ 35 S]-labelled methionine, except for nsp4 (an integral transmembrane domain) and nsp11 (which is only 13 amino acids long) (Fig. 3b) . Despite repeated attempts, we could not achieve the expression of nsp6 and nsp9 in the TNT T7-coupled reticulocyte lysate system. The in vitro synthesis of full-length proteins was confirmed by SDS-PAGE analysis (Fig. 3b) . A few truncated products were observed for the SUD, PLP2, nsp15 and nsp16 proteins. The AD protein corresponds to the lower band on the different gels, according to the protein molecular weight marker. We do not have an explanation regarding the other higher band observed for AD. nsp2 protein (70 kDa) appears to migrate more slowly than nsp12 protein (102 kDa) (Fig. 3b, middle panel) . This discrepancy may be related to the amino-terminal part of nsp2 protein which appears to be disordered. It is documented that disordered protein might have an atypical migration pattern, which is variable according to the percentage of acrylamide sieving. GST pull-down assays were carried out with approximately equimolar levels of input material. Pull-down assays with GST-nsp12 revealed that the SARS-CoV RdRp does not interact with nsp1, AD, nsp7, or nsp10 (Figs. 4 and 6) . Interactions between GST-nsp12 and nsp2, XD, SUD, PLnc, PLP2, and nsp8 were detected, in the absence of a signal for the GST-p55 negative control (Fig. 4, lanes B) . Interactions were also observed with the nsp3-Y domain, nsp5, nsp13, nsp14, nsp15 and nsp16. nsp3-Y domain, nsp12, nsp14 and nsp16 also showed a weak interaction with GST-p55 (Fig. 4 , lanes A) and GST alone (not shown), but the signal clearly increased when specific SARS-CoV GST-nsp fusion proteins were used. Therefore, these background signals were attributed to the large amount of the control proteins used in comparison with the amount of GST-nsp12. In contrast to the yeast twohybrid analysis, GST-nsp12 failed to interact with nsp7. The same interactions were detected after RNase treatment of the in vitro translation products used for these binding studies, indicating that the observed interactions were not mediated by RNA (data not shown). From these GST pull-down experiments with nsp12-RdRp, a clear pattern emerged: the RdRp-containing subunit is apparently able to built two interactomes: one with nsp3 domains and the other with products encoded by ORF1b, which had been missed in the yeast two-hybrid experiments. Notably, nsp12 is also able to dimerize, perhaps providing an explanation for the fact that many interactions involving nsp12 were missed using the yeast two hybrid system (see also below). The nsp12-nsp7 interaction could not be reproduced using the GST pull-down approach. The negative results for nsp1, AD, and nsp10 were consistent with the yeast two-hybrid data. The recently discovered primase activity of nsp8 made this protein a very interesting bait for an extended interaction analysis. The interaction pattern of GST-nsp8 was found to be related to that of GST-nsp12 (Fig. 6 ). Although interactions with mul-tiple nsp3 subdomains (XD, SUD, PLnc and Y) were missing, nsp2, PLP2, nsp5 and ORF1b-encoded products, were identified as interactors, yielding the same interaction tree obtained for GST-nsp12. The bands corresponding to SUD and Y domains did not migrate to the expected distance but much lower further (Fig. 5a, left panel) . For the SUD domain, a strong tendency to proteolytic (self-)degradation was shown by others (Tan et al., in press ). However, this discrepancy in size was not observed with the GST-PLnc pull-down assay (see below). Furthermore, these experiments confirmed the nsp8/nsp15 interaction previously identified by yeast two-hybrid screening (Fig. 2) and revealed that nsp8 dimerized, in accordance with structural, biochemical, and enzymatic data (Imbert et al., 2006; Zhai et al., 2005) . The yeast two-hybrid screening (Fig. 2) identified the nsp3 PLnc domain as a central interactor in the SARS-CoV RTC. Thus, PLnc pull-down assays were performed in order to confirm and perhaps extend these findings. Indeed, this alternative assay confirmed interactions of PLnc with nsp5, nsp12, nsp13, nsp14 and nsp16 (Figs. 5b and 6). Moreover, additional interactions were revealed with nsp2, several nsp3 subdomains (AD, SUD, PLP2 and Y), nsp7, nsp8 and nsp15. Taken together, these data supported the previously proposed scaffolding role for nsp3, in which the PLnc subdomain may fulfill a central organizing position. For unknown reasons, nsp9 expression in the TNT T7coupled reticulocyte lysate system remained unsuccessful. Therefore, a GST-nsp9 fusion protein was produced and used to complete our interaction screening. GST-nsp9 interacted with nsp1, nsp2, some nsp3 domains (AD, SUD, PLP2, Y), nsp5, nsp7, nsp8, nsp10, nsp12, nsp13, nsp14 and nsp15 (Figs. 5c and 6). Overall, like the nsp3 PLnc domain, nsp9 seems to have many partners and interacts with almost all nsps except for nsp16. The GST-nsp9 pull-down analysis confirmed the interactions identified above in the yeast two-hybrid approach (nsp9 with nsp1 and nsp14) and also the previously documented nsp9-nsp10 interaction (Su et al., 2006) . Both are RNA-binding proteins that may cooperate at some point in the virus life cycle. The nsp10/GST-nsp8 interaction and the nsp10/GST-PLnc interaction are much weaker than nsp10/GST-nsp9 interaction. In this study, we investigated in vitro and in cellulo the network of interactions that must exist amongst the viral proteins that assemble into the SARS-CoV RTC. These data may enhance our understanding of coronavirus replication and transcription at the molecular level. We have not only confirmed interactions previously deduced from a variety of experimental approaches, but have also identified a significant number of previously unknown (potential) interactions between SARS-CoV replicase cleavage products or domains. In the first place, we have provided evi-dence for physical interactions that may be the basis for the previously described co-localization of various key enzymes of the SARS-CoV RTC in infected cells (nsp3, nsp5, nsp12, nsp13 and nsp15; Snijder et al., 2006) . Co-immunoprecipitation in mammalian 293 cells confirmed interactions of nsp8 with nsp2, nsp7, nsp8, nsp9, nsp12, nsp13 and nsp14 (von Brunn et al., 2007) and highlighted its central role within the RTC machinery. Secondly, the nsp3 PLnc domain, nsp8 and nsp12 were able to form stable homodimers. Thirdly, the nsp8/nsp15 interactions initially detected by yeast two-hybrid screening could be confirmed in GST pull-down assays. Previously described nsp8/nsp9/nsp10 interactions (Su et al., 2006; Sutton et al., 2004) were also confirmed in the latter assay (Fig. 5, panels a and c) . Finally, three viral interaction partners for nsp2 (nsp8, nsp9 and nsp12) were identified in the pull-down analysis, although the yeast two-hybrid analysis had not revealed these interactions. The coronavirus RTC engages in a variety of RNA synthesis processes with replication and sg mRNA synthesis each proceeding through their specific minus-strand templates. How the RTC accomplishes and balances these various processes remains to be elucidated. It is unknown whether complexes with different composition exist that are specifically dedicated to, e.g. replication or transcription. Alternatively, one type of complex may be converted into another by the binding or release of specific protein partners. In this manner, such protein factors may regulate the balance between replication and transcription, if carried out by the same complex, or direct the formation of specialized "replicase"and "transcriptase"complexes. The molecular mechanism and factors governing this switch in coronaviruses are unknown. In the related arterivirus system, nsp1 was identified as a clear example of a protein with a transcription-specific function. Deletion of this replicase subunit did not affect replication, but reduced sg mRNA synthesis to undetectable levels (Tijms et al., 2001) . In alphaviruses, temporal regulation of replicase polyprotein cleavage by the nsP2 protease was found to be a key factor in regulating the switch from minus-to plusstrand RNA synthesis (Kaariainen and Ahola, 2002; Lemm et al., 1994) , illustrating another barely explored feature of the nidovirus/coronavirus system: replicase subunits may act either after complete maturation, or as part of larger processing intermediates, or both. Thus, RTC function may be temporally and spatially coordinated by the activity of the protease domains embedded in replicase polyproteins. Our interactome studies confirmed that the key enzymes encoded by replicase ORF1b likely exist as a complex, which may be organized around the dimerizing nsp12-RdRp. It is unknown whether dimerization is essential for RdRp activity, as, e.g. in the case of hepatitis C virus (Qin et al., 2002) . In general, RdRp error frequencies are estimated to be in the order of 10 −4 (Huang et al., 2000) . For many years, this number, in combination with the unusual size of the coronavirus genome, has triggered speculations about the presence of proofreading or other repair mechanisms. The GST-nsp12 pull-down experiments identified nsp13 to nsp16 as strong interactors (Fig. 4) , and the fact that this pattern overlaps with that of the nsp8primase is intriguing (Fig. 5a) . The interaction of these proteins, either simultaneously or successively, with the helicase (nsp13), 3 → 5 exonuclease (nsp14), endonuclease (nsp15) and (putative) methytransferase (nsp16) supports the hypothesis that these subunits may mediate successive steps of the same pathway and provides a basis for further experiments aimed at, for example, exploring the existence of a proofreading mechanism. It is interesting to note that the nsp13 helicase, with a 5 -3 directionality, appears to lack interactions with nsp14, nsp15 and nsp16 (our study and von Brunn et al., 2007) . In flaviviruses, NS3 helicase unwinds its substrates in a 3 -to-5 direction and physical mapping and structural studies indicate that the helicase most likely precedes the NS5 RdRp undergoing RNA synthesis (Malet et al., 2007) . Upon isolation of an active coronavirus RdRp, it will be interesting to determine if this is also the case in coronaviruses, in view of the different RNA unwinding polarities (Patel and Donmez, 2006) . In conclusion, our work confirms and defines several association groups among SARS-CoV nsps that will provide a basis to investigate cooperative enzymatic functions as well as currently unknown tertiary and quaternary structures. In this manner, these data may also contribute to the development of in vitro assays faithfully reproducing the different steps in coronavirus RNA synthesis. Furthermore, compounds affecting interactions between essential components of the RTC may inhibit enzymatic activities and may thus provide novel perspectives for antiviral drug design. Construction of a severe acute respiratory syndrome coronavirus infectious cDNA clone and a replicon to study coronavirus RNA synthesis Novel beta-barrel fold in the nuclear magnetic resonance structure of the replicase nonstructural protein 1 from the severe acute respiratory syndrome coronavirus Coronavirus main proteinase (3CLpro) structure: basis for design of anti-SARS drugs The papain-like protease of severe acute respiratory syndrome coronavirus has deubiquitinating activity RNA recognition and cleavage by the SARS coronavirus endoribonuclease Mouse hepatitis virus replicase protein complexes are translocated to sites of M protein accumulation in the ERGIC at late times of infection Programmed ribosomal frameshifting in HIV-1 and the SARS-CoV Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot Expression, purification, and characterization of SARS coronavirus RNA polymerase Protein-protein interactions between hepatitis C virus nonstructural proteins HIV-1 and MLV Gag proteins are sufficient to recruit APOBEC3G into virus-like particles Identification of a novel coronavirus in patients with severe acute respiratory syndrome The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world Structural and functional basis for ADP-ribose and poly(ADP-ribose) binding by viral macro domains VaZyMolO: a tool to define and classify modularity in viral proteins Coronavirus genome: prediction of putative functional domains in the nonstructural polyprotein by comparative amino acid sequence analysis Nidovirales: evolving the largest RNA virus genome RNA replication of mouse hepatitis virus takes place at double-membrane vesicles The nsp2 replicase proteins of murine hepatitis virus and severe acute respiratory syndrome coronavirus are dispensable for viral replication Identification of severe acute respiratory syndrome coronavirus replicase products and characterization of papain-like protease activity Misincorporation by wild-type and mutant T7 RNA polymerases: identification of interactions that reduce misincorporation rates by stabilizing the catalytically incompetent open conformation A second, non-canonical RNA-dependent RNA polymerase in SARS coronavirus Major genetic marker of nido viruses encodes a replicative endoribonuclease Multiple enzymatic activities associated with severe acute respiratory syndrome coronavirus helicase Crystal structure of nonstructural protein 10 from the severe acute respiratory syndrome coronavirus reveals a novel fold with two zinc-binding motifs Functions of alphavirus nonstructural proteins in RNA replication Severe acute respiratory syndrome coronavirus nsp1 protein suppresses host gene expression by promoting host mRNA degradation Membrane topology of murine coronavirus replicase nonstructural protein 3 A novel coronavirus associated with severe acute respiratory syndrome Role for nonstructural protein 1 of severe acute respiratory syndrome coronavirus in chemokine dysregulation Polypeptide requirements for assembly of functional Sindbis virus replication complexes: a model for the temporal regulation of minus-and plus-strand RNA synthesis The papain-like protease from the severe acute respiratory syndrome coronavirus is a deubiquitinating enzyme Wrapping things up about virus RNA replication Crystal structure of the RNA polymerase domain of the West Nile virus non-structural protein 5 The non-structural protein nsp10 of mouse hepatitis virus binds zinc ions and nucleic acids Discovery of an RNA virus 3 → 5 exoribonuclease that is critically involved in coronavirus RNA synthesis Nidovirus transcription: how to make sense? Mechanisms of helicases Coronavirus as a possible cause of severe acute respiratory syndrome Modulation of the hepatitis C virus RNA-dependent RNA polymerase activity by the non-structural (NS) 3 helicase and the NS4B membrane protein Identification and characterization of severe acute respiratory syndrome coronavirus replicase proteins ADP-ribose-1 -monophosphatase: a conserved coronavirus enzyme that is dispensable for viral replication in tissue culture Oligomeric interaction of hepatitis C virus NS5B is critical for catalytic activity of RNA-dependent RNA polymerase Severe acute respiratory syndrome coronavirus papain-like protease: structure of a viral deubiquitinating enzyme Characterization of a novel coronavirus associated with severe acute respiratory syndrome Structural basis of severe acute respiratory syndrome coronavirus ADP-ribose-1 -phosphate dephosphorylation by a conserved domain of nsP3 Viral RNA replication in association with cellular membranes Coronaviruses use discontinuous extension for synthesis of subgenome-length negative strands Functional and genetic analysis of coronavirus replicasetranscriptase proteins A contemporary view of coronavirus transcription Nuclear magnetic resonance structure of the N-terminal domain of nonstructural protein 3 from the severe acute respiratory syndrome coronavirus The human coronavirus 229E superfamily 1 helicase has RNA and DNA duplex-unwinding activities with 5 -to-3 polarity A complex zinc finger controls the enzymatic activities of nidovirus helicases Colocalization and membrane association of murine hepatitis virus gene 1 products and De novo-synthesized viral RNA in infected cells Identification of the mutations responsible for the phenotype of three MHV RNA-negative ts mutants Unique and conserved features of genome and proteome of SARS-Coronavirus, an early split-off from the coronavirus group 2 lineage Ultrastructure and origin of membrane vesicles associated with the severe acute respiratory syndrome coronavirus replication complex Dodecamer structure of severe acute respiratory syndrome coronavirus nonstructural protein nsp10 The nsp9 replicase protein of SARS-Coronavirus, structure and functional insights SARS-unique domain" (SUD) of SARS coronavirus is an oligo(G)-binding protein Mechanisms and enzymes involved in SARS coronavirus genome expression A zinc finger-containing papain-like protease couples subgenomic mRNA synthesis to genome translation in a positive-stranded RNA virus Localization of mouse hepatitis virus nonstructural proteins and RNA synthesis indicates a role for late endosomes in viral replication Analysis of intraviral protein-protein interactions of the SARS Coronavirus ORFeome High-throughput yeast two-hybrid assays for large-scale protein interaction mapping Insights into SARS-CoV transcription and replication from the structure of the nsp7-nsp8 hexadecamer The autocatalytic release of a putative RNA virus transcription factor from its polyprotein precursor involves two paralogous papain-like proteases that cleave the same peptide bond We thank Andrew Davidson and John Ziebuhr for helpful discussion and sharing preliminary data. We acknowledge Jessika Zevenhoven-Dobbe, François Ferron, Christian Cambillau, Valérie Campanacci and Sonia Longhi for material and their help in the initial phase of the project. We thank Claire Debarnot, Etienne Decroly and Bruno Coutard for excellent technical assistance. Dr. J.P. Borg is greatly acknowledged for lab space, equipment, and funding to P.L. This work was supported by the Structural Proteomics in Europe (SPINE) project of the European Union 5th framework research program (Grant QLRT-2001-00988) and subsequently by the Euro-Asian SARS-DTV Network (SP22-CT-2004-511064) from the European Commission specific research and technological development Programme "Integrating and strengthening the European Research area" and partially by Interdisciplinary CNRS 2007 program "Infectious Emerging Diseases".