key: cord-335377-zrbn637z authors: Ishimaru, Daniella; Plant, Ewan P.; Sims, Amy C.; Yount, Boyd L.; Roth, Braden M.; Eldho, Nadukkudy V.; Pérez-Alvarado, Gabriela C.; Armbruster, David W.; Baric, Ralph S.; Dinman, Jonathan D.; Taylor, Deborah R.; Hennig, Mirko title: RNA dimerization plays a role in ribosomal frameshifting of the SARS coronavirus date: 2012-12-26 journal: Nucleic Acids Res DOI: 10.1093/nar/gks1361 sha: doc_id: 335377 cord_uid: zrbn637z Messenger RNA encoded signals that are involved in programmed -1 ribosomal frameshifting (-1 PRF) are typically two-stemmed hairpin (H)-type pseudoknots (pks). We previously described an unusual three-stemmed pseudoknot from the severe acute respiratory syndrome (SARS) coronavirus (CoV) that stimulated -1 PRF. The conserved existence of a third stem–loop suggested an important hitherto unknown function. Here we present new information describing structure and function of the third stem of the SARS pseudoknot. We uncovered RNA dimerization through a palindromic sequence embedded in the SARS-CoV Stem 3. Further in vitro analysis revealed that SARS-CoV RNA dimers assemble through ‘kissing’ loop–loop interactions. We also show that loop–loop kissing complex formation becomes more efficient at physiological temperature and in the presence of magnesium. When the palindromic sequence was mutated, in vitro RNA dimerization was abolished, and frameshifting was reduced from 15 to 5.7%. Furthermore, the inability to dimerize caused by the silent codon change in Stem 3 of SARS-CoV changed the viral growth kinetics and affected the levels of genomic and subgenomic RNA in infected cells. These results suggest that the homodimeric RNA complex formed by the SARS pseudoknot occurs in the cellular environment and that loop–loop kissing interactions involving Stem 3 modulate -1 PRF and play a role in subgenomic and full-length RNA synthesis. A novel coronavirus was responsible for the sudden epidemic, severe acute respiratory syndrome (SARS) outbreak, in 2003. Coronaviruses are positive-strand RNA viruses with large genomes [$30 000 nucleotides (nt)] that serve as templates for translation of viral proteins and for replication. The production of proteins from these viral RNAs does not follow the usual rules governing translation. The first polyprotein encoded by open reading frame (ORF)1a, which encodes non-structural proteins, is defined by initiation and termination codons and is translated normally. Signals embedded within the RNA just before the termination codon of ORF1a redirect a fraction of translating ribosomes to bypass the stop codon and continue translation in the -1 reading frame, thus creating the larger ORF1ab polyprotein (1-3). These programmed -1 ribosomal frameshift (-1 PRF) stimulating signals are typically composed of a heptameric slippery site, on which the ribosome can change register by 1 nt in the 5 0 direction, followed by a pseudoknot. Slippery site sequence requirements have been characterized for several cell types (4) but the range and diversity of frameshift-stimulating pseudoknots continues to grow (5) . Most frameshift-stimulating pseudoknots are twostemmed H-type structures. However, we and others have shown that the SARS coronavirus (SARS-CoV) -1 PRF signal is composed of three stems [ Figure 1A ; (1, 2, 6) ]. Secondary structure predictions indicate that the potential to form the third stem is conserved among Group II coronavirus even though the RNA sequences themselves are not well conserved (1) . Interestingly, removal of the third stem from the coronavirus frameshift signal still allows for frameshifting (1, 2, 6, 7) . Thus, it is not clear what the molecular role of the additional stem-loop (Stem 3-Loop 2 [S3L2]) is, and this requires further study. Here we scrutinize features of the third stem of the SARS-CoV frameshift-stimulating pseudoknot that are important for RNA structure and frameshifting efficiency. We demonstrate the importance of the capping loop sequence in promoting Stem 3 stability and maintaining near-wild-type levels of frameshifting. Specifically, a hexanucleotide, self-complementary sequence in the loop capping Stem 3 raises the possibility that dimerization of the pseudoknot may play a role in viral lifecycle. While the palindromic sequence embedded in the SARS-CoV Stem 3 is not strictly conserved among severe acute respiratory syndrome-related coronaviruses (SARSr-CoV), genomic sequences from Rhinolophus Chinese horseshoe bats (SARSr-Rh-BatCoV) (8) that were previously identified as a natural reservoir of SARS-CoV-related CoV (9) also encode the hexanucleotide palindrome. 'Loop-loop kissing' interactions involving Watson-Crick base pairing between complementary RNA loops are common RNA-tertiary structural motifs and are used e.g. in retroviral replication (10) (11) (12) (13) (14) . Here, the retroviral nucleocapsid proteins can chaperone conversion of the non-covalently linked kissing dimer to a more thermodynamically stable extended duplex-mediated dimer linkage in a structural rearrangement suggested to be associated with viral particle maturation (15) . Pioneering work on Infectious Bronchitis Virus suggested that the genomic RNA (gRNA) of CoVs is not packaged as a dimer (16) . However, CoV replication is mediated by the synthesis of a negative-strand RNA and also includes a discontinuous step involving synthesis of five to eight nested subgenomic RNA (sgRNA) intermediates (17) . The three-stemmed wild-type SARS pseudoknot. Stems are labeled S1, S2 and S3 in the order that they occur 5 0 to 3 0 along the RNA. Accordingly, loops are labeled L1, L2 and L3. Note that L1 and L3 join adjacent stems, while L2 closes S3 (highlighted using gray box). Only the last two digits of the wild-type sequence numbering are used for clarity. The palindromic sequence 5 0 -ACUAGU-3 0 embedded into L2 is indicated using white circles. Dashes represent Watson-Crick and the dot GU Wobble base-pairing as confirmed by NMR spectroscopy. (b) Stem 3 deletion mutant ÁS3 pk. (c) S3L2 hairpin construct S3L2 spanning nucleotides G 37 to C 60 . (d) S3L2 hairpin constructs S3L2-ACUucc and S3L2-ACUAGc with L2 mutations that render the palindromic sequence asymmetrical (highlighted using gray circles) while conserving a Serine codon. (e) S3L2 hairpin constructs S3-cuug and S3-gaaa where the 9 nt L2 is replaced with the smaller tetraloops 5 0 -cuug-3 0 and 5 0 -gaaa-3 0 , respectively. (f) SARS pseudoknot variants S3-2 bp-cuug pk with a shortened Stem 3 is capped with a 5 0 -cuug-3 0 tetraloop. S3-cuug and S3L2-ACUucc variant constructs highlighted with an asterisk (*) were also generated in the context of full-length pk. In the current work, we demonstrate that a previously overlooked loop-loop kissing interaction involving the conserved Stem-loop 3 embedded within the SARS-CoV pseudoknot occurs under physiological conditions in vitro. We further show that kissing dimer formation plays a role in frameshift-stimulation and modulates the relative abundance of full-length and subgenomic viral RNAs. Plasmids containing wild-type pseudoknot as well as the ÁS3 pk mutant were described in Plant et al (1) . The wild-type pk plasmid was used as a template for the generation of S3L2 mutants. S3L2-only transcripts incorporated a cis-acting, 3 0 -hammerhead ribozyme. This plasmid was the template for site-directed mutagenesis replacing Loop 2 5 0 -GCACTAGTA-3 0 with GCACTAG cA, GCACTtccA, cttg or gaaa. In vitro transcriptions were optimized and performed as described (1, 18, 19) using unlabeled nucleotide triphosphates (MP Biomedicals). RNA transcripts were purified by fast-performance liquid chromatography with HiTrap Q column (GE Healthcare) (18, 19) . S3L2-only transcripts and variants were purified with a HiTrap Q (GE Healthcare), followed by a DNAPac PA200 columns (Dionex) (18) . Purified RNA was equilibrated with nuclear magnetic resonance (NMR) buffer [25 mM KCl unless otherwise noted, 10 mM sodium phosphate (pH 6.5), 500 mM EDTA (Ethylenediaminetetraacetic acid), 50 mM sodium azide, 9:1 H 2 O:D 2 O]. All NMR spectra were recorded at 288 K, 295 K or 298 K on Bruker Avance 900, 800, 700 or 600 MHz spectrometer equipped with either a triple resonance inverse detection cryoprobe (800 and 700) or standard triple resonance inverse detection-probeheads. NMR experiments were performed on samples of 500 ml volume containing 0.2-1.2 mM SARS-CoV pk and stem-loop S3L2 variant RNA. Data were processed using NMRPipe (20) and analysed using Sparky (21) . One-dimensional imino proton spectra were acquired using a jump-return echo sequence (22) . Imino resonances were assigned sequence specifically from water flip-back, WATERGATE 2D nuclear Overhauser effect spectroscopy (NOESY) spectra (t mix = 150-200 ms) (23) . Typically, for the 1 H, 1 H-NOESY spectra, 384 complex points were recorded with an acquisition time of 24 ms for 1 H (o 1 ), and 2048 complex points with an acquisition time of 127 ms for 1 H (o 2 ). Repetition delays ranging from 1.2 (pk variants) to 1.7 s (S3L2 variants) were used between transients, with 128 scans per increment (total measuring times 36-49 h, respectively). Unlabeled (150 mM) and 32 P-labeled (1 pM) SARS-CoV pk and S3L2 RNA transcripts were annealed in NMR buffer unless stated otherwise. When 5 mM MgCl 2 was added, samples were incubated for 6 h at 37 C. Temperature, time of incubation and RNA concentration varied, as specified in the text. RNA samples were separated on 10% native polyacrylamide gel in Tris Borate buffer pH 8.3 at 4 C when MgCl 2 was added to the dimerization reaction. Otherwise, Tris Borate EDTA pH 8.3 was used and the gel analysis performed at 4 C or 25 C, as specified in the text. Gels were dried and analysed by phosphorimaging. In Figures 3A and 3B , gels were analysed by ethidium bromide staining. RNA samples were incubated overnight at 37 C to promote dimerization. Samples were crosslinked in a ultraviolet (UV) crosslinker (Spectroline) for 30 min, 254 nm, 1.5 mW/cm 2 on ice (24) . Crosslinked RNA was separated on denaturing PAGE (polyacrylamide gel electrophoresis) and eluted with 0.3 M sodium acetate, 2% sodium dodecyl sulfate (SDS), pH 5.5 at 42 C for 18 h. Precipitated RNA was subjected to partial alkaline hydrolysis. Control, non-crosslinked, 32 P-labeled RNA was subjected to RNase T1 digestion or partial alkaline hydrolysis (Applied Biosystems). All samples were separated on denaturing 15% PAGE. Gels were dried and densitometry of bands was determined using ImageQuant TL (GE Healthcare) (25) . Transfected VeroE6 cells were grown overnight in Dulbecco's Modified Eagle Medium supplemented with 10% Fetal Bovine Serum at 37 C. Cells were disrupted using the passive lysis buffer (Promega). Luminescence reactions were initiated by addition of 10-20 ml of cell lysates to 100 ml of the Promega LAR II buffer and completed by addition of 100 ml Stop-n-Glo reagent. Luminescence was measured using a Turner Design TD20/20. At least three replicates were performed within each assay, and all assays were repeated at least three times until the data were normally distributed (26) . The frequency of frameshifting is expressed as a ratio of firefly to Renilla luciferase from a test plasmid divided by the analogous ratio from the read-through control plasmid multiplied by 100%. Fold change, standard error and estimates of the P-values for ratiometric analyses were performed as previously described (26) . VeroE6 cells inoculated with SARS-CoV or S3L2-ACU ucc pk (multiplicity of infection of 1) or were mockinfected and incubated at 37 C. Media were harvested at 0, 5, 8 or 12 h post-infection (pi) and titers assessed by plaque assay as previously published (27) . Viral detection limit was 50 pfu/ml. Error bars are the standard deviation of three measurements. Total RNA from SARS-CoV and S3L2-ACUucc pk was isolated from infected cell monolayers (Trizol Invitrogen) and purified using Oligotex mRNA spin column reagents (QIAGEN). RNA was separated on an agarose gel using Northern-Max-Gly (Ambion), transfered to a BrightStar-Plus membrane (Ambion) and cross-linked to the membrane with UV light. The blot was pre-hybridized and probed with a SARS-CoV nucleocapsid-specific oligodeoxynucleotide probe (5 0 -CTTGACTGCCGCCTC , where biotinylated nucleotides are designated with a superscript b. Blots were hybridized overnight and washed with low-and highstringency buffers. Filters were incubated with streptavidin-AP, washed and then developed using the chemiluminescent substrate CDP-STAR (New England Biolabs). At 5, 8 and 12 h pi, SARS-CoV and S3L2-ACUucc pk infected or mock-infected cells were washed and lysed in buffer containing 20 mM Tris-HCl (pH 7.6), 150 mM NaCl, 0.5% deoxycholine, 1% Nonidet P-40, 0.1% SDS and post-nuclear supernatants added to an equal volume of 5 mM EDTA and 0.9% SDS, resulting in a final SDS concentration of 0.5%. Samples were heat inactivated twice before usage. On 4-20% Criterion gradient gels (Bio-Rad), 10 mg of protein was loaded and transferred to a polyvinylidene difluoride membrane. Blots were probed with polyclonal rabbit antisera directed against nsp1 (diluted 1:500) or nsp16 (diluted 1:200) (28) and developed using enhanced chemiluminescence reagents (Amersham Biosciences). All CoV genome sequences were obtained from Gen Bank (http://www.ncbi.nlm.nih.gov/genbank/). Accession numbers for sequences discussed are summarized in Supplementary Table S1. Multiple sequence alignments were performed using ClustalW2 (version 2.0.12) (http:// www.ebi.ac.uk/Tools/msa/clustalw2/) (29) . Free energies of the proposed S3L2 structures at 37 C in 1 M NaCl were calculated with Mfold (30) . Our previous NMR analysis of exchangeable imino protons of the SARS-CoV pseudoknot ( Figure 1A , wild-type pk) provided unequivocal evidence for the existence of Stem 3 (1). In the present study, secondary structure analysis by NMR provided further insight into the complex global architecture of the wild-type pk. To establish experimentally whether Stem 3 interacts with the two-stemmed H-type structure, we prepared a transcript lacking the base-paired region of Stem 3 while retaining the hexanucleotide 5 0 -ACU AGU-3 0 palindromic sequence ( Figure 1B , ÁS3 pk) and compared this construct with wild-type pk. The imino NOESY spectrum of the ÁS3 pk transcript is virtually identical to the wild-type pk native construct with a few marked exceptions. The missing sequential imino assignment path for the mutant ÁS3 pk is indicated by dashed lines in the superposition of wild-type pk and ÁS3 pk NOESY spectra shown in Figure 2A . For clarity, these nuclear Overhauser effect (NOE) connectivities including the characteristic G38-U59 wobble pair are highlighted in the schematic secondary structure of the three-stemmed pseudoknot structure (dashed box) and correspond to Stem 3. No significant chemical shift or linewidth changes can be observed for imino protons located outside of Stem 3. The only notable exception is the severely broadened crosspeak for the G14-C25 base-pair in the ÁS3 pk construct located in the vicinity of the Stem 1-Stem2 junction (Figure 2A , red box). A possible explanation for this observation is that a longer unpaired Loop 3 consisting of 12 (ÁS3 pk) rather than 3 nt (wild-type pk) may affect the degree of overrotation at the S1-S2 junction. Addressing this question in more detail would require a complete structure determination. At this stage, the large intrinsic linewidth of a 67-nt wild-type pk RNA in combination with severe line-broadening observed for NMR samples concentrated to >200 mM made this procedure unrealistic (data not shown). However, the overall comparison of exchangeable imino proton spectra of the wild-type pk with ÁS3 pk suggests that Stem 3 does not noticeably engage in stable tertiary interactions involving the two-stemmed H-type structure and likely constitutes an autonomous substructure within the frameshift signal. The NMR line-broadening observed motivated a closer examination of the SARS pseudoknot and revealed a palindromic sequence in the loop capping Stem 3, designated Loop 2 (L2) ( Figure 1A ). To determine whether this palindromic sequence could mediate dimerization, Stem 3 transcripts ( Figure 1C , S3L2) were incubated at 37 C for 30 min in the presence of KCl and subjected to native PAGE. Lane 1 of Figure 3A shows two bands and demonstrates that S3L2 transcripts form homodimers when analysed at 4 C. To evaluate the role of the palindromic sequence for dimer formation, a series of mutations were generated ( Figure 1D -F) with the intention of altering the palindromic sequence ( Figure 1D ), replacing the entire Loop 2 ( Figure 1E ) or significantly reducing the size of both Stem 3 and Loop 2 ( Figure 1F ). When the palindromic sequence was mutated from 5 0 -A CUAGU-3 0 to ACUAGc or ACUucc, the resulting S3L2-A CUAGc and -ACUucc transcripts migrated as single species, indicating that dimer formation was abolished ( Figure 3A , lanes 2 and 3). Unexpectedly, replacement of the entire Loop 2 (5 0 -GCACUAGUA-3 0 ) with the stable tetraloop 5 0 -cuug-3 0 generated a transcript migrating as a dimeric species ( Figure 3A , lane 4). This could be explained by the formation of an extended duplex featuring two central U-U wobble mismatches and was further investigated using NMR methods. In contrast, a 5 0gaaa-3 0 tetraloop containing transcript, S3-gaaa, efficiently prevented dimer formation in vitro ( Figure 3A , lane 5). To verify whether dimers could be observed in the context of the full-length pseudoknot, SARS wild-type pk transcripts were incubated in the same conditions described above. Full-length pk homodimers were observed ( Figure 3C , lane 2) as demonstrated for S3L2 constructs ( Figure 3C , lane 1). To confirm that pk dimerization occurs via Loop 2, full-length pk, ÁS3 pk and S3-2bp-cuug pk were incubated with S3L2 transcripts ( Figure 3C ). Detection of heterodimers was performed by incubation of unlabeled pk with 32 P-labeled S3L2 (S3L2*) transcripts ( Figure 3C , lane 5), where the radioactive-labeled transcripts showed mobility shifts compatible with both homo-and heterodimers. Similarly, ÁS3 pk, a construct that lacks Stem 3 but retains Loop 2, was found to self-associate ( Figure 3C , lane 3) and to form heterodimers when incubated with S3L2 ( Figure 3C , lane 6), albeit weakly. S3L2 was unable to form heterodimers with S3-2bp-cuug pk ( Figure 3C, lane 7) , a variant lacking sequence complementarity in Loop 2. These results collectively indicate that the SARS pseudoknot can homodimerize in vitro via the palindromic sequence located in Loop 2. Because ÁS3 pk is able to self-associate weakly, we conclude that stable S3 formation facilitates dimerization but is not a requirement. The stability of loop-loop kissing interactions in mitochondrial transfer RNA (31) and viral RNA (32, 33) are highly dependent on cation concentration. To test whether SARS S3L2 dimer formation is favored in the presence of Mg 2+ -ions (34), [ 32 P] 5 0 -end-labeled transcripts were incubated in the absence or presence of MgCl 2 at 37 C and separated by native PAGE. Comparison of Figures 4A and B shows that S3L2 self-association is a Mg 2+ -dependent event. In the absence of Mg 2+ , no appreciable change in dimer population was observed between 0 and 48 h at 37 C ( Figure 4A ). Next, we investigated S3L2 self-association as a function of increasing monovalent potassium cation concentration ( Figure 4E ). We found that S3L2 self-association was predominantly driven by the presence of Mg 2+ because significant but slow dimerization resulted from the addition of 6-250 mM KCl in the absence of Mg 2+ ( Figure 4E , open circles), while responses to varying KCl concentrations were negligible in the presence of 5 mM Mg 2+ ( Figure 4E , closed circles). To examine the influence of temperature on dimer formation, S3L2 RNAs were incubated in the presence of MgCl 2 at 25 C and 37 C (Figures 4C and 4D ). Aliquots at various time-points were collected and stored at À20 C until all samples were collectively analysed by native PAGE. As shown in Figure 4D , S3L2 dimers readily formed at physiological temperatures (T ½ = 48 ± 12 min), while dimer formation was considerably slower at room temperature (T ½ = 19.8 ± 6.5 h; Figure 4C ). We also quantified the concentration dependence of S3L2 dimerization, in the presence of MgCl 2 , and determined the dissociation constant for the dimer to be 2.6 ± 0.15 mM at 37 C ( Figure 4F ). Taken together these results suggest that Stem 3/Loop 2 readily dimerizes under physiological conditions, i.e. at 37 C in the presence of Figure 2 . NMR secondary structure comparison of wild-type SARS-CoV pk, ÁS3 pk, S3 and S3L2-ACUucc mutants. (a) Imino regions of 2D 1 H, 1 H-NOESY experiments collected on wild-type SARS-CoV pk (black contours) and ÁS3 pk mutant (red contours), respectively. Dashed black lines show the imino proton walk in the S3 stem. The base-paired region of S3 is deleted in the ÁS3 pk mutant; however, L2 is left intact. Solid red lines show the sequential NOE correlations involving the imino proton U26 located in S1, and the red box highlights the cross peak connecting imino protons U26 and G14 adjacent to the S1-S2 junction, which is absent in the ÁS3 pk mutant. Only the last two digits of the wild-type sequence numbering are used for clarity. The schematic SARS-CoV pk inset highlights the corresponding S3 stem (dashed box) as well as the G14-C25 basepair location in S1 (solid red box). (b) Imino regions of 2D 1 H, 1 H-NOESY experiments collected on wild-type SARS S3 (black contours) and the S3L2-ACUucc mutant (red contours), respectively. Dashed black lines show the imino proton walk in the lower portion of the S3 stem. Solid red lines highlight the sequential cross peaks in the upper portion of S3 correlating imino protons G55, U54 and G43 adjacent to L2, which are broadened beyond detection in the S3L2-ACUucc mutant. The schematic SARS S3L2 inset highlights the corresponding lower S3 stem (dashed box) as well as the base-paired region in the upper S3 stem (solid red box). 5 mM MgCl 2 , and that homodimers tolerate a broad range of ionic strengths. Intermolecular loop-loop kissing of retroviral gRNA are initially metastable and subsequently converted to more stable mature duplexes catalysed by nucleocapsid proteins. Such processes involving palindromic 6-nt sequences have been extensively studied and described for Moloney murine leukemia virus (35), Hepatitis C virus (36) and Human immunodeficiency virus (HIV) dimers (15) . We thus asked the question whether S3L2 loop-loop kissing complexes ( Figure 5B ) can potentially form extended duplexes ( Figure 5C ). Seminal work by Laughrea and Jette´have established that loose (loop-loop kissing) dimers are unstable when subjected to native electrophoresis at 25 C, while tight (extended) duplexes resist these conditions (37, 38) . As shown in lane 1 of Figures 3A and B , S3L2 dimers, while detectable at 4 C, are not favored at 25 C, suggesting the formation of loop-loop kissing complexes. To further demonstrate that S3L2 dimerizes via L2-mediated loop-loop kissing interactions, UV-crosslinked dimers were subjected to partial hydrolysis. Normalized densitometry of bands in each lane revealed a significant reduction in 32 P-signal beginning at nucleotide 45 ( Figure 6B ), indicating that this nucleotide is in close proximity to the dimerization interface. The nucleotide at position 45 represents the adenosine 5 0 -neighbor of the 6-nt palindrome, consistent with SARS S3L2 loop-loop kissing formation ( Figure 5B ) and cannot be explained on the basis of extended duplex formation ( Figure 5C ). Detection of loop-loop kissing dimers for S3L2 transcripts prompted us to investigate whether pk, ÁS3 pk and S3-2bp-cuug pk homodimers formed loose or tight complexes. As shown in Figure 3D , all three homodimers remained stable when subjected to native gel at room temperature, an indication of tight dimer formation (37, 38) . However, as observed previously, only faint bands were detected for ÁS3 pk homodimers ( Figures 3C and 3D, lane 3) . Altogether, these results suggest that S3L2 initially forms loop-loop kissing dimers and, when embedded into larger SARS pk constructs, can mature to form tight dimers. Extended duplex formation can also be observed in case of constructs capped by stable 5 0 -cuug-3 0 tetraloops. To obtain direct structural information about the observed homodimers, a series of RNA transcripts were further investigated by NMR. Watson-Crick base pairing for the isolated Stem 3 RNA was confirmed by twodimensional nuclear NOESY spectra and a sequential imino walk is indicated in Figure 2B . As compared with the native S3L2 construct, a number of exchangeable imino resonances disappeared at 283 K in the NOESY spectrum of the asymmetric S3L2-ACUucc Loop 2 mutant. The missing sequential imino assignment path for the mutant S3L2-ACUucc corresponds to the upper portion of Stem 3 and is indicated by solid red lines in the superposition of S3L2 and S3L2-ACUucc imino NOESY investigation, we attempted to monitor the conversion by NMR with nucleotide-specific resolution in the presence of only potassium chloride. Therefore, we recorded a series of consecutive one-dimensional jump-return echo NMR spectra (Figure 7) to monitor S3L2 imino proton resonances in real-time. The overlapping G43 and U48 as well as G44 and G58 imino proton resonance assignments were confirmed using 2D heteronuclear 1 H, 15 N-HMQC correlations (Supplementary Figure S1 ). Dimer formation at 25 C was induced by addition of 125 mM potassium chloride to an NMR sample containing 250 mM S3L2 RNA and (interrupted) 1D spectra recorded over a period of >20 days. Figure 7 shows that half of the kissing complex is formed after $9 h and conversion progresses for >20 days before reaching a plateau characterized by a S3L2:(S3L2) 2 ratio of 2:8. No evidence for further progression to a potential extended duplex structure could be detected. Curiously, the replacement of the 9-nt loop L2 with the stable 5 0 -cuug-3 0 tetraloop (S3-cuug, Figure 1E ) designed to stabilize hairpin formation generated a transcript that efficiently dimerized ( Figure 3A, lane 2) . The imino assignments through the stem could be easily followed based on the resonance assignments and NOE patterns observed in S3L2 (Supplementary Figure S2) . Close inspection of the sequential assignments and NOE patterns revealed the existence of tandem U-U wobble pairs involving the two uridine nucleotides of the tetraloop. In addition, substantial line broadening was observed consistent with extended duplex formation of S3-cuug transcripts (Supplementary Figure S2 ). Subsequently, we examined the NOESY spectra of the larger, three-stemmed 5 0 -cuug-3 0 -tetraloop containing SARS mutants S3-2bp-cuug ( Figure 1F ) and S3-cuug pk that were subjected to functional frameshifting analysis. We confirmed the formation of the tandem U-U wobble pairs by 2D NOESY. The NOE patterns for the U-U wobbles are an almost perfect subspectrum of the same region of the NOESY for the isolated S3-cuug transcripts (Supplementary Figure S3) . Other exchangeable protons in the Stem 1 and Stem 2 regions comprising the H-type pseudoknot overlay with the corresponding imino protons observed in the context of wild-type pk. The excellent agreement in the overlay of the imino connectivities traced by the NOESY experiments indicates that the stable 5 0 -cuug-3 0 tetraloop capping an otherwise wild-type Stem 3 facilitates the formation of extended duplex structures in the context of larger S3-2bp-cuug and S3-cuug pk mutants and locally retains the same structure as S3-cuug in isolation. In contrast to the SARS S3L2 variants capped with 5 0 -cuug-3 0 tetraloops, S3-gaaa ( Figure 1E ) NMR samples showed pronounced differences and narrower imino proton linewidths. Based on NOESY spectra and the observation of an additional upfield-shifted guanosine resonance located in the sheared G-A loop basepair, it was concluded that a construct containing a 5 0 -gaaa-3 0 tetraloop predominantly adopts a hairpin structure (Supplementary Figure S4) , which is consistent with the native gel mobility, indicating a monomeric species (lane 3, Figure 3A ). To determine the importance of L2-mediated selfassociation for SARS-CoV -1 PRF, several pseudoknot variants were subjected to a dual luciferase reporter-based frameshifting analysis as previously described ( Figure 8A ) (1, 39) . A Loop 2 ACUucc silent (serine codon) mutation was made to investigate the role of the palindrome (S3L2-ACUucc; Figure 1D ) in promoting -1 PRF. Phylogenetic analysis indicated that the sequence UCC is present at a (1). Dual luciferase measurements of -1 PRF from the S3L2-ACUucc pk construct showed a 63% reduction in frameshift stimulation compared with wild-type pk (5.7% versus 15.0%, Figure 8A ). Next, we tested two S3 mutants for frameshift stimulation in the luciferase assay. S3-cuug ( Figure 1E ), which we showed to readily self-associate ( Figure 3A, lane 2) , demonstrated a nominal increase in frameshift frequency (17.5%), while a S3 truncation of the same construct (S3-2bp-cuug, Figure 1F ) exhibited near wild-type efficiency (15.2%). It is important to note that the previously characterized S3-deletion ÁS3 pk construct maintained dimerization capability albeit with reduced efficiency (Figures 3B and C, lane 3) . Our results are consistent with previous analyses by three different groups that have shown mutations to the third stem of a three-stemmed pseudoknot to have less impact on frameshifting efficiency than mutations to Stem 2 (1,2,6,7). However, our data indicate that silent codon changes to Loop 2 that disrupt the palindrome play a role in regulating the frequency of frameshifting. Having shown that the SARS-CoV pseudoknot dimerizes in vitro and alters frameshifting in Vero cells, we asked whether the ability to self-associate is important for viral propagation. Site-directed mutagenesis was used to introduce the Loop 2 ACUucc silent mutation into SARS subclone D (40) , and the recombinant virus mutants were recovered from full-length virus transcripts. The observed effects should therefore reflect changes related to RNA structure within the frameshifting signal, rather than modifying the encoded proteins. To determine the effects on viral growth kinetics, stocks of rescued wild-type SARS-CoV and S3L2-ACUucc pk (multiplicity of infection (MOI) of 1) were used to infect VeroE6 cells, media was harvested at 0, 5 and 8 h pi and tested by plaque assay. Viral titers for SARS-CoV increased by wild-type pk ΔS3 pk S3L2-ACUucc pk S3-cuug pk S3-2bp-cuug pk 1.00E+01 2.5 logs at 5 h pi and by 3.5 logs at 8 h pi, while the growth kinetics for S3L2-ACUucc, a mutant that does not form dimers, lagged behind at $1 log increase at 5 h pi and $2.5 logs by 8 h pi ( Figure 8B ). Our analysis demonstrates that disruption of the palindrome via silent codon changes does impede growth kinetics for S3L2-ACUucc pk at early times pi; however, by 12 h pi, similar titers were detected (data not shown). To determine if the differences in viral growth kinetics were due to reduced levels of RNA replication and/or synthesis, we harvested total RNA at 0, 5, 8 and 12 h pi and performed northern blot analysis. Total RNA was probed using a nucleocapsid specific biotinylated probe ( Figure 9A ) to examine levels of gRNA. S3L2-ACUucc gRNA levels were reduced $2-fold when compared with SARS-CoV at 5 and 8 h pi, suggesting that RNA replication contributed to the reduction in viral titers ( Figure 9B ). To determine the quantities of sgRNA species, the total RNA was enriched for poly A containing mRNA species ( Figure 9C ) before separation on agarose gel. All sgRNA species were readily detectible in both SARS-CoV and S3L2-ACUucc samples; however, $1.5-fold reduction in the quantity of sgRNA species was detected at 5 and 8 h pi ( Figure 9D ), suggesting that reduced levels of RNA transcription also contributed to the titer loss. Finally we compared the levels of ORF1a and ORF1b replicase proteins to determine if these protein levels were altered as well. Equivalent protein concentrations were probed with polyclonal rabbit antisera directed against nsp1 or nsp16 as indicated ( Figure 9E and F). Reduced ORF1a and ORF1b replicase protein levels were seen at 5 and 8 h pi but similar levels were detected by 12 h pi (Supplementary Figure S5 ). Together these data indicate that dimerization is not required for viral viability but it is important for maintaining efficient rates of virus growth and genomic and subgenomic mRNA synthesis. While Stem 3 is conserved in group 2 coronaviruses, the palindrome in loop 2 is not The common ancestor of the civet and human strains seems to have been a bat virus (9, 41) . Therefore, we attempted to obtain information about the combined conservation of sequential and structural features of Stem 3 from a compilation of 22 sequences from related human [GenBank accession codes AY304495 (42) and NC_004718 (43, 44) ], masked palm civet [GenBank accession code AY304488 (42) ] and bat coronaviruses. For this purpose, Mfold (30) was used to predict lowest energy structures of aligned CoV S3 sequences. While 22 out of 24 nt of the S3L2 region are absolutely invariant between the 22 viral species (Supplementary Figure S6) , the variation that does occur most frequently disrupts the hexanucleotide palindromic sequence at its 3 0 -end exhibiting a U51C substitution (Supplementary Figure S6) . The U51C substitution stabilizes the hairpin and lowers the calculated ÁG from À4.8 kcal/mol to À7.5 kcal/mol (Supplementary Table S1 ). We previously showed that this 'ancestor' S3L2-AGUAGc 51 variant was dimerization incompetent (lanes 2, Figure 3A, D) . On the basis of phylogeny clustering, CoVs are classified into three Groups (Group 1-3) . The SARS-CoV lineage has been proposed to cluster with Group 2 (44, 45) . To further analyse conservation patterns within the S3 element in a broader context, we aligned 28 complete genomic CoV sequences (summarized in Supplementary Table S1 ). As previously shown, pseudoknots containing a structured Stem 3 can be predicted for all Group 2 coronaviruses (1). The calculated average free energy for predicted Stem 3s of Group 2 CoV is ÁG = À6.8 ± 1.2 kcal/mol. Except for Feline CoV (ÁG = À5.6 kcal/ mol), all Group 1 sequences are not predicted to form stable Stem 3 substructures and are characterized by average free energies around zero (ÁG = À1.4 ± 1.7 kcal/mol). Similarly, in Group 3 coronaviruses the S3L2 region is most likely a single-stranded loop as indicated by a predicted ÁG of À1.8 kcal/mol (1). Remarkably, among the predicted S3L2 structures of all 17 Group 2 coronaviruses investigated, SARS-CoV codes for the second most labile Stem 3 with a ÁG of À4.8 kcal/ mol; only the Human CoV HKU1 Stem 3 is predicted to be less stable as indicated by its ÁG of À4.7 kcal/mol (Supplementary Figure S7) . Thus, SARS-CoV may form homodimers using its unique hexanucleotide palindromic sequence to compensate for a lack of thermodynamic stability. Although a coronavirus frameshift signal was among the first pseudoknots to be identified and has been extensively studied using mutagenesis (7) , the size of the CoV pseudoknots have limited structural analyses at the atomic level. NMR and X-ray crystallography have been used to describe several smaller pseudoknots but the differences between the pseudoknots makes it impossible to extrapolate features from the smaller pseudoknots to the larger ones (5) . From studies of HIV and other retroviruses, it is clear that modulation of frameshift efficiency can have a dramatic effect on virus viability, and the same is true for positive-strand RNA viruses (46, 47) . Recently, it was demonstrated that the SARS-CoV pseudoknot could potentially be a target for antiviral agents making it imperative to understand its structure and function in greater detail (48) . Our studies of the third stem of the SARS coronavirus pseudoknot illuminate features of Stem 3 that affect RNA structure, frameshifting frequency and viral replication. Typically, G-C-rich Watson-Crick complementarity is required for stable loop-loop base pairing interactions between two RNA hairpins. Among all natural HIV isolates, two palindromic hexanucleotide sequences are most commonly found: GCGCGC and GUGCAC (49, 50) . However, mutagenesis work investigating HIV RNA dimerization also demonstrated that the GUUAAC palindrome found in SIV mnd (Simian Immunodeficiency Virus) could yield up to 20% dimers in the presence of 5 mM MgCl 2 (51) . In good agreement with those studies, we confirm that weaker palindromic sequences such as AC UAGU can readily facilitate intermolecular loop-loop kissing RNA-RNA interactions. RNA pseudoknots implicated in -1 PRF stimulation are not static and are in fact dismantled during translation (52, 53) . If a frameshift event does not occur during the first round of translation then the pseudoknot will have to refold for frameshifting to occur in subsequent rounds. Long unstructured loops between the 3 0 portion of Stem 1 and 5 0 portion of Stem 2 would be expected to reduce the chances of Stem 2 forming and reduce frameshifting efficiency. If this sequence can form a stable restricting S3L2 substructure, as is the case for the SARS-CoV pseudoknot steadied by an intermolecular kissing complex, then the rapid re-formation of Stem 2, which is required for efficient frameshifting, will be more probable. In previous work, the functional importance of the 31 nt comprising Stem 3 and Loop 2 in the SARS-CoV pseudoknot (C36 through A67) were investigated and no obvious determinants for efficient frameshift stimulation for this specific element could be identified. Deletion analysis characterizing variants with 5(6) and 11 nt (2) crossing the minor groove side of Stem 1 yielded 35 and 52% of wild-type frameshifting levels. However, because of the common lack of precise structural information for pseudoknot varaints, determining the exact contribution of specific nucleotides of the pseudoknot to frameshifting is often difficult. There is a potential pitfall in the mutational methods because longer-range distortions that are difficult to predict using e.g. Mfold especially in complicated three-stemmed architectures of Group II CoV could lead to false hypotheses regarding the physical mechanism of frameshifting. Secondary structures and basepairing patterns for all SARS-CoV variants investigated in this study were characterized using NMR methods. Because the 5 0 -ACU AGU-3 0 palindromic sequence was left intact, even the S3-deletion ÁS3 pk mutant could dimerize and thus efficiently stimulated -1 PRF. On the other hand, the S3L2-ACUucc did not engage in loop-loop kissing interactions, was characterized by an open Loop 2 structure, which destabilized Stem 3, and its ability to promote -1 PRF was reduced almost 3-fold. When we capped Stem 3 with a stable CUUG tetraloop that normally facilitates formation of stable hairpin structures (54) and is highly conserved (55) , the frequency of frameshifting almost returned to wild-type levels ( Figure 8) . Surprisingly, in the context of the SARS-CoV Stem 3 sequence, 5 0 -cuug-3 0 tetraloop-capped mutants readily formed extended duplex structures as revealed by native gel and NMR analysis. Thus, the promotion of wild-type levels of frameshifting exhibited by S3-cuug and S3-2bp-cuug SARS pseudoknot variants is consistent with a mechanism involving dimer formation. The metastable SARS-CoV Stem 3 sequence, which features a G38 · U59 wobble pair and a bulged A56, apparently evolved to facilitate a certain degree of dimerization if seeded through loop-loop kissing interactions that can even tolerate tandem U-U mismatches. The S3L2-ACUucc pk mutant, in which frameshifting was reduced nearly 3-fold, was viable. However, while the mutant replicated to levels similar to the wild-type virus at later times pi, the inability to dimerize in infected Vero cells significantly changed the viral growth kinetics, RNA species and replicase protein levels. The fact that a silent codon change in the Loop 2 of SARS-CoV reproducibly affected the levels of gRNA and sgRNA strongly suggests that dimer formation occurs in the cellular environment and that loop-loop kissing interactions involving Stem 3 of the pseudoknot are important for accumulation of sgRNA. The in vitro findings of reduced total RNA levels and ORF1a/b protein following infection leading to a lag in replication are consistent with reduced levels of ORF1b translation products as a result of the 3-fold reduction in -1 PRF. The downregulation of the ORF1a gene product nsp1 is likely a result of the decrease in the amount of gRNA in the infected cells. Several studies have demonstrated that facile formation of the H-type skeleton involving Stem 1 and 2 of a frameshift-stimulating pseudoknot structure is critical for efficient frameshifting (56, 57) . Intriguingly, several long-distance intramolecular loop-loop kissing has also been implicated in -1 PRF stimulation. Specifically, the Group 1 Human CoV 229 E has been shown to use a kissing stem loop to promote frameshifting rather than a compact H-type pseudoknot (58) . Based on sequence alignments, such 'elaborated' pseudoknots could be used by Human CoV NL63, porcine epidemic diarrhea virus and transmissible gastroenteritis virus (6) . Similarly, the P1-P2 pseudoknot from the luteovirus barley yellow dwarf virus can be considered a variation on the 'elaborated' CoV pseudoknot structure. It contains a loop L2 of nearly 4 kb, with S2 formed by long-range kissing interactions with nucleotides near the 3 0 end of the genome (59) . Such a long-distance RNA-RNA kissing interaction is also implicated in -1 PRF of red clover necrotic mosaic virus (60) . Thus, an intermolecular dimerization in SARS-CoV may simply represent a variation of the 'elaborated' pseudoknot. In conclusion, we demonstrate that the structured third stem of SARS-CoV represents an integral part of the -1 PRF stimulating pseudoknot topology. While the exact functional interplay in the viral lifecycle and timing of Stem 3 folding-unfolding interconversions remain to be determined, our results provide important first insight into a palindromic sequence element embedded into Stem 3 that permit loop-loop kissing formation for efficient -1 PRF stimulation, efficient translation of ORF1a/b encoded proteins and relative amounts of sgRNA to remain close to wild-type levels. A three-stemmed mRNA pseudoknot in the SARS coronavirus frameshift signal An atypical RNA pseudoknot stimulator and an upstream attenuation signal for -1 ribosomal frameshifting of SARS coronavirus Mechanisms and enzymes involved in SARS coronavirus genome expression Comparative study of the effects of heptameric slippery site composition on -1 frameshifting among different eukaryotic systems Frameshifting RNA pseudoknots: structure and mechanism Programmed ribosomal frameshifting in decoding the SARS-CoV genome Mutational analysis of the RNA pseudoknot component of a coronavirus ribosomal frameshifting signal Ecoepidemiology and complete genome comparison of different strains of severe acute respiratory syndrome-related Rhinolophus bat coronavirus in China reveal bats as a reservoir for acute, self-limiting infection that allows recombination events A review of studies on animal reservoirs of the SARS coronavirus Requirements for kissing-loop-mediated dimerization of human immunodeficiency virus RNA Evidence that a kissing loop structure facilitates genomic RNA dimerisation in HIV-1 A 19-nucleotide sequence upstream of the 5 0 major splice donor is part of the dimerization domain of human immunodeficiency virus 1 genomic RNA Identification of the primary site of the human immunodeficiency virus type 1 RNA dimerization in vitro Impact of human immunodeficiency virus type 1 RNA dimerization on viral infectivity and of stem-loop B on RNA dimerization and reverse transcription and dissociation of dimerization from packaging Dimerization of retroviral RNA genomes: an inseparable pair Genome of infectious bronchitis virus Biochemical aspects of coronavirus replication and virus-host interaction RNA structure determination by NMR NMRPipe: a multidimensional spectral processing system based on UNIX pipes University of California Absorption mode two-dimensional NOE spectroscopy of exchangeable protons in oligonucleotides Use of a water flip-back pulse in the homonuclear noesy experiment Pattern of 4-thiouridine-induced cross-linking in 16S ribosomal RNA in the Escherichia coli 30S subunit Systematic analysis of bicistronic reporter assay data Severe acute respiratory syndrome coronavirus infection of human ciliated airway epithelia: role of ciliated cells in viral spread in the conducting airways of the lungs Identification and characterization of severe acute respiratory syndrome coronavirus replicase proteins Clustal W and Clustal X version 2.0 Mfold web server for nucleic acid folding and hybridization prediction Dimerization of a pathogenic human mitochondrial tRNA Dimerization of HIV-1 genomic RNA of subtypes A and B: RNA loop structure and magnesium binding Molecular dynamics simulations of RNA kissing-loop motifs reveal structural dynamics and formation of cation-binding pockets RNA LEGO: magnesium-dependent formation of specific RNA assemblies through kissing interactions Bipartite signal for genomic RNA dimerization in Moloney murine leukemia virus Hepatitis C virus genomic RNA dimerization is mediated via a kissing complex intermediate Kissing-loop model of HIV-1 genome dimerization: HIV-1 RNAs can assume alternative dimeric forms, and all sequences upstream or downstream of hairpin 248-271 are dispensable for dimer formation HIV-1 genome dimerization: kissing-loop hairpin dictates whether nucleotides downstream of the 5 0 splice junction contribute to loose and tight dimerization of human immunodeficiency virus RNA Achieving a golden mean: mechanisms by which coronaviruses ensure synthesis of the correct stoichiometric ratios of viral proteins Reverse genetics with a full-length infectious cDNA of severe acute respiratory syndrome coronavirus Recombination, reservoirs, and the modular spike: mechanisms of coronavirus cross-species transmission Isolation and characterization of viruses related to the SARS coronavirus from animals in southern China The genome sequence of the SARS-associated coronavirus Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage Maintenance of the Gag/Gag-Pol ratio is important for human immunodeficiency virus type 1 RNA dimerization and viral infectivity Analysis of natural variants of the human immunodeficiency virus type 1 gag-pol frameshift stem-loop structure Interference of ribosomal frameshifting by antisense peptide nucleic acids suppresses SARS coronavirus replication Structure and function of the human immunodeficiency virus leader RNA Dimerization of retroviral genomic RNAs: structural and functional implications Variant effects of non-native kissing-loop hairpin palindromes on HIV replication and HIV RNA dimerization: role of stem-loop B in HIV replication and HIV RNA dimerization Correlation between mechanical strength of messenger RNA pseudoknots and ribosomal frameshifting Triplex structures in an RNA pseudoknot enhance mechanical stability and increase efficiency of -1 ribosomal frameshifting Solution structure of the CUUG hairpin loop: a novel RNA tetraloop motif Architecture of ribosomal RNA: constraints on the sequence of ''tetra-loops Programmed translational frameshifting Structure, stability and function of RNA pseudoknots involved in stimulating ribosomal frameshifting An 'elaborated' pseudoknot is required for high frequency frameshifting during translation of HCV 229E polymerase mRNA A sequence required for -1 ribosomal frameshifting located four kilobases downstream of the frameshift site A long-distance RNA-RNA interaction plays an important role in programmed -1 ribosomal frameshifting in the translation of p88 replicase protein of Red clover necrotic mosaic virus We acknowledge the support of the Hollings Marine Laboratory NMR Facility for this work. Conflict of interest statement. None declared. Supplementary Data are available at NAR Online: Supplementary Table 1 and Supplementary Figures 1-7.