key: cord-0973018-jh4j3zk8 authors: Tse, Herman; Lung, David Christopher; Wong, Sally Cheuk-Ying; Ip, Ka-Fai; Wu, Tak-Chiu; To, Kelvin Kai-Wang; Kok, Kin-Hang; Yuen, Kwok-Yung; Choi, Garnet Kwan-Yue title: Emergence of a Severe Acute Respiratory Syndrome Coronavirus 2 virus variant with novel genomic architecture in Hong Kong date: 2021-03-02 journal: Clin Infect Dis DOI: 10.1093/cid/ciab198 sha: 296f5c485e8bce175d4dadd821fe62e54fbad412 doc_id: 973018 cord_uid: jh4j3zk8 Throughout the COVID-19 pandemic, divergent SARS-CoV-2 lineages have emerged continuously, mostly through the genomic accumulation of substitutions. We report the discovery of a SARS-CoV-2 variant with a novel genomic architecture characterized by absent ORF7a, ORF7b and ORF8, and a C-terminally modified ORF6 product resulting from partial 5’-UTR duplication and transposition. hopes to end the pandemic. Most studies on the genomic evolution of SARS-CoV-2 have focused on single nucleotide variants, as this remains the primary source of genetic diversity in sequenced genomes. In comparison, deletion variants are uncommon, with small deletions seen in ≤3% of viral genomes [2] . Notably, VOC lineage B.1.1.7 variants carried a ∆69/70 deletion in the spike protein, which enabled reverse transcriptase polymerase chain reaction (RT-PCR) assays targeting the spike gene to identify such variants based on the spike gene dropout. Large deletions are rarer, as they often cause significant disturbance of genomic architecture and/or functional loss of gene products. In January 2020, an ORF8 ∆382 SARS-CoV-2 variant was detected in three clusters in Singapore [3] , and was reportedly associated with a favorable clinical outcome. Here, we report three cases of infection caused by a SARS-CoV-2 variant with a novel genomic architecture resulting from deletion of an 882nucleotide region spanning ORF6 to ORF8 together with duplication and transposition of a 57-nucleotide segment from the 5'-untranslated region (UTR). Patient 1 was a 66-year old male local resident who presented with productive cough in late December 2020. He had untreated diabetes mellitus (HbA1c 8.2%) and no history of recent travel. On presentation, he was afebrile and did not report any dyspnea. A combined throat and nasopharyngeal swab (cT/NS) was performed, and tested positive (threshold cycle (C t ) = 15.7) for SARS-CoV-2 RNA by real-time RT-PCR using the Xpert Xpress SARS-CoV-2/Flu/RSV assay (Cepheid). Around day 10 from symptom onset, the patient developed fever and dyspnea with increasing peripheral haziness on serial chest X-rays. Complete blood count showed lymphopenia and thrombocytopenia with nadirs of 0.3×10 9 /L and 113×10 9 /L respectively. Routine biochemistry testing showed elevated liver parenchymal enzymes and lactate dehydrogenase (LDH) levels. C-reactive protein (CRP) progressively increased to a peak of 106 mg/L. He was given a 10-day course of dexamethasone with good clinical response, and was discharged after 18 days of in-patient management. A c c e p t e d M a n u s c r i p t Patient 2 was a 61-year old female local resident hospitalized for low grade fever and productive cough in late December 2020. She had history of diabetes mellitus (HbA1c 7.3%), hypertension, hyperlipidemia and vitamin B12 deficiency. There was no history of recent travel. Her cT/NS tested positive for SARS-CoV-2 RNA by RT-PCR (C t = 22.3) using the Xpert Xpress SARS-CoV-2/Flu/RSV assay. As the patient showed clinical and radiological progression of disease on day 6 from symptom onset, she was given a regimen of interferon beta-1b (3 doses) and ribavirin (7 days) on day 7, followed by dexamethasone (12 days) and tocilizumab (1 dose) starting on days 11 and 14, respectively. Complete blood count showed lymphopenia with a nadir of 0.8×10 9 /L. LDH level peaked at 417 U/L but alanine transaminase level remained within normal limits. CRP levels peaked at 58 mg/L on day 10 from symptom onset. She recovered fully and was discharged after 33 days of hospitalization. Patient 3 was a 46-year old female local nursing home staff who tested positive for SARS-CoV-2 RNA (C t = 30.6) in mid-January 2021 during a compulsory COVID-19 testing exercise organized by the government. She had good past health, and her symptoms were limited to mild cough and running nose. Complete blood counts were normal throughout. There was mild elevation of LDH and CRP levels to 234 U/L and 12 mg/L respectively. The patient was keen to receive early specific treatment for COVID-19, and was given a course of interferon beta-1b (5 doses). She had an uneventful hospital stay and was discharged 26 days after admission. All 3 patients were tested for antibodies against SARS-CoV-2 before discharge. Detection of IgG against the nucleocapsid protein (NP) was performed using the Architect SARS-CoV-2 IgG assay (Abbott), while testing for anti-Orf8 IgG was performed as described previously [4] . The sera samples from all 3 patients were positive for anti-NP IgG but not anti-Orf8, consistent with infection by a SARS-CoV-2 strain with absent ORF8 expression. We performed ongoing genomic surveillance of SARS-CoV-2 to guide local public health and infection control A c c e p t e d M a n u s c r i p t the presence of a major deletion, but some reads around this region also partially mapped to the 5'-UTR of the viral genome. We performed long-range PCR and Sanger sequencing on the cDNA to confirm the deletion (see Supplementary Methods). The sequencing results confirmed that the 882-nucleotide genomic region described above was indeed replaced by a 57-nucleotide segment that is completely identical to a segment of the 5'-UTR (nucleotide position 8 to 64) (Figure 1a) . Hence, ORF7a, ORF7b, ORF8 and the last 12 nucleotides from the 3'end of ORF6 were lost. The remaining partial ORF6 was extended by the duplicated 5'-UTR sequence and the non-coding region preceding the nucleocapsid gene ORF to produce a novel ORF6x (Figure 1b) . Since the Cterminus of the ORF6 protein is important for antagonizing interferon signaling [5] , we speculate that Orf6x might be non-functional in this respect. There is no significant sequence similarity between the deleted and the While small deletion variants are uncommon in SARS-CoV-2, those with selective advantage can achieve rapid spread and predominance, as exemplified by the lineage B.1.1.7 variants. Large deletion variants are rarer, though major deletions around ORF7a, ORF7b and ORF8 had been reported [3, [6] [7] [8] . Nonetheless, the presently described variant is significant in various ways. Firstly, it contained the largest genetic deletion that had occurred naturally in SARS-CoV-2 to date. The accessory proteins ORF6, ORF7a, ORF7b, and ORF8 are nonessential for viral replication and infection in vitro, but are suggested to have important functions like immune evasion [5, [8] [9] [10] . While we refrain from extrapolating its clinical significance in this report, it is somewhat remarkable that the present variant achieved effective transmission and caused disease in at least 2 out of 3 cases. Another feature of note is the duplication and transposition of the partial 5'-UTR in place of the deletion. According to the copy-choice model of genomic recombination in positive-sense single-stranded RNA viruses, A c c e p t e d M a n u s c r i p t 6 this genomic rearrangement could be achieved by the dissociation of a replication complex from the negativesense genomic RNA template and re-association with a negative-sense subgenomic RNA (sgRNA) template for the nucleocapsid gene. However, the re-association would have to occur near the 3'-end of the sgRNA template without any homologous segment for anchoring. Such an occurrence should be very rare, since the probability of successful template-switching correlates with the local sequence similarity between the donor and acceptor templates [11] . It is uncertain whether the present genomic rearrangement was the result of an accidental onetime occurrence or an alternative mechanism [11, 12] . The present discovery highlights the considerable evolutionary potential of SARS-CoV-2. In this case, the tolerance of a major and unusual genomic arrangement led to the formation of ORF6x, which involved recruitment of typically non-coding sequences into the open reading frame. These events greatly increased the genetic diversity of SARS-CoV-2, thereby accelerating the evolution of new potentially advantageous genotypes and compensating for the relatively low mutation rates in coronaviruses. On a more immediate note, caution is warranted when using diagnostic assays targeting only accessory genes or proteins such as Orf8 [4] , given the risk of false-negative results from the sporadic emergence of major deletion variants. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology Geographic and genomic distribution of SARS-CoV-2 mutations Discovery and genomic characterization of a 382-nucleotide deletion in ORF7b and ORF8 during the early evolution of SARS-CoV-2 Accurate diagnosis of COVID-19 by a novel immunogenic secreted SARS-CoV-2 Orf8 protein SARS-CoV-2 Orf6 hijacks Nup98 to block STAT nuclear import and antagonize interferon signaling Identification of multiple large deletions in ORF7a resulting in in-frame gene fusions in clinical SARS-CoV-2 isolates An 81-nucleotide deletion in SARS-CoV-2 ORF7a identified from sentinel surveillance in Arizona Lost in deletion: The enigmatic ORF8 protein of SARS-CoV-2 Evasion of type I interferon by SARS-CoV-2 Novel immunoglobulin domain proteins provide insights into evolution and pathogenesis of SARS-CoV-2-related viruses Why do RNA viruses recombine? RNA recombination in vivo in the absence of viral replication The authors thank the Molecular Laboratory of Department of Pathology, Queen Elizabeth Hospital, for technical support, and the Hospital Authority and the Food and Health Bureau for their administrative and policy support. A c c e p t e d M a n u s c r i p t