key: cord-304340-9mrtic2k authors: Karacan, Ilker; Akgun, Tugba Kizilboga; Agaoglu, N. Bugra; Irvem, Arzu; Alkurt, Gizem; Yildiz, Jale; Kose, Betsi; Ozel, A. Serra; Altunal, L. Nilsun; Can, Nisan Denizce; Demirkol, Yasemin Kendir; Aydin, Mehtap; Dogan, Ozlem Akgun; Doganay, Levent; Doganay, Gizem Dinler title: The origin of SARS-CoV-2 in Istanbul: Sequencing findings from the epicenter of the pandemic in Turkey date: 2020-05-15 journal: North Clin Istanb DOI: 10.14744/nci.2020.90532 sha: doc_id: 304340 cord_uid: 9mrtic2k OBJECTIVE: Turkey is one of the latest countries that COVID-19 disease was reported, with the first case on March 11, 2020, and since then, Istanbul became the epicenter of the pandemic in Turkey. Here, we reveal sequences of the virus isolated from three different patients with various clinical presentations. METHODS: Nasopharyngeal swab specimens of the patients were tested positive for the COVID-19 by qRT-PCR. Viral RNA extraction was performed from the same swab samples. Amplicon based libraries were prepared and sequenced using the Illumina NextSeq platform. Raw sequencing data were processed for variant calling and generating near-complete genome sequences. All three genomes were evaluated and compared with other worldwide isolates. RESULTS: The patients showed various clinics (an asymptomatic patient, patient with mild disease, and with severe pulmonary infiltration). Amplicon-based next-generation sequencing approach successfully applied to generate near-complete genomes with an average depth of 2.616. All three viral genomes carried the D614G variant (G clade according to GISAID classification) with implications for the origin of a spread first through China to Europe then to Istanbul. CONCLUSION: Here, we report the viral genomes circulating in Istanbul for the first time. Further sequencing of the virus isolates may enable us to understand variations in disease presentation and association with viral factors if there is any. In addition, the sequencing of more viral genomes will delineate the spread of disease and will guide and ease the necessary measures taken to stem the spread of the novel coronavirus. S evere acute respiratory syndrome coronavirus 2 (SARS-CoV-2), causing coronavirus disease 2019 (COVID- 19) , was first detected in Wuhan, China, around mid-December 2019. Following the emergence of the outbreak to other countries in a short period, the World Health Organization declared COVID-19 as a pandemic on 11 March 2020. As of May 2020, more than 3.5 million COVID-19 patients all around the world were reported, and nearly 250 thousand deaths arose from the SARS-CoV-2 pandemic [1] . The novel SARS-CoV-2 belongs to the Betacoronavirus genus of the Coronaviridae family, which are single-stranded RNA viruses [2] . Genetic similarity analyses revealed that the bat coronavirus RaTG13 showed the highest similarity (96%) to SARS-CoV-2, and it was suggested that the zoonotic origin might include bats [3, 4] . Coronaviruses were thought to result in seasonal mild respiratory illness in humans until the epidemics of the SARS-CoV in 2003 and MERS-CoV in 2012 [5, 6] . As the number of COVID-19 cases increases worldwide, new symptoms have been reported, such as nausea, diarrhea, skin rash, and loss of taste and/or smell [7, 8] . In addition to the symptomatic patients, initial studies showed that a considerable amount of COVID-19 patients are asymptomatic (18-30%) or have mild symptoms [9] [10] [11] , whereas hospitalization and mortality rates increase with patient' s age [12] . The severity of the infection is related to the age of the patient and the presence of additional chronic diseases, such as hypertension, diabetes, and cancer [13] . Besides challenges on detecting evidence on virulence changes of such a pathogen that is spreading very fast, there is not any solid report highlighting a mutation that affects the biological features of the virus. The entrance of coronaviruses into the host cell is maintained by spike glycoprotein (encoded by S gene). The S1 subunit of SARS-CoV spike protein contains Receptor Binding Protein, which plays an essential role in the recognition of angiotensin-converting enzyme 2 (ACE2) [14] . The novel SARS-CoV-2 virus has a similar surface spike protein sharing 76% sequence identity with SARS-CoV [15] . The binding affinity of spike protein to ACE2 is important for virulence, and it has been shown that spike protein of the novel SARS-CoV-2 binds to ACE2 with a much higher affinity [16] . It can be hypothesized that a particular mutation in the spike protein may lead to conformational variations affecting the virulence. Due to the lack of polymerase proofreading activity, RNA viruses have a relatively high mutation rate and, thus capable to become resistant to drugs and escape from host immunity. As mutations accumulate, they may result in alterations in virulence, transmission capacity, infectivity, and pathogenicity of the viruses [17] . Although SARS-CoV-2 has a lower mutation rate than expected [18] , real-time tracking of the virus isolates in populations may help epidemiological understanding of the disease and early detection of important mutational or recombination events. The first case of COVID-19 in Turkey was reported on 11 March 2020, much later than the virus had spread to European countries. As of April 1, the Ministry of Health of Turkey announced that COVID-19 had reached all over Turkey, exhibiting the highest spread in Istanbul. The first full-length SARS-CoV-2 genome in Turkey was isolated from a patient in Kayseri province and released on 13 April 2020. Herein, we analyzed full-length SARS-CoV-2 genomes from three patients in Istanbul together with their clinical findings. Sample Collection: Nasopharyngeal swabs were collected from unrelated patients and tested for SARS-CoV-2 presence as a standard care protocol for routine diagnosis in Umraniye Training and Research Hospital (UEAH), Istanbul. Three patients whose tests positive for SARS-CoV-2 with qRT-PCR testing were included in this study. This study was approved by the ethics committee of the Umraniye Training and Research Hospital (B. 10 Viral Genome Sequencing and Data Analysis: Sequencing libraries were generated using the CleanPlex SARS-CoV-2 Library Preparation Kit (Paragon Genomics Inc.) following the manufacturer' s instructions. As input material, 50 ng of extracted RNA was used for each sample. Briefly, reverse transcription of RNA was performed; the viral genome was amplified with multiplex PCRs followed by indexing PCR to add adapters and sample-specific barcode sequences. DNA clean-up steps were performed with Agencourt AMPure XP beads (Beckman Coulter Inc.) when required to maximize the recovery of fragments. The quantity and quality of the final libraries were assessed using a Qubit dsDNA HS Assay Kit with Qubit 4.0 Fluorometer (Thermo Fisher Scientific Inc.) and an Agilent Bioanalyzer 2100 with High Sensitivity DNA Chips (Agilent Technologies Inc.) following manufacturers' protocols before sequencing. Sequencing was performed in the joint Genomic Laboratory (GLAB) of UEAH and Istanbul Technical University [19] using Illumina NextSeq500 instrument with paired-end 150 bp chemistry. Raw demultiplexed sequencing data were further processed to call variants and generate consensus genome sequences. First, the quality check of raw sequencing data was performed using the FASTQC program [20] and adapter sequences were trimmed from reads using cutadapt [21] . Processed reads were aligned to the reference SARS-CoV-2 genome (NC_045512.2) using bwamem [22] . Further, primer sequences were trimmed from aligned bam files. Variant calling and generating consensus sequences were performed using Samtools [23] and iVar [24] . Finally, all detected variants were checked manually to detect the presence of any sequencing errors, if any. Viral genome sequences were examined phylogenetically together with world-wide isolates using Nextstrain [25, 26] . Patient COV-8: COV-8 was a 51-year-old man, and he presented to the emergency department clinic in our hospital with a 2-day history of cough. He did not mention fever or dyspnea. He had no known contact with a COVID-19 positive patient. He was an ex-smoker with a 50 packs-year history and had a medical history of diabetes mellitus and hypertension. The physical examination in the emergency department revealed a body temperature of 36.8°C, blood pressure of 120/70 mm Hg, the pulse of 100 beats per minute, respiratory rate of 20 breaths per minute, and oxygen saturation of 97% while the patient was breathing ambient air. Lung auscultation was normal. The remaining physical examination findings were unremarkable or normal. Nasopharyngeal and oropharyngeal swab specimens were collected and sent for real-time PCR (RT-PCR) assay for SARS-CoV-2. The patient complied with the possible case definition of COVID-19, which was stated in the COVID-19 Guide of the Ministry of Health of Turkey [27] . Therefore, thorax computed tomography (CT), in line with COVID-19 Guide [27] , was performed and reported as containing typical findings of COVID pneumonia with mild involvement. Hospitalization was decided by considering the thorax CT result and the patient' s comorbid diseases. Azithromycin and hydroxychloroquine treatment combinations were started. On the first day of hospitalization, vital signs were in the normal range, and the general clinical condition was good. Viral RT-PCR analysis of the nasopharyngeal swab reported as positive. On the second and third day, subfebrile body temperature fluctuated between 37.7°C and 37.8°C were detected ( Figure 1 ). Besides, there was an increase in the frequency of cough, and rales were detected during lung auscultation. An increase in infiltrations was also detected on the chest radiograph, as well as the increase in acute phase reactants ( Figure 2 ). Close follow-up was continued by adding ceftriaxone to the treatment. On the 6th day, the subfebrile fever persisted, the patient reported fatigue. Oxygen saturation was <90% while inhaling in the room air. We observed that the serum acute phase reactants increased and lymphopenia appeared in the complete blood count test. In the thorax CT on the 6th day, we also detected progression compared to the previous CT scan. Due to the above-mentioned findings in favor of progression, favipiravir treatment was started. Twenty-four hours after the start of favipiravir, the patient' s body temperature returned to normal range. Through 7 and 14 days of the hospitalization, the patient' s clinical findings and laboratory values gradually improved. The physical examination revealed a body temperature of 36.5°C, blood pressure of 125/70 mm Hg, the pulse of 98 beats per minute, respiratory rate of 20 breaths per minute, and oxygen saturation of 98% while the patient was breathing ambient air. The remainder of the examination was normal. Viral RT-PCR analysis of the control nasopharyngeal swab on the 14th day reported as negative. He was discharged from the hospital to revisit for control two weeks later. Patient COV-12: COV-12 was a 49-year-old man and he presented to the COVID-19 clinic in our hospital with a history COVID-19 positive patient contact. The patient did not have any complaints. He was a healthy nonsmoker. The physical examination revealed a body temperature of 36.0°C, blood pressure of 130/70 mm Hg, the pulse of 100 beats per minute, respiratory rate of 18 breaths per minute, and oxygen saturation of 98% while the patient was breathing ambient air. Lung auscultation was normal, as well as the thorax CT. Complete blood count and serum acute phase reactants were in the normal range. Azithromycin and hydroxychloroquine treatment combinations were started, and the patient was isolated at home. The remaining physical examination findings were unremarkable or normal. The control viral RT-PCR test for SARS-CoV-2 was reported as negative. Patient COV-13: COV-13 was a 29-year-old woman; she presented to the urgent care clinic with a 5-day history of cough, sore throat, fever, loss of taste and smell. She was a nonsmoker and reported no comorbid disease. The physical examination revealed a body temperature of 36.2°C, blood pressure of 140/75 mm Hg, the pulse of 96 beats per minute, respiratory rate of 20 breaths per minute, and oxygen saturation of 96%. The remaining physical examination findings were unremarkable or normal. Thorax CT revealed widespread patchy ground-glass opacities in both lungs. Complete blood count and serum acute phase reactants were in the normal range. She was hospitalized and started on azithromycin and hydroxychloroquine treatments. Dur- ing the hospital stay, the patient did not have a fever and did not develop respiratory distress; oxygen saturation ranged between 96-97%. The treatment was completed in five days, and she was discharged to come for control one week later. After one week, the control examination of the patient was completely normal, for both vital signs and clinical findings. Control viral RT-PCR analysis was also reported as negative. Raw sequencing data consist of 600676, 390806 and 283036 paired-end reads for samples isolated from pa-tients COV-8, COV-12 and COV-13, respectively. Nearly all reads were mapped to the 29.903 bp reference genome with a mean ratio of 99.02% (±0.85), resulting in an average 2.616±1.011 depth of coverage. Identified variants of three SARS-CoV-2 isolates are given in Table 1 . Since all three isolates have a D614G variant in spike glycoprotein, they belong to G clade based on GISAID classification. COV-8 and COV-12 had ten, and COV-13 had nine bp changes compared to the reference genome (NC_045512.2). Distinguishably, most of the single nucleotide variants were C to T conversion. Phylogenetic analysis of the three isolates in this study showed that COV-8 and COV-12 were closely clustered with isolates from Belgium, Netherlands, and Latvia, whereas COV-13 was clustered with Sweden, Belgium and Wales isolates ( Figure 3 ). SARS-CoV-2 is a novel coronavirus that infected more than 3 million people leading to approximately 250,000 deaths globally as of May 2020 [1] . As of May 2020, more than 3,500 patients died in Turkey due to COVID-19, and most of the reported patients are located in Istanbul. Herein, we report three virus genomes isolated in Istanbul for the first time together with patients' clinical findings. Patients with various clinical presentations (one asymptomatic, one moderate, and one with severe pulmonary infiltration) were included in this study. Since the first and most of the current cases in Turkey are located in Istanbul, the characterization of virus samples may help to predict the origins of the initial entry to Istanbul. The Nextstrain website (www.nextstrain.org) provides real-time monitoring of viral isolates around the world, mainly based on publicly accessible GISAID data [25, 26] . As of 1 May 2020, more than ten thousand genomes were uploaded to the GISAID database and nearly five thousand different genomes are available for analysis in Nextstrain online tool. Phylogenetic analysis in Nextstrain online tool showed that three isolates from Istanbul in this study were found to be closely clustered within samples isolated mostly in Belgium (Figure 3 ). This close relationship with Belgium isolates gives clues of early viral entry, at least in Istanbul, may have originated from travelers from European countries. GISAID classified three large clades, namely S, G and V. Clades were named based on variants L84S in ORF8 (S clade), D614G in S gene (G clade), and G251V in ORF3a (V clade). Three isolates in this analysis carried the D614G variant in the S gene, indicating they are all in G clade, which was mostly detected in European countries. An increased number of patients should be analyzed to enlighten the effects of viral genetic changes on clinical outcomes. To conclude, we analyzed three SARS-CoV-2 positive individuals in Istanbul, where the epicenter of the pandemic in Turkey. All three viral isolates carried the D614G marker variant indicating the isolates belong to clade G, which encompasses mostly European countries according to GISAID classification. All three virus samples were located in clusters, including isolates from Belgium. As virus surveillance studies are ongoing worldwide, country-wide efforts would also support understanding the local spread of the disease but also evaluate Retrieved and adapted from www.nextstrain.org the effectiveness of precautions, such as travel restrictions on disease spread in the country. Viral genome sequences in this research were deposited in the Global Initiative on Sharing All Influenza Data (GISAID; www.gisaid.org), with accession numbers EPI_ ISL_427391, EPI_ISL_428346, and EPI_ISL_428368. World Health Organization Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2 Bioinformatic analysis indicates that SARS-CoV-2 is unrelated to known artificial coronaviruses A pneumonia outbreak associated with a new coronavirus of probable bat origin Coronaviruses post-SARS: update on replication and pathogenesis Middle East respiratory syndrome Epidemiological, clinical and virological characteristics of 74 cases of coronavirus-infected disease 2019 (COVID-19) with gastrointestinal symptoms Sudden and Complete Olfactory Loss Function as a Possible Symptom of COVID-19 The Novel Coronavirus Pneumonia Emergency Response Epidemiology Team. The epidemiological characteristics of an outbreak of 2019 novel coronavirus diseases (COVID-19) -China Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19) Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship Hospitalization Rates and Characteristics of Patients Hospitalized with Laboratory-Confirmed Coronavirus Disease 2019 -COVID-NET, 14 States Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study Activation of the SARS coronavirus spike protein via sequential proteolytic cleavage at two distinct sites Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation Evolution of virulence in emerging epidemics SARS-CoV-2 is well adapted for humans. What does this mean for re-emergence? Integrating personalized genomics into Turkish healthcare system: A cancer-oriented pilot activity of Istanbul Northern Anatolian Public Hospitals with GLAB FastQC: a quality control tool for high throughput sequence data Cutadapt removes adapter sequences from high-throughput sequencing reads Aligning sequence reads, clone sequences and assembly contigs with Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar Nextstrain: real-time tracking of pathogen evolution TreeTime: Maximum-likelihood phylodynamic analysis COVID-19 (SARS-CoV-2 enfeksiyonu) rehberi We acknowledge the authors, originating and submitting laboratories of the sequence data shared through GISAID's EpiCOV Database.