key: cord-0790946-uxeljiv3 authors: Ul-Rahman, Aziz; Shabbir, Muhammad Abu Bakr; Aziz, Muhammad Waqar; Yaqub, Saima; Mehmood, Asif; Raza, Muhammad Asif; Shabbir, Muhammad Zubair title: A comparative phylogenomic analysis of SARS-CoV-2 strains reported from non-human mammalian species and environmental samples date: 2020-10-26 journal: Mol Biol Rep DOI: 10.1007/s11033-020-05879-5 sha: de66db96f18d7a4372aebfb4395ccb4537cb4e5f doc_id: 790946 cord_uid: uxeljiv3 Coronaviruses (CoVs) infect a wide range of domestic and wild mammals. These viruses have a potential and tendency to cross-species barriers and infect humans. Novel human coronavirus 2019-nCoV (hCoV-19) emerged from Wuhan, China, and has caused a global pandemic. Genomic features of SARS-CoV-2 may attribute inter-species transmission and adaptation to a novel host, and therefore is imperative to explicate the evolutionary dynamics of the viral genome and its propensity for differential host selection. We conducted an in silico analysis of all the coding gene sequences of SARS-CoV-2 strains (n = 39) originating from a range of non-human mammalian species, including pangolin, bat, dog, cat, tiger, mink, mouse, and the environmental samples such as wastewater, air and surface samples from the door handle and seafood market. Compared to the reference SARS-CoV-2 strain (MN908947; Wuhan-Hu-1), phylogenetic and comparative residue analysis revealed the circulation of three variants, including hCoV-19 virus from humans and two hCoV-19-related precursors from bats and pangolins. A lack of obvious differences as well as a maximum genetic homology among dog-, cat-, tiger-, mink-, mouse-, bat- and pangolin-derived SARS-CoV-2 sequences suggested a likely evolution of these strains from a common ancestor. Several residue substitutions were observed in the receptor-binding domain (RBD) of the spike protein, concluding a promiscuous nature of the virus for host species where genomic alternations may be required for the adaptation to novel host/s. However, such speculation needs in vitro investigations to unleash the influence of substitutions towards species-jump and disease pathogenesis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s11033-020-05879-5) contains supplementary material, which is available to authorized users. Coronaviruses (CoVs) have been identified from a diverse range of birds and mammals, including humans [1, 2] . Most of the emerging CoVs, those have infected humans, are originated from animals or birds and have the potential to cause major outbreaks [2] . The CoVs have higher mutation rates and genetic variability in their genomes as a result of either recombination, genetic drift (point mutations), and lack of proofreading capacity by RNA-dependent polymerases (replicative fidelity and diversity) [3] [4] [5] [6] . Indeed, the potential of CoVs for genetic variability has long been considered to be fundamental for their ever-lasting evolution, the emergence of new variants within a certain species, and even to new species resulting in epidemics and/or zoonoses. This phenomenon may simply be exemplified by the emergence of two zoonotic CoVs; the Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) and the Middle East Respiratory Syndrome Coronavirus (MERS-CoV) [2] . Since, it is believed that both CoVs have been transmitted to humans through intermediate mammalian host from a natural reservoir host (bats) [7, 8] . Therefore, bats are now considered as a potential reservoir host of CoVs that could be associated with future CoV-related pandemics [2] . Novel mammalian coronaviruses are now regularly identified [9] [10] [11] . Recently, a novel human coronavirus (hCoV-19) emerged in late December 2019 from Wuhan, China, and spread to more than 190 countries. This virus possesses a positive-sense, single-stranded RNA genome of approximately 30 kilobases in length. The genome encodes ten structural [ORFlab, Spike (S), Envelope (E), Membrane (M), Nucleocapsid (N), ORF3, ORF6, ORF7, ORF8, ORF10] proteins [12] . Though S protein is a larger type-I membrane surface glycoprotein that plays an important role in tissue tropism and host adaptation [13, 14] , other structural proteins are also reported to contribute significantly to the adaptation of CoVs to a novel host. For instance, a sequence of 29-nucleotides in the open reading frame (ORF10) could be considered an example that may play a role in the trans-species jump [15] . Deletion of the said sequence has been observed in most of the human-originated SARS-CoV strains; however, it has been reported vice versa for animal-originating strains. Similarly, insertion of a 29-nucleotide sequence in ORF8 protein of the bat-originating SARS-CoV strain was suggestive of a species-jump from bat to civet [15] . Although much of the genomic characteristics of SARS-CoV-2 are described so far; nevertheless, analysis revealing potential genetic properties underlying inter-and intra-species jump has largely remained elusive. With this background, we explored the genome of SARS-CoV-2 strains originating from a range of non-human mammalian species and environmental samples for the possible emergence of new variants and adaptation to a novel environment/host. To date (2-July 2020), a total of 51 genome sequences corresponding to novel human coronavirus (hCoV-19; SARS-CoV-2) are reported from non-human hosts and the environment at Global Initiative Sharing of All Influenza Data website (https ://www.gisai d.org/). Only full-length nucleotide sequences were retrieved and screened for quality of sequences. The sequences with a long stretch of NNNs in coding genes were excluded from the phylogenetic, evolutionary, and comparative genomic analysis. Because of the greatest difference in spike protein, noises or recombination, or mixed reads of different viruses in metagenomic feces sample collected in 2019, two bat-derived hCoV-19 sequences (EPI ISL 412976 and EPI ISL 412977) were also excluded. Final dataset (n =39), analysed in this study, included SARS-CoV-2 strains originating from mink (n = 13), pangolin (n = 7), dog (n =2), bat (n =1), cat (n =1), tiger (n =1), mouse (n =1) and environment including air, wastewater and surface samples (n = 13). The human-originated SARS-CoV-2 sequence (NC_045512; MN908947; Wuhan-Hu-1) is considered as a reference sequence and retrieved from the GenBank public database (https ://www. ncbi.nlm.nih.gov/). All coding gene sequences (ORFlab, S, E, M, N, ORF3a, ORF6, ORF7, ORF8, and ORF10) were individually aligned using the ClustalW method in BioEdit® version 5.0.6 [16] . To assess the phylogenetic relationship among human-and non-human-derived SARS-CoV-2 strains, a complete S gene-based phylogenetic tree was constructed using Maximum Likelihood (ML) statistical method of Tamura-Nei model with ML heuristic tree interference of Nearest Neighbor-Interchange (NNI) in MEGA ® X [17] , where reliability in the topology of the tree was assured with 1000 number of bootstrap replicas and p-distance substitution model. To determine the genomic divergence, a pairwise sequence comparison analysis was performed with the maximum composite likelihood method (d: transitions + transversions model) using all coding genes in MEGA ® X [17] . The rate and pattern of substitutions among sites were modeled with a gamma distribution (parameter = 1 with homogenous lineage pattern). Moreover, a comparative residue analysis of the receptor-binding domain (RBD) (399-518 aa) at S protein was also performed to elucidate a presumptive role of substitutions in host adaptation. The phylogenetic analysis revealed a close relationship between SARS-CoV-2 strains originating from humans, dog, cat, tiger, mouse, mink, and environmental sources including air, surface, and wastewater. However, bat-and pangolinderived sequences clustered in separate clades. Noteworthy is the fact recently reported pangolin-derived sequences (2019) clustered closer to human-derived SARS-CoV-2 sequences than those reported in 2017 (Fig. 1) . Compared to the human-originated SARS-CoV-2 sequence, pairwise residue similarity analysis of pangolin-derived SARS-CoV-2 strains revealed maximum identity for spike protein (90.15%) and ORFlab protein (86.04%). However, for batderived hCoV-19, a maximum residue identity was found Table 1) . While comparing all structural proteins of reference hCoV-19 sequence, a maximum number of residue substitutions (n = 971) was observed in pangolin-derived hCoV-19 sequence followed by 313 substitutions in bat-derived hCoV-19 sequence and 7 substitutions each in dog-and wastewater-derived hCoV-19 sequences. A variable range of substitutions (n = 1-8) was observed in dog-, cat-, tiger-, mink-, mouse-and environment samples-derived hCoV-19 sequences (Table 1) . Interestingly, there was an absolute lack of substitution in structural proteins of SARS-CoV-2 strains originating from the environment sample of the seafood market, China. A 30-bp (10 aa) deletion was observed in each of the ORFlab and spike proteins of pangolin-derived sequence whereas, 12-bp (4 aa) deletion was observed in the spike protein of bat-derived sequence. Pangolin-derived hCoV-19 sequences had shared maximum similar substitutions (n = 189) with other under-studied hCoV-19 sequences followed by bat (R. affinis)-derived hCoV-19 sequence (n = 173), bat (R. malayanus)-derived hCoV-19 sequence (n = 115), wastewater-derived hCoV-19 sequence from USA (n = 5), tiger-derived hCoV-19 sequence (n = 4), air-and dog-derived hCoV-19 sequences (n = 3 in each), cat-and door handle-derived hCoV-19 sequences (n = 2 in each) and mink-, mouse and wastewater-derived hCoV sequences from Italy and China (n = 1 in each) ( Table 1) . While comparing critical binding residues ( 455 L, 486 F, 493 Q, 494 S, 501 N, and 505 Y) in the receptor-binding domain (RBD) of the spike protein of human-derived reference sequence, five substitutions ( 486 L, 493 Y, 494 R, 501 D, and 505 H) were observed in the bat-derived hCoV-19 sequence. On the other hand, four substitutions ( 486 L, 493 E, 494 R, and 501 T) were observed in pangolin-derived hCoV-19 sequences. Similarly, a range of substitutions was also observed for interacting residues (ACE2-RBD) among non-humanderived hCoV-19 sequences. Nine substitutions ( 403 K, 417 V, 439 V, 445 T, 483 Q, 484 V, 490 Y, 498 H and 204 N) were observed in pangolin-derived hCoV-19, eight substitutions ( 403 T, 439 K, 445 E, 449 F, 483 Q, 484 T, 490 Y and 498 Y) in bat-derived hCoV-19, and one each in mink-( 453 F) and mouse-( 498 H) derived hCoV-19 sequences. All binding and contact residues for ACE2-RBD interaction were found conserved in all other under-studied hCoV-19 sequences (Table 2 ). The analysis revealed that except for bat-and pangolinderived sequences, non-human mammalian-and environment-derived SARS-CoV-2 strains were more identical to human-originated strain. A close relationship between cat-, dog-, tiger-, mouse-, mink-and human-derived SARS-CoV-2 strains may reflect a plausible spill-over of the virus among a wide range of species while living in close vicinity. This is not surprising because there are shreds of evidence suggesting transmission of CoVs from one animal to another or humans living in companionship and/or vicinity [18] [19] [20] , and this is particularly true for pets and animals (e.g., Tiger) kept in captivity such as Zoo. It is now speculated that dogs and cats can be infected presumably after contracting the virus from infected pet-owners. Similarly, the tiger was found to have contracted the virus, likely from an infected but asymptomatic zookeeper. Either having a history of clinical or subclinical infection, the detection of SARS-CoV-2 genome in pets or animals living with and/or close vicinity to humans suggests human-to-animal transmission. Nevertheless, there does not exist or reported evidence for the potential animal-to-human transmission of the SARS-CoV-2 virus so far. Hence, this particular aspect needs further investigations that can explore the identification of biological or other relevant factors for the presence of virus in non-human mammalian hosts. Given the phylogenetic pattern, the study suggests batand pangolin-derived sequences as prototypes or ancestral variants of the SARS-CoV-2 virus. Such observations highlight the evolutionary dynamics of the SARS-CoV-2 virus for the potential emergence of multiple variants or escape mutants as has significantly been evidenced for other CoVs [3, 21, 22] . Pangolin-, bat-and human-derived SARS-CoV-2 strains were distinct or distantly related in the phylogenetic analysis. However, as for the genetic divergence is concerned, bat-and human-derived SARS-CoV-2 strains were closer than pangolin-derived SARS-CoV-2 strains, indicating the emergence of variants from a common ancestor. Based upon genomic heterogeneity and clustering pattern, the study findings are in agreement with previous observations, suggesting a non-mosaic nature of the SARS-CoV-2 and its close relationship with bat-derived strain [12] . The analysis suggests that SARS-CoV-2 is not an exact variant which is causing a global pandemic in humans; however, the hypothesis of the emergence of the new variant from bats-derived CoVs is very likely. However, a controversy exists for a conceivable relationship among bats-and pangolin-derived hCoV-19 to human-originating SARS-CoV-2 strains. Instead of direct transmission from bats to humans, a study suggests the role of another animal as an intermediate host of the SARS-CoV-2 virus [23] . Taking into consideration the recent history of emerged CoVs (e.g., SARS-CoV and MERS-CoV) where bat acted as a natural reservoir, while other animals (e.g., civet and camels) served as intermediate hosts before their transmission to terminal host (human) [7, 8] , it seems reasonable to speculate that there may involve an intermediate host other than bats for potential transmission of SARS-CoV-2 to human. This is Table 1 Comparative residue analysis of structural proteins of SARS-CoV-2 sequences originating from non-human mammalian species and environmental samples A38V, I39L, N93E, V95I, T93I, T114I, Y143F, Y157F, G166S, Y190F, L222I, D235E, D246E, T267N, T280I, R290K, N324D, R338K, D352E, P376S, I378V, T396I, S405A, A443S, S445G, D457E, T498A, I544V, A546S, H557N, Q564K, V591A, D593N, M608L, I627V, E629D, D633E, K643R, V703L, I710V, A711T, P723S, V740I, I757V, E762D, C771S, I774V, D775E, S840N, V849I, N859A, G901S, N915K, Y931E, D934G, E935D, Q940E, Y941F, S959P, S964A, S967A, S969L, I971P, P975Q, E981D, G983D, N984S, E985Q, E986Q, I987T, A988V, M989G, E991Q, E992D, T1007I, S1010P, I1014M, D1015E, S1016L, I1023T, N1024I, N1029S, T1031S, N1038D, F1041Y, Q1053K, N1091D, N1100K, N1115H, I1124V, S1129D, T1135S, Y1143H, D1144E, V1161I, Q1162H, T1178A, D1183N, E1186D, A1199K, Q1204K, V1208I, M1211E, E1214K, P1216F, T1217I, N1220S, K1227R, Q1228K, A1230D, E1231D, V1235I, M1272T, K1275S, T1277I, T1279I, I1294V, K1295Q, T1317A, K1321R, L1323V, V1325T, N1337L, L1342V, V1363I, E1368Q, M1396V, N1443D, T1537S, I1542S, I1545T, A1546T, T1549L, E1550D, Q1552E, I1553V, D1558N, V1580I, A1632V, A1643P, A1685T, D1709E, N1736S, L1782F, V1788I, V1791T, Q1798K, D1816Q, V1825T, A1908S, A1948T, V1993I, T2000A, E2032K, T2036A, H2049P, I2064V, V2078I, D2085N, G2086S, V2087L, E2104D, S2105N, I2125L, I2133V, V2142A, N2154S, V2155T, K2187R, V2214L, V2232I, V2233I, T2281I, V2312I, T2408N, V2493I, S2498T, A2559S, T2587A, A2614T, A2634S, I2718V, L2785F, T2789I, A2806S, S2824A, V2857I, A2879T, S2947A, I2999V, H3013Y, I3059V, V3146A, V3148I, T3446N, V3590I, V3609L, M3757L, I4427T, T4435I, R4453K, K4454R, M4455T, A4458T, T4462I, I4464T, S4465L, I4475T, R4478K, T4482I, T4483I, F4496S, L4520Q L4522S, C4526STOP, R4531K, T4533I, L4542S, L4543H, T4545I, S4550I, L4552S, T4553I, K4555R, T4565I A38V, N93E, Y110H, T114I, V117A, D172E, T280I, P376S, P395I, Y417H, I424V, T498A, A561V, V591A, Q623K, P723S, K724R, T859A, I891T, G901S, D940E, P944S, S959P, V967A, T968A, P969L, L975Q, V989G, E991Q, D993G, V996D, I1001T, T1002I, S1004T, A1006V, P1016L, I1058V, H1091D, G1104S, N1115H, R1126K, D1144E, V1161I, T1205I, S1215P, S1216F, L1222P, Q1228K, V1230D, N1275S, T1317A, R1346K, M1396V, I1545T, I1561T, K1566R, A1632V, N1736S, L1782F, Q1798K, I1825T, I1869V, I1877T, I1884T, L1940F, F1958L, N2018S, T2036A, I2064V, D2085N, G2086S, V2126A, F2227S, T2228K, T2281I, I2283T, G2292S, T2408N, S2533N, T2587A, V3146A, V3148I, T3362P, T3446N, V3609L, A4422V, C4426Y, K4454R, R4456K, T4494M, T4530M, A4532V, I4535T, E4538K, T4548M, K4555R, T4616I, L4621R, L4641P, L4643P, H4645L, P4659L, L4670S, T4709M, L4723H, G4744E, S4791F, P4844L, S4864Y, S4909F, T4911M, T4916M, S4984L, F4989S, I4994T, T5001I, I5008T, T5014I, A5040V, S5088F, L5095S, T5114M, M5155T, N5178S, T5182I, T5186M, S5224F, R5227Q, H5230Q, L5235P, T5277I, I5293T, A5353V, T5360M, L5363Q, R5409H, R5421K, T5434M, I5440T, S5460L, P5479L, C5490Y, S5537F, T5550I, F5585S, K5608R, T5633I, T5682M, L5694Q, L5706P, L5716P, Q5733L, S5755L, L5765S, L5807S, L5808R, F5826S, I5830T, S5838L, V5865A, R5878K, L5893P, R5918K, F5935S, R5937K, T5947I, Q5948L, V5972A, S5988L, T5992I, I5999T, V6004A, R6005K, P6007L, L6017S, L6031P, A6038V, A6051V, H6069R, S6073N, I6076T, L6077S, I6082T, I6100T, C6110Y, N6119S, L6134P, N6140D, M6163T, A6187C, R6190K, F6191S, D6206G, T6224I, F6254S, M6258T, T6280M, T6392I, A6401V, L6423S, T6426I, T6445I, I6449T, K6467R, P6480L, Y6497C, S6512F, I6517T, F6540S, I6569R, S6578L, S6582L, C6588Y, L6612P, P6615L,T6634I, F6641S, P6650L, K6651R, S6658L, V6684A, S6691F, G6698V, I6732T, T6733I, R6763P, R6764K, W6772L, P6776L, D6791A, L6793STOP, S6815F, S6853N, I6856T, L6858STOP, L6859H, G6867E, L6881Q, H6894R, Y6895C, I6934T, I6945T, P6946L, D6949V, D6956G, P6963L, L6967P, S6974N, P6979L, L6984S, I6987T, L6992Q, L6993P, H7000R, L7001H, G7009D, T7012I, R7016H, V7017A, R7019K, T7036I, S7051F, Q7059L, I7087T/3.71% Deletion at 682-686, S1127N, I1230V S32F, S50L, I76T, P218Q, D324E, T346R, T372A, T403R, K439N, H440N, I441L, A443S, E445V, F449Y, A459S, K478T, C483V, T484E, L486F, Y490F, Y493Q, R494S, Y498Q, D502N, H505Y , N519H, A604T, S1125N, I1228V/2.43% Bat-derived hCoV-19 (China) A38V, D93E, T280I, N335D, I350T, P376S, R501Q, S557N, T591A, E629D, G902S, E935D, R956Q, S959P, P969L, V1051A, N1115H, V1161I, S1222P, R1346K, I1396V, N1408K, E1484D, T1508I, F1513S, Y1548H, K1552E, K1569R, A1632V, N1736S, T1766K, L1782F, Q1798K, I1819T, A1908S, G2029D, R2032K, T2036A, I2064V, D2085N, I2096T, I2140T, F2182L, K2187R, F2191S, I2303T, T2408N, F2503S, V2532A, I2612L, V3055I, V3146A, S3374P, C3389S, T3446N, V3609L, V3608A, I3656V, R3681K, S3836N, K4454R, I4660T, N4662S, I4669T, I4699T, L4723H, S4796L, P4801L, T4815M, S4864Y, F4955S, S4964D, S4966F, I4994T, T5001I, L5004STOP, I5006T, T5014I, S5031L, A5040V, L5095S, M5155T, W5158L, L5161S, T5165M, K5218R, S5224F, I5226T, R5229H, H5230Q, L5235P, L5242S, L5254S, I5293T, P5318L, P5322R, V5335A, A5353V, T5360M, H5374R, I5391T, S5408F, R5409H, T5429M, S5433L, L5437Q, I5440T, S5460L, P5479L, T5491I, S5537F, I5544T, T5550I, K5608R, P5637L, L5639P, S5660L, Q5693R, L5694Q, S5703F, L5706P, S5755L, L5765S, V5798A, L5807S, T5808M, A5829V, S5838L, S5839F, V5865A, V5901A, I5907T, F5935S, V5945G, P5984L, S5988L, T5992I, I5999T, R6005K, P6007L, L6017S, P6024L, A6051V, S6275N, L6423S, I6449T, P6480L, S6578L, G6624E, I6732T, L7001H, L7052P, D7066V, I7087T Added to this, there was no selling of bat at the Huanan seafood market, whereas other non-aquatic animals such as pangolins were available. Therefore, this particular aspect needs further epidemiological investigations. Such supposition is also linked to the detection of SARSlike coronavirus in pangolins [24] . On the other hand, the genetic makeup and similarity between spike protein of pangolin-and human-derived SARS-CoV-2 strains is higher (90.15%) than what (99%) is observed between humanand civets-derived SARS-CoV strains, which is why civets were considered as intermediate host but such scenario is not applicable in case of pangolin as intermediate host of SARS-CoV-2 virus. Regarding this aspect, the current study pointed out that pet animals including dogs and cats may act as an intermediate or carrier host of SARS-CoV-2 in the future because of a high level of genomic similarity between dog-, cat-and human-derived hCoVs sequences. This speculation is also supported by the reports about the pet animals as intermediate/carrier hosts of other human emerging viruses [25, 26] . However, it should be cautioned that such speculation is biased because of a limited number of dog-and cat-originating SARS-CoV-2 strains. Recent studies also claimed very low or permissive susceptibility of dogs and cats to SARS-CoV-2 infection [27] and found negative for the existence of SARS-CoV-2-specific antibodies [28] . There are still controversies about the source of the virus and its potential intermediate host which still needs more evidence to be confirmed. Therefore, it is quite earlier to conclude or proclaim any pet animal as an intermediate or carrier host of the SARS-CoV-2 virus and a slew of deep genomic analysis along with disease surveillance, sero-surveillance, molecular epidemiology, pathobiology, and transmissibility investigations has yet to find a conclusive proof. An occurrence of residue substitutions in coding regions of SARS-CoV-2 sequences reported from different hosts highlight the potential of the virus to mutate rapidly for the emergence of host-specific variants. Considering SARS-CoV-2 as a typical RNA virus, there is likely a chance of high mutation rate and subsequent speciation and evolution of novel variants. It is interesting to note that, among all the CoVs, SARS-CoV-2 possesses the longest genome that makes it more prone to errors during RNA transcription, and therefore the potential occurrence of mutations for the adaptation to novel hosts may very frequent. A significant number of residue substitutions were observed in RBD at spike protein of bat-and pangolin-derived SARS-CoV-2 strains. A vital role of mutations in the RBD region has been indicated for potential inter-and intra-species transmission of SARS-CoV [29] . Amino acid substitutions within the RBD can significantly affect the binding affinity of S protein to its receptor. Any substitution at critical binding sites significantly lowers the electrostatic stabilizing interactions and binding affinity between ACE2-RBD and subsequently alter tissue tropism [30] . Indeed, a minimum of 1-2 substitutions in the RBD is sufficient for a virus to alter its host receptor specificity and subsequent inter-species transmission [31] . Beside it was reported that the SARS-CoV-2 virus has a unique peptide (PRRA) insertion at 680 amino acid position, which may be involved in the proteolytic cleavage of the S protein by cellular proteases, and impact host range and transmissibility [32] . In the current comparative study, it is not found in the S proteins of the bat-and pangolin-derived SARS-CoV-2 strains. Comparative investigations to determine an in vitro impact of the mutations are prerequisites to explore species jump of the SARS-CoV-2 virus. In conclusion, taken into consideration the intrinsic capacity of CoVs to adapt novel host species [33] , the current study suggests the potential evolution of all SARS-CoV-2 strains from a common ancestor. Comparative analysis of human-and non-human-derived SARS-CoV-2 strains highlights the positive selection and continuous evolution in different hosts. Hence, the emergence of different variants with varying pathobiological characteristics is very likely for different host species living across the globe. In this regard, increased genetic diversity among SARS-CoV-2 infected patients suggests an adaptation as well as the evolution of the virus to the environment and the human population worldwide [34] . Therefore, an in-depth analysis of residue substitutions and genetic evolution of SARS-CoV-2 from a wide geographical region is ascertained in the future. Summarizing together, such comparative analysis enhances our understanding of the molecular epidemiology of the SARS-CoV-2 virus in a wide range of host species and future intervention strategies. Prevalence and genetic diversity of coronaviruses in bats from China SARS and MERS: recent insights into emerging coronaviruses Epidemiology, genetic recombination, and pathogenesis of coronaviruses Coronaviruses: an RNA proofreading machine regulates replication fidelity and diversity Genetic drift of human coronavirus OC43 spike gene during adaptive evolution Genotype shift in human coronavirus OC43 and emergence of a novel genotype by natural recombination Possible role of an animal vector in the SARS outbreak at Amoy Gardens Dromedary camels and the transmission of Middle East respiratory syndrome coronavirus (MERS-CoV) Discovery of seven novel Mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus Identification of a novel coronavirus in bats From SARS coronavirus to novel animal and human coronaviruses Genomic characterization and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding Coronavirus spike proteins in viral entry and pathogenesis Cooperative involvement of the S1 and S2 subunits of the murine coronavirus spike protein in receptor binding and extended host range Bat-to-human: spike features determining 'host jump'of coronaviruses SARS-CoV, MERS-CoV, and beyond BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT MEGA X: molecular evolutionary genetic analysis across computing platforms Pet dogs-A transmission route for human noroviruses? Influenza virus reservoirs and intermediate hosts: dogs, horses, and new possibilities for influenza virus exposure of humans Cross host transmission in the emergence of MERS coronavirus Reliable typing of MERS-CoV variants with a small genome fragment Deletion variants of Middle East respiratory syndrome coronavirus from humans Probable pangolin origin of SARS-CoV-2 associated with the COVID-19 outbreak Viral metagenomics revealed sendai virus and coronavirus infection of malayan pangolins (Manis javanica) Animals as reservoir for human norovirus Investigation of the role of healthy dogs as potential carriers of rabies virus Susceptibility of ferrets, cats, dogs, and other domesticated animals to SARS-coronavirus 2 Serological survey of SARS-CoV-2 for experimental, domestic, companion and wild animals excludes intermediate hosts of 35 different species of animals SARS-CoV and emergent coronaviruses: viral determinants of interspecies transmission Receptor recognition and cross-species infections of SARS coronavirus Mechanisms of zoonotic severe acute respiratory syndrome coronavirus host range expansion in human airway epithelium Evolutionary history, potential intermediate animal host, and cross-species analyses of SARS-CoV-2 Transmission of morbilliviruses within and among marine mammal species Genomic diversity of SARS-CoV-2 in coronavirus disease 2019 patients The authors thank the anonymous reviewer for valuable and critical comments. Conflict of interest The authors declare that they have no conflict of interest.