key: cord-0859626-u7l0fd98 authors: Su, Mingjun; Li, Chunqiu; Qi, Shanshan; Yang, Dan; Jiang, Ning; Yin, Baishuang; Guo, Donghua; Kong, Fanzhi; Yuan, Dongwei; Feng, Li; Sun, Dongbo title: A molecular epidemiological investigation of PEDV in China: Characterization of co‐infection and genetic diversity of S1‐based genes date: 2019-12-13 journal: Transbound Emerg Dis DOI: 10.1111/tbed.13439 sha: 21ba542fefca5fb7962b51f16a9f01d23470cfc2 doc_id: 859626 cord_uid: u7l0fd98 Porcine epidemic diarrhoea virus (PEDV) is an emerging and re‐emerging epizootic virus of swine that causes substantial economic losses to the pig industry in China and other countries. The variations in the virus, and its co‐infections with other enteric viruses, have contributed to the poor control of PEDV infection. In the current study, a broad epidemiological investigation of PEDV was carried out in 22 provinces or municipalities of China during 2015–2018. The enteric viruses causing co‐infection with PEDV and the genetic diversity of the PEDV S1 gene were also analysed. The results indicated that, of the 543 diarrhoea samples, 66.85% (363/543) were positive for PEDV, and co‐infection rates of PEDV with 13 enteric viruses ranged from 3.58% (13/363) to 81.55% (296/363). Among these enteric viruses, the signs of diarrhoea induced by PEDV were potentially associated with co‐infections with porcine enterovirus 9/10 (PEV) and torque teno sus virus 2 (TTSuV‐2) (p < .05). The 147 PEDV strains identified in our study belong to Chinese pandemic strains and exhibited genetic diversity. The virulence‐determining S1 proteins of PEDV pandemic strains were undergoing amino acid mutations, in which S58_S58insQGVN–N135dup–D158_I159del‐like mutations were common patterns (97.28%, 143/147). When compared with 2011–2014 PEDV strains, the amino acid mutations of PEDV pandemic strains were mainly located in the N‐terminal domain of S1 (S1‐NTD), and 21 novel mutations occurred in 2017 and 2018. Furthermore, protein homology modelling showed that the mutations in pattern of insertion and deletion mutations of the S1 protein of PEDV pandemic strains may have caused structural changes on the surface of the S1 protein. These data provide a better understanding of the co‐infection and genetic evolution of PEDV in China. Porcine epidemic diarrhoea virus (PEDV) is an enveloped, single-stranded, positive-sense RNA virus, belonging to the genus Alphacoronavirus. PEDV was identified as the causative agent of porcine epidemic diarrhoea (PED) in 1978 (Pensaert & de Bouck, 1978) . This disease is characterized by acute watery diarrhoea, vomiting and dehydration, with high mortality that often reaches 100% in neonatal piglets. From 1984 to early 2010, PEDV circulated in the pig population in China, but there were no large-scale outbreaks (Tamura, Stecher, Peterson, Filipski, & Kumar, 2013) . At the end of 2010, a PEDV outbreak occurred in several pig-producing provinces in southern China (Li et al., 2012) . Since then, the disease has spread throughout other provinces of China and has led to enormous economic losses within the pork industry. Currently, PEDV infection is widespread in swine-farming countries in Asia, Europe and North America (Lin, Saif, Marthaler, & Wang, 2016; Sun, Wang, Wei, Chen, & Feng, 2016; Wang et al., 2016) . The emergence and re-emergence of PEDV cause severe economic losses and pose significant public health concerns throughout the world. In addition to PEDV, a large variety of porcine enteric pathogens has been found in diarrhoea samples from pigs, including porcine teschovirus (PTV), porcine transmissible gastroenteritis virus (TGEV), porcine sapelovirus (PSV), porcine enterovirus 9/10 (PEV), mammalian reovirus (MRV), porcine group A rotavirus (GARV), porcine astrovirus (PAstV), porcine torovirus (PToV), torque teno sus virus 2 (TTSuV-2), porcine bocavirus (PBoV), porcine kobuvirus (PKV) and porcine deltacoronavirus (PDCoV); however, data have indicated that PEDV is the major cause of viral diarrhoeal disease in swine in China (Decaro et al., 2005; Song et al., 2015; Wang, Zhou, et al., 2014; Zell et al., 2000; Zhang, Tang, Yue, Ren, & Song, 2014; Zhang et al., 2013) . Viral diarrhoeal diseases caused by a variety of porcine enteric pathogens, and co-infection with multiple enteric pathogens, are very prevalent in piglets with diarrhoea (Chen et al., 2018; Zhang et al., 2014 Zhang et al., , 2013 . In the case of PEDV, the variations in the virus and its co-infections with multiple pathogens make it difficult to control the infection. However, limited data are available on co-infections during PEDV infection in piglets with diarrhoea. Therefore, it is necessary to investigate the co-infection of PEDV with other pathogens. The S protein of coronaviruses is responsible for induction of neutralizing antibodies, specific receptor binding and cell membrane fusion (Li, 2015; Sun et al., 2008) . Deletion and/or insertion mutations of the S protein are potentially associated with the pathogenicity and tissue tropism of coronaviruses (Lin et al., 2016) . In December 2013, an S-INDEL strain, OH851, with reduced virulence, was reported in the USA (Wang, Byrum, & Zhang, 2014) . Subsequently, S-INDEL strains similar to OH851 have been frequently reported in other countries (Yamamoto, Soma, Nakanishi, Yamaguchi, & Niinuma, 2015) . In a recent study, Wang et al. (2016) reported that three PEDV strains, non-S-INDEL, 'CV777' S-INDEL and 'US' S-INDEL co-circulated in the swine population in China . The S1 region of the S protein is under greater immune pressure than the S2 region (Aydin, Al-Khooly, & Lee, 2014; Jarvis et al., 2016) . Widespread epidemiological investigations have demonstrated that the virulence-determining S1 gene of PEDV is undergoing rapid mutation. Additionally, sialic acid binding activity is located in the N-terminal domain (NTD) of PEDV S1 protein (Li, Kuppeveld, He, Rottier, & Bosch, 2016) ; the S1-NTD of Chinese PEDV pandemic/variant strains exhibits strong sialic acid binding activity when compared with the classic PEDV strain (Deng et al., 2016) . Thus, patterns of insertion and deletion mutations of the S1 protein (S1-IDMPs) are an important target for better understanding of the emergence and re-emergence of PEDV in Asia, Europe and North America. Although genotyping of PEDV pandemic strains has been widely undertaken in the pig population in China (Chen et al., 2019; Wang et al., 2016; Wen et al., 2018) , there is fairly limited information available on the S1-IDMPs of PEDV pandemic strains. In our study, to investigate the co-infection of PEDV with other pathogens and the evolution of S1-IDMPs of PEDV circulating in China, 543 intestinal samples from diarrhoeic piglets were collected from 2015 to 2018. All the samples were used to investigate the co-infection of PEDV with other enteric pathogens, and the S1 genes obtained were subjected to analysis of S1-IDMPs, phylogenetic analysis and homology modelling of the protein. Our aim was to provide a better understanding of the epidemiology of PEDV in China. The study was approved by the Animal Experiments Committee of the Heilongjiang Bayi Agricultural University (registration protocol 201401002). The field study did not involve endangered or protected species. No specific permissions were required for the collection of samples because the samples were collected from public areas or nonprotected areas. All sampling and publication of the data were approved by the farm owners. In this study, 543 samples of intestinal tissues from piglets (aged The RNA extraction and cDNA synthesis were performed as described previously by Wang et al. (2016) . The DNA extraction was carried out according to the protocol described by Qi et al. (2019) . PEDV and 12 other enteric viruses, PTV, TGEV, PSV, PEV, MRV, GARV, PAstV, PToV, TTSuV-2, PBoV, PKV and PDCoV, were detected by nested PCR as described in previous studies (Chu, Poon, Guan, & Peiris, 2008; Decaro et al., 2005; Elschner, Prudlo, Hotzel, Otto, & Sachse, 2002; Song et al., 2015; Van et al., 2016; Wang et al., 2016 Wang et al., , 2009 Wang, Zhou, et al., 2014; Zell et al., 2000; Zheng et al., 2016) or as designed in this study (Table S1 ). The map of China was drawn with an online map-generation website (http://pixel map.amcha rts.com/). Amplification of the S1 gene of PEDV was carried out according to the protocol described by Wang et al. (2016) . The amplified S1 genes were cloned into the vector pMD18-T according to the manufacturer's instructions (TaKaRa Biotechnology Co., Ltd). Three positive clones from each sample were subjected to Sanger sequencing. All nucleotide sequences generated in our study were submitted to GenBank. Sequence analysis was carried out using the EditSeq tool in Lasergene DNASTAR™ 5.06 software (DNASTAR Inc). Multiple sequence alignments were carried out using the multiple sequence alignment tool of DNAMAN 6.0 software (Lynnon BioSoft). The sequence variation was described according to the nomenclature system (den Dunnen & Antonarakis, 2001). A divergence analysis of the S1 protein of PEDV was performed with WebLogo (http://weblo go.three pluso ne.com/), online software for sequence logo generator (Crooks, Hon, Chandonia, & Brenner, 2004) . The comparison of point mutations in the S1 protein of PEDV was made with GraphPad Prism ® 8.0 (GraphPad Software, Inc). For the phylogenetic analysis, the S1 genes of PEDV strains were retrieved from GenBank (Table S2 ). These nucleotide sequences were used to generate a neighbour-joining phylogenetic tree of the S1 gene using the ClustalX alignment tool in MEGA6.06 software (Tamura et al., 2013) . A neighbour-joining phylogenetic tree was built using the p-distance model and 1,000 bootstrap replicates. The phylogenetic tree was annotated with the Interactive Tree Of Life (iTOL) software (http://itol.embl.de/), an online tool for the display and annotation of phylogenetic trees (Letunic & Bork, 2016 ). The dominant S1 amino acid sequences from each year during 2015-2018 were aligned (named 2015_PEDV-strain, 2016_PEDV-strain, 2017_PEDV-strain, 2018_PEDV-strain, respectively) (Table S3) and then selected for investigation of the effect of S1-IDMPs on the conformation of the S1 protein of PEDV. The predicted tertiary structures of the S1 region were modelled using the open-source modelling server SWISS-MODEL (https ://swiss model.expasy.org/) from the Swiss Institute of Bioinformatics (Biasini et al., 2014) . The S1 monomer tertiary structures were prepared by using the spike protein of Human coronavirus NL63 (PDB ID: 5SZS) as a template. Illustrations and comparisons of these modelled tertiary structures were obtained using the python-based molecular viewer PyMOL (The PyMOL Molecular Graphics System, Version 1.7.4 Schrödinger, LLC). In addition, the asparagine (N)-linked glycosylation sites were predicted using the NetNGlyc 1.0 Server (http:// www.cbs.dtu.dk/servi ces/NetNG lyc/) (Gupta, Jung, & Brunak, 2004 ). The correlation of PEDV infection with other pathogens (PTV, TGEV, PSV, PEV, MRV, GARV, PAstV, PToV, TTSuV-2, PBoV, PKV, PDCoV and PCV-3) was carried out in 2 × 2 contingency tables using the chi- The significance level in all analyses was 5%, with a confidence interval of 95%. The difference in values was considered statistically significant or highly significant if the associated p value was <.05 or <.01, respectively. The results for PCV-3 detection in these 543 diarrhoea samples have been published in our previous study . (Figure 2b ). In addition, a statistical analysis showed that PEDV-induced diarrhoea symptoms were potentially associated with co-infection of PEV and TTSuV-2 (p < .05) ( Table 1) . In our study, a total of 147 S1 genes of PEDV were successfully sequenced from 2016-2018 PEDV-positive samples (Table S5) . The 147 S1 genes identified were 2,376 bp in length, except for three strains: JL/2016/47a (2,388 bp), JL/2016/47b (2,388 bp) and LN/ DT/2016/510a (2,352 bp). Nucleotide identity of 95.5%-99.9% and amino acid (aa) identity of 94.2%-99.7% were revealed among the 147 S1 genes identified. The 147 PEDV strains exhibited 90.7%-91.8% nucleotide sequence homology and 89.3%-90.7% amino acid homology when compared with the PEDV CV777 strain (GenBank accession no. AF353511). The S1-IDMPs of the 147 PEDV strains identified in our study were analysed using the S1 protein of the PEDV CV777 strain as the comparator. The results indicated that 143 S1 proteins exhibited S1-IDMP S58_S58insQGVN-N135dup-D158_I159del, two S1 proteins from the same farm (JL/2016/47a and JL/2016/47b) showed S1-IDMP S58_S58insQGVN-N135dup-D158_I159del-T380_ V380insGQRS and one S1 protein (LN/DT/2016/510a) exhibited S1-IDMP S58_S58insQGVN-N135dup-D158_I159del-N553_Y560del. From the same farm, the LN/DT/2016/510b strain exhibited a different S1-IDMP S58_S58insQGVN-N135dup-D158_I159del when compared with the LN/DT/2016/510a strain. For PEDV, the sialic acid binding activity is located in the NTD (aa 9-433) of the S1 protein; the cellular receptor binding is located in the receptor-binding domain (RBD, aa 501-629) of the S1 protein . In addition, the S1-IDMPs of 147 Chinese PEDV pandemic strains occurred in the NTD of the S1 protein; only the S1-IDMP of the LN/DT/2016/510a strain, S58_S58insQGVN-N135dup-D158_I159del-N553_Y560del, was located in the NTD and RBD of the S1 protein ( Figure 3a) . These data indicate that the S1-IDMP S58_S58insQGVN-N135dup-D158_I159del, accounting for 97.28% (143/147), was the pandemic insertion and deletion mutation pattern in the 147 PEDV strains identified. To explore further the evolution of S1 proteins of PEDV strains, the deduced amino acids of the 54 S1 genes from NCBI (2011-2014 = 24, 2015 = 29) , and the 147 S1 genes identified in this study (2016 = 72, 2017 = 46, 2018 = 29) For the phylogenetic analysis, a phylogenetic tree was constructed based on the S1 genes of the 147 PEDV strains identified in this study and 131 reference PEDV strains from GenBank. The phylogenic tree was divided into groups GI and GII, and the GII group was composed of two subgroups (GIIa and GIIb). The 147 PEDV strains identified in our study had a close relationship with Chinese PEDV pandemic ref- Analysis of the S1 protein modelling indicated that the S1-NTD of 2018_PEDV-strain showed four major changes, in residues 55-64, 113-120, 130-142 and 157-165, when compared with the classical PEDV CV777 strain ( Figure 5 ). When compared with 2015_PEDVstrain, 2016_PEDV-strain has an obvious structural change in residues 558-575, located in the S1-RBD region (Figure 6a ). Compared with 2016_PEDV-strain, 2017_PEDV-strain exhibited an obvious structural change in residues 130-143, located in the S1-NTD region (Figure 6b ). The PEDV-induced diarrhea symptoms associated-viruses (PEV, TTsuV-2) are shown in Bold. Abbreviations: CI, confidence interval; OR, odd ratio. Sequence analysis of S1 proteins of the PEDV strains identified in our study. (a) Divergence analysis of S1 proteins of PEDV strains identified in our study during 2015-2018. (b) The comparison of amino acid mutations of S1 proteins of PEDV strains identified in our study during 2015-2018 with PEDV CV777 strain. (c) The comparison of amino acid mutation positions of S1 proteins of PEDV strains identified in our study during 2015-2018 with PEDV CV777 strain The S1-based phylogenetic analysis of the PEDV strains identified in our study. Black circle diagram represents the 147 PEDV strains identified in this study F I G U R E 5 Comparative analysis of the predicted S1 protein modelling between PEDV CV777 strain and 2018_PEDV-strain. The S1 protein modelling of PEDV CV777 strain was shown as surface and cartoon by blue. The S1 protein modelling of 2018_PEDV-strian was shown as surface and cartoon by red. The mutant amino acid residues of S1 protein of 2018_PEDV-strain were shown as surface by red. The S1-NTD of PEDV CV777 strain was shown as surface by green In addition, the predicted S1 structure of HLJ/2015/DP1-1, which was identified in our previous study , has changed when compared with S1 modelling of 2015_PEDV-strain. JL/2016/47a and LN/DT/2016/510a exhibited changes to varying degrees when compared with S1 modelling of 2016_PEDV-strain, among which the deletion mutation N 553 VTNSYGY of the LN/DT/2016/510a strain was located in the RBD of the S1 protein of 2016_PEDV-strain (Figure 7 ). In addition, the predicted N-linked glycosylation sites of the S1 glycopro- (Wang et al., 2009) , and PEV infections remain asymptomatic or show mild pathogenicity in pigs (Zell et al., 2000) . Both TTSuV-2 and PEV exhibited a high prevalence in diarrhoeic piglets in this study, and when compared with PEDV-negative samples, a highly significant association of PEV (p = .033) and TTSuV-2 (p = .032) infection with PEDV-positive samples was observed. Although these data suggest that co-infections of PEDV with PEV and TTSuV-2 may result in the aggravation of clinical signs of diarrhoea in piglets, the evidence is limited, and further studies will need to be undertaken. In the current study, the data indicate that ongoing variation of the S1 gene is occurring in the PEDV strains circulating in China, which is the one of most important risk factors for the failure of classic PEDV vaccines and the changes in pathogenesis and tissue tropism. Direct evidence has been reported that insertion and deletion mutations of the S protein change the pathogenicity and tissue tropism of PEDV (Deng et al., 2016; Lin et al., 2016) . In this study, we summarized the S1-IDMPs of the 147 PEDV strains identified using the S1 protein of prototype PEDV CV777 strain as the comparator. Among these Chinese PEDV pandemic strains, the S58_S58insQGVN-N135dup-D158_I159del or S1-IDMPs similar to S58_S58insQGVN-N135dup-D158_I159del were the dominant insertion and deletion mutation pattern. Meanwhile, the S1-IDMPs of nearly all of the 147 strains occurred in the NTD of the S1 protein of PEDV; the S1-IDMP of only the LN/DT/2016/510a strain was located in both the NTD and RBD of the S1 protein. The evidence from S1-IDMPs indicated that, although ongoing variation has been occurring in the S1 genes of Chinese PEDV pandemic strains in the past three years, the S1-IDMPs of PEDV show genetic stability in the pig population in China. The sialic acid binding activity of the S protein facilitates infection by PEDV and other coronaviruses (Deng et al., 2016; Li et al., 2016; Schwegmann-Wessels et al., 2011) . In our study, the pandemic S1-IDMP S58_S58insQGVN-N135dup-D158_I159del of Chinese PEDV variant strains was also located in the NTD of the S1 protein, where it is associated with the sialic acid binding activity. Similar to that reported by Chen et al. (2019, the homology modelling of the S protein showed that, when compared with PEDV CV777, the 2018 PEDV strains exhibited four major structural changes at residues 55-64, 113-120, 130-142 and 157-165, which all reside on the surface of the S1-NTD region. In addition, the amino acid mutations of the S1 protein are mainly located in the region of S1-NTD. These data suggest that the S1-IDMP S58_S58insQGVN-N135dup-D158_ I159del, which is located in the S1-NTD, may enhance the cellular entry of Chinese PEDV pandemic strains via alteration of the sialic acid binding activity. The S1-IDMP of the LN/DT/2016/510a strain identified in our study is a unique and novel S1-IDMP because it is located not only in the NTD but also in the RBD; the S1-IDMPs of the JL/2016/47 (a, b) strain and the HLJ2015/DP1-1 strain identified in our previous study were also novel because they exhibit insertion and deletion mutations at four sites of the S1-NTD. The homology modelling of the S protein suggested that these S1-IDMPs, occur- were identified in our study. Moreover, when compared with PEDV CV777, 10 novel mutation positions of the S1 gene were found in our study. These results could be of significance in understanding the co-infection and genetic evolution of PEDV, and in the development of new strategies to control the disease. The study was supported by the National Natural Science Comparative analysis of the predicted S1 protein modelling of PEDV HLJ/2015/DP1-1, JL/2016/47a and LN/DT/2016/510a strains. (a) The compared S1 protein modelling between PEDV HLJ/2015/DP1-1 strain and 2015_PEDV-strain. (b) The compared S1 protein modelling between PEDV JL/2016/47a strain and 2016_PEDV-strain. (c) The compared S1 protein modelling between PEDV LN/ DT/2016/510a strain and 2016_PEDV-strain. The HLJ/2015/DP-1-1 strain was shown as surface and cartoon by purpleblue. The 2015_ PEDV-strain was shown as cartoon by yellow. The JL/2016/47a strain was shown as surface and cartoon by wheat. The LN/DT/2016/510a strain was shown as surface and cartoon by lightblue. The 2016_PEDV-strain was shown as surface and cartoon by pink. The S1-RBD of LN/ DT/2016/510a strain was shown as surface by limon The authors declare no conflict of interest. The study was approved by the Animal Experiments Committee of the Heilongjiang Bayi Agricultural University (registration protocol 201401002). All sampling and publication of the data were approved by the farm owners. https://orcid.org/0000-0003-3144-3763 Influence of hydrophobic and electrostatic residues on SARS-coronavirus S2 protein stability: Insights into mechanisms of general viral fusion and inhibitor design SWISS-MODEL: Modeling protein tertiary and quaternary structure using evolutionary information Genetic evolution analysis and pathogenicity assessment of porcine epidemic diarrhea virus strains circulating in part of China during Metagenomic analysis of the RNA fraction of the fecal virome indicates high diversity in pigs infected by porcine endemic diarrhea virus in the United States Novel astroviruses in insectivorous bats WebLogo: A sequence logo generator Virological and molecular characterization of a mammalian orthoreovirus type 3 strain isolated from a dog in Italy Nomenclature for the description of human sequence variations Identification and comparison of receptor binding characteristics of the spike protein of two porcine epidemic diarrhea virus strains Nested reverse transcriptase-polymerase chain reaction for the detection of group A rotaviruses NetNGlyc 1.0 Server. Center for Biological Sequence Analysis, Technical University of Denmark avail Specific asparagine-linked glycosylation sites are critical for DC-SIGN-and L-SIGN-mediated severe acute respiratory syndrome coronavirus entry Genomic and evolutionary inferences between American and global strains of porcine epidemic diarrhea virus Interactive tree of life (iTOL) v3: An online tool for the display and annotation of phylogenetic and other trees Receptor recognition mechanisms of coronaviruses: A decade of structural studies New variants of porcine epidemic diarrhea virus, China Cellular entry of the porcine epidemic diarrhea virus Evolution, antigenicity and pathogenicity of global porcine epidemic diarrhea virus strains Kobuvirus shedding dynamics in a swine production system and their association with diarrhea A new coronavirus-like particle associated with diarrhea in swine Molecular detection and phylogenetic analysis of porcine circovirus type 3 in 21 Provinces of China during 2015-2017 The sialic acid binding activity of the S protein facilitates infection by porcine transmissible gastroenteritis coronavirus Newly emerged porcine deltacoronavirus associated with diarrhoea in swine in China: Identification, prevalence and full-length genome sequence analysis Identification of two novel B cell epitopes on porcine epidemic diarrhea virus spike protein Epidemiology and vaccine of porcine epidemic diarrhea virus in China: A mini-review MEGA6: Molecular evolutionary genetics analysis version 6.0. Molecular Biology and Evolution Large-scale screening and characterization of enteroviruses and kobuviruses infecting pigs in Vietnam New variant of porcine epidemic diarrhea virus Molecular characterization of the ORF3 and S1 genes of porcine epidemic diarrhea virus non S-INDEL strains in seven regions of China Identification and survey of Torque teno virus in pigs in China Establishment and application of a nest RT-PCR method for detection of porcine torovirus Genetic epidemiology of porcine epidemic diarrhoea virus circulating in China in 2012-2017 based on spike gene Isolation and experimental inoculation of an S INDEL strain of porcine epidemic diarrhea virus in Japan Detection of porcine enteroviruses by nRT-PCR: Differentiation of CPE groups I-III with specific primer sets Viral metagenomics analysis demonstrates the diversity of viral flora in piglet diarrhoeic faeces in China Occurrence and investigation of enteric viral infections in pigs with diarrhea in China Identification of N-linked glycosylation sites in the spike protein and their functional impact on the replication and infectivity of coronavirus infectious bronchitis virus in cell culture Development and validation of a multiplex conventional PCR assay for simultaneous detection and grouping of porcine bocaviruses A molecular epidemiological investigation of PEDV in China: Characterization of co-infection and genetic diversity of S1-based genes