key: cord-344364-vu389d88 authors: Wang, Wei; Zhang, Wei; Zhang, Jingjing; He, Ji; Zhu, Faming title: Distribution of HLA allele frequencies in 82 Chinese individuals with coronavirus disease‐2019 (COVID‐19) date: 2020-06-02 journal: HLA DOI: 10.1111/tan.13941 sha: doc_id: 344364 cord_uid: vu389d88 COVID‐19 is a respiratory disease caused by a novel coronavirus and is currently a global pandemic. HLA variation is associated with COVID‐19 because HLA plays a pivotal role in the immune response to pathogens. Here, 82 individuals with COVID‐19 were genotyped for HLA‐A, ‐B, ‐C, ‐DRB1, ‐DRB3/4/5, ‐DQA1, ‐DQB1, ‐DPA1, and ‐DPB1 loci using next‐generation sequencing (NGS). Frequencies of the HLA‐C*07:29, C*08:01G, B*15:27, B*40:06, DRB1*04:06, and DPB1*36:01 alleles were higher, while the frequencies of the DRB1*12:02 and DPB1*04:01 alleles were lower in COVID‐19 patients than in the control population, with uncorrected statistical significance. Only HLA‐C*07:29 and B*15:27 were significant when the corrected P‐value was considered. These data suggested that some HLA alleles may be associated with the occurrence of COVID‐19. COVID-19 is a respiratory disease caused by a novel coronavirus and is currently a global pandemic. HLA variation is associated with COVID-19 because HLA plays a pivotal role in the immune response to pathogens. Here, 82 individuals with COVID-19 were genotyped for HLA-A, -B, -C, -DRB1, -DRB3/4/5, -DQA1, -DQB1, -DPA1, and -DPB1 loci using next-generation sequencing (NGS). Frequencies of the HLA-C*07:29, C*08:01G, B*15:27, B*40:06, DRB1*04:06, and DPB1*36:01 alleles were higher, while the frequencies of the DRB1*12:02 and DPB1*04:01 alleles were lower in COVID-19 patients than in the control population, with uncorrected statistical significance. Only HLA-C*07:29 and B*15:27 were significant when the corrected P-value was considered. These data suggested that some HLA alleles may be associated with the occurrence of COVID-19. A new type of pneumonia with an unknown causative agent broke out in Wuhan, Hubei province, China in late December 1, 2019. 1,2 A novel coronavirus was subsequently confirmed as the agent causing the disease. 1, 2 The novel coronavirus was named 2019-nCoV by the World Health Organization (WHO) on January 12, 2020, and was later named severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) by the International Classification Committee of Viruses on February 11, 2020. 3, 4 Now, the associated disease has been named coronavirus disease 2019 (COVID-19) by the WHO. 3, 4 The transmission mode for COVID-19 has been shown to be human to human. 3 COVID-19 is currently a global pandemic, and over 2 740 000 cases of COVID-2019 and 191 000 deaths have been reported globally as of April 24, 2020. The HLA system participates in immune regulation in humans and it plays an important role in the occurrence and development of infectious diseases. [5] [6] [7] Chen et al reported an odds ratio of 4.4 for SARS-CoV infection in individuals homozygous or heterozygous for HLA-C*08:01. 7 We hypothesized that HLA variation in the population may be associated with the occurrence of COVID-19, because HLA plays a pivotal role in the immune response to pathogens. Here, we report HLA allele frequencies in Chinese Han individuals with COVID-19 and a comparison between HLA allele distribution in COVID-19 patients and healthy individuals. Eighty-two Han individuals from Zhejiang with confirmed COVID-19 were tested. All patients had mild or severe COVID-19, with none in a critical condition. Individuals were recruited for plasma donation after recovery and the plasma was used for treatment of other COVID-19 patients with severe or critical symptoms. The ages of the individuals ranged from 20 to 54. All samples were collected during the plasma donation process and informed consent was obtained from all individuals. The samples were genotyped at the HLA-A, -B, -C, -DRB1, -DRB3/4/5, -DQA1, -DQB1, -DPA1, and -DPB1 loci by NGS-based typing, using an AllType NGS 9-Loci Amplification Kit (One Lambda Inc., Canoga Park, California), according to the manufacturer's instructions. HLA genotypes were assigned using TypeStream Visual Software version 2.0 (One Lambda Inc.). The Chinese HLA common and well-documented principle was used to solve ambiguous HLA allele combination assignments. 8 Hardy-Weinberg equilibrium (HWE) was assessed by maximum likelihood, using the method of Guo and Thompson, which has been implemented in Arlequin version 3.5. 9 Odds ratios (95% confidence interval [CI]) and P-values were calculated using Fisher's exact test or Yates continuity-corrected χ 2 test in Prism 5.0 software (GraphPad, San Diego, California). Corrected P-values (Pc) were obtained by multiplying the number of alleles at each locus using the Benjamini-Hochberg method. The significance of the Pc-value was set at a level of .05. The results of HLA-A, -B, -C, -DRB1, -DRB3/4/5, -DQA1, -DQB1, -DPA1, and -DPB1 genotyping were fitted for HWE after considering the Pc-value (Table S1) (Table S2 ). The allele distributions of HLA-A, -C, -B, -DRB1, -DQB1, and -DPB1 loci were compared between COVID-19 patients and control individuals. The resulting ORs (95% CI) and P-values are presented in Table S2 . Data for control individuals were obtained from our previous studies of bone marrow donors in the Zhejiang Han population. 10, 11 There were 3548 individuals in the control group with genotyping data for the HLA-A, -B, -C, -DRB1,-DQB1 loci, and 242 with data for HLA-DPB1 locus. 10, 11 HLA genotyping of the control individuals was performed using polymerase chain reaction at two-field resolution, as in our previous reports. 10, 11 The frequencies and ORs of the HLA alleles, with uncorrected significance, for COVID-19 patients are listed in Table 1 . HLA-C*07:29, C*08:01G (including C*08:01 and C*08:22), B*15:27, B*40:06, DRB1*04:06, and DPB1*36:01 frequencies were higher in COVID-19 patients than in the control population, with uncorrected statistical significance (P < .05). The ORs were 130.20, 1.65, 3.59, 2.43, 2.39, and 12.08, respectively. Meanwhile, the frequencies of the DRB1*12:02 and DPB1*04:01 alleles were significantly lower in COVID-19 patients (Pc < .05), with ORs of 0.44 and 0.40, respectively. However, only the frequencies of HLA-C*07:29 and B*15:27 remained significantly different after P-value correction. The HLA-C*07:29 allele is a well-documented allele in the Chinese population, with five cases in the 816 486 bone marrow donors. 8 In the present study, HLA-C*07:29 was found in one COVID-19 patient, but in no individuals in the control group. Therefore, the significance of C*07:29 should be interpreted with caution and this result needs to be confirmed in further studies with larger sample sizes. Two alleles, B*07:02 (1.38%) and B*27:04 (1.92%), with frequencies greater than 1% in the control group, were not found in COVID-19 patients. The odds ratios for B*07:02 and B*27:04 were 0.216 (95% CI: 0.01335-3.494, P = .2408) and 0.155 (95% CI: 0.009598-2.503, P = .134). Because there were no data for HLA-DRB3/4/5, -DQA1, or -DPA1 loci in the control group, 10,11 the allele frequencies of these loci could not be compared between the two groups. The allele frequency data for the HLA-DRB3/4/5, -DQA1, and -DPA1 loci are summarized in Table S3 . The HLA system is an important host genetic factor that plays an important role in determining the outcome of many infectious diseases, including human immunodeficiency virus (HIV) and severe acute respiratory syndrome (SARS). 6, 12, 13 Associations between HLAs and the development and/or severity of SARS have been found in some populations. 7,13,14 HLA-B*07:03, B*46:01, DRB1* 03:01, DRB1*12:02 alleles have been reported to be associated with susceptibility to SARS, but some studies have Note: #C*08:01G ( 9.02%) was included C*08:01 and C*08:22. The number of the individuals in the control group were 3548 for HLA-B, -C, -DRB1 loci, and 242 for HLA-DPB1 locus. not confirmed these results. 15 The sequence of SARS-CoV-2 shows some homology with SARS-CoV, but there are distinct differences between the two viruses. 4 Therefore, the association of HLA alleles with COVID-19 warrants further research. 15, 16 In the present study, these SARS-susceptibility alleles were not found to occur at a significantly different frequency in COVID-19 patients after P-value correction. Nguyen et al 16 predicted the binding affinity of SARS-CoV-2 to 145 HLA class I alleles, and HLA-A*02:02, HLA-B*15:03, and HLA-C*12:03 were found to be the top presenters of conserved peptides. We found that B*15:27 alleles may be associated with the occurrence of COVID-19. HLA-B*15:03 and B*15:27 belong to the B*15 group and have 10 nucleotide differences. Prediction of the peptide-binding groove of these alleles may help to explain their association with COVID-19. Although the number of samples in the present study was small, these data will still be useful for exploring the influence of HLA gene polymorphisms on susceptibility to COVID-19 and patient outcomes. Additional supporting information may be found online in the Supporting Information section at the end of this article. Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan The outbreak of COVID-19: an overview Spread of SARS-CoV-2 Systematic comparison of two animal-to-human transmitted human coronaviruses: SARS-CoV-2 and SARS-CoV HLA variation and disease HIV and host immunogenetics: unraveling the role of HLA-C Epidemiological and genetic correlates of severe acute respiratory syndrome coronavirus infection in the hospital with the highest nosocomial infection rate in Taiwan in 2003 HLA common and well-documented alleles in China. HLA Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and windows The distributions of HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 allele and haplotype at high-resolution level in Zhejiang Han population of China Establishment of a polymerase chain reaction sequencing based typing method for HLA-DPB1 exons 2 and 3 and investigation of their polymorphisms Lack of association between HLA-A, -B and -DRB1 alleles and the development of SARS: a cohort of 95 SARS-recovered individuals in a population of Guangdong, southern China Association of HLA class I with severe acute respiratory syndrome coronavirus infection Human-leukocyte antigen class I Cw 1502 and class II DR 0301 genotypes are associated with resistance to severe acute respiratory syndrome (SARS) infection HLA studies in the context of coronavirus outbreaks Human leukocyte antigen susceptibility map for SARS-CoV-2 This work was sponsored by the Science Research Foundation of Zhejiang Province (LGF19H100005) and the Science Research Foundation of Zhejiang Healthy Bureau (2020RC053). The authors have declared no conflicting interests. Data sharing is not applicable to this article as no new data were created or analyzed in this study. Faming Zhu https://orcid.org/0000-0002-1963-9176