key: cord-0274029-bdta241e authors: Lehmann, Alexander; Zhang, Ting; Reche, Pedro A; Lehmann, Paul V. title: Discordance between the predicted vs. the actually recognized CD8+ T cell epitopes of HCMV pp65 antigen and aleatory epitope dominance date: 2020-11-07 journal: bioRxiv DOI: 10.1101/2020.11.06.371633 sha: 98ab97a944501a559c4503107a299c6fd387e082 doc_id: 274029 cord_uid: bdta241e CD8+ T cell immune monitoring aims at measuring the numbers and functions of antigen-specific CD8+ T cell populations engaged during immune responses, providing insights into the magnitude and quality of cell-mediated immunity operational in a test subject. The selection of peptides for ex vivo CD8+ T cell detection is critical, however, because for each restricting HLA class I molecule present in a human individual there is a multitude of potential epitopes within complex antigens, and HLA diversity between the test subjects predisposes CD8+ T cell responses to individualized epitope recognition profiles. We report here on a brute force CD8+ T cell epitope mapping approach for the human cytomegalovirus (HCMV) pp65 antigen on ten HLA-A*02:01-matched HCMV infected human subjects. In this approach, in each test subject, every possible CD8+ T cell epitope was systematically tested; that is 553 individual peptides that walk the sequence of the HCMV pp65 protein in steps of single amino acids. Highly individualized CD8+ T cell response profiles with aleatory epitope recognition patterns were observed. We compared the actually detected epitope utilization in each individual with epitope prediction ranking for the shared HLA-A*02:01 allele, and for additional HLA class I alleles expressed by each individual. No correlation was found between epitopes’ ranking on the prediction scale and their actual immune dominance. The data suggest that accurate CD8+ T cell immune monitoring might depend on the agnostic reliance on mega peptide pools, or brute force mapping, rather than individualized epitope predictions. For the past decades, clinical immune diagnostic has relied on the detection of serum antibodies, with interrogation of T cells confined to the scientific discovery. While antibody measurements have become invaluable clinical tests for detecting infections and autoimmune diseases, they have the shortcoming of providing insights into humoral immunity only. There is an increasing number of conditions, however, in which serum antibodies fail to reveal even the immune status of an individual. These include tuberculosis 1 , Lyme disease 2 , human immunodeficiency virus (HIV) elite controllers 3 , up to 30% of human cytomegalovirus (HCMV) infected individuals 4 , and SARS-CoV-2 infection: in the latter, as with other human coronavirus infections, only a transient antibody response tends to occur, after which specific serum antibodies decline to below detection limit 5 . In all the aforementioned conditions, however, antigen-specific T cell immunity could be detected (see the corresponding references above), verifying that an in immune response had indeed occurred, suggesting that cell-mediated immunity can occur, and render protection, in the absence of detectable humoral immunity. Measurement of humoral immunity, therefore, is neither a reliable surrogate for the mere existence of cell mediated immunity, nor does it provide insight into its quality and magnitude. CD4+ and CD8+ T cells and their subpopulations make distinct contributions to host defense however, that, dependent on the pathogen, can be protective or disadvantageous 6 . This realization has led to an increasing need to progress with T cell immune monitoring, recently precipitated by the SARS-CoV-2 pandemic 7 . T cell immune monitoring has a long and successful track record in murine models in which defined experimental conditions, small model antigens, and work with inbred mouse strains expressing few restriction elements (MHC molecules) simplified the task 8 . The magnitude of scope is entirely different when the outbred human population is to be studied, largely due to the immense diversity in restriction elements (human leukocyte antigens, HLA) and the complexity of the antigenic systems, such as viruses. To comprehensively monitor T cell immunity to SARS-CoV-2, for example, this virus' entire proteome, 9,871 amino acids long 9 , would need to be considered. In this report, we confined ourselves to a single protein of HCMV, pp65, which is "only" 561 amino acid long, and to subjects who shared a common HLA-A * 02:01 restriction element, but differed in the remaining HLA class I alleles. Monitoring CD4+ T cell immunity is relatively simple. When the test antigen of interest is added as a protein to peripheral blood mononuclear cells (PBMC), the antigen presenting cells (APC) contained in the PBMC will acquire, process, and present the antigen 10 . Therefore, natural antigen processing and presentation mechanisms will select which peptide fragments (epitopes) of the antigen will be presented on all HLA class II molecules expressed by an individual. The same spectrum of epitopes that triggered the induction of a CD4+ T cell response in vivo will therefore be displayed by the APC in the recall assay ex vivo, resulting in the activation of the entire antigen-specific CD4+ T cell repertoire, making sure that no CD4+ T cell epitope will be left behind. For CD4+ T cell immune monitoring, it is therefore not required to know what the potential CD4+ T cell epitopes are in a given donor, nor is there a need to tailor peptides to each test subject's HLA type -the APC fulfill this function. Unfortunately, this is not the case for ex vivo CD8+ T cell detection. CD8+ T cells evolved to recognize antigens actively bio-synthetized within host cells, as opposed to antigens that APC acquire from their surroundings. Thereby CD8+ T cells can survey ongoing protein synthesis in the cells of the body, permitting them to identify virally-infected or malignant cells, so as to kill them. During protein synthesis, defective byproducts also arise that are degraded by the proteasome into peptide fragments. Such peptides are loaded onto HLA class I molecules, and transported to the cell surface to be displayed to CD8+ T cells 11 . Protein antigens are not suited to recall in vivo-primed CD8+ T cells within PBMC because exogenously added proteins are not efficiently presented to CD8+ T cells in the context of class I molecules. Instead, the CD8+ T cell epitopes need to be added as 8-11 amino acid long peptides that they can bind directly to cell surface expressed HLA class I molecules. From this requirement the need arises to select the "right" peptides for ex vivo CD8+ T cell immune monitoring: those very same epitopes that have induced a CD8+ T cell response in vivo. Missing the "right" peptides, or only partially covering them, has the consequence that the antigen-specific CD8+ T cell repertoire could go partially or entirely undetected. Selecting the "right" peptides for CD8+ T cell immune monitoring is an inherently intricate task. HLA class I molecules are encoded by three genetic loci, HLA-A, HLA-B, and HLA-C, for which a multitude of alleles exist in the human population 12 . Each allelic HLA class I molecule has a unique peptide binding specificity 13 . As there are barely two humans with an identical HLA-type, there should be barely two humans who present the same array of epitopes. Protecting the species, T cell epitope recognition evolved to be highly individualized 14 . Peptide selection for comprehensive CD8+ T cell immune monitoring must therefore account for the unique HLA allele composition in each test subject. A mainstream effort for identifying the "right" peptides for CD8+ T cell monitoring is reliant upon in silico epitope predictions. As the peptide binding motifs for most HLA alleles are well-defined, predictions can be made as far as which peptide sequence of an antigen can bind to a given HLA allele, thus constituting a potential T cell epitope. Search engines have been made available to the scientific community to rank peptide sequences for their predicted binding strength to most HLA alleles, thus narrowing in on a finite set of epitopes. A critical assumption for epitope predictions is that peptides that rank high in their respective HLA allele binding score will be those that are being targeted most by CD8+ T cells. The data presented in this study challenge this hypothesis supporting the conclusions reached by Mei et al 15 . Beyond doubt, a peptide needs to be able to bind to an HLA allele to be a candidate for T cell recognition. However, whether a peptide sequence of a protein antigen that has HLA-binding potential indeed becomes an epitope recognized by T cells is defined by many additional factors 16 . Limitations exist on the level of antigen presentation, including whether that exact peptide is indeed generated through natural antigen processing, and whether it is produced in quantities that can outcompete other peptides, including self-peptides, for binding to the respective HLA molecules. Limitations also exist on the level of the pre-immune T cell repertoire available to engage in antigen recognition. The duration and abundance of epitope presentation will also affect the ensuing CD8+ T cell response, being regulated both by a virus' replication biology, and the host's ability to control the virus. Therefore, it can be expected that only a fraction of peptides with HLA class I binding properties will elicit strong CD8+ T cell responses, becoming dominant epitopes. Other presented peptides might induce a weaker, subdominant, barely detectable, cryptic, or no CD8+ T cell responses at all. As all antigen-specific CD8+ T cells can be expected to contribute equally to the host's defense, irrespective of their fine specificity, comprehensive immune monitoring must not focus on a single or few epitopes, but should instead accommodate all epitopes of an antigen targeted by CD8+ T cells in an individual in order to assess the entire antigen-specific T cell pool. Next to predictions in silico, experimentally verified epitopes have been used as a guide to select peptides for CD8+ T cell immune monitoring. Over the years, T cell lines and clones specific for many viral antigens have been isolated and their epitope specificity compiled in databases 17, 18 . Selection of such previously verified epitopes for immune monitoring is based on the assumption that epitope recognition, including epitope hierarchy, is constant in subjects who express the corresponding HLA allele. In other words, if an HLA-X-restricted peptide Y has been identified as an immune dominant epitope in an HLA-X positive subject Z, this peptide Y will also be immune dominant in HLA-X positive subjects V and W. Such predictable immune dominance prevails in simple murine models when inbred mice are studied that express minimal restriction element diversity 19 . However, predictable epitope dominance is lost as soon as restriction element diversity rises through interbreeding these inbred mouse strains. In such F1 mice, aleatory epitope recognition prevails 20 : T cells in each F1 mouse respond in an unpredictable, dice-like fashion (alea means dice in Latin) to epitopes to which the parental strains responded predictably. Aleatory epitope dominance may also apply to humans due to their diverse restriction element makeup 21 . Therefore, in the present study of HCMV pp65 epitope recognition in HLA-A * 02:01-positive individuals, we also compare the peptides that the CD8+ T cells actually target in our cohort with previously verified epitopes. The third approach for CD8+ T cell immune monitoring is not to select peptides at all, but to systematically test all possible peptides of the antigen. This can be done by using mega peptide pools consisting of hundreds of peptides that cover entire proteins of a virus. By necessity, this approach has become standard recently in the first real world challenge on clinical T cell immune monitoring: trying to study T cell immunity induced by SARS-CoV-2 infection. This crude approach is simple and practical, yet permits comprehensive assessment of the entire expressed antigen-specific T cell repertoire in outbred populations, without requiring customization to HLA types of individuals, but it does not reveal the epitope specificity of the antigen-reactive T cells. In this study, we applied an agnostic approach in which all possible peptides were tested individually on each subject in a "brute force" high-throughput manner 22 . The ability to test hundreds, even thousands of peptides individually on a subject is a recent technological advancement. The hurdles that needed to be overcome included limitations in PBMC numbers available from a subject, access to extensive custom peptide libraries, high-throughput-capable T cell assay platforms, and automated data analysis. We have developed and report here large-scale epitope mapping strategies that can be readily adopted even in small academic laboratories operating on tight budgets. Empowered by the ability to experimentally verify CD8+ T cell epitope utilization at the highest possible resolution in the rather well-studied HCMV pp65 T cell immune monitoring model, we set out to compare the epitopes actually recognized with those that are predicted, or assumed to be recognized based on existing data. We draw attention to how incomplete our appreciation of an individual's expressed epitope space currently is, and suggest that neither epitope predictions, nor reliance on known epitopes suffice, but rather that the agnostic route is best suited for comprehensive CD8+ T cell immune monitoring. PBMC from healthy adult human donors were from CTL's ePBMC library (CTL, Shaker Heights, OH, USA). The PBMC had been collected by HemaCare (Van Nuys, CA) under HemaCare's IRB and sold to CTL concealing the subjects' identities. The donors' age, sex, ethnicity, and HLA type are shown in S. Table 1 . HLA typing was contracted to the University of Oklahoma Health Science Center (Oklahoma City, OK). The ten subjects for this study were selected according to their HCMV-positive status. The frozen cells were thawed following an optimized protocol 23 resulting in viability > 90% for all samples. The PBMC were resuspended in CTL-Test™ Medium (from CTL), developed for low background and high signal performance in ELISPOT assays. The number of PBMC plated into the ImmunoSpot® (ELISPOT) experiments was 3 × 10 5 PBMC per well. 553 nonamer peptides, spanning the entire amino acid (a.a.) sequence of the HCMV pp65 protein in steps of single a.a. were purchased from JPT (Berlin, Germany) as a FastTrack CD8 epitope library. These peptides were not further purified following their synthesis, however, individual peptides were analyzed by JPT using LC-MS. The average purity of these peptides was 56%. These peptides were delivered as lyophilized powder with each peptide present in a dedicated well of a 96-well plate, distributed across a total of six 96-well plates. Individual peptides were first dissolved in 50μL DMSO, followed by addition of 200μL of CTL-Test™ Medium so as to generate a "primary peptide stock solution" at 100μg peptide/mL with 20% v/v DMSO. From each of these wells, a "secondary, 10X peptide stock solution" was prepared using a 96-well multichannel pipette, in which peptides were at a concentration of 2μg/mL, with DMSO diluted to 0.4%. On the day of testing, 20μL from each well was transferred "en block," with a 96-well multi-channel pipette into pre-coated ImmunoSpot® assay plates containing 80μL CTL-Test™ Medium. Finally, 100μL of PBMC (containing 3 × 10 5 cells) in CTL-Test™ Medium was added resulting in a test peptide concentration of 0.2μg/mL with DMSO present at 0.04% v/v. UV-inactivated entire HCMV virions (HCMV Grade 2 antigen from CTL) at 10μg/mL was used to recall HCMV-specific CD4 cells. CPI (from CTL) was used as a positive control because, unlike CEF peptides, CPI elicits T cell recall responses in all healthy donors 24 . CPI is a combination of protein antigens derived from CMV, influenza and parainfluenza viruses, and was used at a final concentration of 6μg/mL in ImmunoSpot® assays. Single-color enzymatic ImmunoSpot® kits from CTL were used for the detection of antigen-induced IFNγproducing CD8+ T cells. Peptides or pp65 were plated at the above specified concentrations into capture antibody-precoated assay plates in a volume of 100μL per well. These plates with the antigen were stored in a CO2 incubator for less than 1h until the PBMC were thawed and ready for plating. The PBMC were added at 3 × 10 5 cells/well in 100μL CTL-Test™ Medium followed by a 24h activation culture at 37°C and 9% CO2. Thereafter the cells were removed, IFNγ detection antibody was added, and the plate-bound cytokine was visualized by enzyme-catalyzed substrate precipitation. After washing, the plates were air-dried prior to scanning and counting of spot forming units (SFU). ELISPOT plates were analyzed using an ImmunoSpot® S6 Reader, by CTL. For each well, SFU were automatically calculated by the ImmunoSpot® Software using its Autogate™ function 25 . The data are expressed as SFU per 3 × 10 5 PBMC, whereby each SFU corresponds to the cytokine footprint of an individual IFNγ-producing T cell 26 . As SFU counts follow Gaussian (normal) distribution among replicate wells, the use of parametric statistics is appropriate to identify positive and negative responses, respectively 27 . The 553 individual peptides of the pp65 nonamer peptide library were tested in single wells. For these peptides, the threshold for a positive response was set at SFU counts exceeding 3 SD of the mean SFU count detected in 18 replicate media control wells, the latter defining the background noise of the test system. This cut off criterion for weak (cryptic) responses renders the likelihood for false positive results at 0.3%. Dominant responses were defined by exceeding 10 SD, and subdominant responses between 5 and 10 SD of the negative control. We assessed peptide-HLA I presentation by predicting peptide-HLA I binding using HLA I allele-specific profile motif matrices 28 . We considered that a given peptide binds to a specific HLA I molecule when its binding score ranks within the top 3% percentile of the binding scores computed for 1,000 random 9mer peptides (average amino acid composition of proteins in the SwissProt database). Peptide binding to experimentally defined HLA-A*02:01 restricted epitopes was predicted using netMHCIpan 28 at IEDB analysis resource 17 , reporting percentile binding score. The lower the percentile binding score the better the binding. Epitope data for HLA-A2-restricted CD8 cell recognition was obtained from IEDB 17 with the following search settings: positive response only; host human; MHC I allele, HLA-A*0201, source species HCVM, source antigen: pp65. Only peptides 9 a.a. long were considered. We took advantage of the fact that ImmunoSpot® assays require as few as 300,000 PBMC per antigen stimulation condition, and that these assays lend themselves to high-throughput testing and analysis. Utilizing only 173 million PBMC per subject, we therefore could test individually 553 nonamer HCMV pp65 peptides, along with 18 negative control replicate wells to establish the background noise, and the CPI positive control in triplicate. Nonamer peptides were selected because the peptide binding groove of HLA class I molecules accommodates peptides 8-11 amino acid (aa) in length, with the most common peptide size being 9 aa residues 10 . In contrast, HLA class molecules present longer peptides 29 . As such, the usage of nonamer peptides in our assays permitted selective activation of CD8+ T cells. Moreover, because the peptide binding groove of HLA class I molecules is closed on both ends 10 , it is intolerant for frame shifts. The peptide library was designed therefore to walk the pp65 protein in steps of single a.a. with each nonamer peptide overlapping by 8 a.a. with the previous one (S. Figure 1) . Importantly, this approach enabled systematic coverage of every possible CD8+ T cell epitope within the pp65 antigen. To reduce assay variables, all peptides used in this study were from the same vendor, and were synthetized, stored, dissolved, and tested in the same way. Moreover, all the peptides were tested on each PBMC donor in a single experiment, which rendered the peptides the only assay variable. Standard IFNγ ImmunoSpot® assays with 24h antigen exposure of PBMC were performed; a time period required for blast transformation and CD8+ T cell activation-driven IFNγ secretion to occur, but too short to permit CD8+ T cell proliferation or differentiation during the cell culture. Thus, we measured at singlecell resolution the frequencies of antigen-specific IFNγ-producing CD8+ T cells as they occurred in vivo at isolation of the PBMC. This approach, therefore permitted us to firmly measure within each PBMC sample the number of CD8+ T cells responding to each peptide, and thus to compare the frequencies of peptide-reactive CD8+ T cells to establish epitope hierarchies for each donor. Moreover, adding up all peptide-induced IFNγ SFU permits one to assess the cumulative magnitude of the antigen-specific CD8+ T cell population in each donor, in turn allowing for determination of the relative percentage of antigenspecific CD8+ T cells targeting individual epitopes in each test subject. Such actual measurements of the epitope-specific CD8+ T cells were then compared to (a) the recognition of published HLA-A * 02:01-restricted epitopes, and (b) epitope predictions not only for HLA-A * 02:01, an allele that all 10 test subjects shared by design, but also for all other class I molecules expressed by the test subjects. Eighteen replicate wells containing media alone were included for all 10 individuals in our HCMV positive, HLA-A * 02:01 positive cohort in order to firmly establish the background noise of the respective PBMC. The mean and standard deviation (SD) of this negative control is shown for all subjects in Table 1 , also specifying the cut off values used for analyzing the peptide-induced SFU counts. The 533 individual pp65 nonamer peptides were also tested on each subjects' PBMC, and the resulting peptideinduced SFU-counts graded: peptides triggering SFU counts larger than 3 and less or equal to 5 SD over the medium control were considered weakly positive or cryptic (highlighted in beige in Table 1 ). Of note, with the mean plus 3 SD definition utilized in this study, the chance for a datapoint being false positive was negligible, less than 0.3%. Peptides triggering SFU counts more than 5 and less than or equal 10 SD over the medium background (highlighted in yellow) were labelled subdominant, and peptides eliciting SFU counts exceeding 10 SD over the medium control were called dominant (and are labelled in orange in Table 1 ). We also introduced a fourth category for peptides that recalled CD8+ T cells in frequencies exceeding 100 SFU/300,000 PBMC, calling them super-dominant epitopes (shown in red in Table 1 ). Table 1 lists peptides that induced at least one dominant or super-dominant recall response in at least one of the ten test subjects in our cohort. Only for these 56 select peptides of the 553 tested are SFU counts shown for the ten donors. Additionally, a color-coding system was utilized in Table I to delineate whether the peptide recalled a super-, dominant, subdominant, cryptic, or no response in the test subject. As revealed by the color code at a glance, epitope recognition followed highly individual patterns, that are closer dissected below. As Table 1 lists only super-and dominant recall responses (>10 SD over background), in S. Table 2 we list 58 additional peptides that induced subdominant recall responses (5-10 SD over the background) in at least one of the ten test subjects in our cohort. At a glance, the color code reveals that these peptides are also recognized in a highly individualized pattern. Peptides that recalled cryptic responses (3-5 SD over background) are not listed individually, but their number is specified for each test subject in Table 2 , along with the number of subdominant, and dominant and super-dominant epitopes recognized in each donor. Adding up the number of epitopes in all four categories permits one to establish the cumulative number of CD8+ T cell epitopes recognized in each subject, which varied between 5 and 47 HCMV pp65-derived peptides in this cohort ( = 29 + 17). Thus, of the 553 peptides covering the 561 amino acid long pp65 protein, between 1% and 8% ( = 5%) of the peptides constituted a CD8+ T cell epitope in each individual, but for the entire cohort 114 peptides (21% of 553 peptides tested) were needed to recall all dominant (56 peptides) and subdominant (58 peptides) CD8+ T cell epitopes. As T cells recognize processed peptides of antigens there is no reason to assume that a T cell specific for one peptide of the antigen will contribute differently to host defense than T cells recognizing another. Immune monitoring therefore needs to assess all antigen-specific CD8+ T cells irrespective of their fine specificity. In our systematic assessment of CD8+ T cell immunity to pp65, we defined this number as the sum of all SFU counts elicited by the individual epitopes in a subject. This cumulative number of pp65-specific CD8 cells is shown for each subject in Table 2 as "Cum. Spec. SFU". From this number, one can calculate what percentage of the pp65-specific CD8 + T cells occurs in each of the four response categories. As seen in Table 2 , although the number of super-dominant epitopes was low in each subject (between 4 and 1), in eight of ten donors the majority of pp65-specific CD8+ T cells targeted these super-dominant epitopes. The percentage of CD8+ T cells specific for individual dominant and superdominant epitopes is shown in S. Table 3 . The data in Table 1 and S. Table 2 show that each subject in our cohort displayed a unique CD8+ T cell epitope recognition pattern. This might come as a surprise, as all these subjects were HLA-A * 02:01 positive, and one might have expected that among the epitopes recognized there should be at least a shared subset, those restricted by the HLA-A * 02:01 allele. To narrow our investigation on such HLA-A * 02:01-restricted epitopes, we searched the IEDB database for HLA-A * 02:01-restricted nonamer epitopes identifying 31 that have been experimentally verified so far: these are listed in S. Table 4 with the corresponding reference citations. With the exception of the epitope, pp65495-503, of all these 31 previously defined HLA-A * 02:01-restricted epitopes 5 peptides recalled super-dominant CD8+ T cells responses in only 2 of the 10 test subjects, while 7 additional peptides triggered occasional dominant recall responses. The rest of the 31 previously defined peptides elicited sporadic subdominant (n=4), cryptic (n=6) or no recall responses (n=8) at all. Importantly, donors who did not respond strongly or at all to these previously defined epitopes responded vigorously to other peptides of pp65 (Table 1) . These previously defined HLA-A * 02:01 -restricted peptides were therefore also targeted in a dice like, aleatory manner in HLA-A * 02:01 positive subjects. Only one previously defined HLA-A * 02:01 -restricted epitope, pp65495-503, induced a dominant, or superdominant recall response in 8 of 10 subjects in our cohort (Table 1 and S. Table 4 ). However, this peptide was not targeted in two donors (ID3# and ID#9) who exhibited responses to other pp65-derived peptides in a super-dominant fashion. Intrigued by this finding, we tested 42 additional (52 in total) HCMV positive, HLA-A * 02:01 positive subjects for their recall response to the pp65495-503 peptide. As shown in S. Figure 2 , the numbers of CD8+ T cells responding to the pp65495-503 peptide did not correlate (r 2 = 0.01) to the numbers of T cells recalled by inactivated HCMV virus; which primarily activates HCMVspecific CD4+ T cells. Even though all these subjects have developed T cell immunity to HCMV, about one fourth of them either did not respond to the pp65495-503 peptide, or displayed a low frequency of pp65495-503-specific CD8+ T cells. This finding is consistent with the notion that the CD8+ T cell response to pp65495-503 peptide is also aleatory. Interestingly, although the HLA-A * 02:01 restriction element was shared by all test subjects in our cohort, and despite the pp65495-503 peptide being displayed in vivo via natural processing and presentation, in some individuals this epitope triggered a super-dominant CD8+ T cell response while in other subjects it did not induce a detectable respond at all. Moreover, in yet other donors, the magnitude of the CD8+T cell response induced by this peptide was anywhere between these two extremes. However, the higher prevalence of recognition for this epitope (pp65495-503) compared to other previously defined HLA-A * 02:01-restricted epitopes might have an unexpected reason: in addition to HLA-A * 02:01, pp65495-503 received a top binding score for several additional HLA-class I alleles (see next). The participation of the other HLA class I alleles, beyond the shared HLA-A * 02:01 restriction element, might explain the highly individual CD8+ T cell response pattern observed in our cohort. Based on extensive knowledge on the peptide binding properties of individual HLA alleles, reference search engines have been established that permit in silico predictions of which peptides fit the binding criteria of a given allele, and moreover, the strength of peptide binding can also be ranked. It has been widely anticipated that such in silico models will suffice to predict epitope utilization. In particular, when there is the need to select one or a few candidate epitopes, e.g. for multimer analysis, it is tempting to pick peptides that have the highest predicted binding score for the HLA allele of interest. In the following we address the validity of such an approach from three angles. In the first two approaches we focused on predictions for HLA-A * 02:01, the most studied HLA allele that is shared by all subjects in our cohort. We introduced into the netMHCIpan 28 search engine of the IEDB analysis resource 17 the individual sequences of our pp65 nonamer peptide library, resulting in the predicted pp65 ranking shown in Table 3 (in which only the top 30 predicted peptides of 553 are shown). In Approach 1, we compared this in silico predicted epitope hierarchy for pp65 with the actual peptide recognition we detected in our cohort. As can be seen in Table 3 , pp65495-503 ranked as the top binder, and indeed induced a CD8+ T cell response in the majority of our HLA-A * 02:01 positive cohort (albeit in an aleatory manner, see above). Most of the other predicted peptides with high HLA-A * 02:01 binding scores recalled CD8+T cells in low frequencies, and in an aleatory manner with each of these predicted peptides eliciting SFU in only one or two of the 10 test subjects. Except for the pp65495-503 peptide, none of the super-dominant, and few of the dominant responses were recalled by these top 30 HLA-A * 02:01 binding peptides (compare with Table 1 ). One might rightfully argue that this is because those dominant peptides were restricted by, and are binders of alternative class I molecules present in our cohort. We will address this hypothesis below, in Approach 3. In Approach 2, we looked up the predicted HLA-A * 02:01 binding scores for those peptides that have been identified experimentally as HLA-A * 02:01-restricted pp65 epitopes. In S. Table 4 these peptides have been listed according to their predicted HLA-A * 02:01 binding ranking along with CD8+ T cell recall responses they induced in our HLA-A * 02:01 positive cohort. With the exception of the pp65495-503 peptide, none of these peptides were among the predicted top 20 binders. Seeking for a correlation between the predicted HLA-A * 02:01 binding of these peptides, and their actual immune dominance, these data are also represented graphically in S. Figure 3 . No significant correlation was seen. The fact, however, that these peptides were targeted by CD8+ T cells in HLA-A * 02:01 positive subjects establishes that immune dominant epitopes do not need to rank high in peptide binding score. The score apparently needs to be just high enough to enable stable HLA allele binding. In Approach 3, we matched binding predictions for all super-dominant and dominant epitopes detected in each of the 10 subjects with all HLA class I alleles expressed in the subject. The results shown for Subject ID 7 in Figure 1 are fully representative for all other subjects in our cohort (see S. Figure 4) . As can be seen, the pp65495-503 peptide ranked as one of the top binders for several of the subject's alleles (Figure 1, S. Figure 4 ), but few of the other actually targeted CD8+ T cell epitopes ranked amongst the top binders for the class I alleles expressed in these respective subjects, and many super-dominant peptides ranked low. A binding score oriented in silico model would not have sufficed to predict the hierarchy of actual epitope recognition. All three of our above approaches suggest that, at least in the case of CD8+ T cell immunity induced by HCMV infection against its pp65 antigen, in silico predicted high binding scores for a specific HLA class I allele neither predict whether those peptides will indeed induce a CD8+ T cell response, nor the magnitude of it. This finding raises the question how generalizable it is. Mei et al.'s recent report 15 suggests that it may be generalizable as they came to the same conclusion studying the prediction performance of databases containing 21,101 experimentally verified epitopes across 19 HLA class I alleles. We studied HCMV pp65 antigen recognition by CD8+ T cells in HCMV infected subjects at the highest possible resolution, testing every potential epitope and measuring the numbers of all epitope-specific CD8+ T cells. Our data show that fixed epitope hierarchies do not exist even in an HLA-A * 02:01 allele matched cohort. Instead, different super-dominant and dominant epitopes were targeted by the individual test subjects (Table 1) . Previously defined epitopes, and peptides predicted to be high HLA-A * 02:01 binders also were also targeted in some, but not other individuals, if at all (S. Table 4 and Table 3 ). If generalizable, the notion of such unpredictable, aleatory epitope recognition patterns in individuals makes it obsolete for CD8+ T cell immune monitoring to rely on testing a select few peptides. Rather, comprehensive CD8+T cell immune monitoring must be all inclusive, accommodating all potential epitopes on all restriction elements of each test subject. Such can be accomplished by brute force epitope mapping, as we did here, or by the use of mega peptide pools. While in silico epitope ranking may have limited value in predicting immune dominant peptides, it should be helpful for narrowing in on the subset of peptides on an antigen that has sufficient HLA-allele binding affinity to constitute an epitope. As it is impractical to tailor a multitude of variable peptides to each individual's HLA-type, it might be more realistic for immune monitoring to develop rules for identifying peptides that do not bind to any HLA class I allele, so as to exclude those peptides from testing. Being able to narrow in on peptides should be helpful, as the ultimate goal of immune monitoring is to assess the CD8+ T cell response to the entire proteome of complex antigenic systems, such as all protein antigens of viruses and tumors. Table 1 . HCMV ppp65 epitope recognition by CD8+ T cells in HCMV positive, HLA-A * 02:01 positive subjects. Ten subjects' PBMC at 300,000 cells per well were challenged with a library of 553 nonamer peptides that systematically covered all possible CD8+ T cell epitopes of the HCMV ppp65 antigen. An IFN-γ ImmunoSpot assay was performed with the spot forming units (SFU) elicited by each peptide recorded. The mean and SD for 18 negative control media wells, and the cut-off values for the colorcoded response categories are specified on the bottom of this Table. Only peptides that induced at least one dominant (orange) or super-dominant (red) recall response in at least one subject are listed. Peptides that have been described as HLA-A * 02:01 restricted nonamer epitopes in the literature are highlighted in green. Table 2 . HCMV ppp65 epitope category distribution in HCMV positive, HLA-A * 02:01 positive subjects. The number of cryptic, subdominant, dominant and super-dominant epitopes, as defined in the text, are shown for the individual test subjects, along with the sum of epitopes in each category (Total Epitopes Recognized) for each PBMC donor. The absolute number of CD8+ T cells targeting peptides in each category (Cum. SFU) is also shown. From the number of all pp65-specific CD8+ T cells detected in each subject (Cumulative Spec. SFU) the percentage of CD8+ T cells targeting peptides in each of the four response categories has been calculated (% of total SFU). The SFU counts shown are after subtracting the mean + 3 SD specificity cut off value. Because SFU counts for the cryptic category frequently were at the cut-off value, or barely exceeded it, they shown with two decimal places. Table 3 . CD8+ T cell recognition of predicted high HLA-A * 02:01 -binding peptides. All 553 nonamer pp65 peptides in our library were run on the netMHCIpan search engine of the IEDB Analysis Resource for predicting their binding to the HLA-A * 02:01 allele, resulting in "pp65 Rank" shown, with the top binder peptide ranked No. 1. The 30 highest ranking peptides are listed. Additionally, an Percentile Binding Score that compares a peptide's binding relative to the binding scores computed for 1,000 random nonamer peptides, reporting the percentile binding score, is also listed. A lower percentile binding score denotes better peptide binding to the HLA-A * 02:01 allele. Peptides that have been described as HLA-A * 02:01 restricted nonamer epitopes in the literature are highlighted in green and the color-coded response categories defined in Table 1 were applied. Table 1 . The corresponding Percentile Binding Score as established by the netMHCIpan search engine is shown comparing a peptide's binding relative to the binding scores computed for 1,000 random nonamer peptides. A lower percentile binding score denotes better peptide binding to the specified HLA allele. The knowns and unknowns of the immunopathogenesis of tuberculosis. The International Journal of Tuberculosis and Lung Disease An Enhanced ELISPOT assay for sensitive detection of antigen-specific T cell responses to Borrelia burgdorferi A role for antibodies in natural HIV control Direct detection of T-and B-memory lymphocytes by ImmunoSpot® assays reveals HCMV exposure that serum antibodies fail to identify. Cells What are the roles of antibodies versus a durable, high quality T-cell response in protective immunity against SARS-CoV-2? Vaccine Viral and host factors related to the clinical outcome of COVID-19 Not just antibodies: B cells and T cells mediate immunity to COVID-19 Dominance and crypticity of T cell antigenic determinants The UCSC SARS-CoV-2 Genome Browser Towards a systems understanding of MHC class I and MHC class II antigen presentation Translating DRiPs: MHC class I immunosurveillance of pathogens and tumors Human leukocyte antigen gene polymorphism and the histocompatibility laboratory Sequence variability analysis of human class I and class II MHC molecules: functional and structural correlates of amino acid polymorphisms Why peptides? Their possible role in the evolution of MHC-restricted T-cell recognition A comprehensive review and performance evaluation of bioinformatics tools for HLA class I peptide-binding prediction Confronting complexity: real-world immunodominance in antiviral CD8+ T cell responses IEDB-AR: immune epitope database-analysis resource in 2019 EPIMHC: a curated database of MHC-binding peptides for customized computational vaccinology Heterogeneity of the T cell response to immunodominant determinants within hen eggwhite lysozyme of individual syngeneic hybrid F1 mice: implications for autoimmunity and infection Aleatory Epitope Recognition Prevails in Human T Cell Responses? Comprehensive evaluation of the expressed CD8+ T cell epitope space using high-throughput epitope mapping Optimal thawing of cryopreserved peripheral blood mononuclear cells for use in high-throughput human immune monitoring studies A positive control for detection of functional CD4 T cells in PBMC: the CPI pool Log-normal ELISPOT spot size distribution permits count harmonization among different laboratories Normal distribution of CD8+ T-cell-derived ELISPOT counts within replicates justifies the reliance on parametric statistics for identifying positive responses NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data Present Yourself! By MHC Class I and MHC Class II Molecules Analysis of the human cytomegalovirus pp65-directed T-cell response in healthy HLA-A2-positive individuals Combinatorial tetramer staining and mass cytometry analysis facilitate T-cell epitope mapping and characterization Multiplex identification of antigen-specific T cell receptors using a combination of immune assays and immune receptor sequencing A HCMV pp65 polypeptide promotes the expansion of CD4+ and CD8+ T cells across a wide range of HLA specificities CMV-specific T cells generated from naïve T cells recognize atypical epitopes and may be protective in vivo Monitoring of cytomegalovirusspecific CD8+ T-cell response with major histocompatibility complex pentamers in kidney transplant recipients Identification of novel CTL epitopes of CMV-pp65 presented by a variety of HLA alleles Ex vivo profiling of CD8+-T-cell responses to human cytomegalovirus reveals broad and multispecific reactivities in healthy virus carriers Induction of pluripotent protective immunity following immunisation with a chimeric vaccine against human cytomegalovirus Ex vivo expansion of human cytomegalovirus-specific cytotoxic T cells by recombinant polyepitope: implications for HCMV immunotherapy Identification of three HLA-A*0201-restricted cytotoxic T cell epitopes in the cytomegalovirus protein pp65 that are conserved between eight strains of the virus Characterization of antigen-specific repertoire diversity following in vitro restimulation by a recombinant adenovirus expressing human cytomegalovirus pp65 Herpesvirus-specific CD8 T cell immunity in old age: cytomegalovirus impairs the response to a coresident EBV infection Optimization of an elispot assay to detect cytomegalovirus-specific CD8+ T lymphocytes Dysfunctional CMVspecific CD8(+) T cells accumulate in the elderly Immediate early effector functions of virus-specific CD8+CCR7+ memory cells in humans defined by HLA and CC chemokine ligand 19 tetramers Major contribution of codominant CD8 and CD4 T cell epitopes to the human cytomegalovirus-specific T cell repertoire Highthroughput determination of the antigen specificities of T cell receptors in single cells We thank Ruliang Li of Cellular Technology Limited for expert technical assistance, Greg A. Kirchenbaum and Alexey Y. Karulin for valuable discussions, and Diana R. Roen for editorial assistance. Percentile Binding Score S. Table 1 . HLA class I allotypes and other characteristics of human subjects tested in this study.S. Figure 1 : Schematic representation of brute force CD8+ T cell epitope mapping. The amino acid sequence of the protein, illustrated on the top, is covered with nonamer peptides that walk the sequence in steps of single amino acids.S. Figure 2 . Frequency of HCMV pp65495-503 peptide-specific CD8+ T cells vs. HCMV grade 2 antigenreactive T cells. Fifty-two subjects were selected from the ePBMC database for being HLA-A * 02:01 positive and responding to HCMV Grade 2 antigen with more than 100 SFU/300,000 PBMC. Each of these subjects' PBMC (represented by a dot) were re-tested in an ImmunoSpot assay for the numbers of SFU triggered by HCMV Grade 2 antigen (shown on the X axis), and the numbers of SFU elicited by the pp65495-503 peptide (shown on the Y axis).S. Table 2 : Subdominant pp65 epitopes recognized in our cohort of ten HCMV positive, HLA-A * 02:01 positive subjects. Peptides that elicited CD8+ T cell recall responses between 5-10 SD over the media background in at least one of the test subjects are highlighted in yellow. Cryptic recall responses (SD 3-5 over background) triggered by these peptides are highlighted in beige. Peptides that elicited SFU counts exceeding 10 SD over background are not shown in this Table, as they are listed in Table 1 . Peptides that only elicited cryptic recall responses (mean plus 3-5 SD) are not shown here, but are summarized in Table 2 . Otherwise the legend to Table 1 applies. S. Table 3 : Percentage of pp65-specific CD8+ T cells targeting individual epitopes. The total number of pp65-specific CD8+ T cells was calculated from the sum of SFU triggered by all epitopes in each subject as detailed in Table 2 . The percentages of Cumulative Specific SFU counts elicited by individual peptides in each test subjects are shown. For the corresponding absolute SFU counts see Table 1 . Otherwise, the legend to Table 1 applies.S. Table 4 . Actual recognition of previously identified HLA-A * 02:01-restricted nonamer epitopes in our HCMV positive, HLA-A * 02:01 positive cohort. The listed peptides have been identified in the IEDB database as HLA-A * 02:01-restricted epitopes and the corresponding publications are specified. All 553 nonamer pp65 peptides in our library were run on IEDB's netMHCIpan search engine for predicting their binding to the HLA-A * 02:01 allele, resulting in the "pp65 Rank" shown, with the top binding peptide ranked No. 1. The corresponding Percentile Binding Score is shown comparing each peptide's binding relative to the binding scores computed for 1,000 random nonamer peptides. A lower percentile binding score denotes better peptide binding to HLA-A * 02:01. Otherwise, the legend to Table 1 Table 1 . The IEBD Rank shown for each peptide and allele was established using the netMHCIpan search engine predicting the peptides' binding score to the respective HLA allele, whereby a lower Percentile Binding Score binding score denotes better peptide binding.