key: cord-0980881-sd2uvd3u authors: Cason, John title: Strategies for mapping and imitating viral B-cell epitopes date: 1994-09-30 journal: Journal of Virological Methods DOI: 10.1016/0166-0934(94)90045-0 sha: eeeebd8a55d2db50df0c856ff092020400b34ef7 doc_id: 980881 cord_uid: sd2uvd3u Abstract Identification of viral B-cell epitopes is of importance in the selection of peptides for inclusion into subunit vaccines, the development of virus-specific serological tests and understanding the interaction of antibodies with viruses at a molecular level. B-cell epitopes can often be determined unequivocally by X-ray analyses of antibody-antigen complexes. This technique is, however, time-consuming and alternative strategies have now been developed for identifying epitopes. This article provides an overview of approaches which are currently available for mapping and imitating B-cell epitopes. Viral infection usually results in the production of specific glycoproteins (antibodies, assembled from four constant and four variable polypeptide chains) by the host's B-lymphocytes. The variable regions of antibodies form two receptors (paratopes) which are able to bind, predominantly via hydrogen bonds, to discrete regions (epitopes, antigenic determinants) of a foreign agent and often promote its elimination from the body. Whilst a concise definition of what constitutes an epitope is a contentious issue, for the purposes of this paper an (proteinacious) epitope is presumed to comprise of those amino acids of an antigen which are involved in bond formation with residues of a corresponding paratope. There are also conflicting views upon how a paratope actually gains access and binds to an epitope. As reviewed by Van Regenmortel (1989) , some believe this interaction may be viewed as a 'lock-and-key' system (whereby a paratope recognizes a fixed region of an antigen which has a directly complementary shape or chemical characteristic). Others envisage an 'induced-fit' model, such that antibody binding is dependent upon slight movements of side chains, or backbone, of the antigen to accommodate antibody (i.e., akin to the adaptation in shape necessary to position a hand inside a glove). What is clear is that during a natural infection the vast majority of antigen-specific antibodies are directed against conformation-dependent (assembled) epitopes present on native st~ctures @aver et al., 1990) . That is, epitopes which depend upon inter-and/or intra-molecular interactions which permit distant (in terms of linear sequence) amino acids to come into close proximity to form an antigenic determinant. It is, however, equally apparent that another type of epitope also exists which bears a much closer resemblance to a protein's primary structure. Such linear (or continuous) epitopes consist of five to eight contiguous residues of a protein's amino acid sequence. This division between conformation-dependent and linear epitopes may be somewhat artificial in the case of antibodies induced by an intact protein, since any linear epitopes recognized may represent constituent parts of co~ormation-dependeni epitopes. Antibodies to linear epitopes are of considerable value as they are easy to prepare. Many workers have used denatured proteins or synthetic peptides (i.e., both essentially FSR: further amino acid or DNA sequencing needed in order to identify the epitope; C: ability to mimic conformation-dependent or, L: linear eptitopes. * Whilst all mapping techniques produce a defined epitope, those techniques dependent upon the derived or random deduction of an epitope may not identify a sequence actually present in the original protein immunogen. linear immunogens) to produce specific antisera or monoclonal antibodies (mAb) in experimental animals. Such antibodies are useful as molecular probes and can sometimes have biological activity. Identification of linear immunogens that can induce neutralizing antibodies to intact virus particles may be of considerable interest to those concerned with the design of subunit viral vaccines. Currently, sequence data of medically important antigens are being accrued at a hitherto unprecedented rate and it is now relatively easy to produce recombinant DNA-derived antigens and their corresponding antibodies. The established method of epitope mapping, X-ray crystallography, suffers from being relatively slow. Accordingly, the techniques of molecular biology and organic chemistry have been utilized to develop strategies for identifying antigenic determinants. Most approaches are currently biased toward the identification of proteinacious epitopes on viruses, due to extensive pre-existent knowledge of protein chemistry and the limited size of viral genomes (which facilitates rapid sequencing and deduction of potential antigens). Here, techniques for mapping epitopes are reviewed and subdivided into those applicable to three scenarios, namely where native viral antigen is: (1) characterized and readily available, (2) known, but unavailable and, (3) completely unknown. Some of the relative merits of the different epitope mapping methods described below are summarized (Table 1) . The structure of conformation-dependent epitopes can be often be elucidated with certainty by X-ray crystallographic studies of antigen-Fab (Fab: the Fragment antigen binding of an antibody) complexes (Laver et al., 1990) . This approach permits detailed analyses of antigen-antibody interactions and has been used to solve the three-dimensional structure of an idiotype/anti-idiotype complex (Bentley et al., 1990) and an epitope on rotavirus (Prasad et al., 1990) . However, X-ray crystallography is expensive, slow, restricted to specialized departments and does not always provide concise results: for example the neutralizing epitopes on foot-and-mouth disease virus could not be clearly established by this technique (Parry et al., 1992) . Furthermore, on occasions antigen-Fab crystals can prove difficult -if not impossible -to prepare. Despite these problems X-ray crystallography, in comparison to reductive epitope mapping methods (below), is believed to provide the most accurate view of epitope/paratope interactions, since it permits the topographical localization of epitopes in the context of an intact native antigen. Infectious agents can be investigated for their ability to mutate after exposure to a neutralizing antibody (in vitro or in vivo). The rationale behind this approach is that any nucleic acid changes which result in amino acid coding differences between an escape mutant and the original isolate contribute to an epitope recognized by a neutralizing Form: Drapeau et al., 1972; Lehninger, 1976; Boehringer, 1991 . X: any amino acid; * non-enzymic cleavage. antibody. This method is particularly useful for studies of RNA viruses as they have no proof-reading functions, which results in greater variation in RNA genomes (whilst DNA viruses may have a similar propensity to mutate any changes are corrected). Interpretation of such results may conceivably be complicated by the occurrence of random mutations at sites which form no part of a neutralizing epitope. Nevertheless, this technique has successfully been used to define neutralizing epitopes of poliovirus (Minor et al., 1985 (Minor et al., , 1986 , foot-and-mouth disease virus (Xie et al., 1987) and avian bronchitis virus (Kant et al., 1992) . Protein antigens can be dissected using enzymes that cut proteins between specified amino acid residues (e.g., Table 2 ) to produce fragments whose molecular weight can be calculated from the protein's amino acid sequence. By subjecting the resultant polypeptides to polyacrylamide gel electrophoresis and Western blotting, epitopes can be localized on an intact protein from the apparent molecular weight of immunoreactive fragments. Analyses of such proteolytic cleavage products have led to the identification P: prokaryotic or, E: eukaryotic expression system. Epitopes, L: predominantly linear, C: conformation-dependent. VLP: virus-like protein structures. Corn: various kits and vectors available commercially; * common plasmids for expression include pEX produces a P-galactosidase fusion protein; PATH: produces a tryptophan-E-synthetase fusion protein; pGEX: produces a glutathione-S-trnsferase fusion protein; EBV: Epstein-Barr virus; BPV: bovine papillomavirus. of group-specific antigenic determinants on the major capsid protein of group A rotaviruses (Kohli et al., 1992) . It should, however, be noted that this and other (see below) similar epitope mapping techniques are limited in that what is measured is actually the cross-reactivity of an antibody (to a complete protein) with polypeptide fragments. It is purely an assumption that such fragments assume structures even similar to those which they adopt in the context of their position within an intact protein. DNA or RNA coding for an otherwise unavailable (or even unknown) protein antigen can usually be amplified, cloned and sequenced. Recombinant proteins and protein sub-regions may then be constructed for use in epitope mapping studies. The most striking example of the effectiveness of this strategy is the isolation, characterization, and development of a serological test for a previously unidentified pathogenic virus, namely hepatitis C (Choo et al., 1989; Alter et al., 1989 ). Nucleic acid coding for a protein can be cloned into one of a variety of prokaryotic or eukaryotic expression vectors, many of which are now available commercially (Table 3) . The most easily manipulated and commonly used systems are those which involve expression of proteins in E. coli recombinant plasmids. Such expression systems are advantageous, as large quantities of recombinant protein can be prepared quickly and cheaply. However, other systems may prove to be more appropriate if, for example, it is suspected that an epitope may be influenced by post-translational modifications such as glycosylation, phosphorylation or myristilation. In such a case, eukaryotic expression via the commercially available baculovirus/insect cell or yeast kits, permit the production of high levels of post-transcriptionally modified viral proteins which can sometimes self-assemble to form virus-like particles (Carter et al., 1991; Kirnbauer et al., 1992) . Panels of subregions of a protein antigen can be produced from restriction endonuclease digests of coding DNA in order to map B-cell epitopes. Such recombinant protein subregions have been used to explain serological differences between strains of human cytomegalovirus (Basgoz et al., 1992) . Alternatively, proteolytic cleavage (above) can be employed to determine the position of epitopes, though this may be complicated if fusion proteins are used. Another approach is to digest an antigen's coding DNA with BAL-31 exonuclease (which selectively cleaves nucleotides from the 3' terminus) and remove samples at intervals prior to insertion into a plasmid. This permits the production of a panel of polypeptides of varying length (successively truncated from the carboxylic terminus) which can be used to screen for an immunoreactive region. DNA coding for this region can then be sequenced and an epitope determined. Use of BAL-31 deletion mutants of the VP2 protein of human rhinovirus type-2 have resulted in the identification of a neutralizing epitope @kern et al., 1987). With only a rudimentary knowledge of organic chemistry it is possible to construct small peptides corresponding to subregions of a protein antigen using f-mot or t-hoc chemistry. Peptides are often synthesized as a series of nested 15-mers with a 5 amino acid overlap, so as to encompass the entire protein under investigation. Peptides can then be used as antigens in enzyme-linked immunosorbent assays (EIA) to deduce an epitope: this approach is expensive but has been used to define a type-restricted epitope on the major coat protein of human papillomavirus type-16 (Cason et al., 1989) . The development by Geysen et al. (1987) of the pin system (PEPSCAN) has revolutionized this form of epitope mapping as it permits the rapid and cheap construction of hundreds of peptides. Briefly, peptides are constructed on polyethylene pins which are then used in situ as antigens in EIAs. Whilst peptides are generally considered to be primarily of value for the identification of linear antigenic determinants, recent work has shown that aggregated peptides may also be able to imitate conformation-dependent epitopes (Rao et al., 1992) . Having identified an epitope the relative contribution of each of its amino acids to antibody binding can be investigated by synthesizing artificial mutant peptides. For a hexameric epitope, a set of 120 peptides may be produced in which each constituent amino acid is sequentially replaced by the remaining 19 possible amino acids whilst keeping the other five amino acids unchanged (Geysen et al., 1987) . Similarly, a minimum binding site for an antibody can be identified by creating panels of peptides sequentially truncated from the amino and/or carboxylic terminus. Synthetic peptides have been employed to localize antigenic determinants recognized by neutralizing mAbs to foot-and-mouth disease virus (Meloen et al., 1987) , rhinovirus- (Sherry et al., 1986) , human immunodeficiency virus (Papsidero et al., 1989) , murine coronaviruses (Luytjes et al., 1989 ) and a type-restricted epitope on human papillomavirus type 16 (Cason et al., 1989) . Whilst peptide mapping techniques are particularly useful for identifying epitopes recognized by mAb or polyclonal antisera derived from hyperimmune animals, the application of this technology for identifying epitopes recognized by antibodies in human sera can often prove to be problematic due to high levels of antibody binding to peptides per se. Synthetic peptides to selected regions of a protein can also be used as immunogens to raise epitope-defined antibodies. Such anti-peptide probes can then be used to investigate the expression, or biologic function, of a particular epitope on a native protein or virion. This method has been used to study the neutralizing activity of mAbs to selected peptide subregions on human immunodeficiency virus (Langedijk et al., 1991) , and antibodies to certain linear B-cell epitopes have neutralizing activity for foot-and-mouth disease virus (Doe1 et al., 1988) influenza virus (Muller et al., 1990) and bovine papillomavirus type-2 (Cason et al., 1993) . Situations can arise when a mAb of interest is obtained using a crude mixture of proteins such as a cell homogenate from a clinical lesion. Whilst the original immunogenic protein may be unknown, epitope mapping strategies have been devised which are independent of any prior knowledge of an antigen's characteristics. In practice these techniques involve either presenting a large number of peptide shapes to the antibody under study, or deriving an epitope mimic (mimotope) by a process of iteration. The spectrum of shapes capable of acting as epitopes is limited by size since -by definition -they must be able to fit into the corresponding paratope. The repertoire of shapes is further reduced by the host's down-regulation of autoreactive paratopes and also by the acquisition of tolerance to other determinants. Despite these restraints, an individual animal is still thought to be capable of producing specific antibodies to about lo8 unique epitopes (Roitt et al., 1985) . As noted above, the majority of antibodies are usually directed against conformationdependent epitopes. However, there is no reason to believe that random peptides are unable to adopt shapes that are similar -or even identical -to conformation-dependent and linear epitopes. The approximate number of residues of an epitope in contact with a paratope is about six amino acids, thus if one creates a random-sequence hexameric peptide library comprising the 20 amino acids coded for by DNA (i.e., the l-stereoisomers), some 6.04 X 10' unique amino acid sequences may be generated. If it is assumed that each of these sequences forms an unique three-dimensional shape, random hexameric peptides may mimic approximately 60% of the epitopes recognized by the B-cell repertoire. Such approaches may be advantageous, since it may also be possible to mimic non-proteinacious epitopes. As it has been shown that antibodies are capable of reproducibly binding to dipeptides, a bank of antigenic shapes can be prepared by constructing just 480 peptides. The first two amino acids of a hexamer are defined by constructing 400 peptides which J. Cason/Journal of Virological Methods 49 (1994) 209-220 contain all possible amino acid combinations of the first two positions, whilst the remaining four positions are synthesized using an equimolar mixture of all 20 amino acids. This panel of peptides is then screened using the antibody under investigation and the most reactive defined dimeric sequence is retained. The third amino acid position is varied for all 20 amino acids and the remaining 3 residues are produced from an equimolar mix of all amino acids. After rescreening and deduction of the most appropriate third amino acid, the 20 variants of the fourth position are tested and one selected and so on, until the complete optimal-binding hexameric epitope is derived (Houghten et al., 1991) . Random pentameric peptide sequences are synthesized onto small resin beads, so that each bead is coated with a single peptide species. This is achieved by coupling 19 amino acids (cysteine being omitted to eliminate disulphide cross-linking) individually to 19 aliquots of beads. The aliquots are mixed, redivided into 19 samples, the second amino acid coupled and the process repeated until a pentapeptide library is completed. In this way a library of 195 (2.47 X 106) random peptides can be produced. This method has the advantage that an individual bead should bear only a single peptide species. Antigenicity of peptide-coated beads can then be identified by a localized colour reaction by EIA upon microscopic examination. Immunoreactive bead(s) are then retrieved, washed free of antibody with urea, and the amino acid sequence of the antigenic determinant/antigen mimic can be identified using a microsequencer (Lam et al., 1991) . Random peptides may also be expressed using a fd vector which is related to the Ml3 family of filamentous bacteriophages. DNA encoding a peptide can be introduced between the two domains of the pII1 protein (Smith, 1985) or, near the 5' terminus of pII1 (Parmley and Smith, 1988 ). Short oligonucleotides encoding peptide sequences inserted at these regions are expressed on the phage surface. By inserting random oligonucleotides coding for peptide sequences of 6 to 15 amino acids, it is possible to generate libraries of recombinant phages which express small random amino acid sequences on their surface (Scott and Smith, 1990; Cwirla et al., 1990) . Samples of individual colonies can be transferred to nitrocellulose sheets and tested with the antibody under investigation by a dot-blot assay. Colonies expressing the appropriate epitope can then be identified and, after DNA sequencing, the amino acid sequence derived. A potential problem associated with the random peptide-based approaches described above is that it is not uncommon for an antibody to bind to epitopes other than the one against which it was originally raised, a phenomenon known as heterospecificity or heterocliticity (Makela, 1965; Al-Moudallal et al., 1982; Harper et al., 1987) . Hence, it is possible to identify antigenic determinants whose amino acid sequence does not occur in that of the original protein immunogen. Furthermore, as these techniques have not proved popular it is presently difficult to assess how effective they actually are. As noted above, antibodies bind reproducibly to dipeptides, thus it is possible to select (from all 400 possible dipeptides) one dimer which is best recognized by a mAb. To increase the spectrum of antigenic shapes which can be imitated in a given peptide unnatural amino acids (i.e., o-stereoisomers) can also be included. For an identified dipeptide 'epitope', all four possible combinations of L-and D-(the unnatural stereoisomers) amino acids for a dipeptide can be tested and the sequence exhibiting highest mAb binding selected. This derived dipeptide can then be tested after the addition of each of the 40 possible D-or L-amino acids added to the carboxylic or amino terminus (with, or without, alanine spacers) to determine the optimal third and fourth residues. The resulting 4-mer peptide is then selected as the mimotope (Geysen et al., 1986 (Geysen et al., , 1987 . Again this epitope mimicking approach has not proved popular amongst experamentalists and its value is hence difficult to evaluate. This article provides an overview of the approaches currently available for the identification of B-cell epitopes. At present the newer (reductive) B-cell mapping techniques appear to be very effective for mapping epitopes recognized by mAbs or polyclonal antisera raised against linear proteinacious epitopes -an obvious limitation. Whilst the random peptide and mimotope approaches may be of value for deriving mimics of conformation-dependent epitopes, new techniques are required so that this form of antigenic determinant can be mapped with accuracy. One possible answer could involve the generation of large numbers of mutant proteins to localize those amino acids, which are crucial for antibody binding. Advances in molecular modelling techniques using computer graphics may also enable the identification of an antigenic determinant on a sequenced protein which is recognized by a (DNA sequenced) mAb paratope. Another area which remains relatively unexplored is that of mapping epitopes on non-proteinaceous antigens: this will no doubt progress in parallel with advances in sugar and lipid biochemistry. Paradoxically, peptide strategies for mapping B-cell epitopes, as described above, are now realized to be most appropriate for the identification of T-cell epitopes, since these are to consist of short peptides rather than conformation-dependent structures. Indeed, the PEPSCAN method has now been modified to permit cleavage of peptides from polyethylene pins so that they can be used to determine T-helper and T-cytotoxic cell determinants on a foreign protein. Monoclonal antibodies as probes of the antigenic structure of tobacco mosaic virus Detection of antibodies to hepatitis C virus in prospectively followed transfusion recipients with acute and chronic non-A non-B hepatitis The amino terminus of human cytomegalovirus glycoprotein B contains epitopes that vary among strains Th ree dimensional structure of an idiotope/anti-idiotope complex Boehringer Mannheim catalogue Expression of human papillomavirus proteins in yeast Saccharomyces cereuisiae Mapping of linear B-cell epitopes on capsid proteins of bovine papillomavirus: identification of three external type-restricted epitopes Identification of immunogenic regions of the major coat protein of human papillomavirus type 16 that contain type-restricted epitopes Isolation of a cDNA clone derived from a blood borne non-A non-B viral hepatitis genome Peptides on phage, a vast library of peptides for identifying ligands Immunization against foot-and-mouth disease virus with synthetic peptides representing the C-terminal region of VPl Purification and properties of an extracellular protease of Staphlococcus aureus A priori delineation of a peptide which mimics a discontinuous antigenic determinant Strategies for epitope analysis using synthetic peptide synthesis Antigen specificity and cross reactivity of monoclonal antilysozyme antibodies Generation and use of synthetic peptide combinatorial libraries for basic research and drug discovery Localization of antigenic sites defined by neutralizing monoclonal antibodies on the Sl avian infectious bronchitis virus glycopeptide Papillomavirus Ll major capsid protein self assembles into virus like particles that are highly immunogenic Localization of group specific antigens on the major capsid protein of group A rotavirus A new type of synthetic peptide library for identifying ligand-binding activity Neutralizing activity of antipeptide antibodies against the principle neutralization domain of human immunodeficiency virus type 1 Epitopes on protein antigens: misconceptions and realities Amino acid sequence of a conserved neutralizing epitope of murine coronaviruses Single lymph node cells producing heteroclitic bacteriophage antibody Antigenicity and immunogenicity of synthetic peptides of foot-and-mouth disease virus Antigenic structure of polioviruses of serotypes 1, 2 and 3 Principal and subsidiary antigenic sites of VP1 involved in the neutralization of poliovirus type 3 Antigenic properties and protective capacity of a cyclic peptide corresponding to site A of influenza virus haeagglutinin Structural and serological evidence of a novel mechanism of antigenic variation in foot-and-mouth disease virus Human immunodeficiency virus type-l neutralizing monoclonal antibodies which react with p-17 core protein: characterization and epitope mapping Antibody-selectable filamentous fd phage vectors: affinity purification of target genes Localization of vp4 neuytralization in rotavirus by three dimensional cryo-electron microscopy Macromolecular self-association of a synthetic peptide derived from hepatitis B surface antigen: construction of a quatemary epitope Immunology. The generation of antibody diversity Searching for peptide ligands with an epitope library Use of monoclonal antibodies to identify four neutralization immunogens on a common cold picomavirus, human rhinovirus 14 A neutralizing epitope on human rhinovirus type 2 includes amino acid residues between 153 and 164 of virus capsid protein VP2 Filamentous fusion phage: novel expression vectors that display cloned antigens on the virion surface Structural and functional approaches to the study of protein antigenicity Neutralization of foot-and-mouth disease virus can be mediated through any of at least three separate antigenic sites