key: cord-0856047-0vgesp97 authors: Fogarty, Carl A; Fadda, Elisa title: Oligomannose N-Glycans 3D Architecture and Its Response to the FcγRIIIa Structural Landscape date: 2021-03-04 journal: J Phys Chem B DOI: 10.1021/acs.jpcb.1c00304 sha: afdfad59734d65ecece96a714cd5573c1b63f2e7 doc_id: 856047 cord_uid: 0vgesp97 [Image: see text] Oligomannoses are evolutionarily the oldest class of N-glycans, where the arms of the common pentasaccharide unit, i.e., Manα(1–6)-[Manα(1–3)]-Manβ(1–4)-GlcNAcβ(1–4)-GlcNAcβ1-Asn, are functionalized exclusively with branched arrangements of mannose (Man) monosaccharide units. In mammalian species oligomannose N-glycans can have up to 9 Man; meanwhile structures can grow to over 200 units in yeast mannan. The highly dynamic nature, branching complexity, and 3D structure of oligomannoses have been recently highlighted for their roles in immune escape and infectivity of enveloped viruses, such as HIV-1 and SARS-CoV2. The architectural features that allow these N-glycans to perform their functions are yet unclear, due to their intrinsically disordered nature that hinders their structural characterization. In this work we will discuss the results of over 54 μs of cumulative sampling by molecular dynamics (MD) simulations of differently processed, free (not protein-linked) oligomannose N-glycans common in vertebrates. We then discuss the effects of a protein surface on their structural equilibria based on over 4 μs cumulative MD sampling of the fully glycosylated CD16a Fc γ receptor (FcγRIIIa), where the type of glycosylation is known to modulate its binding affinity for IgG1s, regulating the antibody-dependent cellular cytotoxicity (ADCC). Our results show that the protein’s structural constraints shift the oligomannoses conformational ensemble to promote conformers that satisfy the steric requirements and hydrogen bonding networks demanded by the protein’s surface landscape. More importantly, we find that the protein does not actively distort the N-glycans into structures not populated in the unlinked forms in solution. Ultimately, the highly populated conformations of the Man5 linked glycans support experimental evidence of high levels of hybrid complex forms at N45 and show a specific presentation of the arms at N162, which may be involved in mediating binding affinity to the IgG1 Fc. Complex carbohydrates (or glycans) are the most abundant biomolecules in nature. Within a human biology context, glycans coat cell membranes and protein surfaces, mediating a myriad of essential biological processes in health and disease states. 1−6 N-glycosylation is one of the most abundant and diverse type of post-translational modification that can affect protein trafficking and structural stability and mediate interactions with different receptors. 6−11 N-glycan recognition and binding affinities are often highly specific to their sequence, intended as the types of monosaccharides, their stereochemistry, and branching patterns, 12 a principle that has been successfully exploited in the development of glycan microarray technology. 13 Molecular recognition is fundamentally dependent, among other considerations, on structural and electrostatic complementarity between the ligand and the receptor's binding site. Within this framework, the prediction and characterization of glycan binding specificity are an extremely difficult task, due to their high degree of flexibility or intrinsic disorder, which hinders our ability to determine their 3D structure by means of experimental techniques. Indeed, glycans can only be structurally resolved in their entirety only when tightly bound to a receptor, thus when their conformational degrees of freedom are heavily restrained. Because of their inherent flexibility, free glycans can adopt different 3D structures within a weighted conformational ensemble, which cannot be determined with currently available experimental methods, although very promising steps forward have been recently made in advancing imaging techniques for single glycans. 14, 15 High performance computing (HPC) molecular simulations can contribute a great deal toward our understanding of the relationships between glycans' sequence, structure, and function. Indeed, conformational sampling through conventional and/or enhanced molecular dynamics (MD) schemes allows us to characterize the dynamic behavior of different glycoforms at the atomistic level of details. Within this context, for the past few years our lab contributed to the knowledge of N-glycans dynamics by providing information on their 3D architecture and relative flexibility from extensive MD-based conformational sampling. 16, 17 As an example, we have shown how the sequence (and branching) of complex N-glycans determines the 3D structure, which in turn drives their recognition. 16, 17 In this work we extend our data set of free (unlinked) N-glycans structures to the vertebrate oligomannose type, where, as shown in Figure 1 , the common pentasaccharide unit, i.e., Manα(1−6)-[Manα(1−3)]-Manβ(1−4)-GlcNAcβ(1−4)-GlcNAcβ1-Asn, is functionalized by a branched arrangement of only Man units. In addition, we also determine how the protein surface landscape affects their conformational dynamics, which is a very important question in terms of its impact on molecular recognition and function while challenging to answer in absolute terms because of the site-specific character. Oligomannoses are often defined as "immature" N-glycans, as they are processed toward complex functionalization in the Golgi 6 and are not abundant in vertebrates. Nevertheless, these N-glycans are a common post-translational modification of viral envelope proteins expressed in human cell lines; 20,21 for example, it is the prevalent type of glycosylation of the HIV-1 fusion trimer. 22−25 Furthermore, an increase in large oligomannose-type N-glycosylation in humans has been linked to breast cancer progression 26−28 and can occur where the protein landscape at the N-glycan site does not allow easy access to the required glycohydrolases and glycotransferases for further functionalization. 6, 29, 30 Interestingly, recent work has shown that oligomannose N-glycans functionalizing CD16a low-affinity Fc γ receptors (FcγIIIa) determine an increase in IgG1-binding affinity by 51-fold, 31 relative to the more common complex N-glycans, 32 although the Nglycosylation composition varies depending on the glycosylation site. 32 In this work we have studied the effect of the FcγIIIa protein surface landscape on the intrinsic conformational propensity of different oligomannose N-glycans we determined for the unlinked forms. Our results show that the two FcγIIIa Nglycosylation sites, N45 and N162, affect the oligomannose dynamics rather differently, in function of the structural constrains of the sites and of the 3D architecture of the glycan. More specifically, we find that the protein landscape affects the glycans conformational equilibrium by promoting structures that are complementary to it and not by actively changing their intrinsic architecture. Indeed, all the 3D conformers observed in the analysis of the bound oligomannoses are always identified in the simulations of the corresponding unlinked forms in solution, although in different populations. Interestingly, we also determined that the progressive elongation of the arms/branches promotes interarm contacts, where the Man9 3D architecture is almost entirely structured with interacting arms. Finally, these findings fit very well within the framework of our recently proposed "glycoblocks" glycans structure representation, 16 whereby groups of specifically linked monosaccharides within N-glycans represent independent structural elements (or glycoblocks), whose exposure, or presentation in function of the particular protein landscape, drives molecular recognition. 36, 37 and also because of consistency with our previous work, 16, 17 we consider the choice of GLYCAM06-j1/TIP3P parameter set as appropriate. All simulations were run in 200 mM NaCl salt concentration, with counterions represented by AMBER parameters 38 in a cubic simulation box of 16 Å sides. Long range electrostatic were treated by particle mesh Ewald (PME) with cutoff set at 11 Å and a B-spline interpolation for mapping particles to and from the mesh of order of 4. van der Waals (vdW) interactions were cut off at 11 Å. The MD trajectories were generated by Langevin dynamics with collision frequency of 1.0 ps −1 . Pressure was kept constant by isotropic pressure scaling with a pressure relaxation time of 2.0 ps. After an initial 500.000 cycles of steepest descent energy minimization, with all protein/glycans heavy atoms restrained by a harmonic potential with a force constant of 5 kcal mol −1 Å −2 , the system was heated in two stages, i.e., from 0 to 100 K over 500 ps at constant volume and then from 100 to 300 K over 500 ps at constant pressure. After the heating phase, all restraints were removed and the system was allowed to equilibrate for 5 ns at 300 K and at 1 atm of pressure. Production and subsequent analysis were done on 500 ns trajectories run in parallel for each uncorrelated starting structure, i.e., each conformer generated with GLYCAM-Web. Analysis was done using the cpptraj tool and with VMD 39 Figure 1c , to obtain two systems, one with only Man5 and the other with only Man9 at both positions. As a note, the structure of the N-glycan at N165 from the PDB structure is quite distorted with uncommon ring conformations of some of the monosaccharides, probably resulting from the fitting to the electron density; therefore it was disregarded and only the chitobiose was used for structural alignment. These systems were run in duplicates from uncorrelated starting structures with the same simulation protocol used for the free glycans. Production runs were extended to 1 μs for each trajectory for a total of 4 μs of cumulative sampling time. All simulations were run on NVIDIA Tesla V100 16GB PCIe (Volta architecture) GPUs on resources from the Irish Centre for High-End Computing (ICHEC) (www.ichec.ie). We used conventional MD simulations, run in parallel for 500 ns from nine uncorrelated starting points, 16, 17 to characterize the 3D structure and dynamics of human oligomannose Nglycans, when unlinked; see Figures 1 and S.1. The effects of the protein on their intrinsic dynamics were studied on two models with Man5 and Man9 linked to the human FcγIIIa protein on the two N-glycosylation sites, namely, N45 and N162; see Figure 1c . This section is organized as follows, first we present the results obtained for the unlinked oligomannoses, starting with Man5 that we used as a reference to describe sequence-to-structure changes in the larger forms. The subset of representative isomers shown in Figure 1 is presented here for simplicity, while the complete analysis of all positional isomers with heat maps and tables is included as Supporting Information. The section concludes with the results obtained for Man5 and Man9 when linked to the FcγRIIIa. Man5 is the simplest oligomannose found in vertebrates and the substrate of GlcNAc transferase I (GnTI), responsible for starting the N-glycan complex functionalization in the Golgi. 6 As found for complex biantennary N-glycans, 16,17 the Man5 The Journal of Physical Chemistry B pubs.acs.org/JPCB Article chitobiose core and the following Manβ(1−4)-GlcNAc linkage are rigid with only one conformation significantly occupied, see Figure 2 and Table S .1, while the (1−3) arm adopts an outstretched conformation with flexibility in a range of 40°a round the ψ torsion angle; see Figure 2 and Table 1 . The Man5 (1−6) arm has a relatively more complex dynamics, hinging around the preferential "open" conformation, 16, 17 populated at 82%, where the Manα(1−3)-Man branch can be orientated toward the front of the page and the Manα(1− 6)-Man branch toward the back of the page or vice versa. We also identified two alternative, less populated conformers, namely, a "front fold" (ϕ = 79°, ψ = 87°) with a relative population of 12% and a "back fold" (ϕ = 83°, ψ = −76°) with a relative population of 6%; see Figure 2 and Table 1 Figure 4 and Tables S.2 and S.6, both Manα(1−2)-Man linkages occupy two conformers, one at (ϕ = 74°, ψ = 151°) and the other at (ϕ = 70°, ψ = 107°) with a relative population of 73% and 27% for Man 6, respectively, and one at (ϕ = 74°, ψ = 151°) and the other at (ϕ = 70°, ψ = 106°) with population of 76% and 24% for Man 7, respectively. As shown by the population analysis in Tables S.2 Table 2 and Tables S.9 and S.13, with a slightly more pronounced preference for the back vs front fold in Man9, due to the interactions of the longer (1−6) branch with the chitobiose, see Figure 5 and Table 2 . The structures of all Manα(1−2)-Man linkages are the same as described for Man6(I) and Man7(I), yet the elongation of both branches on the (1−6) arm with relatively rigid FcγRIIIa-Linked Man5/9. The FcγRIIIa (CD16a) is a cellbound receptor responsible for modulating antibody-dependent cellular cytotoxicity (ADCC) through its interaction with the IgG1 Fc region. 4 Recent studies have shown that the FcγRIIIa glycosylation contributes to the binding to IgG1s by stabilizing the interaction to a degree that is highly dependent on the type of the N-glycans present. 31, 40, 41 Human FcγRIIIa is glycosylated on two sites, namely, N45 and N162; see Figure 1 . These two sites are very different in terms of their surrounding protein landscape; while N162 is highly exposed to the solvent, N45 is located in the core of one of the two structural domains. To understand the effect of the protein surface landscape on the oligomannoses structure and dynamics, we studied two FcγRIIIa glycoforms, one with Man5 at N45 and N162 and the other with Man9 at N45 and N162. As shown in Figure 7 , results obtained from 2 μs of cumulative sampling from two independent runs show that the conformational dynamics of the Man5 at N45 is significantly restrained compared to the unlinked form. Indeed, a network of hydrogen bonds connects the terminal Man on the (1−3) branch of the (1−6) arm within a protein's cleft located conformation; see Figure 7 and Table 3 . The flexibility of the (1−6) branch and of the (1−3) arm, not interacting with the protein, is the same as found for the unlinked Man5; see also Figure 2 . Man9 has two Manα(1−2)-Man linkages elongating both branches on the (1−6) arm, denying the pose found for Man5, which indeed disappears; see Tables 3 and S.13. Despite a higher flexibility relative to Man5, the N45-linked Man9 is less dynamic relative to the unlinked form due to the protein's landscape. Indeed, as shown in Table 3 , only three out of the seven populated conformers are accessible. As shown in Figure 1 , the N162 position is much more exposed to the solvent relative to N45. Consequently, the intrinsic dynamics of the N162-Man5 is almost entirely retained, with a shift promoting the open (cluster 2) relative to the open (cluster 1) as the dominant conformer; see Table 4 . Meanwhile in the case of a N162-linked Man9, the dynamics of the longer arms is limited due to the proximity to the protein's surface, see Figure 8 , and in particular due to the presence of Lys 128, which because of its position denies a number of conformers due to steric hindrance and also potentially stabilizes the open (cluster 1) conformation through a hydrogen bonding interaction with the α(1−6)linked Man on the (1−6) arm. In this work we analyzed the 3D structure and dynamics of human oligomannose N-glycans, from Man5 to Man9, when free (unlinked) in solution and also determined how the effect of FcγRIIIa (CD16a) surface landscape modulates their structural equilibria. Despite similarities with complex Nglycans, 16, 17 in terms of the core chitobiose rigidity and of the relatively low degree of flexibility of the (1−3) arm, oligomannoses have a very unique architecture, which changes with the progressive functionalization of the arms. More specifically, Man5 shows a clear propensity for an "open" structure, where the (1−6) arm is outstretched orientating the Table 3 , with the terminal Man on the (1−3) branch of the (1−6) arm restrained by hydrogen bonds to residues T167, R18 and E85, labeled in the figure. Heat maps were made with RStudio (www.rstudio.com), and structure was rendered with PyMol (www.pymol.org). N-glycan is colored according to the SNFG convention. Figure 6 ; so the structure of Man9 is quite compact, or more "tree-like", relative to smaller oligomannoses, where the arms are shorter but characterized by a more independent dynamics. As a further step in the analysis, a direct comparison of the results we obtained for Man9 and Man8(II) with NMR-validated REMD analysis 42 shows a very good agreement, supporting that the trimming of terminal residues allows for more extended arm structures, which expose embedded glycotopes; see Figure S . 18 . The results obtained for the unlinked oligomannoses also confirm an earlier observation we made in the context of complex N-glycans, 16 whereby the overall 3D architecture is determined by the local spatial arrangement of independent groups of monosaccharides we named "glycoblocks". The oligomannoses dynamics can be also discretized in terms of these structural units, 16 with the addition of a unique Manα(1−2)-Man glycoblock that can be added to the arms with, as we have seen, minimal effect to the dynamics of the underlying units it builds on. This observation can offer a practical advantage to the study of glycan recognition through molecular docking, for example, where the receptor binds a specific glycoblock unit and recognition depends only on its accessibility within a specific glycoform. To understand how the protein affects the presentation of the glycans to potential receptors, we have looked at the human FcγRIIIa (CD16a). Human FcγRIIIa has two Nglycosylation sites, namely, N45 and N162, where the type of glycosylation affects the receptor's binding affinity to IgG1s. 31, 32, 43 The surface landscape around these two sites is quite different, with N162 exposed to the solvent while N45 is located in the core of one of the two structural domains; see Figure 1 . Conformational sampling of a Man5 at N45 shows that the (1−6) arm dynamics is heavily restrained to one of its two open conformations accessible in solution; see Figure 7 . More specifically, we found that the terminal Man on the (1− 3) branch is engaged in a network of hydrogen bonding interactions involving a number of residues near the glycosylation site, namely, Arg 18, Glu 85, and Thr 167. The stabilization of this glycoform by the FcγRIIIa surface landscape renders the (1−3) branch on the (1−6) arm virtually inaccessible for further functionalization. This result agrees with recent work highlighting the unique prevalence of hybrid and oligomannose type N-glycans at N45. 32, 43 The N162 position determines very little steric hindrance to the dynamics of Man5, which retains most of the degrees of freedom characterized for the glycan free in solution. Meanwhile, the dynamics of the larger Man9 is greatly affected by the presence of Lys 128, which forces the glycan to adopt only two of the conformations accessible to the unlinked form; Table 4 and Figure 8 . Ultimately, the comparison between the conformational propensity of the unlinked Man5 and Man9 oligomannoses relative to their FcγRIIIa-linked counterparts suggests that the protein landscape affects the glycans structure by shifting their intrinsic conformational equilibria toward forms that complement it, yet it does not actively morph the glycan into unnatural conformers. In this work we have characterized the 3D structure and dynamics of human oligomannose N-glycans unlinked and linked to FcγRIIIa through extensive sampling based on conventional MD simulations. The simulations of the unlinked oligomannose N-glycans show a complex architecture that is derived from a progressively intricate network of transient hydrogen bonding interactions involving the terminal residues on the arms, all linked through rigid Manα(1−2)-Man glycoblocks. The protein landscape affects the conformational equilibrium of the N-glycans favoring conformations that complement it, but it does not actively distort the oligomannoses' structure. Indeed, the two FcγRIIIa glycosylation sites studied in this work present different sets of constraints to different glycoforms and accordingly shift each conformational equilibrium specifically. This determines a diverse degree of accessibility of the arms for further functionalization by glycotransferases and glycohydrolases at N45, 32,43 which has been found to have an unusually high degree of hybrid N-glycoforms, and ultimately exposure of the arms at N162 for contact with the IgG1 Fc N-glycans, which is implicated in modulating ADCC. 19, 31, 44, 45 Work in this direction is currently ongoing in our lab. Biological roles of glycans Plant protein glycosylation Cell surface protein glycosylation in cancer The history of IgG glycosylation and where we are now Vertebrate protein glycosylation: diversity, synthesis and function Global view of human protein glycosylation pathways and functions Regulation of integrin functions by Nglycans Biological significance of complex N-glycans in plants and their impact on plant physiology. Front Comparisons of N-glycans across invertebrate phyla Protein glycosylation pathways in filamentous fungi Virus recognition of glycan receptors N-glycan structures: recognition and processing in the ER Glycan microarrays for decoding the glycome Imaging single glycans Exploring the Molecular Conformation Space by Soft Molecule-Surface Collision How and why plants and human N-glycans are different: Insight from molecular dynamics into the "glycoblocks" architecture of complex carbohydrates Sequence-to-structure dependence of isolated IgG Fc complex biantennary N-glycans: a molecular dynamics study Updates to the Symbol Nomenclature for Glycans guidelines Unique carbohydrate-carbohydrate interactions are required for high affinity binding between FcgammaRIII and antibodies lacking core fucose Exploitation of glycosylation in enveloped virus pathobiology Site-specific glycan analysis of the SARS-CoV-2 spike The glycan shield of HIV is predominantly oligomannose independently of production system or viral clade Envelope glycans of immunodeficiency virions are almost entirely oligomannose antigens l Trimeric HIV-1-Env Structures Define Glycan Shields from Clades Site-Specific Glycosylation of Virion-Derived HIV-1 Env Is Mimicked by a Soluble Trimeric Immunogen Highmannose glycans are elevated during breast cancer progression Comprehensive N-Glycome Profiling of Cells and Tissues for Breast Cancer Diagnosis Site-specific glycoproteomics confirms that protein structure dictates formation of N-glycan type, core fucosylation and branching N-glycoprotein macroheterogeneity: biological implications and proteomic characterization CD16a with oligomannose-type Site-specific N-glycan Analysis of Antibody-binding Fc γ Receptors from Primary Human Monocytes GLYCAM06: a generalizable biomolecular force field. Carbohydrates Comparison of simple potential functions for simulations of liquid water On the Role of Water Models in Quantifying the Binding Free Energy of Highly Conserved Water Molecules in Proteins: The Case of Concanavalin A Solution Properties of Hemicellulose Polysaccharides with Four Common Carbohydrate Force Fields Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations VMD: visual molecular dynamics Identification of Fc Gamma Receptor Glycoforms That Produce Differential Binding Kinetics for Rituximab The Structural Role of Antibody N-Glycosylation in Receptor Interactions Exploration of conformational spaces of high-mannosetype oligosaccharides by an NMR-validated simulation Allotype-specific processing of the CD16a N45-glycan from primary human natural killer cells and monocytes Antibody Fucosylation Lowers the FcγRIIIa/CD16a Affinity by Limiting the Conformations Sampled by the N162-Glycan Glycan:glycan interactions: High affinity biomolecular interactions that can mediate binding of pathogenic bacteria to host cells Carl A Fogarty − Department of Chemistry and Hamilton Institute, Maynooth University, Maynooth, Kildare, Ireland Complete contact information is available at: https://pubs.acs.org/10.1021/acs.jpcb.1c00304 The authors declare no competing financial interest. Electronic Data Sharing. All structures and trajectories will be made available through a database currently under development in our lab. In the meantime, distribution is done based on requests to the corresponding author. The Irish Centre for High-End Computing (ICHEC) is gratefully acknowledged for generous allocation of computational resources. The Irish Research Council (IRC) is gratefully acknowledged for funding CAF studies through the Government of Ireland Postgraduate Scholarship Programme.