key: cord-0756081-2paaybd0 authors: Mascarenhas, Nahren Manuel; Gosavi, Shachi title: Understanding protein domain-swapping using structure-based models of protein folding date: 2016-11-17 journal: Prog Biophys Mol Biol DOI: 10.1016/j.pbiomolbio.2016.09.013 sha: 128de1d0186d200abb0c50069fae5ff6d2dfbe26 doc_id: 756081 cord_uid: 2paaybd0 In domain-swapping, two or more identical protein monomers exchange structural elements and fold into dimers or multimers whose units are structurally similar to the original monomer. Domain-swapping is of biotechnological interest because inhibiting domain-swapping can reduce disease-causing fibrillar protein aggregation. To achieve such inhibition, it is important to understand both the energetics that stabilize the domain-swapped structure and the protein dynamics that enable the swapping. Structure-based models (SBMs) encode the folded structure of the protein in their potential energy functions. SBMs have been successfully used to understand diverse aspects of monomer folding. Symmetrized SBMs model interactions between two identical protein chains using only intra-monomer interactions. Molecular dynamics simulations of such symmetrized SBMs have been used to correctly predict the domain-swapped structure and to understand the mechanism of domain-swapping. Here, we review such models and illustrate that monomer topology determines key aspects of domain-swapping. However, in some proteins, specifics of local energetic interactions modulate domain-swapping and these need to be added to the symmetrized SBMs. We then summarize some general principles of the mechanism of domain-swapping that emerge from the symmetrized SBM simulations. Finally, using our own results, we explore how symmetrized SBMs could be used to design domain-swapping in proteins. Domain-swapping is a process in which two or more identical protein monomers exchange structural elements and fold into dimers or multimers (Bennett et al., 1994a; Liu and Eisenberg, 2002) . The individual units of such oligomers are structurally similar to the original monomer (Fig. 1) . The earliest evidence for domainswapping was seen in RNase A (Crestfield et al., 1962) and the first determination of a domain-swapped structure was that of the dimeric diphtheria toxin (Bennett et al., 1994a,b) . Since then, it has become clear that domain-swapping is common, and several domain-swapped proteins have been crystallized (Shameer et al., 2011) . Recent data indicates that RNA structures may also domain-swap (Suslov et al., 2015) . In this section, we first define several terms used in the study of domain-swapping (Liu and Eisenberg, 2002; Gronenborn, 2009; Rousseau et al., 2012) and then summarize the biological consequences of domain-swapping (Rousseau et al., 2012; Wodak et al., 2015) . In order for a protein to be considered domain-swapped the structures of both the monomer and the domain-swapped oligomer need to be present. If one of the structures is that of a homologous protein then the protein is said to be quasi-domain swapped. If only the domain-swapped structure is available then the protein is considered a candidate for domain-swapping. Several inter-protein interfaces are formed upon domain-swapping. The interface whose inter-protein interactions mimic those present in the monomer is called the primary interface. However, new interfaces which are not present in the monomer can also be formed and such interfaces are called secondary interfaces. Many proteins swap only a single secondary structural element and in such cases this "domain" can be thought of as being swapped. However, there are proteins where the size of the "swapped" region is similar to that of the other region. To account for both cases, in this review, we call both parts of the monomer swapped domains. So, a domainswapping protein usually consists of two swapping-domains connected by a peptide segment (of about 4e5 residues) called the hinge ( Fig. 1B and C) . The hinge undergoes a conformational change upon swapping, usually being a loop or a turn in the monomer (Fig. 1B) and adopting an extended conformation after domainswapping (Fig. 1C ). There are also examples where two different hinges induce two different modes of domain swapping in the same protein (Liu et al., 1998 (Liu et al., , 2001 Chen et al., 2010) . In some proteins (Nilsson et al., 2004) , one swapped domain is inserted into the sequence of the other domain and there are two peptide segments connecting the two domains. Upon swapping, both segments undergo a conformational change. Given the diversity of structural hinge types, there have been efforts to understand the sequence determinants of hinges. Early efforts concentrated on the proline composition of hinges but although prolines contribute to domain-swapping in some proteins (Bergdoll et al., 1997; Rousseau et al., 2001; Miller et al., 2010) this is not a universal effect (Barrientos et al., 2002; Cho et al., 2005) . A recent study analyses the distribution of all amino acids in the hinge regions of domain swapping proteins and finds that the amino acid distribution of hinge residues is similar to that of other loop regions. However, there are some amino acids such as valine which are more likely to be found in hinges . Open-ended domain-swapping ( Fig. 1D ) can lead to the formation of large protein aggregates (Esposito et al., 2010) . Such aggregation can lead to the loss of protein function and cause disease (Law et al., 2006) . Domain-swapping has also been implicated in the formation of disease-causing fibrillar aggregates (Chiti and Dobson, 2006; Herczenik and Gebbink, 2008; Bennett et al., 1995 Bennett et al., , 2006 Janowski et al., 2005) . For instance, mutants of cystatin-C with reduced domain-swapping also have reduced fibrillar aggregation (Nilsson et al., 2004) . The question that arises, then, is, "Why are the amino acids that cause increased domain-swapping retained through evolution?" In a recent study, we showed that in stefin-B, residues that increase the propensity of domainswapping are part of the protease binding-site and thus, domainswapping can be a by-product of the need to conserve protein function (Mascarenhas and Gosavi, 2016) . In some proteins, domain-swapping is itself involved in protein function and we discuss this next. Domain-swapping has been implicated in the regulation of protein function (Saint-Jean et al., 1998; Vitagliano et al., 1999; Baxter et al., 2012) . The addition of glutathione converted the Pseudomonas putida glyoxylase I from an active domain-swapped dimer with two metal binding sites to a metastable less active monomer with one metal binding site (Saint-Jean et al., 1998) . Domain-swapping was also observed to induce allosteric communication in the N-terminal swapped bovine seminal ribonuclease A (Vitagliano et al., 1999) . Additionally, domain swapping has been suggested as a mechanism for the evolution of larger complex folds when it is followed by gene duplication and fusion (Bennett et al., 1995) . Further support for this hypothesis was found in directed evolution experiments on NF-kB p50, where domain-swapping was able to rescue both structure and function from destabilizing mutations (Chirgadze et al., 2004) . Finally, domain-swapping has been seen to induce protein self-assembly (Baker et al., 2016) . In order to understand the consequences of domain-swapping, it is important not only to be able to predict the final domainswapped structure but also to understand the mechanism of domain-swapping. Here, we review a class of computational protein models called G o (G o and Taketomi, 1979) or structure-based models (SBMs) (Clementi et al., 2000; Whitford et al., 2009) . SBMs use information derived only from the monomeric protein structure to model domain-swapping (Ding et al., 2002; Yang et al., 2004) . Molecular dynamics (MD) simulations of such SBMs have been able to correctly predict hinge regions and the structure of the domain-swapped dimers (Ding et al., 2002; Yang et al., 2004; Cho et al., 2005; Ding et al., 2006) . They have also been used to understand whether swapping starts from a transition-state ensemble, an intermediate ensemble or an unfolded ensemble (Yang et al., 2004; Cho et al., 2005; Ono et al., 2015) . We summarize insights about the mechanism of domain swapping gained from SBM simulations and then briefly describe how this information might be used to design domain-swapping in proteins. Despite having a vast conformational space, proteins fold on a biologically reasonable timescale because of a funnel shaped energy landscape (Bryngelson et al., 1995) . This funnel shape arises because the interactions present in the native or the folded structure of the protein are stabilizing while non-native interactions, which would need to be broken before further folding, are mostly destabilizing. SBMs (Clementi et al., 2000; Hyeon and Thirumalai, 2011; Whitford et al., 2012) use this funnel shape of the energy landscape to simplify the potential energy function and make it easier to sample the conformational space. The potential energy functions of SBMs encode the structure of the protein and in doing so, ignore all non-native interactions and assume that only native interactions contribute to folding. Further, solvation is taken into account implicitly. Both these simplifications speed up computation and make it possible to reach folding timescales in a reasonable amount of computer time. Molecular dynamics (MD) simulations of SBMs of proteins have been successfully used to describe various aspects of protein folding such as folding mechanisms, trends in folding barrier heights, structures of folding intermediate ensembles, etc (Hyeon and Thirumalai, 2011; Whitford et al., 2012) . SBMs have been reviewed in detail elsewhere (Hyeon and Thirumalai, 2011; Whitford et al., 2012; Yadahalli et al., 2014) . Here, we summarize a few general features of the models. The protein structure is encoded into different flavours of SBMs in different ways. For instance, some SBMs are coarse-grained and represent each residue by only its Ca (see for example: Clementi et al., 2000) or its Ca and its Cb atoms (see for example : Cheung et al., 2005) . In general, in all SBMs, constraints derived from chemical bonding such as bond distances ( Fig. 2A) , angles ( Fig. 2B ) or improper dihedrals maintaining chirality are encoded using strong harmonic potentials which do not allow these terms to deviate much from their values in the folded structure. Some SBMs encode a cosine-like dihedral potential which mimics secondary structure formation (Fig. 2C ). The energetics of tertiary structure is encoded through Lennard-Jones-like potentials between atoms which are in "contact" in the folded structure of the protein (Fig. 2D ). The strength of this contact potential is such that contacts can break and form (the distance between the atoms in contact is similar to that in the folded state) allowing the protein to unfold and fold. Atom pairs which are in contact and have an attractive interaction potential between them are usually calculated using one of two recipes. In the first, a contact is said to be present between two atoms if the second atom is within a pre-set cut-off radius (say 4.5 Å) of the first atom (see for example: Karanicolas and Brooks, 2002; Chen and Chan, 2015) . Such contact maps are called cut-off contact maps. In the second recipe, an additional screening term is imposed which removes contacts where the contact is occluded by a third atom. Such screened contact maps (e.g. shadow (Noel et al., 2012) and CSU (Sobolev et al., 1999) ), are advantageous because they reduce the sensitivity of contact lists to the cut-off radius. In several coarse-grained SBMs, contacts are calculated using the atomistic representation of the protein and then projected onto the coarse-grained representation (Clementi et al., 2000; Karanicolas and Brooks, 2002) . Some coarse-grained SBMs weight the strengths of contacts (see Yadahalli and Gosavi, 2016 for a detailed description of several such SBMs) or change the shape of the contact potential (Azia and Levy, 2009; Chen and Chan, 2015) to account for sequence specific effects. Such heterogeneous contacts can be used to represent salt bridges (Azia and Levy, 2009 ), disulphide bonds (Cho et al., 2005) , single effective contacts between large amino acids (Yadahalli and Gosavi, 2016) , etc. Finally, excluded volume interactions keep atoms which are not in contact from passing through each other. SBMs are computationally efficient and can be used to simulate long timescales for multiple protein chains. Thus, SBMs can be used to study domain-swapping. However, SBMs used for protein folding use only native-like interactions in the potential energy function. Although such SBMs can be used to understand the mechanism of domain-swapping using the domain-swapped structure, they cannot be used for predicting the domain-swapped structure or for understanding the causes of domain-swapping. In order to address these issues, symmetrized SBMs, which use information only from the monomer structure, were invented (Ding et al., 2002; Yang et al., 2004) . The potential energy functions of symmetrized SBMs are constructed as follows: The energy function of each identical protein chain (say A or B) is derived from the monomer structure in the same manner as in the folding SBMs. A weak harmonic restraint of the form k(x-x 0 ) 2 is usually applied between the centres of mass of the two protein chains (A and B) to maintain a given protein concentration. Additionally, for every pair of intra-molecular contacts between atoms i A and j A of chain A and i B and j B of chain B present in the monomer SBMs, a pair of inter-molecular contacts (i A ,j B ) and (i B ,j A ) are introduced into the symmetrized SBM. Thus, atoms can form the same set of contacts either within their own chain, which leads to a pair of monomers or with atoms of another chain, which can lead to a diversity of domain-swapped structures. However, it was observed that the largest population of dimers seen in symmetrized SBM simulations had structures which were similar to the domain-swapped dimer crystal structure (Yang et al., 2004; Cho et al., 2005) . In some proteins, specific features of the amino acid sequence promote or repress domain-swapping. Such sequence effects can either be present (as stress) in the hinges (e.g. isomerization of hinge prolines (Gronenborn, 2009) For several proteins, even symmetrized SBMs which include only a coarse-grained representation can correctly predict the structure of the domain-swapped dimers (Yang et al., 2004; Cho et al., 2005; Ding et al., 2006) . Further, the physical understanding of domain-swapping from symmetrized SBMs has also led to a method for predicting the unfolding hotspots and hinge regions using only the monomer structure (Ding et al., 2006) . This method calculates the difference in contact energies (DE) between the folded structure and an "open" (or domain-swapped) structure of the monomer which assumes that a particular hinge contributes to the domain-swapping. The gain in entropy (DS) upon going from the folded structure to the open structure is modelled using polymer theory. A hinge is predicted to be the likely cause of domainswapping if its DE/DS is the lowest in the protein. This method was able to correctly predict the hinges for several proteins including the two different hinges present in RNase-A which give rise to two different domain-swapped dimers. It was also able to predict an entirely new domain-swapped structure for the focal adhesion targeting domain which was subsequently experimentally confirmed. Thus, in general, the structure of the domainswapped dimer is determined by the overall topology of the monomer. We next discuss potential protein fates that have been seen in symmetrized SBM simulations and how these fates are determined by the energetics of the protein structure. At low protein concentrations (a low inter-protein interaction force constant, k, and a high inter-protein mean interaction distance, x 0 in symmetrized SBM simulations), proteins interact little and mainly fold to monomers. At higher protein concentrations (a larger inter-protein interaction force constant, k and a small mean interaction distance, x 0 ), there is a competition between folding to a monomer structure and domain-swapping (Yang et al., 2005) . In simulations of proteins which do not domain-swap and have a high barrier to folding (e.g. WT CI2 (Yang et al., 2004; Cho et al., 2005) ), a high protein concentration promotes domain-swapping but induces a diversity of swapped structures with no particular structure being given a preference (Yang et al., 2004; Cho et al., 2005) . In high concentration simulations of proteins which domain-swap, the correct domain-swapped structure is predominant (Yang et al., 2004; Cho et al., 2005; Ding et al., 2006) . It has also been observed that "hotspots" of unfolding are present around the hinge region (Ding et al., 2006; Dehouck et al., 2003) . Such hotspots can promote domain-swapping from the unfolded state by remaining unfolded while the rest of the protein folds (Yang et al., 2004; Ding et al., 2006; Liu and Huang, 2013) . In some proteins, the free energy barrier to folding is low potentially due to the formation of folding intermediates (Tsytlonok and Itzhaki, 2013) . In such cases, partial monomer folding can still occur even at high protein concentrations. The (Mascarenhas and Gosavi, 2016) . We note in passing that this effect is not accounted for in the algorithm to predict domain-swapping hotspots (Ding et al., 2006) described in the previous section. Domainswapping can also occur from the folded state where unfolding hotspots in the structure promote local unfolding in the folded protein. Such local unfolding events can give rise to near-native intermediates (Straub and Thirumalai, 2011; Das et al., 2011) whose role in domain swapping has recently been demonstrated using SBM simulations of the SH3 domain (Zhuravlev et al., 2014) . An altogether different mechanism of domain swapping has been proposed for a quadruple mutant of GB1 using simulations of non-structure based forcefields (Malevanets et al., 2008) . Since natural GB1 is not known to domain swap, it is possible that the hydrophobic core mutations that promote domain-swapping (Byeon et al., 2003) alter the funnel-shaped energy landscape of natural GB1. So, the domain-swapping mechanism of mutant GB1 may not be generally applicable to the domain-swapping of natural proteins and we do not elaborate on it here. Finally, more complex mechanisms which may involve non-native interactions (interactions not present in the folded monomer structure) have been proposed for the C-terminal domain of the SARS coronavirus main protease (Kang et al., 2012) . We do not comment on the underlying energetics of such mechanisms here because this protein is yet to be studied using SBMs. We next demonstrate how an understanding of the mechanism of domain-swapping might help in the design of domain-swapping. Design of domain-swapping is likely to have applications in protein assembly (see for example Ogihara et al., 2001) and functional regulation (Ha et al., 2015) . Several protein-specific methods have been used to design domain-swapping (Green et al., 1995; Murray et al., 1998; Pica et al., 2013; Rousseau et al., 2001; Ogihara et al., 2001; Reis et al., 2014; Kuhlman et al., 2001; Orlikowska et al., 2011; Picone et al., 2005) . More recently, insertion of a lever protein (ubiquitin) into a non-domain swapping protein (ribose binding protein: RBP) was used to force RBP open and enable domain swapping. This method is likely to be generalizable to other proteins. Using this RBP construct it was found that a primary hinge existed in the protein and if the entropic constraints of intra-protein interactions were lifted then domain-swapping occurred at this hinge (Ha et al., 2015) . Symmetrized-SBM simulations of non-domain swapping proteins such as CI2 at high protein concentrations have shown that such proteins lead to diverse groups of domain-swapped structures each of which uses a different loop for swapping (Yang et al., 2004) . Here, we define the primary hinge as that loop which leads to the domain-swapped structure with the highest population. It follows that it should be easiest to destabilize the loop conformation of this hinge to engineer domain-swapping into the protein. In fact, it was seen in simulations that experimental mutations that led to a domain-swapped version of the protein CI2 were made in the loop that was the primary hinge of CI2 (Yang et al., 2004; Cho et al., 2005) . To understand how simulations can guide design, we discuss domain-swapping simulations of the symmetrized SBMs of two proteins of the cystatin fold, one of which domain-swaps and the other does not. Proteins of the monellin-cystatin fold are composed of five antiparallel strands docked to a single helix with an overall topology of b1-a1-b2-b3-b4-b5 (Fig. 3A) . Here, we compare domainswapping simulations of the cysteine protease inhibitor, stefin-B (Turk and Bode, 1991), which readily domain swaps (Jenko Kokalj et al., 2007) , and a single chain version of the sweet tasting protein monellin (scMn) (Tancredi et al., 1992) , where domainswapping has not been observed in experiment (Patra and Udgaonkar, 2007; Udgaonkar, 2014, 2015) . We recently showed using the simulations of symmetrized SBMs which encode all heavy atoms of the protein that the primary cause for domain-swapping in stefin-B is the presence of a frustrated loop whose composition is QVVAG (Mascarenhas and Gosavi, 2016). The backbone dihedrals of the second Val in this loop lie in the disallowed region of the Ramachandran plot (Ramachandran et al., 1963) . Domain-swapping is favoured because it eases the destabilization created by this disallowed dihedral. We also found that the energetics of this destabilization could only be described correctly if structural information from the dimer structure was encoded to simulate this hinge (QVVAG). In particular, when the hinge dihedrals were extracted from the dimer structure and encoded at ten times the strength of all other dihedrals, we observed that over 60% of the quenched structures folded to the correct domainswapped dimer structure ( Fig. 3B and C) (Mascarenhas and Gosavi, 2016). Here, we show results from the domain-swapping simulations of wild-type monellin (scMn) and a variant which encodes the energetics (and the structure) of the stefin-B loop (QVVAG). As before (Mascarenhas and Gosavi, 2016), we used symmetrized SBMs which encode all the heavy atoms of the protein (Whitford et al., 2009 ) for these simulations. Two unfolded chains (with different structures in each replicate) were used as starting structures in the simulations. These were then simulated for 1 Â 10 7 simulation steps at~0.9T f , where T f is that temperature below which the protein is folded. The chains were restrained to be within interacting distance of each other by a weak harmonic restraint as explained in the section on symmetrized SBMs. Simulations were performed using the GROMACS 5.0 program suite (Abraham et al., 2015) with the leap frog stochastic dynamics integrator and a time step of 0.0005 in reduced units. The basic energy scale of the simulations, ε, is set to 1.0 kJ/mol, the basic length scale to 1 nm and the basic time scale to 1 ps, units which are used in GROMACS. We performed 100 replicate simulations and visually analysed the final structures from these simulations. The scMn mutant was created by deleting residues 47e56 of scMn and replacing them by a loop with residues QVVA. This loop was built using I-TASSER (Yang et al., 2015) . The 57th residue of scMn, a Gly, is not deleted and thus, the mutant scMn has the QVVAG segment from stefin-B in its loop1 (Fig. 3) . The resulting sequence has 90 residues instead of the 96 of scMn. Further, the minima of the backbone dihedrals of the QVVAG sequence are encoded from the dimer structure of stefin-B (PDB ID: 2OCT). The potential energies of these dihedrals are also strengthened to 10 times those of the normal dihedrals (Mascarenhas and Gosavi, 2016) . Table 1 summarizes the analysis of final structures from the simulations of both wild-type and mutant proteins. As expected, the simulations of scMn show a diversity of domain-swapped structures with the highest populations (Table 1) being those with loopA swapped (Fig. 4A ) and loop1 swapped (Fig. 4B) . At a lower protein concentration (here a lower value of the strength of the harmonic potential, k), the population of the loop1 swapped mutant reduces while that of the two monomers increases. Thus, loopA is likely to be the primary hinge in scMn. However, loopA is a long loop and it is not apparent what mutations will destabilize the monomer but stabilize the dimer. The primary hinge in stefin-B is loop1 which is a b-turn and we introduce this into scMn. This loop mutation almost completely suppresses the population of all domain-swapped structures except that of the loop1 swappeddimer (Table 1) . Interestingly, this mutation suppresses the formation of two monomers even at lower protein concentrations (Table 1) and thus, promotes domain-swapping quite significantly. Experiments with the loop1 mutant of scMn can be easily performed to test our predictions. A general feature of swapping proteins is that the hinge region is stressed in the monomer structure and this stress is relieved upon domain swapping. Several structurally homologous pairs of proteins exist where only one of the proteins is known to domain-swap . A detailed understanding of the mechanism of domain-swapping made possible by symmetrized-SBMs can be used to predict whether hinge-swap mutations between such pairs of proteins can induce domain-swapping in the protein which does not domain-swap. The domain-swapping hinges could also be used to create a library of loops which introduce different kinds and degrees of destabilization into the monomer structure. For example, the QVVAG creates destabilization due to the tradeoff between the hydrophobic effect and excluded volume while adding prolines might create destabilization due to the population of a proline isomer in the monomer form which is less stable than the isomer populated in the domainswapped structure (Bergdoll et al., 1997; Rousseau et al., 2001; Kuhlman et al., 2001; Miller et al., 2010) . Finally, changes in (both increases and decreases in) the length of hinges can also promote swapping (Green et al., 1995; Murray et al., 1998; Picone et al., 2005) . Such a library could then be used along with symmetrized-SBM simulations to understand the effect of hingeswap mutations of any given loop in a protein and whether such mutations are likely to promote domain-swapping and to what degree. This loop library could then be used to guide mutation experiments designed to introduce domain-swapping into proteins Table 1 Summary of the results from domain swapping simulations. A total of 100 quenching runs were performed for each protein and the final snapshots of each simulation were categorized by visual inspection. The underlined numbers give the populations of the most commonly occurring structures for each protein. All structural elements have been marked in Fig. 3A Loop1 is the loop between b2-b3 and loopA is the loop between a1-b2. loop1-hinge (one) or loopA-hinge (one): loop1 (or loopA) is the hinge, but only one subunit of the domain-swapped dimer is formed while the other is unfolded. Monomers (one): Only one of the monomers is folded. K is the strength of the harmonic potential in units of ε/nm 2 and x 0 is the equilibrium distance between the centres of mass of the two chains. Table 1 ). (A) Dimer with loopA swapped. This structure is rarely observed in stefin-B simulations because the b1-b2 interactions are weak in stefin-B and this reduces the likelihood that they are formed between proteins. (B) Dimer with loop1 swapped. Similar to Fig. 3C . In simulations, where the monellin loop1 is replaced by the stefin-B loop1, this is the predominant structure observed. (C) Other complex swapped-dimers are also observed. A structure with b1-a1-loopA swapped between proteins is shown. The hinge is the peptide segment between loopA and b2. that do not domain-swap. Protein domain-swapping can have both beneficial (protein evolution, functional regulation, etc.) and detrimental (fibrillar and amorphous aggregation and loss of function) effects. We review symmetrized-SBMs and their simulations, which can both predict the domain-swapped structure and identify the mechanism of domain-swapping. Such simulations have been useful for understanding the consequences of domain-swapping in specific proteins. The design of domain-swapping has applications in proteinassembly and in the regulation of protein function. In this review, we also outline how symmetrized-SBMs could be used to tune the energetics of domain-swapping in order to achieve the required dimer-structure and the degree of domain-swapping. GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1e2 Nonnative electrostatic interactions can modulate protein folding: molecular dynamics with a grain of salt Strand swapping regulates the ironsulfur cluster in the diabetes drug target mitoNEET Domain swapping: entangling alliances between proteins Refined structure of dimeric diphtheria toxin at 2.0 A resolution 3D domain swapping: a mechanism for oligomer assembly Prolinedependent oligomerization with arm exchange Funnels, pathways, and the energy landscape of protein folding: a synthesis. Proteins 21, 167e195. Byeon, I Native contact density and nonnative hydrophobic effects in the folding of bacterial immunity proteins MolProbity: all-atom structure validation for macromolecular crystallography Snapshot of protein structure evolution reveals conservation of functional dimerization through intertwined folding Topological and energetic factors: what determines the structural details of the transition state ensemble and "enroute" intermediates for protein folding? An investigation for small globular proteins Aggregation of g-crystallins associated with human cataracts via domain swapping at the C-terminal b-strands Sequencestructure signals of 3D domain swapping in proteins Molecular dynamics simulation of the SH3 domain aggregation suggests a generic amyloidogenesis mechanism Topological determinants of protein domain swapping Studies on protein folding, unfolding and fluctuations by computer simulation. IV. Hydrophobic interactions One-step evolution of a dimer from a monomeric protein Protein acrobatics in pairsedimerization via domain swapping 3D domainswapped human cystatin C with amyloidlike intermolecular beta-sheets. Proteins 61, 570e578 Foldon unfolding mediates the interconversion between M(pro)-C monomer and 3D domain-swapped dimer The origins of asymmetry in the folding transition states of protein L and protein G A breakdown of symmetry in the folding transition state of protein L Conversion of monomeric protein L to an obligate dimer by computational protein design A domain-swapped RNase A dimer with implications for amyloid formation The crystal structure of a 3D domain-swapped dimer of RNase A at a 2.1-A resolution Tuning cooperativity on the free energy landscape of protein folding High-energy intermediates in protein unfolding characterized by thiol labeling under nativelike conditions A hinge region cis-proline in ribonuclease A acts as a conformational gatekeeper for C-terminal domain swapping Engineering an intertwined form of CD2 for stability and assembly Prevention of domain swapping inhibits dimerization and amyloid fibril formation of cystatin C: use of engineered disulfide bridges, antibodies, and carboxymethylpapain to stabilize the monomeric form of cystatin C The shadow map: a general contact definition for capturing the dynamics of biomolecular folding and function Single-site mutations induce 3D domain swapping in the B1 domain of protein L from Peptostreptococcus magnus Three-dimensional domain swapping and supramolecular protein assembly: insights from the X-ray structure of a dimeric swapped variant of human pancreatic RNase Stereochemistry of polypeptide chain configurations Three-dimensional domain swapping in p13suc1 occurs in the unfolded state and is controlled by conserved proline residues Analysis of domain-swapped oligomers reveals local sequence preferences and structural imprints at the linker regions and swapped interfaces Toward a molecular theory of early and late events in monomer to amyloid fibril formation Structural determination of the active site of a sweet protein. A 1H NMR investigation of pMNEI Biomolecular dynamics: orderedisorder transitions and energy landscapes The landscape of intertwined associations in Homo-oligomeric proteins Functionally relevant specific packing can determine protein folding routes The I-TASSER Suite: protein structure and function prediction Domain swapping is a consequence of minimal frustration Protein oligomerization through domain swapping: role of inter-molecular interactions and protein concentration Propensity to form amyloid fibrils is encoded as excitations in the free energy landscape of monomeric proteins