key: cord-286537-7ri2p5b8
authors: Lee, Ting-Wai; Cherney, Maia M.; Huitema, Carly; Liu, Jie; James, Karen Ellis; Powers, James C.; Eltis, Lindsay D.; James, Michael N.G.
title: Crystal Structures of the Main Peptidase from the SARS Coronavirus Inhibited by a Substrate-like Aza-peptide Epoxide
date: 2005-11-11
journal: Journal of Molecular Biology
DOI: 10.1016/j.jmb.2005.09.004
sha: 
doc_id: 286537
cord_uid: 7ri2p5b8

The main peptidase (Mpro) from the coronavirus (CoV) causing severe acute respiratory syndrome (SARS) is one of the most attractive molecular targets for the development of anti-SARS agents. We report the irreversible inhibition of SARS-CoV Mpro by an aza-peptide epoxide (APE; k inact/K i=1900(±400)M−1 s−1). The crystal structures of the Mpro:APE complex in the space groups C2 and P212121 revealed the formation of a covalent bond between the catalytic Cys145 Sγ atom of the peptidase and the epoxide C3 atom of the inhibitor, substantiating the mode of action of this class of cysteine-peptidase inhibitors. The aza-peptide component of APE binds in the substrate-binding regions of Mpro in a substrate-like manner, with excellent structural and chemical complementarity. In addition, the crystal structure of unbound Mpro in the space group C2 revealed that the “N-fingers” (N-terminal residues 1 to 7) of both protomers of Mpro are well defined and the substrate-binding regions of both protomers are in the catalytically competent conformation at the crystallization pH of 6.5, contrary to the previously determined crystal structures of unbound Mpro in the space group P21.

Severe acute respiratory syndrome (SARS) first emerged in China in November 2002. This highly transmissible, infectious and often fatal disease spread to 32 countries across five continents, causing close to 8500 infections and over 900 deaths, until being contained by the summer of 2003. Several infections in Asia were reported subsequently, alerting the world that it remains at risk of another outbreak of SARS (World Health Organization: Severe acute respiratory syndrome †). Although the development of anti-SARS vaccines and drugs are in progress, these agents are still far from clinical use. 1 Additional efforts in these areas of study therefore remain paramount.

SARS is caused by a novel coronavirus (CoV); 2-4 it is an enveloped positive-sense single-stranded RNA virus infecting respiratory and gastrointestinal epithelial cells, macrophages, and other cell types, thereby causing systemic changes and damaging many vital organs such as lung, heart, liver, kidney and adrenal gland. 4, 5 Anti-SARS therapeutics could target several major steps in the viral life-cycle, such as virus-cell interactions, virus entry, intracellular viral replication, and virus assembly and exit. 1 The intracellular replication of SARS-CoV is mediated by a "replicase" complex derived from two virally coded polyprotein precursors, pp1a (486 kDa) and pp1ab (790 kDa). 6, 7 The formation of this replicase complex requires the extensive processing of the two polyproteins by two cysteine peptidases within them, namely the main peptidase (M pro ), also known as the 3C-like peptidase (3CL pro ) because of its similarity to the 3C peptidases of Picornaviridae, 8 and the accessory papain-like peptidase 2 (PL2 pro ). PL2 pro cleaves at three sites in the N-proximal regions of the two polyproteins, whereas M pro cleaves at 11 sites in the central and C-proximal regions of the two polyproteins. M pro releases the key proteins in viral replication, such as the RNA-dependent RNA polymerase and the helicase. 7 Playing such an essential role, SARS-CoV M pro is an attractive molecular target for the development of anti-SARS drugs acting as the inhibitors of the peptidase.

SARS-CoV M pro has a molecular mass of 33.8 kDa per protomer; it exists as a homodimer over a wide range of concentrations in solution. [9] [10] [11] [12] The crystal structures of M pro in the space group P2 1 showed that the two protomers of the dimeric peptidase are oriented almost perpendicular to each other and that each protomer consists of three domains. Domain I (residues 8 to 101) and domain II (residues 102 to 184) comprise a two-b-barrel fold similar to that of the chymotrypsin-type serine peptidases. Domain III (residues 201 to 300) has five a-helices and is connected to domain II by a long loop (residues 185 to 200). Each protomer has its own substrate-binding region situated in the cleft between domains I and II. 13 A recent mutagenesis study has confirmed that M pro is a cysteine peptidase with a Cys-His catalytic dyad at the active site. 14 As suggested by the structure-based sequence alignment of the main peptidases (including their flanking residues in the polyproteins) from SARS-CoV and other coronaviruses, 15 and confirmed by in vitro studies, 7,9 these peptidases preferentially cleave at a consensus sequence for the P4 to P1 0 residues of substrates (nomenclature based on that of Schechter and Berger 16 ): (amino acid with a small side-chain)-(any amino acid)-Leu-GlnY(Ala, Ser, Gly).

A number of small-molecule inhibitors of SARS-CoV M pro have been proposed using various methodologies, such as knowledge-based discovery 17, 18 and high-throughput screening (experimental [19] [20] [21] or virtual 22, 23 ) . Although the efficacies of many of these inhibitors were supported by assay results, the modes of action are unknown for most of them. Also, the lack of structural information for inhibitors binding to M pro impedes the structurebased optimization of these inhibitors. To our knowledge, the crystal structure of M pro bound by chloromethyl ketone (CMK) has so far been the only published structure for an inhibitor-bound M pro . 13 Aza-peptide epoxides (APEs) were synthesized as a new class of inhibitors apparently specific for clan CD cysteine peptidases 24 (based on the Figure 1 . Inhibition of SARS-CoV M pro by aza-peptide epoxides (APEs). (a) APEs synthesized for our study, Cbz-Leu-Phe-AGln-EP-COOEt. The epoxide carbon atoms are numbered and their stereochemistries are omitted for simplicity. The proposed mechanism for the irreversible inhibition of clan CD cysteine peptidases by APEs is indicated by arrows. Cbz, the benzyloxycarbonyl group; AGln, aza-glutamine; EP, epoxide; COOEt, ethyl ester. (b) Progress curves for the steady-state cleavage of a fluorogenic peptide substrate observed using 64 mM peptide and either no APE (dotted line) or 5 mM Cbz-Leu-Phe-AGln-(S,S)EP-COOEt (continuous line). The appearance of product was followed using excitation and emission wavelengths of 320 and 420 nm, respectively. Analysis of these data using equation (1) , in which P i and P N represent the initial and final product concentrations, respectively, yielded inactivation rates, j s , of 0.07 min K1 and 0.29 min K1 , respectively. (c) The rate of M pro inhibition was determined using 16 mM (,), 32 mM (-), 64 mM (D) and 100 mM (B) peptide. Equation ( 2) was fit to the data using the least-squares, dynamic weighting options of LEONORA, 37 yielding the following parameters: k inact Z35(G17)!10 K3 s K1 , K i Z18(G9) mM and K m Z96(G31) mM. Additional experimental details are provided in Materials and Methods. classification by Barrett and Rawlings 25 ) , including the legumains 26 and the caspases. 27 Each APE has an aza-peptide component, with an epoxide moiety attached to the carbonyl group of the P1 residue. The side-chain of the P1 residue predominantly determines the target-peptidase specificity of an APE. The substituent on the epoxide C2 atom also allows some tuning of both the inhibitory activity and specificity of APE towards a particular target peptidase. The aza-peptide component resembles a peptide, except that the C a atom of the P1 residue in the former is replaced by a nitrogen atom to form an aza-amino acid residue. This introduces trigonal planar geometry to the a-atom of the P1 residue and reduces the electrophilicity of the carbonyl C atom of the P1 residue, thereby making the carbonyl group of the P1 residue resistant to nucleophilic attack. 28 It has been proposed that APEs inhibit their target peptidases irreversibly by a mechanism in which the catalytic Cys S g atom nucleophilically attacks one of the two epoxide carbon atoms (C2 or C3) of APE (Figure 1(a) ). 24, 26, 27 This results in the opening of the conformationally strained epoxide ring, and the formation of a covalent bond between the Cys S g atom and the attacked APE atom.

We hypothesized that an APE possessing an azaglutamine (AGln) as the P1 residue to mimic the S1 specificity of SARS-CoV M pro for Gln (Figure 1(a) ) would irreversibly inhibit the peptidase. Accordingly, we synthesized Cbz-Leu-Phe-AGln-(S,S)EP-COOEt and Cbz-Leu-Phe-AGln-(R,R)EP-COOEt. Micromolar quantities of the S,S diastereomer strongly inhibited the cleavage of a peptidic substrate, manifesting itself as a pronounced slowing of the reaction velocity as the reaction progressed ( Figure 1(b) ). Under these conditions, the R,R diastereomer did not detectably inhibit M pro . Analysis of the rates of inactivation (j s ) at different concentrations of substrate and inhibitor c R work Z SjjF o jKjF c jj=SjF o j, where jF o j and jF c j are the observed and calculated structure factor amplitudes of a particular reflection, respectively, and the summation is over 95% of the reflections in the specified resolution range. The remaining 5% of the reflections were randomly selected before the structure refinement and not included in the structure refinement. R free was calculated over these reflections using the same equation as for R work . 49 indicated that APE inhibited M pro with a k inact /K i Z 1900(G400) M K1 s K1 .

We have determined the crystal structures of SARS-CoV M pro in three forms: wild-type peptidase in the absence and presence of APE in the space group C2, and a variant of the peptidase with an Ala added to the N terminus of the wild-type sequence, M pro CAðK1Þ , bound by APE in the space group P2 1 2 1 2 1 . The parameters and statistics derived from X-ray diffraction data processing and structure refinement are summarized in Table 1 . For unbound M pro and the M pro :APE complex in the space group C2, each asymmetric unit has only one protomer of the dimer. The two protomers of each dimer are related by the crystallographic 2-fold symmetry ( Figure 2 ). All residues of the protomer (residues 1 to 306) were identified in the electron density maps. In the Ramachandran plot for the structure of unbound M pro , Asp33, Ala46 and Glu47 are in the generously allowed regions, whereas Asn84, Tyr154 and Ile286 are in the disallowed regions. The Asp33 O d2 atom forms a hydrogen bond with the Tyr101 phenolic OH group (2.9 Å ). The Asn84 N d2 atom forms a hydrogen bond with the Glu178 carbonyl O atom (3.2 Å ), and possibly there are hydrogen bonds and van der Waals forces between the side-chains of Asn84 and Lys180 as well. Hydrophobic interactions occur between the side-chains of Thr285 and Ile286 of opposite protomers at the dimer interface. The electron densities for the side-chains of Ala46, Glu47 and Tyr154 are not well defined. Similarly, in the Ramachandran plot for the structure of the M pro :APE complex, Asp33, Asn84, Tyr154 and Asn277 are in the generously allowed :APE complex in the space group P2 1 2 1 2 1 , there is a dimer in each asymmetric unit. Only the residues 1A to 304A and residues 1B to 300B were identified in the electron density maps. In the Ramachandran plot for this structure, Asp33 and Asn84 of both protomers are in the generously allowed regions, whereas Tyr154 and Ile286 of both protomers are in the disallowed regions. The electron density for Tyr154 is not well defined. Superimposition (see Materials and Methods) of protomers A and B yielded a rootmean-square difference (rmsd) of 0.29 Å . Positional differences (up to 4.18 Å ) occur mainly among the N and C-terminal residues as well as those poorly defined residues on the flexible loops. The three structures are in close agreement ( Table 2) . With regard to the protomer orientation and protein fold, these structures, in general, are identical with the crystal structures of M pro previously determined in the space group P2 1 . 13

Crystals of SARS-CoV M pro and M pro CAðK1Þ were soaked in the solutions of Cbz-Leu-Phe-AGln-(S,S) EP-COOEt, Cbz-Leu-Phe-AGln-(R,R)EP-COOEt, and a racemic mixture of the S,S and R,R diastereomers (trans). Only the S,S diastereomer showed up in the electron density maps. APE binds in the substrate-binding regions of M pro (Figure 3 (a) and (b)). As visualized in all three structures, the residues forming the substrate-binding regions of both protomers of the peptidase are in the catalytically competent conformation, similar to their counterparts in the structures of the main peptidases from other coronaviruses, 15, 29 and to those in protomer A of the P2 1 structures of SARS-CoV M pro . 13 In the structure of unbound M pro , the catalytic dyad has a distance of 3.7 Å between the His41 N 32 atom and the Cys145 S g atom, and the Cys145 S g atom is coplanar with the atoms of the His41 imidazole ring. Superimposition of the structures of unbound M pro and the M pro :APE complex shows that the binding of APE does not cause any major changes in the structure of the peptidase ( Table 2) . The Cys145 C a -C b bond undergoes a 958 rotation (c 1 : from K64.08 to C30.78) accompanying the formation of a covalent bond with a distance of 2.01 Å between the Cys145 S g atom of the peptidase and the epoxide C3 atom of APE (Figure 4 (a) and (b)). The length of a C-S single bond is normally about 1.8 Å . However, with an estimated overall coordinate error (based on maximum likelihood) of 0.12 Å for the structure of the M pro :APE complex, the difference between the refined and expected distances (0.2 Å ) is not considered significant. This new covalent bond makes a torsion angle, Figure 5(b) ). With the error in atomic coordinates considered, the trigonal plane centered at the P1-AGln N a atom is coplanar with that centered at the P1-AGln carbonyl C atom, allowing the N a atom to reduce the electrophilicity of the carbonyl C atom by p-electron delocalization. The equivalent to

Compared to the structure of the M pro :APE complex, the structure of the M pro CAðK1Þ :APE complex shows some differences in the geometry of binding. In the latter, the rotation of the Cys145 C a -C b bond of the peptidase reaches a more positive value of c 1 (protomer A, 47.58; protomer B, 46.58). The length of the covalent bond between the Cys145 S g atom of the peptidase and the epoxide C3 atom of APE is 2.09 Å in protomer A and 2.05 Å in protomer B (Figure 4 (a) to (c)). Note that the estimated overall coordinate error (based on maximum likelihood) for this structure is 0.17 Å . O]C(P1-AGln)-C3(epoxide)-S g (Cys145) makes a torsion angle of 72.08 in protomer A and 85.58 in protomer B ( Figure 5(g) ). In both protomers, the configurations of the C2 and C3 atoms of APE are inverted from S,S to R,R ( Figure 5 (Figure 4(a) to (c) ).

The predominant S1 specificity of SARS-CoV M pro for Gln is determined primarily by His163. In the structure of unbound M pro , the His163 N 32 atom interacts with a chloride ion at a distance of 3.3 Å and in the plane of the His163 imidazole ring, whereas in the structures of the M pro :APE and M pro CAðK1Þ :APE complexes, the chloride ion is displaced by the P1-AGln side-chain amide group of APE with its O 31 atom accepting a hydrogen bond from the His163 N 32 atom of the peptidase (2.6 Å to 2.8 Å ). Additional hydrogen bonds may be donated, though not of ideal geometry, by the P1-AGln N 32 atom of APE to the Phe140 carbonyl O atom and the Glu166 O 32 atom of the peptidase (Figure 4(b) and (c) ). In both protomers of all three structures, the Phe140 phenyl ring interacts with the His163 imidazole ring through p-stacking (distance between the geometric centers of the rings: 3.7 to 3.8 Å ), and the latter is properly oriented for its N d1 atom to accept a hydrogen bond from the Tyr161 phenolic OH group (2.9 to 3.1 Å ). This hydrogen bond maintains the neutral tautomeric state of the His163 imidazole ring with its N 32 atom protonated over a broad range of pH. This is crucial for the interaction of the His163 of the peptidase with the P1-Gln of the substrate. 13, 15, 29 In each peptidase dimer, the integrity of the S1 specificity pocket in one protomer requires the protonated amino group of Ser1 from the other protomer. This residue is at the tip of the N-finger (N-terminal residues 1 to 7) propagating between the domain III of its parent protomer and the domain II of the partner protomer. In the structures of unbound M pro and the M pro :APE complex, the Ser1 amino group of each protomer forms an ionic interaction with the Glu166 side-chain carboxyl group of the other protomer (2.7 Å ). The Ser1 of each protomer also interacts with the Phe140 of the other protomer through amide hydrogen-carbonyl oxygen hydrogen bonding (3.2 Å ). These three residues are thus held together to form the "floor" of the S1 specificity pocket (Figure 6(a) ). In both protomers of the M pro CAðK1Þ :APE complex, however, Ser1 is N-terminally blocked by the additional Ala, and the ionic interaction between the Ser1 amino group and Glu166 side-chain carboxyl group is lost. This is also followed by the loss of the amide hydrogen-carbonyl oxygen hydrogen bond between Ser1 and Phe140 because Ser1 is no longer properly oriented. The additional Ala is disordered, leaving Ser1 unanchored and the floor of the S1 specificity pocket partly disrupted (Figure 6(b) ).

Despite the disruption of the S1 pocket by the presence of a single additional N-terminal residue, the presence of a ten-residue affinity tag at the N terminus of M pro reduced the specific activity of the peptidase by less than an order of magnitude (results not shown). M pro has greatest preference for Leu and Ile at P2, followed by Phe, Val and Met in that order. 7, 9 In the structures of the M pro :APE and M pro CAðK1Þ :APE complexes, the P2-Phe side-chain of APE fits snugly in the S2 specificity pocket of the peptidase, where the interactions are mainly hydrophobic. The P2-Phe phenyl ring of APE interacts with the His41 imidazole ring of the peptidase through p-stacking (distance between the geometric centers of the two rings: 4.3 to 4.6 Å ). Superimposition of the structures of unbound M pro and the M pro :APE complex shows that, upon the binding of APE, the side-chain of Met49 undergoes a large conformational change, thereby opening the S2 specificity pocket for the P2-Phe side-chain of APE. Also, the side-chain of Gln189 is no longer disordered after its reorientation and formation of a hydrogen bond through its O 31 atom with the P2-Phe amide hydrogen atom of APE (2.8 Å to 3.4 Å ). This appears to secure the P2-Phe of APE in the S2 specificity pocket of the peptidase (Figures 3(a) and (b) and 4(a)).

In the structures of the M pro :APE and M pro CAðK1Þ :APE complexes, the P3-Leu side-chain of APE extends into the solvent and does not have any significant interactions with the peptidase (Figures 3(a) and (b)  and 4(a) ). This lack of interactions is consistent with the fact that no S3 specificity of M pro could be established. 7 The aza-peptide component of APE consists of only three residues, so its benzyloxycarbonyl (Cbz) group partly takes up the space for the P4 residue of a substrate. M pro has a shallow S4 specificity pocket that accommodates small side-chains (Ser, Thr, Val, Pro and Ala). 7 In the M pro :APE complex and protomer B of the M pro CAðK1Þ :APE complex, the binding of the Cbz group of APE does not use the S4 specificity pocket of the peptidase. The position for the C a atom of a P4 residue of a substrate is occupied by the O2 atom in the Cbz group of APE. In this conformation, the benzyl group of APE makes contacts with Pro168 and with residues 190 to 192 of the peptidase; the Cbz group is exposed to the solvent (Figures 3(a) and 4(b) ). In contrast, in protomer A of the M pro CAðK1Þ :APE complex, the benzyl group of APE squeezes into and thereby widens the S4 specificity pocket of the peptidase, so that it is snugly accommodated in this enlarged pocket now formed by the residues 165 to 168, Phe185, Gln192 and the main-chain atoms of Val186 (Figures 3(b) and 4(c)).

The S1 0 specificity pocket of M pro is also shallow, accommodating only small side-chains (Ser, Ala, Gly, Asn and Cys). 7, 9 In the design of APEs, several different epoxide derivatives were attached to the aza-peptide component to modulate the interactions of APE with the S1 0 specificity pocket of a clan CD cysteine peptidase. 24, 26, 27 In the structures of the M pro :APE and M pro CAðK1Þ :APE complexes, however, the S1 0 specificity pocket of the peptidase is apparently not used in the binding of APE. The epoxide C2 atom of APE sits close to the position for the C a atom of the P1 0 residue of a substrate and the hydroxyl group on the C2 atom is exposed to the solvent. The ethyl ester group of APE lies against the "ceiling" of the active site lined by Leu27, Pro39, His41, and the peptide group between His41 and Val42.

The structures reported here show similar features at their dimer interfaces. The solventaccessible surface area (per protomer) buried upon dimerization of SARS-CoV M pro is from 1250 Å 2 to 1260 Å 2 . In the structures of unbound M pro and the M pro :APE complex, the crystallographic 2-fold axis passes through the dimer interface and brings the opposite interacting residue-pairs into exact 2-fold symmetry ( Figure 2) ; whereas in the structure of the M pro CAðK1Þ :APE complex, even in the absence of a crystallographic 2-fold axis, the dimer interface still exhibits approximate 2-fold symmetry. In all three structures, the dimer interface concentrates on one face of each protomer: that containing the residues on the N-finger, and domains II and III. The majority of the interactions occur between the residues on the N-finger and domain II. Two ionic interactions, one between the Ser1 amino group and the Glu166 sidechain carboxyl group of opposite protomers, and one between the side-chains of the Arg4 and Glu290 of opposite protomers, are observed in the structures of unbound M pro and the M pro CAðK1Þ :APE complex (Figure 6(a) ). However, in the structure of the M pro CAðK1Þ :APE complex, only the latter ion pair is observed (Figure 6(b) ). In contrast to the N-finger immobilized at the dimer interface, the C-terminal loop (residues 301 to 306) is highly mobile as it is exposed to the solvent and anchored by only three or four solvent-exposed hydrogen bonds to some residues along the rim of the dimer interface. Interestingly, in the structures of unbound M pro and the M pro :APE complex, the C-terminal loop of one protomer extends towards the S1 specificity pocket of the other protomer, whereas in the structure of the M pro CAðK1Þ :APE complex, the C-terminal loop of protomer A (the only one visualized in this structure) propagates away from the S1 specificity pocket of protomer B.

The kinetic data and crystal structures reported herein indicate that APEs have excellent potential as inhibitors of SARS-CoV M pro and are worthy of further evaluation in the development of lead compounds for anti-SARS agents. Thus, the k inact / K i of M pro for Cbz-Leu-Phe-AGln-(S,S)EP-COOEt is similar in magnitude to that of the first generation APEs produced to inhibit other cysteine peptidases. 24, 26, 27 Optimization of the latter has yielded inhibitors of caspases with k inact /K i well over 10 6 M K1 s K1 . 27 The excellent specificities of APEs for clan CD cysteine peptidases 24, 26, 27 suggest that APEs have better potential as inhibitors of M pro than do chloromethyl ketones (CMKs), the first class of potential inhibitors proposed on a structural basis. The structure of the M pro :CMK complex previously determined in the space group P2 1 shows different and unexpected modes of binding for CMK to the two protomers of the peptidase. 13 CMKs are highly active alkylating agents and react with good nucleophiles, such as hydroxyl and thiol groups. They therefore inhibit serine peptidases as well as cysteine peptidases. 28 A recent study showed that CMKs efficiently inhibit some clan CA cysteine peptidases, such as papain and the cathepsins. 30 This casts doubt on the utility of CMKs as specific inhibitors for M pro . In contrast, the structures of unbound M pro and the M pro :APE complex show that the aza-peptide component of APE binds to the peptidase in a substrate-like manner. The mainchain of the aza-peptide component of APE forms amide hydrogen-carbonyl oxygen hydrogen bonds with the main chain of the residues 164 to 166 of the peptidase in the manner of an anti-parallel b-sheet. The P1 and P2 side-chains of APE occupy the S1 and S2 specificity pockets of the peptidase, respectively. Based on the definitions for the binding of epoxysuccinyl peptides to clan CA cysteine peptidases, 28 this corresponds to the S and S 0 binding mode, with inclination to the S binding mode because the pre-cleavage portion of the substratebinding region of the peptidase makes the major contribution to M pro :APE interactions.

The structures of the M pro :APE and M pro CAðK1Þ :APE complexes substantiate the mechanism by which APEs have been proposed to irreversibly inhibit their target peptidases (Figure 1(a) ). Nucleophilic attack at the epoxide C3, rather than the C2, atom of APE, by the Cys145 S g atom is consistent with the expected transition-state geometry for proteolysis catalyzed by M pro . In caspase-3, the epoxide C3 atom is attacked (M. Grutter, unpublished results), whereas in caspase-1, the C2 atom is attacked (R. Rubin, unpublished results). In the case of epoxysuccinyl peptides binding to clan CA cysteine peptidases, the position of attack depends on the orientation of the epoxysuccinyl peptide in the substrate-binding region. 28 E-64 binds to papain in the S binding mode, similar to the mode of APE binding to M pro . In the papain:E-64 complex, however, the epoxide C2 atom is the one attacked. 31 Nucleophilic attack at the epoxide C3 atom is observed in the S 0 binding mode, as exemplified by CA-074 binding to cathepsin B. 32 The kinetic data and crystal structures indicate that M pro reacts only with the S,S diastereomer of the APE and not its R,R diastereomer. Interestingly, the order of inhibitory activities for APEs towards most clan CD cysteine peptidases is S,SOR,RO transOcis (the racemic mixture of the S,R and R,S diastereomers). 24, 26, 27 Based on the M pro :APE structures, we built models of each of the four diastereomers of APE at the active site of M pro to explain the results of our trials (Figure 7(a) to (d) ). These models show that, for APE to be accommodated in the substrate-binding regions of the peptidase, the epoxide C3 atom of APE must be in the S configuration, otherwise the epoxide moiety sterically clashes with the "back-wall" of the active site of the peptidase and with the aza-peptide component of APE itself (Figure 7(b) and (c) ). In the S configuration, the epoxide C3 atom of APE is also in better geometry with respect to the Cys145 S g atom for the nucleophilic attack. The epoxide C2 atom of APE must be in the S configuration as well to allow the interactions between the epoxide moiety and the active site of the peptidase, otherwise the epoxide moiety sterically clashes with the loop constituting the oxyanion hole of the peptidase (Figure 7(d) ). The model with the S,S diastereomer of APE also indicates that the distance between the His41 N 32 atom of the peptidase and the epoxide O atom is 4 Å to 5 Å , and that these two atoms are not well aligned for proton transfer (Figure 7(a) ). This suggests that the opening of the epoxide ring likely involves two steps separated by a conformational rearrangement of APE: (1) the protonation of the epoxide O atom and (2) the nucleophilic attack at the epoxide C3 atom. From the model, it is not possible to determine the order in which these steps occur. However, it would be energetically more favorable for protonation to be the first step. On the other hand, M pro may be sufficiently flexible in solution to allow the alignment of the His41 N 32 atom and the epoxide O atom for proton transfer. In such a scenario, protonation and nucleophilic attack could occur in a concerted manner, enabling the epoxide ring to open in a single step. Much of the mechanism for the inhibition of SARS-CoV M pro by APE remains to be elucidated. Rigorous treatment of this issue using methodologies in organic chemistry will be required.

All three structures successfully visualize the N-fingers of both protomers of M pro . Despite the appreciable participation by the N-fingers in dimer interactions, it was shown experimentally that the deletion of the N-fingers inactivates the peptidase but has little effect on its dimerization properties. Molecular dynamic simulations suggested that the main role of the N-finger is one directing the peptidase to dimerize at an orientation facilitating the formation of substrate-binding regions in the catalytically competent conformation. 10 This very likely relies on the two ionic interactions observed in the structures of unbound M pro and the M pro :APE complex, one between the protonated amino group of Ser1 and the Glu166 side-chain carboxyl group of opposite protomers, and one between the side-chains of the Arg4 and Glu290 of opposite protomers. In the structure of the M pro CAðK1Þ :APE complex, the former ion-pair does not exist but both substrate-binding regions are still capable of accommodating APE in a manner similar to that exhibited by the structure of the M pro :APE complex. This suggests that the former ionic interaction, possibly because it is relatively accessible to the solvent and therefore weaker, is of less importance.

The crystal structure of M pro previously determined in the space group P2 1 at pH 6.0 showed the collapse of the active site and S1 specificity pocket of one of the protomers, whereas the P2 1 structures at pH 7.6 and 8.0 showed the recovery of the collapsed parts. Based on this trend, a pH-triggered switch for the catalytic activity of the peptidase was proposed. 13 In our study, all crystals were grown at pH 6.5 and the resulting structures show that the substrate-binding regions of both protomers are in the catalytically competent conformation. This suggests the possibility of an alternative or additional mechanism underlying the pH-dependence of the activity of the peptidase, especially at pH 6.5 or above. The change in protonation/ deprotonation state of the catalytic dyad with pH is one of the possible second mechanisms. Insights into this possibility could be provided by the direct determinations of the pK a values of the catalytic dyad by nuclear magnetic resonance (NMR) spectroscopy, as exemplified by the similar studies of the catalytic triad of a-lytic peptidase. 33 

pro CAðK1Þ was cloned, overexpressed and purified as described. 21 A clone expressing M pro was generated using oligonucleotide-directed evolution to delete the codon corresponding to the N-terminal Ala of M pro CAðK1Þ . Using this clone, M pro was overexpressed and purified essentially as described for M pro CAðK1Þ . Cbz-Leu-Phe-AGln-(S,S)EP-COOEt and Cbz-Leu-Phe-AGln-(R,R)EP-COOEt were synthesized using the methods established to synthesize other APEs 24,26,27 with minor modifications.

Enzymatic activity was measured by following the increase in fluorescence due to the cleavage of a fluorogenic peptide: Abz-Ser-Val-Thr-Leu-Gln-Ser-Gly-(NO 2 )Tyr-Arg, where Abz is aminobenzoate and (NO 2 )Tyr is nitrotyrosine. Fluorescence was measured using a Cary Eclipse Fluorescence spectrophotometer (Varian Canada, Mississauga, Ontario, Canada) equipped with a circulating water-bath. Experiments were performed using a 100 ml quartz cuvette. The standard assay contained 25 nM M pro , 20 mM Bis-Tris (pH 7.0), 2 mM DTT, and was performed at 37.0(G0.1) 8C. The reaction was monitored using an excitation wavelength of 320 nm (5 nm bandpass) and an emission wavelength of 420 nm (10 nm bandpass). Initial velocities were determined from a least-squares analysis of the linear portion of the progress curves (at least 1 min) using Excel 2003 (Microsoft, Redmond, WA). All rates were corrected for the inner filter effect using an empirical correction. 34 In inhibition studies, the concentration of APE was varied from 0 mM to 10 mM and the concentration of peptidic substrate was varied from 16 mM to 100 mM. These substrate and inhibitor concentrations were dictated by solubility limitations and the observed rates of inhibition. The rate of inactivation at each concentration of substrate and inhibitor, j s , was determined by fitting equation (1) 35 to the corresponding progress curve using SCIENTIST version 2.01 (Micromath Scientific Software, Salt Lake City, UT). The parameters of inactivation, k inact and K i , were evaluated by fitting equation (2) to the j s obtained at each concentration of S and I, 36 using the least-squares and dynamic weighting options of LEONORA. 37 Crystallization, crystal soaking and cryo-protection Before crystallization, both SARS-CoV M pro and M pro CAðK1Þ were dialyzed against 20 mM NaCl, 20 mM Tris-HCl (pH 7.5), and concentrated to 10 mg/ml. All crystals were grown at ambient temperature by the hanging-drop, vapor-diffusion method. For the C2 crystals, the reservoir solution contained 50 mM ammonium acetate, 5% (w/v) polyethylene glycol (M r 10,000), 3% ethylene glycol, 3% dimethyl sulfoxide, 1 mM dithiothreitol and 0.1 mM Mes (pH 6.5). The drop contained equal amounts of the M pro solution and the reservoir solution. Block-shaped crystals grew in two to three days to a size of about 0.1 mm!0.1 mm!0.1 mm. For the P2 1 2 1 2 1 crystals, the reservoir solution had essentially the same composition as that for the C2 crystals, except the replacement of 5% polyethylene glycol (M r 10,000) and 3% dimethyl sulfoxide by 6% polyethylene glycol (M r 8000). The drop contained equal amounts of the M pro CAðK1Þ solution and the reservoir solution. Needle-shaped crystals grew in three to five days to a size of about 0.05 mm!0.05 mm!0.5 mm. Crystals of good quality were selected and soaked overnight in drops with the same compositions as their reservoir solutions plus the APE chosen for this study at 3 mM. Cryo-protectants had essentially the same compositions as reservoir solutions, except for the inclusion of 25% (v/v) glycerol in the case of the C2 crystals and the increase of ethylene glycol to 25% in the case of the P2 1 2 1 2 1 crystals. Crystals were briefly soaked and then immediately frozen in liquid nitrogen for storage and shipment to the synchrotron beamline.

Data collection and processing, structure solution, refinement and analysis

The X-ray diffraction data from all crystals were collected at the synchrotron Beamline 8.3.1 (equipped with an ADSC-Q210 CCD detector) at the Advanced Light Source in the Lawrence Berkeley National Laboratory. All data sets were indexed, scaled and merged using DENZO and SCALEPACK. 38 Structure solution and refinement were carried out in CCP4. 39, 40 All structures were solved by the molecular replacement method, 41 using the structure of unbound SARS-CoV M pro at pH 8.0 in the space group P2 1 (PDB accession code 1UK2) 13 as the search model for the structure of unbound M pro in the space group C2, and the structure of unbound M pro in the space group C2 as the search model for the structure of the M pro :APE complex in the space group C2 and the structure of the M pro CAðK1Þ :APE complex in the space group P2 1 2 1 2 1 . The structures of unbound M pro and the M pro CAðK1Þ :APE complex were solved using AMoRe, 42 and the structure of the M pro CAðK1Þ :APE complex was solved using MOLREP. 43 APE was located as outstanding electron densities in the substrate-binding region of the peptidase in both the F o KF c (contoured at 3s and 4s) and 2F o KF c (contoured at 1s) maps for each structure. All structures were iteratively refined using REFMAC 44 and adjusted using XtalView/Xfit. 45 The stereochemical qualities of the final structures were assessed using PROCHECK. 46 Graphical representations of the structures were prepared using PyMOL †. Superimpositions of structures were carried out using ALIGN, 47,48 based on the main-chain atoms (amide N, C a , and carbonyl C and O). The surface areas of structures were calculated using NACCESS. ‡ APE-peptidase interactions and dimer interactions were analyzed using LIGPLOT and DIM-PLOT, 49 respectively.

The atomic coordinates and structure factors of all structures have been deposited in the RCSB Protein Data Bank. The accession code is 2A5A for the structure of unbound SARS-CoV M pro , 2A5I for the structure of the M pro :APE complex and 2A5K for the structure of the M pro CAðK1Þ :APE complex.

Molecular mechanisms of severe acute respiratory syndrome (SARS)

Coronavirus as a possible cause of severe acute respiratory syndrome

Identification of a novel coronavirus in patients with severe acute respiratory syndrome

A novel coronavirus associated with severe acute respiratory syndrome

The clinical pathology of severe acute respiratory syndrome (SARS): a report from

Viral replicase gene products suffice for coronavirus discontinuous transcription

Mechanisms and enzymes involved in SARS coronavirus genome expression

Virus-encoded proteinases and proteolytic processing in the Nidovirales

Biosynthesis, purification, and substrate specificity of severe acute respiratory syndrome coronavirus 3C-like proteinase

Severe acute respiratory syndrome coronavirus 3C-like proteinase N terminus is indispensable for proteolytic activity but not for enzyme dimerization: biochemical and thermodynamic investigation in conjunction with molecular dynamic simulations

Quaternary structure of the severe acute respiratory syndrome (SARS) coronavirus main protease

Dissection study on the severe acute respiratory syndrome 3C-like protease reveals the critical role of the extra domain in dimerization of the enzyme: defining the extra domain as a new target for design of highly specific protease inhibitors

The crystal structures of severe acute respiratory syndrome virus main protease and its complex with an inhibitor

3C-like proteinase from SARS coronavirus catalyzes substrate hydrolysis by a general base mechanism

Coronavirus main proteinase (3CLpro) structure: basis for design of anti-SARS drugs

On the size of the active site in proteases

Identification of novel inhibitors of the SARS coronavirus main protease 3CLpro

Synthesis and evaluation of keto-glutamine analogues as potent inhibitors of severe acute respiratory syndrome 3CLpro

Small molecules targeting severe acute respiratory syndrome human coronavirus

Identification of novel small-molecule inhibitors of severe acute respiratory syndrome-associated coronavirus by chemical genetics

High-throughput screening identifies inhibitors of the SARS coronavirus main proteinase

Virtual screening of novel noncovalent inhibitors for SARS-CoV 3C-like proteinase

Generation of predictive pharmacophore model for SARS-coronavirus main proteinase

Aza-peptide epoxides: a new class of inhibitors selective for clan CD cysteine proteases

Evolutionary lines of cysteine peptidases

Aza-peptide epoxides: potent and selective inhibitors of Schistosoma mansoni and pig kidney legumains (asparaginyl endopeptidases)

Design, synthesis, and evaluation of aza-peptide epoxides as selective and potent inhibitors of caspases-1

Irreversible inhibitors of serine, cysteine, and threonine proteases

Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain

Inhibition of papain-like cysteine proteases and legumain by caspase-specific inhibitors: when reaction mechanism is more important than specificity

Crystal structure of a papain-E-64 complex

Substrate specificity of bovine cathepsin B and its inhibition by CA074, based on crystal structure refinement of the complex

Nitrogen-15 nuclear magnetic resonance spectroscopy of the state of histidine in the catalytic triad of a-lytic protease. Implications for the charge-relay mechanism of peptide-bond cleavage by serine proteases

Use of a fluorescence plate reader for measuring kinetic parameters with inner filter effect correction

Transient-phase kinetics of enzyme inactivation induced by suicide substrates

Fundamentals of Enzyme Kinetics

Analysis of Enzyme Kinetic Data

Processing of Xray diffraction data collected in oscillation mode

The CCP4 suite: programs for protein crystallography

A graphical user interface to the CCP4 program suite

The detection of sub-units within the crystallographic asymmetric unit

AMoRe: an automated package for molecular replacement

MOLREP: an automated program for molecular replacement

Refinement of macromolecular structures by the maximum-likelihood method

XtalView/Xfit-A versatile program for manipulating atomic coordinates and electron density

PROCHECK: a program to check the stereochemical quality of protein structures

Phosphocholine binding immunoglobulin Fab McPC603: an X-ray diffraction study at 2.7 Å

ALIGN: a program to superimpose protein coordinates, accounting for insertions and deletions

LIGPLOT: a program to generate schematic diagrams of protein-ligand interactions