key: cord-0996867-l1qwfxvd authors: Kozak, Maciej; Jankowska, Elzbieta; Janowski, Robert; Grzonka, Zbigniew; Grubb, Anders; Alvarez Fernandez, Marcia; Abrahamson, Magnus; Jaskolski, Mariusz title: Expression of a selenomethionyl derivative and preliminary crystallographic studies of human cystatin C date: 2007-09-27 journal: Acta Crystallogr D Biol Crystallogr DOI: 10.1107/s090744499901121x sha: ea8d90b44f0f85f0f495de8b6ee5ebadf994c29d doc_id: 996867 cord_uid: l1qwfxvd Human cystatin C, a protein with amyloidogenic properties and a potent inhibitor of papain‐like mammalian proteases, has been produced in its full‐length form by recombinant techniques and crystallized in two polymorphic forms: cubic and tetragonal. A selenomethionyl derivative of the protein, obtained by Escherichia coli expression and with complete Met→Se‐Met substitution confirmed by mass spectrometry, amino‐acid analysis and X‐ray absorption spectra, was crystallized in the cubic form. A truncated variant of the protein, lacking ten N‐terminal residues, has also been crystallized. The crystals of this variant are tetragonal and, like the two polymorphs of the full‐length protein, contain multiple copies of the molecule in the asymmetric unit, suggesting oligomerization of the protein. Cystatins are single-chain reversibly binding protein inhibitors of enzymes belonging to the papain family of cysteine proteases. They have been found in plants and animals (Turk & Bode, 1991) and in protozoa (Irvine et al., 1992) . Human cystatin C (HCC) is a low molecular weight protein (13 343 Da) composed of 120 amino-acid residues (Grubb & Lo È fberg, 1982; Fig. 1 ). It has high af®nity for cathepsin B and other human cysteine proteases (reviewed in Abrahamson, 1993) . HCC plays an important role in the development of cerebral haemorrhage in patients with a hereditary form of amyloid angiopathy, where a variant of cystatin C is deposited as amyloid ®brils in the cerebral arteries. This molecular pathology, caused by a point mutation and resulting in an amino-acid substitution in the HCC sequence (Leu68Gln; Fig. 1 ), leads to brain haemorrhage and death in young adults Olafsson et al., 1996) . Crystallographic and NMR studies of three cysteine-protease inhibitors, chicken cystatin C (Bode et al., 1988; Dieckmann et al., 1993; Engh et al., 1993) , cystatin B (in complex with papain; Stubbs et al., 1990 ) and cystatin A (Martin et al., 1995) , have shown similar overall structure, with three regions implicated in interactions with the target enzymes. These regions include the N-terminal segment and two hairpin loops. The 11 N-terminal aminoacid residues of HCC are important for its very high af®nity binding to papain (K i = 11 fM; Lindahl et al., 1992) and to other cysteine proteases (Abrahamson et al., 1987 (Abrahamson et al., , 1991 . It has been shown that leukocyte elastase can speci®cally cleave the single N-terminal Val10±Gly11 bond of HCC, resulting in seriously compromised af®nities for such target enzymes as cathepsins B, H and L (Abrahamson et al., 1991) . Peptide inhibitors structurally based on the N-terminal segment of cystatin C show interesting biomedical activities, including protection against streptococcal infections in mice (Bjo È rck et al., 1989) and inhibition of the replication of coronavirus and herpes simplex virus (Collins & Grubb, 1991; Bjo È rck et al., 1990) . Development of a second generation of more effective speci®c cysteineprotease peptide inhibitors would be greatly facilitated by a three-dimensional structure of HCC. Similarly, such a model would be very useful in the elucidation of the pathophysiological background of the cerebral haemorrhage produced by the Leu68Gln variant of HCC. Our initial attempts to solve the crystal structure of HCC by molecular replacement using as a probe the available crystallographic model of N-truncated chicken cystatin C (Bode et al., 1988 ; 41% sequence identity, 62.5% homology) were unsuccessful. We have, therefore, produced an Se-Met derivative of HCC for crystal structure determination by the MAD method (Hendrickson, 1991) . Production of wild-type human cystatin C was accomplished in a previously described Escherichia coli expression system (Abrahamson et al., 1988) . For production of Se-Metenriched cystatin C, the plasmid pHD313 was transformed into competent E. coli BL834(DE3) cells (Novagen). Single colonies were picked and positive transformants were selected by their ability to express cystatin C in crystallization papers small-scale cultures (3 h induction at 315 K in LB medium) as assessed by SDS±polyacrylamide gel electrophoresis. A subclone of the transformed E. coli BL834(DE3) cells was grown in minimal medium composed of 87 mM Na 2 HPO 4 , 46 mM KH 2 PO 4 , 18 mM NaCl, 7.5 mM (NH 4 ) 2 SO 4 , 0.2%(w/v) glucose, 1.7 mM MgSO 4 , 0.117 mM CaCl 2 , 0.015 mM FeSO 4 , 0.075 mM thiamine and 0.3 mM methionine or selenomethionine (Sigma). Initial cultures were grown in minimal medium containing normal methionine. Prior to induction of expression, the cells were centrifuged for 15 min at 2700g and resuspended in Se-Metcontaining minimal medium. Growth conditions and conditions for induction of expression were as described previously (Hall et al., 1993) , except that the culture time after induction of expression was prolonged to 6 h. Normal or Se-Met-enriched cystatin C was puri®ed in a two-step procedure as described previously (Abrahamson et al., 1988) , resulting in a better than 95% pure protein preparation as assessed by SDS±polyacrylamide and agarose gel electrophoreses. The molecular mass of normal wild-type and Se-Met-enriched HCC was determined by electrospray mass spectrometry using a quadrupole mass spectrometer (Quattro; VG Masslab, Altrincham, Cheshire, England). The mass spectra of the selenomethionyl protein con®rmed that the three Met residues ( Fig. 1 ) have been fully substituted by Se-Met. A successful Met3Se-Met substitution was additionally con®rmed by analysing the amino-acid composition of the Se-Met protein using the Biochrom 20 amino-acid analyser (Pharmacia Biotech) and its program for analysis of physiological amino acids. The Se-Met protein was hydrolyzed in vacuo for 24 h in 6 M HCl. For crystallized Se-Met HCC, X-ray absorption spectra con®rmed the crystals to have a signi®cant selenium content. Cystatin C devoid of ten N-terminal residues was obtained by incubation of recombinant wildtype human cystatin C with leukocyte elastase and was isolated as described previously (Abrahamson et al., 1991) . Three variants of HCC were used in the present studies: (i) the native wild-type 120-residue protein, (ii) its Se-Met derivative and (iii) a truncated form of the native protein from which the ®rst ten N-terminal amino acids have been removed by processing with leukocyte elastase (Fig. 1) . Two crystal forms (cubic and tetragonal) of the full-length protein and one crystal form (tetragonal) of the N-truncated protein were obtained. Both forms of the full-length protein were produced at 277 K using the vapour-diffusion method and the hangingdrop technique. Initial crystallization conditions (precipitant, pH) were determined using the sparse-matrix method (Jancarik & Kim, 1991) and Crystal Screen (Hampton Research). Crystals of the cubic form were grown using protein concentration of 10 mg ml À1 . Lyophilized protein was dissolved in 100 mM sodium acetate buffer pH 4.8 containing 20 mM CaCl 2 . 4±6 ml droplets were equilibrated against 1 ml of reservoir with analogous buffer/CaCl 2 solution. After 2 d, the reservoir solution was supplemented with 100±200 ml MPD. Cubeshaped crystals (Fig. 2a) appeared within a week, reaching maximum dimensions of 0.5 Â 0.5 Â 0.5 mm after eight weeks. Crystals of the selenomethionyl derivative of HCC were grown under identical conditions. They reached maximum dimensions of 0.15 Â 0.15 Â 0.15 mm within about two weeks. The tetragonal form of full-length human cystatin C was obtained using a modi®cation of the conditions described above. Lyophilized protein was dissolved to a ®nal concentration of 15 mg ml À1 in a solution containing 100 mM sodium acetate buffer pH 4.8, 20 mM CaCl 2 and 40±45% MPD. 10 ml droplets were equilibrated against reservoir solution (buffer, CaCl 2 , MPD) for 2 d, after which time the concentration of MPD in the reservoir solution was increased to 50±60% (addition of 100±200 ml of MPD). Bipyramidal tetragonal crystals (Fig. 2b ) appeared after 4±8 weeks, reaching maximum dimensions of 0.25 Â 0.25 Â 0.4 mm after a further 12 weeks. Crystallization conditions for the N-truncated form of HCC were also determined using the sparse-matrix method and Crystal Screen. Protein samples (3 ml) at a concentration of 6 mg ml À1 were mixed with equal amounts of reservoir solution [0.4 M (NH 4 )H 2 PO 4 ]. Slab-like crystals (Fig. 2c ) grew in hanging drops at room temperature within two weeks to maximum dimensions of 0.2 Â 0.2 Â 0.1 mm. Diffraction data for the cubic form of the full-length protein were collected at 82 K using synchrotron radiation (Brookhaven National Laboratory, beamline X9B, ! = 0.98 A Ê ) and a 345 mm MAR Research image-plate scanner. The crystal was mounted in a nylon-®bre loop and¯ash-frozen in a nitrogen-gas stream (Teng, 1990) . A 3.1 A Ê data set was obtained by measuring the intensities of 68 891 re¯ections. The ®nal merged data set of 4071 unique re¯ections with I/'(I) > 0.0 is 93% complete (95% in the last resolution shell) and is characterized by R int = 0.086 and hI/'(I)i = 17.6. For the tetragonal form of full-length HCC, the data were collected at 120 K using synchrotron radiation (EMBL, c/o DESY, Hamburg, beamline BW7A, ! = 0.8885 A Ê ) and a 180 mm MAR Research image-plate scanner. The crystal was mounted and¯ash-frozen as above. 192 024 re¯ections with I/'(I) > 0.0 were measured to 3.03 A Ê resolution. After merging, they were reduced to a unique data set consisting of 12 273 re¯ections. This data set is 98% complete (98% in the last resolution shell) and is characterized by R int = 0.123 and hI/'(I)i = 14.0. X-ray diffraction data for the crystals of N-truncated HCC were measured at 293 K using a 300 mm MAR Research detector and Cu K radiation generated from an SRA2 rotating-anode generator (Siemens) operated at 45 kV and 112 mA. A total of 79 910 re¯ections with I/'(I) > 0.0 were measured and reduced to 22 081 unique data extending to 2.7 A Ê . This data set is 84% complete (91% in the last resolution shell), with R int = 0.108 and hI/'(I)i = 6.9. Indexing and integration of all images was performed in DENZO and scaling of the intensity data was performed in SCALEPACK; both are from the HKL program package (Otwinowski, 1993) . An X-ray absorption spectrum for selenomethionyl HCC was recorded using synchrotron radiation (EMBL, c/o DESY, Hamburg, beamline X31) and a crystalline sample of the protein (1 mm 3 ) packed in a thin-walled capillary. The spectrum had a steep absorption edge at ! e = 0.9803 A Ê (f H min = À8.87e) and an absorption peak at ! p = 0.9800 A Ê (f HH max = 9.75e). Amino-acid sequence of human cystatin C. The Met residues are shown in bold and the N-terminal residues, missing in the N-truncated variant, are shown in italics. Leu68, which in an amyloid-forming naturally occurring mutant is replaced by glutamine, is underlined. The masses, con®rmed by mass spectrometry, for the native and Se-Met full-length HCC (13 343 and 13 484 Da, respectively) indicate that in variant (ii) all three S-methionine residues in the sequence (Fig. 1) were substituted by Se-methionine. Additionally, the amino-acid compositions of normal wild-type and Se-Met-enriched cystatin C were analyzed by amino-acid analysis after hydrolysis. Se-Met (Sigma) showed a retention time in a coupled HPLC chromatogram that coincided with the retention time for leucine. Upon analysis, the hydrolyzed Se-Met-enriched recombinant cystatin C showed a complete loss of the methionine peak in the chromatogram, with a concomitant increase of the leucine peak compared with normal recombinant cystatin C. The X-ray absorption spectrum, in which the Se K absorption edge is clearly visible, also con®rms the presence of selenium. The cubic crystals of full-length HCC were very unstable in the X-ray beam at room temperature. An attempt to collect X-ray diffraction data at 297 K was unsuccessful, as the crystals rapidly lost their diffraction power and stopped diffracting within 1 h. Lowering the temperature of the environment to 282 K signi®cantly increased the stability of the crystals and allowed an incomplete data set to be collected to 4.5 A Ê from one crystal. At 82 K, the crystals were stable enough to allow the measurement of a complete data set to 3.1 A Ê resolution. These cubic crystals belong to the I432 space group and have a unit-cell parameter a = 140.4 A Ê . Attempts to collect useful MAD data for the cubic crystals of the selenomethionyl derivative of HCC have failed because of the very limited resolution (worse than 4.5 A Ê ). This might be attributable to the relatively small size of the Se-Met HCC crystals and to the weak intensity of the synchrotron beamline available for these experiments. The crystals of the tetragonal form of fulllength HCC were more stable in the beam. A room-temperature diffraction experiment resulted in an incomplete data set to 4.3 A Ê resolution, while at 120 K a complete data set to 3.03 A Ê resolution was obtained. The crystals belong to the P4 1 2 1 2 space group (or its enantiomorph P4 3 2 1 2) and have unit-cell dimensions a = 91.5, c = 144.5 A Ê . Crystals of the N-terminally truncated HCC were very stable in the X-ray beam and diffracted to 2.7 A Ê resolution. They belong to the tetragonal system, with unitcell parameters a = 68.8, c = 206.5 A Ê . The symmetry and systematic absences are consistent with the P4 2 space group. Analysis of the Matthews volume (Matthews, 1968) for the cubic form of fulllength HCC indicates rather unambiguously that two monomers of the protein are present in the asymmetric unit (2.16 A Ê 3 Da À1 ). In the large asymmetric unit of the tetragonal form of the full-length protein, between three (3.78 A Ê 3 Da À1 ) and seven (1.62 A Ê 3 Da À1 ) monomers can be accommodated. In the asymmetric unit of the tetragonal form of N-truncated HCC, between ®ve (3.66 A Ê 3 Da À1 ) and 11 (1.66 A Ê 3 Da À1 ) independent monomers may be present. The propensity of HCC to crystallize with multiple copies of the molecule in the asymmetric unit, in combination with the additional possibilities offered by the point-symmetry elements of the unit cells, may be indicative of the tendency of the protein to oligomerize. Such oligomerization might re¯ect the amyloid-forming property of Leu68Gln cystatin C, as earlier observations demonstrate that both wild-type and Leu68Gln-substituted cystatin C are capable of forming dimers (Abrahamson & Grubb, 1994; Ekiel & Abrahamson, 1996; Ekiel et al., 1997) . Proc. Natl Acad. Sci. USA, 91 Proc. Natl Acad. Sci. USA, 79 Single crystals of HCC: (a) cubic and (b) tetragonal for the full-length protein and (c) tetragonal for the N-terminally truncated protein Brain Pathol. 6, 121±126. Otwinowski