key: cord-0002878-lg93okwg authors: Aik, Wei Shen; Lin, Min-Han; Tan, Dazhi; Tripathy, Ashutosh; Marzluff, William F.; Dominski, Zbigniew; Chou, Chi-Yuan; Tong, Liang title: The N-terminal domains of FLASH and Lsm11 form a 2:1 heterotrimer for histone pre-mRNA 3’-end processing date: 2017-10-11 journal: PLoS One DOI: 10.1371/journal.pone.0186034 sha: 1a52cac849bdb5840d115cf513ec8cc63dc2d56c doc_id: 2878 cord_uid: lg93okwg Unlike canonical pre-mRNAs, animal replication-dependent histone pre-mRNAs lack introns and are processed at the 3’-end by a mechanism distinct from cleavage and polyadenylation. They have a 3’ stem loop and histone downstream element (HDE) that are recognized by stem-loop binding protein (SLBP) and U7 snRNP, respectively. The N-terminal domain (NTD) of Lsm11, a component of U7 snRNP, interacts with FLASH NTD and these two proteins recruit the histone cleavage complex containing the CPSF-73 endonuclease for the cleavage reaction. Here, we determined crystal structures of FLASH NTD and found that it forms a coiled-coil dimer. Using solution light scattering, we characterized the stoichiometry of the FLASH NTD-Lsm11 NTD complex and found that it is a 2:1 heterotrimer, which is supported by observations from analytical ultracentrifugation and crosslinking. In eukaryotic cells, histones play important roles in genomic DNA packaging as well as epigenetic regulation of gene expression. The levels of histone mRNAs are carefully controlled throughout the cell cycle and they dramatically increase during S phase to meet the growing demand for packaging the newly replicated DNA from the replicating genome [1] . In metazoans, histone proteins for packaging of newly synthesized DNA are encoded by the replicationdependent histone genes. They are distinct from the replication-independent histone genes which are expressed constitutively [2] . Unlike canonical pre-mRNAs, replication-dependent histone pre-mRNAs lack introns and undergo 3'-end processing that differs from cleavage coupled to polyadenylation. Histone pre-mRNAs contain two sequence elements essential for their 3'-end processing: a highly-conserved stem-loop structure and a purine-rich histone downstream element (HDE). Cleavage occurs between these two sequence elements and the polyadenylation step is omitted, giving rise to mature histone mRNAs that end with the stemloop followed by a 4-5 nucleotide tail. PLOS Residues 71-137 observed in the structure form a single α-helix. The length of the coiledcoil FLASH NTD dimer is approximately 100 Å, excluding the C-terminal hexahistidine tag observed for one of the protomers. The protomers do not superimpose perfectly onto each other (r.m.s.d.~1.5 Å), and one of them appears to adopt a straighter conformation (S1 Fig). For each protomer, the buried surface area is~1800 Å 2 (~25% of its total surface area, calculated using the program PISA [29] ). The majority of the FLASH dimer interface residues are leucines and isoleucines, forming the bulk of the hydrophobic interactions (Fig 2A) . The leucines and isoleucines are interspersed with other residues that form either polar or non-polar interactions. Other hydrophobic interactions are formed by bulky residues such as Tyr73, Tyr80, and Phe94, as well as Met87 (selenomethionine in this structure). Hydrophilic interactions include residues Gln100/Asn101 and Glu107/Asn108 near the mid-section of the structure (Fig 2B) , and an ion pair between Lys129 and Asp130 at the C-terminal end of the coiledcoil (Fig 2A) . We also observed that Cys83 is situated in the dimer interface with the thiol side chains from the two protomers positioned near one another (Fig 2C) . While the electron density did not provide conclusive evidence for the existence of a disulfide bond, and the two sulfur atoms are separated by 3.4 Å distance in the current model, the structure raises the possibility that the observed FLASH NTD dimer might be mediated by a disulfide connection, which is unlikely to occur in the reducing environment in the nucleoplasm, the site of 3'-end processing. To rule out the possibility that the observed FLASH NTD dimer is a crystallographic artifact caused by oxidized cysteine residues, we determined the structures of the FLASH NTD C54S/ C83A double mutant in two different crystal forms at 2.1 and 2.6 Å resolution, respectively (Table 1 ). In addition to Cys83, we also mutated the other Cys residue in the FLASH NTD, Cys54, in case it formed a disulfide as well even though the residue was disordered in the structure. The two structures adopt the same coiled-coil dimer (Fig 3A and 3B ) as the wild-type FLASH NTD (Fig 3C) , confirming that FLASH NTD dimer formation does not require the Cys83 disulfide. The individual protomers of these two mutant dimers also show differences, similar to those observed for the wild-type dimer (S1 Fig). Interestingly, residues 53-70 from one of the protomers in the structure at 2.6 Å resolution are stabilized by crystal packing, showing strong electron density corresponding to an α-helix ( Fig 3B) . This protomer appears to form a single long α-helix from residues 53 to 137. The The N-terminal domains of FLASH and Lsm11 form a 2:1 heterotrimer for histone pre-mRNA 3'-end processing other protomer showed weak electron density for residues 56-62, while residues 51-55 and 63-66 are not observed (Fig 3B) . These N-terminal residues are disordered in the other two structures ( Fig 3C) . Overall, our structural data suggest that FLASH NTD alone forms a stable coiled-coil dimer from residues 71-137 while residues 53-70 can form another helix. It appears that the helix for residues 53-70 does not dimerize, and it may also be structurally independent of Two views of the FLASH NTD forming a coiled-coil dimer with respective protomers colored in green and yellow orange. The hexahistidine-tag was observed in protomer 1. Side chains of residues involved in the dimer interface are shown as sticks, while the other side chains are shown as thin sticks. Colors of atoms: red, oxygen; blue, nitrogen; yellow, sulfur/selenium. (B) Hydrophilic interactions (black dashes) formed by Gln100/Asn101 and Glu107/Asn108 respectively. (C) 2F o -F c electron density for the Cys83 side chains, contoured at 1σ (blue). https://doi.org/10.1371/journal.pone.0186034.g002 The N-terminal domains of FLASH and Lsm11 form a 2:1 heterotrimer for histone pre-mRNA 3'-end processing residues 71-137, even though a single long helix (residues 53-137) is observed in one crystal form. The LDLY motif (residues 55-58) essential for binding the HCC is situated in the N-terminal helix (Fig 3B) . Whether the flexibility of this helix is a feature needed for binding the HCC will need to await further investigation. The N-terminal domains of FLASH and Lsm11 form a 2:1 heterotrimer for histone pre-mRNA 3'-end processing FLASH NTD mutations can affect Lsm11 NTD binding but not dimerization The region of FLASH NTD that interacts with Lsm11 has been previously mapped to residues 100-137 [3, 13, 28] . Because our structural data indicated that this region forms a dimer, we investigated the role of FLASH NTD dimerization in Lsm11 binding. Previous pull-down studies showed that substituting Leu118 and Ile119 with alanines abolished the ability of FLASH to bind Lsm11 [28] . According to our FLASH NTD structures, both Leu118 and Ile119 are situated at the dimer interface ( Fig 4A) . It was possible that these mutations disrupt FLASH NTD dimerization, thereby affecting Lsm11 binding. To test this possibility, we generated the FLASH NTD L118A/I119A mutant in the background of C54S/C83A mutations and investigated its oligomeric state by analytical gel filtration (Fig 4B) . Our results showed that this mutant had a similar migration behavior as the C54S/C83A mutant control, indicating that the deleterious effect of L118A/I119A mutation on binding Lsm11 is not due to the disruption of FLASH NTD dimerization. We next investigated the oligomeric states of the mutants Y73A/L76A/Y80A and N101A/ L104A/N108A, each containing substitutions of three consecutive residues in the dimer interface (Fig 2A) . The first mutated cluster is located closer to the N-terminal end, while the second cluster is located near the middle of the NTD. In addition, since Lys129 and Asp130 form an ion pair in the dimer interface, we also replaced these two charged residues as well as the preceding Arg128 with alanines, generating the R128A/K129A/D130A mutant. Our gel filtration results showed that the Y73A/L76A/Y80A mutant had essentially the same migration behavior as the C54S/C83A control, while the N101A/L104A/N108A and R128A/K129A/ D130A mutants actually migrated faster ( Fig 4B) . As a control, we made the K88A/K92A/ K95A mutant, changing three residues located outside of the dimer interface. As expected, the migration behavior of this mutant was nearly the same as that for the C54S/C83A protein ( Fig 4B) . Our analytical ultracentrifugation studies on the N101A/L104A/N108A mutant suggested that it might be trimeric in solution ( Table 2 , see below), suggesting that the mutation has perturbed the structure of the NTD. Therefore, the N101A/L104A/N108A and R128A/K129A/ D130A mutants will not be described further. While the mutations were not able to disrupt the FLASH NTD dimer, we tested whether they affect the interactions with Lsm11 NTD. For these experiments, we co-expressed Histagged Lsm11 NTD with un-tagged FLASH NTD in E. coli and monitored whether FLASH NTD could be co-purified by the nickel-NTA agarose beads. The results showed that the Y73A/L76A/Y80A and K88A/K92A/K95A mutants still interacted with Lsm11 NTD, while the L118A/I119A mutant could no longer interact with Lsm11 NTD (Fig 4C) , consistent with earlier data [28] . These experiments further demonstrated that the loss of binding between the FLASH NTD mutant and Lsm11 NTD is not linked to the dissociation of FLASH dimer. Given the ability of the FLASH NTD to dimerize, we next characterized the stoichiometry of the FLASH-Lsm11 complex. We co-expressed FLASH NTD C54S/C83A double mutant and His-tagged Lsm11 NTD (residues 23-130) in E. coli and purified their complex, demonstrating a stable interaction between the two proteins (S2 Fig) . Extensive efforts at producing diffraction quality crystals of the FLASH NTD-Lsm11 NTD complex have so far been unsuccessful. To obtain estimates for the molar masses of the FLASH NTD-Lsm11 NTD complex as well as the FLASH NTD C54S/C83A mutant alone, we performed size exclusion chromatography multi-angle light scattering (SEC-MALS) experiments using buffers with high (500 mM) and low (250 mM) NaCl concentrations ( Fig 5, S3 Fig) . At high-salt concentration, both the FLASH NTD-Lsm11 NTD complex and FLASH NTD alone eluted in single peaks. However, the FLASH NTD-Lsm11 NTD complex peak had a trailing edge, suggesting some dissociation of the complex during chromatography. The weight-averaged molar masses of the samples eluting in the peaks are 31 kDa for the FLASH NTD-Lsm11 NTD complex (with a Stokes radius of 3.9 nm) and 21 kDa for FLASH NTD C54S/C83A mutant alone (with a Stokes radius In the low-salt buffer, the positions of the peaks for both the FLASH NTD-Lsm11 NTD complex and FLASH NTD C54S/C83A mutant were slightly shifted to the left, suggesting a more extended structure for both. A small amount of higher order structures was present for the FLASH NTD-Lsm11 NTD complex suggesting the formation of aggregates in low-salt buffer condition. As in the high-salt buffer, the FLASH NTD-Lsm11 NTD complex peak had a trailing edge. The weight-averaged molar masses of the samples eluting in the peaks are 34.3 kDa for FLASH NTD-Lsm11 NTD complex (with a Stokes radius of 4.3 nm) and 21.8 kDa for FLASH NTD C54S/C83A (with a Stokes radius of 3.5 nm). Due to the presence of higher order structures, the molar mass of the FLASH NTD-Lsm11 NTD complex decreased gradually from 55 kDa and higher (leading edge of peak) to 26 kDa (trailing edge of peak). The FLASH NTD C54S/C83A molar mass decreased from 23.5 kDa (leading edge) to 19 kDa (trailing edge). Overall, the polydispersity of the FLASH NTD-Lsm11 NTD complex was slightly higher in low-salt buffer. Based on the calculated molecular weights for Lsm11 NTD and FLASH NTD, the results from SEC-MALS showed that the FLASH NTD-Lsm11 NTD complex is a heterotrimer consisting of 2 molecules (a dimer) of FLASH and 1 molecule of Lsm11, while FLASH NTD C54S/C83A is a dimer ( Table 2) . The N-terminal domains of FLASH and Lsm11 form a 2:1 heterotrimer for histone pre-mRNA 3'-end processing We also performed analytical ultracentrifugation (AUC) sedimentation velocity and sedimentation equilibrium experiments on the FLASH NTD-Lsm11 NTD complex, as well as on wildtype FLASH NTD, FLASH NTD C54S/C83A double mutant, and Lsm11 NTD alone as controls in the low-salt condition (Fig 6, S4 Fig). Our AUC results showed that wild-type FLASH NTD and the FLASH NTD C54S/C83A double mutant have similar sedimentation coefficients of 1.61 and 1.59 and frictional ratios of 1.81 and 1.86, consistent with them forming dimers in solution (Table 3 ). Both proteins have Stokes radius of 3.2-3.4 nm, while the long-axis radius is about 6.5 nm, based on the crystal structures. The K d values for the wild-type and C54S/ C83A NTD dimers are 0.18 and 0.05 μM, respectively (Table 3) . Wild-type FLASH NTD showed serious aggregation below the concentration of 0.1 mg/ml, where some of the proteins apparently dissociated into a monomeric but more elongated form (sedimentation coefficients of 0.89, frictional ratio of 3.09 and Stokes radius of 5.5 nm). Lsm11 NTD alone showed a sedimentation coefficient of 1.22 with a frictional ratio of 1.81, suggesting that it exists as a monomer in solution. The FLASH NTD-Lsm11 NTD complex showed a shift in its sedimentation coefficient value from 1.36 to 3.2 with increasing concentration of the complex (Table 3 , Fig 6D) . This observation indicates that the dissociation and association of the complex is a rapid event [30] and the exact molar mass of the complex cannot be accurately estimated. The broad range of observed molecular weight by AUC (17.8 to 49.7 kDa, Table 3 ) is similar to that seen in the SEC-MALS experiment. Nevertheless, we were able to obtain a K d value of 2.4 μM for the complex (Table 3) . Crosslinking studies. We used glutaraldehyde to crosslink the FLASH NTD-Lsm11 NTD complex, FLASH NTD C54S/C83A, and FLASH NTD and analyzed it on SDS-PAGE (S5 Fig). We observed the strong presence of dimers for both FLASH wild-type and C54S/C83A double mutant and weaker presence of higher oligomers (trimer, tetramer etc.). The higher oligomers became less apparent at lower concentration (0.01 mg/mL) of the protein, suggesting that they are probably due to random collisions of monomer/dimer in solution. Dimer and trimer species were also observed for the FLASH NTD-Lsm11 NTD complex but Lsm11 NTD did not appear to be substantially crosslinked to FLASH or to itself, possibly due to the fact that it has only one lysine residue. Therefore, we conclude that the dimer and trimer species for the complex were probably crosslinked FLASH, as observed for the FLASH alone samples, and the cross-linking experiments by themselves did not provide conclusive information about the stoichiometry of FLASH NTD-Lsm11 NTD complex. Human FLASH is a protein of 220 kDa that has been implicated in a broad spectrum of cellular processes. In spite of these diverse and important functions, the structural organization of FLASH remains largely unknown. Although FLASH consists of nearly 2,000 amino acid residues, the functions of only three small regions of the protein are understood: a 100 residue segment in the N-terminus required for histone pre-mRNA processing, the C-terminal segment which forms a SANT/Myb-like domain, interacts with the C-terminal region of NPAT, and is required for localization to the histone locus body [31] ; and a small central region which binds Ars2 [32] is essential for cell cycle progression. Our crystallographic studies of FLASH NTD demonstrate that residues 71-137 adopt a continuous and stable α-helical fold and mediate the formation of a coiled-coil dimer between two FLASH molecules. This α-helical fold might also extend to residues 53-70, encompassing the LDLY motif, but this region is unlikely to contribute to the dimerization interface. Our [33] , which showed that residues 75-136 underwent slow H/D exchange, indicative of extensive secondary structure in this region. Residues 58-62 exchanged significantly faster than the 75-136 region but slower than the directly surrounding sequences, suggesting the presence of a more dynamic secondary structure in the vicinity of the LDLY motif [33] . That amino acids in the N-terminal region of FLASH may fold into a coiled-coil domain was first predicted by bioinformatics [34] . In addition, biochemical studies demonstrated that ectopically-expressed FLASH can self-associate in tissue culture cells and that this self-association requires the N-terminal 200 residues [32] . These data, in conjunction with our current crystallographic study, strongly support the notion that the N-terminal domain of FLASH exists in solution as a coiled-coil dimer. We changed up to three consecutive residues in the dimer interface but failed to convert FLASH into monomers. The dimer interface of the FLASH NTD wild-type, C54S/C83A mutant, FLASH NTD-Lsm11 NTD complex and Lsm11 NTD. The distributions of the proteins at concentrations of 1 mg/ml (B-D) and 0.8 mg/ml (E) are shown by solid lines and those at concentrations of 0.1 mg/ml (B) and 0.05 mg/ml (D) are shown by dashed lines and that at 8 mg/ml (D) are showed by dotted line. The y-axis on the right is for the protein at a concentration of 0.1 mg/ml (B) and 8 mg/ml (D). The vertical dashed lines on the left and right indicate the monomer position of Lsm11 NTD and the dimer position of FLASH NTD, respectively. The residual bitmap of the raw data and the best-fit results are shown in the insets. The data are summarized in Table 2 . https://doi.org/10.1371/journal.pone.0186034.g006 a The values were derived from fitting the AUC data to a continuous c(s) distribution model using the SEDFIT program [57] . b The Stokes radii (R S ) were calculated by the Sednterp program, based on the sedimentation coefficient and molecular weight (M r ) from c(s) distribution analysis. c The values were derived from a global fit of the AUC data to a monomer-dimer self-association model by SEDPHAT [58] . d The values were derived from fitting the sedimentation velocity and sedimentation equilibrium data to a global two discrete species model using the SEDPHAT [58] . e The value was derived from a global fit of the AUC data to a A + B <-> AB hetero-association model by SEDPHAT [58] . https://doi.org/10.1371/journal.pone.0186034.t003 The N-terminal domains of FLASH and Lsm11 form a 2:1 heterotrimer for histone pre-mRNA 3'-end processing FLASH NTD is extensive and local structural disturbances, such as the three consecutive residues that we mutated, are insufficient to prevent dimerization. Interestingly, the L118A/I119A mutation in the interface of the coiled-coil dimer failed to disrupt FLASH dimerization but was sufficient to abolish the ability of FLASH to interact with Lsm11. The N-terminal α-helical region that mediates FLASH dimerization overlaps substantially with the core Lsm11 binding site in FLASH mapped to amino acids 100-140, prompting the hypothesis that Lsm11 may interact with a FLASH dimer. SEC-MALS experiments on the complex provided strong evidence that the FLASH NTD-Lsm11 NTD complex is a 2:1 heterotrimer. While our AUC data confirmed that FLASH is a dimer, the stoichiometry of FLASH and Lsm11 in the FLASH NTD-Lsm11 NTD complex was less clear, likely due to dissociation of Lsm11 NTD from the FLASH NTD dimer during prolonged ultracentrifugation. Some dissociation of the complex was observed during the short time scale of the SEC-MALS experiment. That Lsm11 interacts with a FLASH dimer is also consistent with the data from H/D exchange experiments. While the region between amino acids 100-120 showed the slowest H/ D exchange within the entire FLASH NTD (which we show here can form a dimer), this region underwent slower H/D exchange in the presence of Lsm11 and the reduced rate of exchange extended to amino acid 130 in FLASH [33] . Since H/D exchange occurs when hydrogen bonds are temporarily destabilized, this region of FLASH (residues 100-130) is in a more stable structure in the heterotrimer than in the homodimer. Additional studies are required to determine the structure of the FLASH-Lsm11 heterotrimer and identify potential mechanisms that may regulate the binding of a FLASH dimer to Lsm11 to form the FLASH-Lsm11 heterotrimer. In animal cells, components of the transcription and 3'-end processing machinery are localized in Histone Locus Bodies (HLBs), nuclear domains that assemble at histone gene loci and are present throughout the cell cycle. Strikingly, histone gene expression is repressed during G1 phase and becomes activated only with the onset of S phase and DNA replication in response to cell cycle signals, including cyclin E/ CDK2-mediated phosphorylation of NPAT, a universal coactivator of histone gene expression [35] [36] [37] [38] [39] . A growing body of evidence suggests that FLASH is targeted to HLBs as a separate entity rather than a subunit of the U7 snRNP. For example, mutations in either Lsm11 or FLASH that disrupt binding between their N-terminal domains do not affect localization of either FLASH or U7 snRNP to the Drosophila HLB, but abolishes processing in vivo [40, 41] . Our findings suggest that N-terminus of FLASH may be present as a homodimer throughout the cell cycle. In recent studies [33] , we have found that a second region of Lsm11 interacts with the C-terminal region of FLASH, the same region of FLASH that binds to NPAT [31] . This interaction strengthens the overall binding between FLASH and Lsm11 and could be part of an extensive reorganization of the factors in the HLB to activate histone gene expression as a result of phosphorylation of NPAT by cyclin E/CDK2. C-terminally hexahistidine-tagged FLASH NTD (residues 51-137) and FLASH NTD C54S/ C93A mutant constructs were cloned into pET26b vector and over-expressed in Escherichia coli BL21 Star (DE3) strains (Novagen). The cells were induced using 0.4 mM isopropyl β-D-1-thiogalactopyranoside and grown for 18 h at 20˚C. The cells were harvested by centrifugation and the pellets were re-suspended in lysis buffer (20 mM Tris (pH 7.5), 500 mM NaCl, 10 mM imidazole, 5% (v/v) glycerol, 17.8 μg/mL phenylmethane sulfonyl fluoride (PMSF) and 10 mM β-mercaptoethanol) and lysed by sonication. Cell lysates were then centrifuged at 25,000 x g for 40 min at 4˚C. The supernatant was incubated with nickel beads for 1 h before being loaded onto a gravity flow column (Bio-Rad). The nickel beads were washed with buffer containing 20 mM Tris (pH 7.5), 500 mM NaCl, 40 mM imidazole, and 10 mM β-mercaptoethanol. The proteins were eluted with 20 mM Tris (pH 7.5), 500 mM NaCl, 500 mM imidazole and 10 mM β-mercaptoethanol. The eluted proteins were further purified using size-exclusion chromatography (Sephacryl S-300; GE Healthcare) with a buffer containing 20 mM Tris (pH 8.5), 250 mM NaCl, and 5 mM dithiothreitol (DTT). Relevant fractions from size-exclusion chromatography were pooled and the proteins were concentrated to 9.4 mg/mL (wild-type) and 11 mg/mL (mutant), and stored at -80˚C. The selenomethionyl FLASH NTD protein was prepared using the Escherichia coli B834 methionine-auxotroph strain grown in the LeMaster media supplemented with selenomethionine [42] . The protein was purified using the same protocol as the native protein, concentrated to 10 mg/mL and stored at -80˚C. Selenomethionyl FLASH NTD and native FLASH NTD C54S/C83A mutant were crystallized in a sitting drop by vapor diffusion. The sitting drops were set up by mixing 1 μL of 4 mg/mL selenomethionyl FLASH NTD or 5 mg/mL FLASH NTD C54S/C83A mutant protein with 1 μL of well solution. The well solution for selenomethionyl FLASH NTD crystals contained 100 mM Tris (pH 8.0) and 18% (w/v) PEG 4000; for FLASH NTD C54S/C83A crystal form 1, 4% (w/v) tacsimate pH 7.0, 11% (w/v) PEG 3350; and for FLASH NTD C54S/C83A crystal form 2, 100 mM sodium formate, 15% (w/v) PEG 3350, 3% (v/v) 1,6-hexanediol. The crystals were harvested and soaked in mother liquor supplemented with 15% (v/v) (selenomethionyl FLASH NTD) or 20% (v/v) (FLASH NTD C54S/C83A) ethylene glycol as cryoprotectant before being flash frozen in liquid nitrogen. Initial X-ray diffraction data for selenomethionyl FLASH NTD were collected at APS beamline 24 ID-C with wavelength 0.9792 Å. Three datasets from three different crystals were processed using XDS and merged with XSCALE [43, 44] . The structure was solved by SAD using ShelxCDE [45] and the model built manually with the program Coot [46] . The final structure was then refined using a higher resolution dataset (2.6 Å) collected at ALS beamline 501 (wavelength: 0.9774 Å) and processed with XDS. Data for the structures of FLASH NTD C54S/ C83A were collected using single crystals at APS beamline 24-ID-E (0.9792 Å wavelength for both). The datasets were processed using HKL2000 [47] (FLASH C54S/C83A crystal form 1) and XDS (FLASH C54S/C83A crystal form 2). Both structures were solved by molecular replacement using Phaser [48] with the selenomethionyl FLASH NTD structure as the search model. All three structures were refined using Phenix [49] . N-terminally hexahistidine-tagged Lsm11 NTD (residues 23-130) construct was cloned into pET28a vector. FLASH NTD C54S/C83A mutant construct (without tag) was cloned into MCS2 of pCDF Duet vector. Both plasmids were co-transformed into E. coli BL21 (DE3) Star and the genes were co-expressed. The proteins were purified using the same protocol as for the FLASH NTD proteins with the exception of using 20 mM Tris (pH 7.5), 500 mM NaCl, and 5 mM DTT as the size exclusion chromatography buffer. Relevant fractions corresponding to Lsm11 NTD-FLASH NTD C54S/C83A complex and excess Lsm11 NTD alone were pooled separately and concentrated to 5.3 mg/mL and 2.4 mg/mL respectively. Mutant FLASH NTD constructs were generated using site-directed mutagenesis PCR. Primers (see S1 Table) designed to mutate designated residues were used in PCR reactions to amplify plasmid templates encoding for wild type or mutant FLASH NTD (see S1 Table for templates used). 25 cycles of thermal cycling (98˚C for melting, 55˚C for annealing, 72˚C for elongation) were performed using Phusion polymerase. PCR products were then digested with DpnI for 1 h at 37˚C before being transformed into E. coli DH5α. Mutant constructs were confirmed by DNA sequencing. The C-terminally hexahistidine-tagged FLASH NTD mutant constructs were cloned into pET26b vector and expressed in E. coli BL21 Star (DE3) cells. The mutant proteins were purified by nickel affinity column as detailed in the earlier section. The eluted protein from nickel affinity purification was then injected into Superose 12 analytical gel filtration column, preequilibrated with 20 mM Tris (pH 7.5), 250 mM NaCl, and 5 mM DTT. Analytical gel filtration was performed with a flow rate of 0.5 mL/min using 20 mM Tris (pH 7.5), 250 mM NaCl and 5 mM DTT as buffer. Co-purification of FLASH NTD mutants with His-tagged Lsm11 NTD N-terminally hexahistidine-tagged Lsm11 NTD (residues 23-130) construct was cloned into pET28a vector. FLASH NTD mutant constructs (without tag) were cloned into MCS2 of pCDFDuet vector. Both plasmids were co-transformed into E. coli BL21 Star (DE3) and the genes were co-expressed in 5 mL LB media. The expressed proteins were co-purified with 15 μL of Ni-NTA agarose beads using the same buffers that were used for large-scale purification of FLASH NTD-Lsm11 NTD complex, and analyzed using SDS-PAGE. The AUC experiments were performed on a XL-A analytical ultracentrifuge (Beckman Coulter) using an An-50 Ti rotor [50] [51] [52] [53] [54] [55] [56] . The sedimentation velocity experiments were performed using a double-sector epon charcoal-filled centerpiece at 20˚C with a rotor speed of 42,000 rpm. Protein solutions of 0.05 to 1 mg/ml (330 μl) in a buffer containing 20 mM Tris (pH 8.5), 250 mM NaCl, and 5 mM DTT (low-salt condition) and reference (370 μl) solutions were loaded into the centerpiece, respectively. The absorbance at 280 nm was monitored in a continuous model with a time interval of 300 s and a step size of 0.003 cm. Multiple scans at different time intervals were then fitted to a continuous c(s) distribution model using the SEDFIT program [57] . Additionally, the results with the various different protein concentrations were globally fitted to monomer-dimer self-association or A + B <-> AB hetero-association model using the SEDPHAT program to calculate the dissociation constant (K d ) [58] . To determine the precise molecular weight of the protein, the sedimentation equilibrium experiment was performed [59] . Three different samples (0.10-0.12 ml) were loaded into the sample channels of six-channel epon charcoal-filled centerpieces, and 0.11-0.13 ml buffers were loaded into the reference channels. The cells were then loaded into the rotor and run at speed of 10,000, 15,000, and 25,000 rpm each for 12 h at 20˚C. Ten A 280 nm scans with time interval of 8-10 min were measured for every different rotor speed to check the status of sedimentation equilibrium. Global analyses of combined sedimentation equilibrium and sedimentation velocity data were conducted with SEDPHAT using species analysis model [55] . FLASH NTD C54S/C83A-Lsm11 NTD complex and FLASH NTD C54S/C83A were loaded sequentially onto a Superdex 200 size exclusion column (24 mL) pre-equilibrated with 20 mM Tris pH 7.5, 500 mM NaCl, 5 mM DTT (high salt buffer) or 20 mM Tris pH 7.5, 250 mM NaCl, and 5 mM DTT (low salt buffer). The eluted samples first passed through a Wyatt multiangle light scattering system (DAWN HELEOS-II) and then a Wyatt Trex refractometer. The data were analyzed using ASTRA version 6 software (Wyatt Technology, Santa Barbara, CA). The monomer peak of 3 mg/ml BSA was used for normalization, delay time determination, and band broadening correction using ASTRA. Cross-linking reactions were carried out in 20 mM HEPES (pH 7.5), 500 mM NaCl. A final concentration of 0.1% (w/v) of glutaraldehyde was added to 0.1 mg/mL (total volumẽ 100 μL) and 0.01 mg/mL (total volume~1 mL) of FLASH NTD C54S/C83A-Lsm11 NTD complex, FLASH NTD C54S/C83A, and FLASH NTD wildtype. Controls with protein concentrations of 0.1 mg/mL without glutaraldehyde were set up for each sample type. All samples were incubated at 37˚C for 3 min then chilled on ice. A final concentration of about 100 mM of Tris pH 8.0 was added into each sample to quench the cross-linking reaction. All samples were concentrated to a volume~20 μL using Sartorius Vivaspin 1 500 centrifugal concentrators with a molecular weight cut off of 10 kDa and finally analyzed by SDS-PAGE. The speed of centrifugation for sedimentation equilibrium experiment (A) was 10,000 rpm (squares), 15,000 rpm (circles), and 25,000 rpm (triangles) at 20˚C each for 14 h. The velocity experiment (B) was 42,000 rpm (circles) at 20˚C for 6 h. The solid lines in two panels are the best fit results from global analysis of the two discrete species models by SEDPHAT (57) . The residuals of each fit are shown below the panels. The calculated sedimentation coefficients and M r from the best fit results are shown in Table 2 In the FLASH NTD-Lsm11 NTD complex, FLASH NTD is the lower band (as it lacks a His tag compared to FLASH NTD alone), and Lsm11 NTD is the upper band. While the FLASH NTD band disappeared in the presence of glutaraldehyde, the Lsm11 NTD band mostly stayed the same. Therefore, probably very small amount of Lsm11 NTD (if any) got crosslinked in the reaction, consistent with the fact that it has only 1 Lys residue. (TIF) S1 Metabolism and regulation of canonical histone mRNAs: life without a poly(A) tail Metazoan replication-dependent histone mRNAs: a distinct set of RNA polymerase II transcripts FLASH, a proapoptotic protein involved in activation of caspase-8, is essential for 3' end processing of histone pre-mRNAs Emergence of the beta-CASP ribonucleases: highly conserved and ubiquitous metallo-enzymes involved in messenger RNA maturation and degradation U7 snRNP-specific Lsm11 protein: dual binding contacts with the 100 kDa zinc finger processing factor (ZFP100) and a ZFP100-independent function in histone RNA 3' end processing A novel zinc finger protein is associated with U7 snRNP an interacts with the stem-loop binding protein in the histone pre-mRNP to stimulate 3 '-end processing The stem-loop binding protein forms a highly stable and specific complex with the 3' stem-loop of histone mRNAs The protein that binds the 3' end of histone mRNA: a novel RNA-binding protein required for histone pre-mRNA processing A 3' exonuclease that specifically interacts with the 3' end of histone mRNA Structure of histone mRNA stem-loop, human stem-loop binding protein, and 3'hExo ternary complex Three Proteins of the U7-Specific Sm Ring Function as the Molecular Ruler To Determine the Site of 3 '-End Processing in Mammalian Histone Pre-mRNA Eri1 degrades the stem-loop of oligouridylated histone mRNAs to induce replication-dependent decay Unique Sm core structure of U7 snRNPs: assembly by a specialized SMN complex and the role of a new component, Lsm11, in histone RNA processing Purified U7 snRNPs lack the Sm proteins D1 and D2 but contain Lsm10, a new 14 kDa Sm D1-like protein Identification of the human U7 snRNP as one of several factors involved in the 3' end maturation of histone premessenger RNA's Genetic complementation in the Xenopus oocyte: co-expression of sea urchin histone and U7 RNAs restores 3' processing of H3 pre-mRNA in the oocyte U7 snRNP is recruited to histone pre-mRNA in a FLASH-dependent manner by two separate regions of the Stem-Loop Binding Protein The CED-4-homologous protein FLASH is involved in Fas-mediated activation of caspase-8 during apoptosis FLASH acts as a co-activator of the transcription factor c-Myb and localizes to active RNA polymerase II foci FLASH meets nuclear bodies: CD95 receptor signals via a nuclear pathway A Conserved Interaction That Is Essential for the Biogenesis of Histone Locus Bodies Interaction of FLASH with Arsenite Resistance Protein 2 Is Involved in Cell Cycle Progression at S Phase Mapping the Interaction Network of Key Proteins Involved in Histone mRNA Generation: A Hydrogen/Deuterium Exchange Study Searching for FLASH domains FLASH and NPAT positive but not Coilin positive Cajal Bodies correlate with cell ploidy Staged assembly of histone gene expression machinery at subnuclear foci in the abbreviated cell cycle of human embryonic stem cells Cell cycle-regulated phosphorylation of p220(NPAT) by cyclin E/Cdk2 in Cajal bodies promotes histone gene transcription The cyclin E/Cdk2 substrate p220(NPAT) is required for S-phase entry, histone gene expression, and Cajal body maintenance in human somatic cells NPAT links cyclin E-Cdk2 to the regulation of replication-dependent histone gene transcription Interaction between FLASH and Lsm11 is essential for histone pre-mRNA processing in vivo in Drosophila Concentrating pre-mRNA processing factors in the histone locus body facilitates efficient histone mRNA biogenesis Selenomethionyl Proteins Produced for Analysis by Multiwavelength Anomalous Diffraction (Mad)-a Vehicle for Direct Determination of 3-Dimensional Structure Integration, scaling, space-group assignment and post-refinement Experimental phasing with SHELXC/D/E: combining chain tracing with density modification Features and development of Coot Processing of X-ray diffraction data collected in oscillation mode Phaser crystallographic software PHENIX: a comprehensive Python-based system for macromolecular structure solution Structural and functional characterization of MERS coronavirus papain-like protease Structural basis for catalysis and ubiquitin recognition by the severe acute respiratory syndrome coronavirus papain-like protease Mechanism for controlling the monomerdimer conversion of SARS coronavirus main protease Mutation of Glu-166 blocks the substrate-induced dimerization of SARS coronavirus main protease Structural and functional characterization of human apolipoprotein E 72-166 peptides in both aqueous and lipid environments Differential domain structure stability of the severe acute respiratory syndrome coronavirus papain-like protease Critical Assessment of the Important Residues Involved in the Dimerization and Catalysis of MERS Coronavirus Main Protease Size-distribution analysis of macromolecules by sedimentation velocity ultracentrifugation and lamm equation modeling On the analysis of protein self-association by sedimentation velocity analytical ultracentrifugation Structural and functional variations in human apolipoprotein E3 and E4 A new bioinformatics analysis tools framework at EMBL-EBI Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega Deciphering key features in protein structures with the new ENDscript server Protein secondary structure prediction based on position-specific scoring matrices We thank Aleksandra Skrajna for helpful discussions and comments on the manuscript; S. Banerjee, K. Perry, R. Rajashankar, J. Schuermann, N. Sukumar for access to NE-CAT 24-ID-C and 24-ID-E beamlines at the Advanced Photon Source; and M. Allaire, N. Smith, and S. Ortega for access to Beamline 5.0.1 at the Advanced Light Source; and J. Seetharaman for assistance in X-ray data collection. This research is supported by NIH grants R35GM118093 and S10OD012018 (to LT) and GM29832 (to WFM and ZD) and Taiwan MOST grant 105-2320-B-010-012 and 106-2320-B-010-013 (to CYC). This work is based upon research conducted at the Northeastern Collaborative Access Team beamlines (NIH P41 GM103403), using the Pilatus 6M detector (NIH-ORIP HEI grant S10 RR029205). This research used resources of the