Structural analysis of the core COMPASS family of histone H3K4 methylases from yeast to human
Abstract
Histone H3 lysine 4 (H3K4) methylation is catalyzed by the highly evolutionarily conserved multiprotein complex known as Set1/COMPASS or MLL/COMPASS-like complexes from yeast to human, respectively. Here we have reconstituted fully functional yeast Set1/COMPASS and human MLL/COMPASS-like complex in vitro and have identified the minimum subunit composition required for histone H3K4 methylation. These subunits include the methyltransferase C-terminal SET domain of Set1/MLL, Cps60/Ash2L, Cps50/RbBP5, Cps30/WDR5, and Cps25/Dpy30, which are all common components of the COMPASS family from yeast to human. Three-dimensional (3D) cryo-EM reconstructions of the core yeast complex, combined with immunolabeling and two-dimensional (2D) EM analysis of the individual subcomplexes reveal a Y-shaped architecture with Cps50 and Cps30 localizing on the top two adjacent lobes and Cps60-Cps25 forming the base at the bottom. EM analysis of the human complex reveals a striking similarity to its yeast counterpart, suggesting a common subunit organization. The SET domain of Set1 is located at the juncture of Cps50, Cps30, and the Cps60-Cps25 module, lining the walls of a central channel that may act as the platform for catalysis and regulative processing of various degrees of H3K4 methylation. This structural arrangement suggested that COMPASS family members function as exo-methylases, which we have confirmed by in vitro and in vivo studies.
Keywords: MLL1, single particle analysis, negative stain EM
Methylation of histone lysines, including H3K4, H3K9, H3K27, H3K36, H3K79, and H4K20, plays a crucial role in the regulation of key biological processes, such as cell cycle progression, transcription, and DNA repair (1–3). Except for H3K79 methylation, which is catalyzed by the Dot1 family proteins, all other histone lysine methylations are carried out by the SET [Su (var), Enhancer of Zeste, and Trithorax] domain-containing enzymes. While the majority of the SET domains can act as histone methyltransferases in an isolated form, the MLL/Set1 family methyltransferases and Polycomb group proteins, which respectively catalyze histone H3K4 and H3K27 methylation, must assemble within their respective complexes for maximal catalytic and biological activities (2).
The founding member of the MLL/Set1 family protein, Set1, forms a multiprotein complex named COMPASS (COMplex of Proteins ASsociated with Set1) in Saccharomyces cerevisiae (4). Set1/COMPASS was the first identified histone H3K4 methylase capable of mono-, di-, and trimethylating H3K4, and several of the COMPASS subunits are required for proper methylation (4–8). In addition to the evolutionarily conserved SET domain located at the C terminus of Set1, most associating subunits are also conserved from yeast to human, forming Set1/COMPASS and MLL/COMPASS-like complexes (9, 10).
Histone H3K4 methylation on chromatin, in particular di- and trimethylation, correlates with actively transcribed genes (9, 10). Previous work by us and others has shown that H3K4 methylation is subject to various layers of regulation mechanisms that are highly conserved from yeast to human (9). Different subunits of yeast and human COMPASS have also been shown to regulate H3K4 di- and/or trimethylation as well as Set1 stability. These subunits include Cps60 (Ash2L in human), Cps50 (RbBP5), Cps40 (Cfp1), Cps30 (WDR5), and Cps25 (Dpy30).
The recently determined crystal structure of the MLL1/SET domain in complex with histone H3 peptide substrate (11) revealed that the active site was relatively solvent exposed due to an outward shifted inserted SET motif (iSET). The isolated MLL1/SET domain, free of any associated components of the natural MLL1 complex, exhibits a weak in vitro histone methyltransferase activity (11). However, the addition of residual components of the MLL1 complex, namely WDR5, RbBP5, Ash2L, and Dpy30, greatly stimulated catalytic activity. This observation led to a model, in which the association of the MLL1 complex subunits induces reorientation of iSET to the optimal conformation of its catalytic active site (11). Given the highly conserved C-terminal catalytic SET domain between yeast Set1 and human MLL1, and the common subunit composition of the COMPASS family, the above model might be applicable to yeast COMPASS as well. However, until now there has been no structural information regarding the architectural arrangement of any COMPASS or COMPASS-like complex.
Here we present the identification, reconstitution, as well as the structural and biochemical characterization of a minimal core COMPASS for both yeast and human complexes capable of high levels of H3K4 methylase activity towards histone H3.
Results
Reconstitution of Core COMPASS.
To obtain stable COMPASS for biochemical characterization, we reconstituted the yeast and human complex through baculovirus-mediated cotransfection and overexpression of different subunit combinations in insect cells. Based on affinity purification of the recombinant Set1 and Cps proteins, we obtained nearly pure preparations of reconstituted COMPASS complexes (SI Appendix, Figs. S1–S4) that are enzymatically active in in vitro H3K4 methylation assays with full-length histone H3 (Fig. 1A). This approach enabled us to prepare any combination of wild-type (WT) or modified COMPASS components, including truncated forms of Set1, for studies that have been previously hampered by the absence of subunits such as Cps50 or Cps30 (8). Accordingly, COMPASS containing full-length Set1, truncated Set1 (762–1,080), Set1 (780–1,080), Set1 (938–1,080), or without Set1 were successfully prepared (SI Appendix, Figs. S1–S2). Curiously, full-length Set1 purified from yeast COMPASS consistently runs on SDS/PAGE as a doublet, although the reasons behind this property of Set1 or its biological significance remain unclear.
Fig. 1.
Recombinant Set1/COMPASS complexes and in vitro histone methyltransferase activities. (A) Purifications of recombinant COMPASS consisting of full-length Set1 with various combinations of Cps60, Cps50, Cps40, Cps35, Cps30, and Cps25. Subunit composition of purified COMPASS was confirmed by Western blotting using anti-FLAG (top box) and anti-Set1 (middle box) antibodies. In vitro H3K4 methyltransferase activities toward free histone H3 of various recombinant COMPASS complexes were examined by Western blotting using anti-H3K4me1, me2, and me3 antibodies (bottom box). (B) Recombinant COMPASS complexes were prepared using Set1 (938–1,080) instead of full-length Set1. COMPASS composition and in vitro H3K4 methyltransferase activities were analyzed in the same way as (A).
As a first step, we investigated the effects of specific Cps subunits on in vitro H3K4 methylation by preparing full-length Set1 (1–1,080) with various Cps combinations (Fig. 1A). Each introduced component was purified through FLAG affinity purification as shown by anti-FLAG or anti-Set1 Western blots (Fig. 1A, top and middle boxes, respectively). Consistent with our previous reports of in vivo and in vitro H3K4 methylation using cps60Δ and cps25Δ strains (8), the absence of Cps60 and Cps25 significantly impaired the catalytic activity of Set1 with the total loss of H3K4 trimethylation, and more than an 80% reduction of H3K4 mono- and dimethylation (Fig. 1A, lane 7). In contrast to earlier in vivo studies (8, 12–14), almost no monomethylation or very marginal di- and trimethylation changes of H3K4 were observed in the absence of Cps40 and Cps35 under conditions where Set1 levels are not a limiting reagent in our in vitro experiments (Fig. 1A). This study suggests that full-length Set1 stability in vivo is reduced in either cps40Δ or cps35Δ strains, compared to the strain expressing all five subunits. To this end, we found that Cps40 coeluted with other COMPASS components in the presence of the N-terminally extended region of Set1, including full-length Set1, Set1 (762–1,080), Set1 (780–1,080), but not Set1 (938–1,080) (SI Appendix, Fig. S2). Cps35 was also shown to require a region beyond 762–1,080 of Set1, because it did not coelute in the macromolecular fractions with any other truncated Set1 variants (SI Appendix, Fig. S2).
Next, we evaluated the intrinsic enzymatic activity of soluble Set1 protein in isolation. We did not detect any H3K4 methylation by Set1 alone or in the presence of putative Set1-stabilizing components Cps50 and Cps30 (Fig. 1A, lanes 2 and 3), thereby further confirming that Cps60 and Cps25 are required for H3K4 methylation activity by Set1. In the reconstituted system, Set1 (938–1,080) was sufficient in replacing the full-length protein in terms of stability and enzymatic activity (Fig. 1 A and B), except in the absence of Cps60 and Cps25 (Fig. 1 A, lane 7 and B, lane 7). Whereas full-length Set1-containing COMPASS supported low levels of residual in vitro H3K4 mono- and dimethylation in the absence of Cps60 and Cps25, this activity was completely abolished in Set1 (938–1,080)-containing COMPASS.
Recent studies with H3 peptide substrates have suggested that WDR5 RbBP5, Ash2L, and Dpy30 (also known as the WRAD complex), the human homologs of Cps30, Cps50, Cps60, and Cps25, respectively, display histone H3K4 methyltransferase activity independent of the Set1 domain of MLL1 (15–17). Despite the highly conserved components between the yeast COMPASS and the human MLL/COMPASS-like complexes, we could not detect any H3K4 methylation signals in a reconstituted Cps30, Cps60, Cps50, and Cps25 complex in the absence of Set1 (Fig. 1 A and B; see lane 10 in both figures). This observation is consistent with our previous in vivo studies where we were unable to detect any histone H3K4 methylation in strains with SET1 deletions (8), suggesting that Set1 is the only histone H3K4 methylase in yeast, and that the yeast WRAD complex does not demonstrate substantial HMTase activity associated with it in vivo or in vitro. Indeed, our enzymatic analyses (SI Appendix, Fig. S5) demonstrated the presence of less than 1% of H3K4me1 and virtually undetectable levels of H3K4me2-3 when using WRAD as compared to wild-type Set1/COMPASS or MLL/COMPASS-like complexes. Given the extremely weak catalytic rates reported for the reconstituted WRAD complex as a SET-domain independent histone H3K4 methylase (15–17), the biological relevance of such activity needs to be further demonstrated.
Insect cell-expressed core COMPASS also enabled us to investigate the regulation of histone H3K4 methylation by Cps50 and Cps30. Systematic analysis for in vitro H3K4 methyltransferase activity (SI Appendix, Fig. S3) shows that core COMPASS is rather insensitive to the lack of Cps30 with comparable H3K4 mono- and dimethylation, but partially defective trimethylation activity (SI Appendix, Fig. S3D, lanes 2 and 10). This finding suggests that COMPASS is enzymatically active in the absence of Cps30, but the processivity of the SET domain from H3K4 di- to trimethylation is regulated by Cps30. In contrast, reconstituted core COMPASS lacking either Set1 (938–1,080), Cps60, Cps50, or Cps25 completely lacked enzymatic activities (SI Appendix, Fig. S3D, lanes 1, 6, 11, and 12; note that the low levels of Set1 in lane 6 may also contribute to the absence of activity observed in that condition). Thus, we have identified Set1 (938–1,080), Cps60, Cps50, Cps30, and Cps25 as the necessary core COMPASS components for high levels of in vitro H3K4 mono-, di- and trimethylation using histone H3 as a substrate. Both Cps60 and Cps25 are essential for in vitro enzymatic activity regardless of the combination of other core COMPASS subunits (Fig. 1A lanes 3, 4, and 7, SI Appendix, Fig. S3D, lanes 3–8, 11–14). It is noted that the Cps60-Cps25 subcomplex (6) coeluted with Set1 (938–1,080) upon size exclusion chromatography when reconstituted together (fractions 21 to 23; SI Appendix, Fig. S4B). This result demonstrates a direct interaction between the SET domain and the Cps60-Cps25 module. In contrast, in the absence of Cps60 and Cps25, soluble Set1 (938–1,080) elutes broadly on the same size exclusion chromatography system (fractions 13 to 29; SI Appendix, Fig. S4A). Given that coexpression of neither Cps50 nor Cps30 had any effects on Set1 (938–1,080) elution profile (SI Appendix, Fig. S4C), Cps60 and Cps25 may play an additional role in the correct assembly of COMPASS through interactions with Set1.
Electron Microscopic Mapping of Core COMPASS.
To gain insight into the architecture of yeast core COMPASS, we employed electron microscopy to visualize preparations consisting of purified Set1 (938–1,080), Cps60, Cps50, Cps30, and Cps25 (Fig. 2A, fraction 18; SI Appendix, Fig. S1B). Raw images of negative stained specimen revealed a monodisperse population of complexes with similar sizes (SI Appendix, Fig. 6A). To analyze these particles, 35,527 projections were interactively selected and grouped into 100 classes by reference-free alignment and classification (SI Appendix, Fig. S7A). The two-dimensional (2D) class averages revealed flexible conformers of two major populations of particle projections: one displays a Y-shaped particle with a triangular base connecting through an extension to two adjacent oval lobes, one of which is in closer proximity to the base (Fig. 2B, right; SI Appendix, Fig. S7A); the second population reveals a similar triangular base and an arm extension connecting to a single circular doughnut-like domain with a distinct stain accumulation region in the center (Fig. 2B, left). The doughnut-shaped domain appears identical to the projection profile of a WD40 domain (inset in Fig. 2B), which defines both Cps50 and Cps30, further suggesting that a subpopulation of the reconstituted COMPASS complexes might be missing one of the two WD40 domain-containing subunits.
Fig. 2.
EM mapping of core COMPASS. (A) Purification of Core Set1/COMPASS by size exclusion chromatography and analysis of the active fractions by HMTase assay. Core Set1/COMPASS consisting of Cps25, Cps60, Cps50, Set1 (938–1,080) was reconstituted and fractionated by size exclusion chromatography over a Superose-6 PC 3.2/30 (GE Healthcare). The H3K4 methylase activities of the resulting fractions were tested as shown. (B) Representative 2D averages of negative stained core COMPASS (fraction 18 in A) including Cps25, Cps60, Cps50, Set1 (938–1,080) which is denoted as SET938 for simplicity in Figs. 3 and 4, and Cps30. The inset shows the 2D projection profile of a WD40 domain for comparison (Scale bar, 10 nm). (C) Representative 2D averages of the Cps25-Cps60 assembly. (D) Representative 2D averages of negative stained Cps25-Cps60-Cps50-SET938 complex. (E) 2D average of Cps25-Cps60-Cps50-SET938 complex labeled with a Fab fragment against SET938. (F) Representative 2D class averages of negative stained human core MLL1/COMPASS-like complex. Schematic representations: Cps25/DPY30-Cps60/Ash2L (gray), SET938/MLL1 (blue), Cps50/RbBP5 (orange), and Cps30/WDR5 (green).
In order to identify each domain within the core COMPASS, we visualized individual subcomplexes by negative stain EM. We first examined the structure of the Cps60-Cps25 subcomplex, where classification and averaging (Fig. 2C; SI Appendix, Fig. S8 A and C) revealed a triangular structure with similar dimensions and shape to the triangular base found in the images of core COMPASS (Fig. 2B). This result suggests that Cps60 and Cps25 form the base of core COMPASS. We next analyzed EM images of reconstituted core COMPASS lacking the Cps30 subunit (core COMPASS/-Cps30) (SI Appendix, Fig. S8 B and D). 2D averages from this analysis show the well defined triangular base, with an arm extension and an additional oval lobe near the top of the complex (Fig. 2D). Missing, however, is the second, more distant oval lobe that is present in core COMPASS. This observation indicates that the Cps30 subunit is the more terminal of the two lobes away from the triangular base, while the lobe closer to the middle of the complex belongs to the WD40 domain of Cps50. In addition, a few averages reveal a small density extending from the connection between the Cps60-Cps25 base and the Cps50 domain, which might be attributed to the SET domain (Fig. 2D, right box). To test this hypothesis, we incubated core COMPASS/-Cps30 with a Fab fragment conjugated to SET and analyzed the complexes by single-particle EM. 2D projection averages of Fab-labeled complexes clearly reveal the extra Fab density at the level of the region connecting the Cps60-Cps25 base and Cps50. (Fig. 2E). Next, we examined the architecture of the reconstituted human core COMPASS consisting of MLL SET domain, RbBp5, Ash2L, WDR5, and Dpy30. Negative stain 2D averages from this preparation revealed an architecture with striking resemblance to the yeast complex (Fig. 2F). Thus, both the structure and function and COMPASS-like complexes appear to be highly conserved from yeast to human.
3D Architecture of Core COMPASS.
To further characterize the architecture of the core COMPASS, we sought to examine the three-dimensional (3D) structure of the yeast assembly. In the first step, we calculated several 3D models from negative stained particles belonging to the individual groups produced by classification (SI Appendix, Fig. S9). For this approach, we used the corresponding 60° tilted projections and the random conical tilt method to calculate initial reconstructions, which were further refined after the inclusion of 0° particle views. The 3D reconstructions of particles belonging to classes with only one globular domain connected to the Cps60-Cps25 base reveal that these particles have indeed only one WD40-like domain. In contrast, 3D reconstructions from Y-shaped particles reveal the Cps60-Cps25 module having two globular domains connected to it at different distances (SI Appendix, Fig. S9). In these 3D reconstructions, we notice variability in the disposition of the Cps60-Cps25 base as well as in the proximity between the two WD40-like lobes of Cps50 and Cps30. We reasoned that the various conformations might be the result of inherent complex flexibility, resulting in deformations due to the negative stain preparation on the carbon support of the grids. Therefore, we focused our efforts on cryo-EM of the complex in holey carbon grids, and successfully obtained particle images from specimen suspended in thin vitreous ice (SI Appendix, Fig. S6B). Because our negative stain analysis of core COMPASS indicated the presence of two major populations (+/- one WD40 domain), we subjected 21,583 cryo-EM images to multiple reference-supervised alignment (18, 19) using two initial 3D references (SI Appendix, Fig. S10A): a 3D volume representing only the Cps60-Cps25 base and one WD40-like domain (SI Appendix, Fig. S10A, left), and a 3D volume that included the base and both WD40-like domains (SI Appendix, Fig. S10A, right). This approach allowed us to effectively separate the two major populations within the cryo-EM dataset, resulting in stable particle assignments after several cycles of multireference alignment. We then employed single reference alignment and reconstruction to produce a final cryo-EM map of the core COMPASS at a resolution of 24 Å (FSC = 0.5) (SI Appendix, Fig. S10B). The cryo-EM 3D map displayed features that are clearly distinct from those of the starting reference 3D map from negative stain EM. To validate our reconstruction, we also employed different initial references for cryo-EM projection alignment, thereby, producing very similar final maps (SI Appendix, Fig. S11). Furthermore, 2D classification of the cryo-EM projections (SI Appendix, Fig. S12A) revealed averages that are in very good agreement with reprojections of the 3D map (SI Appendix, Fig. S12B).
Molecular Modeling of Core COMPASS.
To obtain deeper insights to the architecture of the core COMPASS, we generated a model by docking the available or homologous crystal structures into their corresponding positions within the cryo-EM map and according to our assignment from the 2D projection analysis (Fig. 3). Due to the limits in resolution, we performed rigid body manual docking based on visual inspection of the fit. The two neighboring globular domains of the 3D map represent the two WD40 domains, with the lobe more distal to the Cps60-Cps25 base belonging to Cps30 (Figs. 3 and 4B, green) and the other to Cps50 (orange). To model Cps50, which contains a β-propeller fold and a C-terminal tail of ∼80 amino acids, we docked a homologous WD40 domain (orange; PDBID:2XL2) in its corresponding position. A recent crystal structure of WDR5, a Cps30 homolog, has been obtained with peptides of both the C-terminal tail of RbBP5 (Cps50 homolog) and the Win motif that is N-terminally adjacent to the SET domain of MLL1 (Set1 homolog) bound onto the opposite faces of WDR5 (20, 21). We thus docked the cocrystal structure (PDBID:3P4F) of the MLL1 Win peptide-WDR5-RbBP5 tail peptide in the position of Cps30 with an orientation that places the RbBP5 tail peptide close to the Cps50 position on the cryo-EM map. Our Fab labeling experiments position the SET domain in the region connecting the two WD40 domains and the Cps60-Cps25 base. Accordingly, the Set1/SET domain from the homologous crystal structure (11, 22) was docked in this position. Although yeast Set1 does not have the canonical Win motif upstream of the SET domain, we identified a similar Ala-Arg-Ser motif at positions 943–945 of the Set1 primary sequence, and found that core COMPASS harboring either Set1 (R944A) or Set1 (S945A) substitutions almost completely loses its H3K4 methyltransferase activity (SI Appendix, Fig. S13). This observation suggests that the Ala-Arg-Ser motif is important in complex formation like the Win motif in the MLL1 core complex. Based on this result, we oriented the N terminus of Set1 close to Cps30 and opposite to Cps50 (SI Appendix, Fig. S14). According to these interactions, and given the spatial constraints of this region, we could only dock the Set1 domain in a single orientation. In this model, one side of Set1 (938–1,080) appears to extensively interact with Cps30, while another side is forming a bridge with density stemming from the Cps50 WD40 domain (Fig. 4A). Thus, our modeling places the active site of Set1 in the middle of a central channel that runs through the complex, starting from the connection between Cps60-25 and Cps50, and running adjacent to Set1, exiting behind the interface of Cps30 with Cps50 (Fig. 4 A and B). In this configuration, the peptide bound to the active site of the SET domain, as shown in the crystal structure (PDBID:2W5Z), would reside directly in the middle of the COMPASS channel (shown in red in Fig. 4A). Thus, this configuration may limit the substrates that COMPASS can recognize and methylate.
Fig. 3.
3D cryo-EM reconstruction and modeling of core COMPASS. Different views of the cryo-EM 3D map for the Cps25-Cps60-Cps50-SET938-Cps30 COMPASS complex. Each view shows the solid rendered map accompanied by a transparent map with modeled crystal structures of the WD40 domain of Cps50 (orange), Cps30 (green), and MLL1/SET938 (cyan). The red arrow indicates the expected position of the Cps50 arm that bridges with the Cps25-Cps60 module.
Fig. 4.
COMPASS family members are exomethylases. (A) Zoom-in view of the central channel formed in the complex, with the histone peptide of the MLL1 (as cocrystallized) shown in the red space-filling model. (B) Schematic model of core COMPASS family architecture. The red star indicates the histone peptide-binding region of SET938. (C) The N-terminally extended version histone H3 was constructed by inserting FLAG sequence (DYKDDDDK) between the Met start codon and the second Ala codon. COMPASS shows no H3K4 methylase activity with the FLAG-extended H3 tails in vivo (this figure) and in vitro (SI Appendix, Fig. S15).
COMPASS Family Members Are Exo-Methylases.
Our structural analysis and modeling suggest that the centrally located active site within the COMPASS channel may only be reached by flexible peptide terminals, indicating that COMPASS family may function primarily as exo- and not endo-methylases. To test this hypothesis, we engineered a Flag sequence to the N terminus of the only copy of histone H3 in our yeast strain and tested H3K4 methylation by Set1/COMPASS. As shown in Fig. 4C (lanes 1, 3), WT H3 can be methylated on K4 by Set1/COMPASS in vivo. However, H3 bearing a single Flag sequence on its N terminus (making the H3K4 site an internal site) is no longer methylated, although this H3 can be fully methylated on K36 and K79 (Fig. 4C lanes 5, 6). This finding illustrates that Set1/COMPASS specifically methylates the N-terminal tail of histone H3 and that the addition of a short heterologous sequence to the substrate’s N terminus, making the site of methylation an internal site, blocks methylation by Set1/COMPASS in yeast cells. To further confirm this observation in a reconstituted system, we also tested whether N-terminally 10-His-tagged H3 can be methylated either by Set1/COMPASS or MLL/COMPASS-like complex. Unlike WT H3, an internally engineered H3K4 site can no longer serve as a substrate for COMPASS (SI Appendix, Fig. S15, lanes 1–14) further suggesting that the COMPASS family members preferentially methylate N-terminal and not internal lysine sites.
Discussion
In this study, we established the 3D architecture of S. cerevisiae and human COMPASS complexes through characterization of the fully functional core assembly, consisting of the SET domain of Set1, Cps60/Ash2L, Cps50/RbBP5, Cps30/WDR5, and Cps25/DPY30 (Fig. 3). In vitro reconstitution of yeast COMPASS allowed us to prepare multiple truncated forms of Set1 in combination with other subunits (Fig. 1), and analyzed the generation of H3K4 methylation by these complexes (Fig. 1, SI Appendix, Fig. S3). The characterization of core COMPASS clearly shows that five components are required and sufficient for all forms (mono-, di-, and tri-) of H3K4 methylation in vitro, and therefore this complex likely mediates this modification on active genes in vivo. Furthermore, the obtained 3D architecture of core COMPASS, revealing the centrally located SET domain of Set1 that directly contacts Cps50, Cps30, and the Cps60-Cps25 subcomplex, provides the fundamental architectural blueprint mediating its enzymatic function and stability.
The core COMPASS subunits identified here are present in all six Set1/COMPASS and MLL1-4 COMPASS-like complexes (9). While a partially reconstituted core MLL complex lacking Dpy30 (Cps25) was able to trimethylate H3K4 in vitro (23), another study showed that the presence of Dpy30 enhanced the catalytic activity of the MLL complexes (16). Recent studies (16, 24) reveal a conserved physical interaction between Cps60/Ash2 and Cps25/Dpy30 both in yeast and humans. Using both biochemical and structural data, we provide evidence that the Cps60-Cps25 subcomplex interacts directly with the SET domain of Set1 (SI Appendix, Fig. S2E) and activates the in vitro H3K4 methyltransferase activity of core COMPASS (SI Appendix, Fig. S3D). We propose that our reconstituted core yeast and human COMPASS can serve as a model for COMPASS family members in metazoans.
The single-particle EM 2D and 3D analysis of core COMPASS and its subcomplexes have provided insights to the underlying molecular mechanisms of H3K4 methylation. The position of the SET domain, immediately adjacent to each component of core COMPASS, suggests a centralized organization that reflects the importance of the SET motif in catalysis, together with complex stabilization by its closely associated subunits. The histone methyltransferase active site within the SET domain is likely located in the middle of a central channel formed by flanking Cps50, Cps30, and the Cps60-25 subcomplex, suggesting that H3K4 methylation takes place inside the channel. This interpretation provides a structural explanation for the regulation of H3K4 di- and trimethylation by Cps60-Cps25 observed in vivo and in vitro. The Cps60-Cps25 subcomplex could directly alter the structure of the SET domain to allow an inward shift of iSET, the precise positioning of which was previously proposed to allow MLL1’s SET domain to trimethylate H3K4 (11). Additionally, the 3D map and modeling of the complex presented here have suggested that COMPASS family members may function as exo-methylases. We have shown that both in a reconstituted system and in vivo the COMPASS family does indeed prefer N-terminal lysines as substrate (Fig 4C).
Our reconstitution studies further suggest a previously unknown regulatory role for Cps30 in the progression from H3K4 di- to trimethylation by COMPASS (SI Appendix, Fig. S3). Consistent with our in vitro findings with yeast COMPASS, metazoan Cps30 (WDR5) has been shown to be specifically required for H3K4 trimethylation, but not for dimethylation, in Xenopus laevis embryos as well as in human HOX genes (25). Although the mechanistic aspect of Cps30-mediated regulation is unclear, it is possible that Cps30, through its proximity and interactions, directly affects the structure of the SET domain in a distinct way from that of Cps60-25, leading to an optimal conformation for H3K4 trimethylation.
Repressive H3K27 methylation, which has an apparently opposite function to H3K4 methylation, is also introduced by the SET domain-containing multiprotein complex PRC2 (26). COMPASS and PRC2 share common features, such as closely related SET-containing catalytic subunits that require complex formation for activity in vivo and in vitro (27, 28), and also the presence of multiple WD40 proteins (RbBP5 and WDR5 in MLL1/COMPASS, and EED and RbAP46/48 in PRC2) (29). It will thus be interesting to investigate whether and how substrate specificities and/or enzymatic mechanisms are common or diverged between COMPASS and PRC2.
Experimental Procedures
Plasmids and Yeast Strains.
Full-length Set1, Set1 (762–1,080), Set1 (780–1,080), Set1 (938–1,080), and the Cps subunits of COMPASS (Cps60, Cps50, Cps40, Cps35, Cps30, Cps25) except for Cps15 were fused with the FLAG epitope tag on their N termini and cloned into the transfer vector pBacPAK8 (Clontech). The MLL1 core complex components including MLL1-Win-SET(3,745–3,969), full-length WDR5, Ash2L, RbBP5, and Dpy30 were tagged with FLAG epitope tag on their N termini and cloned into the pBacPAK8 vector in the same manner. Yeast shuffle strain YBL574 was transformed with modified pWZ414-F12 plasmid, which encodes an N-terminally FLAG-tagged hht2 gene, and used for the analysis of exo-methylation by COMPASS.
Protein Preparation.
Recombinant COMPASS and MLL1 complexes were prepared through the BacPAK Baculovirus Expression System (Clontech). To prepare COMPASS complexes of various combinations of the Set1 and Cps subunits, exponentially growing Sf9 insect cell cultures were cotransfected with a mixture of viruses expressing specific combinations of COMPASS components and subsequently FLAG-purified. The MLL1 core complex was prepared in the same manner. For further details refer to SI Appendix.
In Vitro and In Vivo H3K4 Methylation Analysis.
Recombinant COMPASS complexes were incubated with 0.5 μg of free histone H3 and 200 μM S-adenosylmethionine in methyltransferase reaction buffer (50 mM Tris-HCl [pH 8.8], 20 mM KCl, 5 mM MgCl2, 0.5 mM dithiothreitol) for at least 2 h at 30 °C. The methylation of histone H3 was examined by Western analysis using anti-H3K4me1, me2, and me3 specific antibodies. Histone methylation in vivo was determined by Western analysis of cleared cell lysates using anti-H3K4me1, me2, me3, H3K79me2, me3, and H3K36me3 specific antibodes.
Specimen Preparation, EM Imaging, 2D Classification, and 3D Reconstruction of Negative Stained COMPASS Complexes.
Core COMPASS and subcomplexes were prepared for electron microscopy using the conventional negative staining protocol (30), and imaged at room temperature with a Tecnai T12 electron microscope operated at 120 kV using low-dose procedures. 2D reference-free alignment and classification of particle projections was performed using SPIDER (31). The random conical tilt technique (32) was used to calculate a first back projection map from individual classes using the images of the tilted specimen. FREALIGN (33) was then used for further refinement of the orientation parameters and for correction of the contrast transfer function to produce final 3D reconstructions. For further details refer to SI Appendix.
Cryo-EM Specimen Preparation, Imaging, and 3D Reconstruction.
Vitrified specimen was visualized on a Tecnai F20 electron microscope (FEI) equipped with a field emission electron source operated at 200 kV. Particles from cryo-EM images were excised using Boxer [part of the EMAN 1.9 software suite] (34). A total of 21,583 particles were initially subjected to multiple reference-supervised alignment with EMAN (18, 19), and the separated particle projections were then subsequently submitted to single reference refinement and 3D reconstruction. For further details refer to SI Appendix.
Supplementary Material
Supporting Information
Acknowledgments.
We thank Dr. J.F. Couture for his comments and insight into the assembly of COMPASS. We are also grateful to Laura Shilatifard for editorial assistance. G.W. is supported by the National Institute of General Medical Sciences (NIGMS) Molecular Biophysics Training Grant GM008270-23. R.C.T. is supported by R01-GM073839. Studies in the Shilatifard laboratory are supported by funds provided from the National Institute of Health R01-CA150265 and R01-GM069905. G.S. is supported by R01-DK090165.
Footnotes
The authors declare no conflict of interest.
*This Direct Submission article had a prearranged editor.
References
- 1.Bhaumik SR, Smith E, Shilatifard A. Covalent modifications of histones during development and disease pathogenesis. Nat Struct Mol Biol. 2007;14:1008–1016. doi: 10.1038/nsmb1337. [DOI] [PubMed] [Google Scholar]
- 2.Shilatifard A. Chromatin modifications by methylation and ubiquitination: implications in the regulation of gene expression. Annu Rev Biochem. 2006;75:243–269. doi: 10.1146/annurev.biochem.75.103004.142422. [DOI] [PubMed] [Google Scholar]
- 3.Zhang Y, Reinberg D. Transcription regulation by histone methylation: interplay between different covalent modifications of the core histone tails. Genes Dev. 2001;15:2343–2360. doi: 10.1101/gad.927301. [DOI] [PubMed] [Google Scholar]
- 4.Miller T, et al. COMPASS: a complex of proteins associated with a trithorax-related SET domain protein. Proc Natl Acad Sci USA. 2001;98:12902–12907. doi: 10.1073/pnas.231473398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Krogan NJ, et al. COMPASS, a histone H3 (Lysine 4) methyltransferase required for telomeric silencing of gene expression. J Biol Chem. 2002;277:10753–10755. doi: 10.1074/jbc.C200023200. [DOI] [PubMed] [Google Scholar]
- 6.Roguev A, et al. The Saccharomyces cerevisiae Set1 complex includes an Ash2 homologue and methylates histone 3 lysine 4. EMBO J. 2001;20:7137–7148. doi: 10.1093/emboj/20.24.7137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Schlichter A, Cairns BR. Histone trimethylation by Set1 is coordinated by the RRM, autoinhibitory, and catalytic domains. EMBO J. 2005;24:1222–1231. doi: 10.1038/sj.emboj.7600607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Schneider J, et al. Molecular regulation of histone H3 trimethylation by COMPASS and the regulation of gene expression. Mol Cell. 2005;19:849–856. doi: 10.1016/j.molcel.2005.07.024. [DOI] [PubMed] [Google Scholar]
- 9.Shilatifard A. Molecular implementation and physiological roles for histone H3 lysine 4 (H3K4) methylation. Curr Opin Cell Biol. 2008;20:341–348. doi: 10.1016/j.ceb.2008.03.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Eissenberg JC, Shilatifard A. Histone H3 lysine 4 (H3K4) methylation in development and differentiation. Dev Biol. 2010;339:240–249. doi: 10.1016/j.ydbio.2009.08.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Southall SM, Wong PS, Odho Z, Roe SM, Wilson JR. Structural basis for the requirement of additional factors for MLL1 SET domain activity and recognition of epigenetic marks. Mol Cell. 2009;33:181–191. doi: 10.1016/j.molcel.2008.12.029. [DOI] [PubMed] [Google Scholar]
- 12.Dehe PM, et al. Protein interactions within the Set1 complex and their roles in the regulation of histone 3 lysine 4 methylation. J Biol Chem. 2006;281:35404–35412. doi: 10.1074/jbc.M603099200. [DOI] [PubMed] [Google Scholar]
- 13.Lee JS, et al. Histone crosstalk between H2B monoubiquitination and H3 methylation mediated by COMPASS. Cell. 2007;131:1084–1096. doi: 10.1016/j.cell.2007.09.046. [DOI] [PubMed] [Google Scholar]
- 14.Nedea E, et al. The Glc7 phosphatase subunit of the cleavage and polyadenylation factor is essential for transcription termination on snoRNA genes. Mol Cell. 2008;29:577–587. doi: 10.1016/j.molcel.2007.12.031. [DOI] [PubMed] [Google Scholar]
- 15.Cao F, et al. An Ash2L/RbBP5 heterodimer stimulates the MLL1 methyltransferase activity through coordinated substrate interactions with the MLL1 SET domain. PLoS One. 2010;5:e14102. doi: 10.1371/journal.pone.0014102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Patel A, Dharmarajan V, Vought VE, Cosgrove MS. On the mechanism of multiple lysine methylation by the human mixed lineage leukemia protein-1 (MLL1) core complex. J Biol Chem. 2009;284:24242–24256. doi: 10.1074/jbc.M109.014498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Patel A, Vought VE, Dharmarajan V, Cosgrove MS. A novel non-SET Domain multi-subunit methyltransferase required for sequential nucleosomal histone H3 methylation by the mixed lineage leukemia protein-1 (MLL1) core complex. J Biol Chem. 2011;286:3359–3369. doi: 10.1074/jbc.M110.174524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Brink J, et al. Experimental verification of conformational variation of human fatty acid synthase as predicted by normal mode analysis. Structure. 2004;12:185–191. doi: 10.1016/j.str.2004.01.015. [DOI] [PubMed] [Google Scholar]
- 19.Menetret JF, et al. Architecture of the ribosome-channel complex derived from native membranes. J Mol Biol. 2005;348:445–457. doi: 10.1016/j.jmb.2005.02.053. [DOI] [PubMed] [Google Scholar]
- 20.Odho Z, Southall SM, Wilson JR. Characterization of a novel WDR5-binding site that recruits RbBP5 through a conserved motif to enhance methylation of histone H3 lysine 4 by mixed lineage leukemia protein-1. J Biol Chem. 2010;285:32967–32976. doi: 10.1074/jbc.M110.159921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Avdic V, et al. Structural and biochemical insights into MLL1 core complex assembly. Structure. 2011;19:101–108. doi: 10.1016/j.str.2010.09.022. [DOI] [PubMed] [Google Scholar]
- 22.Takahashi YH, et al. Regulation of H3K4 trimethylation via Cps40 (Spp1) of COMPASS is monoubiquitination independent: implication for a Phe/Tyr switch by the catalytic domain of Set1. Mol Cell Biol. 2009;29:3478–3486. doi: 10.1128/MCB.00013-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dou Y, et al. Regulation of MLL1 H3K4 methyltransferase activity by its core components. Nat Struct Mol Biol. 2006;13:713–719. doi: 10.1038/nsmb1128. [DOI] [PubMed] [Google Scholar]
- 24.South PF, Fingerman IM, Mersman DP, Du HN, Briggs SD. A conserved interaction between the SDI domain of Bre2 and the Dpy-30 domain of Sdc1 is required for histone methylation and gene expression. J Biol Chem. 2010;285:595–607. doi: 10.1074/jbc.M109.042697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wysocka J, et al. WDR5 associates with histone H3 methylated at K4 and is essential for H3 K4 methylation and vertebrate development. Cell. 2005;121:859–872. doi: 10.1016/j.cell.2005.03.036. [DOI] [PubMed] [Google Scholar]
- 26.Cao R, et al. Role of histone H3 lysine 27 methylation in Polycomb-group silencing. Science. 2002;298:1039–1043. doi: 10.1126/science.1076997. [DOI] [PubMed] [Google Scholar]
- 27.Kuzmichev A, Nishioka K, Erdjument-Bromage H, Tempst P, Reinberg D. Histone methyltransferase activity associated with a human multiprotein complex containing the Enhancer of Zeste protein. Genes Dev. 2002;16:2893–2905. doi: 10.1101/gad.1035902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Nekrasov M, Wild B, Muller J. Nucleosome binding and histone methyltransferase activity of Drosophila PRC2. EMBO Rep. 2005;6:348–353. doi: 10.1038/sj.embor.7400376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kuzmichev A, Jenuwein T, Tempst P, Reinberg D. Different EZH2-containing complexes target methylation of histone H1 or nucleosomal histone H3. Mol Cell. 2004;14:183–193. doi: 10.1016/s1097-2765(04)00185-6. [DOI] [PubMed] [Google Scholar]
- 30.Ohi M, Li Y, Cheng Y, Walz T. Negative staining and image classification—powerful tools in modern electron microscopy. Biol Proced Online. 2004;6:23–34. doi: 10.1251/bpo70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Frank J, et al. SPIDER and WEB: processing and visualization of images in 3D electron microscopy and related fields. J Struct Biol. 1996;116:190–199. doi: 10.1006/jsbi.1996.0030. [DOI] [PubMed] [Google Scholar]
- 32.Radermacher M, Wagenknecht T, Verschoor A, Frank J. Three-dimensional reconstruction from a single-exposure, random conical tilt series applied to the 50S ribosomal subunit of Escherichia coli. J Microsc. 1987;146:113–136. doi: 10.1111/j.1365-2818.1987.tb01333.x. [DOI] [PubMed] [Google Scholar]
- 33.Grigorieff N. FREALIGN: high-resolution refinement of single particle structures. J Struct Biol. 2007;157:117–125. doi: 10.1016/j.jsb.2006.05.004. [DOI] [PubMed] [Google Scholar]
- 34.Ludtke SJ, Baldwin PR, Chiu W. EMAN: semiautomated software for high-resolution single-particle reconstructions. J Struct Biol. 1999;128:82–97. doi: 10.1006/jsbi.1999.4174. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supporting Information