pubmed.ncbi.nlm.nih.gov

Structural Basis for Guide RNA Processing and Seed-Dependent DNA Targeting by CRISPR-Cas12a - PubMed

  • ️Sun Jan 01 2017

Structural Basis for Guide RNA Processing and Seed-Dependent DNA Targeting by CRISPR-Cas12a

Daan C Swarts et al. Mol Cell. 2017.

Abstract

The CRISPR-associated protein Cas12a (Cpf1), which has been repurposed for genome editing, possesses two distinct nuclease activities: endoribonuclease activity for processing its own guide RNAs and RNA-guided DNase activity for target DNA cleavage. To elucidate the molecular basis of both activities, we determined crystal structures of Francisella novicida Cas12a bound to guide RNA and in complex with an R-loop formed by a non-cleavable guide RNA precursor and a full-length target DNA. Corroborated by biochemical experiments, these structures reveal the mechanisms of guide RNA processing and pre-ordering of the seed sequence in the guide RNA that primes Cas12a for target DNA binding. Furthermore, the R-loop complex structure reveals the strand displacement mechanism that facilitates guide-target hybridization and suggests a mechanism for double-stranded DNA cleavage involving a single active site. Together, these insights advance our mechanistic understanding of Cas12a enzymes and may contribute to further development of genome editing technologies.

Keywords: CRISPR RNA; CRISPR-Cas; Cas12a; Cas9; Cpf1; R-loop; crRNA processing; nuclease; seed sequence; target DNA cleavage.

Copyright © 2017 Elsevier Inc. All rights reserved.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Structure of FnCas12a in complex with crRNA guide

(A) Sequence of the crRNA guide (see also Table S1). Nucleotides that are ordered in the structure are colored orange, unordered nucleotides are colored grey. (B) Schematic diagram of the domain organization of FnCas12a. REC: Recognition. PI: Protospacer Adjacent Motif (PAM) interacting. WED: Wedge. BH: Bridge Helix. Nuc: Nuclease. Both the WED and RuvC domains are formed by three discontinuous segments of the protein sequence. For the genomic context and purification details of FnCas12a, see Figure S1. (C) Overall structure of the FnCas12a-crRNA complex. Domains are colored according to the scheme in panel B. For alignments of the structures of the binary complexes of FnCas12a and LbCas12a, see Figure S2. (D) Close-up view of the crRNA seed sequence. Hydrogen bonds between FnCas12a residues and the seed sequence are displayed as dashed lines. For details of FnCas12a-crRNA binding interactions, see Figure S3. For a sequence alignment of Cas12a orthologs in which the conserved residues involved in crRNA seed sequence pre-ordering are indicated, see Figure S4. (E) Target DNA complementarity to the crRNA seed sequence is critical for cleavage. Top: Schematic representation of the crRNA (orange) and target DNA sequences used in the nuclease activity assays (see also Table S1 and Table S2). Numbering indicates the position of the mismatches between the crRNA and DNA targets used in the nuclease activity assays. Bottom: FnCas12a-crRNA complexes were incubated with pre-linearized plasmid DNA substrates containing a wild type (WT) target sequence or sequences with mismatches in the target site. Cleavage products were resolved by agarose gel electrophoresis. LIN: linearized plasmid substrate. M: 1kb DNA Ladder. For EMSA binding assays demonstrating the importance of the crRNA seed sequence in target binding, see Figure S5.

Figure 2
Figure 2. Pre-crRNA processing by FnCas12a

(A) Schematic representation of pre-crRNA substrates. Pre-crRNAX contains a deoxyuridine nucleotide at position (-19). Cy5-pre-crRNA is covalently labeled with a Cy5 fluorophore at the 5’ end of a truncated pre-crRNA sequence. (B) FnCas12a-mediated pre-crRNA processing depends on the presence of a ribose 2'-hydroxyl group in the nucleotide upstream of the scissile phosphate. FnCas12a was incubated with pre-cRNA1 or pre-crRNAX in the presence of chelating agents or divalent cations. Processing products were analyzed by denaturing polyacrylamide gel electrophoresis. (C) FnCas12a-mediated pre-crRNA processing generates a 5' product containing a 2',3'-cyclic phosphate. FnCas12a was incubated with Cy5-pre-crRNA, after which processing products were treated with Alkaline Phosphatase (AP; removes 3’-end phosphates) or T4 Polynucleotide Kinase (PNK; removes 2’,3’-cyclic phosphates). Reaction products were analyzed by denaturing polyacrylamide gel electrophoresis and fluorescence detection. (D) Close-up view of the pre-crRNA processing site in FnCas12a bound to pre-crRNAX. For a sequence alignment of Cas12a orthologs in which the conserved residues involved in pre-crRNA processing are indicated, see Figure S4. (E) Structural model of bona fide pre-crRNA substrate in which nucleotide U(-19) contains a (modeled) 2’-hydroxyl group, based on panel D. The 2’-hydroxyl group is positioned in close proximity of Lys869, suggesting that Lys869 acts as a general base in the catalytic mechanism. (F) Proposed model for the acid-base catalytic mechanism of pre-crRNA processing by FnCas12a.

Figure 3
Figure 3. Structure of FnCas12a-crRNA in complex with a dsDNA target

(A) Sequences of the crRNA guide and DNA target (see also Table S1). Structurally disordered nucleotides are colored grey. (B) Overall structure of the FnCas12a R-loop complex. The nucleic acid strands are colored as in panel A. For alignments of the structures of the ternary complexes of FnCas12a and AsCas12a, see Figure S2. For conformational rearrangements in FnCas12a upon transition from the binary to the ternary complex, see Figure S6. For details of FnCas12a-crRNA-DNA interactions, see Figure S7. (C) Close-up view of the interactions with the non-target DNA strand at the PAM-proximal end of the R-loop. (D) Close-up view of the interactions with the target DNA strand at the PAM-proximal end of the R-loop. (E) Mutations of the FnCas12a residues interacting with the displaced non-target DNA strand impair substrate DNA cleavage. Wild-type and mutated FnCas12a was incubated with crRNA and a pre-linearized plasmid DNA substrate. Cleavage products were analyzed by agarose gel electrophoresis. LIN: linearized plasmid substrate. M: 1kb DNA Ladder. ΔRLI: Ala-Gly3 substitution of R-loop interacting residues Thr698–Ser702. 1 and 2: products of linear DNA cleavage.

Figure 4
Figure 4. The PAM-distal DNA rehybridizes within the confines of FnCas12a

(A) Close-up view of the PAM-distal end of the crRNA-target DNA heteroduplex. Tyr410 stacking caps the crRNA-target DNA heteroduplex at base pair U20-dA(-20) and prevents duplex formation beyond nucleotide U20 of the crRNA. (B) Close-up view of the PAM-distal flank of the R-loop. The PAM-distal segments of the target and non-target DNA strands hybridize in the central cavity of FnCas12a, between the REC2 and Nuc domains. (C) Exonuclease III footprinting analysis of the FnCas12a–R-loop complex. Top: schematic representation of the crRNA sequence (orange) and target DNA sequences used in the footprinting assay (see also Table S1). FnCas12aDM-crRNA was bound to target DNA duplexes in which either the target strand (TS) or the non-target strand (NTS) was 5’-labeled with ATTO532. The resulting complexes were treated with Exonuclease III (ExoIII) and cleavage products were analyzed by denaturing polyacrylamide gel electrophoresis and fluorescence detection. M: Marker generated by S1 treatment of the labeled DNA strand. * and ** indicate the TS and NTS products formed by ExoIII when the dsDNA is protected by the FnCas12aDM-crRNA1 complex. Nucleotides removed from the TS and from the NTS by ExoIII are colored red in the schematic diagram at the top of the panel.

Figure 5
Figure 5. FnCas12a cleaves the target and non-target DNA strands with the same catalytic site

(A) Target strand DNA cleavage is inhibited by specific mutations in the RuvC domain (right panel), but not by any of the mutations in the Nuc domain (left panel). Wild-type and mutated FnCas12a were incubated with crRNA and a dsDNA oligonucleotide substrate in which the target strand was labeled with 3’-ATTO532. Cleavage products were analyzed by denaturing polyacrylamide gel electrophoresis and fluorescence detection. (B) Non-target strand DNA cleavage is completely inhibited by specific mutations in the RuvC domain (right panel), but not by any of the mutations in the Nuc domain (left panel). Wild-type and mutated FnCas12a were incubated with crRNA and a dsDNA oligonucleotide substrate in which the non-target strand was labeled with 5’-Cy5. Cleavage products were analyzed as in (A). (C) Cleavage of both target and non-target DNA strands generates products containing a 5’-phosphate and 3’-hydroxyl groups. dsDNA oligonucleotide substrates, labeled as indicated with covalently attached fluorophores, were cleaved with the FnCas12a-crRNA complex. Cleavage products were further incubated in the presence or absence of alkaline phosphatase and analyzed by denaturing polyacrylamide gel electrophoresis and fluorescence detection. (D) Close-up view of the RuvC and Nuc domain residues involved in DNA cleavage and the PAM-distal part of the R-loop. Arrow indicates the displacement of the scissile phosphate in the target DNA strand from the RuvC domain active site. For a modeled non-target strand in the RuvC catalytic site of FnCas12a, see Figure S8.

Figure 6
Figure 6. Schematic model of crRNA-guided target DNA binding and cleavage by Cas12a enzymes

Cas12a catalyzes the processing of its own crRNA guide and pre-orders crRNA nucleotides 1-5 in the seed sequence. PAM recognition by a preformed binding site in Cas12a positions the target DNA in register with the pre-ordered crRNA seed sequence. Local strand separation in the vicinity of the PAM allows the target strand to base pair to the crRNA seed sequence to initiate R-loop formation. The R-loop is formed by directional guide-target hybridization. The RNA-DNA heteroduplex is terminated after 20 base pairs at the PAM-distal end. The resulting R-loop structure can adopt two alternative conformations that result in the insertion of either the target or the non-target DNA strand in a single catalytic site located at the interface of the RuvC and Nuc domains.

Similar articles

Cited by

References

    1. Afonine PV, Grosse-Kunstleve RW, Echols N, Headd JJ, Moriarty NW, Mustyakimov M, Terwilliger TC, Urzhumtsev A, Zwart PH, Adams PD. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr D Biol Crystallogr. 2012;68:352–367. - PMC - PubMed
    1. Anders C, Niewoehner O, Duerst A, Jinek M. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature. 2014;513:569–573. - PMC - PubMed
    1. Charpentier E, Marraffini LA. Harnessing CRISPR-Cas9 immunity for genetic engineering. Curr Opin Microbiol. 2014;19C:114–119. - PMC - PubMed
    1. Charpentier E, Richter H, van der Oost J, White MF. Biogenesis pathways of RNA guides in archaeal and bacterial CRISPR-Cas adaptive immunity. FEMS Microbiol Rev. 2015;39:428–441. - PMC - PubMed
    1. Cho SW, Kim S, Kim JM, Kim J-S. Targeted genome engineering in human cells with the Cas9 RNA-guided endonuclease. Nat Biotechnol. 2013;31:230–232. - PubMed

MeSH terms

Substances