Progress and challenges in CRISPR-mediated therapeutic genome editing for monogenic diseases

2021-04-09ColinKonishiChengzuLong

THE JOURNAL OF BIOMEDICAL RESEARCH 2021年2期

Colin T. Konishi, Chengzu Long,2,3,4,✉

1Leon H. Charney Division of Cardiology, 2Helen and Martin Kimmel Center for Stem Cell Biology, 3Department of Neurology, 4Department of Neuroscience and Physiology, New York University Grossman School of Medicine, New York, NY 10016, USA.

Abstract There are an estimated 10 000 monogenic diseases affecting tens of millions of individuals worldwide. The application of CRISPR/Cas genome editing tools to treat monogenic diseases is an emerging strategy with the potential to generate personalized treatment approaches for these patients. CRISPR/Cas-based systems are programmable and sequence-specific genome editing tools with the capacity to generate base pair resolution manipulations to DNA or RNA. The complexity of genomic insults resulting in heritable disease requires patientspecific genome editing strategies with consideration of DNA repair pathways, and CRISPR/Cas systems of different types, species, and those with additional enzymatic capacity and/or delivery methods. In this review we aim to discuss broad and multifaceted therapeutic applications of CRISPR/Cas gene editing systems including in harnessing of homology directed repair, non-homologous end joining, microhomology-mediated end joining, and base editing to permanently correct diverse monogenic diseases.

Keywords: gene editing, CRISPR-associated protein 9, CRISPR-Cas system, genetic disease, medical genetics,genetic therapy

Introduction

Monogenic diseases are single gene diseases which affect tens of millions of individuals across the globe[1]. Of the approximately 20 000 protein coding genes in the human genome, almost 4000 genes have been implicated as causal in a monogenic disease.Though each instance is rare, the sum of individuals suffering from monogenic diseases accumulates to a high mortality and disease burden[2]. Intrinsic characteristics of the patterns of inheritance have allowed founder mutations to persist in populations,such as theBRCA1185delAG variant found in Ashkenazi Jews, thereby increasing the frequency of specific disease variants in specific populations[3].Through genetic mapping, extensive efforts have been made to link causal pathogenic variants to disease.Genetic linkage of genotype to clinical phenotype is a crucial component in understanding pathophysiology to better develop treatment approaches, such as the development of statins for controlling cholesterol levels in individuals with monogenic hypercholesterolemia. With the increasing implementation of sequencing technology in the clinic, the detection of rare variants of known monogenic diseases and previously unknown ultrarare monogenic diseases have increased, outpacing treatment approaches identified through traditional interventions[2]. The heterogeneity of clinical phenotypes arising from sequence-specific pathogenic mutations further complicates our understanding of the role of individual genes in monogenic diseases[4]. In combination with drug resistant monogenic diseases,such as in epilepsy with Dravet Syndrome, these barriers to treatments underscore the necessity of novel and personalized therapeutic approaches[5]. In this review, we focus on the implementation of CRISPR/Cas genome editing systems to target pathogenic variants underlying monogenic disease at the basic, translational, and clinical stages of biomedical research. Gene editing is the technology through which precise modifications are made to specific DNA sequences. Gene editing approaches are utilized to enact corrective and permanent changes to a patient's genome in gene therapy.

CRISPR/Cas systems

Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated proteins (Cas)-based genome editing has emerged as one of the most powerful tools for sequence-specific gene editing due to the programmable nature of its sequence specificity,broad targeting scope, efficacy of editing, and its functionality as a platform for engineering novel editing modalities[6]. Initially identified as an anti-viral adaptive immune system in prokaryotes, CRISPR/Cas systems have been repurposed for a multitude of functions including genome editing, transcriptional regulation, and visualization[7-9]. Present in approximately half of bacteria and nearly all archaea surveyed, CRISPR/Cas adaptive immune systems are broadly categorized as either Class 1 or Class 2. Class 1 CRISPR systems are comprised of a multi-subunit effector protein complex, while Class 2 CRISPR systems are comprised of a single effector protein[10-11]. The classification further delineates into type Ⅰ, Ⅲ, and Ⅳ systems categorized under Class 1,and type Ⅱ, Ⅴ, and Ⅵ categorized under Class 2.Despite the vast distinctions and evolutionary divergence between CRISPR/Cas types, CRISPR/Cas adaptive immunity shares the same core mechanistic steps: adaptation, CRISPR-RNA synthesis, and target elimination[11]. First, the genetic material of an invading viral infection is recognized and incorporated into the host CRISPR array. Second, when a subsequent viral infection from the same viral member is initiated and recognized by the host, the host uses the previously incorporated DNA sequence as a template to create a CRISPR-RNA (crRNA), which goes through type specific processes to form a mature crRNA. Third, the crRNA in association with the Cas effector protein or effector protein complex,recognizes and cleaves the invading viral DNA or RNA, thereby eliminating the viral threat[9-10,12]. By adopting the function of the crRNA and effector protein to target and cleave viral nucleic acids,researchers developed highly programmable and sequence-specific nucleases, with many breakthroughs generated through studies on the CRISPR/Cas9 system fromStreptococcus pyogenes(SpCas9)[8,13-15].

SpCas9 is the prototypic and most frequently used CRISPR/Cas nuclease in basic and clinical research settings. The Cas9 protein is targeted to a locus of interest through a 20-bp homologous sequence mediated by a crRNA-trans-activating CRISPR RNA(tracrRNA) hybrid or an engineered single-guide RNA (sgRNA)[8,16]. The sgRNA confers sequencespecific targeting directly upstream of a protospacer adjacent motif (PAM) sequence, a unique string of nucleotides recognized by the Cas protein for effector functionality (Fig. 1A). SpCas9 recognizes a 5′-NGG-3′ PAM, while orthologous Cas9 proteins such as Cas9 fromStaphylococcus aureus(SaCas9) andNeisseria meningitidisCas9 (NmeCas9) require PAM sequences of 5′-NNGRRT-3′ and 5′-NNNRRT-3′ respectively[17-18].Cas12a or Cpf1 (CRISPR fromPrevotellaandFrancisella1) is a type V Cas system that uses a 25-bp gRNA for sequence specificity which is downstream of a 5′-TTTV-3′ PAM sequence. As PAM sites are strict requirements for recognition by the Cas protein,PAMs serve as both a safeguard against off-target editing but also as a restriction limiting the scope of targetable sites. To address the limitations imposed by PAM sites, many groups have engineered Cas protein variants recognizing novel and flexible PAMs, such as xCas9 capable of recognizing the 5′-NG-3′, 5′-GAA-3′, and 5′-GAT-3′ PAMs and SpCas9-NG which recognizes the 5′-NG-3′ PAM[19-21]. Following sequence targeting and PAM recognition, SpCas9 generates a blunt double-strand break (DSB) in the DNA between the 4thand 3rdbase pair upstream of the PAM site,while other Cas proteins, such as Cpf1, creates a staggered DSB[22-24]. DNA DSBs are a highly cytotoxic event, prompting a quick DNA damage response in a biochemical, temporal, and cell cycle dependent manner[25-26].

Fig. 1 Genome editing strategies with Cas9, base editors, and prime editor. A: Cas9 generates a DNA double-strand break (DSB)through guide RNA (gRNA) facilitated complementarity to a target DNA locus. Protospacer adjacent motif (PAM) recognition by the Cas9-gRNA complex is required for cleavage. B: Cytosine base editor (CBE) mediates C-G to T-A transition mutations at the editing window through a cytidine deaminase fused to a Cas9 nickase (nCas9). Following deamination, a fused uracil glycosylase inhibitor (UGI) inhibits base excision repair by uracil N-glycosylase, thereby increasing the desired T-A outcome. A DNA single-strand break (SSB) on the nonedited strand induces repair of the non-edited strand using the edited strand as a template. C: Adenine base editors (ABE) mediate A-T to GC transition mutations at the editing window through an evolved adenosine deaminase fused to nCas9. An SSB on the non-edited strand induces repair of the non-edited strand using the edited strand as a template. D: Prime editors (PE) incorporate novel sequence through a reverse transcriptase (RT) fused to a nCas9 complexed to a prime editing guide RNA (pegRNA). The pegRNA is an extended gRNA with a primer-binding site and a RT template. The primer-binding site anneals to the nicked PAM strand, and novel DNA containing the edit is directly incorporated through the RT using the RT template.

HDR-mediated therapeutic genome editing

Homology directed repair (HDR) is an error-free DNA damage response initiated following a DSB.HDR is an accurate DSB repair process due to the high degree of sequence homology required for templating DNA synthesis[26-27]. Following a DSB, the 5′ ends of the DNA are resected creating a 3′ overhang and an adequate scaffold for DNA damage response proteins to engage. Strand invasion by a homologous template generates a double Holliday junction allowing for polymerase-mediated repair of lost DNA sequence. HDR is active during the G2/S phase of the cell cycle due to the preference for sister chromatids as a homologous template[28]. Additionally, exogenous templates can serve as homologous sequences for HDR, thereby resulting in a mechanism to generate sequence-specific targeted insertions or replacement[29].Targeted insertionsviaexogenous template DNA have been mediated through various constructs including plasmid DNA, single-stranded oligodeoxynucleotides(ssODN), and double-stranded oligodeoxynucleotides(dsODN) donor (Fig. 2A)[30-31]. Comparative analysis of SpCas9, NmCas9, SaCas9,Acidaminococcussp.BV3L6 Cpf1 (AsCpf1), andLachnospiraceaebacterium ND2006 Cpf1 (LbCpf1) demonstrated distinct editing and HDR characteristic preferences including the use of 5′ asymmetrically extended ssODN donor templates for SpCas9 and LbCpf1[22,32]. HDRmediated sequence insertions have become ubiquitous methods of generating research models of specific genotypes, as well as correcting pathogenic variants of disease[33-34].

Leber congenital amaurosis (LCA) is an autosomal recessive disease, causing childhood-onset blindness,with 6% of cases linked to mutations inRPE65[35].Current treatments are limited, with voretigene neparvovec, a gene therapy approach delivering an intactRPE65gene by adeno-associated virus (AAV)to patient retinas, as the only treatment approved by the FDA[36]. However, though visual restoration was incomplete, voretigene neparvovec progressed retinal gene therapy approaches, and established practical methods for the delivery of genetic material to patient retinas. Building upon these delivery methods, 3-week-oldrd12mice, a rodent model of LCA, were treated with a dual AAV system containing a separate Cas9 and sgRNA-Rpe65 donor construct, for delivery of the Cas9 endonuclease and an HDR template.Following AAV delivery through subretinal injection,Rpe65was permanently corrected in approximately 1% of targeted cells, with functional restoration of retinal function exceeding genetic correction. Treatedrd12mice responded to bright stimuli, and quantification of a- and b- wave amplitudes demonstrated significant increases toward wildtype control counterparts[37].

Fig. 2 DNA repair pathways following a double-strand break. A: Homology directed repair (HDR) can facilitate targeted sequence modification by providing a repair template with homology arms to the target sequence (indicated in blue and yellow). Repair templates can be single-stranded oligodeoxynucleotides (ssODN) as illustrated (sequence modification indicated in green), or double-stranded such as plasmid template donors or sister chromatids. B: Non-homologous end joining (NHEJ) is an error prone DNA damage response often resulting in insertions and deletions near the cleavage site (indels indicated in red). C: Microduplications can be precisely deleted by microhomology-mediated end joining (MMEJ). Cas9 generates a double-strand break between repeat sequences, and the subsequent processes of 5′ end resection followed by annealing of the homologous regions result in the deletion of the microduplication.

Mutations in ornithine transcarbamylase (OTC) in patients results in hyperammonemia leading to neurological dysfunction, and in neonatal males, is often fatal[38]. A mouse model for OTC deficiencies,spfash, has a G-to-A mutation resulting in aberrant splicing and a 20-fold reduction in OTC mRNA and protein expression. As genetic liver disease often presents in newborns, postnatal day 2 (p2)spfashmice were treated with AAV8 separately packaging SaCas9 and a sgRNA along with anOTCrepair template for dual deliveryviatemporal vein injection. Deep sequencing analysis of the targetedOTCgene identified 6% to 20% templated gene correction, with insertions and deletions (indels) identified in approximately 30% ofOTCalleles. Most treated mice had higher OTC protein levels than their untreated counterparts but failed to reach the levels of wildtype controls. However, by challenging the treated miceviaa high-protein diet, restoration of OTC function was demonstrated by a reduction in indicators of pathology and toxicity, in addition to complete survival within the treated cohort, contrary to the 70% survival of untreated or untargeted control mice[39]. Due to HDR restrictions in adult liver, adult mice treated with the same dual AAV gene editing system had significantly lower corrective editing and harbored more large deletions resulting in a loss of the minimal expression of OTC which remained in pathogenic cells.

To address the limitations ofin vivoHDR, many groups have utilized anex vivoediting approach for targeted insertions.Fah−/−mice, a rodent model of hereditary tyrosinemia type I (HTI), lack expression of fumarylacetoacetate hydrolase (Fah), an enzyme responsible for the final step of tyrosine catabolism.Fah−/−hepatocytes harbor a G-to-A point mutation in the splice donor of exon 7 resulting in splicing defects and impaired Fah enzymatic activity[40]. Fah deficiency is fatal owing to the accumulation of toxic metabolites in the hepatic system resulting in cell death, and eventually liver failure[41].Ex vivotemplated gene correction was enhanced due to an upregulation of HDR-related genes in cultured hepatocytes compared toin vivocounterparts which was further evidenced by an increase in cellular proliferation marker, Ki-67[42]. CulturedFah−/−hepatocytes were treated with a dual AAV gene editing strategy. Separate packages of an SaCas9 and gRNA, and a homologous repair template and gRNA resulted in a low fraction of HDR corrected hepatocytes. Once transplanted into theFah−/−liver,engrafted cells demonstrated a proliferative advantage leading to functional restoration as indicated by a reduction in hallmark indicators of liver impairment[42]. As hepatocytes are polyploidy and only a single correctedFahallele can restore enzymatic activity, the functional correction efficiencies are greater than editing efficiencies as quantified by sequencing bulk samples[43].

NHEJ-mediated therapeutic genome editing

Non-homologous end joining (NHEJ) is the primary mechanism responsible for repairing DNA DSBs in mammalian cells[44]. Unlike HDR, NHEJ is an error prone response, often resulting in indels due to the direct ligation of separated DNA strands following end resection (Fig. 2B)[45]. NHEJ has become a useful editing strategy in addition to HDR,with advantages including increased efficiency and being template free[15]. Because indels generated by CRISPR/Cas genome editing are site-specific, targeted indel formation by NHEJ is an efficacious mechanism for introducing frameshift mutations and premature stop codons. This strategy has been utilized in research and clinical contexts to disrupt gene activity or bypass mutated genomic regions.

The adaptive capacity of tumors to evade immune recognition and response can be in part facilitated through upregulation of programmed death-ligand 1(PD-L1). PD-L1+tumors are indicators of poor prognosis in multiple tumor types due to interactions with programmed death-1 receptors (PD-1) on activated T-cells to suppress immune response[46].Clinical studies have demonstrated that inhibition of PD-1 by PD-1 inhibitors significantly enhances antitumor immunity[47]. Disruption of the PD-1 gene,PDCD-1, in mesothelin-targeted CAR T cells (Meso CAR T) by NHEJ generated a 60% reduction in PD-1hicells. When PD-1 disrupted Meso CAR T cells were introduced into a xenograft mouse model, thein vivotumor response of PD-1 disrupted Meso CAR T cells demonstrated a significant reduction in tumor burden, and increased genomic copy number,suggesting increased expansion of PD-1 disrupted Meso CAR T cells[48]. In accordance with this study,phase 1 clinical trial to evaluate the effect of CRISPR/Cas9 disrupted PD-1 TCR knock-out CAR T cells in patients with mesothelin positive multiple solid tumors has been initiated[49].

β-hemoglobinopathies are hematologic diseases,encompassing in part, the genetic diseases βthalassemia and sickle cell disease (SCD)[50].Treatments for patients with β-hemoglobinopathies are incomplete with allogeneic hematopoietic stem cell (HSC) transplants as the only definitive treatment[51]. Insufficient availability of HSCs for transplantation has focused efforts on genetic therapies for generating new treatment avenues.During human development, there exist two transitions in globin regulation: from embryonic to fetal globin (HbF) during gestation, and fetal to adult globin (HbA) following birth[52]. Owing to this regulatory change, symptoms of β-thalassemia and SCD begin to manifest in the first few months of life,due to the silencing of HbF expression (Fig. 3A).Individuals with hereditary persistence of fetal globin(HPFH), a condition in which HbF is not silenced following birth, are largely protected by the deleterious outcomes of SCD or β-thalassemia[53]. A strategy to reactivate endogenous expression at the HbF locus was developed by disrupting a transcriptional inhibitor,BCL11A(Fig. 3B)[54]. Human CD34+hematopoietic stem and progenitor cells(HSPC) targeted for disruption ofBCL11Ashowed increased levels of HbF in erythroid cells, and a phase 1/2 clinical trial transplanting Cas9-mediatedBCL11Adisrupted autologous CD34+human HSPCs, termed CTX001, has been initiated[55-56]. Early evidence has shown a successful proof of concept with the first two patients independent of blood transfusions following CTX001 infusion[57].

Mutations that disrupt normal RNA splicing patterns are known contributors to disease, with the most prevalent pathogenic variants altering ciselements including the 5′ splice site, 3′ splice site,branch point or regulatory sites that facilitate spliceosome assembly[58-59]. Pathogenic mis-splicing has been implicated in β-thalassemia, cystic fibrosis,and LCA arising in the aberrant inclusion or exclusion of coding sequence[58]. LCA type 10 (LCA10) is a form of autosomal recessive retinal dystrophy caused by mutations inCEP290. IVS26, the most common mutation in LCA10, creates a novel splice donor site due to an A-to-G mutation in intron 26. Aberrant splicing generates a cryptic pseudoexon, resulting in an mRNA with a premature stop codon[60]. By targeting the IVS26 site with a pair of gRNAs flanking the mutation, a corrective inversion or deletion event is possible such that normal mRNA splicing is restored. The editing components were packaged in an AAV5 vector containing the pair of gRNAs and a SaCas9 expressed under a retinaspecific promoter. Editing in fully mature retinal explant cultures resulted in a corrective editing rate of 16.6%±6.5%. Further experiments in a humanizedCEP290IVS26 knock-in mouse model and nonhuman primate models showed dosage responses exceeding the targeted 10% minimal threshold of corrected foveal cone photoreceptors needed to restore normal vision[61]. This proof of concept study has led to the breakthrough phase 1/2 clinical trial titled EDIT-101 leading to the direct administration of CRISPR/Cas genome editing systems into a human patient for the first time[62].

Fig. 3 Rescue of fetal globin expression by non-homologous end joining-mediated gene disruption alleviates sickle cell disease. A: βhemoglobinopathies are caused by pathogenic variants in the adult β-globin gene. HBBS, the variant underlying sickle cell disease, produces mutant beta subunits of adult globin resulting in sickle shaped red blood cells (RBCs). B: Non-homologous end joining-mediated disruption of BCL11A derepresses expression of fetal globin (HGB2 and HGB1). Activation of fetal globin expression can alleviate the disease severity of β-hemoglobinopathies.

Duchenne muscular dystrophy (DMD) is a fatal disease of the muscle affecting approximately 1 in every 5000 males at birth. DMD is caused by mutations in dystrophin, a protein responsible for linking the dystroglycan complex with the cytoskeleton to maintain structure during muscle cell contraction[63]. Despite the heterogeneity of mutations causal in DMD, it has been estimated that 80% of individuals with DMD could be treated with exonskipping strategies to restore partial dystrophin expression[64]. Myoediting is a strategy to correctDmdmutations by NHEJ-mediated exon skipping. The strategy uses paired gRNAs to target the 5′ and 3′ ends of exon 23 to remove the premature termination codon present in themdxmouse, an animal model of DMD.Notably, deletion of exon 23 rescued normal open reading frame (exon 22 to exon 24), thereby generating a partial restoration of dystrophin protein expression following intraperitoneal, intramuscular, or retro-orbital injection of AAV9 package gene editing systems[65]. As partial restoration of dystrophin expression can improve patient cardiomyopathy and protect against contraction induced injury to skeletal muscle, myoediting has the potential to offer therapeutic benefit to majority of DMD patients.

Strikingly, recent publications have demonstrated that the indels produced following NHEJ are a predictable event, dictated largely by the Cas9 cleavage site and the surrounding local sequence[66-68].Single base duplications, short deletions, and microhomology-mediated deletions are among the most common indel occurrences and can be predicted with small cell type-specific variations. Fanconi anemia (FA) is a genetic disorder of the hematopoietic system caused by mutations in theFANCgenes, key components maintaining genome stability through involvement in DNA repair and cell division processes[69-70]. HDR is impaired in FA due to the intrinsic inefficiency of HDR in primitive HSPCs, and increased efficiency of NHEJ in FA cells. Patients with mutations inFANCgenes occasionally harbor compensatory mutations restoring the pathogenic frameshift mutations, demonstrating a therapeutic modality generated by corrective indels. NHEJmediated repair of prevalent patient-specific mutations ofFANCAincluding the c.295C＞T point mutation and the c.3558insG indel variant, produced healthy donor like cells with anin vitroandin vivorodent transplantation model proliferative advantage[71]. These findings demonstrate an efficient and precise method of targeted sequence reversion despite limitations imposed by disrupted DNA repair pathways.

MMEJ-mediated therapeutic genome editing

Microhomology-mediated end joining (MMEJ),also called alternative end joining, is a DNA damage response occurring following DNA DSBs[72]. MMEJ is an alternative repair pathway to HDR, initiated following DNA end resection[72]. Based on a sufficient region of sequence homology flanking a DSB,approximately 5 to 25 bp, a DSB is repaired through annealing the homologous regions together, thereby deleting one repeat and the intermediate sequence(Fig. 2C)[73]. Microduplications and sequence repeats are a common DNA replication error resulting in nascent genetic disease[74]. Inducing targeted DSB at a site flanked by these repeats meets the criteria to initiate the MMEJ DNA damage response, thereby having the potential to revert pathogenic microduplications and sequence repeats into a wildtype allele.

Limb-girdle muscular dystrophy type 2G(LGMD2G) is a systemic disease affecting multiple tissues including skeletal and cardiac muscle[75]. A pathogenic 8-bp duplication in theTCAPgene is found in approximately 1 in 1000 alleles in Eastern Asian populations. Patient derived induced pluripotent stem cells (iPSC) homozygous for the pathogenic microduplication at theTCAPgene were targeted for MMEJ facilitated gene editing. By targeting ribonucleoprotein complex of SpCas9 and gRNA to the mutant allele, a double-strand cleavage was induced at the boundary of the 8-bp duplication resulting in a precise 8-bp deletion in 57% of all cells targeted for editing. The same study demonstrated an MMEJ based gene editing approach for correction of a pathogenic 16-bp microduplication inHPS1, in which pathogenic variants are associated with Hermansky-Pudlak syndrome type 1 (HPS1). PAM site restrictions prevented cleavage at the center of the microduplication, but an array of gRNAs designed to induce cleavage at staggered locations along the microduplication resulted in variable wild-type reversion with the desired 16-bp deletion occurring at higher frequencies based on proximity of the cleavage to the center of the microduplication[76].

MMEJ gives rise to the ability to generate precise DNA integrationviaproviding an exogenous MMEJplasmid construct. MMEJ-plasmids contain donor templates for MMEJ-mediated targeted insertions where microhomology arms span approximately 5 to 25 bp[77]. Eight-week-old mice homozygous for a pathogenic frameshift insertion in exon 5 of theFahgene were treated with an MMEJ plasmid containing cDNA for exons 5 through 14 of theFahgene and an sgRNA targeting intron 4 ofFah, in tandem with a plasmid encoding SpCas9, delivered through hydrodynamic tail-vein injection. A week afterin vivoediting, NTBC, a pharmacological treatment for HTI,was withdrawn from treated mice. Targeted integration was evidenced by sequencing analysis,FAH protein expression, and drastically increased survival following 14 weeks without NTBC. The study further demonstrated an increase in MMEJmediated sequence integration 10-fold greater than HDR-based knock-in strategies, likely owing to differences in cell cycle status[78].

The 5 to 25 bp of sequence homology required between a target sequence and an MMEJ donor is advantageous considering the limitations in packaging size of common viral delivery methods[79].Recombinant AAVs are especially promising genetic delivery tools due to the safe and stable nature of transgene expression and variant tropism in tissues of biomedical interest[80-81]. As the packaging load of recombinant AAVs is limited to approximately 4.7 kb,a dual AAV delivery system is often utilized, which narrows the cells recipient of all gene editing components in appropriate stoichiometry[39,82]. In a mouse model of blindness caused by the pathogenic

Gnat1IRD2/IRD2andPde6ccpfl1/cpfl1mutations, a single AAV vector utilizing an MMEJ-mediated repair gene editing approach was developed. The MMEJ donor was packaged along with an SaCas9, two gRNAs targeting theIRD2mutation inGnat1, and appropriate regulatory elements including the neural retinaspecific promoterGRK1driving SaCas9 expression,all in a single AAV8 plasmid vector. Subretinal injections of 6-month old mice resulted in Gnat1 expression in photoreceptors at both the protein and mRNA levels, with sequencing ofin vivointegration showing 11%[83].

Base editor-mediated therapeutic genome editing

Base editing systems are a precise editing modality mediated by CRISPR/Cas gene editing tools, capable of generating single nucleotide transitions in DNA.Cytosine base editors (CBE) convert C-G to T-A base pairs, while adenine base editors (ABE) generate A-T to G-C base pair edits (Fig. 1B and C)[84-85]. These precise base pair edits occur within the editing window, a defined region corresponding to positions along the gRNA sequence, which is intrinsic to each base editing system.

CBE is an engineered gene editing tool built upon a modified enzymatically inactive Cas9 (dCas9). The APOBEC1 cytosine deaminase protein was fused to dCas9 which facilitates a C-to-U conversion, which through the endogenous mismatch repair pathway, is repaired to T-A, or reverted to the original C-G. To facilitate the C-to-T transitions, an uracil glycosylase inhibitor (UGI), which impairs the uracil base excision repair pathway, was fused onto the C-terminal of the dCas9 platform, resulting in the optimized system, to facilitate double-stranded edits, in optimized BE3 system, dCas9 was replaced with a Cas9 nickase(D10A), which biases the genomic repair machinery to preferentially generate the T-A edited base pair by generating a single-stranded cut on the non-edited strand[84]. To increase efficiency and reduce indel formation, a bacteriophage Mu protein, Gam, which binds to DSBs was added, resulting in the formation of BE4, with further optimization steps to improve expression and activity resulted in BE4max[86-87].

The generation of ABEs was initially met with difficulty due to the absence of an endogenous ssDNA adenosine deaminase. Targeted evolution of anEscherichia colitRNA adenosine deaminase, known as TadA, led to the generation of mutant TadA proteins, which when fused to a dCas9, can convert A to deoxyinosine (I), which are subsequently recognized as G in the RNA[85]. Initial engineering reports indicated ABE7.10 as the most efficient and specific ABE editing system, which contains a wildtype noncatalytic TadA monomer and the evolved TadA adenosine deaminase on the dCas9 platform.Similar to BE4max, ABEmax was generated with further optimizations to improve expression and activity[87].

Base editing strategies are a promising therapeutic approach for correcting pathogenic variants due to the possibility of generating precise base pair resolution edits without potentially harmful DSBs. Marfan syndrome (MFS) is an autosomal dominant disorder of the connective tissue arising from mutations inFBN1, a gene coding for the protein, fibrillin 1[88].Fibrillin 1 is a critical component of microfibrils, with pathogenic variants in theFBN1causing skeletal abnormalities and cardiovascular manifestations resulting in major morbidity and mortality[89].Embryos harboring a patient-specific heterozygous MFS mutation,FBN1T7498C, were generated byin vitrofertilization of an immature oocyte with sperm from the corresponding MFS patient.FBN1T7498Cis an eligible candidate for CBE targeted gene correction with an appropriate SpCas9 PAM 5′-NGG-3′ to ensure that only the causal mutation resides within the 4thto 8thgRNA position corresponding to the editing window of BE3, enabling a corrective edit while reducing the potential for deleterious bystander mutations[90]. BE3 mRNA and locus-specific sgRNA were microinjected into the zygotes. 6 of 7 injected embryos corrected to the wildtypeFBN1sequence,while 1 of 7 injected embryos harboring an unintended transition mutation. Furthermore, deep sequencing of potential gRNA dependent off-target sites indicated no detectable off-target editing events.A subsequent whole-genome analysis of 419 possible off-target sites as determined by a tolerance of 5 mismatches against the corrective gRNA resulted in no off-targets identified[91].

In addition to correcting pathogenic alleles by reverting disease-causing variants to wildtype sequences, expression of pathogenic transcripts can be halted through nonsense mediated decay by generating stop codons in a coding sequence. CBEs can generate stop codons within a coding sequence through C-to-T conversions of CAG, CGA, or CAA codons[92-93].Approximately 20% of amyotrophic lateral sclerosis(ALS) cases are caused by mutations in Cu-Zn superoxide dismutase 1 (SOD1). In these cases,pathogenicSOD1expression in motor neuron and non-neural cells are thought to underlie disease progression. ALS is a complex and fatal disorder affecting multiple tissues due in part to a loss of motor neurons in the spinal cord and brain. The G93A-SOD1 mouse model of ALS harbors 25 copies of a pathogenic humanSOD1G93Atransgene, thereby resulting in an aggressive neurodegenerative disorder characterized by the accumulation of mutant SOD1 protein, and ALS symptoms including motor neuron loss and muscular atrophy[94]. Targeted conversions ofSOD1G93Acoding sequence to generate stop codons can disrupt pathogenicSOD1expression. The sequence length of CBE systems requires additional considerations when developingin vivodelivery approaches as they exceed the packaging capacity of AAVs. Therefore, to targetSOD1G93A, a split-intein approach that facilitates trans-splicing of two halves of a split CBE system was developed which allows BE3 and aSOD1-specific gRNA to be separately packaged into a dual AAV9 system. An intrathecal injection of the AAVs into theSOD1G93Amice resulted in improved survival, and alleviation of symptoms including motor coordination, hindlimb grip strength,and increased weight. Molecular analyses indicated a reduction in SOD1 reactive inclusions, and increased motor neurons and decreased denervated neuromuscular junctions in treated mice, thereby demonstrating the potential of CBE-mediated correction strategies to prevent pathogenesis[94].Mirroring the generation of a stop codon to disrupt pathogenic transcript expression, ABE systems can convert ATG start codons to GTG or ACG in a strategy called i-Silence[95]. ClinVar analysis reveals 247 human diseases with aberrant start codon mutations, with 147 of these mutations targetable by the i-Silence strategy.

Cystic fibrosis (CF) is an autosomal recessive disorder caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene,which affects multiple organs with primary morbidity and mortality associated with the lungs[96]. A biobank of 664 patients, representing approximately half of the Dutch CF population, was described and surveyed revealing that approximately 20% of mutations could be corrected with base editors[97]. To test the efficacy of base editors on CF mutations, organoids from patient derived tissues were generated and assessed for molecular signatures of correction throughCFTRgene, transcript, and protein analysis as well as restoration of organoid function. Intestinal organoids from a patient harboring a homozygousCFTRc.2353C＞T were generated from a rectal biopsy. The organoids were dissociated into single cells and treated with an ABEmax with the pathogenic C-to-T(G-to-A) mutation. Editing efficiency by ABEmax was 9.3%, 5-fold greater than targeted correction by HDR-mediated correction, and a forskolin induced swelling assay to assess functional rescue of CFTR function in the intestinal organoids showed restoration to wildtype organoid levels. Patient derived intestinal organoids were generated from a patient harboring a homozygous c.3846G＞A mutation, in addition to a patient heterozygous for the third most prevalent nonsense CF mutation, c.1657C＞T. The c.3846G＞A and c.1657C＞T mutations did not have appropriate 5′-NGG-3′ PAMs, but could be targeted using mutationspecific gRNAs and a xABEmax plasmid built upon the xCas9 system which recognizes more flexible 5′-NGN-3′ PAM. After the dissociated organoids were treated with the xABEmax system and were expanded into intestinal organoids where forskolin induced swelling assays demonstrated rescue of organoid function to wildtype levels, despite correction efficiencies of about 1.5%. As 10% of residual CFTR function is associated with a milder form of disease,the editing efficiencies demonstrated, specifically of the ABEmax correction strategy for the c.2353C＞T mutation, has the potential to be an impactful therapeutic approach[98-99].

Recent innovations in base editing technologies have led to the generation of the dual-deaminases, a CRISPR/Cas gene editing platform capable of both A＞G and C＞T transition mutations within the editing window[100-102]. Dual-deaminases are built by combining ABE and CBE components onto the same nCas9 to target a locus for dual-deamination with a single gRNA. Dual-deaminases have the capacity to model and correct multinucleotide variants, as well as generate novel codon changes previously outside the capacity of ABE or CBE, with a single gRNA targeted process[100-102]. Within the promoter of the γ-globin genes are two mutations known to alleviate symptoms of β-hemoglobinopathies through increasing HbF production. The −113A-to-G mutation creates a GATA1 binding site, activatingHBG1transcription,while the -114C-to-T mutation disrupts binding of the transcriptional repressor, BCL11A[103-104]. By using the dual-deaminase A&C-BEmax, it was demonstrated that the simultaneous −113A-to-G and −114C-to-T mutations generated erythroid precursor cell lines with the greatestHBGmRNA expression[101]. Further analysis of edited clones revealed that additional mutations disrupting the promoter region did not increase induction ofHBG, and that the single −114Cto-T did not match the restoration of the combined−113A-to-G and −114C-to-T mutations. This proof of principle experiment demonstrates single nucleotide resolution dissection of gene sequence in a single experiment, and the potential application of a dualdeaminase system in therapeutic gene editing.

Prime editing

Prime editors (PE) are a gene editing system capable of generating targeted insertions, deletions,and all single base pair changes without a DSB or donor DNA template (Fig. 1D)[105]. PEs were generated by fusing a reverse transcriptase (RT) to a nCas9, and by utilizing a modified gRNA, known as a prime editing guide RNA (pegRNA), which includes a distinct primer-binding region followed by RT template which includes the desired edit. Once localized to the target site, the PAM strand of the DNA is nicked, followed by hybridization of the primer-binding region of the pegRNA to the corresponding region in the PAM strand. New DNA is synthesized by the RT templated by the pegRNA RT template resulting in a 5′ flap containing the original unedited sequence, and a 3′ flap harboring the edited sequence. 5′ flap cleavage is mediated through endonucleases including flap endonuclease 1, leading to ligation of newly synthesized DNA. In the optimized PE variant PE3, the unedited strand is nicked by a nCas9 to preferentially repair the unedited strand to match the edited strand. Initial results indicate that prime editors are a robust editing modality with situational advantageous uses over base editors and traditional CRISPR/Cas systems. Base editors can be constrained by multiple substrates within an editing window leading to bystander activity, or a lack of PAM sites to place intended targets within an editing window. PE can circumvent these limitations due to the templated nature of PE edits and the additional distance of possible edits due to the extended length supplied by the pegRNA. PEs can generate targeted insertions with comparable efficiencies to HDR with fewer unintended indels, though the size of the insertion can be a preventative factor for the current generation of PEs. The multiple mechanistic steps underlying the editing capacity of PE and the large genomic size of the PE system limitsin vivoediting.PE is a promising technology to expand the catalog of CRISPR/Cas gene editing tools for therapeutic purposes with further optimization.

Discussion and perspectives

CRISPR/Cas gene editing systems have the potential to correct thousands of monogenic diseases.Functional correction, partial restoration, and conversion of debilitating disease to milder variants is regulated, in part, at two stages: the corrective capacity of the specific gene editing strategy, and the delivery of the gene editing components to targeted tissues. A suitable CRISPR/Cas gene editing strategy can be chosen based on characteristics underlying the mutant variant including the type of mutation and the cell cycle status of the target cells, as outlined in a decision tree (Fig. 4). Following the path of the decision tree results in one of seven possible gene editing strategies. For many mutations, multiple gene editing strategies are possible and should be considered to identify the optimal correction strategy.The practical scope of gene editing is dictated by concerns regarding efficiency, which is inherent to each scenario and requires consideration of corrective doses. For example, 10% of corrected foveal cone photoreceptors restores functional vision in LCA, as does correction of 30% to 50% of DMD cardiomyocytes for rescue of contractile phenotype[61,65,106].

Fig. 4 Decision tree for choosing gene editing strategy for diverse mutations. CRISPR/Cas gene editing strategies can be chosen based on characteristics underlying the mutant variant. Following the decision tree will result in one of seven possible editing strategies. Homology directed repair (HDR) knock-in can produce a diverse range of modifications through a templated repair process, e.g., correction of leber congenital amaurosis through providing a donor oligo for templated repair. Non-homologous end joining (NHEJ) predictable indel generation is a method of introducing specific indels to restore the sequence or open reading frame, e.g., restoring the disrupted reading frame in Fanconi anemia. Microhomology-mediated end joining (MMEJ) knock-in and prime editor-mediated sequence editing can facilitate sequence modification in post-mitotic cells, e.g., correction of hereditary tyrosinemia type I through targeted insertion of a MMEJ plasmid.MMEJ repeat deletion can precisely delete short sequence repeats and microduplications through annealing of repeat sequences following MMEJ double-strand break repair, e.g., precise deletion of a 8-bp duplication in limb-girdle muscular dystrophy type 2G. Cytosine base editors can mediate C-to-T transition mutations in defined editing windows, e.g., correction of patient-specific Marfan syndrome mutations in patient embryos. Adenosine base editors can mediate A-to-G transition mutations in defined editing windows, e.g., correction of diverse cystic fibrosis mutations in intestinal organoids. Dual-deaminases can mediate both C-to-T and A-to-G transition mutations in a defined editing window, e.g., simultaneous generation of an activator binding site and disruption of an inhibitor binding site to alleviate symptoms of β-thalassemia.

Delivery of CRISPR/Cas genome editing systems into target cells and tissues is critical to successful therapeutic genome editing. Adeno associated viruses(AAV) are the leading approach for delivery of gene editing systems, gRNAs, and donor templates. AAVs are single-stranded viral vectors capable of delivering genomic payloads to multiple tissues, with minimal immunogenicity[107]. Despite the prevalence of AAVs,there are considerable obstacles limiting their widespread use, namely genomic packaging limitations (approximately 4.7 kb). Additionally,integration of AAV vectors into Cas9 induced DSB sites can occur at high frequencies across pathologically relevant tissue types including in the mouse brain, skeletal muscle, and cochlea[108].Integration occurred in multiple sites across the genome, enriched at the Cas9 cleavage site, as well as at favored sites independent of gRNA homology. To address limitations imposed by AAVs, multiple groups have bioengineered delivery approaches using polymers, nanoparticles, and modified viral platforms[109-111].

Off-target edits are unintended mutations generated at any region not specified by a gene editing modality[112]. These unintended edits are a major concern when considering the potential applications of CRISPR/Cas systems in basic, translational, and clinical settings due to the uncertain consequences of off-target mutations. Efforts to reduce off-target edits have focused on both gRNA-based and Cas proteinbased interactions. Multiple groups have developed tools to predict gRNA efficiency and specificity building on an understanding of gRNA and target characteristics including base pair composition, gRNA length, similarity of on- and off-target sequences,presence of PAMs, and mismatch locality[113-115].High-fidelity variants of SpCas9 such as SpCas9-HF1 and eSpCas9 with comparable on-target editing efficiency but reduced or undetectable off-target edits have been generated through sequence and structurally driven engineering approaches[116-117].

The additional components of the CBE and ABE systems result in different off-target profiles to Cas9 alone[118]. CBEs have been demonstrated to have increased off-target DNA editing, in addition to RNA editing, due in part to gRNA independent off-target mutagenesis by APOBEC1[119-121]. Engineered CBE variants have been generated to drastically reduce offtarget editing, with no minimal differences in ontarget editing demonstrating strategies to increase the on- to off-target differential[121-123]. Strikingly, offtarget analysis of ABEs reveals far fewer off-target edits, with identified mutations at similar frequencies to background[119,124-125].

As the numbers and types of gene editing technologies built upon CRISPR/Cas systems expands, our understanding of their applications follows. Each gene editing approach employs strategies unique to the architecture of the disease, and through collaborative efforts among the scientific and medical communities, our ability to generate the appropriate tools and develop the right understanding flourishes. The test to prove the potential of CRISPR/Cas systems to correct pathogenic mutations and alleviate disease has begun with multiple FDA approved clinical trials underway. With the pace of development and the ingenuity of the field, we are certain of a bright future for CRISPR/Cas gene editing approaches for the treatment of monogenic diseases.

Acknowledgments

We thank Dr. Joseph Miano for constructive criticism of the manuscript. This work was supported by grants from DHPS Foundation and Kids Connect Charitable Fund.

THE JOURNAL OF BIOMEDICAL RESEARCH

2021年2期