APP下载

Combinatorial genetics methods for discovering high-order regulatory combinations and engineering genetic drivers for neural differentiation

2021-12-05DawnTheanAlanWong

中国神经再生研究(英文版) 2021年12期

Dawn G. L. Thean, Alan S. L. Wong

Diving into the search for effective cell differentiation factors:Researchers are still striving to find better therapeutics to revert or slow down the progression of neurological disorders such as Parkinson’s disease, Alzheimer’s disease, and Huntington’s disease. These disorders are the result of neuronal cell death in different parts of the brain. Medications or treatments used to relieve the symptoms in patients have not been well established. They do not recover the damaged neural tissues and can result in unwanted side effects. Therefore, an increasing number of studies look to stem cells as a promising therapeutic, because of their selfrenewal capabilities and flexibility of differentiation into desired cell lineages for engraftment into the patient to recover the lost neural tissues. However,before stem cells can be clinically used in treating neurological disorders, there are still areas that require a better understanding to unlock their full potential.

Stem cells are regenerative and malleable, with the propensity to become any cell-type given the right concoction of factors to drive differentiation.However, there is a depth of complexity in the large network of factors. It is important to determine the combinations of transcription factors (TF),small molecules, and/or growth factors to achieve the optimal synergism for differentiating stem cells including embryonic stem cells (ESCs) and induced pluripotent stem cells at the fastest rate,resulting in the purest population of a neuronal lineage. However, there could be a strategic way to screen for these factors in a more cost-efficient and time-effective manner. This would help optimize the already established protocols that are still laborious or for further characterization of neuronal subtypes such as medium spiny neurons,sensory neurons, and serotonergic neurons.It remains a challenge to profile the drivers of neuronal fate comprehensively and define how these elements interact with one another in the regulatory network.

A few groups have used new high-throughput screening approaches to elucidate the regulatory network of cell fate and provided informative insights on factors that drive neuronal conversion and survival. Liu et al. (2018) used CRISPR activation (CRISPRa) to search for TFs or DNAbinding factors that promote the neuronal fate of ESCs. However, the work was limited to study single factors or pairwise combinations. Another systematic screen was conducted by Tsunemoto et al. (2018), which investigated the relationship between TFs. Most of their selected single TF showed no effects in the study, however, they identified 76 pairwise TF combinations to promote differentiation of mouse fibroblasts to induced neurons. Notably, their results lacked some consistency with previous studies, such as not detecting the dopamine active transporter Slc6a3 in induced neurons, possibly due to missing out on additional factors. Another study determined the essential genes to maintain or improve the survival of neurons by conducting a CRISPR interference(CRISPRi)-based screening (Tian et al., 2019).And again, they were limited to the screening of single sgRNAs. Further studies are required for an unbiased screening for other genetic factors, and to increase the number of factor combinations, as it has been shown that a cocktail of more than two factors may be required in different contexts to efficiently drive differentiation and reprogramming.A good example is the combination of Oct4, Sox2,Klf4, and c-Myc that is necessary for regulating the developmental signaling network for the pluripotency of ESCs (Takahashi and Yamanaka,2006). In such complex systems, there is a need to comprehensively characterize the functions of high-order genetic combinations in a highthroughput fashion, and Combinatorial GeneticsEn Masse(CombiGEM) method may just be able to do that.

CombiGEM method for a high-throughput,high-order, and systematic screen of factor combinations:CombiGEM offers not only a systematic way of conducting large-scale pooled screens but also has the capability of scalable assembly of high-order combinatorial genetic libraries (Wong et al., 2015, 2016; Zhou et al.,2020). The one-pot process allows the user to screen for a multitude of genetic combinations;1-way, 2-way, 3-way, and in theory, n-way libraries. This provides a rapid alternative to the conventional process of building and testing individual candidate combinations of interest.While several other combinatorial CRISPR screening strategies were also developed for studying pairwise genetic combinations (Han et al., 2017; Shen et al., 2017; Najm et al., 2018;Truong et al., 2019; DeWeirdt et al., 2020),CombiGEM offers a unique opportunity to evaluate interactions between three or more genetic combinations. For instance, if the user aims to use CRISPRa or CRISPRi for overexpressing or repressing, respectively, a list of candidate TF combinations, one could screen a barcoded library of TF-targeting sgRNA combinations using CombiGEM-CRISPR v2.0 (Wong et al., 2016; Zhou et al., 2020) as illustrated inFigure 1. By using one-pot ligation steps, the library of sgRNAs and their respective barcodes are incorporated into a lentiviral-destination vector that reports the expression of a fluorescent protein upon activation of a neuron cell type-specific promoter,such as tubulin α1. The library of sgRNAs and a separate lentiviral vector harboring the enzymatically deficient Cas9 either fused with a transcriptional activator or repressor can be delivered into the desired starting cells. Overtime, as cells begin to differentiate, only induced neuron-like cells will express fluorescence, and these cells can be isolated using fluorescenceactivated cell sorting for retrieving their harboring TF combinations via barcode sequencing. To gain a better understanding of the combinations that drive specific cell lineages, these cells can also be probed for known cell-lineage reporters or markers based on the preference of differentiation stages the user chooses to study. Magnetic-activated cell sorting could be used to separate cells of interest depending on the cell surface marker expressions.Single-cell RNA sequencing could be coupled to the combinatorial CRISPR screening as a readout to profile the type and levels of gene up or downregulation and the combination of targeted TFs can be determined via the barcode reads(Replogle et al., 2020). High-content imaging could be used to monitor the cell morphology changes from early to later stages of neuron development.The physiological relevance or functionality of the induced neuron-like cells can be determined by immunocytochemical labeling specific for neuron maturation, such as NeuN, TUJ1, and MAP2, and electrophysiological measures.

As mentioned earlier, studies have combined TF and small molecules to manipulate cell fate.CombiGEM can also be used to identify small molecule combinations that can maintain selfrenewal of stem cells, induce cell-lineage differentiation, or facilitate reprogramming by increasing the efficiency and potentially replacing the genetic factors. CombiGEM can also be applied by first determining the small molecule targets of signaling pathways, epigenetic, or cellular process factors. Then via designing a library of sgRNA to activate or inactivate the druggable targets, one can then identify the effective combinations of small molecules to enhance the differentiation or reprogramming. This concept is well reflected by the studies we have earlier conducted to discover drug target combinations against cancer and Parkinson’s disease (Zhou et al., 2020).

The use of CombiGEM is not only limited to CRISPR-based perturbations but could also be applied to other DNA or RNA regulatory factors to study the loss-of-function screens with RNA interference, as well as gain-of-function ones with the expression of microRNAs (Wong et al.,2015) and sequence-verified human open reading frames such as those reported in the human TFome library (Ng et al., 2020).

CombiSEAL method for high-throughput engineering of transcription factors:Finding the ideal combination of naturally occurring genetic factors can still pose challenges, such as being limited to characterized TFs of the genome with prior knowledge about their expression and roles in differentiation. Studies have demonstrated that engineering TFs or generating artificial transcription factors opens up a new avenue of options to speed up the rate of differentiation through alternate pathways, surpassing the dependence of prerequisite expression of other endogenous co-factors within the gene regulatory network, or altogether replacing natural TFs with more efficient ones (Jauch, 2018). This enables researchers to tailor the required factors according to their experimental needs.

We propose that if one aims to conduct highthroughput mutagenesis on multiple sites to a TF that comprises of different domains, CombiSEAL could prove to be a useful platform (Choi et al., 2019). The CombiSEAL method assembles amino acid-coding DNA fragments that are combinatorially tagged with barcodes (Figure 1).CombiSEAL begins with first sectioning parts of the protein where mutagenesis is desired. The user can generate any number of variants for each section, then tagging them with barcodes to identify the position and combination of mutations. Each pool of sections is then assembled sequentially by inserting them into a destination vector harboring the wild-type protein sequence modified with flanking type IIS enzymes at the region of intended mutagenesis. This scarlessfusion scheme of linking multiple parts of a protein is important to avoid adding unwanted amino acids to the protein. The CombiSEAL method allows determining the amino acid sequences of the selected pool of variants of interest more efficiently and cost-effectively, avoiding the need for long-read sequencing over the entire protein to identify the types of mutations and where they are situated across the protein. This methodical setup allows easier analysis by conducting a high-throughput sequencing of the pool of short concatenated barcodes that infers the combination of types and positions of amino acid mutations,or other desired modifications such as domain swapping, insertion, and deletion, that had been initially designed and installed on the TF.

Conclusion:In comparison to trial-and-error methods, advances have been made in employing more systematic approaches to determine the essential factors or combinations required to drive the conversion of one cell type to a neuronallineage. However, the ability to identify higherorder combinations has been lacking, and here we propose that the CombiGEM method may be able to address such limitations. CombiGEM uses a barcoded, one-pot ligation system to piece together higher-order combinations of DNA binding factors to target DNA in one-go, circumventing the process of multiple rounds of screening to narrow down the number of hits to a manageable size for downstream validations. Furthermore,less efficient natural TFs can be substituted with artificial or engineered TFs to better regulate gene expression. We describe a protein mutagenesis method, CombiSEAL, that will enable users to create large pools of transcription factor variants by straightforwardly assembling multiple site mutations tagged with barcodes within the protein and their different domains. This will speed up and lower the cost of screening procedures to retrieve the information on the types of mutations of the variants of interest. We hope that the proposed methods can assist with further understanding and broadening the possibilities of promoting neural regeneration and can be widely applicable to other areas of research.

Dr. Alan S. L. Wong would like to declare that patent applications have been filed based on our published work on the presented combinatorial genetics platforms.

The present work was supported by the University of Hong Kong Internal funds, Croucher Foundation Start-up Allowance, and NSFC 2020 Excellent Young Scientists Fund (to ASLW).

Dawn G. L. Thean, Alan S. L. Wong*

Laboratory of Combinatorial Genetics and Synthetic Biology, School of Biomedical Sciences,The University of Hong Kong, Pokfulam, Hong Kong Special Administrative Region, China(Thean DGL, Wong ASL)Department of Electrical and Electronic Engineering, The University of Hong Kong,Pokfulam, Hong Kong Special Administrative Region, China (Wong ASL)

*Correspondence to:Alan S. L. Wong, PhD,aslw@hku.hk.https://orcid.org/0000-0003-1790-3233(Alan S. L. Wong)

Date of submission:November 18, 2020

Date of decision:January 6, 2021

Date of acceptance:February 5, 2021

Date of web publication:April 23, 2021

https://doi.org/10.4103/1673-5374.313038 How to cite this article:Thean DGL, Wong ASL(2021) Combinatorial genetics methods for discovering high-order regulatory combinations and engineering genetic drivers for neural differentiation. Neural Regen Res 16(12):2403-2404.

Copyright license agreement:The Copyright License Agreement has been signed by both authors before publication.

Plagiarism check:Checked twice by iThenticate.

Peer review:Externally peer reviewed.

Open access statement:This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.

Open peer reviewers:Meng How Tan, Nanyang Technological University, Singapore; Abraam M.Yakoub, Stanford University, USA.

Additional file:Open peer review report 1.