sci-RNA seq 2017；sci-CAR 2018； s

作者: 韧_7e6f | 来源:发表于2023-03-13 21:22 被阅读0次

题目：
Comprehensive single-cell transcriptional profiling of a multicellular organism [sci-RNA seq; 2017 年science]
Joint profiling of chromatin accessibility and gene expression in thousands of single cells [sci-CAR（RNA+ ATAC）; 2018 年science]
The single cell transcriptional landscape of mammalian organogenesis [sci-RNA seq3 ; 2019年nature]

一作：Junyue Cao 现在Laboratory of Single Cell Genomics and Population Dynamics, The Rockefeller University, New York, NY, USA独立，2022年发表Easy-Sci。曹俊越，先后获得北京大学生物学学士学位、华盛顿大学基因组科学博士学位。博士期间曾在杰伊·申杜尔（Jay Shendure）的实验室工作，开发出新颖的高通量单细胞基因组技术。利用这项技术，曹俊越对包括蠕虫，小鼠和人类在内的整个生物的数百万个单细胞转录组进行了分析。通过在单细胞分辨率下量化哺乳动物胚胎发育的动力学，他绘制出在整个生物体范围内控制细胞增殖和分化的全球遗传计划。

通讯作者：Jay Shendure 华盛顿大学基因组研究所教授。Jay Shendure对开发和应用遗传学、基因组学和分子生物学的新技术和方法感兴趣。Shendure和他的团队的大部分工作广泛利用下一代DNA测序作为测量生物现象的有效方法。他们目前的项目大致分为六个方面:开发新的分子方法;发育生物学的基因组方法;大规模并行功能基因组学;将基因组学应用于临床;人类疾病的遗传基础;还有基因组测序技术。最终，Shendure的目标是开发创新，对我们对生物学和疾病的理解产生重大影响。

1 背景

sci-RNA seq 2017
虽然单细胞RNA测序(RNA seq)的方法已经激增，但它们依赖于在物理分隔室中分离单个细胞(2,5,8,12 - 17)。因此，用这些方法制备单细胞RNA-seq库可能是昂贵的，成本与处理的细胞数量成线性比例(18,19)。单分子组合索引（Single-molecule combinatorial indexing）可用于单倍型解析基因组测序和全新基因组组装(20,21)，而单细胞组合索引(single-cell combinatorial indexing, “sci”)可用于分析大量单细胞中的染色质可及性(sci- atac -seq)(22)、基因组序列(sci-DNA-seq)(23)、全基因组染色体构像(sci- hi -c)(24)和DNA甲基化(sci- met)(25)。在这项工作中，我们开发了一种组合索引方法来唯一地标记大量单细胞或细胞核的转录组，称为sci-RNA-seq。

sci-CAR 2018
同时分析单细胞内的多类分子(例如RNA和DNA)有可能揭示因果调节关系，并丰富生物尺度单细胞图谱的效用。然而，迄今为止，核酸“coassays”依赖于物理分离每个细胞，限制了每次研究的通量到几个细胞。

sci-RNA seq3 2019
老鼠发育很快，从受精到出生只需要21天。胚泡着床(E4.0)之后是原肠胚形成和胚层形成(E6.5-E7.5)1,2。在早期体细胞阶段，胚胎从原肠胚发育过渡到早期器官发生，形成神经板和心管(E8.0-E8.5)。在接下来的几天里(E9.5-E13.5)，胚胎从几十万个细胞扩展到1000多万个细胞，同时发育出几乎所有的主要器官系统。事实上，大多数导致重大发育缺陷的基因都可以在这个窗口中研究3,4。单细胞的转录谱分析(scRNA-seq)为获得发育过程的全局视图提供了一种有前途的途径5 - 7。例如，scRNA-seq最近揭示了小鼠发育过程中神经元和心肌细胞的显著异质性8,9。然而，尽管最近发布了两组小鼠scRNA-seq图谱(10,11)，但它们大多局限于成人器官，并没有试图表征发育过程中细胞类型的出现和动态。
为了提高通量，我们探索了> 1000个实验条件(扩展数据图1ab;方法)。由此产生的方法sci-RNA-seq3的主要改进包括:
(i)细胞核直接从新鲜组织中提取，无需酶处理，然后固定和储存。
(ii)对于第三轮indexing，改为发夹连接。
(iii)优化各个酶促反应。
(iv)稀释，并增加了超声和过滤步骤以最大限度地减少聚集来取代了FACS分选。
即使没有自动化，sci-RNA-seq3文库的制备也可以通过一个人的密集努力在一周内完成，每个细胞的成本不到0.01美元（费人，呵呵）。

2.问题及目的

问题：虽然单细胞RNA测序(RNA seq)的方法已经激增，但它们依赖于在物理分隔室中分离单个细胞。因此，用这些方法制备单细胞RNA-seq库可能是昂贵的，成本与处理的细胞数量成线性比例。
目的：大大提高通量。

3.工作流程

sci-RNA seq 两轮标记

(A) sci-RNA-seq工作流程示意图. AAAAA，多腺苷尾;NVTTTTT，聚胸腺嘧啶引物。(B)用于Illumina测序的sci-RNA-seq文库扩增子示意图。Bp，碱基对;R: Illumina测序引物的退火位点;P, Illumina P5或P7适配器序列。(C)人类和小鼠细胞唯一分子标识符(UMI)计数的散点图，由384 × 384 sci-RNA-seq确定。蓝色，推断小鼠细胞(n = 5953)。红色，推断的人类细胞(n = 3967)。灰色，collisions (n = 884). (D) 人与小鼠细胞UMI计数的散点图，采用优化的协议，通过96 × 96 sci-RNA-seq测定。蓝色，推断小鼠细胞(n = 129)。红色，推断的人类细胞(n = 160)。灰色, collisions (n = 5). 在(C)和(D)中，只显示了来自含有混合人类和小鼠细胞的孔的细胞。

sci-CAR

sci-RNA seq3

文库结构 Single Cell Genomics Library Structure

sci-RNA seq

sci-RNA seq3

其中值得注意的是实现高通量的方法不同以及通量不一样。
sci-RNA seq 两轮标记：RT barcode + （i5和i7）； ~104 single cells per experiment.
sci-RNA seq 三轮标记：RT barcode + indexed Tn5 +（i5和i7）； >106 single cells per experiment.
sci-RNA seq3 ： 384 * 384 * 768 = ~113 million possible combinations.

4.方案

4.1 细胞或细胞核处理

sci-RNA seq for 细胞：5M细胞在-20°C下5 mL冷的100%甲醇中固定10分钟，用1 mL冷的1X含1%DEPC的PBS(0.1%用于秀丽隐杆线虫细胞)，用含1% SUPERase In RNase Inhibitor (20 U/μL, Ambion)和1% BSA (20mg/ mL, NEB)的1ml冰冷PBS洗三次。细胞在最终浓度为5000 Cells /ul的洗涤缓冲液中重新悬浮。对于所有洗涤，细胞通过300xg离心3分钟，在4°C。
sci-RNA-seq for 细胞核：使用1 mL冰冷裂解缓冲液(10 mM Tris-HCl, pH 7.4, 10 mM NaCl, 3 mM MgCl2和0.1% IGEPAL CA-630 from(58，W. J. Greenleaf，2013 ATAC seq)，修改为还包括1% SUPERase In和1% BSA)组合并裂解5M细胞。将分离得到的核离心沉淀，用含1% DEPC的1X PBS 1 mL冰水洗涤2次，500 μL冷裂解液洗涤2次，500 μL不含IGEPAL CA-630的冷裂解液洗涤1次，在不含IGEPAL CA-630的裂解液中重悬，最终浓度为5000个核/μL。所有洗涤均在4℃下300xg离心3 min。后续使用的核，未做固定。
sci-CAR for 细胞核：将5M细胞结合并使用1 mL冷冻细胞裂解缓冲液(10 mM Tris-HCl, pH 7.4, 10 mM NaCl, 3 mM MgCl2和0.1% IGEPAL CA-630 from(ATAC seq)，修改后还包括1% SUPERase In RNase inhibitor和1% BSA)裂解。将分离得到的核沉淀，用500 μL不含IGEPAL CA-630的冷裂解液洗涤2次，以5000核/μL的终浓度重悬于不含IGEPAL CA-630的裂解液中。对于所有洗涤，通过500xg离心5分钟(4°C)将核沉淀。后续使用的核，未做固定。
sci-RNA-seq3 for 细胞核 ：每个小鼠胚胎用刀片在1 mL冰冷的细胞裂解缓冲液(10 mM Tris-HCl, pH 7.4, 10 mM NaCl, 3 mM MgCl2和0.1% IGEPAL CA-630 from53（ATAC seq），修改为也包括1% SUPERase in和1% BSA)中切碎成小块，并转移到40 μm细胞过滤器(Falcon)的顶部。用注射器柱塞(5ml, BD)的橡胶头在4ml细胞裂解缓冲液中均质组织。然后将过滤后的细胞核转移到一个新的15毫升试管(Falcon)中，在500×g上用离心机离心5分钟将其沉淀，并用1毫升细胞裂解缓冲液清洗一次。核被固定在4ml冰4%多聚甲醛(EMS)上15分钟。固定后用1 ml细胞核洗涤液(不含IGEPAL的细胞裂解液)洗涤细胞核2次，再用500 μl细胞核洗涤液重新悬浮。将样品分成两管，每管250 μl，在液氮中快速冷冻。我们根据每个胚胎中提取的核数与预期的总核数来估计核提取效率。估计核萃取效率为60% ~ 85%。用0.2% TritonX-100(在核洗涤缓冲液中)对解冻的核进行透化3分钟，并在低功率模式下进行短暂超声(Diagenode, 12秒)以减少核团块。然后用细胞核洗涤缓冲液清洗细胞核一次，并通过1 ml Flowmi细胞过滤器(Flowmi)过滤。过滤后的核在500×g下旋转5分钟，并在核洗涤液中重悬。

总结：在技术发展过程中，作者肯定使用更好的固定条件，所以对细胞来说，100%methanol是可以的；而对细胞核来说4%PFA是更好的。另外，提核方法值得关注，与10x genomics 标准不同，仅仅用了 IGEPAL CA-630（NP40），并没有用tween20和digitonin。此外，细胞核成团，可选用FACS或着超声(Diagenode, 低功率12秒)。以及离心力改为500g。

4.2 sci-RNA-seq 实验方案两轮标记法

4.2.1 RT

oligo-dT primer (5ʹ- ACGACGCTCTTCCGATCT (partial Truseq R1) NNNNNNNN [10bp index] T30 VN-3ʹ,
(“N” is any base and “V” is either “A”, “C” or “G”)
对于每个孔，将1000 -10,000个细胞或核(2µL)与1µL 25µm oligo-dT primers 和0.25µL 10 mM dNTPs，在55°c下变性5分钟，立即置于冰上。(注意为了优化RT反应效率，我们每孔使用2000个细胞或5000个核。)
stopped by adding 5 μl 2X stop solution (40 mM EDTA, 1 mM spermidine) to each well。所有细胞pool一起，DAPI染色，FACS分选单个的细胞到well里。
注意：这里方案和https://www.protocols.io/里不一样（是2020写，2021修改），那里没有终止RT反应，且RT反应程序为 2 minutes at 4°C, 2 minutes at 10°C, 2 minutes at 20°C, 2 minutes at 30°C, 2 minutes at 40°C, 2 minutes at 50°C, 15 minutes at 55°C, Forever at 4°C.

4.2.2 Second Strand Synthesis

(NEB；E6111L) 16°C for 150 min. The reaction was then terminated by incubation at 75°C for 20 min.

4.2.3 Tagmentation

用Nextera DNA Sample Preparation kit (Illumina；20018705)进行打断，且每孔加5 ng Human Genomic DNA (Promega)作为载体，以避免过度打断和减少纯化过程中的损失，请注意，由于用于扩增文库的pcr引物对rt产物是特异性的，因此该载体dna没有明显的扩增或测序。加入12 μ l DNA结合缓冲液(Zymo)停止反应，室温孵育5 min。每孔用36 ul ampure xp珠(beckman coulter)纯化，16 μ l缓冲液EB (Qiagen)洗脱，转移到新的孔板上。
注意：这里方案和https://www.protocols.io/里不一样（是2020写，2021修改），那里没有加carrier DNA。

4.2.4 Index PCR and sequencing

P5 primer (5ʹAATGATACGGCGACCACCGAGATCTACAC[i5]ACACTCTTTCCCTACACGACGCTCTTCCGATCT-3ʹ),
P7 primer (5ʹ-CAAGCAGAAGACGGCATACGAGAT[i7]GTCTCGTGGGCTCGG-3ʹ),
NEBNext High-Fidelity 2X PCR Master Mix (NEB).

4.3 sci-RNA-seq 实验方案 [三轮标记法]

4.3.1 RT and Second Strand Synthesis

RT 同4.2.1 RT
Second Strand Synthesis：16°C for 2 hours.
这里改进second strand synthesis的体系（是之前1.32倍）；反应时间缩短30min。而且，不用FACS分选，直接pool一起后，等体积分。

4.3.2 Tagmentation

用 Indexed TDE1 enzyme。所有细胞pool一起，DAPI染色，FACS分选单个的细胞到well里。

4.3.3 Index PCR and sequencing

先加 P7 primer (5ʹ-CAAGCAGAAGACGGCATACGAGAT[i7]GTCTCGTGGGCTCGG-3ʹ) and incubated at 55°C for 15 min. 然后还加了 1 μL 10% Tween-20, 1 μL nuclease-free water, 1μL of 10 μM indexed P5 primer (5ʹ AATGATACGGCGACCACCGAGATCTACAC[i5]ACACTCTTTCCCTACACGACGCTCTTCCGATCT-3ʹ;),
and 10 μL NEBNext High-Fidelity 2X PCR Master Mix (NEB). Amplification program and following steps were the same with sci-RNA-seq with two-level indexing。
比之前，多加了一步P7 primer孵育。

4.4 sci-CAR

4.4.1 RT

For each well, 5,000 nuclei (2 µL) were mixed with 1 µl of 25 µM anchored oligo-dT primer (同sci-RNA seq) and 0.25 µL 10 mM dNTP mix (Thermo), denatured at 55°C for 5 min and immediately placed on ice. 反应体系同sci-RNA seq，反应程序改成: 4°C 2 minutes, 10°C 2 minutes, 20°C 2 minutes, 30°C 2 minutes, 40°C 2 minutes, 50°C 2 minutes and 55°C 10 minutes.

4.4.2 Tn5 tagmentation for gDNA

3 μL cell lysis buffer without IGEPAL CA-630, 10 μL Nextera TD buffer (Illumina) and 1 to 2 μL
indexed TDE1 enzyme (Illumina) were added to each well. Tagmentation was performed at 55°C
for 30 min and stopped by adding 20 μl 2X stop solution (40 mM EDTA, 1 mM spermidine) to
each well. 所有细胞核pool一起，DAPI染色，FACS分选单个的细胞到well里。
注意：此时认为Tn5只能打断DNA双链；而后2020年一篇PNAS 证明Tn5也能打断RNA/cDNA杂交链。

4.4.3 Second Strand Synthesis

16°C for 180 min. The reaction was stopped by adding 6 μL DNA binding buffer (Zymo) and incubating at room temperature for 5 min.
注意：反应时间又增加，从原来150min改为180min；说明作者可能认为这一步反应不够充分。

4.4.4 i7 only Tn5 tagmentation for ds cDNA and index PCR for RNA part

tagmentation at 55°C for 5 min to carry out tagmentation. After tagmentation, each well was mixed with 0.4 μL 1% SDS, 0.4
μL BSA (NEB), and 2 μL of 10 μM P7 primer (5′-
CAAGCAGAAGACGGCATACGAGAT[i7]GTCTCGTGGGCTCGG-3′, IDT), and incubated at
55°C for 15 min. Then, to each well, we added 2 μL 10% Tween-20, 1.2 μL nuclease-free water,
2μL of 10 μM indexed P5 primer (5′-AATGATACGGCGACCACCGAGATCTACAC[i5]ACACTCTTTCCCTACACGACGCTCTT
CCGATCT-3′; IDT), and 20 μL NEBNext High-Fidelity 2X PCR Master Mix (NEB).
(~100,000 reads per cell) and the sci-RNA-seq library was sequenced to ~16,000 reads per cell.

4.4.4 index PCR for ATAC part

For the other plate (ATAC-seq dedicated portion), each well was mixed with 1 μL of 10 μM indexed P5 primer (5′-AATGATACGGCGACCACCGAGATCTACAC[i5]TCGTCGGCAGCGTC-3′; IDT), 1 μL of 10μM P7 primer (5′-CAAGCAGAAGACGGCATACGAGAT[i7]GTCTCGTGGGCTCGG-3′,IDT), Amplified libraries were then gel extracted to remove amplicons shorter than 200 bp, presumably primer dimers. ATAC-seq libraries was sequenced to ~55,000 reads per cell.

4.5 sci-RNA seq3

4.5.1 RT

For each well, 80,000 nuclei (16 μL) were mixed with 8 μl of 25 μM anchored oligo-dT primer (5′-/5Phos/CAGAGCNNNNNNNN[10bp barcode] T30-3′, where “N” is any base; IDT) and 2 μL 10 mM dNTP mix (Thermo), denatured at 55°C for 5 min and immediately placed on ice. SuperScript IV reverse transcriptase ，反应程序 (4°C 2 minutes, 10°C 2 minutes, 20°C 2 minutes, 30°C 2 minutes, 40°C 2 minutes, 50°C 2 minutes and 55°C 10 minutes).

4.5.2 ligation reaction

each well including 4 μL T4 ligation buffer (NEB), 2 μL T4 DNA ligase (NEB), 4 μL Betaine solution (5M, Sigma�Aldrich), 6 μL nuclei in nuclei wash buffer, 8μL barcoded ligation adaptor (100 uM, 5’-GCTCTG[9 bp or 10 bp barcode A]/dideoxyU/ACGACGCTCTTCCGATCT[reverse complement of barcode A]-3’) and 16 μL 40% PEG 8000 (Sigma-Aldrich). The ligation reaction was done at 16°C for 3 hours.
注意：特殊结构的oligo替代 index Tn5.

step1-3

4.5.3 Second Strand Synthesis

2,500 nuclei in 5 μL nuclei wash buffer and 5 μL elution buffer (Qiagen). 1.33 μl mRNA Second Strand Synthesis buffer (NEB) and 0.66 μl mRNA Second Strand Synthesis enzyme (NEB) were then added to each well, and second strand synthesis was carried out at 16°C for 180 min.

4.5.4 i7 only Tn5 tagmentation

55°C for 5 min to carry out tagmentation. The reaction was then stopped by adding 24 μL DNA binding buffer (Zymo) per well and incubating at room temperature for 5 min. Each well was then purified using 1.5× AMPure XP beads (Beckman Coulter).

4.5.5 USER enzyme treatment

In the elution step, each well was added with 8 μL nuclease free water, 1 μL 10× USER buffer (NEB), 1 μL USER enzyme (NEB) and incubated at 37°C for 15 min. 每孔再加6.5 μL洗脱缓冲液。AMPure XP珠被磁性支架除去，洗脱产物转移到一个新的96孔板中。

step5-6

4.5.6 Index PCR

For PCR amplification, each well (16 μL product) was mixed with 2 μL of 10 μM indexed
P5 primer (5′-AATGATACGGCGACCACCGAGATCTACAC[i5]ACACTCTTTCCCTACACGACGCTCT
TCCGATCT-3′; IDT), 2 μL of 10 μM P7 primer (5′-CAAGCAGAAGACGGCATACGAGAT[i7]GTCTCGTGGGCTCGG-3′, IDT), NEBNext High-Fidelity 2× PCR Master Mix (NEB).
Of note, 在单个实验中，我们在反转录步骤引入了384个条形码，发夹连接引入了384个条形码，PCR引入了768个条形码。.This corresponds to 384 * 384 * 768 = ~113 million possible combinations.

5 数据处理部分

主要针对物种混合实验：
Cells with over 85% of UMIs assigned to one species were regarded as species-specific cells。（sci-RNA seq and sci-RNA seq3）
Cells with over 80% of UMIs assigned to one species were regarded as species-specific cells, with the remaining cells classified as mixed cells or “collisions”. （sci-CAR）

6 总结对实验方案的选择

细胞/核固定：对细胞来说，100%methanol是可以的；而对细胞核来说4%PFA是更好的，且0.2% TritonX-100(在核洗涤缓冲液中)对解冻的核进行透化3分钟，但并没有解交联。另外，提核方法值得关注，与10x genomics 标准不同，仅仅用了 IGEPAL CA-630（NP40），并没有用tween20和digitonin。**此外，细胞核成团，可选用FACS或着超声(Diagenode, 低功率12秒)。以及离心力改为500g。
RT：25 µM for 1万细胞/核；SuperScript IV reverse transcriptase；反应程序：(4°C 2 minutes, 10°C 2 minutes, 20°C 2 minutes, 30°C 2 minutes, 40°C 2 minutes, 50°C 2 minutes and 55°C 10 minutes).
第二轮的标记：用特殊结构oligo替代Index Tn5
Second Strand Synthesis：反应条件要充分，2,500 nuclei 对应1.33 μl mRNA Second Strand Synthesis buffer (NEB) and 0.66 μl mRNA Second Strand Synthesis enzyme (NEB)。反应时间16°C for 180 min。
Tagmentation：用两个相同adaptor 的i7 only Tn5 tagmentation ，减少产物损失。
Index PCR：这一步，在直接用细胞/核时，会增加 P7 primer 孵育 55°C for 15 min这一步骤。