美文网首页RNA-seq
RNAseq教程(2.5)

RNAseq教程(2.5)

作者: 周小钊 | 来源:发表于2020-12-29 17:09 被阅读0次

目录

1.Module 1 - Introduction to RNA sequencing

  1. Installation
  2. Reference Genomes
  3. Annotations
  4. Indexing
  5. RNA-seq Data
  6. Pre-Alignment QC

2.Module 2 - RNA-seq Alignment and Visualization

  1. Adapter Trim
  2. Alignment
  3. IGV
  4. Alignment Visualization
  5. Alignment QC

3.Module 3 - Expression and Differential Expression

  1. Expression
  2. Differential Expression
  3. DE Visualization
  4. Kallisto for Reference-Free Abundance Estimation

4.Module 4 - Isoform Discovery and Alternative Expression

  1. Reference Guided Transcript Assembly
  2. de novo Transcript Assembly
  3. Transcript Assembly Merge
  4. Differential Splicing
  5. Splicing Visualization

5.Module 5 - De novo transcript reconstruction

  1. De novo RNA-Seq Assembly and Analysis Using Trinity

6.Module 6 - Functional Annotation of Transcripts

  1. Functional Annotation of Assembled Transcripts Using Trinotate

2.5 Alignment QC

使用samtools和FastQC来评估比对结果

使用samtools view查看SAM/BAM比对文件的格式

samtools view -H UHR.bam
samtools view UHR.bam | head

尝试过滤BAM文件以排除某些标记。这可以通过samtools view -f -F选项实现

-f INT required flag -F INT filtering flag

Try requiring that alignments are 'paired' and 'mapped in a proper pair' (=3). Also filter out alignments that are 'unmapped', the 'mate is unmapped', and 'not primary alignment' (=268)

samtools view -f 3 -F 268 UHR.bam | head

现在要求只对“PCR or optical duplicate”进行比对。有多少reads符合这个标准?为什么?

samtools view -f 1024 UHR.bam | head

使用samtools flagstat获取比对的基本情况。比对到上reads的百分比是多少?

samtools flagstat UHR.bam
1174953 + 0 in total (QC-passed reads + QC-failed reads)
24539 + 0 secondary
0 + 0 supplementary
0 + 0 duplicates
1173741 + 0 mapped (99.90% : N/A)
1150414 + 0 paired in sequencing
575207 + 0 read1
575207 + 0 read2
1143860 + 0 properly paired (99.43% : N/A)
1148598 + 0 with itself and mate mapped
604 + 0 singletons (0.05% : N/A)
6 + 0 with mate mapped to a different chr
6 + 0 with mate mapped to a different chr (mapQ>=5)

samtools flagstat HBR.bam
793963 + 0 in total (QC-passed reads + QC-failed reads)
7597 + 0 secondary
0 + 0 supplementary
0 + 0 duplicates
793356 + 0 mapped (99.92% : N/A)
786366 + 0 paired in sequencing
393183 + 0 read1
393183 + 0 read2
783124 + 0 properly paired (99.59% : N/A)
785496 + 0 with itself and mate mapped
263 + 0 singletons (0.03% : N/A)
0 + 0 with mate mapped to a different chr
0 + 0 with mate mapped to a different chr (mapQ>=5)

SAM/BAM格式的详细信息可以在这里找到:http://samtools.sourceforge.net/SAM1.pdf

Using FastQC

你可以使用FastQC来执行基本的BAM文件的QC(Pre-Alignment QC)。输出非常类似于在fastq文件上运行FastQC。

相关文章

网友评论

    本文标题:RNAseq教程(2.5)

    本文链接:https://www.haomeiwen.com/subject/xaqqoktx.html