目录
1.Module 1 - Introduction to RNA sequencing
- Installation
- Reference Genomes
- Annotations
- Indexing
- RNA-seq Data
- Pre-Alignment QC
2.Module 2 - RNA-seq Alignment and Visualization
- Adapter Trim
- Alignment
- IGV
- Alignment Visualization
- Alignment QC
3.Module 3 - Expression and Differential Expression
- Expression
- Differential Expression
- DE Visualization
- Kallisto for Reference-Free Abundance Estimation
4.Module 4 - Isoform Discovery and Alternative Expression
- Reference Guided Transcript Assembly
- de novo Transcript Assembly
- Transcript Assembly Merge
- Differential Splicing
- Splicing Visualization
5.Module 5 - De novo transcript reconstruction
- De novo RNA-Seq Assembly and Analysis Using Trinity
6.Module 6 - Functional Annotation of Transcripts
- Functional Annotation of Assembled Transcripts Using Trinotate
1.6 Pre-Alignment QC
在对齐之前,你可以使用FastQC来确定你的数据质量:
http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
还有一个视频教程
http://www.youtube.com/watch?v=bz93ReOv87Y
尝试在fastq文件上运行FastQC:
fastqc *.fastq.gz
查看一下结果
HBR_Rep1_ERCC-Mix2_Build37-ErccTranscripts-chr22.read1_fastqc.html
练习4
作业:在前面的实践练习中下载的其他fastq文件中运行FASTQC。
提示:请记住,将这些数据存储在一个名为“practice”的独立工作目录中。
在文件hcc1395_normal_1.fastq上运行FASTQC。并通过检查输出来回答这些问题。
问题
How many total sequences are there? 331,958
What is the range (x - y) of read lengths observed? 151
What is the most common average sequence quality score? 41
What does the Adaptor Content warning tell us? 有一些证据表明,Illumina接头在reads的3'端。这表明去除对于该数据来说可能是必须的。
网友评论