1.下载三个index:
![](https://img.haomeiwen.com/i13325312/8a7375554913c0d9.png)
2.重命名为:
hisat2_grcm38_genome_index/genome [1-sam]
hisat2_grcm38_genome_snp_tran_index/genome_snp_tran [1-sam]
hisat2_mm10_genome_index/genome [1-sam]
3.hisat2比对命令:
hisat2 -p 10 -x ../hisat2_grcm38_genome_index/genome -1 R1.fq -2 R2.fq -S 1.sam
hisat2 -p 10 -x ../hisat2_grcm38_genome_snp_tran_index/genome_snp_tran -1 R1.fq -2 R2.fq -S 2.sam
hisat2 -p 10 -x ../hisat2_mm10_genome_index/genome -1 R1.fq -2 R2.fq -S 3.sam
4.比对率:
![](https://img.haomeiwen.com/i13325312/2b3346607f4cef23.png)
![](https://img.haomeiwen.com/i13325312/fe776049235caec2.png)
![](https://img.haomeiwen.com/i13325312/67598677e5eb4123.png)
嘿嘿:比对发现转录本的比对率高
![](https://img.haomeiwen.com/i13325312/0683a5c5802c6dbc.png)
![](https://img.haomeiwen.com/i13325312/9aa68984347d9ba1.png)
5.查看sam文件寻找差异:
![](https://img.haomeiwen.com/i13325312/bbe66c4b222ab5cf.png)
6.看chr有哪些?
![](https://img.haomeiwen.com/i13325312/679382d53b5e5fe8.png)
![](https://img.haomeiwen.com/i13325312/94179a1ee4f0115c.png)
结论:基因组还是有差别的,也就是说后续的gtf不能混用!
7.下载查看gtf:
gencode:
axel -n 50 ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_mouse/release_M19/gencode.vM19.chr_patch_hapl_scaff.annotation.gtf.gz
gencode.vm19.GRCm38.all.ano.gtf
![](https://img.haomeiwen.com/i13325312/746fe3c5263f6dc1.png)
ensembl:
axel -n 20 ftp://ftp.ensembl.org/pub/release-94/gtf/mus_musculus/Mus_musculus.GRCm38.94.chr_patch_hapl_scaff.gtf.gz
ensembl.GRCm38.94.all.ano.gtf
![](https://img.haomeiwen.com/i13325312/c92bc3903dcc2535.png)
NCBI:
wget -c ftp://ftp.ncbi.nih.gov/genomes/M_musculus/GFF/ref_GRCm38.p4_top_level.gff3.gz
ncbi.GRCm38.p4.all.ano.gff3
![](https://img.haomeiwen.com/i13325312/ee17f8c8e215a442.png)
UCSC:
![](https://img.haomeiwen.com/i13325312/ce6861ee442a95da.png)
![](https://img.haomeiwen.com/i13325312/55bf20c416cd0d2e.png)
结论:
hisat2-GRCm38-index比对后:
若无--add-chrname,则肯定用enseml的gtf
若加了--add-chrname,则用gencode的gtf改造“+chr”/或者ensembl改造也行“+chr”
hisat2-mm10-index比对后:
使用UCSC的gtf
就转录本而言,其实用哪个都行...
![](https://img.haomeiwen.com/i13325312/5aa329f43ede0e2b.png)
8.下载查看fasta:
gencode:
axel -n 100 ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_mouse/release_M19/GRCm38.p6.genome.fa.gz
GRCm38.p6.genome.fa
![](https://img.haomeiwen.com/i13325312/b928120dc0951d3b.png)
ensembl:
axel -n 100 ftp://ftp.ensembl.org/pub/release-94/fasta/mus_musculus/dna/Mus_musculus.GRCm38.dna_sm.toplevel.fa.gz
Mus_musculus.GRCm38.dna_sm.toplevel.fa
![](https://img.haomeiwen.com/i13325312/f7366e8dc8a3d30a.png)
ucsc:
axel -n 30 ftp://hgdownload.soe.ucsc.edu/goldenPath/mm10/bigZips/chromFa.tar.gz
cat *.fa > mm10.fa
![](https://img.haomeiwen.com/i13325312/dc682fdf61a2459e.png)
查看具体序列:
![](https://img.haomeiwen.com/i13325312/397fcc6014b4664b.png)
结论:三者大部分相同,也有略微不同
gencode和ucsc有chr
ensembl没有chr
相比较而言,gencode和ensembl比较像,但是基本的fasta并不影响。
总结论:
使用哪个基因组的fasta都ok
使用gencode和ensembl的gtf也是一样的
差别在chr这块
![](https://img.haomeiwen.com/i13325312/fc31dc4b354839e0.png)
网友评论