blastn的用法

作者: Yanglook | 来源:发表于2020-05-04 10:23 被阅读0次

blastn的用法
windows本地BLAST（二）：操作及应用
blastn跟blastp用法（linux）
用blast进行短序列（20nt左右）的搜索
blast notes
Biostar_handbook||charpter 12. B
这或许是我写的最全的BLAST教程
biostar handbook(七)| BLAST
Blastdbcmd,makeblastdb,blastn.使用
2022-04-10Diamond比对软件学习1

方法参考：https://blog.csdn.net/likelet/article/details/7567426
http://blog.sciencenet.cn/blog-508298-621460.html
https://www.ncbi.nlm.nih.gov/books/NBK279684/#appendices.Options_for_the_commandline_a

1. 以input的reads为query，contigs(fa格式，包含在Clustering的输出文件中)为database.

2. blastn使用参数如下：

-db 数据库文件名前缀
-query
-out 输出文件名
-outfmt
-num_threads
-evalue
-num_alignments
-word_size
-dust
-gapopen
-gapextend
-penalty
-reward

3. 进行blastn之前先执行makeblastdb，参数说明如下：

-in 后接输入文件，你要格式化的fasta序列
-dbtype 后接序列类型，nucl为核酸，prot为蛋白
-title 给数据库起个名，好看~~(不能用在后面搜索时-db的参数)
-parse_seqids 推荐加上，现在有啥原因还没搞清楚
-out 后接数据库名，自己起一个有意义的名字，以后blast+搜索时要用到的-db的参数

/public/home/ztchen/bio_soft/ncbi-blast-2.10.0+/bin/makeblastdb -in contigs.fasta -input_type fasta -dbtype nucl -title CL -out CL_db #contigs.fasta即为数据库，CL为数据库名，CL_db为以后blast+搜索时要用到的-db的参数，可以自己起一个有意义的名字
/public/home/ztchen/bio_soft/ncbi-blast-2.10.0+/bin/blastn -db $CL_db  -query $input_paired.fa -out input_blastn.out -outfmt 6 -num_threads 20  -evalue 1e-8 -num_alignments 1 -word_size 9 -dust no -gapopen 5 -gapextend 2 -penalty -3 -reward 2 #此处参数适用于计算ChIP-reads/input-reads ratio, input_paired.fa文件要求forward和reverse交错排列，可以通过RepeatExplorer处理