参考https://www.biostars.org/p/147148/
The sra-toolkit has a utility, vdb-validate which will report any errors in the data, and perform an md5 checksum comparison.
原始SRA数据
vdb-validate
命令安装
conda install -c bioconda sra-tools
通过prefetch --option-file id
下载的SRA数据,使用下方代码:
for i in `ls *.sra`;
do
SRR=${i%%_*};
echo $SRR >> list.txt;
done
for j in `sort -u list.txt`
do
vdb-validate $j
done
处理后fastq文件检查
通过fastq-dump --split-files --gzip
命令处理后的fastq文件,使用下方代码:
for i in `ls *fastq.gz`;
do
gunzip -t $i 2 > $i.err
done
find . -name "*err" -type f -size +0c -exec -larth {} \;
网友评论