美文网首页程序员
sratoolkit安装和简单使用

sratoolkit安装和简单使用

作者: 少年英雄小猪熊 | 来源:发表于2018-11-17 16:47 被阅读0次

    [TOC]


    安装环境

    Ubuntu18.10
    sratoolkit2.9.2-ubuntu64


    安装过程

    1、从https://www.ncbi.nlm.nih.gov/sra/docs/toolkitsoft/中的NCBI SRA Toolkit latest release compiled binaries and md5 checksums下载安装包。
    2、将安装包sudo mv sratoolkit.current-ubuntu64.tar.gz -t /opt
    然后

    cd /opt
    tar xzvf sratoolkit.current-ubuntu64.tar.gz
    echo "export PATH=\$PATH:/opt/sratoolkit.2.9.2-ubuntu64/bin" >> ~/.bashrc
    source ~/.bashrc
    fastq-dump -h
    

    这里就安装完毕了。


    简单使用

    1、命令prefetch 可以从远程站点下载文件prefetch SRR1553610
    2、将sra转换成fastq:fastq-dump SRR1553610
    3、sra转换成fasta:fastq-dump --fasta 50 SRR1553610
    4、将双端测序文件分开:fastq-dump --split-files SRR1553610


    What is the purpose of the SRA toolkit?

    The SRA Toolkit, and the source-code SRA System Development Kit (SDK), will allow you to programmatically access data housed within SRA and convert it from the SRA format to the following formats:

    ABI SOLiD native (colorspace fasta / qual)

    fasta

    fastq

    sff

    sam (human-readable bam, aligned or unaligned)

    Illumina native

    You can also use the toolkit to convert from the formats listed below into the SRA format (not required for submission, but will allow you to use the SRA Toolkit to archive or analyze your data):

    fastq or fasta/qual pairs

    AB SOLiD-SRF

    AB SOLiD-native

    Illumina SRF

    Illumina native

    sff

    Aligned bam

    The SRA toolkit is available in versions compatible with Linux, Windows and Mac operating systems.


    How do I use the SRA Toolkit to convert data into a particular format?

    The SRA Toolkit contains a series of independent data-“dump” utilities that will allow you to convert SRA data into different file formats. As of version 2.3.2, the list of “dumpers” that are included with the toolkit include:

    fastq-dump: Converts data to fastq and fasta format.

    sam-dump: Converts data to sam (human-readable bam). Data submitted as aligned bam are output as aligned sam, while other formats are output as unaligned sam.

    sff-dump: Converts data to sff format. Note that only data submitted as sff can be converted back to this format.

    abi-dump: Converts data to csfasta/csqual format. Note that data submitted in base-space can be represented in color-space, but please be aware of the advantages / disadvantages of converting between different encodings.

    illumina-dump: Converts data to Illumina native and qseq formats.

    vdb-dump: Exports the vdb-formatted data of the .sra file.

    Each of the above links will open the current documentation / help page for the respective utility, which include frequently used options and their definitions, usage examples, and common errors messages / solutions.


    https://github.com/ncbi/sra-tools
    https://blog.csdn.net/sunchengquan/article/details/79781366
    https://blog.csdn.net/xubo245/article/details/50510026

    相关文章

      网友评论

        本文标题:sratoolkit安装和简单使用

        本文链接:https://www.haomeiwen.com/subject/tnxifqtx.html