美文网首页组学学习
2023-07-23 upload_to_NCBI

2023-07-23 upload_to_NCBI

作者: 麦冬花儿 | 来源:发表于2023-08-14 10:17 被阅读0次

    真核生物只需上传基因组序列文件,原核生物还需要上传注释文件

    # Installing tbl2asn (https://www.ncbi.nlm.nih.gov/genbank/tbl2asn2/)
    #wget ftp://ftp.ncbi.nih.gov/toolbox/ncbi_tools/converters/by_program/tbl2asn/linux64.tbl2asn.gz -P ~/software/
    mkdir /opt/biosoft/tbl2asn
    gzip -dc ~/software/linux64.tbl2asn.gz > /opt/biosoft/tbl2asn/tbl2asn
    chmod 755 /opt/biosoft/tbl2asn/tbl2asn
    echo 'PATH=$PATH:/opt/biosoft/tbl2asn/' >> ~/.bashrc
    source ~/.bashrc
    
    sudo dnf install libidn libidn-devel
    sudo ln -s /usr/lib64/libidn.so.12 /usr/lib64/libidn.so.11
    
    mkdir -p /home/train/11.upload_to_NCBI/
    cd /home/train/11.upload_to_NCBI/
    
    # 制作 ASN 文件
    # 将GFF3格式转换成tbl文件
    gff3_remove_UTR.pl ~/00.incipient_data/data_for_gene_prediction_and_RNA-seq/Malassezia_sympodialis_V01.bestGeneModels.gff3 > Malassezia_sympodialis.gff3
    gff3_to_tbl_for_antismatsh.pl Malassezia_sympodialis.gff3 > Malassezia_sympodialis.tbl
    # 准备基因组序列文件
    perl -p -e 's/>(\S+).*/>$1 [organism=Malassezia sympodialis] [gcode=1]/' ~/00.incipient_data/data_for_genome_assembling/assemblies_of_Malassezia_sympodialis/Malassezia_sympodialis.genome_V01.fasta > Malassezia_sympodialis.fsa
    # 通过网站http://www.ncbi.nlm.nih.gov/WebSub/template.cgi制作sbt文件
    cp ~/00.incipient_data/data_for_genome_assembling/template.sbt Malassezia_sympodialis.sbt
    # 运行tbl2asn命令生成后缀为.sqn的ASN文件
    tbl2asn -t Malassezia_sympodialis.sbt -p ./ -a a -V vb -M n -Z discrep
    # real  0m29.280s
    # user  0m29.159s
    # sys   0m0.093s
    
    # 上传测序数据
    md5sum testreads.fastq
    lftp -e "put -c testreads.fastq; exit" -u sra,VfOiVJn1 ftp-private.ncbi.nlm.nih.gov
    

    相关文章

      网友评论

        本文标题:2023-07-23 upload_to_NCBI

        本文链接:https://www.haomeiwen.com/subject/xevxpdtx.html