美文网首页基因组学
Roary分析泛基因组

Roary分析泛基因组

作者: 胡童远 | 来源:发表于2021-10-07 17:51 被阅读0次

    文章:Roary: rapid large-scale prokaryote pan genome analysis. bioinformatics 2015
    引用:1780
    GITHUB: http://sanger-pathogens.github.io/Roary/
    Tutorial:https://github.com/microgenomics/tutorials/blob/master/pangenome.md

    conda安装

    conda create -n pantools
    conda activate pantools
    conda config --add channels r
    conda config --add channels defaults
    conda config --add channels conda-forge
    conda config --add channels bioconda
    conda install roary
    roary -h
    

    运行roary

    roary -e -n -v -i 80 -p 4 \
    -f ./result_roary/ \
    ./prokka_gff/*.gff
    

    参数:
    f: output directory
    e: create a multiFASTA alignment of core genes using PRANK
    n: fast core gene alignment with MAFFT, use with -e
    p: number of threads [1]
    v: verbose output to STDOUT
    r: create R plots, requires R and ggplot2
    i: minimum percentage identity for blastp [95]

    运行过程

    Fixing input GFF files
    Extracting proteins from GFF files
    Combine proteins into a single file
    Iteratively run cd-hit
    Parallel all against all blast
    Cluster with MCL
    Running command: pan_genome_post_analysis
    Running command: FastTree -fastest -nt acces sory_binary_genes.fa > accessory_binary_genes.fa.newick
    Running command: protein_alignment_from_nucleotides  -v  --mafft pan_genome_sequences/lexA.fa
    Running command: mafft --auto --quiet pan_genome_sequences/group_537.fa > pan_genome_sequences/group_537.fa.aln
    

    roary结果

    gene_presence_absence.Rtab文件即是PAV

    core/soft/shell/cloud genes

    number_of_genes_in_pan_genome.Rtab (10种pan组合)

    number_of_conserved_genes.Rtab (10种core组合)

    相关文章

      网友评论

        本文标题:Roary分析泛基因组

        本文链接:https://www.haomeiwen.com/subject/yreznltx.html