该gff 的gene行存在Name,导致脚本没有考虑到这种情况,提取的gene id为Name,gene id 过长,导致MCScanX无法运行。
解决方法:将第九列去掉Name既可。
awk -F '[\t;]' '{if($3~/gene/) {print $1"\t"$2"\t"$3"\t"$4"\t"$5"\t"$6"\t"$7"\t"$8"\t"$9";} else{print $0}}' Dod.chr.gff3
chr07 . gene 28275981 28288196 . - . ID=evm.TU.scaffold_30.17;
chr07 . mRNA 28275981 28288196 . - . ID=evm.model.scaffold_30.17;Parent=evm.TU.scaffold_30.17;Name=EVM%20prediction%20scaffold_30.17
chr07 . five_prime_UTR 28288064 28288196 . - . ID=evm.model.scaffold_30.17.utr5p1;Parent=evm.model.scaffold_30.17
chr07 . five_prime_UTR 28287389 28287395 . - . ID=evm.model.scaffold_30.17.utr5p2;Parent=evm.model.scaffold_30.17
chr07 . exon 28288064 28288196 . - . ID=evm.model.scaffold_30.17.exon1;Parent=evm.model.scaffold_30.17
chr07 . exon 28287253 28287395 . - . ID=evm.model.scaffold_30.17.exon2;Parent=evm.model.scaffold_30.17
chr07 . CDS 28287253 28287388 . - 0 ID=cds.evm.model.scaffold_30.17;Parent=evm.model.scaffold_30.17
chr07 . exon 28281681 28282520 . - . ID=evm.model.scaffold_30.17.exon3;Parent=evm.model.scaffold_30.17
chr07 . gene 28275981 28288196 . - . ID=evm.TU.scaffold_30.17;Name=EVM%20prediction%20scaffold_30.17
chr07 . mRNA 28275981 28288196 . - . ID=evm.model.scaffold_30.17;Parent=evm.TU.scaffold_30.17;Name=EVM%20prediction%20scaffold_30.17
chr07 . five_prime_UTR 28288064 28288196 . - . ID=evm.model.scaffold_30.17.utr5p1;Parent=evm.model.scaffold_30.17
chr07 . five_prime_UTR 28287389 28287395 . - . ID=evm.model.scaffold_30.17.utr5p2;Parent=evm.model.scaffold_30.17
chr07 . exon 28288064 28288196 . - . ID=evm.model.scaffold_30.17.exon1;Parent=evm.model.scaffold_30.17
chr07 . exon 28287253 28287395 . - . ID=evm.model.scaffold_30.17.exon2;Parent=evm.model.scaffold_30.17
chr07 . CDS 28287253 28287388 . - 0 ID=cds.evm.model.scaffold_30.17;Parent=evm.model.scaffold_30.17
chr07 . exon 28281681 28282520 . - . ID=evm.model.scaffold_30.17.exo
LIU_sa07 EVM20prediction20scaffold_30.17 28275981 28288196
LIU_sa07 EVM20prediction20scaffold_30.1 27505683 27508995
LIU_sa07 EVM20prediction20scaffold_30.33 28962370 28967589
LIU_sa07 EVM20prediction20scaffold_30.74 29946477 29988627
LIU_sa07 EVM20prediction20scaffold_30.62 29734064 29735281
网友评论