美文网首页
如何简单上手序列突变分析?

如何简单上手序列突变分析?

作者: 生信雀 | 来源:发表于2020-08-22 20:10 被阅读0次

无论是小尺度(少量)的序列集,还是大尺度(大量)的序列集,无论是基因片段的突变分析,还是全基因组的突变分析。

又或者,序列为非编码基因,或非编码基因+编码基因,或单纯的编码基因,或是蛋白质的氨基酸序列。

一图一个软件应该够了。

附BioAider下载地址:https://github.com/ZhijianZhou01/BioAider


BioAider的基因突变分析功能

This function could be used for analysis of the mutations characteristicson on large numbers of sequenced strains.  The sequence datas for analysis needs to be aligned in advance, and   they could be nucleotides, proteins ( amino acid )sequences or simply   coding gene fragments. For nucleotides and proteins sequences, BioAider   could summarizes all the mutation sites with corresponding frequency and  strains.

Of course, if the datas is codon gene, BioAider provides multiple  sets of different codon tables for users, and could scan each condon   sites in aligned sequence datasets, and identifies the type of mutation,   including synonymous, non-synonymous, insertions and deletions and   early termination. Finally, BioAider will automatically summarize and  output the relevant analysis results.

Note: The codon gene sequences for mutations analysis have to   be aligned by translation-alignment methon in advance, It is worth   mentioning that BioAider packed three multiple-sequence-alignment   software (mafft, muscle and clsutal-omega) in the graphical interface,  and provided translation-alignment additionally.

Whether it’s nucleotides or amino acids or coding genes, BioAider  could plot the frequency distribution graph for mutation sites through   specifing groups of substitution frequencey in custom.

Eaxmple of mutations analysis for aligned SARS-CoV-2 ORF3a gene (一个编码基因) sequences.

First, create frequency grouping in a table editor:

The each groups of substitution frequencey contains start value and end value  which are separated by tab symbol. Note, the start value of each group is not included in the range of frequency, and the frequencies of different groups need to be consecutive integers.

Then copy them to the textedit box of BioAider,and select "Codon" single button in  "Datas type":

After the run is over, these analysis result could be found in the  directory where the source file is located, you could scan the *_mutation site summary file then know the overall variation and mutation hotspots.

You could also konw the number of mutation sites under each mutation frequency group through view *_substitution frequency distribution.png.

It is not difficult to find that more than half of the mutation sites  only appear in a single strain, although there are many mutation sites  in ORF3a gene. Of course,BioAider  additionally  provides  vector  graphics  (*_substitution  frequency distribution.pdf), users can edit them and facilitate publication.

Besides, users could obtain the corresponding mutant strains of these variant sites in the detailed *_log.txt file.

Of note, if these sequences are much divergent, such as from different family enver order and contain a lot of gaps ("-") in the aligned sequence, I usually don't recommend using them for mutation analysis. On the one hand, they would make a lot of calculations, on the other hand, they are inherently highly variable and have no value of analysis.

相关文章

  • 如何简单上手序列突变分析?

    无论是小尺度(少量)的序列集,还是大尺度(大量)的序列集,无论是基因片段的突变分析,还是全基因组的突变分析。 又或...

  • 11.21 interview

    如何评价facebook开源的prophet时间序列预测工具? 时间序列分析 时间序列预测之--ARIMA模型 通...

  • Business Analytics with SQL in a

    写这个系列的初衷非常简单——写一个通俗、易懂、易上手的,如何使用SQL进行商业分析(business analyt...

  • 序列比对软件 MUMmer 简单上手(一)

    MUMmer 4 是一个快速比对 DNA 或蛋白序列的软件,最常用的 program 就是 nucmer 和 pr...

  • 学习笔记

    基于核酸和蛋白质序列如何研究生物进化?主要步骤是什么?蛋白质序列分析主要内容:(1)蛋白质序列的基本性质分析理化性...

  • 基因敲除技术(gene knockout)

    基因敲除技术,即是可以针对一个 DNA 碱基序列已知但功能未知的基因,通过抑制该基因的表达,分析突变体表型变化后来...

  • TCGA SNP突变数据分析

    一.基因突变的基础知识 二.SNP突变分析 (等更新)

  • 2020 时序分析随笔

    多元时间序列分析 通过分析序列自身的变化规律,构建时间序列模型 一元时间序列分析方法 多元时间序列分析 在 198...

  • 使用oncoPrint绘制瀑布图

    -by 杀杀 生信分析中,我们在全基因组测序后会获得分子突变谱信息,如何直观地将各种疾病分子亚型下的基因突变特征,...

  • what's mutant(基因敲除突变株)?

    基因敲除,即是针对一个 DNA 碱基序列已知但功能未知的基因,通过抑制该基因的表达,分析突变体表型变化后来确定该基...

网友评论

      本文标题:如何简单上手序列突变分析?

      本文链接:https://www.haomeiwen.com/subject/fuldjktx.html