生信理论及操作

作者: 豆沙了 | 来源:发表于2020-07-25 17:46 被阅读0次
bsub -q interactive -Is bash
for i in *.lsf;do bsub < ${i};done

RNA-seq
【系列】 转录组入门(1):软件准备 https://www.plob.org/article/11457.html

SRA下载
SRA 数据下载自救指南 https://www.jianshu.com/p/19ae29e54c59
NGS小技能(2):如何进行SRA到fastq格式的快速转换 https://www.jianshu.com/p/4779b7103ae1
SRA Run Selector https://www.ncbi.nlm.nih.gov/Traces/study/?
GEO http://www.ncbi.nlm.nih.gov/geo/
都8102年了,还用fastq-dump,快换fasterq-dump吧 https://www.jianshu.com/p/5c97a34cc1ad

blast
这或许是我写的最全的BLAST教程 https://www.jianshu.com/p/de28be1a3bea

       qaccver - Query accesion.version
       saccver - Subject accession.version
        pident - Percentage of identical matches
        length - Alignment length
      mismatch - Number of mismatches
       gapopen - Number of gap openings   
        qstart - Start of alignment in query
          qend - End of alignment in query
        sstart - Start of alignment in subject
          send - End of alignment in subject
        evalue - Expect value
      bitscore - Bit score

[-db database_name]
[-taxids taxids]
[-query input_file]
[-out output_file]
[-evalue evalue]
[-outfmt format]
[-max_target_seqs num_sequences]
[-num_threads int_value]
 -html
 -sorthits <Integer, (>=0 and =<4)>

植物转录因子数据库
PlantRegMap http://plantregmap.gao-lab.org/
PlantTFDB http://planttfdb.gao-lab.org/

文本数据处理
shell 第一列相同 则拼接第二列 https://blog.csdn.net/weixin_33700350/article/details/92654968
EXCEL判断第一列如果相同,则合并第二列中的值到一个单元格,中间用逗号隔开 https://zhidao.baidu.com/question/1860633839081791987.html

PCA
一文看懂主成分分析 https://cloud.tencent.com/developer/article/1167713
PCA-弱水三千,取哪一瓢饮? https://www.jianshu.com/p/f20bd286ce32
Plotting PCA (Principal Component Analysis) https://cran.r-project.org/web/packages/ggfortify/vignettes/plot_pca.html

IGV
测序数据可视化 (二)- IGV https://cloud.tencent.com/developer/article/1036706

bedtools igv -i input.bed

保存igv的信息,igvtools接口的使用:将igvtools查看bam文件的结果保存为txt文件,并提取信息 https://blog.csdn.net/a_giant_pig/article/details/103011731
bam文件的可视化(测序深度) | IGV https://www.jianshu.com/p/c0635eb9ac24
基因组浏览器IGV实践 https://www.jianshu.com/p/4089d07ba239
学IGV必看的初级教程 https://cloud.tencent.com/developer/article/1050826
Running IGV with a batch file https://software.broadinstitute.org/software/igv/batch
File Formats https://software.broadinstitute.org/software/igv/IGV

如何快速找到某个基因的结构域和保守位点 https://www.sohu.com/a/216315762_419916
snapgene入门视频 https://www.snapgene.com/support/tutorial-videos/?referrer=SnapGene

RPKM vs FPKM vs TPM http://www.genek.tv/article/23
TPM、read counts、RPKM/FPKM你选对了吗? http://blog.sciencenet.cn/blog-3372875-1089851.html

表达矩阵的归一化和标准化,去除极端值,异常值 https://mp.weixin.qq.com/s/KQgWCdZ42EcqV_Qzc2wmTw
Z-Score数据标准化 https://www.cnblogs.com/itboys/p/10861492.html

STDEV是样本标准差的函数,STDEVP是总体标准差的函数。
如是总体(即估算总体方差),根号内除以n(对应excel函数:STDEVP);
如是抽样(即估算样本方差),根号内除以(n-1)(对应excel函数:STDEV);
因为我们大量接触的是样本,所以普遍使用根号内除以(n-1)。

Question: Error while doing indexing of fasta file using SAMTOOL faidx https://www.biostars.org/p/229449/

GWAS
GWAS | 原理和流程 | 全基因组关联分析 | Linkage disequilibrium (LD)连锁不平衡 | 曼哈顿图 Manhattan_plot | QQ plot https://www.cnblogs.com/leezx/p/9013615.html
动植物GWAS群体知多少 https://www.sohu.com/a/207046986_761120
全基因组关联分析 (GWAS) - 简介 https://www.jianshu.com/p/acdc4a22e30a

相关文章

网友评论

    本文标题:生信理论及操作

    本文链接:https://www.haomeiwen.com/subject/qbsmlktx.html