表观基因组

作者: 思考问题的熊 | 来源:发表于2018-02-11 18:21 被阅读86次

不同层次的表观

img
  • Broadly, features at different levels of chromatin organization are generally associated with inactive (off) or active (on) transcription.
  • From the top, genomic DNA is methylated(Me) on cytosine bases in specific contexts and is packaged into nucleosomes, which vary in histone composition and histone modifications (for example, histone H3 lysine 9 trimethylation (H3K9me3)); these features constitute the primary layer of chromatinstructure.
  • Here, different histone modifications are indicated by coloured dots and histone variants such as H2A.Z are brown.
  • DNA in chromatin may remain accessible to DNA-binding proteins such as transcription factors (TFs) and RNA polymerase II (RNAPII)or may be further compacted.
  • Chromatin can also organize into higher-order structuressuch as nuclear lamina-associated domains and transcription factories.
  • DOI:10.1038/nrg2905

表观实例

基因印迹

img

表观遗传定义

  • A mitotically or meiotically heritable state of different gene activity and expression (phenotype) that is independent of differences in DNA sequence (genotype) – based on Conrad Waddington, 1942
  • The sum of the alterations to the chromatin template that collectively establish and propagate different patterns of gene expression (transcription) and silencing from the same genome.
  • Epigenetic changes influence the phenotype without altering the genotype.
  • While epigenetics often refers to the study of single genes or sets of genes, epigenomics refers to more global analyses of epigenetic changes across the entire genome.

表观基因组 epigenome

  • The complete number,location, and types ofepigenetic modificationsthat occur in a given cell

Epigenetic mechanisms

DNA methylation

  • Normal cells — role in gene expression and chromosome stability
  • Cancer cells — consequences of aberrant hypo- and hyper-methylation

Histone modification

  • Normal cells — the histone code
  • Cancer cells — consequences of altered histone modifying enzymes

Interaction between DNA methylation, histone modifications, and other players such as non-coding RNAs

Cell and tissue type specificity

Gene-environment interaction, disease susceptibility

DNA methylation

human 特征

img
  • DNA methylation occurs mostly in CG context, but also non-CG sequences (mCHG, mCHH)
  • DNA methylation is generally depleted from promoters, and enriched within gene bodies
  • DNA methylation is partially depleted at regulatory element

植物中有特殊的RdDM

bisulfite sequencing

基本原理

img
  • DNA的重亚硫酸盐转化来检测未甲基化的胞嘧啶
  • 在文库制备过程中,重亚硫酸盐将未甲基化的胞嘧啶转化成尿嘧啶
  • 转化后的碱基在测序中被识别为胸腺嘧啶(PCR后),并通过序列计数来确定甲基化胞嘧啶的比例

比对的过程需要进行转换

img

常用mapping软件算法

  • bismark
img
  • bsmap
img
  • Bisulfite seed table, using the original seed and bisulfite variants as keys and corresponding coordinates in the reference genome as values. Each read was looked up in the seed table for potential mapping positions.

  • A positional specific mask of the corresponding reference sequence was generated by setting 01 to C(light blue) and 11 to A, G, T(black). The original read was masked by a bitwise AND operation with the positional specific mask.

  • The reference sequence and the masked read were compared with a bitwise XOR operation. Non-zero XOR results were counted as mismatches (red). Bisulfite alignment is marked in green.

Methylated DNA immunoprecipitation (MeDIP)

  • 甲基化 DNA 免疫共沉淀测序
  • use 5-methylcytosine antibody
  • 富集甲基化的DNA序列
  • 有甲基化的地方会有peak
  • 不能精确到单碱基分辨率,这能知道一段区域的甲基化程度

Methyl-sensitive restriction enzymes (MREs)

  • Unmethylated CpGs are identified by sequencing size selected fragments from parallel DNA digestions with the methyl-sensitive restriction enzymes (MREs)

  • 没有甲基化的地方会有peak

  • 甲基化敏感性限制酶测序(MRE-seq) 技术主要利用甲基化敏感性限制性酶(methylation-sensitive restriction enzyme, MRE) 对基因组 DNA 进行切割,未甲基化位点可以被酶切而产生不同长度的片段,然后,结合高通量测序技术获得序列的甲基化信息

  • 和MeDIP结合使用

    img
  • plant DOI:10.1146/annurev-arplant-043014-114633

  • human DOI:10.1038/nature08514

组蛋白修饰

H3K4me3含义

img

组蛋白修饰功能

img
  • Promoters, gene bodies, an enhancer and a boundary element are indicated on a schematic genomic region.
  • Active promoters are commonly marked by H3K4me2, H3K4me3, acetylation (ac), and H2A.Z.
  • Transcribed regions are enriched for H3K36me3 and H3K79me2.
  • Repressed genes may be located in large domains of H3K9me2 and/or H3K9me3 or H3K27me3.
  • Enhancers are relatively enriched for H3K4me1, H3K4me2, H3K27ac and the histone acetyltransferase p300.
  • CTCF binds many sites that may function as boundary elements, insulators or structural scaffolds

如何整合众多的ChIP-seq数据

半自动基因注释

img

两种常用算法

ChromHMM

img

输入参数

img
  • 算法是HMM,具体隐马尔可夫模型参考另一篇笔记

  • 属于HMM中的learning类问题

  • 使用迭代方法是EM

  • 200 bp resolution

  • Bernoulli to model emission

  • Posterior decoding

  • Single model for all cell types

Segway

img
  • 基于贝叶斯动态网络
  • 1 bp resolution
  • Gaussian to model emission
  • Viterbi decoding
  • One model per cell type

基因浏览器展示

img

具体学习地址

human表观数据库

encode

roadmap


加入靠谱熊基地,和大家一起交流

相关文章

网友评论

    本文标题:表观基因组

    本文链接:https://www.haomeiwen.com/subject/qjkltftx.html