BED格式文件

作者: JeremyL | 来源:发表于2018-06-04 15:13 被阅读1194次

BED (Browser Extensible Data) format provides a flexible way to define the data lines that are displayed in an annotation track.

BED (Browser Extensible Data)格式文件就是通过规定行的内容来展示注释信息

BED文件每行至少包括chromchromStartchromEnd三列;另外还可以添加额外的9列,这些列的顺序是固定的。

BED文件必须的3列:

  1. chrom - 染色体号.
  2. chromStart - feature在染色体上起始位置. The first base in a chromosome is numbered 0.
  3. chromEnd - feature在染色体上末尾位置。染色体上前100个碱基位置标记为chromStart=0, chromEnd=100.The chromEnd base is not included in the display of the feature. For example, the first 100 bases of a chromosome are defined as chromStart=0, chromEnd=100, and span the bases numbered 0-99.

BED文件可选的9列:

  1. name - BED行名,在基因组浏览器左边显示;
  2. score - 在基因组浏览器中显示的灰度设定,值介于0-1000;
gray score
  1. strand - 正负链标记. Either "." (=no strand) or "+" or "-".

  2. thickStart - feature起始位置(for example, the start codon in gene displays)。 When there is no thick part, thickStart and thickEnd are usually set to the chromStart position.

  3. thickEnd - feature编码终止位置 (for example the stop codon in gene displays).

  4. itemRgb - R,G,B (e.g. 255,0,0)值,当itemRgb 设置为 "On",BED的行会显示颜色.

  5. blockCount - blocks (exons)数目.

  6. blockSizes - blocks (exons)大小列表,逗号分隔,对应于blockCount.

  7. blockStarts -blocks (exons)起始位置列表,逗号分隔,对应于blockCount.;这个起始位置是与chromStart的一个相对位置。

Example: a complete BED definition

track name=pairedReads description="Clone Paired Reads" useScore=1
chr22 1000 5000 cloneA 960 + 1000 5000 0 2 567,488, 0,3512
chr22 2000 6000 cloneB 900 - 2000 6000 0 2 433,399, 0,3601

The first three required BED fields are

  1. chrom - The name of the chromosome (e.g. chr3, chrY, chr2_random) or scaffold (e.g. scaffold10671).
  2. chromStart - The starting position of the feature in the chromosome or scaffold. The first base in a chromosome is numbered 0.
  3. chromEnd - The ending position of the feature in the chromosome or scaffold. The chromEnd base is not included in the display of the feature. For example, the first 100 bases of a chromosome are defined as chromStart=0, chromEnd=100, and span the bases numbered 0-99.

The 9 additional optional BED fields are:

  1. name - Defines the name of the BED line. This label is displayed to the left of the BED line in the Genome Browser window when the track is open to full display mode or directly to the left of the item in pack mode.

  2. score - A score between 0 and 1000. If the track line useScore attribute is set to 1 for this annotation data set, the score value will determine the level of gray in which this feature is displayed (higher numbers = darker gray). This table shows the Genome Browser's translation of BED score values into shades of gray:

    gray score
  3. strand* - Defines the strand. Either "." (=no strand) or "+" or "-".

  4. thickStart - The starting position at which the feature is drawn thickly (for example, the start codon in gene displays). When there is no thick part, thickStart and thickEnd are usually set to the chromStart position.

  5. thickEnd - The ending position at which the feature is drawn thickly (for example the stop codon in gene displays).

  6. itemRgb - An RGB value of the form R,G,B (e.g. 255,0,0). If the track line itemRgb attribute is set to "On", this RBG value will determine the display color of the data contained in this BED line. NOTE: It is recommended that a simple color scheme (eight colors or less) be used with this attribute to avoid overwhelming the color resources of the Genome Browser and your Internet browser.

  7. blockCount - The number of blocks (exons) in the BED line.

  8. blockSizes - A comma-separated list of the block sizes. The number of items in this list should correspond to blockCount.

  9. blockStarts - A comma-separated list of block starts. All of the blockStart positions should be calculated relative to chromStart. The number of items in this list should correspond to blockCount

参考:

相关文章

  • 如何构建fasta文件及其index索引文件

    需要准备的文件是序列信息bed格式文件或者GRanges格式文件 比如有一个bed文件: 我习惯保存为GR格式: ...

  • bed文件

    BED (Browser Extensible Data)格式文件就是通过规定行的内容来展示注释信息 BED文件每...

  • BED 文件格式

    BED (Browser Extensible Data)格式文件就是通过规定行的内容来展示注释信息 BED文件每...

  • 生物数据格式 - bed

    格式 BED格式文件全称是 Browser Extensible Data,通过规定行的内容来展示注释信息。bed...

  • BED格式文件

    BED (Browser Extensible Data) format provides a flexible ...

  • deeptools intersect -wa 结果竟然有重复

    此记录完全是由一次不仔细阅读软件参数导致的踩坑记录 #需要的两个Bed格式文件 A.bed B.bed #bedt...

  • 2022-05-13

    将vcf文件转换为bed格式文件。这里注意一点!!!!:应该是软件的问题,需要把染色体/contig名称变成连续的...

  • 接着

    stay in bed 待在床上 (in bed 常指生病在床上, on the bed 常指物品在床上) 5. ...

  • 随口摇篮曲一

    it's time for bed time for bed. It's time to sleep ...

  • [试译]Talking In Bed 卧谈

    Talking In Bed 卧谈 Philip Larkin Talking in bed ought to b...

网友评论

    本文标题:BED格式文件

    本文链接:https://www.haomeiwen.com/subject/utcysftx.html