写在前面

目前针对二代测序数据开发的CNV检测方法，根据数据类型分为针对WES和WGS的方法，其中还有很多方法是针对大量gene panel的方法，类似于WES。

cnvCapSeq是目前找到的唯一一个针对长片段捕获测序检测CNV的方法，虽然发表在NAR上，但是根据网上查到的信息，软件更新到0.1.2版本（2014）后就没有更新了，而且文献的引用率较低（Google scholar显示引用率不到10）。即便如此，考虑到没有其他更合适的方法，还是先试试看。

实战-失败告终

不需要安装，下载压缩包后解压就能用。README文档里有详细的使用步骤，强烈建议看看每一步所用的sh脚本里的内容，可以更好地理解具体整个过程。

###1. pre-processing pipeline
  #extract a region of interest from a large BAM file 
  sh extract_region_from_BAM.sh "sample.bam" "chr1:10000000-20000000" "bam"

  #默认输入文件(提取的bam在文件夹BAM/中)
  #The read pair pipeline requires the FASTA sequence file against which the the multi-mapped reads will be re-aligned.
  #mv chr1.fa aux/
  #bowtie2-build chr1.fa chr1.fa
  nohup sh RD_pipeline_capture.sh 1 10000000 20000000 >RD.log & >RD.log 
  nohup sh RP_pipeline_capture.sh 1 10000000 20000000 >RP.log & >RP.log 
  #结果在./CNV_detection_capture/data中

###2. CNV detection
 #这里使用默认参数文件，仅修改了--mid 参数（指定区域），如果需要更改，可以修改相应文件里的参数（param.txt, data.csv)
 #./CNV_detection_capture/analysis/run_cnvCapSeq.sh 
 java -Xmx3800m -cp ./cnvCapSeq.jar lc1.dp.appl.CNVHap --paramFile param.txt --mid 1:10.00mb:20.00mb --baseDir ./data/ --index 1 --column 1 --numIt 5 --include RD:RP --plot 0 --r_panel true --b_panel true --showScatter false --showHMM false  --weightedInsert true --expModelIntHotSpot 1:1:1e3 --collapseDupStates true --nbdMeanPseudo 700 --nbdDispersionPseudo 8 > output_msg.txt 2>&1

唉，报错了，不懂java，没找到解决方法，所以放弃了，略忧伤。