日更80

作者: 深度学习模型优化 | 来源:发表于2019-07-04 06:00 被阅读0次

今天偷个懒,看下TGS找盐比赛中的一些好的idea。

代码看这里: https://github.com/SeuTao/Kaggle_TGS2018_4th_solution

Solution development:

1.单模型设计:

  1. input: 101 random pad to 128*128, random LRflip;
  2. encoder: resnet34, se-resnext50, resnext101_ibna, se-resnet101, se-resnet152, se resnet154;
  3. decoder: scse, hypercolumn (not used in network with resnext101ibna, seresnext101 backbone), ibn block, dropout;
  4. Deep supervision structure with Lovasz softmax (a great idea from Heng);
  5. We designed 6 single models for the final submission;

2. 模型训练:

  • SGD: momentum -- 0.9, weight decay -- 0.0002, lr -- from 0.01 to
    0.001 (changed in each epoch);
  • LR schedule: cosine annealing with snapshot ensemble (shared by
    Peter), 50 epochs/cycle, 7cycles/fold ,10fold;

3.模型集成: +0.001 in public LB/+0.001 in private LB

  • voting across all cycles

4. Post processing: +0.010 in public LB/+0.001 in private LB

According to the 2D and 3D jigsaw results (amazing ideas and great job from @CHAN), we applied around 10 handcraft rules that gave a 0.010~0.011 public LB boost and 0.001 private LB boost.

5.Data distill (Pseudo Labeling): +0.002 in public LB/+0.002 in private LB

We started to do this part since the middle of the competetion. As Heng posts, pseudo labeling is pretty tricky and has the risk of overfitting. I am not sure whether it would boost the private LB untill the result is published. I just post our results here https://github.com/SeuTao/Kaggle_TGS2018_4th_solution
, the implementation details will be updated.

6.Ideas that hadn't tried:

  • mean teacher: We have no time to do this experiment. I think mean
    teacher + jigsaw + pseudo labeling is promising.

7. Ideas that didn't work:

  • oc module: The secret weapon of @alex's team. Can't get it work.

Related papers:

相关文章

  • 日更80

    看到小宝对每一件事情都充满了好奇,而且在做的过程中都带着一种充满着快乐无比的灿烂笑容,这个时候我时常都会问他,你为...

  • 日更.80

    工作上犯了错,小问题,是被别人发现后补救的,心理有点难受。 每个人犯的那些小错,其实都是一个人做事不认真不仔细不负...

  • 日更80

    今天偷个懒,看下TGS找盐比赛中的一些好的idea。 代码看这里: https://github.com/SeuT...

  • 日更80

    1参加早会 早上起床后,就犹豫是否去公司参加早会,问下丫头的意见,她建议我去。那我就去,顺便把通关搞定。 今天又做...

  • 原理 80/日更

    书中提到母性原理和父性原理的教育,怎样将这样的教育实施。在我看来没有绝对的母性原理教育和绝对的父性原理教育。...

  • 日更80天

    今晚下班和朋友一起去吃了个大餐,好久都没有这样几个朋友一起出来吃顿饭,喝喝啤酒了。算是在工作之余犒劳犒劳自己,放松...

  • 日更80天

    日更刚开始,从写相声段子、写歌词、心灵感悟、到现在得每日计划,说实话过程很难熬,谁坚持谁知道,我想说的是,无论写什...

  • 建德——日更80

    建德是个好名字,可惜姓卜。 卜建德在都城出生,在都城长大。 他爹是个读书人,希望于儿子能建功立业,积德向善,于是给...

  • 日更80无题

    如果梦想是一串项链,那么我们把每天做的小事情都当做一颗珍珠。随着时光,你会积累好多散落的珍珠。当你用心思考后,终有...

  • 日更80天

    日更80天,小小声地给自己鼓个掌。 春节期间,是忙还是懒散,把上一次已经坚持日更七十多天的行动中断了。可惜遗憾之后...

网友评论

      本文标题:日更80

      本文链接:https://www.haomeiwen.com/subject/lgartctx.html