Paper Reading: Deep Networks wit

作者: Dorts | 来源:发表于2016-08-31 16:31 被阅读103次

Paper Reading: Deep Networks wit
易 AI - 如何阅读深度学习论文
2020-07-07 Deep Snake for Real-T
[Paper Reading]Maxout Networks b
【paper reading】Deep Interest Evo
【paper reading】Wide & Deep L
AI学习笔记
Spectral Inference Networks: Uni
量化模型学习笔记（1）
目标检测领域 2015-2

Written on 2016/08/31

Application:

This simple approach complements the recent success of **residual network ** to reduce training time and improve the test error.

Challenge:

Very deep models become worse at function approximation (called ** degradation ** problem) is not caused by overfitting, but caused by training signals vanishing.
Effective and efficient training methods for very deep models need to be found.

Problem:

Motivated by ** ResNets ** which simplifies ** Highway Networks **, authors proposed a method new called Stochastic Depth to go a step further to reduce ResNet's test error and training time.

Solution:

Shrink the depth of a network during training, while keeping it unchanged during testing.
By a survival probability, randomly dropping entire ResBlocks during training and by bypassing their transformations through skip connections.
Survival probabilities can adopt uniform distribution or linear decay (better)

Insights:

This method(Stochastic depth) is designed for ResNet. Therefore, other networks without ResBlocks is not compatible with this method.
This method can be regarded as an implicit model ensemble.
A new more competitive method has been proposed (http://arxiv.org/pdf/1603.05027.pdf), which can be employed on deeper model and acquire lower test error.