前言
Google Inception Net在2014年的 ImageNet Large Scale Visual Recognition Competition (ILSVRC)中取得第一名,该网络以结构上的创新取胜,通过采用全局平均池化层取代全连接层,极大的降低了参数量,是非常实用的模型,一般称该网络模型为Inception V1。随后的Inception V2中,引入了Batch Normalization方法,加快了训练的收敛速度。在Inception V3模型中,通过将二维卷积层拆分成两个一维卷积层,不仅降低了参数数量,同时减轻了过拟合现象。
一、多少层?
Inception V3究竟有多少层呢?某书籍上说42层,某书籍上说46层。参考实现的源代码,仔细数一数,应该是47层。
data:image/s3,"s3://crabby-images/19760/1976001b1215c9f73011f4a247542e3633c75542" alt=""
5(前面)+
3(block1_module1)+3(block1_module2)+3(block1_module3)+
3(block2_module1)+5(block2_module2)+5(block2_module3)+5(block2_module4)+5(block2_module5)+
4(block3_module1)+3(block3_module2)+3(block3_module3)
= 47层
Tips:上面的这张层次结构图出现在某些帖子和书籍中,根据实现的源码,标注的红色方框处应该是5个卷积层,而不是4个。
二、详细网络结构
详细的网络结构及其子网络结构如下。
data:image/s3,"s3://crabby-images/1730c/1730c6ce4b830003227e08287c8b7ad3b334131f" alt=""
data:image/s3,"s3://crabby-images/e258f/e258f5f10ce05a5319caf8bdf1b4c5edc03366b7" alt=""
data:image/s3,"s3://crabby-images/2f1d1/2f1d1df78888987921336f26c0cd923e3a48fcd4" alt=""
data:image/s3,"s3://crabby-images/a41b0/a41b03483f1d54a5e604d5bd9d2edcdefafacbf3" alt=""
data:image/s3,"s3://crabby-images/c4958/c49582a8f39833268ebcd1260aaa32f29fa86b50" alt=""
data:image/s3,"s3://crabby-images/b95f9/b95f9985abbb2e0ec4d23c4da186a330d97ead6c" alt=""
data:image/s3,"s3://crabby-images/88424/884243f525e71cfd3f0e9f430183dc33de881d0b" alt=""
data:image/s3,"s3://crabby-images/8f6a0/8f6a026be67c385ad928c74e06b4a570cb996bce" alt=""
data:image/s3,"s3://crabby-images/3e87d/3e87deff9b48d886d0e16dabb822bab2c622dade" alt=""
data:image/s3,"s3://crabby-images/692c9/692c95ace95bee0cb3039a072d77f3108979036c" alt=""
data:image/s3,"s3://crabby-images/8ce38/8ce38fc68b0c976fc8f189bbd465d867ab473b39" alt=""
这么复杂精巧的网络结构是怎么设计出来的呢?是不断的进行数值实验吗?
还是靠爱... ...
data:image/s3,"s3://crabby-images/68daa/68daad85eb4944fa7c375180aa23abe1a83bb3d2" alt=""
网友评论