A Neural Algorithm of Artistic Style:https://arxiv.org/pdf/1508.06576v2.pdf
一. 介绍
1. Content reconstructions:
CNN被广泛运用于训练目标识别,随着训练的进行,它们能够有一种将图片抽象的能力。最终,图片被抽象为只含有实际内容的信息,而丢掉许多像素细节。越多层的CNN越能捕捉主要的图片内容。所以我们将深层的CNN抽象出来的内容称为content representation.
2. Style reconstructions:
为了获得图片的风格特征,我们利用feature space去捕捉图片的质地特征。这些feature space建立于每层网络的filter之上。它由特征图(feature map)的空间范围内的不同滤波器响应之间的相关性组成。
data:image/s3,"s3://crabby-images/26655/266555d9769ed3c90700d5b416aadb1304cdab69" alt=""
这篇论文的发现是style reconstructions和content reconstructions是可分的。那就是说,可以分别从不同的图片得到style和不同的图片得到content,然后将它们结合起来,构成新的图片。
二. 方法
1. 结构
(1) style reconstructions
We used the feature space provided by the 16 convolutional and 5 pooling layers of the 19 layer VGG-Network. We do not use any of the fully connected layers.
(2) image synthesis
we found that replacing the max-pooling operation by average pooling improves the gradient flow and one obtains slightly more appealing results, which is why the images shown were generated with average pooling.
2. 误差函数
data:image/s3,"s3://crabby-images/c293f/c293fe61d453831953fdc3b9b6229eb81a3d46bb" alt=""
(1) content reconstructions loss function:
每层有N_l个不同的滤波器,有N_l个M_l大小的feature map, M_l指的是feature map的高度乘以宽度。 所以feature在l层的表示被存在一个F_L的矩阵里,F_l(i,j)指的是在l层,位置j的第i个filter的激活值。P_I(i,j)就是在上面位置的原图的值。
data:image/s3,"s3://crabby-images/2b861/2b861d0a880a7d3b3ae60540aad72f67a343931d" alt=""
(2) style reconstructions loss function:
(3) image synthesis loss function:
data:image/s3,"s3://crabby-images/fcf79/fcf7946de7aa5134ee2c1f4e5e09bd60faac8a2d" alt=""
网友评论