tensorflow入门 -CNN

作者: leo567 | 来源:发表于2018-11-10 15:55 被阅读73次

tensorflow入门 -CNN
SkySeraph AI 实践到理论系列
Faster-RCNN-TensorFlow-Python3 训
CNN Network quantization 阶段总结
十二.报错：logits and labels must be
2018-12-27制作自己的数据集
tensorflow cnn
CNN on TensorFlow
TensorFlow Faster R-CNN 实现详解（未完）
TensorFlow 卷积层

传统的神经网络，权值太多，计算量太大，需要大量样本进行训练。（建议: 网络中需要确定的权值数为 x，最好需要5到10倍的数据进行训练）

局部感受野

1962年哈佛医学院神经生理学家Hubel和Wiesel通过对猫视觉皮层细胞的研究，提出了感受野(receptive field)的概念，1984年日本学者Fukushima基于感受野概念提出的神经认知机(neocognitron)可以看作是卷积神经网络的第一个实现网络，也是感受野概念在人工神经网络领域的首次应用。

卷积神经网络CNN

CNN通过感受野和权值共享减少了神经网络需要训练的参数个数。

卷积

黄色方块为卷积核的大小（这里横向和纵向的步长都是 1），每次都原数据进行采样，生成特征图

卷积.gif

多个卷积核

（滤波器）

不同的卷积核生成不同的特征图，对不同的特征进行采样。

池化

还有一个随机的池化

对于卷积操作

SAME PADDING：
给平面外部补0
卷积窗口采样后得到一个跟原来平面大小相同的平面

SAME PADDING

VALID PADDING：
不会超出平面外部
卷积窗口采样后得到一个比原来平面小的平面

VALID PADDING

对于池化操作

SAME PADDING：可能会给平面外部补0
VALID PADDING：不会超出平面外部

假如有一个28x28的平面，用2x2并且步长为2的窗口对其进行pooling操作
使用SAME PADDING的方式，得到14x14的平面
使用VALID PADDING的方式，得到14x14的平面
假如有一个2x3的平面，用2x2并且步长为2的窗口对其进行pooling操作

使用SAME PADDING的方式，得到1x2的平面
使用VALID PADDING的方式，得到1x1的平面

CNN的基本结构

CNN运用于MNIST数据集分类

import tensorflow as tf
from tensorflow.examples.tutorials.mnist import input_data
import os

os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2'

mnist = input_data.read_data_sets('MNIST_data', one_hot=True)

# 每个批次的大小
batch_size = 100
# 计算一共有多少个批次
n_batch = mnist.train.num_examples // batch_size


# 初始化权值
def weight_variable(shape, name):
    initial = tf.truncated_normal(shape, stddev=0.1)  # 生成一个截断的正态分布
    return tf.Variable(initial, name=name)


# 初始化偏置
def bias_variable(shape, name):
    initial = tf.constant(0.1, shape=shape)
    return tf.Variable(initial, name=name)


# 卷积层
def conv2d(x, W):
    # x input tensor of shape `[batch, in_height, in_width, in_channels]`
    # W filter / kernel tensor of shape [filter_height, filter_width, in_channels, out_channels]
    # `strides[0] = strides[3] = 1`. strides[1]代表x方向的步长，strides[2]代表y方向的步长
    # padding: A `string` from: `"SAME", "VALID"`
    return tf.nn.conv2d(x, W, strides=[1, 1, 1, 1], padding='SAME')


# 池化层
def max_pool_2x2(x):
    # ksize [1,x,y,1]
    return tf.nn.max_pool(x, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1], padding='SAME')


# 定义两个placeholder
x = tf.placeholder(tf.float32, [None, 784], name='x-input')
y = tf.placeholder(tf.float32, [None, 10], name='y-input')
# 改变x的格式转为4D的向量[batch, in_height, in_width, in_channels]`
x_image = tf.reshape(x, [-1, 28, 28, 1], name='x_image')

# 初始化第一个卷积层的权值和偏置
W_conv1 = weight_variable([5, 5, 1, 32], name='W_conv1')  # 5*5的采样窗口，32个卷积核从1个平面抽取特征
b_conv1 = bias_variable([32], name='b_conv1')  # 每一个卷积核一个偏置值

# 把x_image和权值向量进行卷积，再加上偏置值，然后应用于relu激活函数
conv2d_1 = conv2d(x_image, W_conv1) + b_conv1
h_conv1 = tf.nn.relu(conv2d_1)
h_pool1 = max_pool_2x2(h_conv1)  # 进行max-pooling

# 初始化第二个卷积层的权值和偏置
W_conv2 = weight_variable([5, 5, 32, 64], name='W_conv2')  # 5*5的采样窗口，64个卷积核从32个平面抽取特征
b_conv2 = bias_variable([64], name='b_conv2')  # 每一个卷积核一个偏置值

# 把h_pool1和权值向量进行卷积，再加上偏置值，然后应用于relu激活函数
conv2d_2 = conv2d(h_pool1, W_conv2) + b_conv2
h_conv2 = tf.nn.relu(conv2d_2)
h_pool2 = max_pool_2x2(h_conv2)  # 进行max-pooling

# 28*28的图片第一次卷积后还是28*28，第一次池化后变为14*14
# 第二次卷积后为14*14，第二次池化后变为了7*7
# 进过上面操作后得到64张7*7的平面

# 初始化第一个全连接层的权值
W_fc1 = weight_variable([7 * 7 * 64, 1024], name='W_fc1')  # 上一场有7*7*64个神经元，全连接层有1024个神经元
b_fc1 = bias_variable([1024], name='b_fc1')  # 1024个节点

# 把池化层2的输出扁平化为1维。 -1代表占位值
h_pool2_flat = tf.reshape(h_pool2, [-1, 7 * 7 * 64], name='h_pool2_flat')
# 求第一个全连接层的输出
wx_plus_b1 = tf.matmul(h_pool2_flat, W_fc1) + b_fc1
h_fc1 = tf.nn.relu(wx_plus_b1)

# keep_prob用来表示神经元的输出概率
keep_prob = tf.placeholder(tf.float32, name='keep_prob')
h_fc1_drop = tf.nn.dropout(h_fc1, keep_prob, name='h_fc1_drop')

# 初始化第二个全连接层
W_fc2 = weight_variable([1024, 10], name='W_fc2')
b_fc2 = bias_variable([10], name='b_fc2')
wx_plus_b2 = tf.matmul(h_fc1_drop, W_fc2) + b_fc2
# 计算输出
prediction = tf.nn.softmax(wx_plus_b2)

# 交叉熵代价函数
cross_entropy = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(labels=y, logits=prediction),
                               name='cross_entropy')

# 使用AdamOptimizer进行优化
train_step = tf.train.AdamOptimizer(1e-4).minimize(cross_entropy)

# 求准确率
# 结果存放在一个布尔列表中
correct_prediction = tf.equal(tf.argmax(prediction, 1), tf.argmax(y, 1))  # argmax返回一维张量中最大的值所在的位置
# 求准确率
accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    for i in range(10001):
        # 训练模型
        batch_xs, batch_ys = mnist.train.next_batch(batch_size)
        sess.run(train_step, feed_dict={x: batch_xs, y: batch_ys, keep_prob: 0.5})

        if i % 100 == 0:
            test_acc = sess.run(accuracy, feed_dict={x: mnist.test.images, y: mnist.test.labels, keep_prob: 1.0})

            print("Iter " + str(i) + ", Testing Accuracy= " + str(test_acc))

7500次达到99%

网友评论

自己的世界自己码

本文标题：tensorflow入门 -CNN

本文链接：https://www.haomeiwen.com/subject/jgswxqtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

tensorflow入门 -CNN

相关文章