手写神经网络

作者: azorazz | 来源:发表于2019-09-29 15:25 被阅读0次

神经网络实现手写数字识别（吴恩达课程Octave代码用pytho
Pytorch基础篇--3
2018-06-26
TensorFlow学习01-SoftmaxRegression
MNIST手写字体识别（机器学习）
[神经网络这次真的搞懂了!] 系列
神经网络算法
Pytorch学习之LSTM识别MNIST数据集
《自己动手写神经网络》PDF+源代码+葛一鸣
《神经网络与深度学习》笔记7-编码实现神经网络

假设：
输入层 L0 (输入X矩阵，4*3 有3个特征)
隐藏层 L1 （有4个特征）
输出层 L2 （有1个特征）

image.png

初始化

输入数据：

X = np.array([[0, 0, 1],
              [0, 1, 1],
              [1, 0, 1],
              [1, 1, 1]])

标签值：

y = np.array([[0],
              [1],
              [1],
              [0]])

初始W（权重）

w0 = 2 * np.random.random((3, 4)) - 1   # w0:  shape(3,4)
w1 = 2 * np.random.random((4, 1)) - 1    # w1:  shape(4,1)

流程

用激活函数sigmoid非线性化

sigmod函数：
$f(x) = \frac{1}{1+exp(-x)}$
sigmod求导：
$f'(x) = f(x)\cdot (1-f(x))$

sigmod函数及求导：

def nonlin(x, deriv=False):
    if (deriv == True):
        return x * (1 - x)  # sigmoid函数求导

    return 1 / (1 + np.exp(-x))   # sigmoid函数

1. 正向传播

随机初始化权重W，求出预测值
L1 层数据：
$\frac{1}{1+exp(-w_{0}x)}$
L2 层数据：
$L2 = \frac{1}{1+exp(-w_{1}L_{1})} = \frac{1}{1+exp(-w_{1}\frac{1}{1+exp(-w_{0}x)})}$

L1，L2表示：

l1 = nonlin(np.dot(l0, w0))  # l0:  shape(4,3)    l1: shape(4,4)
l2 = nonlin(np.dot(l1, w1))   # l2:  shape(4,1)

2. 计算误差

$loss = (y-L_{2})^{2}$

3. 反向传播

通过误差值，不断更新权重W

计算w1的梯度：
$\triangledown w_{1}=\frac{\partial (loss)}{\partial (w_{1})} = \frac{\partial (loss)}{\partial (L_{2})}\cdot \frac{\partial (L_{2})}{\partial (w_{1})} = 2(y-L_{2})\cdot L_{2}(1-L_{2})\cdot L_{1}$
计算w0的梯度：
$\triangledown w_{0}=\frac{\partial (loss)}{\partial (w_{0})} = 2(y-L_{2})\cdot L_{2}(1-L_{2})\cdot w_{1}\cdot L_{1}(1-L_{1})\cdot L_{0}$
梯度更新：
$w_{1} = w_{1} - \triangledown w_{1}$
$w_{0} = w_{0} - \triangledown w_{0}$

l2_error = 2*(y - l2)      # l2_error:  shape(4,1)

l2_delta = l2_error * nonlin(l2, deriv=True)   # l2_delta:  shape(4,1)

l1_error = l2_delta.dot(w1.T)   # l1_error:  shape(4,4)

l1_delta = l1_error * nonlin(l1, deriv=True)

w1 += l1.T.dot(l2_delta)
w0 += l0.T.dot(l1_delta)

代码实现

import numpy as np


def nonlin(x, deriv=False):
    if (deriv == True):
        return x * (1 - x)  # sigmoid函数求导

    return 1 / (1 + np.exp(-x))   # sigmoid函数


X = np.array([[0, 0, 1],
              [0, 1, 1],
              [1, 0, 1],
              [1, 1, 1]])


y = np.array([[0],
              [1],
              [1],
              [0]])

np.random.seed(1)

# randomly initialize our weights with mean 0
w0 = 2 * np.random.random((3, 4)) - 1
w1 = 2 * np.random.random((4, 1)) - 1

for j in range(60000):
    # 输入层
    l0 = X

    # 隐藏层1
    l1 = nonlin(np.dot(l0, w0))

    # 输出层
    l2 = nonlin(np.dot(l1, w1))

    l2_error = 2*(y - l2)

    if (j % 10000) == 0:
        print("Error:" + str(np.mean((y - l2)*(y - l2))))

    l2_delta = l2_error * nonlin(l2, deriv=True)

    l1_error = l2_delta.dot(w1.T)

    l1_delta = l1_error * nonlin(l1, deriv=True)

    w1 += l1.T.dot(l2_delta)
    w0 += l0.T.dot(l1_delta)

神经网络实现手写数字识别（吴恩达课程Octave代码用pytho
详细代码参考github 神经网络实现手写数字识别实例：利用神经网络实现手写数字的识别，网络已经训练好，权重参数...
Pytorch基础篇--3
源码：github codepytorch简单神经网络手写数字识别
2018-06-26
手写数字识别卷积神经网络版参考代码：
TensorFlow学习01-SoftmaxRegression
SoftmaxRegression识别手写数字整个神经网络的流程：定义算法公式，也就是神经网络的forward...
MNIST手写字体识别（机器学习）
练习：使用 CNN（卷积神经网络）识别 MNIST手写字体— Tensorflow 本文利用卷积神经网络将 MNI...
[神经网络这次真的搞懂了!] 系列
[神经网络这次真的搞懂了!] (1) 使用神经网络识别手写数字 - 感知器[https://www.jianshu...
神经网络算法
神经网络 Neural Networks 本文介绍使用前向反馈传播神经网络，并使用该算法来预测手写数字。代价函数...
Pytorch学习之LSTM识别MNIST数据集
实验RNN循环神经网络识别MNIST手写数字集本文主要是讲述pytorch实现的RNN神经网络去识别MNIST手...
《自己动手写神经网络》PDF+源代码+葛一鸣
神经网络是一种模拟人脑的神经网络，以期能够实现类人工智能的机器学习技术。学习神经网络知识，推荐学习《自己动手写神经...
《神经网络与深度学习》笔记7-编码实现神经网络
用python 动手实现一个可以识别手写数字的简单神经网络。