循环神经网络pytorch实现

作者: 直接往二 | 来源:发表于2019-09-29 21:03 被阅读0次

Pytorch学习之LSTM识别MNIST数据集
循环神经网络pytorch实现
Pytorch Geometric中的图神经网络GAT是如何实现
动手学深度学习(十一) NLP循环神经网络
2020-02-25
动手学深度学习Task2笔记
纯用NumPy实现神经网络
Pytorch教程
PyTorch RNN Regression
【Note】MV-机器学习系列之神经网络 PyTorch

RNN

RNN
前向过程：

$h_t = g(Uh_{t-1} + Wx_t +b_h)$
$y_t = g(W_yh_t + b_y)$

pytorch 实现

import torch
import torch.nn as nn
import torch.nn.functional as F


class RNNCell(nn.Module):

    def __init__(self, input_size, hidden_dim):
        super(RNNCell, self).__init__()
        self.input_size = input_size
        self.hidden_dim = hidden_dim
        self.linear1 = nn.Linear(hidden_dim, hidden_dim)
        self.linear2 = nn.Linear(input_size, hidden_dim)

    def forward(self, x, h_pre):
        """
        :param x:       (batch, input_size)
        :param h_pre:   (batch, hidden_dim)
        :return: h_next (batch, hidden_dim)
        """
        h_next = torch.tanh(self.linear1(h_pre) + self.linear2(x))
        return h_next


class RNN(nn.Module):

    def __init__(self, input_size, hidden_dim):
        super(RNN, self).__init__()
        self.input_size = input_size
        self.hidden_dim = hidden_dim
        self.rnn_cell = RNNCell(input_size, hidden_dim)

    def forward(self, x):
        """
        :param x: (seq_len, batch,input_size)
        :return:
           output (seq_len, batch, hidden_dim)
           h_n    (1, batch, hidden_dim)
        """
        seq_len, batch, _ = x.shape
        h = torch.zeros(batch, self.hidden_dim)
        output = torch.zeros(seq_len, batch, self.hidden_dim)
        for i in range(seq_len):
            inp = x[i, :, :]
            h = self.rnn_cell(inp, h)
            output[i, :, :] = h

        h_n = output[-1:, :, :]
        return output, h_n

LSTM

LSTM
前向过程：

输入门: $i_t = \sigma (W_ix_t + U_ih_{t-1} + b_i)$
遗忘门: $f_t = \sigma (W_fx_t + U_fh_{t-1} + b_f)$
输出门: $o_t = \sigma (W_ox_t + U_oh_{t-1} + b_o)$
$\hat{c}_t = tanh(W_cx_t + U_ch_{t-1} + b_c)$
$c_t = f_t \odot c_{t-1} + i_t \odot \hat{c} _t$
$h_t = o_t \odot tanh(c_t)$

pytorch 实现

import torch
import torch.nn as nn
import torch.nn.functional as F
import copy


class Gate(nn.Module):
    def __init__(self, input_size, hidden_dim):
        super(Gate, self).__init__()
        self.linear1 = nn.Linear(hidden_dim, hidden_dim)
        self.linear2 = nn.Linear(input_size, hidden_dim)

    def forward(self, x, h_pre, active_func):
        h_next = active_func(self.linear1(h_pre) + self.linear2(x))
        return h_next


def clones(module, N):
    "Produce N identical layers."
    return nn.ModuleList([copy.deepcopy(module) for _ in range(N)])


class LSTMCell(nn.Module):

    def __init__(self, input_size, hidden_dim):
        super(LSTMCell, self).__init__()
        self.input_size = input_size
        self.hidden_dim = hidden_dim
        self.gate = clones(Gate(input_size, hidden_dim), 4)

    def forward(self, x, h_pre, c_pre):
        """
        :param x: (batch, input_size)
        :param h_pre: (batch, hidden_dim)
        :param c_pre: (batch, hidden_dim)
        :return: h_next(batch, hidden_dim), c_next(batch, hidden_dim)
        """
        f_t = self.gate[0](x, h_pre, torch.sigmoid)
        i_t = self.gate[1](x, h_pre, torch.sigmoid)
        g_t = self.gate[2](x, h_pre, torch.tanh)
        o_t = self.gate[3](x, h_pre, torch.sigmoid)
        c_next = f_t * c_pre + i_t * g_t
        h_next = o_t * torch.tanh(c_next)

        return h_next, c_next


class LSTM(nn.Module):

    def __init__(self, input_size, hidden_dim):
        super(LSTM, self).__init__()
        self.input_size = input_size
        self.hidden_dim = hidden_dim
        self.lstm_cell = LSTMCell(input_size, hidden_dim)

    def forward(self, x):
        """
        :param x: (seq_len, batch,input_size)
        :return:
           output (seq_len, batch, hidden_dim)
           h_n    (1, batch, hidden_dim)
           c_n    (1, batch, hidden_dim)
        """
        seq_len, batch, _ = x.shape
        h = torch.zeros(batch, self.hidden_dim)
        c = torch.zeros(batch, self.hidden_dim)
        output = torch.zeros(seq_len, batch, self.hidden_dim)
        for i in range(seq_len):
            inp = x[i, :, :]
            h, c = self.lstm_cell(inp, h, c)
            output[i, :, :] = h

        h_n = output[-1:, :, :]
        return output, (h_n, c.unsqueeze(0))

GRU

GRU
前向过程：

更新门:

$r_t = \sigma (W_{xr}x_t + W_{hr}h_{t-1} + b_r)$
$z_t = \sigma (W_{xz}x_t + W_{hz}h_{t-1} + b_z)$

候选隐含状态：

$\hat{h}_t = tanh(W_{xh}x_t + r_t \odot W_{hh}h_{t-1} + b_h)$

隐含状态：

$h_t = z_t \odot h_{t-1} + (1-z_t) \odot \hat{h}_t$

输出:

$y_t = softmax(W_{hy}h_t + b_y)$

Pytorch学习之LSTM识别MNIST数据集
实验RNN循环神经网络识别MNIST手写数字集本文主要是讲述pytorch实现的RNN神经网络去识别MNIST手...
循环神经网络pytorch实现
RNN pytorch 实现 LSTM 输入门: 遗忘门: 输出门: pytorch 实现 GRU 更新门: 候选...
Pytorch Geometric中的图神经网络GAT是如何实现
最近在使用Pytorch Geometric, 这个包收集了最新的图神经网络的Pytorch实现。这篇文章想研究下...
动手学深度学习(十一) NLP循环神经网络
循环神经网络本节介绍循环神经网络，下图展示了如何基于循环神经网络实现语言模型。我们的目的是基于当前的输入与过去的...
2020-02-25
循环神经网络本节介绍循环神经网络，下图展示了如何基于循环神经网络实现语言模型。我们的目的是基于当前的输入与过去的...
动手学深度学习Task2笔记
循环神经网络什么是循环神经网络下图展示了如何基于循环神经网络实现语言模型。我们的目的是基于当前的输入与过去的输入...
纯用NumPy实现神经网络
摘要：纯NumPy代码从头实现简单的神经网络。 Keras、TensorFlow以及PyTorch都是高级别的深...
Pytorch教程
Pytorch 神经网络基础 1.1 Pytorch & Numpy 1.1.1 用Torch还是Numpy To...
PyTorch RNN Regression
循环神经网络RNN及时预测时间序列. 更多可以查看官网 :* PyTorch 官网载入数据假设想要用 sin...
【Note】MV-机器学习系列之神经网络 PyTorch
一、PyTorch 简介 1、Why PyTorch？ PyTorch 的优势是建立的神经网络是动态的，比如 RN...

循环神经网络pytorch实现

RNN

pytorch 实现

LSTM

pytorch 实现

GRU

相关文章

Pytorch学习之LSTM识别MNIST数据集

循环神经网络pytorch实现

Pytorch Geometric中的图神经网络GAT是如何实现

动手学深度学习(十一) NLP循环神经网络

2020-02-25

动手学深度学习Task2笔记

纯用NumPy实现神经网络

Pytorch教程

PyTorch RNN Regression

【Note】MV-机器学习系列之神经网络 PyTorch

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读