美文网首页
PyTorch_Geometric学习

PyTorch_Geometric学习

作者: 魏鹏飞 | 来源:发表于2020-03-04 20:50 被阅读0次

    1. 包安装

    ${CUDA}可替换为cpu, cu92, cu100 or cu101。

    $ pip install torch-scatter==latest+${CUDA} torch-sparse==latest+${CUDA} -f https://s3.eu-central-1.amazonaws.com/pytorch-geometric.com/whl/torch-1.4.0.html
    $ pip install torch-cluster (optional)
    $ pip install torch-spline-conv (optional)
    $ python setup.py install or pip install torch-geometric
    

    2. 理论

    图的表示:



    图的节点可以根据其值进行向量表示,而节点与节点间使用邻接矩阵来表示。

    邻接矩阵主要由源节点(第一列)和目标节点(第二列)组成。源节点和目标节点顺序对应。比如 ,在图中,节点0的目标节点有节点1,节点3.可以用[[0,0],[1,3]]来表示。

    所以,邻接矩阵的关键是,源节点列和目标节点列的对应关系表示。

    x = torch.tensor([[2,1], [5,6], [3,7], [12,0]],\
    dtype=torch.float)
    y = torch.tensor([0, 1, 0, 1], dtype=torch.float)
    
    edge_index = torch.tensor([[0, 1, 2, 0, 3],\
                            [1, 0, 1, 3, 2]], dtype=torch.long)
    

    图神经网络定义:

    节点i的值是它相邻的节点加权和它上一轮的值之和。
    x'_i=\theta_1x_i+\sum_{j\in N(i)}\theta_2x_j\tag{1}

    Graph Conv的代码如下:

    class GraphConv(MessagePassing):
        def __init__(self, in_channels, out_channels, aggr='add', bias=True,
                     **kwargs):
            super(GraphConv, self).__init__(aggr=aggr, **kwargs)
    
            self.in_channels = in_channels
            self.out_channels = out_channels
    
            self.weight = Parameter(torch.Tensor(in_channels, out_channels))
            self.lin = torch.nn.Linear(in_channels, out_channels, bias=bias)
    
            self.reset_parameters()
    
        def reset_parameters(self):
            uniform(self.in_channels, self.weight)
            self.lin.reset_parameters()
    
        def forward(self, x, edge_index, edge_weight=None, size=None):
            """"""
            h = torch.matmul(x, self.weight)
            return self.propagate(edge_index, size=size, x=x, h=h,
                                  edge_weight=edge_weight)
    
        def message(self, h_j, edge_weight):
            return h_j if edge_weight is None else edge_weight.view(-1, 1) * h_j
    
        def update(self, aggr_out, x):
            return aggr_out + self.lin(x)
    
        def __repr__(self):
            return '{}({}, {})'.format(self.__class__.__name__, self.in_channels,
                                       self.out_channels)
    

    3. 实践

    # 导包
    import torch
    import torch.nn.functional as F
    from torch_geometric.datasets import TUDataset
    from torch_geometric.data import DataLoader
    from torch_geometric.nn import GraphConv, TopKPooling
    from torch_geometric.nn import global_mean_pool as gap, global_max_pool as gmp
    
    # 加载数据、拆分数据集
    dataset = TUDataset(root='/tmp/ENZYMES', name='ENZYMES')
    dataset = dataset.shuffle()
    n = len(dataset) // 10
    test_dataset = dataset[:n]
    train_dataset = dataset[n:]
    test_loader = DataLoader(test_dataset, batch_size=60)
    train_loader = DataLoader(train_dataset, batch_size=60)
    
    # 构建模型
    class Net(torch.nn.Module):
        def __init__(self):
            super(Net, self).__init__()
    
            self.conv1 = GraphConv(dataset.num_features, 128)
            self.pool1 = TopKPooling(128, ratio=0.8)
            self.conv2 = GraphConv(128, 128)
            self.pool2 = TopKPooling(128, ratio=0.8)
            self.conv3 = GraphConv(128, 128)
            self.pool3 = TopKPooling(128, ratio=0.8)
    
            self.lin1 = torch.nn.Linear(256, 128)
            self.lin2 = torch.nn.Linear(128, 64)
            self.lin3 = torch.nn.Linear(64, dataset.num_classes)
    
        def forward(self, data):
            x, edge_index, batch = data.x, data.edge_index, data.batch
    
            x = F.relu(self.conv1(x, edge_index))
            x, edge_index, _, batch, _, _ = self.pool1(x, edge_index, None, batch)
            x1 = torch.cat([gmp(x, batch), gap(x, batch)], dim=1)
    
            x = F.relu(self.conv2(x, edge_index))
            x, edge_index, _, batch, _, _ = self.pool2(x, edge_index, None, batch)
            x2 = torch.cat([gmp(x, batch), gap(x, batch)], dim=1)
    
            x = F.relu(self.conv3(x, edge_index))
            x, edge_index, _, batch, _, _ = self.pool3(x, edge_index, None, batch)
            x3 = torch.cat([gmp(x, batch), gap(x, batch)], dim=1)
    
            x = x1 + x2 + x3
    
            x = F.relu(self.lin1(x))
            x = F.dropout(x, p=0.5, training=self.training)
            x = F.relu(self.lin2(x))
            x = F.log_softmax(self.lin3(x), dim=-1)
    
            return x
    
    # 训练与评估
    device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
    model = Net().to(device)
    optimizer = torch.optim.Adam(model.parameters(), lr=0.0005)
    
    
    def train(epoch):
        model.train()
    
        loss_all = 0
        for data in train_loader:
            data = data.to(device)
            optimizer.zero_grad()
            output = model(data)
            loss = F.nll_loss(output, data.y)
            loss.backward()
            loss_all += data.num_graphs * loss.item()
            optimizer.step()
        return loss_all / len(train_dataset)
    
    
    def test(loader):
        model.eval()
    
        correct = 0
        for data in loader:
            data = data.to(device)
            pred = model(data).max(dim=1)[1]
            correct += pred.eq(data.y).sum().item()
        return correct / len(loader.dataset)
    
    for epoch in range(1, 201):
        loss = train(epoch)
        train_acc = test(train_loader)
        test_acc = test(test_loader)
        print('Epoch: {:03d}, Loss: {:.5f}, Train Acc: {:.5f}, Test Acc: {:.5f}'.
              format(epoch, loss, train_acc, test_acc))
    

    运行结果:

    Epoch: 001, Loss: 1.80737, Train Acc: 0.16667, Test Acc: 0.16667
    Epoch: 002, Loss: 1.78834, Train Acc: 0.19259, Test Acc: 0.15000
    Epoch: 003, Loss: 1.77594, Train Acc: 0.25185, Test Acc: 0.21667
    Epoch: 004, Loss: 1.77198, Train Acc: 0.27593, Test Acc: 0.28333
    Epoch: 005, Loss: 1.75865, Train Acc: 0.27407, Test Acc: 0.26667
    Epoch: 006, Loss: 1.73226, Train Acc: 0.28333, Test Acc: 0.25000
    Epoch: 007, Loss: 1.73482, Train Acc: 0.26852, Test Acc: 0.25000
    Epoch: 008, Loss: 1.70825, Train Acc: 0.27407, Test Acc: 0.25000
    Epoch: 009, Loss: 1.70847, Train Acc: 0.31111, Test Acc: 0.25000
    Epoch: 010, Loss: 1.69101, Train Acc: 0.31111, Test Acc: 0.28333
    Epoch: 011, Loss: 1.67449, Train Acc: 0.32222, Test Acc: 0.36667
    ......
    ......
    ......
    Epoch: 195, Loss: 0.71042, Train Acc: 0.85185, Test Acc: 0.45000
    Epoch: 196, Loss: 0.68334, Train Acc: 0.84630, Test Acc: 0.41667
    Epoch: 197, Loss: 0.72414, Train Acc: 0.86481, Test Acc: 0.46667
    Epoch: 198, Loss: 0.68001, Train Acc: 0.86481, Test Acc: 0.48333
    Epoch: 199, Loss: 0.71797, Train Acc: 0.84259, Test Acc: 0.48333
    Epoch: 200, Loss: 0.72108, Train Acc: 0.84074, Test Acc: 0.45000
    

    相关文章

      网友评论

          本文标题:PyTorch_Geometric学习

          本文链接:https://www.haomeiwen.com/subject/haablhtx.html