Machine Learning (CNN): 识别手写数字

作者: 正在学习的Yuki | 来源:发表于2019-06-19 17:34 被阅读0次

机器学习入门 Machine Learning Tutorial I: 图像 Images

1. 手写数字识别 Handwritten Digits Recognition

附上对应英文，有助于日后更深层次的学习；若对内容有任何疑问，欢迎留言讨论 : )

内容简介

通过训练简单的CNN模型（Convolutional Neural Networks 卷积神经网络）来识别MNIST数据集中的手写数字，精确度高于99%。

环境准备

Python 3.6
pandas 0.24.2
tensorflow 1.13.1 / keras 2.2.4

1. 导入模块 (Import module)

import pandas as pd
from keras import datasets, layers, models # 或 from tensorflow.keras import datasets, layers, models

2. 下载及处理MNIST数据集 (Download and prepare the MNIST dataset)

数据来源：keras.datasets.minist
training data: 60000 images, 28x28 pixels for each image
testing data: 10000 images, 28x28 pixels for each image
图像示例：
后附数据视觉化(Data visualization)代码

image.png

下载数据集：

(train_images, train_labels), (test_images, test_labels) = datasets.mnist.load_data()

处理数据集：

train_images = train_images.reshape((60000, 28, 28, 1))
test_images = test_images.reshape((10000, 28, 28, 1))
train_images, test_images = train_images / 255.0, test_images / 255.0 # 将像素值标准化为至0-1区间 (原为0-255)

数据视觉化（如果你想看看图片）：

def visualize(X, label):
    # Convert train datset to (num_images, img_rows, img_cols) format
    X = X.reshape(X.shape[0], 28, 28)
    for i in range(0, 9):
        plt.subplot(330+(i+1))
        plt.imshow(X[i], cmap=plt.get_cmap('gray'))
        plt.title(label[i])
    plt.show()

visualize(train_images, train_labels)

3. 建立模型 (Build the model)

model = models.Sequential()
model.add(layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28,28,1)))
model.add(layers.MaxPool2D(2, 2))
model.add(layers.Conv2D(64, (3, 3), activation='relu'))
model.add(layers.MaxPool2D(2, 2))
model.add(layers.Conv2D(64, (3, 3), activation='relu'))
model.add(layers.Flatten())
model.add(layers.Dense(64, activation='relu'))
model.add(layers.Dense(10, activation='softmax'))

4. 编译及训练模型 (Compile and train the model)

model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])
model.fit(train_images, train_labels, epochs=5)

5. 评估模型 (Evaluate the model)

test_loss, test_acc = model.evaluate(test_images, test_labels)
print("Test Accuracy: ", test_acc)

准确率 Accuracy: 99.16%

image.png

6. 输出预测结果 (Output the predictions)

predictions = model.predict_classes(test_images, verbose=0)
submissions=pd.DataFrame({"ImageId": list(range(1,len(predictions)+1)),
                         "Label": predictions})
submissions.to_csv("predictions_cnn.csv", index=False, header=True)

即可在当前文件夹中看到predictions_cnn.csv文件，内含预测结果。

学习自：https://www.tensorflow.org/beta/tutorials/images/intro_to_cnns

Machine Learning (CNN): 识别手写数字

机器学习入门 Machine Learning Tutorial I: 图像 Images

1. 手写数字识别 Handwritten Digits Recognition

内容简介

环境准备

1. 导入模块 (Import module)

2. 下载及处理MNIST数据集 (Download and prepare the MNIST dataset)

3. 建立模型 (Build the model)

4. 编译及训练模型 (Compile and train the model)

5. 评估模型 (Evaluate the model)

准确率 Accuracy: 99.16%

6. 输出预测结果 (Output the predictions)

相关文章

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读

人工智能/模式识别/机器学习精华专题

大数据，机器学习，人工智能

深度学习-推荐系统-CV-NLP

机器学习和人工智能入门

机器学习与计算机视觉