Keras 文本预处理

作者: dreampai | 来源:发表于2019-08-20 11:37 被阅读0次

Keras 文本预处理
Keras实现文本预处理
使用Keras进行深度学习：（四）使用text-CNN处理自然语
tensorflow模型建立与训练
一文读懂keras文本预处理
（4）简单的模型编写
2019-05-29 文本预处理
动手学深度学习(八) NLP 文本预处理
pytorch之文本预处理,语言模型,循环神经网络基础
第一次打卡 Task02

from keras.preprocessing.text import Tokenizer
from keras.preprocessing import sequence
text1 = "学习keras的Tokenizer"
text2 = "就是这么的简单"
texts = [text1, text2]
# num_words 表示用多少词语生成词典（vocabulary）
# char_level表示 if True, every character will be treated as a token.
# oov_token是out-of-vocabulary，用来代替那些字典上没有的字。
tokenizer = Tokenizer(num_words=5000, char_level=True, oov_token='UNK')
tokenizer.fit_on_texts(texts)
# 每个word出现了几次
print(tokenizer.word_counts)
# 每个word出现在几个文档中
print(tokenizer.word_docs)
# 每个word出现了几次
print(tokenizer.document_count)
# 每个word对应的index，字典映射
print(tokenizer.word_index)
# mode：‘binary’，‘count’，‘tfidf’，‘freq’之一，默认为‘binary’
# 返回值：形如(len(texts), nb_words)的numpy array
print(tokenizer.texts_to_matrix(texts))
# 序列的列表
print(tokenizer.texts_to_sequences(texts))
texts = tokenizer.texts_to_sequences(texts)
texts = sequence.pad_sequences(texts, maxlen=30, padding='post',truncating='post')
print(texts)

网友评论

本文标题：Keras 文本预处理

本文链接：https://www.haomeiwen.com/subject/mlgusctx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

Keras 文本预处理

相关文章

Keras 文本预处理

Keras实现文本预处理

使用Keras进行深度学习：（四）使用text-CNN处理自然语

tensorflow模型建立与训练

一文读懂keras文本预处理

（4）简单的模型编写

2019-05-29 文本预处理

动手学深度学习(八) NLP 文本预处理

pytorch之文本预处理,语言模型,循环神经网络基础

第一次打卡 Task02

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读