Python jieba 去除停用词生成词云图

作者: zestloveheart | 来源:发表于2018-10-11 19:24 被阅读0次

Python jieba 去除停用词生成词云图
词云图
jieba分词使用报告
Python词云图生成
python:生成词云图
20220213memo#艽野尘梦#读书笔记
基于 CNN 的中文歌词文本情感分类
Python3 生成中文词云
我与Python相遇的每天_2020-5-28 词云图
[数据分析]基于人物登场率生成《倚天》词云图

读写文件

把待读取的文本存在info.txt中，content类型为str

with open('info.txt', 'r', encoding="UTF-8") as file1:  # with as操作读取文件很ok
    content = "".join(file1.readlines())

待写入文件为 output.txt，content_after为待写入字符串

with open('output.txt', 'w', encoding='utf-8') as file2:
    file2.write(content_after+"\n")

分词

# 调用jieba.cut
sentence_seged = jieba.cut(content)

去除停用词

建立停用词表
将停用词表放在stop.txt中，一行一个词

# stopwords为停用词list
stopwords = [line.strip() for line in open('stop.txt', 'r', encoding='utf-8').readlines()]

遍历去除停用词

outstr = '' # 待返回字符串

 for word in sentence_seged:
    if word not in stopwords:
        outstr += word + " "

生成词云图

images = Image.open("something.png") # 打开保存的图片
maskImages = np.array(images) # 并用numpy转换
wc = WordCloud(font_path="msyh.ttc", background_color="white", max_words=100, max_font_size=100).generate(content_after) # 生成词云图
wc.to_file('wordCloudPic.png')    # 保存到本地图片文件

网友评论

本文标题：Python jieba 去除停用词生成词云图

本文链接：https://www.haomeiwen.com/subject/ouhiaftx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

Python jieba 去除停用词生成词云图

读写文件

分词

去除停用词

生成词云图

相关文章

Python jieba 去除停用词生成词云图

词云图

jieba分词使用报告

Python词云图生成

python:生成词云图

20220213memo#艽野尘梦#读书笔记

基于 CNN 的中文歌词文本情感分类

Python3 生成中文词云

我与Python相遇的每天_2020-5-28 词云图

[数据分析]基于人物登场率生成《倚天》词云图

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读

Python jieba 去除停用词 生成词云图

读写文件

分词

去除停用词

生成词云图

相关文章

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读

Python jieba 去除停用词生成词云图