Python 最简单的OCR——paddleocr简要教程

作者: 程序员的隐秘角落 | 来源:发表于2021-11-29 11:26 被阅读0次

想要一个识别多种语言，但是准确率还有一定要求OCR，还想免费怎么办?

这里一分钟教会你paddleocr。

测试环境：Win10+Python3.7

环境搭载(控制台输入)：

设置百度的镜像源: pip config set global.index-url https://mirror.baidu.com/pypi/simple

pip3 install paddleocr

#pip install paddleocr==2.0.2

python -m pip install paddlepaddle -i https://mirror.baidu.com/pypi/simple

pip install pyUserInput

注意这里一定要以 pip3 安装。

在py文件同级目录下放一个o.png图像。

图像如下：

测试代码如下:

from paddleocr import PaddleOCR

ocr=PaddleOCR(use_angle_cls = True,use_gpu= False) #使用CPU预加载，不用GPU

text=ocr.ocr("o.png",cls=True) #打开图片文件

#打印所有文本信息

for t in text:

print(t[1][0])

运行结果输出如下：

总结：paddleocr 是一个十分好的开源项目，可以搭配OpenCv 根据自己实际项目的图像获取拉平处理，再OCR可以实现精准识别。

【问题列表】

（1）error: Microsoft Visual C++ 14.0 is required. Get it with "Microsoft Visual C++ Build Tools": http://landinghub.visualstudio.com/visual-cpp-build-tools

【解决】：python_Levenshtein‑0.12.0‑cp36‑cp36m‑win_amd64.whl

下载地址：https://www.lfd.uci.edu/~gohlke/pythonlibs/

pip install C:\Users\IMMOTOR\Desktop\python_Levenshtein-0.12.0-cp36-cp36m-win_amd64.whl

（2） Could not find a version that satisfies the requirement fasttext==0.9.1 (from paddleocr) (from versions: )No matching distribution found for fasttext==0.9.1 (from paddleocr)

【解决】：安装之，pip install fasttext==0.9.1

（3）升级jinja2:

pip install Jinja2 --upgrade

（4）问题解决：ModuleNotFoundError: No module named 'paddle'

【解决】：python -m pip install paddlepaddle -i https://mirror.baidu.com/pypi/simple

（4） xlrd.biffh.XLRDError: Excel xlsx file

pip uninstall xlrd

pip install xlrd==1.2.0

（5）paddleOCR识别速度过慢

python paddleocr 增加识别速度的方法

切记长度不要超过960px

1 模型压缩 enable_mkldnn=True,use_tensorrt=True,use_angle_cls=False

ocr = PaddleOCR(enable_mkldnn=True,use_tensorrt=True,use_angle_cls=False,use_gpu= False)

text=ocr.ocr(img_name) #打开图片文件

网友评论

本文标题：Python 最简单的OCR——paddleocr简要教程

本文链接：https://www.haomeiwen.com/subject/lrnkxrtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

Python 最简单的OCR——paddleocr简要教程

【问题列表】

相关文章

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读