一心只想往前飞飞便千山和万水

编程 pytesseract使用

编程

未归档(1) 草稿本(1) 计算机视觉(5)

/ 注册

pytesseract使用

503 浏览 0 回复 2020-08-18

一心只想往前飞飞便千山和万水

+关注

pytesseract利用tesseract进行OCR文字识别。

依赖项

pillow文档及安装
pip install pillow
tesseract下载点这儿
tesseract下载并安装完后需要配置系统变量及tesseract变量。
```
 1. 配置系统变量    
```
```
 2. 配置tesseract变量
```
pytesseract文档及安装
pip install pytesseract

OCR使用

pytesseract使用

from PIL import Image
import pytesseract

pytesseract.pytesseract.tesseract_cmd = n'<full_path_to_your_tesseract_executable>'

# 转成文字
print(pytesseract.image_to_string(Image.open('test.png')))

# 指定语言
print(pytesseract.image_to_string(Image.open('test-european.jpg'), lang="fra'))

tesseract使用
tesseract 图像路径输出.txt

注意事项

安装时选择需要的语言，若不能自动安装，参考这篇文章或到这儿下载
语言包置于.\Tesseract-OCR\tessdata文件夹下

#python #OCR

举报

收藏 1

赞

评论加载中...