[python] 파이썬을 이용한 화면 글자 인식 pytesseract

개인 프로젝트 진행을 위해 화면 이미지 OCR을 해보고자 합니다.

적절한 라이브러리를 불러와야겠네요.

pytesseract를 사용하여 이미지의 정보를 불러오겠습니다.

Pytesseract 다운로드

https://github.com/madmaze/pytesseract

GitHub - madmaze/pytesseract: A Python wrapper for Google Tesseract

A Python wrapper for Google Tesseract. Contribute to madmaze/pytesseract development by creating an account on GitHub.

github.com

Install 방법

Windows
$ pip install pytesseract

Apple
$ brew install tesseract

Ubuntu
$ sudo apt install tesseract-ocr

pytesseract는 저장된 이미지에서 글자를 가져오는 기능을 수행합니다.

저장된 이미지에서 숫자를 가져오는 간단한 예시를 만들어 보겠습니다.

try:
	import Image
except ImportError:
	from PIL import Image
import pytesseract

print(pytesseract.image_to_string(Image.open('image.png')))

코드와 같은 폴더에 image.png라는 이름으로 숫자 이미지를 넣으면 정상적으로 인식됨을 볼수 있습니다.

최종적으로는 pyautogui와 pytesseract를 이용하여 정해진 시간에 특정 영역의 이미지를 capture하고 변수로 가져오는 동작까지 수행할수 있도록 만들어 보고자 합니다.

코딩의 재미는 하나씩 만들어가는 재미 아니겠습니까!!!

[python] 텔레그램 bot 만들기 (2)	2022.07.08
[python] openpyxl을 이용한 Excel 제어 (0)	2022.07.02
[Python] 크롤링을 위한 chrome driver 다운로드 (0)	2022.05.21
[python] 스마트폰 미러링 - scrcpy (0)	2022.04.29
[python] pip 설치시 에러문구 - 파일에 바이러스 ..... (0)	2022.04.29

Index Engineer