https://hub.docker.com/r/franky1/tesseract
https://github.com/Franky1/Tesseract-OCR-5-Docker
https://github.com/tesseract-ocr/tesseract
https://github.com/tesseract-ocr/tessdata
https://github.com/tesseract-ocr/tessdata_best
https://sourceforge.net/projects/vietocr/files/jTessBoxEditor/
https://tesseract-ocr.github.io/tessdoc/
docker run -d --rm --name tesseract -v /data/file/tessdata/tmp:/tmp -v /data/file/tessdata:/usr/local/share/tessdata/ -v /etc/localtime:/etc/localtime:ro -e TESSDATA_PREFIX='/usr/local/share/tessdata/' franky1/tesseract tesseract english.png output -l chi_sim
docker run -d --net=host --restart=always --name tesseract -v /data/file/tessdata:/tmp -v /data/file/tessdata:/usr/local/share/tessdata/ -v /etc/localtime:/etc/localtime:ro -e TESSDATA_PREFIX='/usr/local/share/tessdata/' franky1/tesseract tesseract english.png result -l chi_sim
-识别完后会生成result.txt文件
---Tesseract是一个开源的OCR(Optical Character Recognition,光学字符识别)引擎,可以识别多种格式的图像文件并将其转换成文本,目前已支持60多种语言(包括中文)
---
#英文
-l eng
#中文
-l chi_sim
https://blog.csdn.net/juzicode00/article/details/121538270
https://blog.csdn.net/qq_39569480/article/details/113883930
https://hub.docker.com/r/s8n02/nextcloud-tesseract-ocr
https://hub.docker.com/r/sa7ori/ocrdocker
http://www.htmltoo.com/