Skip to content

Instantly share code, notes, and snippets.

@davit312
Created August 24, 2024 14:06
Show Gist options
  • Save davit312/2c7079367bee267b8857d50bbe25bafc to your computer and use it in GitHub Desktop.
Save davit312/2c7079367bee267b8857d50bbe25bafc to your computer and use it in GitHub Desktop.
Local ORC with docker, (it includes Aremenian too)
# Run ORC on local host with docker
# Download https://github.com/tesseract-ocr/tessdata_best for best quality
# Use -l hye+eng+rus to read Armenian and other languages
# And download tessconfigs, it is a link in repo as @tessconfigs
IMAGE_FULL_PATH=
docker run -it --rm \
-v /path/to/tessdata_best:/usr/local/share/tessdata/ \
-v $IMAGE_FULL_PATH:/tmp/img \
jitesoft/tesseract-ocr -l hye /tmp/img stdout
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment