Update README.md

This commit is contained in:
Stephan Porada 2019-04-03 10:21:43 +02:00
parent 08d9e594bb
commit d3854bfdd0

View File

@ -49,6 +49,7 @@ Valid language codes are:
2. Re-enter the screen session to check the status of the running OCR job with `screen -r <container-name>`. (Try this if there is an error. `script -q -c "screen -r <container-name>" /dev/null`). 2. Re-enter the screen session to check the status of the running OCR job with `screen -r <container-name>`. (Try this if there is an error. `script -q -c "screen -r <container-name>" /dev/null`).
## Use prebuilt image ## Use prebuilt image
Download via regestry function with login or deploy token.
## Add additional trained data for OCR of additional languages. ## Add additional traineddata for OCR of additional languages.
TBD Additional traineddata can be easily added to the docker file. Just append the needed data file URL after line 56 followin the same syntax. The standard traineddata for various languages can be found under https://github.com/tesseract-ocr/tesseract/wiki/Data-Files#data-files-for-version-400-november-29-2016. The URL for Afrikaans (afr) would be for example https://github.com/tesseract-ocr/tessdata/raw/4.00/afr.traineddata.