diff --git a/README.md b/README.md index 699e5ba..2a0e4e5 100644 --- a/README.md +++ b/README.md @@ -49,6 +49,7 @@ Valid language codes are: 2. Re-enter the screen session to check the status of the running OCR job with `screen -r `. (Try this if there is an error. `script -q -c "screen -r " /dev/null`). ## Use prebuilt image +Download via regestry function with login or deploy token. -## Add additional trained data for OCR of additional languages. -TBD +## Add additional traineddata for OCR of additional languages. +Additional traineddata can be easily added to the docker file. Just append the needed data file URL after line 56 followin the same syntax. The standard traineddata for various languages can be found under https://github.com/tesseract-ocr/tesseract/wiki/Data-Files#data-files-for-version-400-november-29-2016. The URL for Afrikaans (afr) would be for example https://github.com/tesseract-ocr/tessdata/raw/4.00/afr.traineddata.