Update README.md

2026-06-12 10:15:44 +00:00 · 2019-04-03 10:21:43 +02:00
parent 08d9e594bb
commit d3854bfdd0
1 changed files with 3 additions and 2 deletions
@@ -49,6 +49,7 @@ Valid language codes are:
 2. Re-enter the screen session to check the status of the running OCR job with `screen -r <container-name>`. (Try this if there is an error. `script -q -c "screen -r <container-name>" /dev/null`).
 ## Use prebuilt image
 Download via regestry function with login or deploy token.
-## Add additional trained data for OCR of additional languages.
+## Add additional traineddata for OCR of additional languages.
-TBD
+Additional traineddata can be easily added to the docker file. Just append the needed data file URL after line 56 followin the same syntax. The standard traineddata for various languages can be found under https://github.com/tesseract-ocr/tesseract/wiki/Data-Files#data-files-for-version-400-november-29-2016. The URL for Afrikaans (afr) would be for example https://github.com/tesseract-ocr/tessdata/raw/4.00/afr.traineddata.