This commit is contained in:
Patrick Jentsch 2019-05-16 13:19:20 +02:00
parent 9536116cc2
commit 75dd73f383

View File

@ -32,9 +32,11 @@ docker pull gitlab.ub.uni-bielefeld.de:4567/sfb1288inf/ocr:latest
``` ```
mkdir -p /<mydatalocation>/files_for_ocr /<mydatalocation>/files_from_ocr mkdir -p /<mydatalocation>/files_for_ocr /<mydatalocation>/files_from_ocr
``` ```
2. Place your files inside the `/<mydatalocation>/files_for_ocr` directory. Files can either be 2. Place your files inside the `/<mydatalocation>/files_for_ocr` directory. Files can either be
multipage TIFF (.tiff, .tif) or PDF (.pdf) files. Files should all contain text multipage TIFF (.tiff, .tif) or PDF (.pdf) files. Files should all contain text
of the same language. of the same language.
3. Start the OCR process. 3. Start the OCR process.
``` ```
docker run \ docker run \
@ -48,6 +50,7 @@ docker run \
-l <languagecode> -l <languagecode>
``` ```
The specified below `sfb1288inf/ocr:latest` are described in the OCR arguments part. The specified below `sfb1288inf/ocr:latest` are described in the OCR arguments part.
4. Check your results in the `/<mydatalocation>/files_from_ocr` directory. 4. Check your results in the `/<mydatalocation>/files_from_ocr` directory.
### OCR arguments ### OCR arguments