This commit is contained in:
Patrick Jentsch 2019-05-16 13:19:20 +02:00
parent 9536116cc2
commit 75dd73f383

View File

@ -32,9 +32,11 @@ docker pull gitlab.ub.uni-bielefeld.de:4567/sfb1288inf/ocr:latest
```
mkdir -p /<mydatalocation>/files_for_ocr /<mydatalocation>/files_from_ocr
```
2. Place your files inside the `/<mydatalocation>/files_for_ocr` directory. Files can either be
multipage TIFF (.tiff, .tif) or PDF (.pdf) files. Files should all contain text
of the same language.
3. Start the OCR process.
```
docker run \
@ -48,6 +50,7 @@ docker run \
-l <languagecode>
```
The specified below `sfb1288inf/ocr:latest` are described in the OCR arguments part.
4. Check your results in the `/<mydatalocation>/files_from_ocr` directory.
### OCR arguments