mirror of
https://gitlab.ub.uni-bielefeld.de/sfb1288inf/ocr.git
synced 2024-12-26 05:34:17 +00:00
.gitlab-ci.yml | ||
Dockerfile | ||
hocrtotei | ||
ocr | ||
README.md |
Installation
Install additional packages
- Install
screen
. We will use this to execute commands in their own terminal session.
Build your own image
- Clone this repository and navigate into it.
- Build the image from the dockerfile.
docker build -t <image_name>:<tag> .
For example:docker build -t ocr_container:latest .
Alternatively build directly from git.
- Use the following command to build directly from gitLab.
docker build -t <image_name>:<tag> https://gitlab.ub.uni-bielefeld.de/sfb1288inf/ocr.git
.
Folder setup
- Create input and output folders for the OCR files.
mkdir -p /some/path/<container-name>/ocr/files_for_ocr /some/path/<image_name>/ocr/files_from_ocr
Run the container
- Run container from an image. <contianer_name> and /some/path are the same as mentioned in the step folder setup. We are creating two volumes based on the folder paths provided in the section Folder setup.
docker run \
--name <container-name> \
-dit \
-v /some/path/<container-name>/files_for_ocr:/root/files_for_ocr \
-v /some/path/<container-name>/files_from_ocr:/root/files_from_ocr \
<image_name>
Start an OCR job
- Place some files inside the folder files_for_ocr. Files can either be multipage tiffs or PDF files. One folder per file is needed. Files should all be of the same language.
- Start a screen session with
screen -dmS <container_name>
- Enter the screen session with
screen -r <container-name>
. (Try this if there is an error.script -q -c "screen -r <container-name>" /dev/null
). - Start the OCR process for all files placed in files_for_ocr with
docker exec -it <container-name> ocr -i files_for_ocr -o files_from_ocr -l <sprachcode>
.
Valid language codes are:
- deu (German)
- deu_frak (German Fraktur)
- eng (English)
- enm (Middle englisch)
- fra (French)
- frm (Middle french)
- por (Portuguese)
- spa (Spanish)
Exit an re-enter the current running OCR process
- You can leave the currently running OCR process by pressing
ctrl + a + d
and thus leaving the screen session. - Re-enter the screen session to check the status of the running OCR job with
screen -r <container-name>
. (Try this if there is an error.script -q -c "screen -r <container-name>" /dev/null
).
Use prebuilt image
Add additional trained data for OCR of additional languages.
TBD