diff --git a/README.md b/README.md index 85624d9..98834e5 100644 --- a/README.md +++ b/README.md @@ -14,10 +14,10 @@ This software implements a heavily parallelized pipeline to recognize text in PD 1. Install Docker and Python 3. 2. Clone this repository: `git clone https://gitlab.ub.uni-bielefeld.de/sfb1288inf/ocr.git` -2. Build the Docker image: `docker build -t gitlab.ub.uni-bielefeld.de:4567/sfb1288inf/ocr:v0.1.0 ocr` -2. Add the wrapper script (`wrapper/ocr` relative to this README file) to your `${PATH}`. -3. Create working directories for the pipeline: `mkdir -p //{input,models,output}`. -4. Place your Tesseract OCR model(s) inside `//models`. +3. Build the Docker image: `docker build -t gitlab.ub.uni-bielefeld.de:4567/sfb1288inf/ocr:v0.1.0 ocr` +4. Add the wrapper script (`wrapper/ocr` relative to this README file) to your `${PATH}`. +5. Create working directories for the pipeline: `mkdir -p //{input,models,output}`. +6. Place your Tesseract OCR model(s) inside `//models`. ## Use the Pipeline