Commit Graph

138 Commits

Author SHA1 Message Date
Patrick Jentsch
613bceb4ff Add new models 2021-02-23 11:11:50 +01:00
Patrick Jentsch
ca7df6d0ed First work on version 1.0.0 2021-02-19 13:04:03 +01:00
Patrick Jentsch
07635dcdfa Use "buster" instead of "10" in FROM 2020-10-08 23:17:48 +02:00
Patrick Jentsch
c0069d5453 Use new Dockerfile structure 2020-10-08 23:09:10 +02:00
Patrick Jentsch
e941f64ee4 test new ci config 2020-10-07 16:44:38 +02:00
Stephan Porada
cb68d6de2d One thread per page ocr patch 2020-10-07 13:46:22 +02:00
Patrick Jentsch
4b84488fe6 fix gitlab ci 2020-09-23 16:58:07 +02:00
Patrick Jentsch
7d52ad9f68 Update 2020-09-23 15:52:24 +02:00
Patrick Jentsch
ac4b5c2fd8 Add possibility to use an intermediate dir 2020-09-22 17:44:32 +02:00
Patrick Jentsch
6d90d43699 fix cleanup attempt 2020-09-21 15:36:03 +02:00
Patrick Jentsch
4bd0d3bb01 Use commit_sha for intermediate image 2020-09-21 15:02:04 +02:00
Patrick Jentsch
15061bfaaf add tag to clean stage 2020-09-21 15:00:09 +02:00
Patrick Jentsch
7cc8ebd666 compile tesseract in container 2020-09-21 14:46:03 +02:00
Patrick Jentsch
82285a8e6c better multithreading 2020-07-02 11:49:35 +02:00
Patrick Jentsch
7322a5bc7c More GhostScript, less dependencies! 2020-07-02 11:47:43 +02:00
Patrick Jentsch
2b63ba9e59 Remove unused dependencies and use ghostscript for image split 2020-07-01 11:03:34 +02:00
Patrick Jentsch
aee9628e5e fix pipeline 2020-06-23 15:19:27 +02:00
Stephan Porada
ec5b4eb521 Add PDF compression 2020-06-16 09:31:34 +02:00
Stephan Porada
b77ca5914f Set relative file paths in hocr 2020-06-10 11:48:58 +02:00
Stephan Porada
018939ae55 Add PoCo zips part 1 2020-06-09 16:58:22 +02:00
Patrick Jentsch
64fe706126 Keep uncompressed output files after zip jobs. 2020-05-13 09:11:01 +02:00
Patrick Jentsch
a75b32ca1d Bump versions 2020-04-06 09:21:52 +02:00
Patrick Jentsch
364e3d626d Fix zip creation 2020-04-04 15:37:21 +02:00
Patrick Jentsch
36a86887b0 Update OCR Pipeline 2020-04-03 17:35:30 +02:00
stephan
eb5ccf4e21 Add ocr to filenames 2020-02-18 10:16:24 +01:00
stephan
c1f5252633 Some cosmetics 2020-02-17 14:59:34 +01:00
stephan
880f0efcf9 Add zip fielname argument 2020-02-17 14:26:50 +01:00
Patrick Jentsch
6c4a642cb7 Add a switch for zip functionality 2020-02-03 15:00:27 +01:00
Patrick Jentsch
dfc05be7db add zip creation of results 2020-01-20 15:04:55 +01:00
Patrick Jentsch
3a4cc16e5b Update 2019-11-04 15:14:59 +01:00
Patrick Jentsch
8a4d006687 Update .gitlab-ci.yml 2019-09-16 15:39:02 +02:00
Patrick Jentsch
3e43c8eab5 Update .gitlab-ci.yml 2019-09-16 15:33:35 +02:00
Patrick Jentsch
f1d1434e1a Update .gitlab-ci.yml 2019-09-16 15:30:11 +02:00
Patrick Jentsch
62a435e8c2 Update .gitlab-ci.yml 2019-09-16 15:28:33 +02:00
Patrick Jentsch
088cf49b89 set charset again! 2019-09-12 11:30:52 +02:00
Patrick Jentsch
cebc53da03 Codestyle 2019-09-11 15:15:00 +02:00
Patrick Jentsch
1fd85d1b44 Change CI script. 2019-07-31 11:23:41 +02:00
Patrick Jentsch
fa4a798351 Use language models from repository. Remove workaround for the legacy German Fraktur model. 2019-07-31 11:13:55 +02:00
Patrick Jentsch
1a3d7175fe Remove comments 2019-06-11 14:18:46 +02:00
Patrick Jentsch
6f6d6e809e Update .gitlab-ci.yml 2019-06-04 12:18:31 +02:00
Patrick Jentsch
148e9e86e9 Use variable instead of headcoded string 2019-06-03 14:20:22 +02:00
Patrick Jentsch
f280b16b1b Make arguments optional 2019-06-03 14:18:16 +02:00
Patrick Jentsch
b5ba154f86 Update for unprivileged usage 2 2019-06-03 13:32:42 +02:00
Patrick Jentsch
a4b68bece7 Use more specific versions. 2019-06-02 21:45:11 +02:00
Patrick Jentsch
a433aea3e6 Fix 2019-06-02 21:41:33 +02:00
Patrick Jentsch
95adc4d804 Update for unprivileged usage. 2019-06-02 21:38:30 +02:00
Patrick Jentsch
f731634ba1 Update Dockerfile 2019-05-27 11:47:38 +02:00
Patrick Jentsch
f73a191314 Add wrapper and remove default arguments from Dockerfile 2019-05-21 12:29:26 +02:00
Patrick Jentsch
8a9ff27aaa Add usage hint. 2019-05-20 12:06:57 +02:00
Patrick Jentsch
8ca24c3a14 Merge branch 'master' of gitlab.ub.uni-bielefeld.de:sfb1288inf/ocr 2019-05-20 11:10:49 +02:00