Commit Graph

98 Commits

Author SHA1 Message Date
Patrick Jentsch
4dea95a108 Preliminary work 2021-07-13 16:31:53 +02:00
Patrick Jentsch
5139fd9727 Fix problem where encoding is not set 2021-06-22 12:46:01 +02:00
Patrick Jentsch
fd39246e4b Update file handling. Now md5 is correct 2021-05-18 10:26:03 +02:00
Patrick Jentsch
bd5d8ddedb Fix problems caused by wrong textwrap.wrap usage 2021-04-30 09:44:35 +02:00
Patrick Jentsch
f7b7da2b1f restrict memory usage for nlp tasks 2021-04-22 08:46:28 +02:00
Patrick Jentsch
2813d1a222 Fix long text processing 2021-04-22 08:43:34 +02:00
Patrick Jentsch
cd976692d6 Don't process files in subdirectories 2021-04-12 13:24:31 +02:00
Patrick Jentsch
4e7669d009 Return the returncode 2021-04-12 09:26:21 +02:00
Patrick Jentsch
8105edfd1b Add missing argument to wrapper script 2021-04-12 09:20:28 +02:00
Patrick Jentsch
72409bd12d Fix race condition 2021-03-26 14:48:38 +01:00
Patrick Jentsch
54f336e620 Fix permissions 2021-03-26 10:09:45 +01:00
Patrick Jentsch
3b570e5df1 more pipeline help tweaks 2021-03-26 10:02:14 +01:00
Patrick Jentsch
dc62755d12 Update README and pipeline help 2021-03-26 10:01:51 +01:00
Patrick Jentsch
aa1bfa259d Use JSON files for stand-off annotations. 2021-03-26 09:46:17 +01:00
Patrick Jentsch
d620c29f27 Fix version 1.0.0 2021-02-25 11:26:11 +01:00
Patrick Jentsch
2ced38504c Use "buster" instead of "10" in FROM 2020-10-08 23:17:58 +02:00
Patrick Jentsch
f02c0953bf Use new Dockerfile structure 2020-10-08 23:08:49 +02:00
Patrick Jentsch
5329446277 Update CI script 2020-10-07 17:09:09 +02:00
Patrick Jentsch
15e373db58 fix gitlab ci 2020-09-23 16:53:16 +02:00
Patrick Jentsch
8afdfb13b2 Use smaller models 2020-09-23 15:46:43 +02:00
Patrick Jentsch
1ed42f68ad Remove clean stage from stages 2020-09-23 15:27:31 +02:00
Patrick Jentsch
42583fea46 Update to newer Version 2020-09-23 15:26:53 +02:00
Patrick Jentsch
5bd0feda5c fix pipeline 2020-06-23 15:19:39 +02:00
Patrick Jentsch
5980a995e5 Add missing newline 2020-06-10 14:23:43 +02:00
Patrick Jentsch
fe7ab93513 Update nlp software metadata represantation 2020-06-10 13:14:34 +02:00
Stephan Porada
91708308bc Add model version number 2020-05-20 15:35:45 +02:00
Stephan Porada
887e814020 Fix 2020-05-20 15:01:52 +02:00
Stephan Porada
3fc6ebff4c Add stand off varaiant and metadata 2020-05-20 14:55:52 +02:00
Patrick Jentsch
bef51b7d81 Keep uncompressed output files after zip jobs. 2020-05-13 09:07:31 +02:00
Patrick Jentsch
68e86338d4 Bump versions 2020-04-06 09:21:38 +02:00
Patrick Jentsch
30d127f3af Fix zip creation 2020-04-04 15:37:12 +02:00
Patrick Jentsch
e061a7426d Update NLP Pipeline 2020-04-03 17:35:05 +02:00
stephan
41910afb79 Add nlp to filename 2020-02-18 10:17:24 +01:00
stephan
5d2fee029e Some cosmetics 2020-02-17 14:58:18 +01:00
stephan
6e87e0decd Add filename argument for zip results 2020-02-17 11:57:55 +01:00
Stephan Porada
79043f3dd7 Fix last errors 2020-02-12 14:25:08 +01:00
Stephan Porada
1a3e4a0a02 Fix check_encoding functionality 2020-02-12 14:16:36 +01:00
Stephan Porada
504861ae07 Update Dockerfile 2020-02-12 13:48:30 +01:00
Stephan Porada
88d03d4360 Add function to check the encoding of input text files. 2020-02-12 13:46:43 +01:00
Patrick Jentsch
6769be049a Escape text and lemma 2020-02-04 13:12:31 +01:00
Patrick Jentsch
ec2cf1dcff Fix zip switch integration 2020-02-03 15:26:04 +01:00
Patrick Jentsch
e4ef4835e5 Add a switch for zip functionality 2020-02-03 15:02:26 +01:00
Patrick Jentsch
5f20f9be40 Remove id xml attribute from output file 2020-01-27 15:59:32 +01:00
Patrick Jentsch
b0a402b3ac Add zip creation 2020-01-20 15:09:38 +01:00
Patrick Jentsch
543a1ba29a Bump version 2020-01-07 11:24:11 +01:00
Patrick Jentsch
d5a2d38c17 fix 2019-11-04 15:18:52 +01:00
Patrick Jentsch
4af9d9c899 Update 2019-11-04 15:15:41 +01:00
Patrick Jentsch
de8160a5b6 Update .gitlab-ci.yml 2019-09-19 09:25:29 +02:00
Patrick Jentsch
d564ed0464 Update .gitlab-ci.yml 2019-09-19 09:24:04 +02:00
Patrick Jentsch
abf6c430c3 Update .gitlab-ci.yml 2019-09-16 15:52:23 +02:00