Go to file
Patrick Jentsch 5b7bc2a840 Update
2019-05-20 11:28:51 +02:00
.gitlab-ci.yml Update .gitlab-ci.yml 2019-03-13 18:33:11 +01:00
Dockerfile Update 2019-05-20 11:28:51 +02:00
nlp Update 2019-05-20 11:28:51 +02:00
README.md Update README.md 2019-03-27 09:41:44 +01:00
spacy_nlp Update 2019-05-20 11:28:51 +02:00
test.py Add linewrap function and test.py for fun. 2019-03-06 14:17:03 +01:00

Natural language processing

This repository provides all code that is needed to build a container image for natural language processing utilising spaCy. In case you don't want to build the image by yourself, there is also a prebuild image that can be used in the registry.

Build the image

user@machine:~$ cd <path-to-this-repository>
user@machine:~$ docker build -t gitlab.ub.uni-bielefeld.de:4567/sfb1288inf/nlp .

Starting a container

user@machine:~$ docker run \
  --name nlp-container \
  -dit \
  -v <your-input-directory>:/root/files_for_nlp \
  -v <your-output-directory>:/root/files_from_nlp \
  gitlab.ub.uni-bielefeld.de:4567/sfb1288inf/nlp

Start a natural language processing run

user@machine:~$ docker exec -it nlp-container \
  nlp -i files_for_nlp -o files_from_nlp -l <language-code>

Where needs to be one of the following:

  • de (German)
  • en (English)
  • es (Spanish)
  • fr (French)
  • pt (Portuguese)