Go to file
2019-03-27 09:40:22 +01:00
.gitlab-ci.yml Update .gitlab-ci.yml 2019-03-13 18:33:11 +01:00
Dockerfile Update Dockerfile 2019-03-11 23:37:50 +01:00
nlp min instead of max 2019-03-06 15:56:37 +01:00
README.md README.md hinzufügen 2019-03-27 09:40:22 +01:00
spacy_nlp Remove path from text id. 2019-03-06 18:31:18 +01:00
test.py Add linewrap function and test.py for fun. 2019-03-06 14:17:03 +01:00

Natural language processing

This repository provides all code that is needed to build a container image for natural language processing utilising spaCy. In case you don't want to build the image by yourself, there is also a prebuild image that can be used in the registry.

Build the image

user@machine:~$ cd <path-to-this-repository>
user@machine:~$ docker build -t gitlab.ub.uni-bielefeld.de:4567/sfb1288inf/nlp .

Starting a container

user@machine:~$ docker run \
  --name nlp-container \
  -dit \
  -v <your-input-directory>:/root/files_for_nlp \
  -v <your-output-directory>:/root/files_from_nlp \
  gitlab.ub.uni-bielefeld.de:4567/sfb1288inf/nlp

Start a natural language processing run

user@machine:~$ docker exec -it nlp-container \
  nlp -i files_for_nlp -o files_from_nlp -l <language-code>

Where needs to be one of the following:

  • de (Deutsch)
  • en (Englisch)
  • es (Spanish)
  • fr (Französisch)
  • pt (Portugisisch)