Go to file
2020-10-23 16:11:08 +02:00
daemon bump image version 2020-10-09 17:50:59 +02:00
web Fix placeholder text color for file inputs/uploads. 2020-10-23 16:11:08 +02:00
.env.tpl Add new config variables (defaults are what you want if you don't have http to https redirect enabled) 2020-10-09 14:43:23 +02:00
.gitignore Huge config update and smtp fix for daemon 2020-10-08 12:34:02 +02:00
docker-compose.development.yml Huge config update and smtp fix for daemon 2020-10-08 12:34:02 +02:00
docker-compose.traefik.yml Add neccessary header to docker-compose.traefik.yml. 2020-10-12 14:01:42 +02:00
docker-compose.yml Huge config update and smtp fix for daemon 2020-10-08 12:34:02 +02:00
README.md Latest README update, before i forget. 2020-10-13 10:15:55 +02:00

nopaque

nopaque bundles various tools and services that provide humanities scholars with DH methods and thus can support their various individual research processes. Using nopaque, researchers can subject digitized sources to Optical Character Recognition (OCR). The resulting text files can then be used as a data basis for Natural Language Processing (NLP). The texts are automatically subjected to various linguistic annotations. The data processed via NLP can then be summarized in the web application as corpora and analyzed by means of an information retrieval system through complex search queries. The range of functions of the web application will be successively extended according to the needs of the researchers.

Prerequisites and requirements

  1. Install docker for your system. Following the official instructions.
  2. Install docker-compose. Following the official instructions.

Configuration and startup

Create Docker swarm

The generated computational workload is handled by a Docker swarm. A swarm is a group of machines that are running Docker and joined into a cluster. It consists out of two different kinds of members, manager and worker nodes. The swarm setup process is described best in the Docker documentation.

Create network storage

A shared network space is necessary so that all swarm members have access to all the data. To achieve this a samba share is used.

# Example: Create a Samba share via Docker
# More details can be found under https://hub.docker.com/r/dperson/samba/
username@hostname:~$ sudo mkdir -p /srv/samba/nopaque
username@hostname:~$ docker run \
                       --name opaque_storage \
                       -v /srv/samba/nopaque:/srv/samba/nopaque \
                       -p 139:139 \
                       -p 445:445 \
                       dperson/samba \
                         -p -s "nopaque;/srv/samba/nopaque;no;no;no;nopaque" -u "nopaque;nopaque"

# Mount the Samba share on all swarm nodes (managers and workers)
username@hostname:~$ sudo mkdir /mnt/nopaque
username@hostname:~$ sudo mount --types cifs --options gid=${USER},password=nopaque,uid=${USER},user=nopaque,vers=3.0 //<SAMBA-SERVER-IP>/nopaque /mnt/nopaque

Download, configure and build nopaque

# Clone the nopaque repository
username@hostname:~$ git clone https://gitlab.ub.uni-bielefeld.de/sfb1288inf/nopaque.git
username@hostname:~$ cp .env.tpl .env
# Fill out the variables within this file.
username@hostname:~$ <YOUR EDITOR> .env
username@hostname:~$ touch docker-compose.override.yml
# Tweak the docker-compose.override.yml to satisfy your needs. (You can find examples in docker-compose.<example>.yml)
username@hostname:~$ <YOUR EDITOR> docker-compose.override.yml
# Build docker images
username@hostname:~$ docker-compose build

Start your instance

# Create log files
touch nopaque.log nopaqued.log
# For background execution add the -d flag and to scale the app, add --scale web=<NUM-INSTANCES>
username@hostname:~$ docker-compose up