Go to file
2023-02-15 11:37:09 +01:00
.vscode Add .vscode config files 2023-01-25 16:03:28 +01:00
app Merge branch 'public-corpus' into development 2023-02-15 11:32:44 +01:00
docker-compose Move docker-compose example files in subdirectory 2022-07-15 12:22:19 +02:00
logs Add log folder via dummy file to repo 2022-02-08 15:41:56 +01:00
migrations Add missing migrationscript messages 2023-02-06 11:48:01 +01:00
tests Fix some bugs in tests 2021-11-15 14:10:28 +01:00
.dockerignore Update Dockerfile logic 2022-11-11 14:59:14 +01:00
.env.tpl Update Flask Hashids 2023-01-20 08:58:41 +01:00
.flaskenv Simplify Config setup and move some functions to dedicated files 2021-09-15 12:31:53 +02:00
.gitignore Update Dockerfile logic 2022-11-11 14:59:14 +01:00
boot.sh Update Dockerfile logic 2022-11-11 14:59:14 +01:00
config.py Update Flask Hashids 2023-01-20 08:58:41 +01:00
db.env.tpl Next big config update. Check the .env.tpl and db.env.tpl 2020-10-21 13:07:10 +02:00
docker-compose.yml Add new config option. DOCKER_NETWORK_NAME 2022-10-12 15:32:47 +02:00
Dockerfile Downgrade Python version to match Python current Ubuntu LTS package version 2023-01-25 16:03:09 +01:00
LICENSE Big Corpus analysis update 2021-11-16 15:23:57 +01:00
nopaque.py Add m2m corpus-follower table 2023-01-23 11:50:12 +01:00
README.md Move docker-compose example files in subdirectory 2022-07-15 12:22:19 +02:00
requirements.txt Bump Flask-Hashids version 2023-02-15 10:42:55 +01:00

nopaque

nopaque bundles various tools and services that provide humanities scholars with DH methods and thus can support their various individual research processes. Using nopaque, researchers can subject digitized sources to Optical Character Recognition (OCR). The resulting text files can then be used as a data basis for Natural Language Processing (NLP). The texts are automatically subjected to various linguistic annotations. The data processed via NLP can then be summarized in the web application as corpora and analyzed by means of an information retrieval system through complex search queries. The range of functions of the web application will be successively extended according to the needs of the researchers.

Prerequisites and requirements

  1. Install docker for your system. Following the official instructions.
  2. Install docker-compose. Following the official instructions.

Configuration and startup

Create Docker swarm

The generated computational workload is handled by a Docker swarm. A swarm is a group of machines that are running Docker and joined into a cluster. It consists out of two different kinds of members, manager and worker nodes. The swarm setup process is described best in the Docker documentation.

Create network storage

A shared network space is necessary so that all swarm members have access to all the data. To achieve this a samba share can be used.

You can create a samba share by using this Docker image.

# Mount the Samba share on all swarm nodes (managers and workers)
username@hostname:~$ sudo mkdir /mnt/nopaque
username@hostname:~$ sudo mount --types cifs --options gid=${USER},password=nopaque,uid=${USER},user=nopaque,vers=3.0 //<SAMBA-SERVER-IP>/<SAMBA-SHARE-NAME> /mnt/nopaque

Download, configure and build nopaque

# Clone the nopaque repository
username@hostname:~$ git clone https://gitlab.ub.uni-bielefeld.de/sfb1288inf/nopaque.git
# Create data directories
username@hostname:~$ mkdir data/{db,logs,mq}
username@hostname:~$ cp db.env.tpl db.env
username@hostname:~$ cp .env.tpl .env
# Fill out the variables within these files.
username@hostname:~$ <YOUR EDITOR> db.env
username@hostname:~$ <YOUR EDITOR> .env
# Create docker-compose.override.yml file
username@hostname:~$ touch docker-compose.override.yml
# Tweak the docker-compose.override.yml to satisfy your needs. (You can find examples inside the docker-compose directory)
username@hostname:~$ <YOUR EDITOR> docker-compose.override.yml
# Build docker images
username@hostname:~$ docker-compose build

Start your instance

# For background execution add the -d flag
docker-compose up
# To scale your app use the following command after starting it normally
docker-compose \
  -f docker-compose.yml \
  -f docker-compose.override.yml \
  -f docker-compose.scale.yml \
  up \
    -d \
    --no-recreate \
    --scale nopaque=<NUM_INSTANCES>