diff --git a/README.md b/README.md index 180eeb6..bafffb7 100644 --- a/README.md +++ b/README.md @@ -8,7 +8,10 @@ The data can be downloaded here: https://uni-bielefeld.sciebo.de/s/9p55VIn9OLmNq Size: around 70GB -Structure: +Structure: +Note that there are currently **two** versions of all the data available. +Version 1.0 data contains the data described and used for the original master thesis. The master thesis can be viewed here. +Version 1.1 data contains new calculated ngrams based on new officlally released xml protocols. Also some fixes have been introduced. ``` .