diff --git a/README.md b/README.md index 27b8b78..d65fd56 100644 --- a/README.md +++ b/README.md @@ -12,9 +12,9 @@ The data can be downloaded here: https://uni-bielefeld.sciebo.de/s/9p55VIn9OLmNq Note that there are currently **two** versions of all the data available. -Version _1.0\_data_ contains the data described and used for the original master thesis. The master thesis can be viewed here. Protocols for the periods 15, 16 and 17 are erroneous. +Version _1.0\_data_ contains the data described and used for the original master thesis. The master thesis can be viewed here. Protocols for the periods 15, 16 and 17 are erroneous. Therfore the ngrams for those periods are erroneous as well. -Version _1.1\_data_ contains new calculated ngrams based on new officlally released xml protocols. Also some fixes regarding the markup have been introduced. The erroneous protocols have been fixed officially from the Bundesregierung. +Version _1.1\_data_ contains the new officlally released and corrected xml protocols. The protocols have been corrected by the Bundesregierung. Ngrams have been calculated again on the basis of the new protocols. Also some fixes regarding the markup have been introduced. ``` .