diff --git a/README.md b/README.md index 07ce439..bafe61e 100644 --- a/README.md +++ b/README.md @@ -12,7 +12,7 @@ The data can be downloaded here: https://uni-bielefeld.sciebo.de/s/I93l9QNZKLUTv Note that there are currently **two** versions of all the data available. -Version _1.0\_data_ contains the data described and used for the original master thesis. The master thesis can be viewed here. Protocols for the periods 15, 16 and 17 are erroneous. Therfore the ngrams for those periods are erroneous as well. +Version _1.0\_data_ contains the data described and used for the original master thesis. The master thesis can be viewed [here](https://gitlab.ub.uni-bielefeld.de/sporada/bundesdata_web_app/raw/24641c2959796659d428514c9cdd3782d4248da0/2019-02-04_Stephan_Porada_Masterthesis_semi.pdf?inline=false). Protocols for the periods 15, 16 and 17 are erroneous. Therfore the ngrams for those periods are erroneous as well. Version _1.1\_data_ contains the new officlally released and corrected xml protocols. The protocols have been corrected by the Bundesregierung. Ngrams will be calculated again on the basis of the new protocols in the near future. Also some fixes regarding the markup will be introduced.