Update README.md

This commit is contained in:
Stephan Porada 2020-07-28 10:28:22 +02:00
parent 9a39ccc7e0
commit 74e243c6ad

View File

@ -4,7 +4,7 @@ This is just a repository providing the link to the data used and created by the
Pelase read the description of that project to understand what kind of data this is. The project is part of a master thesis which can be read [here](https://gitlab.ub.uni-bielefeld.de/sporada/bundesdata_web_app/raw/24641c2959796659d428514c9cdd3782d4248da0/2019-02-04_Stephan_Porada_Masterthesis_semi.pdf?inline=false). Pelase read the description of that project to understand what kind of data this is. The project is part of a master thesis which can be read [here](https://gitlab.ub.uni-bielefeld.de/sporada/bundesdata_web_app/raw/24641c2959796659d428514c9cdd3782d4248da0/2019-02-04_Stephan_Porada_Masterthesis_semi.pdf?inline=false).
The data can be downloaded here: https://uni-bielefeld.sciebo.de/s/9p55VIn9OLmNqa9 The data can be downloaded here: https://uni-bielefeld.sciebo.de/s/I93l9QNZKLUTv3S
**Size**: around 70GB **Size**: around 70GB
@ -14,7 +14,7 @@ Note that there are currently **two** versions of all the data available.
Version _1.0\_data_ contains the data described and used for the original master thesis. The master thesis can be viewed here. Protocols for the periods 15, 16 and 17 are erroneous. Therfore the ngrams for those periods are erroneous as well. Version _1.0\_data_ contains the data described and used for the original master thesis. The master thesis can be viewed here. Protocols for the periods 15, 16 and 17 are erroneous. Therfore the ngrams for those periods are erroneous as well.
Version _1.1\_data_ contains the new officlally released and corrected xml protocols. The protocols have been corrected by the Bundesregierung. Ngrams have been calculated again on the basis of the new protocols. Also some fixes regarding the markup have been introduced. Version _1.1\_data_ contains the new officlally released and corrected xml protocols. The protocols have been corrected by the Bundesregierung. Ngrams will be calculated again on the basis of the new protocols in the near future. Also some fixes regarding the markup will be introduced.
``` ```
. .