Merge branch 'master' of gitlab.ub.uni-bielefeld.de:sporada/bundesdata_markup_nlp_software

This commit is contained in:
Stephan Porada 2019-06-26 12:43:13 +02:00
commit e0b987c0c5

View File

@ -7,7 +7,7 @@ which member of parliament hold what speech etc.
This software can mark every protocol from 1949 till 2017 automatically. The
software identifies speakers, their speeches, metadata etc. For detailed information
why this software was made and how it works, read the corresponding master thises
uploaded [here](#) (It is written in german though).
uploaded [here](https://gitlab.ub.uni-bielefeld.de/sporada/bundesdata_web_app/raw/24641c2959796659d428514c9cdd3782d4248da0/2019-02-04_Stephan_Porada_Masterthesis_semi.pdf?inline=false) (It is written in german though).
Besides the markup the software can also calculate ngrams for all automatically
marked protocols either from lemmatized or just tokenized text with or without