From cd172f11a2a5e1fb2bfbce8b372ac3e65d4349fa Mon Sep 17 00:00:00 2001 From: Stephan Porada Date: Tue, 26 Feb 2019 19:54:42 +0100 Subject: [PATCH] Update README.md --- README.md | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.md b/README.md index 9fbdb7c..d322a39 100644 --- a/README.md +++ b/README.md @@ -16,7 +16,7 @@ Structure: │   ├── backup_raw_xml ### Zip files of all original protocols. │   ├── current_official_protocols_xml ### Example file of the new official markup. │   ├── development_data_xml ### Set of original xml protocols used for development. -│   ├── faulty_raw_xml ### All original protocolls with errors. The Bundesregierung should have fixed those by now. The Software mentioned above used these faulty ones though because the new ones were not available back then. +│   ├── faulty_raw_xml ### All original protocols with errors. The Bundesregierung should have fixed those by now. The Software mentioned above used these faulty ones though because the new ones were not available back then. │   │   ├── 15_Wahlperiode_2002-2005 │   │   ├── 16_Wahlperiode_2005-2009 │   │   └── 17_Wahlperiode_2009-2013 @@ -41,7 +41,7 @@ Structure: ├── outputs ### These are the files an data produced using the software from https://gitlab.ub.uni-bielefeld.de/sporada/bundesdata_markup_nlp_software │   ├── markup ### Contains all automatically marked protocols. │   │   ├── dev_data ### Automatically marked dev_data protocols. -│   │   │   ├── beautiful_xml ### Final output: humanreadable automatically marked protocolls. +│   │   │   ├── beautiful_xml ### Final output: humanreadable automatically marked protocols. │   │   │   ├── clear_speech_markup ### Tmp data │   │   │   ├── complex_markup ### Tmp data │   │   │   ├── new_metadata ### Tmp data @@ -66,14 +66,14 @@ Structure: │   │   │   ├── 17_Wahlperiode_2009-2013_faulty │   │   │   └── 18_Wahlperiode_2013-2017 │   │   └── test_data ### Automatically marked test_data protocols. -│   │   ├── beautiful_xml ### Final output: humanreadable automatically marked protocolls. +│   │   ├── beautiful_xml ### Final output: humanreadable automatically marked protocols. │   │   ├── clear_speech_markup ### Tmp data │   │   ├── complex_markup ### Tmp data │   │   ├── new_metadata ### Tmp data │   │   └── simple_xml ### Tmp data -│   └── nlp ### All data created from the automatically marked protocolls. -│   └── full_periods ### Contains data created from all protocolls. -│   ├── n-grams ### N-Gramm data based on protocolls (sibling of this folder). +│   └── nlp ### All data created from the automatically marked protocols. +│   └── full_periods ### Contains data created from all protocols. +│   ├── n-grams ### N-Gramm data based on protocols (sibling of this folder). │   │   ├── lm_ns_speaker ### N-grams from lemmatized protocols without stop words counted by speaker. │   │   │   ├── 1_grams │   │   │   ├── 2_grams