2020-02-10 12:25:15 +00:00
|
|
|
|
{% extends "nopaque.html.j2" %}
|
|
|
|
|
|
2020-02-10 15:06:08 +00:00
|
|
|
|
{% set full_width = False %}
|
2020-02-10 12:25:15 +00:00
|
|
|
|
{% set roadmap = False %}
|
|
|
|
|
|
|
|
|
|
{% block page_content %}
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<style>
|
|
|
|
|
input::placeholder {
|
|
|
|
|
color: black;
|
|
|
|
|
font-style: italic;
|
|
|
|
|
}
|
|
|
|
|
</style>
|
|
|
|
|
|
|
|
|
|
<div class="col s9">
|
2020-02-10 12:25:15 +00:00
|
|
|
|
<div class="card">
|
|
|
|
|
<div class="card-content">
|
|
|
|
|
<span class="card-title"><i class="material-icons left">burst_mode</i>Setup files</span>
|
|
|
|
|
<p>
|
|
|
|
|
Häufig liegen Datenbestände in verschiedenen Formaten und verstreut
|
|
|
|
|
vor. Da eine Verarbeitung via nopaque ein einheitliches Datenformat
|
|
|
|
|
vorsieht, wird dieser Dienst zur Verfügung gestellt, um etwaig
|
|
|
|
|
anfallende Konvertierungsprozesse durchzuführen.
|
|
|
|
|
</p>
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<div class="row">
|
|
|
|
|
<div class="col s9">
|
|
|
|
|
<div class="file-field input-field">
|
|
|
|
|
<div class="btn">
|
|
|
|
|
<span>File</span>
|
|
|
|
|
<input type="file" multiple>
|
|
|
|
|
</div>
|
|
|
|
|
<div class="file-path-wrapper">
|
|
|
|
|
<input class="file-path validate" type="text" placeholder="Bilder, Fotos, Scans…">
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
<div class="col s3 right-align">
|
|
|
|
|
<p> </p>
|
|
|
|
|
<button class="btn waves-effect waves-light"type="submit">Submit<i class="material-icons right">send</i></button>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
2020-02-10 12:25:15 +00:00
|
|
|
|
<blockquote>Umgesetzt mit <i>ImageMagick</i></blockquote>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
|
|
|
|
|
<div class="col s3">
|
|
|
|
|
<div class="card">
|
|
|
|
|
<div class="card-content">
|
|
|
|
|
<span class="card-title">Ausgabe</span>
|
|
|
|
|
<p>Aus den Eingaben zusammengesetzte Multipage-TIFF-Dateien.</p>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
|
|
|
|
|
<div class="col s12"></div>
|
|
|
|
|
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<div class="col s9">
|
2020-02-10 12:25:15 +00:00
|
|
|
|
<div class="card">
|
|
|
|
|
<div class="card-content">
|
|
|
|
|
<span class="card-title"><i class="material-icons left">find_in_page</i>Optical Character Recognition</span>
|
|
|
|
|
<p>
|
2020-02-10 15:06:08 +00:00
|
|
|
|
Durch optische Analysemethoden werden aus Bilddaten, wie Fotos oder
|
|
|
|
|
Scans, Textdaten erzeugt. Erst dieser Vorverarbeitungsschritt
|
2020-02-10 12:25:15 +00:00
|
|
|
|
ermöglicht eine weitere computergestützte Verarbeitung von Dokumenten.
|
|
|
|
|
</p>
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<div class="row">
|
|
|
|
|
<div class="col s9">
|
|
|
|
|
<div class="file-field input-field">
|
|
|
|
|
<div class="btn">
|
|
|
|
|
<span>File</span>
|
|
|
|
|
<input type="file" multiple>
|
|
|
|
|
</div>
|
|
|
|
|
<div class="file-path-wrapper">
|
|
|
|
|
<input class="file-path validate" type="text" placeholder="Multipage-TIFF- oder PDF-Dateien">
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
<div class="col s3 right-align">
|
|
|
|
|
<p> </p>
|
|
|
|
|
<button class="btn waves-effect waves-light"type="submit">Submit<i class="material-icons right">send</i></button>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
2020-02-10 12:25:15 +00:00
|
|
|
|
<blockquote>Umgesetzt mit <i>Tesseract OCR</i></blockquote>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
|
|
|
|
|
<div class="col s3">
|
|
|
|
|
<div class="card">
|
|
|
|
|
<div class="card-content">
|
|
|
|
|
<span class="card-title">Ausgabe</span>
|
|
|
|
|
<p>
|
|
|
|
|
Textdateien, PDF-Dateien und TEI P5 konformen XML-Dateien.
|
|
|
|
|
</p>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
|
|
|
|
|
<div class="col s12"></div>
|
|
|
|
|
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<div class="col s9">
|
2020-02-10 12:25:15 +00:00
|
|
|
|
<div class="card">
|
|
|
|
|
<div class="card-content">
|
|
|
|
|
<span class="card-title"><i class="material-icons left">format_textdirection_l_to_r</i>Natural Language Processing</span>
|
|
|
|
|
<p>
|
|
|
|
|
Mit Hilfe computergestützter linguistischer
|
|
|
|
|
Datenverarbeitungsmethoden (Tokenisierung, Lemmatisierung,
|
|
|
|
|
Part-of-speech-Tagging und Eigennamenerkennung) werden Textdateien
|
|
|
|
|
mit weiteren Informationen angereichert.
|
|
|
|
|
</p>
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<div class="row">
|
|
|
|
|
<div class="col s9">
|
|
|
|
|
<div class="file-field input-field">
|
|
|
|
|
<div class="btn">
|
|
|
|
|
<span>File</span>
|
|
|
|
|
<input type="file" multiple>
|
|
|
|
|
</div>
|
|
|
|
|
<div class="file-path-wrapper">
|
|
|
|
|
<input class="file-path validate" type="text" placeholder="Textdateien">
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
<div class="col s3 right-align">
|
|
|
|
|
<p> </p>
|
|
|
|
|
<button class="btn waves-effect waves-light"type="submit">Submit<i class="material-icons right">send</i></button>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
2020-02-10 12:25:15 +00:00
|
|
|
|
<blockquote>Umgesetzt mit <i>spaCy</i></blockquote>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
|
|
|
|
|
<div class="col s3">
|
|
|
|
|
<div class="card">
|
|
|
|
|
<div class="card-content">
|
|
|
|
|
<span class="card-title">Ausgabe</span>
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<p>Korpusdateien im <i>verticalized text</i>-Format (XML-Dialekt).</p>
|
2020-02-10 12:25:15 +00:00
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
|
|
|
|
|
<div class="col s12"></div>
|
|
|
|
|
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<div class="col s9">
|
2020-02-10 12:25:15 +00:00
|
|
|
|
<div class="card">
|
|
|
|
|
<div class="card-content">
|
|
|
|
|
<span class="card-title"><i class="material-icons left">search</i>Corpus Analysis</span>
|
|
|
|
|
<p>
|
|
|
|
|
Mittels CQP Query Language als Abfragesprache können komplexe
|
|
|
|
|
Suchanfragen unter Zuhilfenahme von Metadaten und NLP-Auszeichnungen
|
|
|
|
|
ausgeführt werden. Ergebnisse können als Text oder in abstrakter
|
|
|
|
|
Darstellung ausgewertet werden.
|
|
|
|
|
</p>
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<div class="input-field">
|
|
|
|
|
<i class="material-icons prefix">search</i>
|
|
|
|
|
<input class="search" placeholder='"fox" "jumps" "over" []* "dog"' type="search"></input>
|
|
|
|
|
</div>
|
|
|
|
|
<p>
|
2020-02-10 15:07:26 +00:00
|
|
|
|
<i class="material-icons left" style="padding-left: 10px;">subdirectory_arrow_right</i>
|
2020-02-10 15:06:08 +00:00
|
|
|
|
<span class="chip">The</span> <span class="chip">quick</span>
|
|
|
|
|
<span class="chip">brown</span> <span class="chip">fox</span>
|
|
|
|
|
<span class="chip">jumps</span> <span class="chip">over</span>
|
|
|
|
|
<span class="chip">the</span> <span class="chip">lazy</span>
|
|
|
|
|
<span class="chip">dog</span> <span class="chip">.</span>
|
|
|
|
|
</p>
|
2020-02-10 12:25:15 +00:00
|
|
|
|
<blockquote>Umgesetzt mit <i>IMS Open Corpus Workbench</i></blockquote>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
|
|
|
|
|
<div class="col s3">
|
|
|
|
|
<div class="card">
|
|
|
|
|
<div class="card-content">
|
|
|
|
|
<span class="card-title">Ausgabe</span>
|
|
|
|
|
<p>Export der Ergebnisse in CSV, Excel, JSON und HTML.</p>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
</div>
|
|
|
|
|
{% endblock %}
|