{% extends "nopaque.html.j2" %} {% set full_width = False %} {% set roadmap = True %} {% block page_content %}
layersTokenisierung

Aufteilung eines Textes in Sätze und Wörter. Dies ist zur weiteren Verarbeitung notwendig.

 
layersLemmatisierung

Reduktion der Flexionsformen eines Wortes auf dessen Grundform.

 
layersPart-of-speech-Tagging

Kontext- und definitionsbezogene Zuordnung von Wörtern und Satzzeichen zu Wortarten.

 
layersEigennamenerkennung

Identifikation von Wörtern, die eine Entitätbeschreiben, wie Firmen- und Personennamen.

Submit a job

{{ add_job_form.hidden_tag() }}
title {{ add_job_form.title(data_length='32') }} {{ add_job_form.title.label }} {% for error in add_job_form.title.errors %} {{ error }} {% endfor %}
language {{ add_job_form.language() }} {{ add_job_form.language.label }} {% for error in add_job_form.language.errors %} {{ error }} {% endfor %}
language {{ add_job_form.version() }} {{ add_job_form.version.label }} {% for error in add_job_form.version.errors %} {{ error }} {% endfor %}
{{ add_job_form.files.label.text }} {{ add_job_form.files(accept='text/plain') }}
{% for error in add_job_form.files.errors %} {{ error }} {% endfor %}
description {{ add_job_form.description(data_length='255') }} {{ add_job_form.description.label }} {% for error in add_job_form.description.errors %} {{ error }} {% endfor %}
Check Encoding

If the input files are not created with the nopaque OCR service or you do not know if your text files are UTF-8 encoded, check this switch. We will try to automatically determine the right encoding for your texts to process them.

{% endblock %}