{% extends "nopaque.html.j2" %} {% import 'materialize/wtf.html.j2' as wtf %} {% from '_colors.html.j2' import colors %} {% set scheme_primary_color = colors.nlp_darken %} {% set scheme_secondary_color = colors.nlp %} {% block nav_content %} {% include 'services/_breadcrumbs.html.j2' %} {% endblock nav_content %} {% block main_attribs %} class="nlp-color lighten"{% endblock main_attribs %} {% block page_content %}

{{ title }}

 

 

layersTokenization

Your text is split up into sentences and words, so called tokens, which can then be analyzed.

layersLemmatization

All inflected forms of a word are grouped together so that it can be analyzed as a single item.

layersPart-of-speech Tagging

In accordance with its definition and context, each word is marked up as corresponding to a particular part of speech.

layersNamed-Entity Recognition

Named entities are located and classified into specific categories like persons or locations.

Submit a job

{{ form.hidden_tag() }}
{{ wtf.render_field(form.title, data_length='32', material_icon='title') }}
{{ wtf.render_field(form.description, data_length='255', material_icon='description') }}
{{ wtf.render_field(form.files, accept='text/plain', placeholder='Choose your .txt files') }}
{{ wtf.render_field(form.language, material_icon='language') }}
{{ wtf.render_field(form.version, material_icon='apps') }}
Preprocessing

{{ form.check_encoding.label.text }}

If the input files are not created with the nopaque OCR service or you do not know if your text files are UTF-8 encoded, check this switch. We will try to automatically determine the right encoding for your texts to process them.

{{ wtf.render_field(form.submit, material_icon='send') }}
{% endblock %}