{% extends "base.html.j2" %} {% from "services/_breadcrumbs.html.j2" import breadcrumbs with context %} {% import "materialize/wtf.html.j2" as wtf %} {% block main_attribs %} class="service-scheme" data-service="tesseract-ocr-pipeline"{% endblock main_attribs %} {% block page_content %}

{{ title }}

 

 

layersOCR

In this process, nopaque converts your image data – like photos or scans – into text data. This step enables you to proceed with the computational analysis of your documents.

Submit a job

{{ form.hidden_tag() }}
{{ wtf.render_field(form.title, material_icon='title') }}
{{ wtf.render_field(form.description, material_icon='description') }}
{{ wtf.render_field(form.pdf, accept='application/pdf', placeholder='Choose a PDF file') }}
language {{ form.model() }} {{ form.model.label }} help_outline new_label {% for error in form.model.errors %} {{ error }} {% endfor %}
{{ wtf.render_field(form.version, material_icon='apps') }}
Preprocessing
{% if 'disabled' not in form.binarization.render_kw or not form.binarization.render_kw['disabled'] %}

{{ form.binarization.label.text }}

Based on a brightness threshold pixels are converted into either black or white. It is useful to reduce noise in images. (longer duration)

{% if 'disabled' not in form.ocropus_nlbin_threshold.render_kw or not form.ocropus_nlbin_threshold.render_kw['disabled'] %}

Intensity (between 0 and 1)

{{ form.ocropus_nlbin_threshold() }}

{% endif %} {% endif %}
{{ wtf.render_field(form.submit, material_icon='send') }}
{% endblock page_content %} {% block modals %} {{ super() }} {% endblock modals %} {% block scripts %} {{ super() }} {% endblock scripts %}