{% extends "base.html.j2" %} {% from "services/_breadcrumbs.html.j2" import breadcrumbs with context %} {% import "materialize/wtf.html.j2" as wtf %} {% block main_attribs %} class="service-scheme" data-service="tesseract-ocr-pipeline"{% endblock main_attribs %} {% block page_content %}

{{ title }}

 

 

layersOCR

In this process, nopaque converts your image data – like photos or scans – into text data. This step enables you to proceed with the computational analysis of your documents.

Submit a job

{{ form.hidden_tag() }}
{{ wtf.render_field(form.title, material_icon='title') }}
{{ wtf.render_field(form.description, material_icon='description') }}
{{ wtf.render_field(form.pdf, accept='application/pdf', placeholder='Choose a PDF file') }}
language {{ form.model() }} {{ form.model.label }} More details about models {% for error in form.model.errors %} {{ error }} {% endfor %}
{{ wtf.render_field(form.version, material_icon='apps') }}
Preprocessing

{{ form.binarization.label.text }}

Based on a brightness threshold pixels are converted into either black or white. It is useful to reduce noise in images. (longer duration)

 

 

Page range

 

 

Page rotation

 

 

Page split

{{ wtf.render_field(form.submit, material_icon='send') }}
{% endblock page_content %} {% block modals %} {{ super() }} {% endblock modals %}