{% extends "base.html.j2" %} {% from "services/_breadcrumbs.html.j2" import breadcrumbs with context %} {% import "materialize/wtf.html.j2" as wtf %} {% block main_attribs %} class="service-scheme" data-service="tesseract-ocr"{% endblock main_attribs %} {% block page_content %}

{{ title }}

 

 

layersOCR

In this process, nopaque converts your image data – like photos or scans – into text data. This step enables you to proceed with the computational analysis of your documents.

Submit a job

{{ form.hidden_tag() }}
{{ wtf.render_field(form.title, data_length='32', material_icon='title') }}
{{ wtf.render_field(form.description, data_length='255', material_icon='description') }}
{{ wtf.render_field(form.files, accept='application/pdf', placeholder='Choose your .pdf files') }}
{{ wtf.render_field(form.model, material_icon='language') }}
{{ wtf.render_field(form.version, material_icon='apps') }}
Preprocessing

{{ form.binarization.label.text }}

Based on a brightness threshold pixels are converted into either black or white. It is useful to reduce noise in images. (longer duration)

 

 

Page range

 

 

Page rotation

 

 

Page split

{{ wtf.render_field(form.submit, material_icon='send') }}
{% endblock page_content %} {% block modals %} {{ super() }} {% endblock modals %} {% block scripts %} {{ super() }} {% endblock scripts %}