Skip to main content

Tesseract

Description

OCR: Tesseract plugin step detects and extracts text from an image to a readable text type. Supported image types: BMP, PNG, JPG, JPEG. Compatibility: Tesseract version 4.0.0.

Prerequisites:

Configurations

No.Field NameDescription
1Step NameName of the step. This name has to be unique in a single workflow.
Input Fields
1Data Folder PathSpecify the Tesseract data folder path or click the Browse button to browse the folder path (data folder path is mentioned in the prerequisites).
The data type is String. This field is mandatory.
2Button: BrowseClicking on this button brings up the dialog to browse the Tesseract data folder path.
3File PathSpecify the path of the input image file to extract readable text. Alternately browse the file path.
Note: Supported image types are BMP, PNG, JPG, JPEG
The data type is String. This field is mandatory.
4Button: BrowseClicking on this button brings up the dialog to browse the image File path.
5Language CodeSpecify Language. (e.g. eng for English, hin for Hindi, urd for Urdu). Multiple languages can be passed. Add ‘+’ sign to extract multi-language output.
For language code refer URL: https://muthu.co/all-tesseract-ocr-options/
Default value is: eng. The data type is String.
6Page Segment ModeSelect Page Segmentation Mode required as per the input file type. Allowed values are 0-13. The data type is String.
Please refer table below for a list of Page Segmentation Mode with a description.
Output Field
1Output TextSpecify an output field to hold converted text on successful plugin execution. The default value is OutputText.
Sr.No.Page Segment ModeDescription
10Orientation and script detection (OSD) only.
21Automatic page segmentation with OSD.
32Automatic page segmentation, but no OSD, or OCR.
43Fully automatic page segmentation, but no OSD. (Default)
54Assume a single column of text of variable sizes.
65Assume a single uniform block of vertically aligned text.
76Assume a single uniform block of text.
87Treat the image as a single text line.
98Treat the image as a single word.
109Treat the image as a single word in a circle.
1110Treat the image as a single character.
1211Sparse text. Find as much text as possible in no particular order.
1312Sparse text with OSD.
1413Raw line. Treat the image as a single text line, bypassing hacks that are Tesseract-specific.