Transformer-Based OCR

Tags: ocr pipeline recognition document

June 8th 2022

View original

As you probably already know, Optical Character Recognition (OCR) is the electronic conversion of images of typed, handwritten, or printed text into machine-encoded text. The source can be a scanned document, a photo of a document, or a subtitle text imposed on an image. OCR converts such sources into machine-readable text.

10 Mind-Blowing Facts About Vortex Cl…
Best Baby toys 0 â€“ 6 months
Top Reasons to Switch to an Electric …
Appleâ€™s MM1 Large Language Model Bl…
TP-Link Tapo C100 Home Security Wi-Fi…

Let’s understand how an Ocr Pipeline works before we dig deeper into Transformer Based OCR.

A typical OCR pipeline consists of two modules.
1. A Text Detection Module
2. A Text Recognition Module

Read the complete article

This post first appeared on How To Stop Manual Data Entry And OCR From Stalling Your Intelligent Automation Strategy, please read the originial post: here