As you probably already know, Optical Character Recognition (OCR) is the electronic conversion of images of typed, handwritten, or printed text into machine-encoded text. The source can be a scanned document, a photo of a document, or a subtitle text imposed on an image. OCR converts such sources into machine-readable text.
Let’s understand how an Ocr Pipeline works before we dig deeper into Transformer Based OCR.
A typical OCR pipeline consists of two modules.
1. A Text Detection Module
2. A Text Recognition Module
Read the complete article
This post first appeared on How To Stop Manual Data Entry And OCR From Stalling Your Intelligent Automation Strategy, please read the originial post: here