OCR
Optical Character Recognition
OCR (Optical Character Recognition) converts images of text — scanned documents, photos, PDFs — into machine-readable characters. It was the first step in automating document handling, letting systems "read" a scanned bill of lading or invoice.
Classic OCR reads characters from fixed positions and struggles when layouts change. Modern Document AI goes further, understanding what each field means and validating it — but OCR remains the underlying step that turns pixels into text.
Also known as
Optical Character Recognition
Related terms
Where this matters at WHIZTEC
Frequently asked
Is OCR the same as Document AI?
No. OCR turns an image into text. Document AI adds understanding — knowing what an HBL number or HS code is, extracting it from any layout, and validating it against your data.