Document OCR

GitHub Repository: https://github.com/junwai7159/Document-OCR

About this Project

This project is part of the SJTU ICE4309 - Image Processing & Content Analysis course.

We implemeted an 3-stage Optical Character Recognition (OCR) framework for converting in-the-wild documents to digitally readable and recognizable text.

Architecture of Document OCR

document_ocr

First Stage: Preprocessing

The images undergo preprocessing, including edge detection, contour detection, perspective transformation and binarization to further enhance the image.