Optical Character Recognition Engine to extract Food-items and Prices from Grocery Receipt Images via Templating and Dictionary-Traversal Technique
@inproceedings{sohani2018optical,
author = {Ullah, Rafi and Sohani, Ali and Ali, Faraz and Rai, Athaul},
year = {2018},
month = {03},
pages = {},
title = {Optical Character Recognition Engine to extract Food-items and Prices from Grocery Receipt Images via Templating and Dictionary-Traversal Technique}
}
Receipt detection | Receipt localization | Receipt normalization | Text line segmentation | Optical character recognition | Semantic analysis |
---|---|---|---|---|---|
❌ | ✔️ | ✔️ | ❌ | ❗ | ✔️ |
- Image Background Removal
-
Otsu’s Image binarization
-
Image and Text De-skewing
-
Image Resizing
- Tesseract OCR Library
-
Fields extracted:
- item names,
- item quantities,
- item prices,
-
Section-level extraction
-
Our system then bifurcates the image into sections (like Price, Item-Names, Quantity are dealt separately from one another) according to given template layouts
-
We have used the stored templates of stores in database. And while testing the image, we retrieved that specific store template. Store template have the (x, y) coordinate points, width and height information.
-
-
We had the location of items, prices and quantity in the template. We used this knowledge to retrieve image’s specific portion.
-
Using above technique we retrieved the image portions/sections using template information independent of requested image size, whether greater or lesser than template image.
-
Context sensitive spelling correction is a technique of correcting OCR results by matching them with dictionaries of stores.
-
store specific and grocery related dictionary
-
Words such as tax, total, subtotal, discount etc are included in that corpus. These words are excluded at the very first stage from OCR result.