A Unified Toolkit for Deep Learning Based Document Image Analysis
-
Updated
Aug 15, 2024 - Python
A Unified Toolkit for Deep Learning Based Document Image Analysis
Novalad offers a unified, centralized platform enabling organizations to extract meaningful data and perform advanced processing at high speed.
PdfDet aims to simplify PDF layout detect tasks for users.
Extracting structured text from GI Bill index cards for JDoc 2023 paper
Layout Parser notebook Implementation & Re-trained model for Image detection and extraction
A lightweight Python library for metadata-rich document chunking in Retrieval-Augmented Generation (RAG) workflows. It leverages Azure AI Document Intelligence to enhance chunking by retaining hierarchical structure, page numbers, and bounding boxes for seamless integration with PDF viewers.
Yolo & Layout Parser & Detectron2
Add a description, image, and links to the layout-parser topic page so that developers can more easily learn about it.
To associate your repository with the layout-parser topic, visit your repo's landing page and select "manage topics."