Add unified academic agent(notes-parser + summariser), OCR-enabled file parsing, and PDF export endpoints #2

Abhyuday-06 · 2025-09-15T18:20:21Z

Description

Added AcademicAgent that can parse notes and create summaries in one step. Also supports parse_only and summarize_only.
New API routes:
- Parse: POST /parse, POST /parse-file
- Summarize: POST /summarize, POST /summarize-file
- PDF export: POST /parse/export-pdf, POST /parse-file/export-pdf, POST /summarize/export-pdf, POST /summarize-file/export-pdf
File support: PDF, DOCX, PPTX, TXT (and MD for summarize). For scanned PDFs, OCR uses Tesseract + pdf2image. Optional OpenCV/Pillow preprocessing helps with deskew and noise.
The agent uses Ollama (Mistral 7B). It also picks out keywords, concepts, study questions, and key points from summaries.
Added structured logging, CORS, clear errors, file size and content length checks, and safe temp-file cleanup.
Health check verifies Ollama. Root route shows basic API info. Startup/shutdown logs added.
PDF reports generated via pdf_exporter.
OpenAI client is set up for future use if we add another model provider.

Major Features: - Advanced OCR pipeline for handwritten/scanned documents - Multi-strategy OCR with quality scoring - Image preprocessing with OpenCV and PIL - AI-powered text correction using LLM - Full extracted text debugging output - Improved file handling and validation Configuration: - Increased file size limit to 100MB - Enhanced OCR settings and preprocessing options - Better error handling and logging File Management: - Added .gitignore with comprehensive rules - Created local_files directory for uploads - Removed unused test files for cleaner codebase - Added support for PPTX files Performance: - Optimized prompts for better extraction quality - Reduced token usage for faster processing - Enhanced plausibility checks for handwritten content

- Remove .env file containing API keys and secrets - Remove all __pycache__ directories and .pyc files - Keep only essential source code and documentation - These files are properly ignored in .gitignore

- Merged parsing and summarizing into single AcademicAgent - Simplified API endpoints: /parse, /parse-file, /summarize, /summarize-file - Added comprehensive PDF export functionality with formatted reports - Moved legacy files to legacy/ folder for backup - Removed context limits for better full-content processing - Enhanced summarization prompts to use bullet point format - Added reportlab and markdown dependencies for PDF generation - Maintained backward compatibility with existing functionality New Features: - /parse/export-pdf and /parse-file/export-pdf for parsing results - /summarize/export-pdf and /summarize-file/export-pdf for summary results - Professional PDF formatting with metadata, tables, and proper styling - Enhanced OCR support and file processing capabilities Breaking Changes: - Removed multiple endpoint variations - Consolidated agents into single unified system

- Updated to reflect new unified AcademicAgent architecture - Added comprehensive documentation for PDF export functionality - Simplified API endpoint documentation - Added new usage examples with PDF generation - Documented architectural changes and legacy support - Updated project structure to show new organization - Added 'What's New' section highlighting September 2025 updates - Enhanced configuration section with PDF export settings - Updated acknowledgments to include ReportLab

Abhyuday-06 added 7 commits August 9, 2025 16:28

Initial commit

8291569

Initial commit

7e12004

Updated README.md

b8dee57

Remove sensitive files and cache directories

2c72789

- Remove .env file containing API keys and secrets - Remove all __pycache__ directories and .pyc files - Keep only essential source code and documentation - These files are properly ignored in .gitignore

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add unified academic agent(notes-parser + summariser), OCR-enabled file parsing, and PDF export endpoints #2

Add unified academic agent(notes-parser + summariser), OCR-enabled file parsing, and PDF export endpoints #2

Uh oh!

Abhyuday-06 commented Sep 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add unified academic agent(notes-parser + summariser), OCR-enabled file parsing, and PDF export endpoints #2

Are you sure you want to change the base?

Add unified academic agent(notes-parser + summariser), OCR-enabled file parsing, and PDF export endpoints #2

Uh oh!

Conversation

Abhyuday-06 commented Sep 15, 2025

Description

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant