Skip to content

Conversation

@Abhyuday-06
Copy link

Description

  • Added AcademicAgent that can parse notes and create summaries in one step. Also supports parse_only and summarize_only.
  • New API routes:
    • Parse: POST /parse, POST /parse-file
    • Summarize: POST /summarize, POST /summarize-file
    • PDF export: POST /parse/export-pdf, POST /parse-file/export-pdf, POST /summarize/export-pdf, POST /summarize-file/export-pdf
  • File support: PDF, DOCX, PPTX, TXT (and MD for summarize). For scanned PDFs, OCR uses Tesseract + pdf2image. Optional OpenCV/Pillow preprocessing helps with deskew and noise.
  • The agent uses Ollama (Mistral 7B). It also picks out keywords, concepts, study questions, and key points from summaries.
  • Added structured logging, CORS, clear errors, file size and content length checks, and safe temp-file cleanup.
  • Health check verifies Ollama. Root route shows basic API info. Startup/shutdown logs added.
  • PDF reports generated via pdf_exporter.
  • OpenAI client is set up for future use if we add another model provider.

Major Features:
- Advanced OCR pipeline for handwritten/scanned documents
- Multi-strategy OCR with quality scoring
- Image preprocessing with OpenCV and PIL
- AI-powered text correction using LLM
- Full extracted text debugging output
- Improved file handling and validation

Configuration:
- Increased file size limit to 100MB
- Enhanced OCR settings and preprocessing options
- Better error handling and logging

File Management:
- Added .gitignore with comprehensive rules
- Created local_files directory for uploads
- Removed unused test files for cleaner codebase
- Added support for PPTX files

Performance:
- Optimized prompts for better extraction quality
- Reduced token usage for faster processing
- Enhanced plausibility checks for handwritten content
- Remove .env file containing API keys and secrets
- Remove all __pycache__ directories and .pyc files
- Keep only essential source code and documentation
- These files are properly ignored in .gitignore
- Merged parsing and summarizing into single AcademicAgent
- Simplified API endpoints: /parse, /parse-file, /summarize, /summarize-file
- Added comprehensive PDF export functionality with formatted reports
- Moved legacy files to legacy/ folder for backup
- Removed context limits for better full-content processing
- Enhanced summarization prompts to use bullet point format
- Added reportlab and markdown dependencies for PDF generation
- Maintained backward compatibility with existing functionality

New Features:
- /parse/export-pdf and /parse-file/export-pdf for parsing results
- /summarize/export-pdf and /summarize-file/export-pdf for summary results
- Professional PDF formatting with metadata, tables, and proper styling
- Enhanced OCR support and file processing capabilities

Breaking Changes:
- Removed multiple endpoint variations
- Consolidated agents into single unified system
- Updated to reflect new unified AcademicAgent architecture
- Added comprehensive documentation for PDF export functionality
- Simplified API endpoint documentation
- Added new usage examples with PDF generation
- Documented architectural changes and legacy support
- Updated project structure to show new organization
- Added 'What's New' section highlighting September 2025 updates
- Enhanced configuration section with PDF export settings
- Updated acknowledgments to include ReportLab
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant