🚀 Transform months of research into minutes of insight
English | 中文
ARIA is an automated research assistant framework for scientific data analysis, visualization, and report generation.
aria/
├── .claude/commands/ # Claude AI command files
│ ├── academic/ # Academic workflow commands
│ ├── git/ # Git operation commands
│ └── python/ # Python environment commands
├── data/
│ ├── raw/ # Original data
│ ├── processed/ # Preprocessed data
│ └── output/ # Experiment outputs
│ ├── models/ # Trained models
│ ├── results/ # Experimental results
│ ├── figures/ # Visualization charts
│ └── logs/ # Execution logs
├── docs/ # Project documentation
├── src/ # Source code
├── scripts/ # Execution scripts
├── GETTING_STARTED.md # Quick start guide
├── INSTALL.md # Installation guide
└── README.md # This file
- Place raw data in the
data/raw/folder - Manually create
docs/01-basic-information.mdto describe project background, objectives, and data overview
- Raw Data Analysis: Use
@raw-data-analysis.mdcommand to analyze raw data, generatingdocs/02-raw-data-analysis.md - Data Preprocessing: Use
@preprocess.mdcommand to design preprocessing plan (docs/03-preprocess-plan.md), execute preprocessing, analyze processed data (docs/04-processed-data-analysis.md)
- Research Plan: Use
@research-plan.mdcommand to develop research plan including feature engineering, model selection, evaluation metrics, generatingdocs/05-research-plan.md
- Code Development: Use
@code-implementation.mdcommand to implement research plan, creating necessary Python modules - Generate
docs/06-implementation-docs.md(implementation documentation) anddocs/07-execution-instructions.md(execution guide) - Code quality check: Use mypy and ruff to ensure code quality
- Run Experiments: Use
@run-experiments.mdcommand to execute experiment scripts - Outputs saved to corresponding subfolders in
output/directory
- Results Analysis: Use
@experiment-analysis.mdcommand to analyze experiment outputs - Generate individual analyses in
docs/08-experiment-results/directory - Generate
docs/09-experiment-report.mdcomprehensive experiment report
- Academic Paper: Use
@research-report.mdcommand to generate high-impact journal format paper - Generate
docs/10-manuscript.md(main text),docs/10-manuscript-supplement.md(supplementary materials),docs/10-cover-letter.md(cover letter)
- Gradio Interface: If models are trained, use
@gradio-app.mdcommand to create model inference interface - Generate
docs/11-model-deployment-guide.mddeployment guide
All academic command files are located in .claude/commands/academic/ directory:
@raw-data-analysis.md- Analyze raw data@preprocess.md- Data preprocessing@research-plan.md- Research plan design@code-implementation.md- Code implementation@run-experiments.md- Experiment execution@experiment-analysis.md- Results analysis@research-report.md- Academic paper generation@gradio-app.md- Model deployment interface
Python command files are located in .claude/commands/python/ directory:
@setup-environment.md- Automated environment setup- Checks and installs Git, Python 3.12+, and UV package manager
- Sets up project dependencies with
uv sync - Includes Tsinghua mirror fallback for network issues
- Provides complete environment verification
Git command files are located in .claude/commands/git/ directory:
@git-commit.md- Intelligent Git commits- Automatically creates structured commit messages
- When there are many modified files, automatically commits in batches (max 10 files per commit)
- Generates meaningful commit descriptions based on code changes
Complete example projects demonstrating ARIA workflow across different research tasks:
- 🏠 aria-example-buston - Boston housing price prediction (regression) | OpenML Dataset
- 💎 aria-example-diamonds - Diamond price prediction (regression) | OpenML Dataset
- 🐛 aria-example-kc1 - Software defect prediction (classification) | OpenML Dataset
- 🧩 aria-example-sat11 - SAT solver performance prediction (regression) | OpenML Dataset
All datasets are from OpenML, an open machine learning platform. Each project includes complete documentation, production-ready code, trained models, and academic manuscripts.
New to ARIA? Start here: Getting Started Guide 📖
The complete guide covers:
- Installing Git and AI code editor (Cursor/VSCode/Lingma IDE)
- Automated environment setup with
@setup-environment.md - Step-by-step workflow from data to paper
- Troubleshooting and best practices
- Setup Environment: Use
@setup-environment.mdto configure Python and dependencies - Prepare Data: Place raw data in
data/raw/ - Create Project Description: Write
docs/01-basic-information.md - Execute Workflow: Use Claude AI to execute academic commands in sequence
- Version Control: Use
@git-commit.mdfor intelligent commits
📚 Full Documentation: See GETTING_STARTED.md and INSTALL.md
Using UV for Python dependency management:
uv add <package-name> # Add new dependency
uv sync # Synchronize dependencies- Type checking:
mypy src/ - Code style:
ruff check src/ - Code formatting:
ruff format src/
- 📊 Data-Driven: Complete workflow from raw data to academic paper
- 🤖 AI-Assisted: Claude AI commands automate each research phase
- 📝 Academic Standards: Generate Nature/Science standard papers
- 🎯 Quality Assurance: Integrated code quality checking tools
- 🚀 Model Deployment: Support for Gradio interface rapid deployment
- 📦 Version Control: Intelligent Git commit management
This project is dual-licensed:
- GNU AGPL-3.0 - for open source use (personal, research, education, open source projects)
- Commercial License - for proprietary/commercial use (contact for licensing)
See the LICENSE file for details.
Contributions are welcome! Please feel free to submit Issues and Pull Requests.
For questions, suggestions, or collaboration opportunities:
- GitHub Issues: Create an issue
- Email: [email protected]
- Project: ARIA on GitHub
