GitHub Repository Assessment

Overview

This project implements an AI-driven assessment framework to analyze GitHub repositories for AI/ML and data science projects using Large Language Models (LLMs) and rule-based techniques. It evaluates key software quality dimensions to ensure adherence to best practices and industry standards.

See this article for more information on the Assessment framework.

Core Evaluation Categories

Each repository is assessed across five critical dimensions to guarantee excellence:

Documentation – Comprehensive, well-structured, and user-centric guidance for seamless adoption.
Repository Architecture – Intuitive, standardized organization that enhances collaboration and scalability.
Environment & Dependencies – Robust dependency management ensuring reproducibility and seamless execution.
License & Legal – Clear governance of intellectual property, permissions, and compliance.
Code Quality – Enforced coding standards ensuring efficiency, maintainability, and technical integrity.

This framework structured into three progressive tiers:

Essential – Establishes foundational repository standards and ensures minimal compliance.
Professional – Implements structured, maintainable, and scalable repository practices.
Elite – Adopts best-in-class methodologies, including advanced documentation, automation, and security enforcement.

Built With

This project is powered by:

Python (>=3.12) – Core language.
LangChain (>=0.3.20) – LLM-driven framework.

For full dependencies, see pyproject.toml.

Target Audience

AI/ML Practitioners: Researchers and developers sharing technical projects with the community.
Engineering Teams: Teams and organizations establishing quality standards for repository management.

Project Structure

This repository follows a well-organized directory structure designed for clarity, maintainability, and extensibility. Each folder serves a specific purpose within the assessment framework:
- data/: Stores all input repositories and output assessment results, providing clear separation between source data and analysis results.
  - inputs/: Contains cloned GitHub repositories that will be analyzed by the assessment tool.
  - outputs/: Stores assessment reports, JSON results, and other artifacts generated during evaluation.
- src/: Houses all source code for the repository assessment tool, organized into logical modules.
  - config/: Contains configuration files that control the behavior of the assessment tool.
    - scoring/: Stores YAML definition files for all scoring criteria across the five assessment categories.
  - directory_scorer/: Implements algorithms for analyzing directory file contents.
  - utils/: Provides utility functions for repository management, validation, and general helper functions.

tests/: Contains all unit tests and test fixtures to ensure code quality and reliability.

 root/
 ├── data/                      # Data directory for inputs and outputs
 │   ├── inputs/                # Directory for storing cloned repositories
 │   └── outputs/               # Directory for assessment results
 │
 ├── src/                       # Source code
 │   ├── config/                # Configuration files and scoring criteria
 │   │   ├── scoring/           # YAML files defining criteria for each category
 │   │   │   ├── code_quality_criteria.yaml
 │   │   │   ├── dependancies_criteria.yaml
 │   │   │   ├── documentation_criteria.yaml
 │   │   │   ├── license_criteria.yaml
 │   │   │   └── structure_criteria.yaml
 │   │   ├── config.json        # Configuration settings (URLs, workers)
 │   │   ├── logic_based_scoring.py # Logic-based scoring functions
 │   │   ├── paths.py           # Path definitions for the project
 │   │   ├── prompts.yaml       # LLM prompts for assessment
 │   │   └── tracked_files.yaml # File tracking configuration
 │   │
 │   ├── directory_scorer/      # Directory and file content scoring
 │   │   ├── content_based_scorer.py # File content evaluation
 │   │   └── tree.py            # Directory tree management
 │   │
 │   ├── utils/                 # Utility functions
 │   │   ├── general.py         # General utility functions
 │   │   ├── llm.py             # LLM integration
 │   │   ├── project_validators.py # Repository validation functions
 │   │   └── repository.py      # Repository management functions
 │   │
 │   ├── generators.py          # Criteria generation functions
 │   ├── logger.py              # Logging configuration
 │   ├── main.py                # Main entry point
 │   ├── output_parsers.py      # Output formatting and parsing
 │   └── report.py              # Report generation
 │
 ├── tests/                     # Test directory
 │   ├── conftest.py            # Pytest configuration
 │   └── test_project_validators.py # Tests for project validators
 │
 ├── .env.example               # Example environment variables
 ├── .gitignore                 # Git ignore file
 ├── LICENSE                    # MIT License
 ├── Makefile                   # Build commands
 ├── README.md                  # Project documentation
 ├── pyproject.toml             # Project metadata and dependencies
 ├── pytest.ini                 # Pytest configuration
 └── uv.lock                    # Dependency lock file

Getting Started

Prerequisites

Python 3.12 or higher
uv package for package management
OpenAI API Key

Installation Guide

Package & Project Management with `uv`

This project leverages uv (official documentation) for efficient package and dependency management, ensuring seamless setup and environment consistency.

Clone the RepositoryBegin by cloning the repository to your local environment:

git clone https://github.com/readytensor/rt-repo-assessment.git
cd rt-repo-assessment

**Install uv**Run the following command to install uv:
```
pip install uv
```
How to install uv
Install Python & Dependencies Set up Python and install all dependencies in a virtual environment using:
```
make install
```
Adding New Packages To add a package to the project, use the following command:
```
uv add <package>
```

Usage

Create .env in the root directory and place your OpenAI API, Groq, Gemini or OpenRouter key in it. See the example file.
Run the Assessment Execute the repository assessment tool using:
```
python main.py 
```
Configure Repository URLs & Max Workers Configurations are specified in /src/config/config.json Modify config.json to specify repository URLs and other parameters:
```
{
"from_inputs_directory": false,
"project_name": "my-first-ai-project",
"urls": [
    "https://github.com/repo_name"
],
"max_workers": 12
}
```
- from_inputs_directory: If set to true, the tool will run on a local folder present in the /data/inputs directory, and the urls parameter will be ignored. If set to false, the tool will download remote repositories specified in the urls parameter, and the project_name will be ignored.
- project_name: Used to specify the name of the project when from_inputs_directory is true.
- urls: A list of repository URLs to download when from_inputs_directory is false.
- max_workers: Specifies the maximum number of workers to use for processing.
View Assessment Results The assessment results can be found in the data/outputs/repo_name/report.md file.

Overall Summary

Total Criteria: Total Number of Criteria
Criteria Met: Percentage of Criteria Met

Category Breakdown

Category	Criteria Met	Total Criteria	Percentage
Essential	# Essential Criteria Met	Total Essential Criteria	%
Professional	# Professional Criteria Met	Total Professional Criteria	%
Elite	# Elite Criteria Met	Total Elite Criteria	%

Sample Detailed Results

Category	Criterion	Status	Explanation
Code Quality	Modular Code Organization	✅	This criterion is satisfied in the project by file 'utils.py'. The code is organized into functions and a class, which is a good practice.
Documentation	README Completeness	✅	The README provides all necessary information for understanding, setting up, and using the project effectively.
Repository Structure	Clear Folder Organization	✅	The repository follows a logical structure with clear divisions for source code, documentation, and other assets.
Environment & Dependencies	Test Directory Structure	❌	The project directory does not contain a dedicated 'tests/' directory or any other structure that organizes tests in a way that mirrors the main code structure. There is no indication of any testing files or directories present.
License and Legal	License Presence	✅	The project directory includes a recognized license file named 'LICENSE.txt' in the root directory, which explicitly states the terms of use, modification, and distribution.

Data Requirements

Expected Data Sources and Setup

This project analyzes GitHub repositories by fetching data from the provided repository links. The expected data sources include:

GitHub Repository Links: A list of GitHub repository URLs that the framework will analyze.

The evaluation data consists of:

README Files: To evaluate documentation quality.
Repository Metadata: To assess repository structure, dependencies, and license information.
Code Files: For evaluating code quality and enforcing coding standards.

Ensure your GitHub repositories are public, or that you have the necessary access rights for private repositories. The project will automatically fetch the data from these repositories when provided with their URLs.

For example:

{
  "repository_urls": [
    "https://github.com/readytensor/rt-repo-assessment",
    "https://github.com/another-user/sample-repo"
  ]
}

Testing

The project uses pytest for unit testing. Tests can be run using the make test command.

Configuration

The configuration is managed through the config.json and scoring files.

As detailed in the usage section, the config.json defines key parameters like repository URLs and the maximum number of concurrent workers.

Scoring Criteria

Scoring is based on five key areas: Documentation, Repository Structure, Environment & Dependencies, License & Legal, and Code Quality. Each criterion is assessed at three levels: Essential, Professional, and Elite.

The assessment framework employs three types of scoring methods:

Logic-based Scoring: Relies on predefined functions that apply specific validation rules and tests.
File Content-based Scoring: Uses LLMs to evaluate the content of files against defined criteria.
Metadata-based Scoring: Uses LLMs on repository metadata such as directory structure and file presence.

Examples of Different Scoring Types:

Logic-based Scoring Example

script_length:
  name: Script Length Control
  description: Individual scripts/modules have reasonable length (< 500 lines)
  essential: true
  professional: true
  elite: true
  based_on: custom_logic
  args:
    max_script_length: 500

Logic-based scoring uses custom functions to validate criteria such as script length against configurable thresholds.

Custom Logic Function Example

Here's how a custom logic function processes the metadata to evaluate a criterion:

@scoring_function
def script_length(metadata: Dict[str, Any], max_script_length: int) -> Dict[str, Any]:
    """
    Check if scripts in the repository exceed the maximum allowed length.
    """
    script_lengths = metadata["script_lengths"]
    long_scripts = []
    score = 1
    explanation = "All scripts are less than 500 lines."
  
    for script, length in script_lengths.items():
        if length > max_script_length:
            score = 0
            long_scripts.append(script)
            explanation = f"The following scripts are longer than {max_script_length} lines: {long_scripts}"

    return {
        "score": score,
        "explanation": explanation,
    }

This function analyzes the script lengths in the metadata and assigns a score of 1 if all scripts meet the length requirement, or 0 if any scripts exceed it.

File Content-based Scoring Example

functions_and_classes:
  name: Modular Code Organization
  description: Code organized into functions/methods rather than monolithic scripts
  essential: true
  professional: true
  elite: true
  include_extensions:
  - .py
  - .ipynb
  aggregation: OR
  based_on: file_content

File content-based scoring examines the actual content of code files, checking for proper modularization and organization.

Metadata-based Scoring Example

organized_notebooks:
  name: Organized Notebooks
  description: If Jupyter notebooks are present, they are organized in a logical manner
  essential: false
  professional: true
  elite: true
  instructions: If the repository does not contain any notebooks, this criterion should be scored 1.

Metadata-based scoring evaluates repository structure and organization patterns rather than specific file contents.

Repository Metadata Structure

The assessment framework extracts repository metadata, which is then used for evaluation:

# Example of repository metadata structure
metadata = {
    # File existence checks
    "readme_exists": True,
    "requirements_txt_exists": True, 
    "pyproject_toml_exists": False,
    "setup_py_exists": True,
    "license_file_exists": True,
    "gitignore_file_exists": True,
    "ignored_files_exist": True,
  
    # Script length information
    "script_lengths": {
        "src/main.py": 178,
        "src/utils/repository.py": 309,
        "src/utils/project_validators.py": 171
    },
  
    # Repository directory structure as a formatted string
    "directory_structure": 
        repo_name
        ├── src
        │   ├── utils
        │   │   ├── repository.py
        │   │   └── project_validators.py
        │   └── main.py
        ├── README.md
        ├── LICENSE
        └── requirements.txt   
}

This metadata provides a comprehensive overview of the repository structure and key characteristics, enabling both rule-based and LLM-based evaluation against the defined criteria.

Contributing

We welcome contributions to improve the framework. Please refer to the guidelines below for how you can contribute:

Fork the repository.
Create a new branch for your changes.
Submit a pull request with a detailed description of your modifications.

License

This project is licensed under the MIT License.

Citation

Ready Tensor, Inc. (2025). Best Practices for AI Project Code Repositories.

Contact

For any inquiries, feel free to reach out to [email protected]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GitHub Repository Assessment

Overview

Core Evaluation Categories

Built With

Target Audience

Project Structure

Getting Started

Prerequisites

Installation Guide

Package & Project Management with `uv`

Usage

Data Requirements

Expected Data Sources and Setup

Testing

Configuration

Scoring Criteria

Examples of Different Scoring Types:

Logic-based Scoring Example

Custom Logic Function Example

File Content-based Scoring Example

Metadata-based Scoring Example

Repository Metadata Structure

Contributing

License

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
data		data
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
uv.lock		uv.lock

License

readytensor/rt-repo-assessment

Folders and files

Latest commit

History

Repository files navigation

GitHub Repository Assessment

Overview

Core Evaluation Categories

Built With

Target Audience

Project Structure

Getting Started

Prerequisites

Installation Guide

Package & Project Management with uv

Usage

Data Requirements

Expected Data Sources and Setup

Testing

Configuration

Scoring Criteria

Examples of Different Scoring Types:

Logic-based Scoring Example

Custom Logic Function Example

File Content-based Scoring Example

Metadata-based Scoring Example

Repository Metadata Structure

Contributing

License

Citation

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Package & Project Management with `uv`

Packages