Skip to content

inmve/free-ai-coding

Repository files navigation

Last updated: November 17, 2025 • PRs/issues welcome

Languages: EspañolPortuguês中文Français日本語हिन्दीTürkçe

AI Coding Tools: Where Pro-Grade Models Are Actually Free

Many AI coding tools claim to be "free," but access to pro-grade models usually runs out fast, then you're downgraded. Each tool uses different limits (credits, tokens, requests), making comparison difficult. This list puts them side by side to show what you actually get for free.

TL;DR — Free Tiers for Pro‑Grade AI Coding

(tools with higher limits listed first)

Tool Pro‑grade models Free tier limit Credit card
Qwen Code Qwen3-Coder-480B 2,000 requests/day No
Rovo Dev CLI Claude Sonnet 4 5M tokens/day (beta) No
Gemini CLI Gemini 2.5 Pro 100 requests/day No
Kilo Code Claude Opus/Sonnet, Gemini 2.5 Pro, GPT‑4.1 Up to $25 signup credits (one‑time) Yes
Warp GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro 150 credits/month (first 2 months), then 75/month No
Trae Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, GPT‑4.1, GPT‑4o, Gemini 2.5 Pro 10 fast + 50 slow requests/month No
Amazon Q Developer Claude Sonnet 4 50 agentic requests/month Yes
GitHub Copilot GPT‑4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1 50 chat requests + 2,000 completions/month No
Windsurf OpenAI, Anthropic, Google, xAI 25 credits/month Yes
Jules Gemini 2.5 Pro 15 tasks/day No
AWS Kiro Claude 4 Sonnet, Claude 3.7 Sonnet 50 credits/month No
Qoder Qwen3-Coder-480B, Claude, GPT, Gemini Free tier + 2-week Pro trial (1,000 credits) No

Qualifying Pro‑Grade Models

Only models achieving >60% on SWE-bench Verified qualify as pro-grade for real-world coding tasks. Below is the current list

Model SWE-bench Verified Provider
Claude Sonnet 4.5 77.2% (82.0% w/ parallel) Anthropic
GPT-5 74.9% OpenAI
Claude Opus 4.1 74.5% Anthropic
Claude Sonnet 4 72.7% (80.2% w/ parallel) Anthropic
GPT-5 mini 71.0% OpenAI
Qwen3-Coder-480B 69.6% (interactive) / 67.0% (single) Alibaba
Gemini 2.5 Pro 63.2% Google

Contributing

If you spot an error, missing source link, or have updated quota/model information, please open an issue or pull request with a source. New tool contributions are welcomed! See CONTRIBUTING.md for detailed guidelines.

Disclaimer

No affiliation with any vendor. All trademarks belong to their owners. Information is for research; accuracy not guaranteed; limits/pricing change frequently.

Contents

1. AI-coding Tools with Free Access to Pro-Grade Models

(ordered from most generous to least)

Qwen3-Coder-480B access

  • 2,000 requests/day free tier via Qwen OAuth
  • 60 requests/minute rate limit
  • Command-line AI workflow tool (adapted from Gemini CLI)
  • One-click browser authentication
  • No credit card required

Links: GitHub | Documentation


Claude Sonnet 4 access during beta

  • 5M tokens/day free tier (20M on first day only)
  • Claude Sonnet 4 model (confirmed via testing)
  • No credit card required during beta
  • Token limits reset at midnight UTC
  • Note: Upgrade to Jira Standard/Premium/Enterprise for 20M tokens/day

Links: Documentation | Token Limits


Gemini 2.5 Pro access

  • 100 requests/day limit for Gemini 2.5 Pro
  • 250 requests/day limit for Gemini 2.5 Flash
  • No credit card required
  • Google models only
  • Switches to paid rates after free quota

Links: Rate Limits | Pricing


Claude Opus/Sonnet, Gemini 2.5 Pro, GPT-4.1 access

  • Up to $25 signup credits (one-time bonus)
  • Open source VS Code extension
  • Pay-as-you-go with no markup on model pricing
  • Credit card required to claim full bonus credits
  • Supports bringing your own API keys

Links: GitHub | Documentation | Pricing


GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro access

  • 150 AI credits/month (first 2 months), then 75 AI credits/month
  • Multiple providers (OpenAI GPT‑5, Claude Opus 4.1, Claude Sonnet 4, Gemini 2.5 Pro)
  • No credit card required for basic signup
  • New pricing structure announced Oct 30, 2025: Single Build plan ($20/mo) with 1,500 credits

Links: Pricing


Claude Sonnet 4 access

  • 50 agentic requests/month limit (multi-turn conversations)
  • Latest Claude models (AWS-hosted)
  • Credit card required
  • Must upgrade to Pro for continued access
  • Perpetual free tier

Links: Pricing


Agent Mode with GPT‑4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1

  • 50 chat requests + 2,000 completions/month limit
  • Agent Mode with autonomous multi-step coding
  • Multiple providers (GPT-4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1)
  • No credit card required
  • Limited to basic features after quota

Links: Plans Details | Agent Mode


Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, Claude 3.5 Sonnet, GPT‑4.1, GPT‑4o, Gemini 2.5 Pro access

  • 10 fast requests + 50 slow requests/month for premium models
  • 1,000 slow requests/month for advanced models
  • 5,000 auto-completions/month
  • VS Code-based IDE with AI integration
  • Multiple premium models including Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, GPT‑4.1
  • No credit card required for free tier
  • Pro Plan: $10/mo (600 fast + unlimited slow requests)

Links: Pricing | Documentation


OpenAI, Anthropic, Google, xAI model access

  • 25 prompt credits/month limit
  • Multiple providers (OpenAI, Claude, Gemini, xAI)
  • Credit card required
  • Can purchase add-on credits to continue

Links: Pricing


Gemini 2.5 Pro access

  • 15 tasks/day free tier
  • 3 concurrent tasks
  • Gemini 2.5 Pro model
  • Gmail account required (18+ years)
  • Task limits reset on rolling 24-hour window
  • No credit card required
  • Pro tier ($19.99/mo): 100 tasks/day (5x limits)

Links: Usage Limits | Documentation


Claude 4 Sonnet, Claude 3.7 Sonnet access

  • 50 credits/month (Free tier)
  • Claude 4 Sonnet and Claude 3.7 Sonnet models (AWS-hosted)
  • No credit card required
  • 14-day welcome bonus: 500 credits
  • Paid tiers: Pro ($20/mo - 1,000 credits), Pro+ ($40/mo - 2,000 credits), Power ($200/mo - 10,000 credits)

Links: Pricing | Introduction Blog


Qwen3-Coder-480B, Claude, GPT, Gemini models

  • Free tier: Unlimited completions/edits + limited chat/agent requests + 2-week Pro trial (1,000 credits)
  • AI-powered IDE from Alibaba
  • Available for Windows and macOS
  • Primarily uses Qwen3-Coder-480B (Alibaba's flagship coding model)
  • Also supports Claude, GPT-4, Gemini models
  • Agent Mode and Quest Mode for autonomous coding
  • No credit card required (free tier)
  • Paid tiers: Pro ($20/mo - 2,000 credits), Pro+ ($60/mo - 6,000 credits)

Links: Homepage | Pricing


Limits change fast. If you see a mistake, a newer quota/model, or want to add a new tool, open an issue or PR with a source. See CONTRIBUTING.md for guidelines.


2. API Providers for AI Coding Tools

(ordered from most generous to least)

These services provide API access to coding-optimized models that integrate with popular AI coding tools like Cursor, Continue.dev, Cline, and others. They don't provide standalone coding tools but offer the AI backend for existing tools.

Qwen3-Coder-480B via OpenRouter

  • 50 requests/day free tier (1,000/day if purchased $10+ credits)
  • Additional free models: Qwen3-30B-A3B, Qwen3-235B-A22B, Gemini Flash
  • OpenAI-compatible API for all major IDEs
  • No credit card required for free models
  • 20 requests/minute rate limit for free tier
  • Works with Continue.dev, Cline, Cursor, etc.

Links: Free Models | Qwen3-Coder API


Qwen3-235B and Llama 3.1 access

  • Free tier: 1M tokens/day
  • No credit card required
  • Rate limit: 30 requests/minute, 8,192 token context
  • Models: Qwen3-235B, Llama 3.1 70B (Note: Qwen3-Coder-480B deprecated Nov 5, 2025)
  • OpenAI-compatible API (works with Cursor, Continue.dev, Cline, RooCode, etc.)
  • Ultra-fast inference: 2,000 tokens/second (40x faster than typical providers)
  • Paid tiers: Developer ($10+ self-serve), Enterprise (custom pricing)

Links: Pricing | API Docs | Integration Guides


3. Tools with Paid Tiers with Pro-Grade Models

Jira Standard ($7.53/user/mo): 20M tokens/day Jira Premium ($15.25/user/mo): 20M tokens/day Jira Enterprise (custom): 20M tokens/day

  • 4x increase from free tier (5M → 20M tokens/day)
  • Same Claude-based model as free tier
  • Token limits reset at midnight UTC

Links: Documentation | Token Limits | Jira Pricing


Pro ($17/mo with annual): Sonnet 4 access with more usage than free tier Max (From $100/mo): Opus 4.1 + Sonnet 4 access (5x or 20x more usage than Pro)

  • Usage limits reset weekly
  • 5-hour rolling window limits apply
  • Priority access during high traffic (Max tier)

Links: Pricing


Pro ($19/mo): Increased limits for agentic requests

  • Usage may be adjusted based on regional factors and usage patterns

Links: Pricing


Build ($20/mo): 1,500 AI credits/month

  • Reload Credits available (up to 50% cheaper than old overage rates, roll over for 12 months)
  • Bring Your Own API Key (BYOK) option available
  • New pricing effective immediately for new customers (Oct 30, 2025)
  • Existing monthly subscribers transition on first renewal after Dec 1, 2025
  • Enterprise tier: Custom pricing

Links: Pricing


Pro ($10/mo): 300 premium requests + unlimited completions/month Pro+ ($39/mo): 1,500 premium requests + unlimited completions/month Business ($19/user/mo): 300 premium requests + unlimited completions/user/month Enterprise ($39/user/mo): 1,000 premium requests + unlimited completions/user/month

  • Access to multiple models (GPT-4.1, Claude Opus 3.5, Gemini 2.0 Flash, Grok Code Fast 1)
  • Overage billing available at $0.04/request

Links: Plans Details


Pro ($10/mo): 600 fast requests + unlimited slow requests for premium models

  • Unlimited slow requests for advanced models
  • Zero rate limits and faster access to premium models
  • Extra packages available: $3-$12 for additional fast requests
  • Multiple premium models: Claude 4 Sonnet (Beta), Claude 3.7 Sonnet, Claude 3.5 Sonnet, Gemini 2.5 Pro, GPT‑4.1, GPT‑4o
  • VS Code-based IDE with full AI integration
  • First month available for $3

Links: Pricing | Documentation


Pro ($15/mo): 500 prompt credits/month Teams ($30/user/mo): 500 prompt credits/user/month Enterprise ($60+/user/mo): 1,000 prompt credits/user/month

Links: Pricing


Pro ($25/mo): 150 credits/month (5 daily credits) Teams ($30/mo): Higher limits (undisclosed)

Links: Messaging Limits


$20/mo: 10M tokens/month $200/mo: 120M tokens/month

Links: Token Documentation


Hobby (Free): Limited Agent requests + Limited Tab completions + 1-week Pro trial Pro ($20/mo or $16/mo annually): Extended Agent limits + Unlimited Tab completions + Background Agents + Maximum context windows Pro+ ($60/mo): 3x usage on all OpenAI, Claude, Gemini models Ultra ($200/mo): 20x usage on all OpenAI, Claude, Gemini models Teams ($40/user/mo): Pro features + team management

  • One-week Pro trial available (free tier)
  • Free tier: GPT-4.1 only (50 requests/month premium), plus free models (Cursor Small, Deepseek v3, Gemini 2.5 Flash, GPT-4o mini, Grok 3 Mini Beta)
  • Paid tiers: Access to OpenAI, Claude, Gemini models
  • Note: Claude models removed from free tier ~June 2025
  • AI-powered code editor with autonomous coding capabilities

Links: Pricing


Free with ChatGPT Plus ($20/mo): GPT-5 access for coding tasks Pay-as-you-go: Use with OpenAI API key Free OSS mode: Access to open-source models only (via --oss flag)

  • Lightweight coding agent running locally
  • Interactive terminal UI with sandbox mode
  • Cross-platform support: macOS 12+, Ubuntu 20.04+, Windows 11 via WSL2
  • Experimental project under active development

Links: GitHub Repo


Pro ($10/mo): Unlimited usage with advanced context awareness

  • Claude 3.5 Sonnet, GPT-4o access
  • Enhanced context window and personalization Teams ($12/user/mo): Pro features + team management Enterprise (Custom): On-premise deployment, custom models

Links: Pricing


Pro ($12/mo): Enhanced AI completions and chat Enterprise ($39/user/mo): Multiple LLMs, private deployment

  • Models: Claude 3.5 Sonnet, GPT-4o, Llama 3.3 70B, proprietary models
  • 600+ programming languages supported
  • On-premises and air-gapped deployment options
  • Bring your own fine-tuned models

Links: Pricing


AI Pro ($15/mo): Increased cloud quota + unlimited local models AI Ultimate ($25/mo): Maximum cloud quota + advanced features

  • Free tier: Unlimited code completion + local models + limited cloud quota
  • 30-day Pro trial included
  • All Products Pack includes AI Pro
  • Offline mode with local models via Ollama/LM Studio

Links: AI Pricing


Pro ($19.99/mo via Google AI Pro): 100 tasks/day

  • 5x higher limits than free tier (15 tasks/day → 100 tasks/day)
  • 5x concurrent tasks (3 → 15 concurrent)
  • Higher access to latest models

Ultra (via Google AI Ultra): 300 tasks/day

  • 20x higher limits than free tier
  • 60 concurrent tasks
  • Priority access to latest models
  • Gmail account required (18+ years)

Links: Usage Limits | Google AI Plans


Pro ($10/mo): 1M token context window + chat credits

  • Alternative: $99/year
  • Chat interface with GPT-4o, Claude 3.5 Sonnet, GPT-4 Team ($10/user/mo): Pro features + team management
  • Note: Merged with Cursor IDE in November 2024

Links: Pricing


Know better pricing or limits? Share a link in an issue or PR to help keep this updated. See CONTRIBUTING.md for guidelines.


4. Tools with Free Access to Basic Models

(unspecified/basic models)

Unspecified models

  • 1M tokens/month limit
  • Specific model not publicly specified
  • Credit card required

Links: Token Documentation


Unspecified models

  • 5 daily credits, max 30 per month (free)
  • Models not publicly enumerated
  • Credit card required

Links: Messaging Limits


Proprietary models (not frontier)

  • GPT-5 access requires v0 Premium subscription
  • $5 in credits/month limit
  • Uses proprietary models with varied routing
  • Credit card required

Links: Updated Pricing Blog


Unlimited free usage of basic AI coding assistance

  • Individual plan: Free forever with unlimited code completions, AI chat, commands
  • 70+ programming languages supported
  • IDE integrations: VS Code, JetBrains, Vim/Neovim, Jupyter
  • No credit card required
  • Limited context awareness (expanded in paid tiers)
  • Base model only (Llama 3.1 70B), pro-graded models require subscription

Links: Pricing | Documentation


Free tier with limited features

  • Basic AI code completions and chat (limited)
  • Local processing available
  • Context heavily limited in free tier
  • Performance dialed down to save resources
  • 600+ programming languages supported

Links: Pricing


AI Free tier included with IDEs

  • Unlimited code completion and local model support
  • Limited quota for cloud-based features
  • 30-day AI Pro trial
  • Chat, code generation, commit messages with local models

Links: AI Features


Free tier with basic features

  • Basic code suggestions
  • 7-day data retention limit
  • Credit card required for registration
  • 1M token context window (impressive for free tier)

Links: Pricing


Free open-source extension with flexible model support

  • Free VS Code and JetBrains extension
  • Full support for local models via Ollama, LM Studio
  • Solo tier: Private/team/public visibility options
  • Supports 200+ models (requires your own API keys for cloud models)
  • Community hub for custom AI assistants
  • No vendor lock-in or usage limits for local models

Links: GitHub | Model Hub


Know the official limits or models? Share a link in an issue or PR to update the information. See CONTRIBUTING.md for guidelines.


5. Local Models

Running open-weight frontier models locally provides unlimited coding assistance without API costs or usage limits. Popular tools for local deployment include Cline (VS Code extension with Plan/Act modes and MCP support), Aider (command-line assistant with built-in Git integration), and Continue.dev (open-source VS Code extension supporting 200+ models). All work seamlessly with Ollama to run frontier models like Devstral (24B parameters, optimized for agentic coding), Qwen3-Coder, DeepSeek Coder V2, Codestral, and GLM-4.5.

Note: Frontier models require substantial RAM/VRAM. In particular, for Qwen3‑Coder‑480B the Ollama‑friendly GGUF is ~150GB, and practical local inference can require ~150GB of unified memory (RAM+VRAM), which makes it hard on typical laptops; the 30B quant commonly needs ~18GB. See the Unsloth Qwen3‑Coder local guide for details (docs) and Simon Willison's article on running GLM‑4.5 AIR on his laptop to build Space Invaders for a practical example.


Comparison Notes

  • Goal: Compare AI coding tools by their access to pro-grade models and free tier limits.
  • What qualifies a model as "pro-grade"? Models must achieve ≥60% on SWE-bench Verified, demonstrating real-world software engineering capability. Current qualifying models: GPT-5 (74.9%), Claude Opus 4.1 (74.5%), Claude Sonnet 4 (72.7%), GPT-5 mini (71.0%), Qwen3-Coder-480B (69.6%), and Gemini 2.5 Pro (63.2%).
  • Different limit types: Tools use various quota systems - requests, tokens, credits, chats - making direct comparison challenging. Check documentation for specifics.
  • Real-world usage: Actual consumption varies dramatically based on coding style, task complexity, and tool implementation.

Related Resources