Grafana Observability Stack for AI Infrastructure

Infrastructure observability that complements Langfuse for complete AI system visibility. While Langfuse tracks LLM interactions, this stack monitors the distributed systems, service mesh, and infrastructure beneath.

Why This + Langfuse = Complete Observability

Layer	Langfuse Provides	This Stack Provides
LLM	Prompts, completions, token usage	-
Application	LLM call traces, evaluations	Service dependencies, distributed traces
Infrastructure	-	Container metrics, network I/O, resource usage
Data	-	GraphRAG performance, cache efficiency
Automation	-	Backup health, git hook triggers

Key Integration: Both systems share trace IDs via OTLP, enabling end-to-end debugging from LLM call to infrastructure.

Quick Start (3 Commands)

# 1. Start the stack
docker compose -f docker-compose.grafana.yml up -d

# 2. Verify health
curl -s http://prometheus.local:9090/api/v1/query?query=up | jq '.data.result[].metric.job'

# 3. Open Grafana
open http://grafana.local  # Login: admin/admin

What Makes This Unique

Service Dependency Mapping - See what calls what in your AI architecture
Distributed Transaction Tracing - Follow requests across MCP servers, databases, and services
Memory Loop Detection - GraphRAG-specific patterns not visible in LLM traces
Infrastructure Correlation - Link slow AI responses to resource constraints
Automated Backup System - Git-driven configuration management with health monitoring

Documentation

🚀 Getting Started

Quick Start Guide - 5-minute setup with Langfuse integration
Trace Correlation - Link Langfuse and Tempo traces
MCP Instrumentation - OpenTelemetry for MCP servers

📚 References

Operations Guide - Visual patterns and troubleshooting
Integration Examples - Real-world scenarios
Learn More - External resources and documentation

Services

Service	URL	Purpose
Grafana	http://grafana.local	Visualization dashboard
Prometheus	http://prometheus.local	Metrics storage
Tempo	http://tempo.local	Distributed tracing
Loki	http://loki.local	Log aggregation
Alloy	http://alloy.local	OTLP collector

Quick Health Check

# Check all services are running
curl -s http://prometheus.local:9090/api/v1/query?query=up | \
  jq '.data.result[] | select(.value[1]=="0") | .metric.job' || \
  echo "✅ All exporters up"

# Check current AI operations load
curl -s http://prometheus.local:9090/api/v1/query?query='rate(mcp_tool_invocations_total[1m])' | \
  jq '.data.result[0].value[1]' | \
  xargs printf "Tool calls/min: %.0f\n"

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.githooks		.githooks
backup		backup
config		config
dashboards		dashboards
docs		docs
instrumentation		instrumentation
mcp-instrumentation		mcp-instrumentation
secrets		secrets
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Makefile		Makefile
README.md		README.md
backup-setup-instructions.md		backup-setup-instructions.md
cliff.toml		cliff.toml
docker-compose.backup.yml		docker-compose.backup.yml
docker-compose.grafana.yml		docker-compose.grafana.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Grafana Observability Stack for AI Infrastructure

Why This + Langfuse = Complete Observability

Quick Start (3 Commands)

What Makes This Unique

Documentation

🚀 Getting Started

📚 References

Services

Quick Health Check

Requirements

Configuration

License

About

Uh oh!

Releases 3

Packages

Languages

devops-adeel/grafana-orbstack

Folders and files

Latest commit

History

Repository files navigation

Grafana Observability Stack for AI Infrastructure

Why This + Langfuse = Complete Observability

Quick Start (3 Commands)

What Makes This Unique

Documentation

🚀 Getting Started

📚 References

Services

Quick Health Check

Requirements

Configuration

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages