🐦‍⬛ chough

pronounced "chuff" /tʃʌf/ — a fast, memory-efficient ASR CLI using Parakeet TDT 0.6b V3 via sherpa-onnx with chunked processing.

Features

⚡ Fast: 7-20x realtime transcription
🧠 Memory-efficient: Processes audio in chunks
📦 Any format: If ffmpeg supports it, chough supports it
🎯 No setup: Auto-downloads models on first run
📝 Multiple formats: text, json, vtt
💻 CPU only: No GPU required
🌐 Server mode: HTTP API for batch processing

Supported Languages

Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Russian, Ukrainian

Requirements

ffmpeg - for audio/video support

Installation

Arch Linux (AUR)

paru -S chough-bin

macOS (Homebrew)

brew install --cask hyperpuncher/tap/chough

Windows (Winget)

winget install chough

Binary releases

Download from GitHub Releases.

Build from source

go install github.com/hyperpuncher/chough/cmd/chough@latest

Skill

npx skills add hyperpuncher/dotagents --skill chough

CLI Usage

# Basic transcription (text to stdout)
chough audio.mp3

# Pipe audio from stdin
cat audio.mp3 | chough

# Video files work too - extracts audio automatically
chough -f vtt -o subtitles.vtt lecture.mp4

# JSON with timestamps
chough -f json podcast.mp3 > transcript.json

# Smaller chunks for lower memory usage
chough -c 30 long-interview.wav

# Use remote server mode (requires CHOUGH_URL)
CHOUGH_URL=http://localhost:8080 chough --remote audio.mp3

Flags

Flag	Description	Default
`-c, --chunk-size`	Chunk size in seconds	60
`-f, --format`	Output format: text, json, vtt	text
`-o, --output`	Output file	stdout
`-r, --remote`	Transcribe via CHOUGH_URL server	-
`--version`	Show version	-
`-h, --help`	Show help	-

Server Mode

Run chough as an HTTP server for API access. The server keeps the model loaded in memory (~1.6GB), eliminating the ~1.5s startup time per request.

# Start server
chough --server --port 8080

# With custom settings
chough --server --host 0.0.0.0 --port 8080 --workers 2

API Endpoints

Method	Endpoint	Description
POST	`/transcribe`	Transcribe audio (file upload, URL, or base64)
GET	`/health`	Health check with queue status

API Examples

# Upload file
curl -X POST http://localhost:8080/transcribe \
  -F "file=@audio.mp3" \
  -F "format=json" \
  -F "chunk_size=60"

# Transcribe from URL
curl -X POST http://localhost:8080/transcribe \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com/audio.mp3", "format": "vtt"}'

# Base64 audio
curl -X POST http://localhost:8080/transcribe \
  -H "Content-Type: application/json" \
  -d '{"base64": "...", "format": "text"}'

# Health check
curl http://localhost:8080/health

Server Flags

Flag	Description	Default
`--server`	Run in server mode	-
`--host`	Server host	0.0.0.0
`--port`	Server port	8080
`--workers`	Concurrent workers	2
`--max-upload`	Max upload size (MB)	1024

Docker

# Build and run
docker build -t chough .
docker run -d -p 8080:8080 chough

# Or use docker-compose
docker-compose up -d

Environment

CHOUGH_MODEL: Path to model directory (optional, auto-downloaded if not set)
CHOUGH_URL: Remote server URL for --remote mode (must start with http:// or https://)

Model

Default: Parakeet TDT 0.6b V3 (sherpa-onnx-nemo-parakeet-tdt-0.6b-v3-int8.tar.bz2)

Models are automatically downloaded to $XDG_CACHE_HOME/chough/models (~650MB).

How it works

Splits audio into 60s chunks (configurable)
Loads ONNX model once (~1.5s)
Processes chunks sequentially
Outputs results

Performance

Benchmark on 1-minute audio file (AMD Ryzen 5 5600X, 6 cores):

Tool	Model	Time	Relative	Realtime Factor	Memory
chough	Parakeet TDT 0.6b V3	4.3s	13.2x	14.1x	1.6GB
whisper-ctranslate2	medium	27.8s	2.0x	2.2x	1.7GB
whisper	turbo	56.6s	1.0x	1.1x	5.3GB

chough is ~6-13x faster than other tools.

Speed by audio length

Duration	Time	Speed
15s	2.0s	7.4x realtime
1min	4.3s	14.1x realtime
5min	16.2s	18.5x realtime
30min	90.2s	19.9x realtime

Run your own benchmarks: just benchmark <audio-file>

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
.github/workflows		.github/workflows
benchmarks		benchmarks
cmd		cmd
internal		internal
scripts		scripts
.gitignore		.gitignore
.goreleaser.yaml		.goreleaser.yaml
Dockerfile		Dockerfile
Dockerfile.goreleaser		Dockerfile.goreleaser
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
justfile		justfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐦‍⬛ chough

Features

Supported Languages

Requirements

Installation

Arch Linux (AUR)

macOS (Homebrew)

Windows (Winget)

Binary releases

Build from source

Skill

CLI Usage

Flags

Server Mode

API Endpoints

API Examples

Server Flags

Docker

Environment

Model

How it works

Performance

Speed by audio length

License

About

Uh oh!

Releases 20

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🐦‍⬛ chough

Features

Supported Languages

Requirements

Installation

Arch Linux (AUR)

macOS (Homebrew)

Windows (Winget)

Binary releases

Build from source

Skill

CLI Usage

Flags

Server Mode

API Endpoints

API Examples

Server Flags

Docker

Environment

Model

How it works

Performance

Speed by audio length

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 20

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages