fix: _get_offset_tokenizer immune to global fastokens patch (concurrent-pool race) by S1ro1 · Pull Request #86 · PrimeIntellect-ai/renderers

S1ro1 · 2026-06-14T14:06:17Z

Problem

load_tokenizer(use_fastokens=True) installs a process-global fastokens monkeypatch for the duration of each pool-slot's from_pretrained, holding _FASTOKENS_PATCH_LOCK only around the patch/unpatch calls — not around the load itself. Under the concurrent renderer pool (create_renderer_pool fans loads across a thread pool), _get_offset_tokenizer's "vanilla" reload (_load_tokenizer_via_auto → AutoTokenizer.from_pretrained) can run inside another slot's open patch window, come back with the offset-less fastokens shim, and get cached. From then on every offset attribution raises NotImplementedError: fastokens does not track character offsets (surfaced downstream as a 502 from the interception server).

Reproduced: a bare AutoTokenizer.from_pretrained issued during an open patch window returns an offset-less tokenizer.

Fix

In _get_offset_tokenizer, after the first load, verify offsets; if missing, reload with fastokens force-unpatched under _FASTOKENS_PATCH_LOCK (restoring the prior patch state) and re-probe before caching — so a non-offset tokenizer is never returned or cached. Verified: a reload landing in a patch window now returns an offset-capable tokenizer.

Note

Fix `_get_offset_tokenizer` to retry with fastokens patch disabled under a lock

Adds _has_offsets(tok) helper in renderers/base.py that verifies true offset support by checking is_fast and attempting tokenization with return_offsets_mapping=True.
Replaces the previous is_fast check with _has_offsets for both initial load and post-reload validation.
If the initial tokenizer fails _has_offsets, retries under _FASTOKENS_PATCH_LOCK by temporarily unpatching fastokens, reloading the tokenizer, then restoring the patch — suppressing stdout during patch/unpatch.
Only caches a tokenizer that passes _has_offsets; raises RuntimeError otherwise.
Risk: the temporary unpatch is process-global, so concurrent tokenizer loads during the retry window may see inconsistent patch state despite the lock.

^{Macroscope summarized 057f008.}

Note

Medium Risk
Touches tokenizer loading and global fastokens patch coordination on a hot path for hand-coded renderers; incorrect locking or restore logic could regress attribution or pool startup, but scope is narrow to _get_offset_tokenizer.

Overview
Fixes a concurrent renderer pool race where _get_offset_tokenizer could load and cache a fastokens shim tokenizer (no offset_mapping) if AutoTokenizer.from_pretrained ran while another slot had the process-global fastokens patch active—breaking hand-coded renderer body/scaffold attribution downstream.

_get_offset_tokenizer now uses a _has_offsets probe (fast tokenizer + successful return_offsets_mapping=True) instead of relying on is_fast alone. After the first vanilla load, if offsets are missing it reloads under _FASTOKENS_PATCH_LOCK with fastokens temporarily unpatch (restoring prior patch state), then refuses to cache until offsets are verified; otherwise it raises a clearer RuntimeError.

^{Reviewed by Cursor Bugbot for commit 057f008. Bugbot is set up for automated code reviews on this repo. Configure here.}

load_tokenizer toggles a process-global fastokens monkeypatch per pool-slot load, holding _FASTOKENS_PATCH_LOCK only around the patch/unpatch calls, not the load. Under the concurrent renderer pool, _get_offset_tokenizer's 'vanilla' reload could race an open patch window, get an offset-less fastokens-backed tokenizer, and cache it — permanently breaking offset attribution (renderers using attribute_text_segments then raise 'fastokens does not track character offsets'). Reload with the patch forced off under _FASTOKENS_PATCH_LOCK and re-probe before caching, so a poisoned (non-offset) tokenizer is never returned or cached. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

macroscopeapp · 2026-06-14T21:33:23Z

Approvability

Verdict: Approved

This is a well-scoped bug fix for a race condition in tokenizer loading. The changes add defensive detection and retry logic using existing lock infrastructure, with clear intent and limited scope to a single function.

^{You can customize Macroscope's approvability policy. Learn more.}

S1ro1 mentioned this pull request Jun 14, 2026

feat(v1): encode router-replay routed_experts into transport PrimeIntellect-ai/prime-rl#2808

Merged

style: ruff format (blank line before nested def)

057f008

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

mikasenghaas marked this pull request as ready for review June 14, 2026 21:32

mikasenghaas requested a review from hallerite June 14, 2026 21:32

macroscopeapp Bot approved these changes Jun 14, 2026

View reviewed changes

hallerite approved these changes Jun 14, 2026

View reviewed changes

hallerite merged commit 67568f7 into main Jun 14, 2026
11 checks passed

hallerite deleted the fix/offset-tokenizer-fastokens-race branch June 14, 2026 23:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: _get_offset_tokenizer immune to global fastokens patch (concurrent-pool race)#86

fix: _get_offset_tokenizer immune to global fastokens patch (concurrent-pool race)#86
hallerite merged 2 commits into
mainfrom
fix/offset-tokenizer-fastokens-race

S1ro1 commented Jun 14, 2026 •

edited by macroscopeapp Bot

Loading

Uh oh!

macroscopeapp Bot commented Jun 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

S1ro1 commented Jun 14, 2026 • edited by macroscopeapp Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Fix _get_offset_tokenizer to retry with fastokens patch disabled under a lock

Uh oh!

macroscopeapp Bot commented Jun 14, 2026

Approvability

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

S1ro1 commented Jun 14, 2026 •

edited by macroscopeapp Bot

Loading

Fix `_get_offset_tokenizer` to retry with fastokens patch disabled under a lock