feat(reward): sentinel-based exchange-count bypass for cron episodes by chiefmojo · Pull Request #1848 · MemTensor/MemOS

chiefmojo · 2026-06-01T01:53:02Z

Summary

Adds a config-driven bypass so cron-initiated episodes can be scored even when they fall below the minExchangesForCompletion threshold.

Problem

Scheduled agent jobs produce single-exchange episodes (the cron prompt + agent response). With the default minExchangesForCompletion: 1 floor these score fine, but tighter settings — or multi-turn flows that cron fires as one composite turn — hit the "too few exchanges" skip condition. The cron session produces real work worth scoring but the exchange-count gate silently discards it.

Solution

RewardConfig.cronSentinels: string[] — an array of substrings. If any user message in the episode starts with one of these strings, the exchange-count check (gate 1) is bypassed. All other quality gates (content length, triviality, tool-heavy ratio) remain active.

The default is an empty array ([]), so behavior is unchanged for existing deployments. Operators who run scheduled agent jobs add their sentinel string to config.yaml:

algorithm:
  reward:
    cronSentinels: ["[IMPORTANT: You are running as a scheduled cron job"]

Design notes

Only gate 1 (exchange count) is bypassed — content and triviality gates still apply.
Sentinel match is prefix-based (message.startsWith(sentinel)).
The bypass is transparent in logs: skipped-by-cron episodes show a distinct log tag.

Test plan

Episode with 1 exchange and matching sentinel: passes exchange-count gate, other gates still apply
Episode with 1 exchange and no matching sentinel: still skipped by exchange-count gate (default behavior)
Empty cronSentinels array: no change to existing behavior
Trivial cron episode (matching sentinel but trivial content): still skipped by content gate

🤖 Generated with Claude Code

Cron jobs always produce exactly 1 user↔agent exchange — the task prompt plus one reply — so minExchangesForCompletion: 2 zero-scores every cron episode before content is even evaluated. This starves L2 induction of signal after the bridge stabilises. Adds `cronSentinels` to RewardConfig (schema, defaults, types). When the first user turn starts with a sentinel prefix, check 1 (exchange count) is skipped; content/triviality checks still apply. Default sentinel covers the Hermes cron prompt. The `snapshot.meta?.initialUserText` fallback handles episodes scored during recovery when turns aren't materialised. If the field is absent the episode falls back to the old skip behaviour — no false positives. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

docs(memos-local-plugin): clarify install path and stale dir names (MemTensor#1540) The README's 'Quick start' section told users to use install.sh instead of npm install, but the warning was buried and users still tried 'npm install -g @memtensor/memos-local-plugin' first. The reporter in MemTensor#1540 encountered this on a Hermes deployment. This change: - Promotes the 'do not run npm install -g' notice to a prominent IMPORTANT callout explaining why global install is wrong (no agent-home deploy, no config.yaml, no bridge/viewer) and that the tarball intentionally ships built artifacts only. - Adds a Troubleshooting subsection covering the two specific symptoms in the bug report: the 'package not found' misread, and the stale web/ and site/ directory names (web/ is now viewer/, site/ was removed by commit 26e7e3d). - Mentions install.ps1 for Windows alongside install.sh. - CHANGELOG: record the docs fix and reference MemTensor#1540. Documentation-only change; no code or runtime behavior touched. Co-authored-by: MemOS AutoDev <autodev@memtensor.ai> Co-authored-by: Matthew <heimixiaozhuang@zju.edu.cn>

…_() got an unexpected keyword a (MemTensor#1889) fix: remove invalid chunker parameter from SystemParser test instantiation - SystemParser.__init__() signature changed to (embedder, llm=None) - Test was still passing chunker=None causing TypeError - Fixes all 5 failing tests in test_system_parser.py Fixes MemTensor#1888 Co-authored-by: MemOS AutoDev <autodev@memos.ai> Co-authored-by: Matthew <heimixiaozhuang@zju.edu.cn>

…tributeError when given None (MemTensor#1884) * test: add comprehensive tests for clean_json_response (issue MemTensor#1525) - Add test suite in tests/mem_os/test_format_utils.py - Cover None input ValueError with diagnostic message - Cover markdown removal, whitespace stripping, edge cases - Verify fix for AttributeError when LLM returns None * style: format clean_json_response tests --------- Co-authored-by: MemOS AutoDev <autodev@memos.ai> Co-authored-by: Matthew <heimixiaozhuang@zju.edu.cn>

…date_cube_access — fails for ev (MemTensor#1903) fix: validate current user not target in share_cube_with_user (MemTensor#1901) share_cube_with_user(cube_id, target_user_id) called _validate_cube_access(cube_id, target_user_id), but the validator signature is (user_id, cube_id). The cube_id therefore landed in the user_id slot and _validate_user_exists raised "User '<cube_id>' does not exist or is inactive" for every well-formed call, making the API unusable. The in-code comment "Validate current user has access to this cube" already documented the correct intent: the sharing user (self.user_id) must have access to the cube being shared, not the target. Switch the call to self._validate_cube_access(self.user_id, cube_id). The target user's existence is independently checked on the next line via validate_user(target_user_id), so that path is unchanged. Add regression tests in tests/mem_os/test_memos_core.py that pin down: - validate_user_cube_access is consulted with (self.user_id, cube_id), - add_user_to_cube is called with (target_user_id, cube_id) on success, - a missing target raises "Target user '<id>' does not exist". Closes MemTensor#1901 Co-authored-by: MemOS AutoDev Bot <autodev@memtensor.local> Co-authored-by: Matthew <heimixiaozhuang@zju.edu.cn>

Memtensor-AI · 2026-07-02T12:29:46Z

Automated Test Results: PASSED

Cloud test-engine rerun against dev-v2.0.22 completed successfully.

Run: tr-dfe46f67-699 on cloud test-engine 10012
memos_local_plugin/unit: 6 passed, 0 failed, 0 skipped

Manual code review is still required before merge.

Memtensor-AI changed the base branch from main to dev-20260604-v2.0.19 June 10, 2026 15:41

Memtensor-AI and others added 5 commits June 14, 2026 17:24

Merge branch 'dev-20260604-v2.0.19' into pr/cron-episode-scoring-bypass

3596ef9

Memtensor-AI changed the base branch from dev-20260604-v2.0.19 to dev-v2.0.22 July 1, 2026 13:16

CarltonXiang deleted the branch MemTensor:main July 3, 2026 07:25

CarltonXiang closed this Jul 3, 2026

syzsunshine219 reopened this Jul 3, 2026

syzsunshine219 added the needs-audit Requires manual audit before merge label Jul 3, 2026

syzsunshine219 changed the base branch from dev-v2.0.22 to main July 3, 2026 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(reward): sentinel-based exchange-count bypass for cron episodes#1848

feat(reward): sentinel-based exchange-count bypass for cron episodes#1848
chiefmojo wants to merge 6 commits into
MemTensor:mainfrom
chiefmojo:pr/cron-episode-scoring-bypass

chiefmojo commented Jun 1, 2026

Uh oh!

Memtensor-AI commented Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

chiefmojo commented Jun 1, 2026

Summary

Problem

Solution

Design notes

Test plan

Uh oh!

Memtensor-AI commented Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants