Cns/test foundations by benoitcayladbx · Pull Request #45 · databrickslabs/ontobricks

benoitcayladbx · 2026-06-05T07:33:45Z

No description provided.

The Mapping dropdown lists ontology dataProperties, not SQL columns. The OWL generator often models attributes via owl:Restriction without rdfs:domain, which the parser doesn't pick up. Fixing the parser and syncing attributes on domain load.

ontology precision score computed from the existing pitfalls analysis in OntoBricks. The pitfalls engine is in src/back/core/external/pitfalls/runner.py (OntologyPatternToolkit) and src/back/objects/domain/PitfallsService.py. The UI is src/front/static/ontology/js/ontology-pitfalls.js. Compute a 0–100 score from pitfall results: weight Critical (P1.x) issues more heavily than Minor (P4.x), normalized by ontology size (class + property count). Expose it in GET /ontology/pitfalls/results/{task_id} as a precision_score field. Display a circular gauge in the pitfalls UI header alongside the existing issue count. Store the score in the domain session so it appears on the domain home panel.

feat (ontology) Precision score

…ties #39 OntoBricks Version Reported on 0.4.0; confirmed still present in 0.5.0. Fixed in 0.5.0. Resolution Imported R2RML class/predicate URIs are now normalized to the ontology's canonical URIs at import time, matched by local name (exact-case, then case-insensitive). This fixes the persisted data at the source so the designer, diagnostics, export, and KG build all agree; unmatched URIs are left untouched. src/back/objects/mapping/Mapping.py — new _canonicalize_imported_uris, called from parse_r2rml. tests/units/mapping/test_mapping_service.py — added TestParseR2rmlUriNormalization. Tests: uv run pytest -q → 2411 passed. Note: Existing sessions that already imported an affected file need a re-import to pick up the canonical URIs. The UI "mapped" gate still also requires sql_query, so a table-only (rr:tableName) import without rr:sqlQuery will still display as unmapped by design. Want me to also add a one-time migration that canonicalizes already-persisted mappings on session load (so existing imports self-heal without re-importing)?

…M1.P1, T-M3) Implements Section 9 of the Cursor-Native Superpowers (CNS) plan: T-M0 — Test Foundations (P1–P6): - pyproject.toml: extended pytest markers (e2e, eval, mcp, db, spark, external, property, contract), [tool.coverage.run/report/xml/html] sections, dev-deps (hypothesis, syrupy, testcontainers[postgres], fastmcp, pytest-mock, pyyaml). - pytest.ini: mirror markers + --strict-markers + --strict-config. - tests/fixtures/factories/ — 5 dataclass-based factories (OntologyFactory, R2RMLMappingFactory, TripleFactory, DomainFactory, ShaclShapeFactory). - tests/fixtures/factories/databricks/ — 5 Databricks surface mocks (MockSQLWarehouse, MockUCCatalog, MockVolume, MockFoundationModelClient, lakebase_pg testcontainers fixture). - tests/fixtures/mlflow.py — InMemoryTraceSink + captured_traces fixture for span-tree assertions on agent code. - tests/fixtures/mcp_client.py — InProcessMCPClient + mcp_app/mcp_client fixtures using FastMCP v2 API (list_tools / get_tool / call_tool). - tests/fixtures/http.py — agent_mock_transport ScriptedTransport factory. - tests/fixtures/redaction.py — redacted_caplog fixture for db-marked tests. - scripts/check_coverage.py — per-package threshold enforcer; parses coverage.xml against ci/coverage_thresholds.yaml; exits 1 with violation table. - ci/coverage_thresholds.yaml — per-package floors (90% project line / 80% branch overall; back/objects 95%, back/core 92%, agents 85%, mcp-server 90%, front 80%) matching §9.1 of the methodology plan. - .github/workflows/ci.yml — added coverage-gate job (G1-pkg) + mcp-test job (G1c, runs when src/mcp-server/ touched). - .github/workflows/nightly.yml — property tests + Playwright E2E + external smoke probe against https://fevm-ontobricks-int.cloud.databricks.com/. T-M1.P1 — SHACL unit tests (filling the 0-coverage gap): - tests/back/core/w3c/shacl/test_shacl_parser.py — 10 tests covering happy path, multi-class parsing, constraint extraction (minCount/maxCount/pattern), and defensive paths (empty/malformed/non-SHACL input). - tests/back/core/w3c/shacl/test_shacl_generator.py — 6 tests covering empty graph, disabled shapes, NodeShape emission, parser↔generator roundtrip, base-uri override. - tests/back/core/w3c/shacl/test_shacl_service.py — 9 tests covering create/update/delete shape, default severity, missing-id no-op, roundtrip, pyshacl validate smoke. T-M3 — MCP integration test harness scaffold: - tests/mcp/conftest.py — re-exports the canonical fixtures. - tests/mcp/integration/test_tool_schemas.py — 5 tests asserting tools are registered, expected core tools present, every tool has a schema, schemas are object-typed, tool names unique. - tests/mcp/integration/test_smoke_tools.py — 5 tests invoking list_domains / list_domain_versions / get_design_status with httpx.MockTransport routed via AsyncClient class-level patch. Asserts unknown-tool raises and 5xx is surfaced as fastmcp.ToolError. Gap #2 fix: changelogs/ directory bootstrapped (removed `/changelogs` from .gitignore which was suppressing the .cursorrules-mandated audit trail). Verification: - `uv run pytest --collect-only` → 1928 tests collect cleanly, zero strict- marker warnings. - `uv run pytest tests/back/core/w3c/shacl/ tests/mcp/` → 35/35 new tests pass. - Full suite: 1845 passing (3 pre-existing failures in test_settings_lakebase_status.py unrelated to this change — also fail on master). What's left in Section 9 (follow-up work): - T-M1.P2 SparqlTranslator direct unit tests (2407-LOC, ~120 tests) - T-M1.P3 DigitalTwin direct unit tests (3525-LOC, ~70 tests) - T-M1.P4 src/back/core/logging unit tests - T-M1.P5 src/back/core/errors direct unit tests - T-M2 integration tier (Delta sync, Lakebase, R2RML complex joins, OpenAPI/GraphQL contracts) - T-M3 finish (all 9 MCP tools × full schema + happy + 2 failure tests) - T-M4 Agent eval harness (requires .claude/skills/ai-feature/ from M2.P1+P2) - T-M5 E2E nightly user journeys - T-M6 Hypothesis property tests for W3C translators Co-authored-by: Isaac

… changelog gate M1 — Foundation completion (closes gaps #1, #8, #10, #12, #13): - src/.coding_rules.md (long-form rules with Fowler refactoring vocab, code-smell catalog, decision tables) — closes gap #1; un-ignored in .gitignore. - .pre-commit-config.yaml + scripts/pre-commit/{check-changelog-presence, forbid-gsd-imports}.sh — closes gap #8. - docs/PR_REVIEW_CHECKLIST.md (12-item reviewer reference) + PR template — closes gap #12. - commitlint.config.js + .github/workflows/lint-pr-title.yml — closes gap #10. - .claude/worktrees/README.md (naming, lifecycle, multi-agent protocol) — closes gap #13. - .planning/ROADMAP.md — multi-task tracking surface mirroring GitHub Milestones. M2 — AI Discipline (the critical-path lifecycle that closes gap #4): - .cursor/11-ai-feature-lifecycle.mdc (priority 90) — the rule that mandates SPEC.md + dataset + MLflow URI for any change to src/agents/**. - .claude/skills/ai-feature/{SKILL.md, SPEC.template.md} — orchestrator skill with 7-step procedure (brainstorm → SPEC → dataset → harness → impl → re-eval → ship). Path of least resistance to passing the G2 gate. - .planning/agents/{owl_generator, ontology_assistant, auto_assignment, auto_icon_assign, dtwin_chat}/SPEC.md — scaffolds for all 5 existing agents. Proposed eval dimensions per agent; team fills tables at M2.P4. - .github/workflows/eval-gate.yml — G2 CI gate. Four jobs: detect changed agents, check SPEC.md + eval-dimensions table, check dataset present + sized, check MLflow URI in PR body. CALIBRATION_MODE=true for first 2 weeks (reports but doesn't block) — flip to false after team calibrates thresholds. M3.P2 — Changelog presence gate (closes gap #9): - .github/workflows/changelog-presence.yml — fails PRs that touch src/ or tests/ without a matching changelogs/ diff. Bypass via 'no-changelog' label (reviewer must ack). Verification: - uv run pytest --collect-only -q → 1928 tests, zero strict-marker warnings - uv run pytest tests/back/core/w3c/shacl/ tests/mcp/ -q → 35/35 pass (T-M1.P1 + T-M3 samples from prior commit still green) What's left under CNS: - M2.P4: build the 5 eval datasets (≥20 examples each) — the hardest M2 item. - M2.P6 (full): expand T-M3 sample to all 40+ MCP tools. - M2.P7: eval-drift cron + mcp-ontobricks smoke probe (depends on M2.P4). - M3.P1: ruff + mypy in CI with baseline file. - M3.P3: enable E2E in the nightly workflow (already scaffolded). - M4: monolith splits (DigitalTwin, SparqlTranslator, SettingsService) — hard precondition is M2 fully done so refactors have an eval safety net. - T-M1.P2-P5, T-M2, T-M3 expansion, T-M4-T-M6: section 9 testing milestones. Co-authored-by: Isaac

… ruff+mypy, agent eval seeds, MCP parametrized T-M1.P4 — logging module unit tests (17/17 passing). Closes the 0%-coverage gap on src/back/core/logging/: LogManager singleton, get_logger, setup, JSONFormatter, module-level public API shims. T-M1.P5 — errors module direct unit tests (33/33). Was previously integration-only. Covers OntoBricksError base + 5 subclasses, error_code_from_class derivation, polymorphism, and the ErrorResponse pydantic model. T-M6 sample — Hypothesis property-based tests for OWL parser ↔ generator roundtrip (3/3 with `-m property`). First W3C-translator property tests; nightly only via `property` marker. Generates configs with 1-5 classes and 0-4 properties; verifies class + object-property name sets roundtrip through the Turtle serialization. T-M2.P4 — OpenAPI contract tests (10/10). Locks the MCP↔REST contract: asserts that /api/v1/domains, /api/v1/domain/versions, /api/v1/domain/design-status are declared in the external app's OpenAPI spec (probes both /api/v1/... and mount-relative /v1/... forms). Plus shape sanity (path-count bounds, no-undocumented-v1-paths). M3.P1 — ruff + mypy in CI (closes gap #7). pyproject.toml grows [tool.ruff], [tool.ruff.lint], [tool.mypy], [[tool.mypy.overrides]] sections. Dev deps add ruff>=0.7.4 and mypy>=1.13.0. scripts/generate-mypy-baseline.sh regenerates mypy_baseline.txt; scripts/check-mypy-diff.py compares current mypy output to the baseline and exits 1 only on NEW errors. Initial baseline: 160 currently-accepted mypy errors against src/ (tests excluded). .github/workflows/ci.yml adds a `mypy-diff` job and an advisory `ruff check` step on PR-changed files only (full repo has ~3000 ruff findings; pre-commit hook gates NEW lines, full burn-down deferred). M2.P4 seed datasets — 3-example baseline.jsonl for each of the 5 agents: agent_owl_generator, agent_ontology_assistant, agent_auto_assignment, agent_auto_icon_assign, agent_dtwin_chat. Each row uses the schema declared in .claude/skills/ai-feature/SPEC.template.md (id, input, expected {contains, schema, constraints}, tags). agent_auto_icon_assign also seeds regression.jsonl with the production icon-bug from CNS §4.6 T6 worked example. tests/eval/README.md documents the harness layout; tests/eval/ thresholds.yaml pins per-agent thresholds matching each SPEC's §5. Team must expand each baseline.jsonl to ≥ 20 examples (real M2.P4 work). M2.P6 expand — parametrized MCP tool tests (9/9). tests/mcp/integration/ test_tool_parametrized.py runs shape-checks across every registered MCP tool (not just the marquee set): name is non-empty snake_case, schema has properties or no-args declaration, type='object' when declared, required is a list whose entries appear in properties, tool groups (registry, entity, design-status) are all represented. Auto-covers new tools as the team registers them. Verification: - uv run pytest --collect-only -q → 2000 tests collected - uv run pytest tests/back/core/w3c/shacl/ tests/mcp/ tests/back/core/errors/ tests/back/core/logging/ tests/contract/ -q → 104 passed, 3 deselected - uv run pytest tests/property/ -m property -q → 3 passed - uv run python scripts/check-mypy-diff.py → OK — no new mypy errors See changelogs/2026-05-14.log round-3 section for full detail. Co-authored-by: Isaac

…property tests, more MCP smoke, DigitalTwin units, eval-drift workflow T-M2.P5 — GraphQL schema contract (10/10). Locks the GraphQL surface for the MCP server's query_graphql / get_graphql_schema tools and the front-end dtwin canvas. Asserts the 5 canonical routes are declared, the /dtwin/graphql/schema endpoint is either 200 SDL or 400 OntoBricksError (empty-ontology is part of the contract), and the depth-setting endpoint returns a positive integer. T-M6 expansion — SHACL conformance (4/4) + R2RML idempotency (5/5) property tests. Extends the OWL-roundtrip pattern from round 3 to the other two W3C translators. SHACL: generated Turtle parses with rdflib, target_class roundtrips, delete/update unknown id is no-op. R2RML: semantic determinism via rdflib graph isomorphism (works around real non-determinism in column iteration order — flagged for follow-up), generated Turtle is parseable, class URIs appear in output. All under `property` marker — nightly only. T-M3 expansion — 6 more MCP tool happy-path smoke tests covering select_domain, list_entity_types, get_status, get_graphql_schema, query_graphql, describe_entity. Each tolerates FastMCP ToolError (real backend routes can't always be mocked precisely from JSON-RPC). Discovered + corrected real parameter-name mismatches in query_graphql and describe_entity by introspecting the actual tool schemas. T-M1.P3 sample — 25 DigitalTwin direct unit tests covering the pure-function surface: is_datatype_range, extract_local_id, is_owlrl_available, build_quality_sql, diagnose_view_error, compute_dtwin_indicator, expand_uri_aliases. Discovered + documented that extract_local_id returns input unchanged for trailing-separator URIs — flagged for M4 cleanup. Full ~70-test coverage deferred to T-M2 integration + the M4 split. M2.P7 scaffold — .github/workflows/eval-drift.yml. Four jobs: nightly matrix eval over 5 agents, open-issue-on-drift, mcp-smoke-probe against fevm-ontobricks-int, open-issue-on-smoke-failure. Gated behind two repo variables (ONTOBRICKS_EVAL_RUNNERS_READY, ONTOBRICKS_INT_MCP_REACHABLE) so it stays inert until M2.P4 lands real runners. ROADMAP update — .planning/ROADMAP.md status table refreshed: M2.P1-P3, P5 marked landed (45c60aa); M2.P4 partial (3-example seeds); M3.P1, P2 landed; T-M0, T-M1.P1, T-M1.P3 partial, T-M1.P4, T-M1.P5 landed; T-M6 partial (OWL + SHACL + R2RML done; SPARQL property tests open). Verification: - uv run pytest --collect-only -q → 2050 tests collected - uv run pytest tests/back/core/w3c/shacl/ tests/mcp/ tests/back/core/errors/ tests/back/core/logging/ tests/back/core/digitaltwin/ tests/contract/ -q → 145 passed - uv run pytest tests/property/ -m property -q → 12 passed - uv run python scripts/check-mypy-diff.py → OK — no new mypy errors See changelogs/2026-05-14.log round-4 section for full detail. Co-authored-by: Isaac

Lands the representative slice of SparqlTranslator direct unit tests called for in §9.5 T-M1.P2. Full target was ~120 tests covering each visitor + each SPARQL op family; SparqlTranslator.py is 2407 LOC with a single public method (`translate_sparql_to_spark`). This sample exercises the public API end-to-end against canonical inputs, leaving per-visitor expansion as a focused follow-up PR. Coverage (21 tests, 8 classes): - Return-shape contract (dict with success/sql/variables keys). - Single-variable SELECT alias + FROM clause emission. - LIMIT propagation (explicit, default, parametrized [1, 100, 1000]). - Multi-variable SELECT (rdfs:label projection). - Entity-mapping respected (catalog/schema/table appear in output SQL). - SQL safety: no statement terminator inside body; no IRI-borne SQL injection. - Error path: missing mapping, empty SPARQL, invalid SPARQL, unclosed brace, non-SELECT (CONSTRUCT) all raise ValidationError (per §4 coding rule — translators raise from the OntoBricksError hierarchy, routes translate to HTTP). Discovered + documented during test authoring: the translator's contract is to raise `ValidationError` on malformed input, NOT to return `{"success": False}`. Tests were corrected to match the actual contract; this matches the OntoBricksError pattern documented in §4 of src/.coding_rules.md. ROADMAP: T-M1.P2 flipped from open to partial-landed. Expansion path called out: per-visitor BGP/FILTER/OPTIONAL/UNION/GROUP BY/ORDER BY/ property paths (~100 more tests). Verification: - uv run pytest tests/back/core/w3c/sparql/ -q → 21 passed - uv run pytest --collect-only -q → 2071 tests total Co-authored-by: Isaac

…ine units The upstream merge brought a new `agent_cohort` agent + ~3000 LOC of business logic (CohortService 609 LOC, _BuildPipeline 1006 LOC). Two gaps remained: 1. agent_cohort had no SPEC.md scaffold and no eval dataset, which the G2 CI gate (.cursor/12-ai-feature-lifecycle.mdc + .github/workflows/eval-gate.yml) would block on the next PR touching src/agents/agent_cohort/**. 2. CohortService had only ~3 indirect references in test_digitaltwin_api.py; _BuildPipeline had zero direct unit tests. Added: - .planning/agents/agent_cohort/SPEC.md (retroactive scaffold) - tests/eval/datasets/agent_cohort/baseline.jsonl (3-example seed) - tests/eval/thresholds.yaml: cohort: block - .planning/agents/README.md: status row for agent_cohort - tests/back/core/digitaltwin/test_cohort_service_units.py (39 tests) - tests/back/core/digitaltwin/test_build_pipeline_units.py (15 tests) Coverage of the new code: - CohortService._snake_case, _result_to_dict, _enrich_members, probe_uc_write, suggest_uc_target — all branches covered including store-exception fall-through and the catalog/schema priority chain. - _BuildPipeline.__init__ derived state (is_api, actual_mode, cfg_forced_full) and _log_phase elapsed-time recorder. Verification: 232 CNS tests pass (was 178); 2373 total collected (was 2319; +54 new). Co-authored-by: Isaac

Pre-flight for the integration deploy surfaced 3 failures in test_settings_lakebase_status.py — all caused by the production code's optional-extra import gate (`import psycopg`) short-circuiting before the mocked RegistryFactory.lakebase was reached. The `[lakebase]` extra isn't part of the base venv, so the gate triggers locally. Fix: add a `psycopg_installed` fixture that stubs `sys.modules["psycopg"]` with a MagicMock. Applied to the 5 tests that need to reach past the gate. The deliberate-miss test (`test_returns_false_pair_when_psycopg_missing`) stays unchanged — it's testing the gate itself. After the fix the full suite is clean: uv run pytest -q --ignore=tests/e2e --ignore=tests/property # 2281 passed Deployment to https://fevm-ontobricks-int.cloud.databricks.com/ via: - ontobricks-030 (FastAPI UI) — /healthz returns 200 - mcp-ontobricks — /mcp returns 302 (OAuth redirect, expected) DAB target dev (no Lakebase project in int); all resources created in ontobricks_int_catalog.ontobricks. Co-authored-by: Isaac

The round-8 deploy ran with --no-bootstrap (intentional while we were still validating); the Access Denied page surfaced. Ran scripts/bootstrap-app-permissions.sh against ontobricks-030 + mcp-ontobricks in the int workspace: - ontobricks-030 SP -> CAN_MANAGE on ontobricks-030 - mcp-ontobricks SP -> CAN_MANAGE on mcp-ontobricks - mcp-ontobricks SP -> CAN_USE on ontobricks-030 /access-denied?reason=bootstrap now returns 302 (redirects to the app). /healthz still 200. Co-authored-by: Isaac

Asked to run integration tests against the deployed int instance. The existing tests/integration + tests/e2e suites use in-process TestClient / local Uvicorn — they integration-test the codebase, not a deployed instance. Wrote a new tests/live_integration/ suite that points at the real URLs with a workspace OAuth Bearer minted from the active Databricks CLI profile, gated behind ONTOBRICKS_LIVE_BASE. Production bug surfaced + fixed in passing: The round-8 deploy started the MCP app's compute but never deployed source code to it (`bundle run ontobricks_dev_app` only — never `bundle run mcp_ontobricks_app`). `/mcp` returned 502 Bad Gateway. Ran the MCP bundle deploy; FastMCP now serves proper JSON-RPC frames and a full Streamable-HTTP handshake returns the tool inventory. Test assumptions corrected by the live probes: - /healthz returns 200 with empty body (not JSON). - OpenAPI does not use /api/v1 or /api/routers/internal prefixes — the actual surface is top-level (/dtwin/, /ontology/, /mapping/, /settings/, /domain/, /tasks/, /graphql, /api/help/...). - MCP needs the full Streamable-HTTP session handshake (initialize -> Mcp-Session-Id header -> notifications/initialized -> tools/list). Verification: ONTOBRICKS_LIVE_BASE=https://ontobricks-030-7474657573264612.aws.databricksapps.com \ ONTOBRICKS_LIVE_MCP_BASE=https://mcp-ontobricks-7474657573264612.aws.databricksapps.com \ DATABRICKS_CONFIG_PROFILE=fevm-ontobricks-int \ uv run pytest tests/live_integration/ -v -m live_integration --no-cov # 17 passed in 12.18s Followup tickets noted in the changelog: - scripts/deploy.sh should also run the MCP app via bundle run. - The post-deploy verification block reports NOT DEPLOYED misleadingly. Co-authored-by: Isaac

Initial e2e run produced 80 errors at fixture setup with the message "Failed to start test server" because the conftest piped uvicorn stdout/stderr to DEVNULL. Replaced with a session log file at tests/e2e/_e2e_server.log; pytest.fail() now prints the last 4 KB of uvicorn output. (.gitignore updated for the log file.) The actual issue with the original 80-error run was timing — the server starts fine in 3-5s, just not always inside the 20s wait window when other processes contend. With the better diagnostics in place, subsequent runs succeed. Two-pass results: Pass A (default fake creds DATABRICKS_HOST=test.databricks.com): 74 passed, 6 failed in 371s -- 6 failures all Playwright timeouts on routes that fan out to Databricks (/dtwin/, /domain owl-content + r2rml, /resolve) Pass B (real int OAuth + warehouse fcdf5a06992ad225): 80 passed in 92.80s The 6 Databricks-dependent tests need either: - @pytest.mark.external so they're nightly-only with real creds, or - mocked outbound calls so they pass under fake env. Tracked as a followup in the changelog. Co-authored-by: Isaac

`pytest tests/e2e/` now produces 80/80 green out of the box. Previously 6 tests timed out because the conftest set DATABRICKS_HOST to test.databricks.com (unreachable) and the routes that fan out to a Databricks workspace hung waiting for upstream calls. The `_set_env` fixture now mints a workspace OAuth token from the Databricks CLI profile named by ONTOBRICKS_E2E_PROFILE (default fevm-ontobricks-int) and exports it as DATABRICKS_HOST/TOKEN into the subprocess. Three modes: - default: auto-mint from the configured CLI profile - caller creds: respect pre-set DATABRICKS_HOST + DATABRICKS_TOKEN - ONTOBRICKS_E2E_FAKE_CREDS=1: fall back to test.databricks.com (6 DB-dependent tests will time out — intentional for non-int CI) - no working profile + no fake-creds opt-in: skip with actionable error message Verified all three modes; default produces 80 passed in 59.86s. Co-authored-by: Isaac

Both apps redeployed to fevm-ontobricks-int after the rebase off develop: - ontobricks-030 (FastAPI UI) — RUNNING, /healthz returns 200 - mcp-ontobricks (FastMCP) — ACTIVE, /mcp speaks proper JSON-RPC Full suite executed: - Unit + integration: 2329 passed, 2 pre-existing failures (TestDomainVersions on develop — psycopg not installed; identical failure on stock upstream/develop) - Property tests (hypothesis): 12 passed - Live integration probes against the redeployed instance: 17 passed - E2E (Playwright, defaulting to int via the auto-mint conftest): 258 passed Total: 2616 tests pass, 2 pre-existing failures unrelated to the deploy. Also restores the `!changelogs/*.log` negation rule in .gitignore (was lost when the master-merge commit got dropped during the rebase off develop). Co-authored-by: Isaac

The 2 pre-existing failures in TestDomainVersions weren't actually about psycopg being missing — the route was refactored to do an early RegistryCfg.from_session(...).is_configured check and short-circuit with 400 if the registry isn't configured. The tests only mocked DigitalTwin and RegistryService, not the cfg-resolution path. Fix: - pyproject.toml: add psycopg[binary] + psycopg-pool to dev deps so a plain `uv sync` installs them (mirrors the lakebase optional extra). Sticks the install so it doesn't fall out on a fresh sync. - test_external_api.py: pass registry_catalog/_schema/_volume as query-string overrides on both tests so the cfg gate is satisfied and the mocked RegistryService is reached as intended. Verification across all 4 suites: - Unit + integration: 2373 passed, 0 failed (was 2329 passed, 2 failed) - Property: 12 passed - Live integration (against int): 17 passed - E2E (Playwright on int): 258 passed - Total: 2660 tests pass, 0 failures. Co-authored-by: Isaac

Benoit Cayla and others added 28 commits May 28, 2026 07:46

release Num update

a7226e9

feat (DT) Graph Chat — Streaming

a272552

feat (Mapping) Exclude All Unmapped Button

8b9659e

Merge pull request #44 from databrickslabs/feat/ontology-Precision-Score

5edfdfb

feat (ontology) Precision score

create auto-generate rules

d6a4238

feat (ontology) cleanup DQ rules

a9024aa

fix (DT) issue with DT deletion

71ecde4

feat (settings) change UI & navigation

147e96c

feat (settings) clean all Domain/build objects

24582c1

feat (ontology) generation iteration improvements

f6d9800

feat (ontology) SWRL ato-generate and imporvements

7398f2c

feat (DT) Build-run tracing in the registry

5846313

benoitcayladbx requested a review from a team as a code owner June 5, 2026 07:33

Version change

9c186ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cns/test foundations#45

Cns/test foundations#45
benoitcayladbx wants to merge 29 commits into
masterfrom
cns/test-foundations

benoitcayladbx commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

benoitcayladbx commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant