feat: better scorers #106

c-ehrlich · 2025-10-24T00:58:26Z

No description provided.

pkg-pr-new · 2025-10-24T00:59:44Z

npm i https://pkg.pr.new/axiomhq/ai/axiom@106

commit: 7668499

Copilot

Pull Request Overview

This PR refactors the scorer system to improve type inference and ergonomics. The key changes separate scorer definitions from usage requirements, introduce automatic type inference from function arguments, and simplify the API by removing the need for explicit type parameters in most cases.

Introduces ScorerLike for flexible scorer consumption and Scorer for strict scorer definitions with a name property
Implements automatic type inference in createScorer based on the callback's argument types
Updates scorer factory to attach the name as a property instead of embedding it in the returned score

Reviewed Changes

Copilot reviewed 11 out of 12 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
packages/ai/src/evals/scorers.ts	Separates `Score` and `ScoreWithName` types, introduces `ScorerLike` vs `Scorer` distinction with `TExtra` support
packages/ai/src/evals/scorer.factory.ts	Refactors `createScorer` to infer types from arguments and attach name as function property
packages/ai/test/evals/scorer.types.test.ts	Adds comprehensive type inference tests for the new scorer system
packages/ai/src/evals/eval.types.ts	Updates type references and strengthens `OutputOf` type constraints
packages/ai/src/evals/eval.ts	Updates to use `ScorerLike` and extract scorer name from function property
packages/ai/src/evals/builder.ts	Adds TODO comment about unused function
packages/ai/README.md	Documents Node version requirement for evals
examples/example-evals-nextjs/test/feature.eval.ts	Migrates to new `Scorer` factory pattern
examples/example-evals-nextjs/src/lib/scorers.ts	Updates scorers to use typed arguments and removes optional expected handling
examples/example-evals-nextjs/src/lib/capabilities/classify-ticket/evaluations/ticket-classification.eval.ts	Adds wrapped autoevals scorer example
.github/workflows/ci.yaml	Reorders CI steps to run format check before build

Files not reviewed (1)

pnpm-lock.yaml: Language not supported

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-10-28T13:23:20Z

packages/ai/src/evals/scorer.factory.ts

+    return res;
+  };
+
+  const scorer: any = (args: TArgs) => {


Using any type annotation bypasses type safety. Consider using the inferred return type or a more specific type annotation for the scorer function.

Suggested change

const scorer: any = (args: TArgs) => {

const scorer = (args: TArgs) => {

packages/ai/src/evals/eval.ts

Copilot · 2025-10-28T13:23:21Z

packages/ai/src/evals/eval.ts

                input: data.input,
                output,
-                expected: data.expected,
+                expected: data.expected as any,


Using as any type assertion bypasses type checking. This could hide type mismatches between the data's expected value and what the scorer expects.

Suggested change

expected: data.expected as any,

expected: data.expected,

examples/example-evals-nextjs/test/feature.eval.ts

c-ehrlich added 7 commits October 23, 2025 20:22

better scorers

2439db7

improve scorer types and create scorer type tests

409c88a

better inference

ec950a8

better expected types

97f8704

types

0ffe4e7

Merge branch 'main' into better-scorers

bf3970e

add autoevals scorer

d255ac8

c-ehrlich added 5 commits October 24, 2025 11:02

undefined scorer params are unknown instead of never

2d7d9e7

fix another type issue

a9aa24c

note minimum node version in readme

183ec9c

build later

57982b7

need to build a bit earlier actually

697b9d2

c-ehrlich marked this pull request as ready for review October 28, 2025 13:21

Copilot AI review requested due to automatic review settings October 28, 2025 13:21

Copilot AI reviewed Oct 28, 2025

View reviewed changes

c-ehrlich added 4 commits October 28, 2025 20:33

dont need this

90893ff

dont need this export

7640d7c

better scorer name handling

421210b

remove as any here

7668499

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: better scorers #106

feat: better scorers #106

Uh oh!

c-ehrlich commented Oct 24, 2025

Uh oh!

pkg-pr-new bot commented Oct 24, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 28, 2025

Uh oh!

Uh oh!

Copilot AI Oct 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	const scorer: any = (args: TArgs) => {
	const scorer = (args: TArgs) => {

Uh oh!

feat: better scorers #106

Are you sure you want to change the base?

feat: better scorers #106

Uh oh!

Conversation

c-ehrlich commented Oct 24, 2025

Uh oh!

pkg-pr-new bot commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pkg-pr-new bot commented Oct 24, 2025 •

edited

Loading