Add llm specification for automated scoring #1

rnichi1 · 2025-02-06T08:56:11Z

This PR contains an example exercise 04 with LLM specific flags for automated scoring.

…rall

04_llm/string_manipulation/config.toml

sealexan · 2025-02-11T09:11:59Z

04_llm/string_manipulation/grading/examples.toml

@@ -0,0 +1,7 @@
+[[examples]]
+answer = "Reversing word order and reversing characters both have O(n) complexity, but character reversal requires more operations per word, making it slightly less efficient in practice."
+points = "{ \"R1\": 1, \"R2\": 1 }"


Is there any way to simplify this for the task designer, i.e. use toml syntax instead of a string? Escaping things like this is a bit tedious.
Also: these are weights? Because the points below are 0.5 and 0.5 for R1 and R2.

Sure, I will try to change it to toml syntax. Also yes, you are right. It should be 0.5 for the examples for each rubric. These are points not weights. Thanks!

sealexan · 2025-02-11T09:13:24Z

04_llm/string_manipulation/rubrics/rubrics.toml

@@ -0,0 +1,9 @@
+[[rubrics]]
+id = "R1"


Is it even necessary to have rubric IDs or could they be parsed in the order they appear?

The Ids help the model understand the examples. Without them, it would be harder to tell which rubric was correctly solved for each example, especially when there are more than just 2 rubrics.
This is also useful in order to avoid duplication of the rubric, since the model has the rubrics but might not immediately know to infer the order in an unstructured and fairly large prompt as ours. Also as an educator it's easier to give them a clear id like "asymptotically_equivalent" instead of an abstract "R2", in order to then more easily create examples for the llm. I will change the R1, R2 ids to be something more along those lines.

rnichi1 added 7 commits February 2, 2025 15:43

llm example task

56c1331

Add llm to assignments list

42d1dbb

Fix missing test file

c2088ff

Add comment to explanation md

d66cbda

Add llm max points config

e18a0a0

Set claude as default model for example due to better performance ove…

09b275d

…rall

Merge branch 'main' of https://github.com/rnichi1/Access-LLM-Mock-Course

d9f55aa

sealexan assigned rnichi1 Feb 13, 2025

sealexan reviewed Feb 13, 2025

View reviewed changes

Comments for optional fields and toml structure of example points

c964fab

rnichi1 requested a review from sealexan February 14, 2025 17:01

Add pre and prompt comments

c8df6a9

sealexan force-pushed the main branch from 81f2374 to 510c2fa Compare August 14, 2025 09:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add llm specification for automated scoring #1

Add llm specification for automated scoring #1

Uh oh!

rnichi1 commented Feb 6, 2025

Uh oh!

Uh oh!

Uh oh!

sealexan Feb 11, 2025

Uh oh!

rnichi1 Feb 14, 2025

Uh oh!

sealexan Feb 11, 2025

Uh oh!

rnichi1 Feb 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add llm specification for automated scoring #1

Are you sure you want to change the base?

Add llm specification for automated scoring #1

Uh oh!

Conversation

rnichi1 commented Feb 6, 2025

Uh oh!

Uh oh!

Uh oh!

sealexan Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

rnichi1 Feb 14, 2025

Choose a reason for hiding this comment

Uh oh!

sealexan Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

rnichi1 Feb 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants