Skip to content

Hack for evaluating Claude 3 on LegalBench #6259

Hack for evaluating Claude 3 on LegalBench

Hack for evaluating Claude 3 on LegalBench #6259

Triggered via pull request May 1, 2024 20:12
Status Success
Total duration 12m 39s
Artifacts

test.yml

on: pull_request
Matrix: Run HELM with minimal dependencies only
Matrix: Run all tests
Fit to window
Zoom out
Zoom in