Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update autoevals manifest #112

Merged
merged 1 commit into from
Jan 23, 2025
Merged

Update autoevals manifest #112

merged 1 commit into from
Jan 23, 2025

Conversation

ankrgyl
Copy link
Contributor

@ankrgyl ankrgyl commented Jan 23, 2025

The RAGAS scorers require extra params.

Copy link

github-actions bot commented Jan 23, 2025

Braintrust eval report

Autoevals (update-manfiest-1737668540)

Score Average Improvements Regressions
NumericDiff 61.8% (-12pp) 8 🟢 39 🔴
Start 1737668540.53s - -
End 1737668544.67s - -
Duration 3.88s (+2.31s) - 100 🔴
Llm_duration 2.44s - -
Prompt_tokens 277.86tok (-1.37tok) 44 🟢 -
Completion_tokens 17.84tok (-0.27tok) 22 🟢 21 🔴
Total_tokens 295.7tok (-1.65tok) 62 🟢 18 🔴
Estimated_cost 0$ - -

@ankrgyl ankrgyl requested review from manugoyal and edenh January 23, 2025 22:13
@ankrgyl ankrgyl merged commit dbb7167 into main Jan 23, 2025
8 checks passed
Copy link

github-actions bot commented Jan 23, 2025

Braintrust eval report

Autoevals (main-1737670685)

Score Average Improvements Regressions
NumericDiff 72.4% (+10pp) 26 🟢 1 🔴
Start 1737670685.45s - -
End 1737670687.02s - -
Duration 1.56s (-2.31s) 100 🟢 18 🔴
Llm_duration 1.58s (-0.86s) 10 🟢 8 🔴
Prompt_tokens 279.04tok (+1.17tok) - 42 🔴
Completion_tokens 18tok (+0.33tok) - 1 🔴
Total_tokens 297.05tok (+1.5tok) - 43 🔴
Estimated_cost 0$ (+0$) - -

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants