Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add gh actions #72

Merged
merged 25 commits into from
Sep 25, 2024
Merged

add gh actions #72

merged 25 commits into from
Sep 25, 2024

Conversation

filip-michalsky
Copy link
Collaborator

why

need to run evals before merging PRs

what changed

added GH actions script

test plan

it tests itself.

@filip-michalsky
Copy link
Collaborator Author

Also, removing instructor from dependencies.

@filip-michalsky filip-michalsky changed the title [WIP - do not merge] add gh actions [WIP] add gh actions Sep 24, 2024
Copy link
Contributor

github-actions bot commented Sep 24, 2024

Braintrust eval report

stagehand (HEAD-1727219469)

Score Average Improvements Regressions
Exact match 83.3% - -
Duration 89.82s - -

@filip-michalsky filip-michalsky changed the title [WIP] add gh actions add gh actions Sep 24, 2024
@filip-michalsky
Copy link
Collaborator Author

this one should be good to go. I gave up on adding the Braintrust Eval GH to the CI script, way too much friction

@filip-michalsky filip-michalsky merged commit ceaaed2 into main Sep 25, 2024
1 check passed
@filip-michalsky filip-michalsky deleted the fm/gh-actions-evals branch September 25, 2024 19:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants