Skip to content

v0.2.1

Compare
Choose a tag to compare
@danmcp danmcp released this 23 Sep 14:10
· 82 commits to main since this release
53d6abf

What's Changed

  • update README by @sallyom in #108
  • Use single answer file and model list (backport #110) by @mergify in #112
  • mergify: add mergify configuration by @nathan-weinberg in #114
  • Bump step-security/harden-runner from 2.8.1 to 2.9.1 by @dependabot in #94
  • ci: move E2E runner from github to AWS by @nathan-weinberg in #118
  • docs: add initial release strategy doc and CHANGELOG by @nathan-weinberg in #91
  • CI: Fix working directories to be relative by @danmcp in #120
  • Bump actions/setup-python from 5.1.1 to 5.2.0 by @dependabot in #119
  • Bump actions/checkout from 4.1.6 to 4.1.7 by @dependabot in #116
  • build(deps): bump pypa/gh-action-pypi-publish from 1.9.0 to 1.10.0 by @dependabot in #122
  • ci: add AWS tags to show github ref and PR num for all jobs by @nathan-weinberg in #123
  • Bump rojopolis/spellcheck-github-actions from 0.38.0 to 0.41.0 by @dependabot in #96
  • build(deps): bump pypa/gh-action-pypi-publish from 1.10.0 to 1.10.1 by @dependabot in #124
  • build(deps): bump hynek/build-and-inspect-python-package from 2.6.0 to 2.9.0 by @dependabot in #125
  • build(deps): bump DavidAnson/markdownlint-cli2-action from 16.0.0 to 17.0.0 by @dependabot in #126
  • build(deps): bump step-security/harden-runner from 2.9.1 to 2.10.1 by @dependabot in #127
  • Add comment to make it clear how the code is working by @danmcp in #105
  • Allow for external serving to be used with mmlu by @danmcp in #99
  • Better path and string handling by @danmcp in #106
  • Improve logging by @danmcp in #111
  • Cleanup usage of load model answers by @danmcp in #115
  • add option to pass 'api_key' to gen_answers, judge_answers by @sallyom in #128
  • e2e: only run PR job if certain files are changed by @nathan-weinberg in #131
  • Allow max_workers to be passed in after evaluator is created by @danmcp in #107
  • Remove fastchat dependency by @danmcp in #98

New Contributors

Full Changelog: v0.2.0...v0.2.1