Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bench export #1147

Merged
merged 11 commits into from
May 12, 2024
Merged

Bench export #1147

merged 11 commits into from
May 12, 2024

Conversation

ATheorell
Copy link
Collaborator

@ATheorell ATheorell commented May 12, 2024

Adding flag to export a .yaml file with benchmark results. This is particularly useful for creating standardized scoreboard entries

Copy link

codecov bot commented May 12, 2024

Codecov Report

Attention: Patch coverage is 16.12903% with 26 lines in your changes are missing coverage. Please review.

Project coverage is 67.93%. Comparing base (5c43542) to head (5aec1eb).
Report is 26 commits behind head on main.

Files Patch % Lines
gpt_engineer/benchmark/run.py 0.00% 9 Missing ⚠️
gpt_engineer/benchmark/bench_config.py 38.46% 8 Missing ⚠️
gpt_engineer/benchmark/__main__.py 0.00% 5 Missing ⚠️
gpt_engineer/benchmark/types.py 0.00% 4 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1147      +/-   ##
==========================================
- Coverage   70.51%   67.93%   -2.59%     
==========================================
  Files          31       32       +1     
  Lines        1733     1818      +85     
==========================================
+ Hits         1222     1235      +13     
- Misses        511      583      +72     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@ATheorell ATheorell merged commit 42d04c7 into gpt-engineer-org:main May 12, 2024
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant