Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate Auto-GPT with Auto-GPT-Benchmarks #4987

Merged
merged 12 commits into from
Jul 22, 2023
Merged

Conversation

waynehamadi
Copy link
Contributor

@waynehamadi waynehamadi commented Jul 15, 2023

Background

We want the master branch to be compatible with Auto-GPT-Benchmarks, so we always have the most up to date score in the benchmark.

Changes

Screenshot 2023-07-15 at 10 51 36 AM - regression_tests.json stores the regression tests - benchmarks.py connects Auto-GPT to the benchmark's interface. - config.json defines the entry_path, the workspace location and the time a challenge should run.

Documentation

Test Plan

PR Quality Checklist

  • My pull request is atomic and focuses on a single change.
  • I have thoroughly tested my changes with multiple different prompts.
  • I have considered potential risks and mitigations for my changes.
  • I have documented my changes clearly and comprehensively.
  • I have not snuck in any "extra" small tweaks changes.
  • I have run the following commands against my code to ensure it passes our linters:
    black .
    isort .
    mypy
    autoflake --remove-all-unused-imports --recursive --ignore-init-module-imports --ignore-pass-after-docstring autogpt tests --in-place

@waynehamadi waynehamadi marked this pull request as draft July 15, 2023 16:56
@netlify
Copy link

netlify bot commented Jul 15, 2023

Deploy Preview for auto-gpt-docs canceled.

Name Link
🔨 Latest commit e75b9d5
🔍 Latest deploy log https://app.netlify.com/sites/auto-gpt-docs/deploys/64bc3bd396545d000805389a

@waynehamadi waynehamadi force-pushed the benchmark-integration branch from 1830f56 to edbfbdb Compare July 15, 2023 17:07
@codecov
Copy link

codecov bot commented Jul 15, 2023

Codecov Report

Patch and project coverage have no change.

Comparison is base (e0bcde1) 51.00% compared to head (e75b9d5) 51.00%.

Additional details and impacted files
@@           Coverage Diff           @@
##           master    #4987   +/-   ##
=======================================
  Coverage   51.00%   51.00%           
=======================================
  Files         119      119           
  Lines        4968     4968           
  Branches      662      662           
=======================================
  Hits         2534     2534           
  Misses       2239     2239           
  Partials      195      195           

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

@waynehamadi waynehamadi force-pushed the benchmark-integration branch 3 times, most recently from cf30bbd to 8f6e811 Compare July 15, 2023 17:34
@waynehamadi waynehamadi marked this pull request as ready for review July 15, 2023 17:50
collijk
collijk previously approved these changes Jul 16, 2023
@github-actions github-actions bot added the conflicts Automatically applied to PRs with merge conflicts label Jul 20, 2023
@github-actions
Copy link
Contributor

This pull request has conflicts with the base branch, please resolve those so we can evaluate the pull request.

@waynehamadi waynehamadi force-pushed the benchmark-integration branch 3 times, most recently from 017ebdf to 0f9431a Compare July 22, 2023 20:19
@waynehamadi waynehamadi force-pushed the benchmark-integration branch from 0f9431a to 6e51216 Compare July 22, 2023 20:22
@github-actions github-actions bot removed the conflicts Automatically applied to PRs with merge conflicts label Jul 22, 2023
@github-actions
Copy link
Contributor

Conflicts have been resolved! 🎉 A maintainer will review the pull request shortly.

@waynehamadi waynehamadi force-pushed the benchmark-integration branch from 6e51216 to 02396d6 Compare July 22, 2023 20:25
@waynehamadi waynehamadi force-pushed the benchmark-integration branch from 02396d6 to e75b9d5 Compare July 22, 2023 20:28
@waynehamadi waynehamadi merged commit 4ada7d1 into master Jul 22, 2023
@waynehamadi waynehamadi deleted the benchmark-integration branch July 22, 2023 21:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

4 participants