Feat set token limits based on model #4498

BaseInfinity · 2023-05-31T22:14:56Z

Background

Every day we see people come into the discord explaining how they ran into a token limit because most people are still on a 3.0/3.5 key. The maintainer's built the app to assume 8000 token limit due to gpt4 key and atm this just causes friction because in reality, most people do not have a 4.0 key

To meet in the middle we will respect the user's token limit but compare it to the max token limit based off the smart/fast llvm model and set a limit 15% less to ensure we take into account the prompt and package

Changes

Use the model's max_token_limits for both fast/smart llvm

Test Plan

i updated failing tests

PR Quality Checklist

My pull request is atomic and focuses on a single change.
I have thoroughly tested my changes with multiple different prompts.
I have considered potential risks and mitigations for my changes.
I have documented my changes clearly and comprehensively.
I have not snuck in any "extra" small tweaks changes

…its-based-on-model

…its-based-on-model' into feat--set-token-limits-based-on-model

vercel · 2023-05-31T22:15:01Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 31, 2023 10:15pm

Auto-GPT-Bot · 2023-05-31T22:19:48Z

You changed AutoGPT's behaviour. The cassettes have been updated and will be merged to the submodule when this Pull Request gets merged.

codecov · 2023-05-31T22:20:15Z

Codecov Report

Patch coverage: 66.66% and project coverage change: -0.04 ⚠️

Comparison is base (dae58f8) 69.70% compared to head (dd987f7) 69.67%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #4498      +/-   ##
==========================================
- Coverage   69.70%   69.67%   -0.04%     
==========================================
  Files          72       72              
  Lines        3562     3558       -4     
  Branches      569      569              
==========================================
- Hits         2483     2479       -4     
  Misses        890      890              
  Partials      189      189

Impacted Files	Coverage Δ
autogpt/config/config.py	`74.48% <ø> (-1.02%)`	⬇️
autogpt/agent/agent.py	`59.88% <66.66%> (+0.48%)`	⬆️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

github-actions · 2023-06-06T18:04:47Z

This pull request has conflicts with the base branch, please resolve those so we can evaluate the pull request.

BaseInfinity · 2023-06-06T21:03:34Z

Not sure what to do with this pull request, essentially waiting to see if we should allow users to set the token limits still or always infer them by model

@Pwuts / @Boostrix / @ntindle / @k-boikov

I also understand this is probably low priority but figured I'd keep the conversation going =)

Boostrix · 2023-06-06T21:05:28Z

Low priority is relative, it's one of the more frequent issues on discord and github

https://discord.com/channels/1092243196446249134/1092243196798582930/1115737448580907180

Pwuts

Looks good to me :)

Pwuts · 2023-06-06T23:51:42Z

Conflict is because test_config.py has been moved in master

erik-megarad · 2023-06-07T00:09:32Z

Hell yeah, nice one

…its-based-on-model

vercel · 2023-06-07T08:07:10Z

Deployment failed with the following error:

Resource is limited - try again in 6 hours (more than 100, code: "api-deployments-free-per-day").

github-actions · 2023-06-07T08:07:23Z

Conflicts have been resolved! 🎉 A maintainer will review the pull request shortly.

Boostrix · 2023-06-07T13:48:40Z

We may still want to support changing this at runtime so that the agent can exit gracefully using a corresponding error message, in order for the LLM to change some variables at runtime: https://discord.com/channels/1092243196446249134/1092423060923101304/1115963736642035732

BaseInfinity · 2023-06-08T03:11:15Z

Thanks @Pwuts for getting this merged! I went away for a couple days and was pleasantly surprised to see this merged lol

BaseInfinity and others added 28 commits May 11, 2023 16:25

feat: set max token limits for better user experience

89d5caf

fix: use OPEN_AI_CHAT_MODELS max limits

b3ba858

fix: use the old default of 8000

2ad9647

fix: formatting so isort/black checks pass

3d4679c

fix: avoid circular dependencies

31a3a47

fix: use better to avoid circular imports

6f6aa78

Merge remote-tracking branch 'origin/master' into feat--set-token-lim…

03e716c

…its-based-on-model

Merge branch 'master' into feat--set-token-limits-based-on-model

a3100ad

feat: introduce soft limits and use them

952104d

Merge remote-tracking branch 'origin/master' into feat--set-token-lim…

98badbb

…its-based-on-model

fix: circular import issue and missing field

64853d3

fix: move import to avoid overriding doc comment

bd4637a

feat: DRY things up and set token limit for fast llm models too

d1546b7

tests: make linter tests happy

6230a97

test: use the max token limits in config.py test

347abff

Merge branch 'master' into feat--set-token-limits-based-on-model

abc43ca

Merge branch 'master' into feat--set-token-limits-based-on-model

3b597ec

Merge remote-tracking branch 'origin/master' into feat--set-token-lim…

0782a3a

…its-based-on-model

Merge branch 'master' into feat--set-token-limits-based-on-model

b707f5d

Merge remote-tracking branch 'origin/master' into feat--set-token-lim…

190ead0

…its-based-on-model

Merge branch 'master' into feat--set-token-limits-based-on-model

e2ebb0a

fix: remove fast token limit from config

28a480a

feat: remove smart token limit from config

50c2fcb

fix: linters

c3bd5f4

fix: removed unused imports

14703ca

fix: remove unused soft_token_limit var

f3a545e

fix: remove unneeded tests, settings aren't in config anymore

ce70b93

Merge remote-tracking branch 'refs/remotes/origin/feat--set-token-lim…

4b91489

…its-based-on-model' into feat--set-token-limits-based-on-model

github-actions bot added the size/l label May 31, 2023

vercel bot temporarily deployed to Preview May 31, 2023 22:15 Inactive

BaseInfinity mentioned this pull request May 31, 2023

feat: set max token limits for better user experience #4128

Closed

5 tasks

Auto-GPT-Bot added the behaviour change label May 31, 2023

EmpathicSage mentioned this pull request Jun 1, 2023

Improve chunking and chunk handling #38

Closed

github-actions bot added the conflicts Automatically applied to PRs with merge conflicts label Jun 6, 2023

Pwuts previously approved these changes Jun 6, 2023

View reviewed changes

Pwuts added the function: config label Jun 6, 2023

Pwuts added this to the v0.4.1 Release milestone Jun 6, 2023

Merge remote-tracking branch 'origin/master' into feat--set-token-lim…

fee2f0d

…its-based-on-model

github-actions bot removed the conflicts Automatically applied to PRs with merge conflicts label Jun 7, 2023

github-actions bot added size/m and removed size/l labels Jun 7, 2023

smol fix

dd987f7

Pwuts dismissed their stale review via dd987f7 June 7, 2023 08:10

Pwuts self-assigned this Jun 7, 2023

Pwuts approved these changes Jun 7, 2023

View reviewed changes

Pwuts merged commit 1e851ba into Significant-Gravitas:master Jun 7, 2023

Pwuts mentioned this pull request Jun 7, 2023

Fix to the token limit in GPT4, GPT3 modes #3266

Closed

5 tasks

Pwuts mentioned this pull request Jun 13, 2023

Exceeding max tokens (4097) limit #1211

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat set token limits based on model #4498

Feat set token limits based on model #4498

BaseInfinity commented May 31, 2023

vercel bot commented May 31, 2023 •

edited

Loading

Auto-GPT-Bot commented May 31, 2023

codecov bot commented May 31, 2023 •

edited

Loading

github-actions bot commented Jun 6, 2023

BaseInfinity commented Jun 6, 2023

Boostrix commented Jun 6, 2023

Pwuts left a comment

Pwuts commented Jun 6, 2023

erik-megarad commented Jun 7, 2023

vercel bot commented Jun 7, 2023

github-actions bot commented Jun 7, 2023

Boostrix commented Jun 7, 2023

BaseInfinity commented Jun 8, 2023

Feat set token limits based on model #4498

Feat set token limits based on model #4498

Conversation

BaseInfinity commented May 31, 2023

Background

Changes

Test Plan

PR Quality Checklist

vercel bot commented May 31, 2023 • edited Loading

Auto-GPT-Bot commented May 31, 2023

codecov bot commented May 31, 2023 • edited Loading

Codecov Report

github-actions bot commented Jun 6, 2023

BaseInfinity commented Jun 6, 2023

Boostrix commented Jun 6, 2023

Pwuts left a comment

Choose a reason for hiding this comment

Pwuts commented Jun 6, 2023

erik-megarad commented Jun 7, 2023

vercel bot commented Jun 7, 2023

github-actions bot commented Jun 7, 2023

Boostrix commented Jun 7, 2023

BaseInfinity commented Jun 8, 2023

vercel bot commented May 31, 2023 •

edited

Loading

codecov bot commented May 31, 2023 •

edited

Loading