Incorporating current summary length in token count calculation for the update_running_summary function #4670

kinance · 2023-06-12T15:36:06Z

Background

This PR addresses an oversight in my previous PR #4652, which introduced a new batch processing approach within the update_running_summary function. The goal of this approach is to prevent the total token count from exceeding the model's maximum limit when new events are being processed. While this method accounted for the token length of new events and the summarization prompt, it failed to consider the length of the current summary.

Changes

In the modified version, a summary_tlength variable has been introduced in the update_running_summary function. This variable holds the token length of the current summary, ensuring the total token count of three components: 1) the current summary, 2) the new events, and 3) the summarization prompt, does not surpass the maximum token limit imposed by the model.

Test Plan

Reran the tests and passed.

PR Quality Checklist

My pull request is atomic and focuses on a single change.
I have thoroughly tested my changes with multiple different prompts.
I have considered potential risks and mitigations for my changes.
I have documented my changes clearly and comprehensively.
I have not snuck in any "extra" small tweaks changes.

I have run the following commands against my code to ensure it passes our linters:

black .
isort .
mypy
autoflake --remove-all-unused-imports --recursive --ignore-init-module-imports --ignore-pass-after-docstring autogpt tests --in-place

vercel · 2023-06-12T15:36:10Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
docs	⬜️ Ignored (Inspect)			Jun 12, 2023 3:46pm

netlify · 2023-06-12T15:36:20Z

✅ Deploy Preview for auto-gpt-docs canceled.

Name	Link
🔨 Latest commit	`759ac5d`
🔍 Latest deploy log	https://app.netlify.com/sites/auto-gpt-docs/deploys/6487a94aaa44ac0008c1d8fa

codecov · 2023-06-12T15:39:42Z

Codecov Report

Patch coverage: 100.00% and project coverage change: -0.03 ⚠️

Comparison is base (a9d177e) 69.96% compared to head (24a72fd) 69.94%.

❗ Current head 24a72fd differs from pull request most recent head 759ac5d. Consider uploading reports for the commit 759ac5d to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #4670      +/-   ##
==========================================
- Coverage   69.96%   69.94%   -0.03%     
==========================================
  Files          72       72              
  Lines        3586     3590       +4     
  Branches      569      569              
==========================================
+ Hits         2509     2511       +2     
- Misses        894      895       +1     
- Partials      183      184       +1

Impacted Files	Coverage Δ
autogpt/log_cycle/log_cycle.py	`95.65% <100.00%> (+0.19%)`	⬆️
autogpt/memory/message_history.py	`85.71% <100.00%> (+0.29%)`	⬆️

... and 1 file with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

waynehamadi · 2023-06-12T23:25:55Z

@kinance great I understand your changes, this will remove the last token issues

Include the token length of the current summary

5f94656

github-actions bot added the size/s label Jun 12, 2023

kinance requested a review from waynehamadi June 12, 2023 15:45

kinance and others added 2 commits June 13, 2023 00:46

Merge branch 'master' into fix-should-include-current-summary-lenght

24a72fd

Merge branch 'master' into fix-should-include-current-summary-lenght

759ac5d

waynehamadi approved these changes Jun 12, 2023

View reviewed changes

waynehamadi merged commit 7bf39cb into Significant-Gravitas:master Jun 12, 2023

kinance deleted the fix-should-include-current-summary-lenght branch June 12, 2023 23:36

kinance mentioned this pull request Jun 17, 2023

Implement Batch Summarization in MessageHistory Class to manage context length under model's token limit #4652

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorporating current summary length in token count calculation for the update_running_summary function #4670

Incorporating current summary length in token count calculation for the update_running_summary function #4670

kinance commented Jun 12, 2023

vercel bot commented Jun 12, 2023 •

edited

Loading

netlify bot commented Jun 12, 2023 •

edited

Loading

codecov bot commented Jun 12, 2023 •

edited

Loading

waynehamadi commented Jun 12, 2023

Incorporating current summary length in token count calculation for the update_running_summary function #4670

Incorporating current summary length in token count calculation for the update_running_summary function #4670

Conversation

kinance commented Jun 12, 2023

Background

Changes

Test Plan

PR Quality Checklist

vercel bot commented Jun 12, 2023 • edited Loading

netlify bot commented Jun 12, 2023 • edited Loading

✅ Deploy Preview for auto-gpt-docs canceled.

codecov bot commented Jun 12, 2023 • edited Loading

Codecov Report

waynehamadi commented Jun 12, 2023

vercel bot commented Jun 12, 2023 •

edited

Loading

netlify bot commented Jun 12, 2023 •

edited

Loading

codecov bot commented Jun 12, 2023 •

edited

Loading