reward system for autogpt #2446

horazius · 2023-04-18T21:33:02Z

Duplicates

I have searched the existing issues

Summary 💡

Introduce a reward system so that AutoGPT can also learn. You should be able to see all actions in a history. Possibly even be able to call up parameters and then evaluate them with positive "p" and "n" negative. This evaluation file could then be shared as an extension among different users or in the end even played back to openAi for learning.

Examples 🌈

No response

Motivation 🔦

To improve the software by crowd learning. The basics of ChatGPT

Androbin · 2023-04-19T00:00:42Z

Please note that GPT-4 can only do in-context learning, as the API does not currently support fine-tuning the model.

Boostrix · 2023-05-09T06:28:58Z

this is more important than people may think, you need this for any sort of fitness function / training purposes - regardless of whether the LLM supports this or not, the reward system could also be executed locally to self-optimize: #3868 (comment)

And all actions/commands have a certain cost associated with it.
So in general, all actions/commands need to expose their costs so that a local reward function can optimize for those:

built-in diagnostics / self-test via dedicated command #4042

Androbin · 2023-05-22T17:01:21Z

I'm all for on-policy reinforcement learning, see the paper
LETI: Learning to Generate from Textual Interactions
https://arxiv.org/abs/2305.10314

github-actions · 2023-09-06T21:05:42Z

This issue has automatically been marked as stale because it has not had any activity in the last 50 days. You can unstale it by commenting or removing the label. Otherwise, this issue will be closed in 10 days.

github-actions · 2023-09-17T01:52:18Z

This issue was closed automatically because it has been stale for 10 days with no activity.

ntindle added the potential plugin This may fit better into our plugin system. label Apr 21, 2023

This was referenced May 9, 2023

built-in diagnostics / self-test via dedicated command #4042

Closed

Revise budget manager #4040

Merged

Command Base Class Interface #3824

Merged

[DRAFT] psutil integration, based on various RFEs on github #4123

Closed

WIP test commands #3399

Closed

github-actions bot added the Stale label Sep 6, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reward system for autogpt #2446

reward system for autogpt #2446

horazius commented Apr 18, 2023

Androbin commented Apr 19, 2023

Boostrix commented May 9, 2023 •

edited

Loading

Androbin commented May 22, 2023

github-actions bot commented Sep 6, 2023

github-actions bot commented Sep 17, 2023

reward system for autogpt #2446

reward system for autogpt #2446

Comments

horazius commented Apr 18, 2023

Duplicates

Summary 💡

Examples 🌈

Motivation 🔦

Androbin commented Apr 19, 2023

Boostrix commented May 9, 2023 • edited Loading

Androbin commented May 22, 2023

github-actions bot commented Sep 6, 2023

github-actions bot commented Sep 17, 2023

Boostrix commented May 9, 2023 •

edited

Loading