Sharing this here doesn't seem like an infohazard at this point. This is all over my YouTube feed anyway.
Description from the authors:
Auto-GPT is an experimental open-source application showcasing the capabilities of the GPT-4 language model. This program, driven by GPT-4, autonomously develops and manages businesses to increase net worth. As one of the first examples of GPT-4 running fully autonomously, Auto-GPT pushes the boundaries of what is possible with AI.
I wanted to additionally call out this in their read me:
💀 Continuous Mode ⚠️
Run the AI without user authorisation, 100% automated. Continuous mode is not recommended. It is potentially dangerous and may cause your AI to run forever or carry out actions you would not usually authorise. Use at your own risk.
- Run the
main.py
Python script in your terminal:python scripts/main.py --continuous
- To exit the program, press Ctrl + C
Nice! Super nice. Super safe and super good.
I don't know what Steve would say, but I know that some folks from DeepMind and Stanford have recently used an LLM to create rewards to train another LLM to do specific tasks, like negotiation. which I think is exactly what you've described. It seems to work really well.
Reward Design with Language Models