New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Adding a mechanism for self-improvement #353

Open

mczhuge wants to merge 15 commits into geekan:main from mczhuge:main

mczhuge commented Sep 21, 2023 •

edited

Summary of Updates

1️⃣ Internal Feedback Action:
Added the Feedback action in metagpt/actions/internal_feedback.py to allow internal evaluation among agents (currently supporting the handover process).

2️⃣ Action Functions Enhancement:
Introduced two new methods, _add_action_at_head() and _add_action_at_tail(), for improved clarity and ease of defining actions.

3️⃣ Long-Term Memory Utilization:
Commenced utilization of the previously underutilized longterm_memory, currently supporting storage in a JSON file. Future improvements will involve leveraging existing FAISS support.

4️⃣ Reflective Mechanism:
Incorporated the reflect.py module within metagpt/learn.

5️⃣ Initialization with Feedback:
Implemented feedback-driven constraints initialization during role setup, marking the initial phase of our journey towards self-improvement.

These updates address several urgent issues, including effective long-term memory utilization, internal evaluation mechanisms, and the integration of historical data into the self-improvement mechanism. And there is still ample room for improvement beyond these updates. 😄

Quick usage: python startup.py "Write a cli snake game" --self_improvement True

Note: Unit tests are pending implementation due to time constraints and will be added in the near days.

mczhuge added 15 commits

September 20, 2023 17:22


          update

8d15d06


          debug with role.py

858c8a8


          build the initial system of self-improvement.

23d36db


          fix the bug

74f4b37


          unify the name

1a0e17c


          update

27a1064


          update

3f0cd25


          update

933827a


          feedback before engineering

be3f274


          update

c05928b


          finish feedback

ba463b2


          finish the feedback system


          update

c1cac58


          clean code

657f037


          update the startup.py

9c6a515

Collaborator

stellaHSR commented Sep 22, 2023

Excellent! It seems that the HANDOVER_FILE is necessary. Could you kindly provide it as well?

stellaHSR added the enhancement label

geekan reviewed

View reviewed changes

metagpt/actions/internal_feedback.py Show resolved Hide resolved

metagpt/actions/internal_feedback.py

+                  "QaEngineer": {"name": "Edward", "next": None, "prev": "Engineer"},
+              }
+              def print_with_color(text, color="red"):

Owner

geekan Sep 26, 2023

place it in utils

metagpt/actions/internal_feedback.py

		print(f"{color_codes[color]} {text} {color_codes['reset']}")

Owner

geekan Sep 26, 2023

PEP8

metagpt/actions/internal_feedback.py

+                          """
+                  async def run(self, handover_msg, *args, **kwargs) -> ActionOutput:
+                      import re

Owner

geekan Sep 26, 2023

import it in file header

metagpt/actions/internal_feedback.py

+                      import re
+                      #prev_role = handover_msg[0].to_dict()["role"]
+                      #prev_msg = handover_msg[0].to_dict()["content"]
+                      if  isinstance(handover_msg, list):

Owner

geekan Sep 26, 2023

dup space

metagpt/software_company.py

@@ @@ -38,6 +38,22 @@ def invest(self, investment: float): @@
                       CONFIG.max_budget = investment
                       logger.info(f'Investment: ${investment}.')
+                  def improvement(self, initial=False, roles=None):
+                      handover_file = CONFIG.handover_file
+                      if initial:

Owner

geekan Sep 26, 2023

Would it be better to do this in CONFIG?

metagpt/software_company.py

+                          msgs = self.environment.memory.get_by_action(Feedback)
+                          if isinstance(msgs, list):
+                              for msg in msgs:
+                                  logger.info(f"{msg.role}'s feedback: {msg.content}")

Owner

geekan Sep 26, 2023

This actually prints the feedback result, which is different from the function name. Should we optimize the function name?

metagpt/learn/reflect.py

+              Now, rewrite your "{role}" constraints in 30 words:
+              """
+              # def print_with_color(text, color="red"):

Owner

geekan Sep 26, 2023

Remove useless comments

metagpt/learn/reflect.py

+              #     }
+              #     print(f"{color_codes[color]}  {text} {color_codes['reset']}")
+              class Reflect():

Owner

geekan Sep 26, 2023

class Reflect instead of class Reflect()

metagpt/learn/reflect.py

+              class Reflect():
+                  def from_feedback(role, constraints):
+                      chat = OpenAIGPTAPI()

Owner

geekan Sep 26, 2023

use LLM instead

stellaHSR reviewed

View reviewed changes

startup.py

                   if run_tests:
                       # developing features: run tests on the spot and identify bugs
                       # (bug fixing capability comes soon!)
                       company.hire([QaEngineer()])
-                  company.invest(investment)
                   if self_improvement:

Collaborator

stellaHSR Sep 26, 2023

you need add this first, before hire()

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment