Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slowness with latest update and brain agent required tool name with question #63

Open
satishthumar opened this issue Apr 22, 2024 · 4 comments

Comments

@satishthumar
Copy link

We’ve noticed that the latest updates have resulted in a decrease in performance speed. Our bot, which was developed using this code in conjunction with two different tools, employs GPT-4 with a capacity of 100 PTUs. Despite this, we’re experiencing response times of 30-40 seconds for both RAG search and SQL queries. Furthermore, we’ve detected that the absence of specific tool parameters leads to inadequate responses, an issue that wasn’t present before the recent updates.”

@pablomarin
Copy link
Collaborator

Can you be more specific regarding; "absence of specific tool parameters "?
are you refering to return_direct=True?

@satishthumar
Copy link
Author

satishthumar commented Apr 23, 2024

In prompt we have to specify (i.e. doc or sql) tool name prior to question. Agent is not learning on its own to direct question based on tools knowleadge to a particular tool.

We are using apps/backend/botservice and it does not have any settings set to use return_direct=true. We are using bot.py and utils.py.

    doc_search = DocSearchAgent(llm=llm, indexes=doc_indexes,
                       k=6, reranker_th=1,
                       sas_token=os.environ['BLOB_SAS_TOKEN'],
                       name="doc",
                       description="useful when the questions includes the term: doc",
                       callback_manager=cb_manager, verbose=False,streaming=True)
    
    sql_search = SQLSearchAgent(llm=llm, k=30, callback_manager=cb_manager,
                        name="sql",
                        description="useful when the questions includes the term: sql",
                        verbose=False,streaming=True)
    
    tools = [doc_search, sql_search]

Let me know if you need any more information. Also we are unable to get streaming response although we set streaming=true.

Thank you for your quick checking.

@pablomarin
Copy link
Collaborator

pablomarin commented Apr 23, 2024 via email

@satishthumar
Copy link
Author

Thank you for informaiton. We will evaluate LangServe for streaming. One more question, how to have feedback (Like,DisLike) loop for the bot response?

Any suggestion to have quick response for DocSearchAgent as it is taking 30-40 seconds with 100 PTU?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants