Mark user message as required #520

alxmiron · 2023-04-02T22:54:41Z

I found a conflict with maxResponseTokens @transitive-bullshit

Problem 1

From one side, buildMessages() now has logic to limit prompt length, because we include "history messages" and wanna know when to stop adding them. From another hand, prompt length defines a length of response, because total sub should not exceed maxModelTokens. The we have the following:

I don't send parentMessageIds in my chatbot, so my users send short prompts, but get responses only of 1000 tokens length, because it's a default value
I set maxResponseTokens: 4096 as max allowed value, expect maximum length responses
But the current logic filters out user prompt, so we send just systemMessage to ChatGPT that has no sense and is confusing to user
I can reduce maxResponseTokens, but don't know for how much because I want to use full capacity of 4096 tokens but don't know what user prompt length is

Solution

If user passed a prompt and it exceeds the limits, we should throw exception to him, instead of skipping the userPrompt and sending systemMessage only. Then let's mark systemMessage and userPromp as "required" messages, which means that if they are passed the take prioritized tokens space. And if they exceed limits - we throw exception.

Now the flow looks like this:

Validate systemMessage and userPromp, based on maxModelTokens - 1 value, ignoring maxResponseTokens. Where 1 - minimum space for response. If user enters maxModelTokens - 1 tokens length prompt, he will get 1 token in
response. If he enters prompt bigger than maxModelTokens - 1, there will be an exception
Validate historyPrompts, based on maxModelTokens - maxResponseTokens value
Send calculated size for response, based on numTokens and maxResponseTokens

alxmiron · 2023-04-02T23:07:26Z

src/chatgpt-api.ts

        break
      }

-      messages = nextMessages
-      numTokens = nextNumTokensEstimate
-
      if (!isValidPrompt) {


This condition almost never happened, because we had if (prompt && !isValidPrompt) { break } before. I removed that one

alxmiron · 2023-04-02T23:13:22Z

src/chatgpt-api.ts

@@ -400,21 +400,29 @@ export class ChatGPTAPI {
          }
        }, [] as string[])
        .join('\n\n')


Problem 2

I found that our calculations of prompt length in tokens have a mistake.
It's -6 tokens diff when we have 1 systemMessage in the list (our estimation is 6 tokens less than real one from OpenAI)
It's -8 tokens diff when we have 2 messages in the list, regardless of the text in userPrompt
For other lengths it's bigger
But _getTokenCount() works correctly, because it calculates the length of response equally as OpenAI does.
It's something we prompt string building. We need to check it (not in this PR)
Why do we add Instructions:, User: prefixes there? I didn't find any docs about it

alxmiron added 2 commits April 3, 2023 01:47

Mark user message as required. Fix used tokens estimation

159a239

Remove hardcoded tokens estimation diff

c81ee20

alxmiron commented Apr 2, 2023

View reviewed changes

alxmiron changed the title ~~Mark user message as required. Fix used tokens estimation~~ Mark user message as required Apr 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mark user message as required #520

Mark user message as required #520

alxmiron commented Apr 2, 2023 •

edited

alxmiron Apr 2, 2023 •

edited

alxmiron Apr 2, 2023

Mark user message as required #520

Are you sure you want to change the base?

Mark user message as required #520

Conversation

alxmiron commented Apr 2, 2023 • edited

Problem 1

Solution

alxmiron Apr 2, 2023 • edited

Choose a reason for hiding this comment

alxmiron Apr 2, 2023

Choose a reason for hiding this comment

Problem 2

alxmiron commented Apr 2, 2023 •

edited

alxmiron Apr 2, 2023 •

edited