Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WARNING:root:Warning: model not found. Using cl100k_base encoding. #38

Open
moresearch opened this issue Apr 11, 2024 · 1 comment
Open

Comments

@moresearch
Copy link

I am getting WARNING:root:Warning: model not found. Using cl100k_base encoding. when using non openAI models such as claude-2. I checked the code and it seems to be a tiktoken issue.

Am I getting wrong cost??

@areibman
Copy link
Contributor

areibman commented Apr 11, 2024

Potentially, yes. I believe Claude 1 uses cl100k, but Claude 2 and Claude 3 I'm not so sure.

Here is the only info I've found on Claude 3: https://github.com/javirandor/anthropic-tokenizer

And other Anthropic models:
https://github.com/anthropics/anthropic-sdk-python/blob/bd866a1e837febd05b6f5cea322f1c42e29f2d01/src/anthropic/tokenizer.json

For stuff like this, TokenCost will likely be a close but not exact price estimate.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants