Skip to content

Latest commit

 

History

History
18 lines (18 loc) · 13.4 KB

MODELS.md

File metadata and controls

18 lines (18 loc) · 13.4 KB
model name and link container name open-source? size GPU usage max tokens (prompt + response) licence description
BLOOMZ 7B transformers-lm-bloomz7b yes 7.1B 33GB 2,048 tokens bigscience-bloom-rail-1.0, commercial use allowed An open-source multilingual instruction-based large language model (46 languages). NB: free of charge. This model is up and running on our servers and can be used for free.
GPT-J 6B transformers-lm-gptj yes 6B 25GB 2,048 tokens Apache 2.0 , commercial use allowed An open-source English-only large language model which is NOT fine-tuned for instruction following and NOT capable of code generation. NB: free of charge. This model is up and running on our servers and can be used for free.
GPT-3.5 openai-api-davinci3 no supposedly, 175B - (cannot be run locally) 4,097 tokens available under subscription plan, commercial use allowed A multilingual instruction-based large language model which is capable of code generation. Unlike ChatGPT, not optimized for chat. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
ChatGPT openai-api-chatgpt no supposedly, 175B - (cannot be run locally) 4,096 tokens available under subscription plan, commercial use allowed Based on gpt-3.5-turbo -- the most capable of the entire GPT-3/GPT-3.5 models family. Optimized for chat. Able to understand and generate code. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
Open-Assistant Pythia 12B transformers-lm-oasst12b yes 12B 29GB (half-precision) 5,120 tokens Apache 2.0 , commercial use allowed An open-source English-only instruction-based large language model which is NOT good at answering math and coding questions. NB: free of charge. This model is up and running on our servers and can be used for free.
Vicuna 13B transformers-lm-vicuna13b yes, but only for non-commercial use 13B 29GB (half-precision) 2,048 tokens Non-commercial license An instruction-based large language model fine-tuned on LLaMa that achieves more than 90%* quality of OpenAI ChatGPT and Google Bard. The model performs best in English and is NOT good at answering math, reasoning, and coding questions. NB-1: Free of charge. This model is up and running on our servers and can be used for free. NB-2: cannot be used for commercial purposes due to license restriction.
GPT-4 openai-api-gpt4 no supposedly, 175B - (cannot be run locally) 8,192 tokens available under subscription plan, commercial use allowed A multilingual instruction-based large language model which is capable of code generation and other complex tasks. More capable than any GPT-3.5 model, able to do more complex tasks, and optimized for chat. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
GPT-4 32K openai-api-gpt4-32k no supposedly, 175B - (cannot be run locally) 32,768 tokens available under subscription plan, commercial use allowed A multilingual instruction-based large language model which is capable of code generation and other complex tasks. Same capabilities as the base gpt-4 mode but with 4x the context length. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
GPT-4 turbo openai-api-gpt4-turbo no supposedly, 175B - (cannot be run locally) 128,000 tokens available under subscription plan, commercial use allowed A multilingual instruction-based large language model which is capable of code generation and other complex tasks. The model is also 3X cheaper for input tokens and 2X cheaper for output tokens compared to the original GPT-4 model. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
GPT-JT 6B transformers-lm-gptjt yes 6B 14GB (half-precision) 2,048 tokens Apache 2.0 , commercial use is allowed An open-source English-only large language model which was fine-tuned for instruction following but is NOT capable of code generation. NB: free of charge. This model is up and running on our servers and can be used for free.
ChatGPT 16k openai-api-chatgpt-16k no supposedly, 175B - (cannot be run locally) 16,384 tokens available under subscription plan, commercial use allowed Same capabilities as the standard gpt-3.5-turbo model but with 4 times the context. NB: paid. You must provide your OpenAI API key to use the model. Your OpenAI account will be charged according to your usage.
Anthropic Claude-v1 anthropic-api-claude-v1 no supposedly, 52B - (cannot be run locally) 9,000 tokens available under subscription plan, commercial use allowed The largest model, ideal for a wide range of more complex tasks. NB: paid. You must provide your Anthropic API key to use the model. Your Anthropic API account will be charged according to your usage.
Anthropic Claude Instant v1 anthropic-api-claude-instant-v1 no (paid access via API) supposedly, 52B - (cannot be run locally) 9,000 tokens available under subscription plan, commercial use allowed A smaller model with far lower latency, sampling at roughly 40 words/sec! Its output quality is somewhat lower than the latest claude-1 model, particularly for complex tasks. However, it is much less expensive and blazing fast. NB: paid. You must provide your Anthropic API key to use the model. Your Anthropic API account will be charged according to your usage.
Russian XGLM 4.5B (private weights) transformers-lm-ruxglm no 4.5B 15GB 2,048 tokens Not available yet A private large language model for the Russian language which was fine-tuned for instruction following by Dmitry Kosenko in Summer 2023. This model is up and running on our servers and can be used for free.
ruGPT-3.5-13B transformers-lm-rugpt35 yes 13B 35GB (half-precision) 2,048 tokens MIT A large language model for the Russian language which was used for trainig GigaChat. This model is up and running on our servers and can be used for free.
Mistral 7B 128k Tokens transformers-lm-mistral-7b-128k yes 7B 20GB (half-precision) 128,000 tokens Apache 2.0 , commercial use is allowed An open-source English-only large language model which was fine-tuned for instruction following but is NOT capable of code generation. NB: free of charge. This model is up and running on our servers and can be used for free.