genAI-cohere-testing

Notebooks for testing Cohere's Command-series LLMs.

Length Testing

Testing Command and Command-R compliance with prompt-enforced length control of outputs.

Methods

A list of five synonym phrases for maximum limitation restrictions (i.e. phrases that restricted Command’s output to x characters or less) was developed based on general use popularity and is as follows:

['at most', 'no more than', 'less than', 'maximum', 'under']

Then the following template was used for all prompts:

What is gravity? Your explanation should be ____ 150 words.

where ____ is replaced by each individual limitation phrasing.

Results

API calls are sent to Command (1000 completions) and Command-R (100 completions) models using default preamble and temperature parameters. The resulting histograms of word counts are shown below, with highlighted bins corresponding to the acceptable completions.

Command: 1000 completions	Command-R: 100 completions

Further mini-tests of 100 completions each were also done on lists of five synonym phrases for minimum, approximate, and exact limitation retrictions on Command and Command-R models. Results can be found in their corresponding folders.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Length_Testing		Length_Testing
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Length_Testing

Length_Testing

LICENSE

LICENSE

README.md

README.md

Repository files navigation

genAI-cohere-testing

Length Testing

Methods

Results

About

Releases

Packages

Languages

License

paxnea/genAI-cohere-testing

Folders and files

Latest commit

History

Repository files navigation

genAI-cohere-testing

Length Testing

Methods

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Languages