Constitutional AI: Harmlessness from AI Feedback #154

gleibovich-nvidia · 2024-04-15T14:20:43Z

Draft for the implementation of Constitutional AI: Harmlessness from AI Feedback.

Note that there are still some minor missing pieces regarding the exact model and datasets to be used, yet the main building blocks are in place.

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

Signed-off-by: shami nisimov <snisimov@nvidia.com>

+ replace 'ultrachat' with 'sft_datablend_v1' Signed-off-by: shami nisimov <snisimov@nvidia.com>

Signed-off-by: shami nisimov <snisimov@nvidia.com>

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

Signed-off-by: shami nisimov <snisimov@nvidia.com>

for more information, see https://pre-commit.ci

Signed-off-by: shami nisimov <snisimov@nvidia.com>

for more information, see https://pre-commit.ci

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

Signed-off-by: snisimov <snisimov@nvidia.com>

docs/user-guide/CAI.rst

docs/user-guide/index.rst

examples/nlp/cai/sys_prompt_constitution

jgerh

I completed the copyedits to the rst files identified in Constitutional AI: Harmlessness from AI Feedback
#154 and submitted them for your review. Please let me know if you have any questions.

terrykong · 2024-05-14T16:14:16Z

@gleibovich-nvidia Please review @jgerh 's copyedits

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

gleibovich-nvidia · 2024-05-15T12:42:34Z

I completed the copyedits to the rst files identified in Constitutional AI: Harmlessness from AI Feedback #154 and submitted them for your review. Please let me know if you have any questions.

Thanks a lot!
I have accepted and incorporated most of the suggestions. There are couple of comments, which I'm not sure about, to which I've replied directly.

docs/user-guide/CAI.rst

Signed-off-by: shami nisimov <snisimov@nvidia.com>

…pt from name) Signed-off-by: shami nisimov <snisimov@nvidia.com>

for more information, see https://pre-commit.ci

…_inference.py' Signed-off-by: shami nisimov <snisimov@nvidia.com>

gleibovich-nvidia · 2024-05-16T09:26:26Z

@terrykong FYI, we found an issue with the converted tokenizer being used within megatron_gpt_eval.py causing the BOS token of Mistral's prompt template not be handled correctly, when used in a conversation, rather than in a single prompt/reply. This is especially important when using few-shot prompting, as the model observes an already in progress conversation (which should follow the template correctly). It seems that megatron_gpt_eval.py is not a good fit here as an API inference service for a conversation (chat) with several rounds of human/assistant interaction, as it does not support using HF's tokenizer natively.

To resolve this issue, commits 65a9c3b and 2286522 are adding a lightweight inference service overriding the tokenizer to use the HF tokenizer instead, which tokenizes a conversation correctly.
NeMo's documentation is also mentioning using the HF tokenizer here.

…the inference service Signed-off-by: shami nisimov <snisimov@nvidia.com>

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

terrykong · 2024-05-16T16:25:14Z

we found an issue with the converted tokenizer being used within megatron_gpt_eval.py causing the BOS token of Mistral's prompt template not be handled correctly

Is my understanding correct that the tokenizer within the megatron checkpoint does not tokenize the BOS as you expect?

Why not change the prompt template to use the BOS token that the tokenizer expects? I feel like maintaining a light fork of the megatron_gpt_eval.py should be a last resort.

jgerh · 2024-05-16T17:22:23Z

@gleibovich-nvidia Gal, I reviewed your comments, responded to your questions, and completed my review.

gleibovich-nvidia · 2024-05-17T05:55:29Z

we found an issue with the converted tokenizer being used within megatron_gpt_eval.py causing the BOS token of Mistral's prompt template not be handled correctly

Is my understanding correct that the tokenizer within the megatron checkpoint does not tokenize the BOS as you expect?

Why not change the prompt template to use the BOS token that the tokenizer expects? I feel like maintaining a light fork of the megatron_gpt_eval.py should be a last resort.

Yes, the BOS for Mistral models is <s> and it is then tokenized to three different tokens <, s, > with the tokenizer in the megatron checkpoint, instead of tokenizing it to just one single token <s>.

We prompt a trained instruct model (Mistral-7B-v0.1) asking for critiques and revisions. The model then expects this specific prompt template, considering the way it was trained and the data it has observed during SFT.

While I do agree it is not ideal to maintain code adapted from NeMo, I would like to stress that:

megatron_gpt_eval.py is merely an example of how to run inference with NeMo. AFAIU, it wasn't developed to be a comprehensive solution for LLM inference, but just an example of how it can be done for specific use-cases. It doesn't implement any combination of use-cases required by potential users. This PR, on the other hand, is requiring functionality which is missing with megatron_gpt_eval.py to get a prompt tokenized correctly to match Mistral's expected prompt template.
There are other examples within NeMo-Aligner where missing upstream functionality required some code adaptation and duplication. e.g. https://github.com/NVIDIA/NeMo-Aligner/blob/main/nemo_aligner/utils/train_utils.py#L16 and https://github.com/NVIDIA/NeMo-Aligner/blob/main/nemo_aligner/data/nlp/samplers.py#L28.

There might be another way to workaround this issue that I can try on Sunday morning. Let's see how that goes.
If all else fails, we can stick with this proposed workaround for now, and I will make sure to add a note highlighting that this code is adapted from NeMo, and we can remove this extra added functionality once this issue gets resolved in NeMo.

gleibovich-nvidia · 2024-05-17T06:06:40Z

@gleibovich-nvidia Gal, I reviewed your comments, responded to your questions, and completed my review.

Thanks a lot @jgerh!

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

jgerh

Completed reviews and updates from Gal Leibovich.

terrykong · 2024-05-17T17:36:50Z

Yes, the BOS for Mistral models is <s> and it is then tokenized to three different tokens <, s, > with the tokenizer in the megatron checkpoint, instead of tokenizing it to just one single token <s>.

We prompt a trained instruct model (Mistral-7B-v0.1) asking for critiques and revisions. The model then expects this specific prompt template, considering the way it was trained and the data it has observed during SFT.

I feel like I'm missing something b/c shouldn't convert_mistral_7b_hf_to_nemo.py add the tokenizer that the huggingface checkpoint was trained with? If for some reason you need a different one, I feel like the functionality should just be added to convert_mistral_7b_hf_to_nemo.py

megatron_gpt_eval.py is merely an example of how to run inference with NeMo. AFAIU, it wasn't developed to be a comprehensive solution for LLM inference, but just an example of how it can be done for specific use-cases. It doesn't implement any combination of use-cases required by potential users. This PR, on the other hand, is requiring functionality which is missing with megatron_gpt_eval.py to get a prompt tokenized correctly to match Mistral's expected prompt template.

That's true that one script may not be comprehensive, but this example seems to just be about a tokenizer being different, so I feel like that's not a big enough reason to fork. Tokenizers are usually part of the package deal for models since swapping tokenizers can usually result in garbage when the token-ids get remapped, so I prefer to address this in the model packaging/conversion rather at inference time when swapping the tokenizer is not so much a feature, but an avenue for user error.

There are other examples within NeMo-Aligner where missing upstream functionality required some code adaptation and duplication. e.g. https://github.com/NVIDIA/NeMo-Aligner/blob/main/nemo_aligner/utils/train_utils.py#L16 and https://github.com/NVIDIA/NeMo-Aligner/blob/main/nemo_aligner/data/nlp/samplers.py#L28.

Sure, but these are not examples to follow, but exceptions. Also, the second one even says it should be deleted.

In general, forking puts the burden on us to maintain the functionality and correctness. I would rather us focus on the algorithms here and push for Nemo/Megatron to incorporate our features when possible. If we ask and they say "no" or "not anytime soon", then I see that as a good reason for when we should fork.

Signed-off-by: snisimov <snisimov@nvidia.com>

…ate_sl_cai_dataset.py') Signed-off-by: snisimov <snisimov@nvidia.com>

…ate_rl_cai_dataset.py) Signed-off-by: snisimov <snisimov@nvidia.com>

…same index to both 'chosen' and 'rejected' responses) Signed-off-by: snisimov <snisimov@nvidia.com>

gleibovich-nvidia requested review from gshennvm, guyknvda and snisimov April 15, 2024 14:20

github-actions bot added the documentation Improvements or additions to documentation label Apr 15, 2024

gleibovich-nvidia marked this pull request as draft April 15, 2024 14:33

gleibovich-nvidia force-pushed the cai branch from a087ddd to af5a265 Compare April 16, 2024 12:47

CAI (squashed)

f12b53c

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

gleibovich-nvidia force-pushed the cai branch from af5a265 to f12b53c Compare April 16, 2024 12:55

snisimov and others added 11 commits April 17, 2024 19:12

bug fix

c96886d

Signed-off-by: shami nisimov <snisimov@nvidia.com>

bug fix

048a60c

Signed-off-by: shami nisimov <snisimov@nvidia.com>

+ bug fix.

d62243f

+ replace 'ultrachat' with 'sft_datablend_v1' Signed-off-by: shami nisimov <snisimov@nvidia.com>

minor

a735717

Signed-off-by: shami nisimov <snisimov@nvidia.com>

minor

cf86b1f

Signed-off-by: shami nisimov <snisimov@nvidia.com>

update cai_flow.png

9598b72

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

update to CAI.rst

dd32216

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

update CAI.rst

020e6ab

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

done with 'generate_sl_cai_dataset.py'

a752921

Signed-off-by: shami nisimov <snisimov@nvidia.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

a66a104

for more information, see https://pre-commit.ci

minor

042a4fd

Signed-off-by: shami nisimov <snisimov@nvidia.com>

snisimov force-pushed the cai branch from b6f14a3 to 042a4fd Compare April 18, 2024 11:25

snisimov and others added 10 commits April 18, 2024 15:53

update from internal repo

ff97e12

Signed-off-by: shami nisimov <snisimov@nvidia.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

2b37559

for more information, see https://pre-commit.ci

update CAI.rst

77c6458

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

minor

531118e

Signed-off-by: snisimov <snisimov@nvidia.com>

minor

37c1feb

Signed-off-by: snisimov <snisimov@nvidia.com>

minor

ae1467c

Signed-off-by: snisimov <snisimov@nvidia.com>

minor

1d60562

Signed-off-by: snisimov <snisimov@nvidia.com>

minor

f772951

Signed-off-by: snisimov <snisimov@nvidia.com>

minor fix

6d52100

Signed-off-by: snisimov <snisimov@nvidia.com>

in CAT.rst, update stage #4

1e53060

Signed-off-by: snisimov <snisimov@nvidia.com>

jgerh reviewed May 14, 2024

View reviewed changes

docs/user-guide/CAI.rst Outdated Show resolved Hide resolved

jgerh reviewed May 14, 2024

View reviewed changes

docs/user-guide/index.rst Outdated Show resolved Hide resolved

jgerh reviewed May 14, 2024

View reviewed changes

examples/nlp/cai/sys_prompt_constitution Outdated Show resolved Hide resolved

jgerh reviewed May 14, 2024

View reviewed changes

examples/nlp/cai/sys_prompt_constitution Outdated Show resolved Hide resolved

jgerh reviewed May 14, 2024

View reviewed changes

Fixes following feedback by jgerh.

e2cc4ee

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

jgerh reviewed May 15, 2024

View reviewed changes

docs/user-guide/CAI.rst Show resolved Hide resolved

jgerh reviewed May 15, 2024

View reviewed changes

docs/user-guide/CAI.rst Outdated Show resolved Hide resolved

snisimov and others added 4 commits May 16, 2024 09:51

minor (in 'prepare_args', change default value for '--add_bos')

c0e8f4f

Signed-off-by: shami nisimov <snisimov@nvidia.com>

local inference service to replace megatron_gpt_eval.py (example scri…

2286522

…pt from name) Signed-off-by: shami nisimov <snisimov@nvidia.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

95cd167

for more information, see https://pre-commit.ci

update CAI.rst: replace usage of 'megatron_gpt_eval.py' with 'service…

65a9c3b

…_inference.py' Signed-off-by: shami nisimov <snisimov@nvidia.com>

snisimov and others added 2 commits May 16, 2024 13:44

In CAI.rst, update step #7 (Inference): add example code for calling …

28e1101

…the inference service Signed-off-by: shami nisimov <snisimov@nvidia.com>

minor documentation update

c530c66

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

minor doc update

33deaf7

Signed-off-by: Gal Leibovich <gleibovich@nvidia.com>

jgerh previously approved these changes May 17, 2024

View reviewed changes

minor (parser, change default value)

b1612d0

Signed-off-by: snisimov <snisimov@nvidia.com>

snisimov dismissed jgerh’s stale review via b1612d0 May 20, 2024 07:01

snisimov and others added 4 commits May 20, 2024 10:17

minor (explicitly sets all default values for 'examples/nlp/cai/gener…

115eceb

…ate_sl_cai_dataset.py') Signed-off-by: snisimov <snisimov@nvidia.com>

Merge branch 'main' into cai

b9eb33a

minor (explicitly sets all default values for 'examples/nlp/cai/gener…

477aa3c

…ate_rl_cai_dataset.py) Signed-off-by: snisimov <snisimov@nvidia.com>

minore (address the special case where the 'judge' model assigns the …

fb52c22

…same index to both 'chosen' and 'rejected' responses) Signed-off-by: snisimov <snisimov@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constitutional AI: Harmlessness from AI Feedback #154

Constitutional AI: Harmlessness from AI Feedback #154

gleibovich-nvidia commented Apr 15, 2024

jgerh left a comment

terrykong commented May 14, 2024

gleibovich-nvidia commented May 15, 2024

gleibovich-nvidia commented May 16, 2024 •

edited

terrykong commented May 16, 2024

jgerh commented May 16, 2024

gleibovich-nvidia commented May 17, 2024

gleibovich-nvidia commented May 17, 2024

jgerh left a comment

terrykong commented May 17, 2024

Constitutional AI: Harmlessness from AI Feedback #154

Are you sure you want to change the base?

Constitutional AI: Harmlessness from AI Feedback #154

Conversation

gleibovich-nvidia commented Apr 15, 2024

jgerh left a comment

Choose a reason for hiding this comment

terrykong commented May 14, 2024

gleibovich-nvidia commented May 15, 2024

gleibovich-nvidia commented May 16, 2024 • edited

terrykong commented May 16, 2024

jgerh commented May 16, 2024

gleibovich-nvidia commented May 17, 2024

gleibovich-nvidia commented May 17, 2024

jgerh left a comment

Choose a reason for hiding this comment

terrykong commented May 17, 2024

gleibovich-nvidia commented May 16, 2024 •

edited