[BUG] The bug about the options of the Megatron-core, transformer-impl and flash-attention. #778

Baibaifan · 2024-04-12T08:24:07Z

Describe the bug
Open --use-mcore-models and --use-flash-attn, set --transformer-impl local, and do not use flash-attention.

To Reproduce
N/A

Expected behavior
N/A

Stack trace/logs
N/A

Environment (please complete the following information):

Megatron-LM commit ID : ba77325

Proposed fix
N/A

Additional context
N/A

The text was updated successfully, but these errors were encountered:

ethanhe42 · 2024-04-13T02:54:46Z

when you use --use-mcore-models,, you cannot use local. --use-flash-attn decides whether to use the OSS flash attention implmentation or cudnn implmementation.

Baibaifan · 2024-04-15T03:17:28Z

when you use --use-mcore-models,, you cannot use local. --use-flash-attn decides whether to use the OSS flash attention implmentation or cudnn implmementation.

hi @ethanhe42 ,I understand the process you mentioned, but currently there is a task warning in the configuration options, which is not very user-friendly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] The bug about the options of the Megatron-core, transformer-impl and flash-attention. #778

[BUG] The bug about the options of the Megatron-core, transformer-impl and flash-attention. #778

Baibaifan commented Apr 12, 2024

ethanhe42 commented Apr 13, 2024

Baibaifan commented Apr 15, 2024

[BUG] The bug about the options of the Megatron-core, transformer-impl and flash-attention. #778

[BUG] The bug about the options of the Megatron-core, transformer-impl and flash-attention. #778

Comments

Baibaifan commented Apr 12, 2024

ethanhe42 commented Apr 13, 2024

Baibaifan commented Apr 15, 2024