Enable device map #30870

darshana1406 · 2024-05-16T19:10:42Z

What does this PR do?

Fixes #30858

Added _no_split_modules = ["VideoLlavaVisionAttention"] to src/transformers/models/video_llava/modeling_video_llava.py
Works on a 4 GPU NVIDIA GeForce RTX 2080 Ti setup.

Who can review?

@zucchini-nlp @amyeroberts

amyeroberts · 2024-05-16T19:33:23Z

Thanks for adding this feature @darshana1406!

Could you run the following tests in a multi-gpu environment and share the terminal output?

pytest tests/models/video_llava/test_modeling_ video_llava.py -vv -k "offload or parallelism"

zucchini-nlp · 2024-05-17T07:01:00Z

Awesome, thanks a lot for adding device map support!

darshana1406 · 2024-05-17T07:46:46Z

@amyeroberts Here is the terminal output.

(videocon) darshana.s@gnode084:~/transformers$ python -m pytest tests/models/video_llava/test_modeling_video_llava.py -vv -k "offload or parallelism"
============================================================= test session starts =============================================================
platform linux -- Python 3.10.14, pytest-8.2.0, pluggy-1.5.0 -- /home2/darshana.s/miniconda3/envs/videocon/bin/python
cachedir: .pytest_cache
rootdir: /home2/darshana.s/transformers
configfile: pyproject.toml
plugins: anyio-4.3.0
collected 125 items / 121 deselected / 4 selected                                                                                             

tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_cpu_offload <- tests/test_modeling_common.py PASSED [ 25%]
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_disk_offload_bin <- tests/test_modeling_common.py PASSED [ 50%]
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_disk_offload_safetensors <- tests/test_modeling_common.py PASSED [ 75%]
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_model_parallelism <- tests/test_modeling_common.py PASSED [100%]

============================================================== warnings summary ===============================================================
../miniconda3/envs/videocon/lib/python3.10/site-packages/_pytest/config/__init__.py:1448
  /home2/darshana.s/miniconda3/envs/videocon/lib/python3.10/site-packages/_pytest/config/__init__.py:1448: PytestConfigWarning: Unknown config option: doctest_glob
  
    self._warn_or_fail_if_strict(f"Unknown config option: {key}\n")

tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_cpu_offload
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_cpu_offload
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_disk_offload_bin
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_disk_offload_bin
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_disk_offload_safetensors
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_model_parallelism
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_model_parallelism
  /home2/darshana.s/miniconda3/envs/videocon/lib/python3.10/site-packages/accelerate/utils/modeling.py:1142: DeprecationWarning: The 'warn' method is deprecated, use 'warning' instead
    logger.warn(

tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_cpu_offload
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_disk_offload_bin
tests/models/video_llava/test_modeling_video_llava.py::VideoLlavaForConditionalGenerationModelTest::test_disk_offload_safetensors
  /home2/darshana.s/miniconda3/envs/videocon/lib/python3.10/site-packages/accelerate/utils/modeling.py:1363: UserWarning: Current model requires 262176 bytes of buffer for offloaded layers, which seems does not fit any GPU's remaining memory. If you are experiencing a OOM later, please consider using offload_buffers=True.
    warnings.warn(

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
=============================================== 4 passed, 121 deselected, 11 warnings in 25.69s ===============================================

amyeroberts

Thanks for adding this really useful feature @darshana1406! ❤️

darshana1406 · 2024-05-17T12:49:25Z

Thank you for guiding me! @zucchini-nlp @amyeroberts

* added_no_split_modules * added LlavaNextVisionAttention to _no_split_modules

darshana1406 added 2 commits May 17, 2024 00:12

added_no_split_modules

b49c365

added LlavaNextVisionAttention to _no_split_modules

f4bf56e

amyeroberts approved these changes May 17, 2024

View reviewed changes

amyeroberts merged commit 3802e78 into huggingface:main May 17, 2024
18 checks passed

itazap pushed a commit that referenced this pull request May 24, 2024

Enable device map (#30870)

f830beb

* added_no_split_modules * added LlavaNextVisionAttention to _no_split_modules

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable device map #30870

Enable device map #30870

darshana1406 commented May 16, 2024

amyeroberts commented May 16, 2024

zucchini-nlp commented May 17, 2024

darshana1406 commented May 17, 2024

amyeroberts left a comment

darshana1406 commented May 17, 2024

Enable device map #30870

Enable device map #30870

Conversation

darshana1406 commented May 16, 2024

What does this PR do?

Who can review?

amyeroberts commented May 16, 2024

zucchini-nlp commented May 17, 2024

darshana1406 commented May 17, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

darshana1406 commented May 17, 2024