Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Qwen-7B-Chat AttributeError: 'LlamaSplitFuseInferStateInfo' object has no attribute 'logn_values' #239

Open
1 task
exceedzhang opened this issue Dec 2, 2023 · 4 comments
Labels
bug Something isn't working

Comments

@exceedzhang
Copy link

exceedzhang commented Dec 2, 2023

Before you submit an issue, please search for existing issues to avoid duplicates.

Issue description:
AttributeError: 'LlamaSplitFuseInferStateInfo' object has no attribute 'logn_values'

Please provide a clear and concise description of your issue.

Steps to reproduce:
python -m lightllm.server.api_server --model_dir /root/autodl-tmp/Qwen-7B-Chat --tp 1 --trust_remote_code --splitfuse_mode

Please list the steps to reproduce the issue, such as:

Expected behavior:

Please describe what you expected to happen.

Error logging:

If applicable, please copy and paste the error message or stack trace here. Use code blocks for better readability.

Environment:

Please provide information about your environment, such as:

  • Using container

  • OS: (Ubuntu 14.04, CentOS7)

  • GPU info:

    • nvidia-smi (e.g. NVIDIA-SMI 525.116.04 Driver Version: 525.116.04 CUDA Version: 12.0)
    • Graphics cards: (4090x1)
  • Python: (e.g. CPython3.10)

  • LightLLm: (git commit-hash)

  • openai-triton: pip show triton
    Name: triton
    Version: 2.1.0
    Summary: A language and compiler for custom Deep Learning operations
    Home-page: https://github.com/openai/triton/
    Author: Philippe Tillet
    Author-email: phil@openai.com
    License:
    Location: /root/miniconda3/lib/python3.10/site-packages
    Requires: filelock
    Required-by: lightllm, torch

Additional context:

image

Language:

lightllm run Qwen-7B-Chat use splitfuse_mode not worked.

@exceedzhang exceedzhang added the bug Something isn't working label Dec 2, 2023
@exceedzhang exceedzhang changed the title [BUG] Qwen-7B-Chat [BUG] Qwen-7B-Chat AttributeError: 'LlamaSplitFuseInferStateInfo' object has no attribute 'logn_values' Dec 2, 2023
@hiworldwzj
Copy link
Collaborator

@exceedzhang splitfuse mode is in test. so this mode only support llama and llama2 now.

@hiworldwzj
Copy link
Collaborator

we will try to support other mode types soon.

@ObliviousDonkey
Copy link

does it support qwen VL?

@hiworldwzj
Copy link
Collaborator

@ObliviousDonkey qwen VL will be supported soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants