Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BEAM search 70B llama on tinybox #3921

Open
chenyuxyz opened this issue Mar 25, 2024 · 1 comment
Open

BEAM search 70B llama on tinybox #3921

chenyuxyz opened this issue Mar 25, 2024 · 1 comment

Comments

@chenyuxyz
Copy link
Collaborator

70B llama uses almost all vram and beam search new buffer allocation would fail due to out of resource.

It would be nice if we can search live with some flags to control buffer allocation. If not, we can have a regular run and log the kernels, and use a separate script to loop through the kernels and beam search them into cache.

@chenyuxyz chenyuxyz added this to the llama2 70B FP16 10 tok/s milestone Mar 25, 2024
@chenyuxyz
Copy link
Collaborator Author

see a weird error while trying to beam search llama 34B DEBUG=2 BEAM=2 PYTHONPATH=. python3 examples/llama.py --gen code --size '34B' --temperature 0 --shard 6 --count 10 --prompt "from tinygrad.tensor import Tensor" --timing

   3.08s:       377.80 us          7/  20           86  683    4    2   16    4HW Exception by GPU node-4 (Agent handle: 0x55d987988820) reason :GPU Hang
HW Exception by GPU node-4 (Agent handle: 0x559da9432ce0) reason :GPU Hang
HW Exception by GPU node-4 (Agent handle: 0x55d9a6070cb0) reason :GPU Hang
CLDevice: got 1 platforms and 0 devices
/usr/lib/python3.10/multiprocessing/resource_tracker.py:224: UserWarning: resource_tracker: There appear to be 6 leaked semaphore objects to clean up at shutdown
  warnings.warn('resource_tracker: There appear to be %d '
[1]    112769 IOT instruction (core dumped)  PARALLEL=1 DEBUG=2 BEAM=2 PYTHONPATH=. python3 examples/llama.py --gen code

that CLDevice: got 1 platforms and 0 devices means we are initializing a GPU device?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

1 participant