Using a simple model for torchtune setup testing in PRs? #978

mikekgfb · 2024-05-14T01:23:26Z

I'm trying to run torchtune as an on-pr test for torchchat to ensure on-going comptaibility. Alas, llama3 can't be used because on-pr tests can't use HF tokens. Can you please suggest an alternative setup, or enable stories15M from Andrej's tinyllamas collection as a standin?

https://github.com/pytorch/torchchat/actions/runs/9071903213/job/24926494708

  + tune download stories15M --output-dir ./Meta-Llama-3-8B
  Ignoring files matching the following patterns: *.safetensors
  usage: tune download <repo-id> [OPTIONS]
  tune download: error: Repository 'stories15M' not found on the Hugging Face Hub.

cc: @byjlw

The text was updated successfully, but these errors were encountered:

ebsmothers · 2024-05-14T12:24:07Z

Hi @mikekgfb one suggestion is to use one of the checkpoints here which you can just get via curl. These are more or less random weights though, so if you want to test specifically for generation quality you may want to go another route.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using a simple model for torchtune setup testing in PRs? #978

Using a simple model for torchtune setup testing in PRs? #978

mikekgfb commented May 14, 2024

ebsmothers commented May 14, 2024

Using a simple model for torchtune setup testing in PRs? #978

Using a simple model for torchtune setup testing in PRs? #978

Comments

mikekgfb commented May 14, 2024

ebsmothers commented May 14, 2024