Support loading `.pt` weights #420

shripadk · 2024-04-17T10:13:48Z

Feature request

Need support for loading models that only contain .pt weights

Motivation

I quantized Mixtral 8x7b model using HQQ (which produces a qmodel.pt file). But I am unable to load the weights in LoRAX as it expects either a .safetensors or .bin weights.

Your contribution

I haven't studied the source enough to submit a PR but from cursory understanding of the code, changes need to be made in hub.py file, specifically:

lorax/server/lorax_server/utils/sources/hub.py

Lines 68 to 78 in cc2e0a9

    
           try: 
        
               filenames = weight_hub_files(model_id, revision, extension, api_token) 
        
           except EntryNotFoundError as e: 
        
               if extension != ".safetensors": 
        
                   raise e 
        
               # Try to see if there are pytorch weights 
        
               pt_filenames = weight_hub_files(model_id, revision, extension=".bin", api_token=api_token) 
        
               # Change pytorch extension to safetensors extension 
        
               # It is possible that we have safetensors weights locally even though they are not on the 
        
               # hub if we converted weights locally without pushing them 
        
               filenames = [f"{Path(f).stem.lstrip('pytorch_')}.safetensors" for f in pt_filenames]

Though I would also like to be able to load the base model from local rather than remote/from the hub (as explained in this issue: #347)

The text was updated successfully, but these errors were encountered:

magdyksaleh · 2024-04-18T19:38:29Z

I will work on a fix for this alongside #347

tgaddair · 2024-05-23T19:52:57Z

Looks like we just need to support .pt extension as an alternative to .bin (it should be the same underlying format).

As a workaround @shripadk can you try renaming the file to qmodel.bin?

magdyksaleh self-assigned this Apr 18, 2024

magdyksaleh added the bug Something isn't working label May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support loading `.pt` weights #420

Support loading `.pt` weights #420

shripadk commented Apr 17, 2024

magdyksaleh commented Apr 18, 2024

tgaddair commented May 23, 2024

Support loading .pt weights #420

Support loading .pt weights #420

Comments

shripadk commented Apr 17, 2024

Feature request

Motivation

Your contribution

magdyksaleh commented Apr 18, 2024

tgaddair commented May 23, 2024

Support loading `.pt` weights #420

Support loading `.pt` weights #420