8-bit quantitization with PyTorch 2.0 #89

eschaffn · 2023-05-17T22:23:12Z

Hey there!

Is it possible to do post-training quantitzation with Parseq? I'm looking for ways to speed up inference time. I tried training a parseq-tiny model but lost about 13% absolute val accuracy.

I'm new to quantitization and am unsure about the types of models it benefits or which type of quantitization to use.

Thanks for any suggestions!

baudm · 2023-06-09T14:25:39Z

I'm sorry but I'm also new to quantization and model deployments in general.

Another route to take is to use a bigger model, "sparsify" it, then prune the unused connections to optimize inference time.

dat080399 · 2023-07-14T02:47:04Z

Hey there!

Is it possible to do post-training quantitzation with Parseq? I'm looking for ways to speed up inference time. I tried training a parseq-tiny model but lost about 13% absolute val accuracy.

I'm new to quantitization and am unsure about the types of models it benefits or which type of quantitization to use.

Thanks for any suggestions!

Did you speed up inference time ? And did you quantize with post-training quantization ?

VikasOjha666 · 2023-11-25T20:09:41Z

I have created the quantization support in a separate fork of this repo which can be checked here https://github.com/VikasOjha666/parseq

By default when the model is trained it will be trained in a quantization-aware training which further helps preserve the accuracy during quantization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

8-bit quantitization with PyTorch 2.0 #89

8-bit quantitization with PyTorch 2.0 #89

eschaffn commented May 17, 2023

baudm commented Jun 9, 2023

dat080399 commented Jul 14, 2023

VikasOjha666 commented Nov 25, 2023 •

edited

8-bit quantitization with PyTorch 2.0 #89

8-bit quantitization with PyTorch 2.0 #89

Comments

eschaffn commented May 17, 2023

baudm commented Jun 9, 2023

dat080399 commented Jul 14, 2023

VikasOjha666 commented Nov 25, 2023 • edited

VikasOjha666 commented Nov 25, 2023 •

edited