-
IST Austria
- Vienna
- efrantar.github.io
- @elias_frantar
Block or Report
Block or report efrantar
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
IST-DASLab/gptq
IST-DASLab/gptq PublicCode for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
-
IST-DASLab/sparsegpt
IST-DASLab/sparsegpt PublicCode for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
-
IST-DASLab/marlin
IST-DASLab/marlin PublicFP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
-
IST-DASLab/qmoe
IST-DASLab/qmoe PublicCode for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
-
IST-DASLab/OBC
IST-DASLab/OBC PublicCode for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".
-
rob-twophase
rob-twophase PublicThe ultimate Rubik's Cube solving algorithm for high-speed axial robots.
If the problem persists, check the GitHub status page or contact support.