Reinforcement learning ∩ LLMs, Generative models, Artificial intelligence
- San Francisco, CA
- alecwangcq.github.io
Highlights
- Pro
Block or Report
Block or report alecwangcq
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
EigenDamage-Pytorch
EigenDamage-Pytorch PublicCode for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934
-
KFAC-Pytorch
KFAC-Pytorch PublicPytorch implementation of KFAC and E-KFAC (Natural Gradient).
-
f-divergence-dpo
f-divergence-dpo PublicDirect preference optimization with f-divergences.
Python 6
-
gd-zhang/Weight-Decay
gd-zhang/Weight-Decay PublicRegularization, Neural Network Training Dynamics
Python 14
-
ssydasheng/Neural-Kernel-Network
ssydasheng/Neural-Kernel-Network PublicCode for "Differentiable Compositional Kernel Learning for Gaussian Processes" https://arxiv.org/abs/1806.04326
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.