You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sure, trimming involves removing nodes and connections in the network while minimizing accuracy loss. There is also an inference performance gain in both speed and hardware requirements.
I don't know of any right now, this is just a placeholder for people to fill in if they are aware of such options.
Here is an example of a performance increase from this pruning process: https://github.com/mlcommons/inference_results_v3.0/tree/main/open/NeuralMagic
The text was updated successfully, but these errors were encountered: