You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We can add back the FA1 implementation from huggingface/text-generation-inference#624 when compute capability of Volta or Turing is detected. This may bloat the Docker somewhat to support both, but it seems this is a common user pain point we should definitely address.
The text was updated successfully, but these errors were encountered:
We can add back the FA1 implementation from huggingface/text-generation-inference#624 when compute capability of Volta or Turing is detected. This may bloat the Docker somewhat to support both, but it seems this is a common user pain point we should definitely address.
The text was updated successfully, but these errors were encountered: