Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Supports for s4 and s8 GEMM on Python? #1463

Closed
mlpen opened this issue Apr 8, 2024 · 3 comments
Closed

Supports for s4 and s8 GEMM on Python? #1463

mlpen opened this issue Apr 8, 2024 · 3 comments
Labels
feature request New feature or request inactive-30d question Question

Comments

@mlpen
Copy link

mlpen commented Apr 8, 2024

Hi,

There are supports for using cutlass on Python https://github.com/NVIDIA/cutlass/blob/main/python/README.md, so I am wondering if there is a plan to support s4 and s8 GEMM on Python.
If not, is there any resource that I can use if I want to implement them by myself?

Thanks!

Tasks

No tasks being tracked yet.
@jackkosaian
Copy link
Contributor

The CUTLASS Python interface does support s8 GEMMs. Unit tests that show examples of using these are here and here.

The CUTLASS Python interface does not currently support s4. You can see this issue for some details related to this: #756

@mlpen
Copy link
Author

mlpen commented Apr 8, 2024

Thank you so much for the prompt response! This is helpful.

Copy link

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

@mnicely mnicely added the feature request New feature or request label May 20, 2024
@mnicely mnicely closed this as completed May 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request inactive-30d question Question
Projects
None yet
Development

No branches or pull requests

3 participants