-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Where is the 4bit attention API? #52
Comments
Hello. The 4-bit API will be released in the near future. |
looking forward the 4-int kernel |
We are currently exploring some new techniques that can further improve the accuracy of 4 bit kernel. |
awesome! looking forward to! do you have high level timeline that when 4 bit kernel or algorithm reference code might be ready? |
hi @jason-huang03 @jt-zhang any updates on 4bit kernel? |
I read the SageAttention2 tech report and I'm really thirsty for trying 4bit kernel. However I cannot find any API introduction in your README. Is it still under testing and not released?
The text was updated successfully, but these errors were encountered: