-
Notifications
You must be signed in to change notification settings - Fork 228
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Kraft Metadata Cluster down due commit recordbatch exceed limit #2057
Comments
logs on local node
|
It caused by SSO compaction force split. I think we should limit the number of Streams for an Object in DeltaWALUploadTask to less than 10,000 to avoid exceeding the limits during subsequent SSO Compaction. A possible approach is: |
Pls assign that to me |
Done, thanks @hzh0425 , and cc @lifepuzzlefun |
feel free to go |
Version & Environment
master
What went wrong?
raft no available leader the leader elected and step down due commit fail.
What should have happened instead?
How to reproduce the issue?
create 5w+ partition and delete them at 100 concurrency and only have 5 node in cluster.
stop one node and start it.
so one node have big partition number. once trigger upload their may have a lot of StreamObject and Stream to upload
I think when commitSSO this will happen. and this may cause the whole cluster not function.
Additional information
Please attach any relevant logs, backtraces, or metric charts.
The text was updated successfully, but these errors were encountered: