-
Notifications
You must be signed in to change notification settings - Fork 492
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Indexer Causes Unclean Shutdown Race Condition #1387
Comments
Hey @crypto-services have you tried with a more recent version of bor? |
The issue still appears to be present on v1.5.3. |
Hi @crypto-services, apologies for the delay. I tried to reproduce this issue locally and on one of our internal nodes and couldn't. But it seems like there's a blocking call in the indexing service which doesn't honour the interrupt and hence stalls until the process is forcefully killed leading to unclean shutdown. In order to further debug, it would be great if you can help me with the following things.
Thanks! |
Ok sure thing I'll notify the team and get them to capture it when possible. |
Hey @crypto-services, did you prune ancient data by any chance on the node showing this issue? |
@manav2401 Yeah they've all been either pruned (or resynced at some point with a pruned snapshot). Additionally they've all had |
Hi @crypto-services, this issue has been fixed in #1411. It will be available in a beta branch soon. For now, to avoid this issue, please set the |
System information
Bor client version: 0.5.1
Heimdall client version: 1.0.10
OS & Version: Ubuntu 22.04/24.04
Environment: Polygon Mainnet
Type of node: All
Overview of the problem
When attempting to shutdown the Bor service (SIGINT) the process appears unable to exit if the indexer was running at the time. This results in the SIGINT timing out triggering SIGKILL which either dumps the recent state or worse corrupts the db.
Reproduction Steps
Happens often when stopping the Bor service with SIGINT.
Logs / Traces / Output / Error Messages
The node is not able to reach step
Writing cached state to disk
and is eventually killed. This does not happen when the indexer isn't running at the moment of shutdown.The text was updated successfully, but these errors were encountered: