Allow assigning resources to nodes with finite negative scores #3802

kgaillot · 2025-01-15T15:51:01Z

No description provided.

... in pcmk__node_available(), to make it easier to add new conditions. I went through the callers to see if any others should reject guest nodes with unrunnable guests, and it appears not.

... now that the "alive" and "usable" checks are separated

... to isolate the code more logically

Nothing uses it yet

This allows resources to be active in cases where they would previously be stopped. Fixes T335

Now that resources may be assigned to nodes with finite negative scores, two scheduler regression tests have improvements: In systemhealthp2, the stonith resource may now start, whereas before it was left stopped. There are two nodes, hs21c (with health status yellow, equivalent to -100 preference) and hs21d (which is unseen, so unclean and offline). There was no good reason to leave stonith stopped. In node-maintenance-1, rsc1 stays active where it is, whereas before it was stopped. rsc1 is started on node1 (where it has a -1 location preference), and rsc2 is started on node2 (where it has a -1 location preference), and node2 is unmanaged. Since node2 is unmanaged, rsc2 can't be moved away from it, and rsc1 has nowhere to move to even though it has a negative preference for its current node. Previously, rsc1 would be stopped even though it couldn't be recovered elsewhere.

All existing callers retain the same behavior

kgaillot · 2025-01-15T15:52:39Z

@nrwahl2 , could you review this? Thanks

This was something I had started but had to put aside for higher priority stuff. I went back and finished it but it needs careful consideration and review. It's a pretty big behavioral change but it can be considered a fix.

Previously, clone children were sorted in pcmk__clone_create_probe(). However, the sort was only necessary when probing anonymous clones on nodes that wouldn't have an active instance, so now they are sorted only in that case, for a slight efficiency gain.

kgaillot · 2025-01-15T16:34:52Z

The last commit is unrelated. I only glanced over the regression test changes but they appear to just be sorting differences as expected. Since the clone children are in a list, not a hash table, I expect the unsorted order to remain consistent so that regression test results don't vary, but if that does happen, this is the likely culprit.

kgaillot added 9 commits January 14, 2025 17:33

Refactor: scheduler: convert boolean arguments to flags

ae46153

... in pcmk__node_available(), to make it easier to add new conditions. I went through the callers to see if any others should reject guest nodes with unrunnable guests, and it appears not.

Refactor: scheduler: use pcmk__node_available() in more places

68181f4

... now that the "alive" and "usable" checks are separated

Refactor: scheduler: make guest check a node availablity option

8aca980

... to isolate the code more logically

Refactor: scheduler: make positive score check a node availablity option

fa3345b

... to isolate the code more logically

Refactor: scheduler: add option for disallowing banned nodes

247609b

Nothing uses it yet

Fix: scheduler: allow assigning to nodes with finite negative scores

d48abd1

This allows resources to be active in cases where they would previously be stopped. Fixes T335

Refactor: scheduler: allow pcmk__any_node_available() to take flags

c475296

All existing callers retain the same behavior

Doc: scheduler: add TODO

72e5f98

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow assigning resources to nodes with finite negative scores #3802

Allow assigning resources to nodes with finite negative scores #3802

kgaillot commented Jan 15, 2025

kgaillot commented Jan 15, 2025

kgaillot commented Jan 15, 2025

Allow assigning resources to nodes with finite negative scores #3802

Are you sure you want to change the base?

Allow assigning resources to nodes with finite negative scores #3802

Conversation

kgaillot commented Jan 15, 2025

kgaillot commented Jan 15, 2025

kgaillot commented Jan 15, 2025