open-nomad/scheduler
Tim Gross 8ff5ea1bee
CSI: no early return when feasibility check fails on eligible nodes (#13274)
As a performance optimization in the scheduler, feasibility checks
that apply to an entire class are only checked once for all nodes of
that class. Other feasibility checks are "available" checks because
they rely on more ephemeral characteristics and don't contribute to
the hash for the node class. This currently includes only CSI.

We have a separate fast path for "available" checks when the node has
already been marked eligible on the basis of class. This fast path has
a bug where it returns early rather than continuing the loop. This
causes the entire task group to be rejected.

Fix the bug by not returning early in the fast path and instead jump
to the top of the loop like all the other code paths in this method.
Includes a new test exercising topology at whole-scheduler level and a
fix for an existing test that should've caught this previously.
2022-06-07 13:31:10 -04:00
..
benchmarks
annotate.go
annotate_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
context.go
context_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
device.go
device_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
feasible.go CSI: no early return when feasibility check fails on eligible nodes (#13274) 2022-06-07 13:31:10 -04:00
feasible_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
generic_sched.go system_scheduler: support disconnected clients (#12555) 2022-04-15 09:31:32 -04:00
generic_sched_test.go CSI: no early return when feasibility check fails on eligible nodes (#13274) 2022-06-07 13:31:10 -04:00
preemption.go
preemption_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
propertyset.go
rank.go
rank_test.go
reconcile.go reconciler: Handle canaries when client disconnects (#12539) 2022-04-21 10:05:58 -04:00
reconcile_test.go reconciler: Handle canaries when client disconnects (#12539) 2022-04-21 10:05:58 -04:00
reconcile_util.go core: reschedule evicted batch job when resources become available 2022-06-02 14:04:13 -05:00
reconcile_util_test.go core: reschedule evicted batch job when resources become available 2022-06-02 14:04:13 -05:00
scheduler.go disconnected clients: ensure servers meet minimum required version (#12202) 2022-04-05 17:12:23 -04:00
scheduler_oss.go
scheduler_sysbatch_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
scheduler_system.go system_scheduler: support disconnected clients (#12555) 2022-04-15 09:31:32 -04:00
scheduler_system_test.go system_scheduler: support disconnected clients (#12555) 2022-04-15 09:31:32 -04:00
select.go
select_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
spread.go
spread_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
stack.go
stack_oss.go
stack_test.go ci: swap ci parallelization for unconstrained gomaxprocs 2022-03-15 12:58:52 -05:00
testing.go disconnected clients: ensure servers meet minimum required version (#12202) 2022-04-05 17:12:23 -04:00
util.go scheduler: volume updates should always be destructive (#13008) 2022-05-13 11:34:04 -04:00
util_test.go scheduler: volume updates should always be destructive (#13008) 2022-05-13 11:34:04 -04:00