]> xenbits.xensource.com Git - xen.git/commitdiff
xen/sched: fix race in RTDS scheduler
authorJuergen Gross <jgross@suse.com>
Mon, 31 Oct 2022 12:22:54 +0000 (13:22 +0100)
committerJan Beulich <jbeulich@suse.com>
Mon, 31 Oct 2022 12:22:54 +0000 (13:22 +0100)
When a domain gets paused the unit runnable state can change to "not
runnable" without the scheduling lock being involved. This means that
a specific scheduler isn't involved in this change of runnable state.

In the RTDS scheduler this can result in an inconsistency in case a
unit is losing its "runnable" capability while the RTDS scheduler's
scheduling function is active. RTDS will remove the unit from the run
queue, but doesn't do so for the replenish queue, leading to hitting
an ASSERT() in replq_insert() later when the domain is unpaused again.

Fix that by removing the unit from the replenish queue as well in this
case.

Fixes: 7c7b407e7772 ("xen/sched: introduce unit_runnable_state()")
Signed-off-by: Juergen Gross <jgross@suse.com>
Acked-by: Dario Faggioli <dfaggioli@suse.com>
master commit: 73c62927f64ecb48f27d06176befdf76b879f340
master date: 2022-10-21 12:32:23 +0200

xen/common/sched/rt.c

index c24cd2ac320046c03d5796bf9b46d33548388085..ec2ca1bebc26b4f58c9be8339b42c03fa9e95789 100644 (file)
@@ -1087,6 +1087,7 @@ rt_schedule(const struct scheduler *ops, struct sched_unit *currunit,
         else if ( !unit_runnable_state(snext->unit) )
         {
             q_remove(snext);
+            replq_remove(ops, snext);
             snext = rt_unit(sched_idle_unit(sched_cpu));
         }