Skip to content

Commit 3785c7d

Browse files
Yujun DongIngo Molnar
Yujun Dong
authored and
Ingo Molnar
committed
cpuidle, sched: Use smp_mb__after_atomic() in current_clr_polling()
In architectures that use the polling bit, current_clr_polling() employs smp_mb() to ensure that the clearing of the polling bit is visible to other cores before checking TIF_NEED_RESCHED. However, smp_mb() can be costly. Given that clear_bit() is an atomic operation, replacing smp_mb() with smp_mb__after_atomic() is appropriate. Many architectures implement smp_mb__after_atomic() as a lighter-weight barrier compared to smp_mb(), leading to performance improvements. For instance, on x86, smp_mb__after_atomic() is a no-op. This change eliminates a smp_mb() instruction in the cpuidle wake-up path, saving several CPU cycles and thereby reducing wake-up latency. Architectures that do not use the polling bit will retain the original smp_mb() behavior to ensure that existing dependencies remain unaffected. Signed-off-by: Yujun Dong <[email protected]> Signed-off-by: Ingo Molnar <[email protected]> Link: https://lore.kernel.org/r/[email protected]
1 parent b521730 commit 3785c7d

File tree

1 file changed

+16
-7
lines changed

1 file changed

+16
-7
lines changed

include/linux/sched/idle.h

Lines changed: 16 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -79,6 +79,21 @@ static __always_inline bool __must_check current_clr_polling_and_test(void)
7979
return unlikely(tif_need_resched());
8080
}
8181

82+
static __always_inline void current_clr_polling(void)
83+
{
84+
__current_clr_polling();
85+
86+
/*
87+
* Ensure we check TIF_NEED_RESCHED after we clear the polling bit.
88+
* Once the bit is cleared, we'll get IPIs with every new
89+
* TIF_NEED_RESCHED and the IPI handler, scheduler_ipi(), will also
90+
* fold.
91+
*/
92+
smp_mb__after_atomic(); /* paired with resched_curr() */
93+
94+
preempt_fold_need_resched();
95+
}
96+
8297
#else
8398
static inline void __current_set_polling(void) { }
8499
static inline void __current_clr_polling(void) { }
@@ -91,21 +106,15 @@ static inline bool __must_check current_clr_polling_and_test(void)
91106
{
92107
return unlikely(tif_need_resched());
93108
}
94-
#endif
95109

96110
static __always_inline void current_clr_polling(void)
97111
{
98112
__current_clr_polling();
99113

100-
/*
101-
* Ensure we check TIF_NEED_RESCHED after we clear the polling bit.
102-
* Once the bit is cleared, we'll get IPIs with every new
103-
* TIF_NEED_RESCHED and the IPI handler, scheduler_ipi(), will also
104-
* fold.
105-
*/
106114
smp_mb(); /* paired with resched_curr() */
107115

108116
preempt_fold_need_resched();
109117
}
118+
#endif
110119

111120
#endif /* _LINUX_SCHED_IDLE_H */

0 commit comments

Comments
 (0)