Centos-kernel-stream-9/kernel
Patrick Talbert f4f8151fe5 Merge: workqueue: Do not warn when cancelling WQ_MEM_RECLAIM work from !WQ_MEM_RECLAIM worker
MR: https://gitlab.com/redhat/centos-stream/src/kernel/centos-stream-9/-/merge_requests/6197

JIRA: https://issues.redhat.com/browse/RHEL-74107
CVE: CVE-2024-57888

commit de35994ecd2dd6148ab5a6c5050a1670a04dec77
Author: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
Date:   Thu, 19 Dec 2024 09:30:30 +0000

    workqueue: Do not warn when cancelling WQ_MEM_RECLAIM work from !WQ_MEM_RECLAIM worker

    After commit
    746ae46c1113 ("drm/sched: Mark scheduler work queues with WQ_MEM_RECLAIM")
    amdgpu started seeing the following warning:

     [ ] workqueue: WQ_MEM_RECLAIM sdma0:drm_sched_run_job_work [gpu_sched] is flushing !WQ_MEM_RECLAIM events:amdgpu_device_delay_enable_gfx_off [amdgpu]
    ...
     [ ] Workqueue: sdma0 drm_sched_run_job_work [gpu_sched]
    ...
     [ ] Call Trace:
     [ ]  <TASK>
    ...
     [ ]  ? check_flush_dependency+0xf5/0x110
    ...
     [ ]  cancel_delayed_work_sync+0x6e/0x80
     [ ]  amdgpu_gfx_off_ctrl+0xab/0x140 [amdgpu]
     [ ]  amdgpu_ring_alloc+0x40/0x50 [amdgpu]
     [ ]  amdgpu_ib_schedule+0xf4/0x810 [amdgpu]
     [ ]  ? drm_sched_run_job_work+0x22c/0x430 [gpu_sched]
     [ ]  amdgpu_job_run+0xaa/0x1f0 [amdgpu]
     [ ]  drm_sched_run_job_work+0x257/0x430 [gpu_sched]
     [ ]  process_one_work+0x217/0x720
    ...
     [ ]  </TASK>

    The intent of the verifcation done in check_flush_depedency is to ensure
    forward progress during memory reclaim, by flagging cases when either a
    memory reclaim process, or a memory reclaim work item is flushed from a
    context not marked as memory reclaim safe.

    This is correct when flushing, but when called from the
    cancel(_delayed)_work_sync() paths it is a false positive because work is
    either already running, or will not be running at all. Therefore
    cancelling it is safe and we can relax the warning criteria by letting the
    helper know of the calling context.

    Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
    Fixes: fca839c00a ("workqueue: warn if memory reclaim tries to flush !WQ_MEM_RECLAIM workqueue")
    References: 746ae46c1113 ("drm/sched: Mark scheduler work queues with WQ_MEM_RECLAIM")
    Cc: Tejun Heo <tj@kernel.org>
    Cc: Peter Zijlstra <peterz@infradead.org>
    Cc: Lai Jiangshan <jiangshanlai@gmail.com>
    Cc: Alex Deucher <alexander.deucher@amd.com>
    Cc: Christian König <christian.koenig@amd.com
    Cc: Matthew Brost <matthew.brost@intel.com>
    Cc: <stable@vger.kernel.org> # v4.5+
    Signed-off-by: Tejun Heo <tj@kernel.org>

Signed-off-by: Waiman Long <longman@redhat.com>

Approved-by: Phil Auld <pauld@redhat.com>
Approved-by: Herton R. Krzesinski <herton@redhat.com>
Approved-by: CKI KWF Bot <cki-ci-bot+kwf-gitlab-com@redhat.com>

Merged-by: Patrick Talbert <ptalbert@redhat.com>
2025-02-17 12:00:34 -05:00
..
bpf Merge: BPF 6.12 Rebase 2025-01-27 15:24:25 +01:00
cgroup Merge: cgroup: Extend cgroup.stat as a replacement of /proc/cgroups for v2 2024-11-25 13:17:36 +00:00
configs mm/slab: rename CONFIG_SLAB to CONFIG_SLAB_DEPRECATED 2024-09-05 20:35:59 -04:00
debug
dma mm, treewide: rename MAX_ORDER to MAX_PAGE_ORDER 2024-12-09 12:24:17 -05:00
entry
events lib/buildid: rename build_id_parse() into build_id_parse_nofault() 2025-01-21 11:27:05 +01:00
futex fault-inject: improve build for CONFIG_FAULT_INJECTION=n 2024-12-17 22:59:23 +01:00
gcov
irq kthread: add kthread_stop_put 2024-09-20 12:26:34 -07:00
kcsan
livepatch livepatch: Add "replace" sysfs attribute 2024-10-15 09:54:10 -04:00
locking rtmutex: Drop rt_mutex::wait_lock before scheduling 2024-12-18 17:06:50 +01:00
module module, bpf: Store BTF base pointer in struct module 2024-11-26 15:55:10 +01:00
power PM: hibernate: Use kernel_can_power_off() 2025-01-31 19:28:19 -05:00
printk printk: nbcon: Fix illegal RCU usage on thread wakeup 2025-01-06 10:33:52 -05:00
rcu rcu/kvfree: Fix data-race in __mod_timer / kvfree_call_rcu 2025-01-08 17:48:13 -05:00
sched Merge: Sched: Post 6.12 fixes (9.6) 2025-01-27 15:24:22 +01:00
time clocksource: Use migrate_disable() to avoid calling get_random_u32() in atomic context 2025-02-04 13:21:04 -05:00
trace Merge: BPF 6.12 Rebase 2025-01-27 15:24:25 +01:00
.gitignore
Kconfig.freezer
Kconfig.hz
Kconfig.kexec crash: clean up kdump related config items 2024-12-23 09:35:35 +08:00
Kconfig.locks
Kconfig.preempt
Makefile Merge: RHEL9.6 drm backport dependencies 2025-01-06 08:26:14 +00:00
acct.c
async.c
audit.c Merge: audit: Send netlink ACK before setting connection in auditd_set 2024-08-16 14:22:20 +00:00
audit.h
audit_fsnotify.c
audit_tree.c audit: Annotate struct audit_chunk with __counted_by 2024-07-04 14:52:57 -03:00
audit_watch.c
auditfilter.c audit: remove unnecessary assignment in audit_dupe_lsm_field() 2024-07-04 14:53:06 -03:00
auditsc.c fs: port xattr to mnt_idmap 2024-10-16 10:45:21 +08:00
backtracetest.c backtracetest: Convert from tasklet to BH workqueue 2024-06-18 17:26:55 -03:00
bounds.c bounds: Use the right number of bits for power-of-two CONFIG_NR_CPUS 2024-05-20 16:15:24 -04:00
capability.c fs: port i_{g,u}id_into_vfs{g,u}id() to mnt_idmap 2024-10-16 11:02:01 +08:00
cfi.c cfi: Remove CONFIG_CFI_CLANG_SHADOW 2024-06-17 14:17:20 -04:00
compat.c
configs.c
context_tracking.c
cpu.c Merge: Update arch/{x86,powerpc,arm64}/mm to v6.6 2024-11-12 08:02:20 +00:00
cpu_pm.c
crash_core.c Document/kexec: generalize crash hotplug description 2024-12-23 09:35:36 +08:00
crash_reserve.c crash: add prefix for crash dumping messages 2024-12-23 09:35:36 +08:00
cred.c
delayacct.c delayacct: track delays from IRQ/SOFTIRQ 2024-07-15 11:12:08 -04:00
dma.c
elfcorehdr.c crash: remove dependency of FA_DUMP on CRASH_DUMP 2024-12-23 09:35:35 +08:00
exec_domain.c
exit.c lazy tlb: introduce lazy tlb mm refcount helper functions 2024-11-04 09:14:17 -05:00
exit.h exit: add internal include file with helpers 2024-07-02 09:45:34 -04:00
extable.c
fail_function.c
fork.c Merge: CVE-2024-50271: ucounts: Split rlimit and ucount values and max values 2025-02-03 10:00:41 -05:00
freezer.c
gen_kheaders.sh
groups.c
hung_task.c
iomem.c mm: move is_ioremap_addr() into new header file 2024-10-01 11:17:52 -04:00
irq_work.c
jump_label.c
kallsyms.c kallsyms: Fix kallsyms_selftest failure 2024-06-17 14:17:28 -04:00
kallsyms_internal.h kallsyms: Reduce the memory occupied by kallsyms_seqs_of_names[] 2024-06-17 14:17:21 -04:00
kallsyms_selftest.c kallsyms: Fix kallsyms_selftest failure 2024-06-17 14:17:28 -04:00
kallsyms_selftest.h kallsyms: Add self-test facility 2024-06-17 14:17:22 -04:00
kcmp.c
kcov.c
kexec.c crash: add a new kexec flag for hotplug support 2024-12-23 09:35:36 +08:00
kexec_core.c crash: split crash dumping code out from kexec_core.c 2024-12-23 09:35:35 +08:00
kexec_elf.c
kexec_file.c kexec_file: fix elfcorehdr digest exclusion when CONFIG_CRASH_HOTPLUG=y 2024-12-23 09:35:36 +08:00
kexec_internal.h crash: remove dependency of FA_DUMP on CRASH_DUMP 2024-12-23 09:35:35 +08:00
kheaders.c
kprobes.c fprobe: Pass return address to the handlers 2024-10-25 09:07:32 +02:00
ksyms_common.c kallsyms: make kallsyms_show_value() as generic function 2024-06-17 14:17:28 -04:00
ksysfs.c crash: split crash dumping code out from kexec_core.c 2024-12-23 09:35:35 +08:00
kthread.c kthread: unpark only parked kthread 2024-11-25 12:31:48 +00:00
latencytop.c
module_signature.c
notifier.c
nsproxy.c
padata.c padata: use integer wrap around to prevent deadlock on seq_nr overflow 2024-11-13 15:23:14 +00:00
panic.c mm: remove arguments of show_mem() 2024-10-01 11:17:31 -04:00
params.c params: Introduce the param_unknown_fn type 2024-06-17 14:17:29 -04:00
pid.c
pid_namespace.c Revert "rcu-tasks: Fix synchronize_rcu_tasks() VS zap_pid_ns_processes()" 2024-08-26 10:57:50 -04:00
pid_sysctl.h
profile.c
ptrace.c mm: make __access_remote_vm() static 2024-12-09 12:22:45 -05:00
range.c
reboot.c kernel/reboot: Add SYS_OFF_MODE_RESTART_PREPARE mode 2024-08-22 11:21:33 -04:00
regset.c
relay.c
resource.c resource: fix region_intersects() vs add_memory_driver_managed() 2024-10-29 10:10:23 -06:00
resource_kunit.c
rh_messages.c
rh_messages.h [redhat] rh_messages.h: driver and device updates 2025-02-04 16:04:03 -05:00
rh_shadowman.c
rseq.c
scftorture.c
scs.c
seccomp.c
signal.c signal: restore the override_rlimit logic 2024-12-20 15:33:02 +00:00
smp.c
smpboot.c kthread: add kthread_stop_put 2024-09-20 12:26:34 -07:00
smpboot.h
softirq.c softirq: Allow raising SCHED_SOFTIRQ from SMP-call-function on RT kernel 2024-12-11 19:37:36 +00:00
stackleak.c
stacktrace.c
static_call.c
static_call_inline.c Merge: static_call: Replace pointless WARN_ON() in static_call_module_notify() 2024-11-15 21:09:46 +00:00
stop_machine.c
sys.c Merge: CVE-2024-50271: ucounts: Split rlimit and ucount values and max values 2025-02-03 10:00:41 -05:00
sys_ni.c x86/shstk: Introduce map_shadow_stack syscall 2024-10-01 11:17:15 -04:00
sysctl-test.c
sysctl.c mm: hugetlb: move hugeltb sysctls to its own file 2024-07-16 09:29:59 -04:00
task_work.c task_work: Add TWA_NMI_CURRENT as an additional notify mode. 2024-10-14 13:02:40 +02:00
taskstats.c
test_kprobes.c
torture.c
tracepoint.c
tsacct.c
ucount.c signal: restore the override_rlimit logic 2024-12-20 15:33:02 +00:00
uid16.c
uid16.h
umh.c
up.c
user-return-notifier.c
user.c
user_namespace.c Merge: CVE-2024-50271: ucounts: Split rlimit and ucount values and max values 2025-02-03 10:00:41 -05:00
usermode_driver.c
utsname.c
utsname_sysctl.c
vmcore_info.c crash: split vmcoreinfo exporting code out from crash_core.c 2024-12-23 09:35:35 +08:00
watch_queue.c
watchdog.c
watchdog_hld.c
workqueue.c workqueue: Do not warn when cancelling WQ_MEM_RECLAIM work from !WQ_MEM_RECLAIM worker 2025-01-17 09:22:12 -05:00
workqueue_internal.h