Changelog in Linux kernel 6.1.177

9p: avoid putting oldfid in p9_client_walk() error path [+ + +]

Author: Yizhou Zhao <[email protected]>
Date:   Thu May 28 13:39:16 2026 +0800

    9p: avoid putting oldfid in p9_client_walk() error path
    
    commit 1a3860d46e3eb47dbd60339783cdad7904486b9f upstream.
    
    When p9_client_walk() is called with clone set to false, fid aliases
    oldfid. If the walk subsequently fails after the request has been sent,
    the error path jumps to clunk_fid, which currently calls p9_fid_put(fid)
    unconditionally.
    
    This drops a reference to oldfid even though ownership of oldfid remains
    with the caller. If this is the last reference, oldfid can be clunked and
    destroyed while the caller still expects it to be valid. A later use or
    put of oldfid can then trigger a use-after-free or refcount underflow.
    
    Fix this by only putting fid in the clunk_fid error path when it does not
    alias oldfid, matching the existing guard in the error path below.
    
    This can be triggered when a multi-component walk is split into multiple
    p9_client_walk() calls and a later non-cloning walk fails. A reproducer
    and refcount warning logs are available on request.
    
    Fixes: b48dbb998d70 ("9p fid refcount: add p9_fid_get/put wrappers")
    Cc: [email protected]
    Reported-by: Yuxiang Yang <[email protected]>
    Reported-by: Ao Wang <[email protected]>
    Reported-by: Xuewei Feng <[email protected]>
    Reported-by: Qi Li <[email protected]>
    Reported-by: Ke Xu <[email protected]>
    Assisted-by: GLM 5.1
    Signed-off-by: Yizhou Zhao <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Dominique Martinet <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

af_unix: Reject SIOCATMARK on non-stream sockets [+ + +]

Author: Jiexun Wang <[email protected]>
Date:   Wed Jun 24 15:16:48 2026 +0000

    af_unix: Reject SIOCATMARK on non-stream sockets
    
    commit d119775f2bad827edc28071c061fdd4a91f889a5 upstream.
    
    SIOCATMARK reports whether the receive queue is at the urgent mark for
    MSG_OOB.
    
    In AF_UNIX, MSG_OOB is supported only for SOCK_STREAM sockets.
    SOCK_DGRAM and SOCK_SEQPACKET reject MSG_OOB in sendmsg() and recvmsg(),
    so they should not support SIOCATMARK either.
    
    Return -EOPNOTSUPP for non-stream sockets before checking the receive
    queue.
    
    Fixes: 314001f0bf92 ("af_unix: Add OOB support")
    Cc: [email protected]
    Reported-by: Yuan Tan <[email protected]>
    Reported-by: Yifan Wu <[email protected]>
    Reported-by: Juefei Pu <[email protected]>
    Reported-by: Xin Liu <[email protected]>
    Suggested-by: Kuniyuki Iwashima <[email protected]>
    Signed-off-by: Jiexun Wang <[email protected]>
    Signed-off-by: Ren Wei <[email protected]>
    Reviewed-by: Kuniyuki Iwashima <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Alexander Martyniuk <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

agp/amd64: Fix broken error propagation in agp_amd64_probe() [+ + +]

Author: Mingyu Wang <[email protected]>
Date:   Mon May 4 15:48:23 2026 +0800

    agp/amd64: Fix broken error propagation in agp_amd64_probe()
    
    commit b08472db93b1ccff84a7adec5779d47f0e9d3a30 upstream.
    
    A NULL pointer dereference was observed in the AMD64 AGP driver when
    running in a virtualized environment (e.g. qemu/kvm) without a physical
    AMD northbridge. The crash occurs in amd64_fetch_size() when attempting
    to dereference the pointer returned by node_to_amd_nb(0).
    
    The root cause of this crash is broken error propagation in
    agp_amd64_probe(): When no AMD northbridges are found, cache_nbs()
    correctly returns -ENODEV. However, the probe function erroneously
    checks the return value against exactly -1, rather than < 0.
    
    As a result, the hardware absence error is masked, allowing the driver
    to improperly proceed with initialization. It eventually calls
    agp_add_bridge(), which invokes amd64_fetch_size(). Since the hardware
    does not exist, node_to_amd_nb(0) returns NULL, leading to a General
    Protection Fault (GPF) when accessing its ->misc member.
    
    Fix the issue by correcting the error check in agp_amd64_probe() to
    abort properly when cache_nbs() returns any negative error code. This
    prevents the driver from erroneously proceeding without hardware, thereby
    avoiding the subsequent NULL pointer dereference at its source.
    
    Fixes: a32073bffc65 ("[PATCH] x86_64: Clean and enhance up K8 northbridge access code")
    Signed-off-by: Mingyu Wang <[email protected]>
    Signed-off-by: Lukas Wunner <[email protected]>
    Reviewed-by: Lukas Wunner <[email protected]>
    Cc: [email protected] # v2.6.18+
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

apparmor: fix use-after-free in rawdata dedup loop [+ + +]

Author: Ruslan Valiyev <[email protected]>
Date:   Tue May 26 00:04:46 2026 +0200

    apparmor: fix use-after-free in rawdata dedup loop
    
    commit 6f060496d03e4dc560a40f73770bd08335cb7a27 upstream.
    
    aa_replace_profiles() walks ns->rawdata_list to dedup the incoming
    policy blob against entries already attached to existing profiles.
    Per the kernel-doc on struct aa_loaddata, list membership does not
    hold a reference: profiles hold pcount, and when the last pcount
    drops, do_ploaddata_rmfs() is queued on a workqueue that takes
    ns->lock and removes the entry. Between dropping the last pcount
    and the workqueue running, an entry remains on the list with
    pcount == 0.
    
    aa_get_profile_loaddata() is an unconditional kref_get() on
    pcount, so when the dedup loop hits such an entry, refcount
    hardening reports
    
      refcount_t: addition on 0; use-after-free.
    
    inside aa_replace_profiles(), and the poisoned counter then
    trips "saturated" and "underflow" warnings on the subsequent
    uses of the same loaddata.
    
    Before commit a0b7091c4de4 ("apparmor: fix race on rawdata
    dereference") the dedup path used a get_unless_zero-style helper
    on a single counter, so the existing "if (tmp)" guard was
    meaningful. The split-refcount refactor introduced
    aa_get_profile_loaddata(), which has plain kref_get() semantics,
    and the guard quietly became a no-op.
    
    Introduce aa_get_profile_loaddata_not0(), matching the existing
    _not0 convention used by aa_get_profile_not0(), and use it for
    the rawdata_list dedup lookup so dying entries are skipped.
    
    Reproduced on x86_64 with v7.1-rc5 in QEMU+KVM running Ubuntu
    24.04 + stress-ng 0.17.06:
    
      stress-ng --apparmor 1 --klog-check --timeout 60s
    
    Without this patch the three refcount_t warnings fire within a
    few seconds. With it the same 60 s run is clean. Coverage is a
    smoke-test only; a longer soak with CONFIG_KASAN, CONFIG_KCSAN
    and CONFIG_PROVE_LOCKING would be welcome from anyone with the
    cycles.
    
    Fixes: a0b7091c4de4 ("apparmor: fix race on rawdata dereference")
    Reported-by: Colin Ian King <[email protected]>
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=221513
    Cc: [email protected]
    Signed-off-by: Ruslan Valiyev <[email protected]>
    Signed-off-by: John Johansen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

apparmor: mediate the implicit connect of TCP fast open sendmsg [+ + +]

Author: Bryam Vargas <[email protected]>
Date:   Mon Jun 22 15:57:38 2026 -0500

    apparmor: mediate the implicit connect of TCP fast open sendmsg
    
    commit 4d587cd8a72155089a627130bbd4716ec0856e21 upstream.
    
    sendmsg()/sendto() with MSG_FASTOPEN is a combination of connect(2) and
    write(2): it opens the connection in the SYN. apparmor_socket_sendmsg()
    only checks AA_MAY_SEND, so a profile that grants send but denies connect
    lets a confined task open an outbound TCP/MPTCP connection that connect(2)
    would have refused, bypassing connect mediation.
    
    Mediate the implicit connect when MSG_FASTOPEN is set and a destination
    is supplied. Add it to apparmor_socket_sendmsg() (not the shared
    aa_sock_msg_perm() helper, which recvmsg also uses) and call aa_sk_perm()
    directly, mirroring the selinux and tomoyo fixes. sk_is_tcp() does not
    cover MPTCP fast open, so the SOCK_STREAM/IPPROTO_MPTCP arm is explicit.
    
    Fixes: cf60af03ca4e ("net-tcp: Fast Open client - sendmsg(MSG_FASTOPEN)")
    Cc: [email protected]
    Signed-off-by: Bryam Vargas <[email protected]>
    Signed-off-by: John Johansen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ARM: allow __do_kernel_fault() to report execution of memory faults [+ + +]

Author: Russell King (Oracle) <[email protected]>
Date:   Mon Jun 22 12:26:32 2026 +0200

    ARM: allow __do_kernel_fault() to report execution of memory faults
    
    commit 40b466db1dffb41f0529035c59c5739636d0e5b8 upstream
    
    Allow __do_kernel_fault() to detect the execution of memory, so we can
    provide the same fault message as do_page_fault() would do. This is
    required when we split the kernel address fault handling from the
    main do_page_fault() code path.
    
    Reviewed-by: Xie Yuanbin <[email protected]>
    Tested-by: Xie Yuanbin <[email protected]>
    Signed-off-by: Russell King (Oracle) <[email protected]>
    Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ARM: fix branch predictor hardening [+ + +]

Author: Russell King (Oracle) <[email protected]>
Date:   Mon Jun 22 12:26:34 2026 +0200

    ARM: fix branch predictor hardening
    
    commit fd2dee1c6e2256f726ba33fd3083a7be0efc80d3 upstream.
    
    __do_user_fault() may be called with indeterminent interrupt enable
    state, which means we may be preemptive at this point. This causes
    problems when calling harden_branch_predictor(). For example, when
    called from a data abort, do_alignment_fault()->do_bad_area().
    
    Move harden_branch_predictor() out of __do_user_fault() and into the
    calling contexts.
    
    Moving it into do_kernel_address_page_fault(), we can be sure that
    interrupts will be disabled here.
    
    Converting do_translation_fault() to use do_kernel_address_page_fault()
    rather than do_bad_area() means that we keep branch predictor handling
    for translation faults. Interrupts will also be disabled at this call
    site.
    
    do_sect_fault() needs special handling, so detect user mode accesses
    to kernel-addresses, and add an explicit call to branch predictor
    hardening.
    
    Finally, add branch predictor hardening to do_alignment() for the
    faulting case (user mode accessing kernel addresses) before interrupts
    are enabled.
    
    This should cover all cases where harden_branch_predictor() is called,
    ensuring that it is always has interrupts disabled, also ensuring that
    it is called early in each call path.
    
    Reviewed-by: Xie Yuanbin <[email protected]>
    Tested-by: Xie Yuanbin <[email protected]>
    Signed-off-by: Russell King (Oracle) <[email protected]>
    Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ARM: fix hash_name() fault [+ + +]

Author: Russell King (Oracle) <[email protected]>
Date:   Mon Jun 22 12:26:33 2026 +0200

    ARM: fix hash_name() fault
    
    commit 7733bc7d299d682f2723dc38fc7f370b9bf973e9 upstream.
    
    Zizhi Wo reports:
    
    "During the execution of hash_name()->load_unaligned_zeropad(), a
     potential memory access beyond the PAGE boundary may occur. For
     example, when the filename length is near the PAGE_SIZE boundary.
     This triggers a page fault, which leads to a call to
     do_page_fault()->mmap_read_trylock(). If we can't acquire the lock,
     we have to fall back to the mmap_read_lock() path, which calls
     might_sleep(). This breaks RCU semantics because path lookup occurs
     under an RCU read-side critical section."
    
    This is seen with CONFIG_DEBUG_ATOMIC_SLEEP=y and CONFIG_KFENCE=y.
    
    Kernel addresses (with the exception of the vectors/kuser helper
    page) do not have VMAs associated with them. If the vectors/kuser
    helper page faults, then there are two possibilities:
    
    1. if the fault happened while in kernel mode, then we're basically
       dead, because the CPU won't be able to vector through this page
       to handle the fault.
    2. if the fault happened while in user mode, that means the page was
       protected from user access, and we want to fault anyway.
    
    Thus, we can handle kernel addresses from any context entirely
    separately without going anywhere near the mmap lock. This gives us
    an entirely non-sleeping path for all kernel mode kernel address
    faults.
    
    As we handle the kernel address faults before interrupts are enabled,
    this change has the side effect of improving the branch predictor
    hardening, but does not completely solve the issue.
    
    Reported-by: Zizhi Wo <[email protected]>
    Reported-by: Xie Yuanbin <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Reviewed-by: Xie Yuanbin <[email protected]>
    Tested-by: Xie Yuanbin <[email protected]>
    Signed-off-by: Russell King (Oracle) <[email protected]>
    Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ARM: group is_permission_fault() with is_translation_fault() [+ + +]

Author: Russell King (Oracle) <[email protected]>
Date:   Mon Jun 22 12:26:31 2026 +0200

    ARM: group is_permission_fault() with is_translation_fault()
    
    commit dea20281ac88226615761c570c8ff7adc18e6ac2 upstream.
    
    Group is_permission_fault() with is_translation_fault(), which is
    needed to use is_permission_fault() in __do_kernel_fault(). As
    this is static inline, there is no need for this to be under
    CONFIG_MMU.
    
    Signed-off-by: Russell King (Oracle) <[email protected]>
    Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: bla: annotate lasttime access with READ/WRITE_ONCE [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:07 2026 +0200

    batman-adv: bla: annotate lasttime access with READ/WRITE_ONCE
    
    commit 98b0fb191c878a64cbaebfe231d96d57576acf8c upstream.
    
    The lasttime field for claim, backbone_gw, and loopdetect tracks the
    jiffies value of the most recent activity and is used to detect timeouts.
    These accesses are not consistently protected by a lock, so
    READ_ONCE/WRITE_ONCE must be used to prevent data races caused by compiler
    optimizations.
    
    Cc: [email protected]
    Fixes: 23721387c409 ("batman-adv: add basic bridge loop avoidance code")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: dat: prevent false sharing between VLANs [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:21 2026 +0200

    batman-adv: dat: prevent false sharing between VLANs
    
    commit 20d7658b74169f86d4ac01b9185b3eadddf71f28 upstream.
    
    The local hash of DAT entries is supposed to be VLAN (VID) aware. But
    the adding to the hash and the search in the hash were not checking the VID
    information of the hash entries. The entries would therefore only be
    correctly separated when batadv_hash_dat() didn't select the same buckets
    for different VIDs.
    
    Cc: [email protected]
    Fixes: be1db4f6615b ("batman-adv: make the Distributed ARP Table vlan aware")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: ensure bcast is writable before modifying TTL [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:10 2026 +0200

    batman-adv: ensure bcast is writable before modifying TTL
    
    commit 4cd6d3a4b96a8576f1fed8f9f9f17c2dc2978e0c upstream.
    
    Before batman-adv is allowed to write to an skb, it either has to have its
    own copy of the skb or used skb_cow() to ensure that the data part is not
    shared.
    
    The old implementation used a shared queue and created copies before
    attempting to write to it. But with the new implementation, the broadcast
    packet is already modified when it gets received. Potentially writing to
    shared buffers in this process.
    
    Adding a skb_cow() right before this operation avoids this and can at the
    same time prepare it for the modifications required to rebroadcast the
    packet.
    
    Cc: [email protected]
    Fixes: 3f69339068f9 ("batman-adv: bcast: queue per interface, if needed")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: fix (m|b)cast csum after decrementing TTL [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:11 2026 +0200

    batman-adv: fix (m|b)cast csum after decrementing TTL
    
    commit e728bbdf32660c8f32b8f5e8d09427a2c131ad60 upstream.
    
    The broadcast and multicast packets can be received at the same time by the
    local system and forwarded to other nodes. Both are simply decrementing the
    TTL at the beginning of the receive path - independent of chosen paths
    (receive/forward). But such a modification of the data conflicts with the
    hw csum. This is not a problem when the packet is directly forwarded but
    can cause errors in the local receive path.
    
    Such a problem can then trigger a "hw csum failure". The receiver path must
    therefore ensure that the csum is fixed for each modification of the
    payload before batadv_interface_rx() is reached.
    
    Since all batman-adv packet types with a ttl have it as u8 at offset 2, a
    helper can be used for all of them. But it is only used at the moment for
    batadv_bcast_packet and batadv_mcast_packet because they are the only ones
    which deliver the packet locally but unconditionally modify the TTL.
    
    Cc: [email protected]
    Fixes: 3f69339068f9 ("batman-adv: bcast: queue per interface, if needed")
    Fixes: 07afe1ba288c ("batman-adv: mcast: implement multicast packet reception and forwarding")
    [ Context, Drop change for non-existing mcast handling ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: frag: avoid underflow of TTL [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:13 2026 +0200

    batman-adv: frag: avoid underflow of TTL
    
    commit 493d9d2528e1a09b090e4b37f0f553def7bd5ce9 upstream.
    
    Packets with a TTL are using it to limit the amount of time this packet can
    be forwarded. But for batadv_frag_packet, the TTL was always only reduced
    but it was never evaluated. It could even underflow without any effect.
    
    Check the TTL in batadv_frag_skb_fwd() before attempting to prepare it for
    forwarding. This keeps it in sync with the not fragmented unicast packet.
    
    Cc: [email protected]
    Fixes: 610bfc6bc99b ("batman-adv: Receive fragmented packets and merge")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: frag: ensure fragment is writable before modifying TTL [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:12 2026 +0200

    batman-adv: frag: ensure fragment is writable before modifying TTL
    
    commit b7293c6e8c15b2db77809b25cf8389e35331b27a upstream.
    
    Before batman-adv is allowed to write to an skb, it either has to have its
    own copy of the skb or use skb_cow() to ensure that the data part is not
    shared. But batadv_frag_skb_fwd() modifies the TTL even when it is shared.
    
    Adding a skb_cow() right before this operation avoids this and can at the
    same time prepare it for the modifications required to forward the
    fragment.
    
    Cc: [email protected]
    Fixes: 610bfc6bc99b ("batman-adv: Receive fragmented packets and merge")
    [ Context ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: prevent ELP transmission interval underflow [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:08 2026 +0200

    batman-adv: prevent ELP transmission interval underflow
    
    commit 5e50d4b8ae3ea622122d3c6a38d7f6fe68dfddca upstream.
    
    batadv_v_elp_start_timer() enqeues a delayed work. The time when it starts
    is randomly chosen between (elp_interval - BATADV_JITTER) and
    (elp_interval + BATADV_JITTER). The configured elp_interval must therefore
    be larger or equal to BATADV_JITTER to avoid that it causes an underflow of
    the unsigned integer. If this would happen, then a "fast" ELP interval
    would turn into a "day long" delay.
    
    At the same time, it must not be larger than the maximum value the variable
    can store.
    
    Cc: [email protected]
    Fixes: a10800829040 ("batman-adv: Add elp_interval hardif genl configuration")
    [ Context ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: add only finished tp_vars to lists [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:06 2026 +0200

    batman-adv: tp_meter: add only finished tp_vars to lists
    
    commit 15ccbf685222274f5add1387af58c2a41a95f81e upstream.
    
    When the receiver variables (aka "session") are initialized, then they are
    added to the list of sessions before the timer is set up. A RCU protected
    reader could therefore find the entry and run mod_setup before
    batadv_tp_init_recv() finished the timer initialization.
    
    The same is true for batadv_tp_start(), which must first initialize the
    finish_work and the test_length to avoid a similar problem.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: annotate last_recv_time access with READ/WRITE_ONCE [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:16 2026 +0200

    batman-adv: tp_meter: annotate last_recv_time access with READ/WRITE_ONCE
    
    commit d67c728f07fca2ee6ffdc6dd4421cf2e8691f4d1 upstream.
    
    The last_recv_time field for batadv_tp_receiver tracks the jiffies value of
    the most recent activity and is used to detect timeouts. These accesses are
    not consistently protected by a lock, so READ_ONCE/WRITE_ONCE must be used
    to prevent data races caused by compiler optimizations.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: avoid divide-by-zero for dec_cwnd [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:03 2026 +0200

    batman-adv: tp_meter: avoid divide-by-zero for dec_cwnd
    
    commit 33ccd52f3cc9ed46ce395199f89aa3234dc83314 upstream.
    
    The cwnd is always MSS <= cwnd <= 0x20000000. But the calculation in
    batadv_tp_update_cwnd() assumes unsigned 32 bit arithmetics.
    
        ((mss * 8) ** 2) / (cwnd * 8)
    
    In case cwnd is actually 0x20000000, it will be shifted by 3 bit to the
    left end up at 0x100000000 or U32_MAX + 1. It will therefore wrap around
    and be 0 - resulting in:
    
        ((mss * 8) ** 2) / 0
    
    This is of course invalid and cannot be calculated. The calculation should
    must be simplified to avoid this overflow:
    
       (mss ** 2) * 8 / cwnd
    
    It will keep the precision enhancement from the scaling (by 8) but avoid
    the overflow in the divisor.
    
    In theory, there could still be an overflow in the dividend. It is at the
    moment fixed to BATADV_TP_PLEN in batadv_tp_recv_ack() - so it is not an
    imminent problem. But allowing it to use the whole u32 bit range, would
    mean that it can still use up to 67 bits. To keep this calculation safe for
    32 bit arithmetic, mss must never use more than floor((32 - 3) / 2) bits -
    or in other words: must never be larger than 16383.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: avoid window underflow [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:02 2026 +0200

    batman-adv: tp_meter: avoid window underflow
    
    commit 765947b81fb54b6ebb0bc1cfe55c0fa399e002b8 upstream.
    
    In batadv_tp_avail(), win_left is calculated with 32-bit unsigned
    arithmetic: win_left = win_limit - tp_vars->last_sent;
    
    During Fast Recovery, cwnd is inflated and last_sent advances rapidly. When
    Fast Recovery ends, cwnd drops abruptly back to ss_threshold. If the newly
    shrunk win_limit is less than last_sent, the unsigned subtraction will
    underflow, wrapping to a massive positive value. Instead of returning that
    the window is full (unavailable), it returns that the sender can continue
    sending.
    
    To handle this situation, it must be checked whether the windows end
    sequence number (win_limit) has to be compared with the last sent sequence
    number. If it would be before the last sent sequence number, then more acks
    are needed before the transmission can be started again.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: fix fast recovery precondition [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:04 2026 +0200

    batman-adv: tp_meter: fix fast recovery precondition
    
    commit 2b0d08f08ed3b2174f05c43089ec65f3543a025b upstream.
    
    The fast recovery precondition checks if the recover (initialized to
    BATADV_TP_FIRST_SEQ) is bigger than the received ack. But since recover is
    only updated when this check is successful, it will never enter the fast
    recovery mode.
    
    According to RFC6582 Section 3.2 step 2, the check should actually be
    different:
    
    > When the third duplicate ACK is received, the TCP sender first
    > checks the value of recover to see if the Cumulative
    > Acknowledgment field covers more than recover
    
    The precondition must therefore check if recover is smaller than the
    received ack - basically swapping the operands of the current check.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: handle overlapping packets [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:18 2026 +0200

    batman-adv: tp_meter: handle overlapping packets
    
    commit cbde75c38b21f022891525078622587ad557b7c1 upstream.
    
    If the size of the packets would change during the transmission, it could
    happen that some retries of packets are overlapping. In this case, precise
    comparisons of sequence numbers by the receiver would be wrong. It is then
    necessary to check if the start sequence number to the end sequence number
    ("seqno + length") would contain a new range.
    
    If this is the case then this is enough to accept this packet. In all other
    cases, the packet still has to be dropped (and not acked).
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    [ Switch to pre-splitted tp_vars structure names ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: handle seqno wrap-around for fast recovery detection [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:05 2026 +0200

    batman-adv: tp_meter: handle seqno wrap-around for fast recovery detection
    
    commit f54c85ed42a1b27a516cf2a4728f5a612b799e07 upstream.
    
    The recover variable and the last_sent sequence number are initialized on
    purpose as a really high value which will wrap-around after the first 2000
    bytes. The fast recovery precondition must therefore not use simple integer
    comparisons but use helpers which are aware of the sequence number
    wrap-arounds.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: initialize dec_cwnd explicitly [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:01 2026 +0200

    batman-adv: tp_meter: initialize dec_cwnd explicitly
    
    commit febfb1b86224489535312296ecfa3d4bf467f339 upstream.
    
    When batadv_tp_update_cwnd() is called, dec_cwnd is increased. But dec_cwnd
    is only initialixed (to 0) when a duplicate Ack was received or when cwnd
    is below the ss_threshold.
    
    Just initialize the cwnd during the initialization to avoid any potential
    access of uninitialized data.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: initialize dup_acks explicitly [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:00 2026 +0200

    batman-adv: tp_meter: initialize dup_acks explicitly
    
    commit b2b68b32a715e0328662801576974aa37b942b00 upstream.
    
    When an ack with a sequence number equal to the last_acked is received, the
    dup_acks counter is increased to decide whether fast retransmit should be
    performed. Only when the sequence numbers are not equal, the dup_acks is
    set to the initial value (0).
    
    But if the initial packet would have the sequence number
    BATADV_TP_FIRST_SEQ, dup_acks would not be initialized and atomic_inc would
    operate on an undefined starting value. It is therefore required to have it
    explicitly initialized during the start of the sender session.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: initialize last_recv_time during init [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:09 2026 +0200

    batman-adv: tp_meter: initialize last_recv_time during init
    
    commit 811cb00fa8cdc3f0a7f6eefc000a6888367c8c8f upstream.
    
    The last_recv_time is the most important indicator for a receiver session
    to figure out whether a session timed out or not. But this information was
    only initialized after the session was added to the tp_receiver_list and
    after the timer was started.
    
    In the worst case, the timer (function) could have tried to access this
    information before the actual initialization was reached. Like rest of the
    variables of the tp_meter receiver session, this field has to be filled out
    before any other (parallel running) context has the chance to access it.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    [ Context ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: keep unacked list in ascending ordered [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:10:59 2026 +0200

    batman-adv: tp_meter: keep unacked list in ascending ordered
    
    commit 5aa8651527ea0b610e7a09fb3b8204c1398b9525 upstream.
    
    When batadv_tp_handle_out_of_order inserts a new entry in the list of
    unacked (out of order) packets, it searches from the entry with the newest
    sequence number towards oldest sequence number. If an entry is found which
    is older than the newly entry, the new entry has to be added after the
    found one to keep the ascending order.
    
    But for this operation list_add_tail() was used. But this function adds an
    entry _before_ another one. As result, the list would contain a lot of
    swapped sequence numbers. The consumer of this list
    (batadv_tp_ack_unordered()) would then fail to correctly ack packets.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: prevent parallel modifications of last_recv [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:17 2026 +0200

    batman-adv: tp_meter: prevent parallel modifications of last_recv
    
    commit 6dde0cfcb36e4d5b3de35b75696937478441eed4 upstream.
    
    When last_recv is updated to store the last receive sequence number, it is
    assuming that nothing is modifying in parallel while:
    
    * check for outdated packets is done
    * out of order check is performed (and packets are stored in out-of-order
      queue)
    * the out-of-order queue was searched for closed gaps
    * sequence number for next ack is calculated
    
    Nothing of that was actually protected. It could therefore happen that the
    last_recv was updated multiple times in parallel and the final sequence
    number was calculated with deltas which had no connection to the sequence
    number they were added to.
    
    Lock this whole region with the same lock which was already used to protect
    the unacked (out-of-order) list.
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    [ Switch to pre-splitted tp_vars structure names ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tp_meter: restrict number of unacked list entries [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:15 2026 +0200

    batman-adv: tp_meter: restrict number of unacked list entries
    
    commit e7c775110e1858e5a7471a23a9c9658c0af9df89 upstream.
    
    When the unacked_list is unbound, an attacker could send messages with
    small lengths and appropriated seqno + gaps to force the receiver to
    allocate more and more unacked_list entries. And the end either causing an
    out-of-memory situation or increase the management overhead for the (large)
    list that significant portions of CPU cycles are wasted in searching
    through the list.
    
    When limiting the list to a specific number, it is important to still
    correctly add a new entry to the list. But if the list became larger than
    the limit, the last entry of the list (with the highest seqno) must be
    dropped to still allow the earlier seqnos to finish and therefore to
    continue the process. Otherwise, the process might get stuck with too high
    seqnos which are not handled by batadv_tp_ack_unordered().
    
    Cc: [email protected]
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    [ Switch to pre-splitted tp_vars structure names ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tt: don't merge change entries with different VIDs [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:19 2026 +0200

    batman-adv: tt: don't merge change entries with different VIDs
    
    commit f08e06c2d5c3e2434e7c773f2213f4a7dce6bc1e upstream.
    
    batadv_tt_local_event() merges/cancels events for the same client which
    would conflict or be duplicates. The matching of the queued events only
    compares the MAC address - the VLAN ID stored in each event is ignored.
    
    If a MAC would now appear on multiple VID, the two ADD change events (for
    VID 1 and VID 2) would be merged to a single vid event. The remote can
    therefore not calculate the correct TT table and desync. A full translation
    table exchange is required to recover from this state.
    
    A check of VID is therefore necessary to avoid such wrong merges/cancels.
    
    Cc: [email protected]
    Fixes: c018ad3de61a ("batman-adv: add the VLAN ID attribute to the TT entry")
    [ Context ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tt: prevent TVLV entry number overflow [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri May 29 21:45:29 2026 +0200

    batman-adv: tt: prevent TVLV entry number overflow
    
    commit 99d9958fa10fb684b2a8e2c48a8d704122721420 upstream.
    
    The helpers to prepare the buffers for the local and global TT based
    replies are trying to sum up all TT entries which can be found for each
    VLAN. In theory, this sum can be too big for an u16 and therefore overflow.
    A too small buffer would then be allocated for the TVLV.
    
    The too small buffer will be handled gracefully by
    batadv_tt_tvlv_generate() and is not causing a buffer overflow - just a
    truncated reply. But this overflow shouldn't have happened in the first and
    the too small buffer should never have been allocated when an overflow was
    detected.
    
    Cc: [email protected]
    Fixes: 7ea7b4a14275 ("batman-adv: make the TT CRC logic VLAN specific")
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tt: track roam count per VID [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:20 2026 +0200

    batman-adv: tt: track roam count per VID
    
    commit 12407d5f61c2653a64f2ff4b22f3c267f8420ef1 upstream.
    
    batadv_tt_check_roam_count() is supposed to track roaming of a TT entry.
    But TT entries are for a MAC + VID. The VID was completely missed and thus
    leads to incorrect detection of ROAM counts when a client MAC exists in
    multiple VLANs.
    
    Cc: [email protected]
    Fixes: c018ad3de61a ("batman-adv: add the VLAN ID attribute to the TT entry")
    [ Context ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tvlv: avoid race of cifsnotfound handler state [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:23 2026 +0200

    batman-adv: tvlv: avoid race of cifsnotfound handler state
    
    commit edb557b2ba38fea2c5eb710cf366c797e187218c upstream.
    
    TVLV handlers can have the flag BATADV_TVLV_HANDLER_OGM_CIFNOTFND set to
    signal that the OGM handler should be called (with NULL for data) when the
    specific TVLV container was not found in the OGM. This is used by:
    
    * DAT
    * GW
    * Multicast (OGM + Tracker)
    
    The state whether the handler was executed was stored in the struct
    batadv_tvlv_handler. But the TVLV processing is started without any lock.
    Multiple parallel contexts processing TVLVs would therefore overwrite each
    others BATADV_TVLV_HANDLER_OGM_CALLED flag in the shared
    batadv_tvlv_handler.
    
    Drop the shared BATADV_TVLV_HANDLER_OGM_CALLED flag and instead determine,
    per TVLV buffer, whether a matching container was present by scanning the
    packet's buffer.
    
    Cc: [email protected]
    Fixes: ef26157747d4 ("batman-adv: tvlv - basic infrastructure")
    [ Context ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: tvlv: enforce 2-byte alignment [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:22 2026 +0200

    batman-adv: tvlv: enforce 2-byte alignment
    
    commit 32a6799255525d6ea4da0f7e9e0e521ad9560a46 upstream.
    
    The fields of an aggregated OGM(v2) are accessed assuming (at least) 2-byte
    alignment, so a following OGM must start at an even offset. As the header
    length is even, an odd tvlv_len would misalign it and trigger unaligned
    accesses on strict-alignment architectures.
    
    Such a misaligned TVLV/OGM/OGMv2 is not created by a normal participant in
    the mesh. Therefore, reject such malformed packets.
    
    Cc: [email protected]
    Fixes: ef26157747d4 ("batman-adv: tvlv - basic infrastructure")
    [ Drop change for non-existing mcast handling ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

batman-adv: v: prevent OGM aggregation on disabled hardif [+ + +]

Author: Sven Eckelmann <[email protected]>
Date:   Fri Jun 26 18:11:14 2026 +0200

    batman-adv: v: prevent OGM aggregation on disabled hardif
    
    commit d11c00b95b2a3b3934007fc003dccc6fdcc061ad upstream.
    
    When an interface gets disabled, the worker is correctly disabled by
    batadv_hardif_disable_interface() -> ... -> batadv_v_ogm_iface_disable().
    In this process, the skb aggr_list is also freed.
    
    But batadv_v_ogm_send_meshif() can still queue new skbs (via
    batadv_v_ogm_queue_on_if()) to the aggr_list. This will only stop after all
    cores can no longer find the RCU protected list of hard interfaces. These
    queued skbs will never be freed or consumed by batadv_v_ogm_aggr_work.
    
    The batadv_v_ogm_iface_disable() function must block
    batadv_v_ogm_queue_on_if() to avoid leak of skbs.
    
    Cc: [email protected]
    Fixes: f89255a02f1d ("batman-adv: BATMAN_V: introduce per hard-iface OGMv2 queues")
    [ Context ]
    Signed-off-by: Sven Eckelmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bpf: use kvfree() for replaced sysctl write buffer [+ + +]

Author: Dawei Feng <[email protected]>
Date:   Wed Jun 3 18:53:16 2026 +0800

    bpf: use kvfree() for replaced sysctl write buffer
    
    commit 4c21b5927d4364bfe7365f2700da5fea0ed0d004 upstream.
    
    proc_sys_call_handler() allocates its temporary sysctl buffer with
    kvzalloc() and passes it to __cgroup_bpf_run_filter_sysctl(). Since
    kvzalloc() may fall back to vmalloc() for large allocations, freeing
    that buffer with kfree() is wrong and can corrupt memory.
    
    Use kvfree() to safely handle both kmalloc and kvzalloc()/vmalloc
    allocations.
    
    The bug was first flagged by an experimental analysis tool we are
    developing for kernel memory-management bugs while analyzing
    v6.13-rc1. The tool is still under development and is not yet publicly
    available. Manual inspection confirms that the bug is still
    present in v7.1-rc5.
    
    Reproduced the bug based on v7.1-rc4 in a QEMU x86_64 guest booted with
    KASAN and CONFIG_FAILSLAB enabled. To exercise the replacement path, the
    test tree also included the accompanying fix for the stale ret == 1
    check in __cgroup_bpf_run_filter_sysctl(). The reproducer confines
    failslab injections to the proc_sys_call_handler() range, uses
    stacktrace-depth=32, and injects fail-nth=1 while writing 8191 bytes to
    /proc/sys/kernel/domainname from a task in the target cgroup. Under
    that setup, fail-nth=1 triggered the fault:
    
      BUG: unable to handle page fault for address: ffffeb0200024d48
      #PF: supervisor read access in kernel mode
      #PF: error_code(0x0000) - not-present page
      PGD 0 P4D 0
      Oops: Oops: 0000  SMP KASAN NOPTI
      CPU: 2 UID: 0 PID: 209 Comm: repro_proc_sys_ Not tainted 7.1.0-rc4-00686-g97625979a5d4  PREEMPT(lazy)
      Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.15.0-1 04/01/2014
      RIP: 0010:kfree+0x6e/0x510
      ...
      Call Trace:
       <TASK>
       ? __cgroup_bpf_run_filter_sysctl+0x626/0xc30
       __cgroup_bpf_run_filter_sysctl+0x74d/0xc30
       ? __pfx___cgroup_bpf_run_filter_sysctl+0x10/0x10
       ? srso_return_thunk+0x5/0x5f
       ? __kvmalloc_node_noprof+0x345/0x870
       ? proc_sys_call_handler+0x250/0x480
       ? srso_return_thunk+0x5/0x5f
       proc_sys_call_handler+0x3a2/0x480
       ? __pfx_proc_sys_call_handler+0x10/0x10
       ? srso_return_thunk+0x5/0x5f
       ? selinux_file_permission+0x39f/0x500
       ? srso_return_thunk+0x5/0x5f
       ? lock_is_held_type+0x9e/0x120
       vfs_write+0x98e/0x1000
       ...
       </TASK>
    
    With this fix applied on top of the same test setup, rerunning the
    reproducer with fail-nth=1 yields no corresponding Oops reports.
    
    Fixes: 4508943794ef ("proc: use kvzalloc for our kernel buffer")
    Cc: [email protected]
    
    Reviewed-by: Emil Tsalapatis <[email protected]>
    Reviewed-by: Jiayuan Chen <[email protected]>
    Acked-by: Yonghong Song <[email protected]>
    Signed-off-by: Zilin Guan <[email protected]>
    Signed-off-by: Dawei Feng <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alexei Starovoitov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

crypto: qat - remove unused character device and IOCTLs [+ + +]

Author: Giovanni Cabiddu <[email protected]>
Date:   Thu Jun 25 13:56:04 2026 -0400

    crypto: qat - remove unused character device and IOCTLs
    
    [ Upstream commit d237230728c567297f2f98b425d63156ab2ed17f ]
    
    The QAT driver exposes a character device (qat_adf_ctl) with IOCTLs
    for device configuration, start, stop, status query and enumeration.
    These IOCTLs are not part of any public uAPI header and have no known
    in-tree or out-of-tree users. Device lifecycle is already managed via
    sysfs.
    
    The ioctl interface also increases the attack surface and is the
    subject of a number of bug reports.
    
    Remove the character device, the IOCTL definitions, and the related
    data structures (adf_dev_status_info, adf_user_cfg_key_val,
    adf_user_cfg_section, adf_user_cfg_ctl_data). Drop the now-unused
    adf_cfg_user.h header and strip adf_ctl_drv.c down to the minimal
    module_init/module_exit hooks for workqueue, AER, and crypto/compression
    algorithm registration.
    
    Clean up leftover dead code that was only reachable from the removed
    IOCTL paths: adf_cfg_del_all(), adf_devmgr_verify_id(),
    adf_devmgr_get_num_dev(), adf_devmgr_get_dev_by_id(),
    adf_get_vf_real_id() and the unused ADF_CFG macros.
    
    Additionally, drop the entry associated to QAT IOCTLs in
    ioctl-number.rst.
    
    Cc: [email protected]
    Fixes: d8cba25d2c68 ("crypto: qat - Intel(R) QAT driver framework")
    Reported-by: Zhi Wang <[email protected]>
    Reported-by: Bin Yu <[email protected]>
    Reported-by: MingYu Wang <[email protected]>
    Closes: https://lore.kernel.org/all/[email protected]/
    Link: https://lore.kernel.org/all/[email protected]/
    Link: https://lore.kernel.org/all/[email protected]/
    Link: https://lore.kernel.org/all/[email protected]/
    Signed-off-by: Giovanni Cabiddu <[email protected]>
    Reviewed-by: Ahsan Atta <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

crypto: qat - Replace kzalloc() + copy_from_user() with memdup_user() [+ + +]

Author: Thorsten Blum <[email protected]>
Date:   Thu Jun 25 13:56:02 2026 -0400

    crypto: qat - Replace kzalloc() + copy_from_user() with memdup_user()
    
    [ Upstream commit 1e26339703e2afd397037defa798682b2b93dcc0 ]
    
    Replace kzalloc() followed by copy_from_user() with memdup_user() to
    improve and simplify adf_ctl_alloc_resources(). memdup_user() returns
    either -ENOMEM or -EFAULT (instead of -EIO) if an error occurs.
    
    Remove the unnecessary device id initialization, since memdup_user()
    (like copy_from_user()) immediately overwrites it.
    
    No functional changes intended other than returning the more idiomatic
    error code -EFAULT.
    
    Signed-off-by: Thorsten Blum <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Stable-dep-of: d237230728c5 ("crypto: qat - remove unused character device and IOCTLs")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

crypto: qat - Return pointer directly in adf_ctl_alloc_resources [+ + +]

Author: Herbert Xu <[email protected]>
Date:   Thu Jun 25 13:56:03 2026 -0400

    crypto: qat - Return pointer directly in adf_ctl_alloc_resources
    
    [ Upstream commit 5ce9891ea928208a915411ce8227f8c3e37e5ad9 ]
    
    Returning values through arguments is confusing and that has
    upset the compiler with the recent change to memdup_user:
    
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c: In function ‘adf_ctl_ioctl’:
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c:308:26: warning: ‘ctl_data’ may be used uninitialized [-Wmaybe-uninitialized]
      308 |                  ctl_data->device_id);
          |                          ^~
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c:294:39: note: ‘ctl_data’ was declared here
      294 |         struct adf_user_cfg_ctl_data *ctl_data;
          |                                       ^~~~~~~~
    In function ‘adf_ctl_ioctl_dev_stop’,
        inlined from ‘adf_ctl_ioctl’ at ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c:386:9:
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c:273:48: warning: ‘ctl_data’ may be used uninitialized [-Wmaybe-uninitialized]
      273 |         ret = adf_ctl_is_device_in_use(ctl_data->device_id);
          |                                        ~~~~~~~~^~~~~~~~~~~
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c: In function ‘adf_ctl_ioctl’:
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c:261:39: note: ‘ctl_data’ was declared here
      261 |         struct adf_user_cfg_ctl_data *ctl_data;
          |                                       ^~~~~~~~
    In function ‘adf_ctl_ioctl_dev_config’,
        inlined from ‘adf_ctl_ioctl’ at ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c:382:9:
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c:192:54: warning: ‘ctl_data’ may be used uninitialized [-Wmaybe-uninitialized]
      192 |         accel_dev = adf_devmgr_get_dev_by_id(ctl_data->device_id);
          |                                              ~~~~~~~~^~~~~~~~~~~
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c: In function ‘adf_ctl_ioctl’:
    ../drivers/crypto/intel/qat/qat_common/adf_ctl_drv.c:185:39: note: ‘ctl_data’ was declared here
      185 |         struct adf_user_cfg_ctl_data *ctl_data;
          |                                       ^~~~~~~~
    
    Fix this by returning the pointer directly.
    
    Signed-off-by: Herbert Xu <[email protected]>
    Reviewed-by: Thorsten Blum <[email protected]>
    Acked-by: Giovanni Cabiddu <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Stable-dep-of: d237230728c5 ("crypto: qat - remove unused character device and IOCTLs")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

debugobjects,locking: Annotate debug_object_fill_pool() wait type violation [+ + +]

Author: Peter Zijlstra <[email protected]>
Date:   Mon Jun 22 11:55:49 2026 +0200

    debugobjects,locking: Annotate debug_object_fill_pool() wait type violation
    
    commit 0cce06ba859a515bd06224085d3addb870608b6d upstream.
    
    There is an explicit wait-type violation in debug_object_fill_pool()
    for PREEMPT_RT=n kernels which allows them to more easily fill the
    object pool and reduce the chance of allocation failures.
    
    Lockdep's wait-type checks are designed to check the PREEMPT_RT
    locking rules even for PREEMPT_RT=n kernels and object to this, so
    create a lockdep annotation to allow this to stand.
    
    Specifically, create a 'lock' type that overrides the inner wait-type
    while it is held -- allowing one to temporarily raise it, such that
    the violation is hidden.
    
    Reported-by: Vlastimil Babka <[email protected]>
    Reported-by: Qi Zheng <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Tested-by: Qi Zheng <[email protected]>
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: Thomas Gleixner <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

debugobjects: Allow to refill the pool before SYSTEM_SCHEDULING [+ + +]

Author: Sebastian Andrzej Siewior <[email protected]>
Date:   Mon Jun 22 11:55:53 2026 +0200

    debugobjects: Allow to refill the pool before SYSTEM_SCHEDULING
    
    commit 06e0ae988f6e3499785c407429953ade19c1096b upstream.
    
    The pool of free objects is refilled on several occasions such as object
    initialisation. On PREEMPT_RT refilling is limited to preemptible
    sections due to sleeping locks used by the memory allocator. The system
    boots with disabled interrupts so the pool can not be refilled.
    
    If too many objects are initialized and the pool gets empty then
    debugobjects disables itself.
    
    Refiling can also happen early in the boot with disabled interrupts as
    long as the scheduler is not operational. If the scheduler can not
    preempt a task then a sleeping lock can not be contended.
    
    Allow to additionally refill the pool if the scheduler is not
    operational.
    
    Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

debugobjects: Do not fill_pool() if pi_blocked_on [+ + +]

Author: Helen Koike <[email protected]>
Date:   Mon Jun 22 11:56:02 2026 +0200

    debugobjects: Do not fill_pool() if pi_blocked_on
    
    commit 5f41161059fd0f1bbf18c90f3180e38cc45a14eb upstream.
    
    On RT enabled kernels, fill_pool() ends up calling rtlock_lock(), which
    asserts if current::pi_blocked_on is set, because a task can obviously only
    block on one lock as otherwise the priority inheritenace chain gets
    corrupted.
    
    Prevent this by expanding the conditional to take current::pi_blocked_on
    into account.
    
    Fixes: 4bedcc28469a ("debugobjects: Make them PREEMPT_RT aware")
    Reported-by: [email protected]
    Signed-off-by: Helen Koike <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=b8ca586b9fc235f0c0df
    Signed-off-by: Sasha Levin <[email protected]>

debugobjects: Dont call fill_pool() in early boot hardirq context [+ + +]

Author: Waiman Long <[email protected]>
Date:   Mon Jun 22 11:56:07 2026 +0200

    debugobjects: Dont call fill_pool() in early boot hardirq context
    
    commit 0d046ae106255cba5eb83b23f78ee93f3620247d upstream.
    
    When booting a debug PREEMPT_RT kernel on an ARM64 system, a "inconsistent
    {HARDIRQ-ON-W} -> {IN-HARDIRQ-W} usage" lockdep warning message was
    reported to the console.
    
    During early boot, interrupts are enabled before the scheduler is
    enabled. In this window (before SYSTEM_SCHEDULING is set) interrupts can
    fire and in the hard interrupt context handler attempt to fill the pool
    
    This can lead to a deadlock when the interrupt occurred when the interrupt
    hits a region which holds a lock that is required to be taken in the
    allocation path.
    
    Add a new can_fill_pool() helper and reorder the exception rule and forbid
    this scenario by excluding allocations from hard interrupt context.
    
    Fixes: 06e0ae988f6e ("debugobjects: Allow to refill the pool before SYSTEM_SCHEDULING")
    Suggested-by: Sebastian Andrzej Siewior <[email protected]>
    Suggested-by: Thomas Gleixner <[email protected]>
    Signed-off-by: Waiman Long <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Reviewed-by: Sebastian Andrzej Siewior <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

debugobjects: Use LD_WAIT_CONFIG instead of LD_WAIT_SLEEP [+ + +]

Author: Sebastian Andrzej Siewior <[email protected]>
Date:   Mon Jun 22 11:55:57 2026 +0200

    debugobjects: Use LD_WAIT_CONFIG instead of LD_WAIT_SLEEP
    
    commit 37de2dbc318ee10577c1c2704de5a803e75e55a2 upstream.
    
    fill_pool_map is used to suppress nesting violations caused by acquiring
    a spinlock_t (from within the memory allocator) while holding a
    raw_spinlock_t. The used annotation is wrong.
    
    LD_WAIT_SLEEP is for always sleeping lock types such as mutex_t.
    LD_WAIT_CONFIG is for lock type which are sleeping while spinning on
    PREEMPT_RT such as spinlock_t.
    
    Use LD_WAIT_CONFIG as override.
    
    Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

dlm: prevent NPD when writing a positive value to event_done [+ + +]

Author: Thadeu Lima de Souza Cascardo <[email protected]>
Date:   Mon Feb 10 13:16:22 2025 -0600

    dlm: prevent NPD when writing a positive value to event_done
    
    commit 8e2bad543eca5c25cd02cbc63d72557934d45f13 upstream.
    
    do_uevent returns the value written to event_done. In case it is a
    positive value, new_lockspace would undo all the work, and lockspace
    would not be set. __dlm_new_lockspace, however, would treat that
    positive value as a success due to commit 8511a2728ab8 ("dlm: fix use
    count with multiple joins").
    
    Down the line, device_create_lockspace would pass that NULL lockspace to
    dlm_find_lockspace_local, leading to a NULL pointer dereference.
    
    Treating such positive values as successes prevents the problem. Given
    this has been broken for so long, this is unlikely to break userspace
    expectations.
    
    Fixes: 8511a2728ab8 ("dlm: fix use count with multiple joins")
    Signed-off-by: Thadeu Lima de Souza Cascardo <[email protected]>
    Signed-off-by: David Teigland <[email protected]>
    Signed-off-by: Nazar Kalashnikov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Documentation: ioctl-number: Extend "Include File" column width [+ + +]

Author: Bagas Sanjaya <[email protected]>
Date:   Thu Jun 25 13:56:01 2026 -0400

    Documentation: ioctl-number: Extend "Include File" column width
    
    [ Upstream commit 15afd5def819e4df2a29cef6fcfa6ae7ba167c0f ]
    
    Extend width of "Include File" column to fit full path to
    papr-physical-attestation.h in later commit.
    
    Reviewed-by: Haren Myneni <[email protected]>
    Signed-off-by: Bagas Sanjaya <[email protected]>
    Acked-by: Madhavan Srinivasan <[email protected]>
    Signed-off-by: Jonathan Corbet <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Stable-dep-of: d237230728c5 ("crypto: qat - remove unused character device and IOCTLs")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Drivers: hv: vmbus: Improve the logic of reserving fb_mmio on Gen2 VMs [+ + +]

Author: Dexuan Cui <[email protected]>
Date:   Tue Jun 16 18:46:46 2026 -0400

    Drivers: hv: vmbus: Improve the logic of reserving fb_mmio on Gen2 VMs
    
    [ Upstream commit 016a25e4b0df4d77e7c258edee4aaf982e4ee809 ]
    
    If vmbus_reserve_fb() in the kdump/kexec kernel fails to properly reserve
    the framebuffer MMIO range (which is below 4GB) due to a Gen2 VM's
    screen.lfb_base being zero [1], there is an MMIO conflict between the
    drivers hyperv-drm and pci-hyperv: when the driver pci-hyperv's
    hv_allocate_config_window() calls vmbus_allocate_mmio() to get an
    MMIO range, typically it gets a 32-bit MMIO range that overlaps with the
    framebuffer MMIO range, and later hv_pci_enter_d0() fails with an
    error message "PCI Pass-through VSP failed D0 Entry with status" since
    the host thinks that PCI devices must not use MMIO space that the
    host has assigned to the framebuffer.
    
    This is especially an issue if pci-hyperv is built-in and hyperv-drm is
    built as a module. Consequently, the kdump/kexec kernel fails to detect
    PCI devices via pci-hyperv, and may fail to mount the root file system,
    which may reside in a NVMe disk. The issue described here has existed
    for SR-IOV VF NICs since day one of the pci-hyperv driver, and has been
    worked around on x64 when possible. With the recent introduction of
    ARM64 VMs that boot from NVMe, there is no workaround, so we need a
    formal fix.
    
    On Gen2 VMs, if the screen.lfb_base is 0 in the kdump/kexec kernel [1],
    fall back to the low MMIO base, which should be equal to the framebuffer
    MMIO base [2] (the statement is true according to my testing on x64
    Windows Server 2016, and on x64 and ARM64 Windows Server 2025 and on
    Azure. I checked with the Hyper-V team and they said the statement should
    continue to be true for Gen2 VMs). In the first kernel, screen.lfb_base
    is not 0; if the user specifies a very high resolution, it's not enough
    to only reserve 8MB: let's always reserve half of the space below 4GB,
    but cap the reservation to 128MB, which is the required framebuffer size
    of the highest resolution 7680*4320 supported by Hyper-V.
    
    While at it, fix the comparison "end > VTPM_BASE_ADDRESS" by changing
    the > to >=. Here the 'end' is an inclusive end (typically, it's
    0xFFFF_FFFF for the low MMIO range).
    
    Note: vmbus_reserve_fb() now also reserves an MMIO range at the beginning
    of the low MMIO range on CVMs, which have no framebuffers (the
    'screen.lfb_base' in vmbus_reserve_fb() is 0 for CVMs), just in case the
    host might treat the beginning of the low MMIO range specially [3]. BTW,
    the OpenHCL kernel is not affected by the change, because that kernel
    boots with DeviceTree rather than ACPI (so vmbus_reserve_fb() won't run
    there), and there is no framebuffer device for that kernel.
    
    Note: normally Gen1 VMs don't have the MMIO conflict issue because the
    framebuffer MMIO range (which is hardcoded to base=4GB-128MB and
    size=64MB for Gen1 VMs by the host) is always reported via the legacy PCI
    graphics device's BAR, so the kdump/kexec kernel can reserve the 64MB
    MMIO range; however, if the VM is configured to use a very high resolution
    and the required framebuffer size exceeds 64MB (AFAIK, in practice, this
    isn't a typical configuration by users), the hyperv-drm driver may need to
    allocate an MMIO range above 4GB and change the framebuffer MMIO location
    to the allocated MMIO range -- in this case, there can still be issues [4]
    which can't be easily fixed: any possible affected Gen1 users would have
    to use a resolution whose framebuffer size is <= 64MB, or switch to Gen2
    VMs.
    
    [1] https://lore.kernel.org/all/SA1PR21MB692176C1BC53BFC9EAE5CF8EBF51A@SA1PR21MB6921.namprd21.prod.outlook.com/
    [2] https://lore.kernel.org/all/SA1PR21MB69218F955B62DFF62E3E88D2BF222@SA1PR21MB6921.namprd21.prod.outlook.com/
    [3] https://lore.kernel.org/all/SN6PR02MB415726B17D5A6027CD1717E8D4342@SN6PR02MB4157.namprd02.prod.outlook.com/
    [4] https://lore.kernel.org/all/SA1PR21MB69213486F821CA5A2C793C81BF342@SA1PR21MB6921.namprd21.prod.outlook.com/
    
    Fixes: 4daace0d8ce8 ("PCI: hv: Add paravirtual PCI front-end for Microsoft Hyper-V VMs")
    CC: [email protected]
    Reviewed-by: Michael Kelley <[email protected]>
    Tested-by: Krister Johansen <[email protected]>
    Tested-by: Matthew Ruffell <[email protected]>
    Signed-off-by: Dexuan Cui <[email protected]>
    Signed-off-by: Wei Liu <[email protected]>
    [ changed `sysfb_primary_display.screen.lfb_base/lfb_size` reads to the global `screen_info.lfb_base/lfb_size` and dropped the `if (IS_ENABLED(CONFIG_SYSFB))` wrapper, de-indenting the block. ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/display: Bound VBIOS record-chain walk loops [+ + +]

Author: Harry Wentland <[email protected]>
Date:   Tue May 12 15:24:22 2026 -0400

    drm/amd/display: Bound VBIOS record-chain walk loops
    
    [ Upstream commit ff287df16a1a58aca78b08d1f3ee09fc44da0351 ]
    
    [Why & How]
    All record-chain walk loops in bios_parser.c and bios_parser2.c use
    for(;;) and only terminate on a 0xFF record_type sentinel or zero
    record_size. A malformed VBIOS image missing the terminator record
    causes unbounded iteration at probe time, potentially hundreds of
    thousands of iterations with record_size=1. In the final iterations
    near the BIOS image boundary, struct casts beyond the 2-byte header
    validated by GET_IMAGE can also read out of bounds.
    
    Cap all 14 record-chain walk loops to BIOS_MAX_NUM_RECORD (256)
    iterations. The atombios.h defines up to 22 distinct record types
    and atomfirmware.h has 13. Assuming an average of less than 10
    records per type (which is reasonable since most are connector-
    based) 256 is a generous upper bound.
    
    Fixes: 4562236b3bc0 ("drm/amd/dc: Add dc display driver (v2)")
    Assisted-by: Copilot:claude-opus-4.6 Mythos
    Reviewed-by: Alex Hung <[email protected]>
    Signed-off-by: Harry Wentland <[email protected]>
    Signed-off-by: Ray Wu <[email protected]>
    Tested-by: Daniel Wheeler <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 95700a3d660287ed657d6892f7be9ffc0e294a93)
    Cc: [email protected]
    Signed-off-by: Sasha Levin <[email protected]>

drm/v3d: Skip CSD when it has zeroed workgroups [+ + +]

Author: Maíra Canal <[email protected]>
Date:   Tue Jun 2 14:50:15 2026 -0300

    drm/v3d: Skip CSD when it has zeroed workgroups
    
    [ Upstream commit 7f93fad5ea0affc9e1505dd0f7596c0fdb496213 ]
    
    A compute shader dispatch encodes its workgroup counts in the CFG0..CFG2
    registers. Kicking off a dispatch with a zero count in any of the three
    dimensions is invalid. First, the hardware will process 0 as 65536,
    while the user-space driver exposes a maximum of 65535. Over that, a
    submission with a zeroed workgroup dimension should be a no-op.
    
    These zeroed counts can reach the dispatch path through an indirect CSD
    job, whose workgroup counts are only known once the indirect buffer is
    read and may legitimately be zero, but such scenario should only result in
    a no-op.
    
    Overwrite the indirect CSD job workgroup counts with the indirect BO
    ones, even if they are zeroed, and don't submit the job to the hardware
    when any of the workgroup counts is zero, so the job completes immediately
    instead of running the shader.
    
    Cc: [email protected]
    Fixes: d223f98f0209 ("drm/v3d: Add support for compute shader dispatch.")
    Suggested-by: Jose Maria Casanova Crespo <[email protected]>
    Reviewed-by: Iago Toral Quiroga <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Maíra Canal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/v3d: Store the active job inside the queue's state [+ + +]

Author: Maíra Canal <[email protected]>
Date:   Tue Aug 26 11:18:59 2025 -0300

    drm/v3d: Store the active job inside the queue's state
    
    [ Upstream commit 0d3768826d38c0ac740f8b45cd13346630535f2b ]
    
    Instead of storing the queue's active job in four different variables,
    store the active job inside the queue's state. This way, it's possible
    to access all active jobs using an index based in `enum v3d_queue`.
    
    Reviewed-by: Iago Toral Quiroga <[email protected]>
    Reviewed-by: Melissa Wen <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Maíra Canal <[email protected]>
    Stable-dep-of: 7f93fad5ea0a ("drm/v3d: Skip CSD when it has zeroed workgroups")
    Signed-off-by: Sasha Levin <[email protected]>

exfat: fix potential use-after-free in exfat_find_dir_entry() [+ + +]

Author: Michael Bommarito <[email protected]>
Date:   Wed Apr 22 11:58:44 2026 -0400

    exfat: fix potential use-after-free in exfat_find_dir_entry()
    
    commit 3f5f8ee9917cc2b9076ac533492d8a200edcabb8 upstream.
    
    In exfat_find_dir_entry(), the buffer_head obtained from
    exfat_get_dentry() is released with brelse(bh) before the fall-through
    TYPE_EXTEND branch reads the directory entry through ep (which points
    into bh->b_data):
    
            brelse(bh);
            if (entry_type == TYPE_EXTEND) {
                    ...
                    len = exfat_extract_uni_name(ep, entry_uniname);
                    ...
            }
    
    After brelse() drops our reference, nothing guarantees that the
    underlying page backing bh->b_data remains valid for the subsequent
    exfat_extract_uni_name() read. This is the same pattern fixed in
    commit fc961522ddbd ("exfat: Fix potential use after free in
    exfat_load_upcase_table()").
    
    Move brelse(bh) so it runs after ep is no longer dereferenced on
    each branch.
    
    Confirmed on QEMU x86_64 with CONFIG_KASAN=y + CONFIG_DEBUG_PAGEALLOC=y
    + CONFIG_PAGE_POISONING=y on linux-next, using a crafted exFAT image
    (long filename with same-hash collisions forcing the TYPE_EXTEND path).
    With a debug-only invalidate_bdev() inserted between brelse(bh) and
    the ep read to make the stale-deref window deterministic, the
    unpatched kernel faults:
    
      BUG: KASAN: use-after-free in exfat_find_dir_entry+0x133b/0x15a0
      BUG: unable to handle page fault for address: ffff88801a5fa0c2
      Oops: 0000 [#1] SMP DEBUG_PAGEALLOC KASAN NOPTI
      RIP: 0010:exfat_find_dir_entry+0x1188/0x15a0
    
    With this patch applied, the same instrumented harness completes
    cleanly under the same sanitizer stack. I have not reproduced a
    crash on an uninstrumented kernel under ordinary reclaim; the
    instrumented A/B establishes the lifetime violation and that the
    patch closes it, not an unaided triggerability claim.
    
    Fixes: ca06197382bd ("exfat: add directory operations")
    Cc: [email protected]
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Michael Bommarito <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ext4: add bounds check for inline data length in ext4_read_inline_page [+ + +]

Author: Yuto Ohnuki <[email protected]>
Date:   Tue Jun 30 17:35:53 2026 +0100

    ext4: add bounds check for inline data length in ext4_read_inline_page
    
    [ Upstream commit 356227096eb66e41b23caf7045e6304877322edf ]
    
    ext4_read_inline_page() does not validate that the inline data length
    fits within a page before copying data. If the inline size exceeds
    PAGE_SIZE due to filesystem corruption, this could lead to a kernel
    memory write beyond the page boundary.
    
    Add a bounds check after computing len, returning -EFSCORRUPTED if the
    value exceeds PAGE_SIZE.
    
    The upstream commit replaced a BUG_ON(len > PAGE_SIZE) in
    ext4_read_inline_folio(). In 6.1 and earlier, the function is still named
    ext4_read_inline_page() and the BUG_ON was never present, so this patch
    adds the bounds check directly.
    
    Fixes: 46c7f254543d ("ext4: add read support for inline data")
    Signed-off-by: Yuto Ohnuki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

f2fs: fix to round down start offset of fallocate for pin file [+ + +]

Author: Sunmin Jeong <[email protected]>
Date:   Mon Jun 22 14:28:17 2026 +0900

    f2fs: fix to round down start offset of fallocate for pin file
    
    commit 4275b59673eb60b02eec3997816c83f1f4b909c4 upstream.
    
    Currently, the length of fallocate for pin file is section-aligned to
    keep allocated sections from being selected as victims of GC. However,
    for the case that the start offset of fallocate is not aligned in
    section, the allocated sections can't be fully utilized. It's because a
    new section is allocated by f2fs_allocate_pinning_section() after using
    blks_per_sec blocks regardless of the start offset. As a result, several
    unexpected dirty segments may be created, including blocks assigned to
    the pinned file.
    
    To address this issue, let's round down the start offset of fallocate
    to the length of section.
    
    The reproducing scenario is as below
    
    chunk=$(((2<<20)+4096)) # 2MB + 4KB
    touch test
    f2fs_io pinfile set test
    f2fs_io fallocate 0 0 $chunk test
    f2fs_io fallocate 0 $chunk $chunk test
    f2fs_io fallocate 0 $((chunk*2)) $chunk test
    f2fs_io fiemap 0 $((chunk*3)) test
    
    Fiemap: offset = 0 len = 12288
        logical addr.    physical addr.   length           flags
    0   0000000000000000 000000068c600000 0000000000400000 00001088
    1   0000000000400000 000000003d400000 0000000000001000 00001088
    2   0000000000401000 00000003eb200000 0000000000200000 00001088
    3   0000000000601000 00000005e4200000 0000000000001000 00001088
    4   0000000000602000 0000000605400000 0000000000200000 00001089
    
    Cc: [email protected]
    Fixes: f5a53edcf01e ("f2fs: support aligned pinned file")
    Reviewed-by: Yunji Kang <[email protected]>
    Reviewed-by: Yeongjin Gil <[email protected]>
    Reviewed-by: Sungjong Seo <[email protected]>
    Signed-off-by: Sunmin Jeong <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: validate ACL entry sizes in f2fs_acl_from_disk() [+ + +]

Author: Zhang Cen <[email protected]>
Date:   Mon Jun 15 15:19:54 2026 +0800

    f2fs: validate ACL entry sizes in f2fs_acl_from_disk()
    
    commit c4810ada31e80cbe4011467c4f3b1e93f94134f3 upstream.
    
    f2fs_acl_count() only validates the aggregate ACL xattr length. A
    malformed ACL can still place ACL_USER or ACL_GROUP in a slot that only
    contains struct f2fs_acl_entry_short bytes, and f2fs_acl_from_disk()
    then reads entry->e_id before verifying that a full entry fits.
    
    Require a short entry before reading e_tag and e_perm, and require a
    full entry before reading e_id for ACL_USER and ACL_GROUP. Return
    -EFSCORRUPTED from these new truncated-entry checks, while keeping the
    pre-existing -EINVAL paths unchanged.
    
    Validation reproduced this kernel report:
    KASAN slab-out-of-bounds in __f2fs_get_acl+0x6fb/0x7e0
    RIP: 0033:0x7f4b835ea7aa
    The buggy address belongs to the object at ffff888114589960 which belongs
    to the cache kmalloc-8 of size 8
    The buggy address is located 0 bytes to the right of allocated 8-byte
    region [ffff888114589960, ffff888114589968)
    Read of size 4
    Call trace:
      dump_stack_lvl+0x66/0xa0 (?:?)
      print_report+0xce/0x630 (?:?)
      __f2fs_get_acl+0x6fb/0x7e0 (fs/f2fs/acl.c:169)
      srso_alias_return_thunk+0x5/0xfbef5 (?:?)
      __virt_addr_valid+0x224/0x430 (?:?)
      kasan_report+0xe0/0x110 (?:?)
      __f2fs_get_acl+0x5/0x7e0 (fs/f2fs/acl.c:169)
      __get_acl+0x281/0x380 (?:?)
      vfs_get_acl+0x10b/0x190 (?:?)
      do_get_acl+0x2a/0x410 (?:?)
      do_get_acl+0x9/0x410 (?:?)
      do_getxattr+0xe8/0x260 (?:?)
      filename_getxattr+0xd1/0x140 (?:?)
      do_getname+0x2d/0x2d0 (?:?)
      path_getxattrat+0x16c/0x200 (?:?)
      lock_release+0xc8/0x290 (?:?)
      cgroup_update_frozen+0x9d/0x320 (?:?)
      lockdep_hardirqs_on_prepare+0xea/0x1a0 (?:?)
      trace_hardirqs_on+0x1a/0x170 (?:?)
      _raw_spin_unlock_irq+0x28/0x50 (?:?)
      do_syscall_64+0x115/0x6a0 (arch/x86/entry/syscall_64.c:87)
      entry_SYSCALL_64_after_hwframe+0x77/0x7f (?:?)
    
    Cc: [email protected]
    Fixes: af48b85b8cd3 ("f2fs: add xattr and acl functionalities")
    Assisted-by: Codex:gpt-5.5
    Signed-off-by: Zhang Cen <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: validate compress cache inode only when enabled [+ + +]

Author: Wenjie Qi <[email protected]>
Date:   Thu May 21 11:16:18 2026 +0800

    f2fs: validate compress cache inode only when enabled
    
    commit 5073c66a96a9c23c0c2533ed4ed06e42f9021208 upstream.
    
    F2FS_COMPRESS_INO() uses NM_I(sbi)->max_nid as the synthetic inode
    number for the compressed page cache inode. That inode only exists when
    the compress_cache mount option is enabled.
    
    When compress_cache is disabled, max_nid is outside the valid inode
    range. A corrupted directory entry that points to ino == max_nid should
    therefore be rejected by f2fs_check_nid_range(). However, is_meta_ino()
    currently treats F2FS_COMPRESS_INO() as a meta inode unconditionally,
    so f2fs_iget() bypasses do_read_inode() and its nid range check, and
    instantiates a fake internal inode instead.
    
    Gate the compressed cache inode case on COMPRESS_CACHE, matching
    f2fs_init_compress_inode(). With compress_cache disabled, ino ==
    max_nid now follows the normal inode path and is rejected as an
    out-of-range nid.
    
    Cc: [email protected]
    Fixes: 6ce19aff0b8c ("f2fs: compress: add compress_inode to cache compressed blocks")
    Signed-off-by: Wenjie Qi <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fbdev: Fix fb_new_modelist to prevent null-ptr-deref in fb_videomode_to_var [+ + +]

Author: Ian Bridges <[email protected]>
Date:   Wed Jun 24 23:13:12 2026 -0500

    fbdev: Fix fb_new_modelist to prevent null-ptr-deref in fb_videomode_to_var
    
    commit 7f08fc10fa3d3366dc3af723970bd03d7d6d10e3 upstream.
    
    info->var, a framebuffer's current mode, is expected to have a matching
    entry in info->modelist. var_to_display() relies on this and treats a
    failed fb_match_mode() as "This should not happen". fb_set_var() keeps it
    true by adding the mode to the list on every change, and
    do_register_framebuffer() does the same at registration.
    
    store_modes() replaces the modelist from userspace. fb_new_modelist()
    validates the new modes but does not check that info->var still has a
    match. It relies on fbcon_new_modelist() to re-point consoles, but that
    only handles consoles mapped to the framebuffer. With fbcon unbound there
    are none, so info->var is left describing a mode that is no longer in the
    list.
    
    A later console takeover runs var_to_display(), where fb_match_mode()
    returns NULL and leaves fb_display[i].mode NULL. fbcon_switch() passes it
    to display_to_var(), and fb_videomode_to_var() dereferences the NULL mode.
    
    Keep the current mode in the list in fb_new_modelist(), the same way
    fb_set_var() does.
    
    Cc: [email protected]
    Assisted-by: Claude:claude-opus-4-8
    Signed-off-by: Ian Bridges <[email protected]>
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fbdev: modedb: Fix misaligned fields in the 1920x1080-60 mode [+ + +]

Author: Steffen Persvold <[email protected]>
Date:   Fri Jun 12 18:40:41 2026 +0200

    fbdev: modedb: Fix misaligned fields in the 1920x1080-60 mode
    
    commit d894c48a57d78206e4df9c90d4acfaf39394806a upstream.
    
    The 1920x1080@60 modedb entry has one too many initializers before
    its sync field: a stray "0" occupies the sync slot, which shifts the
    remaining values by one field. The entry therefore decodes as
    sync = 0, vmode = FB_SYNC_HOR_HIGH_ACT | FB_SYNC_VERT_HIGH_ACT (0x3,
    i.e. FB_VMODE_INTERLACED | FB_VMODE_DOUBLE), and flag =
    FB_VMODE_NONINTERLACED, instead of the intended sync = positive H/V,
    vmode = non-interlaced.
    
    fb_find_mode() then returns a 1920x1080 mode flagged as interlaced +
    doublescan with active-low syncs. Drivers that honour var->vmode and
    var->sync when programming display timing enable doublescan and the
    wrong sync polarity, corrupting the output.
    
    Drop the stray initializer so sync and vmode hold their intended
    values (positive H/V sync, non-interlaced), matching the adjacent
    1920x1200 entry.
    
    Fixes: c8902258b2b8 ("fbdev: modedb: Add 1920x1080 at 60 Hz video mode")
    Cc: [email protected]
    Signed-off-by: Steffen Persvold <[email protected]>
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fpga: region: fix use-after-free in child_regions_with_firmware() [+ + +]

Author: Wentao Liang <[email protected]>
Date:   Wed Apr 8 15:45:34 2026 +0000

    fpga: region: fix use-after-free in child_regions_with_firmware()
    
    commit 54f3c5643ec523a04b6ec0e7c19eb10f5ebebdd3 upstream.
    
    Move of_node_put(child_region) after the error print to avoid accessing
    freed memory when pr_err() references child_region.
    
    Fixes: 0fa20cdfcc1f ("fpga: fpga-region: device tree control for FPGA")
    Cc: [email protected]
    Signed-off-by: Wentao Liang <[email protected]>
    [ Yilun: Fix the Fixes tag ]
    Reviewed-by: Xu Yilun <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Xu Yilun <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fuse: limit FUSE_NOTIFY_RETRIEVE to uptodate folios [+ + +]

Author: Jann Horn <[email protected]>
Date:   Tue Jun 16 21:00:23 2026 +0200

    fuse: limit FUSE_NOTIFY_RETRIEVE to uptodate folios
    
    [ Upstream commit 4e3d1b2c48ca6c55f1e9ca7f8dccc76f120f276c ]
    
    FUSE_NOTIFY_RETRIEVE must be limited to uptodate folios; !uptodate folios
    can contain uninitialized data.
    Since FUSE_NOTIFY_RETRIEVE is intended to only return data that is already
    in the page cache and not wait for data from the FUSE daemon, treat
    !uptodate folios as if they weren't present.
    
    This only has security impact on systems that don't enable automatic
    zero-initialization of all page allocations via
    CONFIG_INIT_ON_ALLOC_DEFAULT_ON or init_on_alloc=1.
    
    Cc: [email protected]
    Fixes: 2d45ba381a74 ("fuse: add retrieve request")
    Signed-off-by: Jann Horn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Acked-by: Miklos Szeredi <[email protected]>
    Signed-off-by: Christian Brauner (Amutable) <[email protected]>
    [adjusted for stable: page instead of folio]
    Signed-off-by: Jann Horn <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fuse: re-lock request before replacing page cache folio [+ + +]

Author: Joanne Koong <[email protected]>
Date:   Tue Jun 23 15:49:37 2026 -0400

    fuse: re-lock request before replacing page cache folio
    
    [ Upstream commit a078484921052d0badd827fcc2770b5cfc1d4120 ]
    
    fuse_try_move_folio() unlocks the request on entry but does not
    re-lock it on the success path. This means fuse_chan_abort() can end the
    request and free the fuse_io_args (eg fuse_readpages_end()) while the
    subsequent copy chain logic after fuse_try_move_folio() accesses the
    fuse_io_args, leading to use-after-free issues.
    
    Fix this by calling lock_request() before replace_page_cache_folio().
    This ensures the request is locked on the success path which will
    prevent the fuse_io_args from being freed while the later copying logic
    runs, and also ensures that the ap->folios[i]->mapping is never null
    since ap->folios[i] will always point to the newfolio after
    replace_page_cache_folio().
    
    Fixes: ce534fb05292 ("fuse: allow splice to move pages")
    Cc: [email protected]
    Reported-by: Lei Lu <[email protected]>
    Signed-off-by: Joanne Koong <[email protected]>
    Signed-off-by: Miklos Szeredi <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hdlc_ppp: sync per-proto timers before freeing hdlc state [+ + +]

Author: Fan Wu <[email protected]>
Date:   Wed Jun 17 02:05:18 2026 +0000

    hdlc_ppp: sync per-proto timers before freeing hdlc state
    
    commit c78a4e41ab5ead6193ad8a2dd92e8906bae659fa upstream.
    
    Each PPP control protocol (LCP/IPCP/IPV6CP) embedded in struct ppp
    registers a timer via timer_setup(). That struct ppp is the
    hdlc->state allocation, which detach_hdlc_protocol() frees with kfree()
    in both teardown paths: unregister_hdlc_device() and the re-attach inside
    attach_hdlc_protocol().
    
    The ppp proto never registered a .detach callback, so
    detach_hdlc_protocol() performs no timer synchronization before the
    kfree(). The only cancel, timer_delete(&proto->timer) in ppp_cp_event(),
    is partial (it does not wait for a running callback) and only runs on the
    ->CLOSED transition; ppp_stop()/ppp_close() do not sync either. A
    ppp_timer callback already executing (blocked on ppp->lock) survives the
    kfree and then dereferences proto->state / ppp->lock in freed memory,
    leading to a use-after-free.
    
    Fix this by adding a .detach helper that calls timer_shutdown_sync() on
    every per-proto timer. detach_hdlc_protocol() invokes proto->detach(dev)
    before kfree(hdlc->state), so timer_shutdown_sync()
    now runs on both free paths.
    timer_shutdown_sync() is used instead of timer_delete_sync() because the
    keepalive path re-arms the timer through add_timer()/mod_timer() and
    shutdown blocks any re-activation during teardown.
    
    Initialize the per-protocol timers in ppp_ioctl() when the protocol is
    attached, and remove the now-redundant timer_setup() from ppp_start(), so
    that the timers are initialized exactly once at attach time and
    ppp_timer_release() never operates on uninitialized timer_list
    structures. attach_hdlc_protocol() uses kmalloc() (not kzalloc), so
    struct ppp's protos[i].timer is uninitialized garbage until the first
    timer_setup(); without this init-at-attach, attaching the PPP protocol
    without ever bringing the device up would leave timer_shutdown_sync()
    operating on uninitialized memory in .detach. Moving the init out of
    ppp_start() (which only runs on NETDEV_UP) into the attach path makes the
    initialization unconditional and avoids initializing the same timer_list
    twice.
    
    This bug was found by static analysis.
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: [email protected]
    Signed-off-by: Fan Wu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hv: utils: handle and propagate errors in kvp_register [+ + +]

Author: Thorsten Blum <[email protected]>
Date:   Tue Jun 16 13:24:09 2026 -0400

    hv: utils: handle and propagate errors in kvp_register
    
    [ Upstream commit 3fcf923302a8f5c0dc3af3d2ca2657cb5fae4297 ]
    
    Make kvp_register() return an error code instead of silently ignoring
    failures, and propagate the error from kvp_handle_handshake() instead of
    returning success.
    
    This propagates both kzalloc_obj() and hvutil_transport_send() failures
    to kvp_handle_handshake() and thus to kvp_on_msg().
    
    Fixes: 245ba56a52a3 ("Staging: hv: Implement key/value pair (KVP)")
    Cc: [email protected]
    Signed-off-by: Thorsten Blum <[email protected]>
    Reviewed-by: Long Li <[email protected]>
    Signed-off-by: Wei Liu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i2c: stub: Reject I2C block transfers with invalid length [+ + +]

Author: Weiming Shi <[email protected]>
Date:   Wed Apr 15 01:23:39 2026 +0800

    i2c: stub: Reject I2C block transfers with invalid length
    
    commit 6036b5067a8199ba7a2dc7b377d4b9dd276d5f9e upstream.
    
    The I2C_SMBUS_I2C_BLOCK_DATA case in stub_xfer() uses data->block[0]
    as the transfer length. The existing check only clamps it to avoid
    overrunning the chip->words[256] register array, but does not validate
    it against I2C_SMBUS_BLOCK_MAX (32), which is the limit of the union
    i2c_smbus_data.block buffer (34 bytes total). The driver is a
    development/test tool (CONFIG_I2C_STUB=m, not built by default)
    that must be loaded with a chip_addr= parameter.
    
    A local user with access to /dev/i2c-* can issue an I2C_SMBUS ioctl
    with I2C_SMBUS_I2C_BLOCK_DATA and data->block[0] > 32, causing
    stub_xfer() to read or write past the end of the union
    i2c_smbus_data.block buffer:
    
     BUG: KASAN: stack-out-of-bounds in stub_xfer (drivers/i2c/i2c-stub.c:223)
     Read of size 1 at addr ffff88800abcfd92 by task exploit/81
     Call Trace:
      <TASK>
      stub_xfer (drivers/i2c/i2c-stub.c:223)
      __i2c_smbus_xfer (drivers/i2c/i2c-core-smbus.c:593)
      i2c_smbus_xfer (drivers/i2c/i2c-core-smbus.c:536)
      i2cdev_ioctl_smbus (drivers/i2c/i2c-dev.c:391)
      i2cdev_ioctl (drivers/i2c/i2c-dev.c:478)
      __x64_sys_ioctl (fs/ioctl.c:583)
      do_syscall_64 (arch/x86/entry/syscall_64.c:94)
      entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130)
      </TASK>
    
    The bug exists because i2c-stub implements .smbus_xfer directly,
    bypassing the I2C_SMBUS_BLOCK_MAX validation in
    i2c_smbus_xfer_emulated(). The I2C_SMBUS_BLOCK_DATA case in the same
    function correctly validates against I2C_SMBUS_BLOCK_MAX, but the
    I2C_SMBUS_I2C_BLOCK_DATA case does not.
    
    Fix by rejecting transfers with data->block[0] == 0 or
    data->block[0] > I2C_SMBUS_BLOCK_MAX with -EINVAL, consistent with
    both the I2C_SMBUS_BLOCK_DATA case in the same function and the
    I2C_SMBUS_I2C_BLOCK_DATA validation in i2c_smbus_xfer_emulated().
    
    Fixes: 4710317891e4 ("i2c-stub: Implement I2C block support")
    Reported-by: Xiang Mei <[email protected]>
    Signed-off-by: Weiming Shi <[email protected]>
    Reviewed-by: Jean Delvare <[email protected]>
    Signed-off-by: Wolfram Sang <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ip6_vti: set netns_immutable on the fallback device. [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Mon Jun 8 15:59:18 2026 +0000

    ip6_vti: set netns_immutable on the fallback device.
    
    [ Upstream commit d289d5307762d1838aaece22c6b6fcad9e8865f9 ]
    
    john1988 and Noam Rathaus reported that vti6_init_net() does not set the
    netns_immutable flag on the per-netns fallback tunnel device (ip6_vti0).
    
    Other similar tunnel drivers (like ip6_tunnel, sit, ip6_gre, and ip_tunnel)
    correctly set this flag during their fallback device initialization to
    prevent them from being moved to another network namespace.
    
    Fixes: 61220ab34948 ("vti6: Enable namespace changing")
    Reported-by: Noam Rathaus <[email protected]>
    Signed-off-by: Eric Dumazet <[email protected]>
    Cc: Steffen Klassert <[email protected]>
    Reviewed-by: Nicolas Dichtel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    [Salvatore Bonaccorso: Backport for version without 0c493da86374 ("net:
    rename netns_local to netns_immutable") in v6.15-rc1 and without
    05c1280a2bcf ("netdev_features: convert NETIF_F_NETNS_LOCAL to
    dev->netns_local") in v6.12-rc1 and use NETIF_F_NETNS_LOCAL device
    feature.]
    Signed-off-by: Salvatore Bonaccorso <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipv6: account for fraggap on the paged allocation path [+ + +]

Author: Wongi Lee <[email protected]>
Date:   Tue Jun 30 11:08:14 2026 -0400

    ipv6: account for fraggap on the paged allocation path
    
    [ Upstream commit 736b380e28d0480c7bc3e022f1950f31fe53a7c5 ]
    
    In __ip6_append_data(), when the paged-allocation branch is taken
    (MSG_MORE / NETIF_F_SG / large fraglen), alloclen and pagedlen are
    computed as
    
            alloclen = fragheaderlen + transhdrlen;
            pagedlen = datalen - transhdrlen;
    
    datalen already includes fraggap (datalen = length + fraggap). When
    fraggap is non-zero, this is not the first skb and transhdrlen is zero.
    The fraggap bytes carried over from the previous skb are copied just past
    the fragment headers in the new skb's linear area. The linear area is
    therefore undersized by fraggap bytes while pagedlen is overstated by the
    same amount, and the copy writes past skb->end into the trailing
    skb_shared_info.
    
    An unprivileged user can trigger this via a UDPv6 socket using
    MSG_MORE together with MSG_SPLICE_PAGES.
    
    The bad accounting was introduced by commit 773ba4fe9104 ("ipv6:
    avoid partial copy for zc"). Before commit ce650a166335 ("udp6: Fix
    __ip6_append_data()'s handling of MSG_SPLICE_PAGES"), the negative
    copy value caused -EINVAL to be returned. That later commit allowed
    MSG_SPLICE_PAGES to proceed in this case, making the corruption
    triggerable.
    
    The non-paged branch sets alloclen to fraglen, which already accounts
    for fraggap because datalen does. Bring the paged branch in line by
    adding fraggap to alloclen and subtracting it from pagedlen.
    
    After this adjustment, copy no longer collapses to -fraggap on the
    paged path, so remove the stale comment describing that old arithmetic.
    Since a negative copy is no longer expected for a valid MSG_SPLICE_PAGES
    case, remove the MSG_SPLICE_PAGES exception from the negative copy check.
    
    Fixes: 773ba4fe9104 ("ipv6: avoid partial copy for zc")
    Signed-off-by: Jungwoo Lee <[email protected]>
    Signed-off-by: Wongi Lee <[email protected]>
    Reviewed-by: Ido Schimmel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

irqchip/imgpdc: Fix resource leak, add missing chained handler cleanup on remove [+ + +]

Author: Qingshuang Fu <[email protected]>
Date:   Thu Jun 18 10:13:52 2026 +0800

    irqchip/imgpdc: Fix resource leak, add missing chained handler cleanup on remove
    
    commit 37738fdf2ab1e504d1c63ce5bc0aeb6452d8f057 upstream.
    
    The driver allocates domain generic chips using
    irq_alloc_domain_generic_chips() during probe and sets up chained
    handlers using irq_set_chained_handler_and_data(). However, on driver
    removal, the generic chips are not freed and the chained handlers are
    not removed.
    
    The generic chips remain on the global gc_list and may later be accessed by
    generic interrupt chip suspend, resume, or shutdown callbacks after the
    driver has been removed, potentially resulting in a use-after-free and
    kernel crash.
    
    The chained handlers that were installed in probe for peripheral and
    syswake interrupts are also left dangling, which can lead to spurious
    interrupts accessing freed memory.
    
    Fix these issues by:
    
      - Setting IRQ_DOMAIN_FLAG_DESTROY_GC flag in domain->flags, so the
        core code automatically removes generic chips when irq_domain_remove()
        is called
    
      - Clearing all chained handlers with NULL in pdc_intc_remove()
    
    Fixes: b6ef9161e43a ("irq-imgpdc: add ImgTec PDC irqchip driver")
    Signed-off-by: Qingshuang Fu <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KEYS: fix overflow in keyctl_pkey_params_get_2() [+ + +]

Author: Jarkko Sakkinen <[email protected]>
Date:   Mon Jun 1 23:11:54 2026 +0300

    KEYS: fix overflow in keyctl_pkey_params_get_2()
    
    commit cb481e59ea6cae3b7796ac1d7a22b6b24c3f3c0b upstream.
    
    The length for the internal output buffer is calculated incorrectly, which
    can result overflow when a too small buffer is provided.
    
    Fix the bug by allocating internal output with the size of the maximum
    length of the cryptographic primitive instead of caller provided size.
    
    Link: https://lore.kernel.org/keyrings/[email protected]/
    Cc: [email protected] # v4.20+
    Fixes: 00d60fd3b932 ("KEYS: Provide keyctls to drive the new key type ops for asymmetric keys [ver #2]")
    Reported-by: Alessandro Groppo <[email protected]>
    Tested-by: Alessandro Groppo <[email protected]>
    Signed-off-by: Jarkko Sakkinen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

keys: Pin request_key_auth payload in instantiate paths [+ + +]

Author: Shaomin Chen <[email protected]>
Date:   Wed Jun 10 13:10:05 2026 +0300

    keys: Pin request_key_auth payload in instantiate paths
    
    commit fd15b457a86939c38aa12116adabd8ff686c5e51 upstream.
    
    A: request_key()       B: KEYCTL_INSTANTIATE_IOV
    ================       =========================
    
    create auth key
    store rka in auth key
    wait for helper
                           get auth key
                           load rka from auth key
                           copy user payload
                           sleep on #PF
    
    helper completed
    detach and free rka
    destroy auth key
                           wake up
                           use rka->target_key
                           **USE-AFTER-FREE**
    
    Give request_key_auth payloads a refcount.  Take a payload reference while
    authkey->sem stabilizes the payload and revocation state.  Hold that
    reference across the instantiate and reject paths.  Drop the auth key
    owning reference from revoke and destroy.
    
    [jarkko: Replaced the first two paragraphs of text with an actual
     concurrency scenario.]
    Cc: [email protected] # v5.10+
    Fixes: b5f545c880a2 ("[PATCH] keys: Permit running process to instantiate keys")
    Reported-by: Shaomin Chen <[email protected]>
    Closes: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Shaomin Chen <[email protected]>
    Signed-off-by: Jarkko Sakkinen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: fix out-of-bounds read in smb_check_perm_dacl() [+ + +]

Author: Hem Parekh <[email protected]>
Date:   Tue Jun 2 16:56:46 2026 -0700

    ksmbd: fix out-of-bounds read in smb_check_perm_dacl()
    
    commit 1ef06004ed4bd6d3ed8c840d9d1a376b66d4935b upstream.
    
    The permission-check ACE walk in smb_check_perm_dacl() validates the ACE
    header size and caps sid.num_subauth at SID_MAX_SUB_AUTHORITIES, but it
    never checks that ace->size is actually large enough to contain
    num_subauth sub-authorities before compare_sids() dereferences them.
    
    CIFS_SID_BASE_SIZE covers the SID header up to but excluding the
    sub_auth[] array, and offsetof(struct smb_ace, sid) is the ACE header,
    so the existing guards only guarantee the 8-byte SID base, i.e. zero
    sub-authorities. compare_sids() then reads ace->sid.sub_auth[i] for
    i < min(local_sid->num_subauth, ace->sid.num_subauth). The local
    comparison SIDs (sid_everyone, sid_unix_NFS_mode, and the id_to_sid()
    result) always have at least one sub-authority, and an attacker controls
    the ACE revision and authority bytes (which lie within the in-bounds SID
    base), so they can match one of those SIDs and force the sub_auth read.
    
    A crafted ACE with size == 16 and num_subauth >= 1 placed at the tail of
    the security descriptor therefore causes a heap out-of-bounds read of up
    to SID_MAX_SUB_AUTHORITIES * sizeof(__le32) bytes past the pntsd
    allocation. The security descriptor is loaded by ksmbd_vfs_get_sd_xattr()
    into a buffer sized exactly to the on-disk data (kzalloc(sd_size) in
    ndr_decode_v4_ntacl()), so the read lands past the allocation. The
    malformed descriptor can be stored verbatim via SMB2_SET_INFO (the DACL
    is not normalised before being written to the security.NTACL xattr) and
    the read fires on a subsequent SMB2_CREATE access check, making this
    reachable by an authenticated client on a share that uses ACL xattrs.
    
    Add the missing num_subauth-versus-ace_size check, mirroring the
    identical guards already present in the sibling parsers parse_dacl() and
    smb_inherit_dacl().
    
    Fixes: d07b26f39246 ("ksmbd: require minimum ACE size in smb_check_perm_dacl()")
    Cc: [email protected]
    Signed-off-by: Hem Parekh <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: reject non-VALID session in compound request branch [+ + +]

Author: Gil Portnoy <[email protected]>
Date:   Thu Jun 11 22:59:19 2026 +0900

    ksmbd: reject non-VALID session in compound request branch
    
    commit 609ca17d869d04ba249e32cdcbf13c0b1c66f43c upstream.
    
    smb2_check_user_session() takes a shortcut for any operation that is not
    the first in a COMPOUND request: it reuses work->sess (the session bound by
    the first operation) and validates only the SessionId, then returns
    "valid". It never re-checks work->sess->state == SMB2_SESSION_VALID, and a
    SessionId of 0xFFFFFFFFFFFFFFFF (ULLONG_MAX, the MS-SMB2 related-operation
    value) skips even the id comparison. The standalone path
    (ksmbd_session_lookup_all() plus the SESSION_SETUP state machine) does
    enforce the VALID state; the compound branch bypasses all of it.
    
    A SESSION_SETUP carrying only an NTLM Type-1 (NtLmNegotiate) blob publishes
    a fresh SMB2_SESSION_IN_PROGRESS session whose sess->user is still NULL
    (->user is assigned later, by ntlm_authenticate()). Used as operation 1 of
    a COMPOUND with operation 2 = TREE_CONNECT (related, SessionId=ULLONG_MAX,
    \\host\IPC$), the tree-connect then runs on that IN_PROGRESS session and
    reaches ksmbd_ipc_tree_connect_request(), which dereferences
    user_name(sess->user) with sess->user == NULL (transport_ipc.c:687/701/704)
    -> remote NULL-pointer dereference and a kernel Oops that wedges the ksmbd
    worker for all clients.
    
    Reject any non-first compound operation that lands on a session which is
    not SMB2_SESSION_VALID, mirroring the validity the standalone lookup path
    enforces. SESSION_SETUP itself legitimately runs on an IN_PROGRESS session,
    but it is never carried as a non-first compound operation, so multi-leg
    authentication is unaffected by this check.
    
    Fixes: 5005bcb42191 ("ksmbd: validate session id and tree id in the compound request")
    Cc: [email protected]
    Signed-off-by: Gil Portnoy <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: nVMX: Add a helper to get highest pending from Posted Interrupt vector [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Fri Jun 19 16:31:05 2026 -0400

    KVM: nVMX: Add a helper to get highest pending from Posted Interrupt vector
    
    commit d83c36d822be44db4bad0c43bea99c8908f54117 upstream.
    
    Add a helper to retrieve the highest pending vector given a Posted
    Interrupt descriptor.  While the actual operation is straightforward, it's
    surprisingly easy to mess up, e.g. if one tries to reuse lapic.c's
    find_highest_vector(), which doesn't work with PID.PIR due to the APIC's
    IRR and ISR component registers being physically discontiguous (they're
    4-byte registers aligned at 16-byte intervals).
    
    To make PIR handling more consistent with respect to IRR and ISR handling,
    return -1 to indicate "no interrupt pending".
    
    Cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    [ Nicholas Dudar: backport to 6.1.y. 6.1.y defines struct pi_desc in
      posted_intr.h and predates the move to <asm/posted_intr.h>, so the helper
      and the <linux/find.h> include go in posted_intr.h. ]
    Signed-off-by: Nicholas Dudar <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

KVM: nVMX: Check for pending posted interrupts when looking for nested events [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Fri Jun 19 16:31:06 2026 -0400

    KVM: nVMX: Check for pending posted interrupts when looking for nested events
    
    commit 27c4fa42b11af780d49ce704f7fa67b3c2544df4 upstream.
    
    Check for pending (and notified!) posted interrupts when checking if L2
    has a pending wake event, as fully posted/notified virtual interrupt is a
    valid wake event for HLT.
    
    Note that KVM must check vmx->nested.pi_pending to avoid prematurely
    waking L2, e.g. even if KVM sees a non-zero PID.PIR and PID.0N=1, the
    virtual interrupt won't actually be recognized until a notification IRQ is
    received by the vCPU or the vCPU does (nested) VM-Enter.
    
    Fixes: 26844fee6ade ("KVM: x86: never write to memory from kvm_vcpu_check_block()")
    Cc: [email protected]
    Cc: Maxim Levitsky <[email protected]>
    Reported-by: Jim Mattson <[email protected]>
    Closes: https://lore.kernel.org/all/[email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    [ Nicholas Dudar: backport to 6.1.y. Prerequisite for the next patch, which
      folds its check into the vmx_has_nested_events() body this patch builds.
      Applies cleanly. The for_injection path still returns preemption_timer ||
      mtf, as the previous 6.1.y body did. ]
    Signed-off-by: Nicholas Dudar <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

KVM: nVMX: Fold requested virtual interrupt check into has_nested_events() [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Fri Jun 19 16:31:07 2026 -0400

    KVM: nVMX: Fold requested virtual interrupt check into has_nested_events()
    
    commit 321ef62b0c5f6f57bb8500a2ca5986052675abbf upstream.
    
    Check for a Requested Virtual Interrupt, i.e. a virtual interrupt that is
    pending delivery, in vmx_has_nested_events() and drop the one-off
    kvm_x86_ops.guest_apic_has_interrupt() hook.
    
    In addition to dropping a superfluous hook, this fixes a bug where KVM
    would incorrectly treat virtual interrupts _for L2_ as always enabled due
    to kvm_arch_interrupt_allowed(), by way of vmx_interrupt_blocked(),
    treating IRQs as enabled if L2 is active and vmcs12 is configured to exit
    on IRQs, i.e. KVM would treat a virtual interrupt for L2 as a valid wake
    event based on L1's IRQ blocking status.
    
    Cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    [ Nicholas Dudar: backport to 6.1.y. 6.1.y predates the vmx main.c /
      x86_ops.h split, so drop .guest_apic_has_interrupt from vmx_x86_ops in
      vmx.c rather than vt_x86_ops in main.c. The function is static in vmx.c, so
      upstream's x86_ops.h prototype removal does not apply. 6.1.y keeps the
      current hwapic_isr_update signature. ]
    Signed-off-by: Nicholas Dudar <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

KVM: SVM: Fix page overflow in sev_dbg_crypt() for ENCRYPT path [+ + +]

Author: Ashutosh Desai <[email protected]>
Date:   Fri May 1 13:35:32 2026 -0700

    KVM: SVM: Fix page overflow in sev_dbg_crypt() for ENCRYPT path
    
    commit 78ee2d50185a037b3d2452a97f3dad69c3f7f389 upstream.
    
    In sev_dbg_crypt(), the per-iteration transfer length is bounded by
    the source page offset (PAGE_SIZE - s_off) but not by the destination
    page offset (PAGE_SIZE - d_off).  When d_off > s_off, the encrypt
    path (__sev_dbg_encrypt_user) performs a read-modify-write using a
    single-page intermediate buffer (dst_tpage):
    
      1. __sev_dbg_decrypt() expands the size to round_up(len + (d_off & 15), 16)
         before issuing the PSP command.  If len + (d_off & 15) > PAGE_SIZE,
         the PSP writes beyond the end of the 4096-byte dst_tpage allocation.
    
      2. The subsequent memcpy()/copy_from_user() into
         page_address(dst_tpage) + (d_off & 15) of 'len' bytes overflows
         by up to 15 bytes under the same condition.
    
    Trigger example: s_off = 0, d_off = 1, debug.len = PAGE_SIZE -
    the PSP is instructed to write round_up(4097, 16) = 4112 bytes to
    a 4096-byte buffer.
    
    Fix by also bounding len by (PAGE_SIZE - d_off), the same check that
    sev_send_update_data() already performs for its single-page guest
    region.
    
     ==================================================================
     BUG: KASAN: slab-use-after-free in sev_dbg_crypt+0x993/0xd10 [kvm_amd]
     Write of size 4095 at addr ff110062293bb009 by task sev_dbg_test/228214
    
     CPU: 96 UID: 0 PID: 228214 Comm: sev_dbg_test Tainted: G     U  W           7.0.0-smp--5ce9b0c48211-dbg #156 PREEMPTLAZY
     Tainted: [U]=USER, [W]=WARN
     Hardware name: Google Astoria/astoria, BIOS 0.20250817.1-0 08/25/2025
     Call Trace:
      <TASK>
      dump_stack_lvl+0x54/0x70
      print_report+0xbc/0x260
      kasan_report+0xa2/0xd0
      kasan_check_range+0x25f/0x2c0
      __asan_memcpy+0x40/0x70
      sev_dbg_crypt+0x993/0xd10 [kvm_amd]
      sev_mem_enc_ioctl+0x33c/0x450 [kvm_amd]
      kvm_vm_ioctl+0x65d/0x6d0 [kvm]
      __se_sys_ioctl+0xb2/0x100
      do_syscall_64+0xe8/0x870
      entry_SYSCALL_64_after_hwframe+0x4b/0x53
      </TASK>
    
     The buggy address belongs to the physical page:
     page: refcount:1 mapcount:0 mapping:0000000000000000 index:0x7fe72b6a0 pfn:0x62293bb
     memcg:ff11000112827d82
     flags: 0x1400000000000000(node=1|zone=1)
     raw: 1400000000000000 0000000000000000 dead000000000122 0000000000000000
     raw: 00000007fe72b6a0 0000000000000000 00000001ffffffff ff11000112827d82
     page dumped because: kasan: bad access detected
    
     Memory state around the buggy address:
      ff110062293bbf00: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
      ff110062293bbf80: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
     >ff110062293bc000: fa fb fb fb fb fb fb fb fc fc fc fc fc fc fc fc
                        ^
      ff110062293bc080: fa fb fb fb fb fb fb fb fc fc fc fc fc fc fc fc
      ff110062293bc100: fa fb fb fb fb fb fb fb fc fc fc fc fc fc fc fc
     ==================================================================
     Disabling lock debugging due to kernel taint
    
    Fixes: 24f41fb23a39 ("KVM: SVM: Add support for SEV DEBUG_DECRYPT command")
    Fixes: 7d1594f5d94b ("KVM: SVM: Add support for SEV DEBUG_ENCRYPT command")
    Cc: [email protected]
    Signed-off-by: Ashutosh Desai <[email protected]>
    [sean: add sample KASAN splat, Fixes, and stable@]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: VMX: Make vmread_error_trampoline() uncallable from C code [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Wed Jun 17 13:51:00 2026 +0300

    KVM: VMX: Make vmread_error_trampoline() uncallable from C code
    
    [ Upstream commit 0b5e7a16a0a79a3742f0df9e45bca46f01b40e6a ]
    
    Declare vmread_error_trampoline() as an opaque symbol so that it cannot
    be called from C code, at least not without some serious fudging.  The
    trampoline always passes parameters on the stack so that the inline
    VMREAD sequence doesn't need to clobber registers.  regparm(0) was
    originally added to document the stack behavior, but it ended up being
    confusing because regparm(0) is a nop for 64-bit targets.
    
    Opportunustically wrap the trampoline and its declaration in #ifdeffery
    to make it even harder to invoke incorrectly, to document why it exists,
    and so that it's not left behind if/when CONFIG_CC_HAS_ASM_GOTO_OUTPUT
    is true for all supported toolchains.
    
    No functional change intended.
    
    Cc: Uros Bizjak <[email protected]>
    Signed-off-by: Sean Christopherson <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    (cherry picked from commit 0b5e7a16a0a79a3742f0df9e45bca46f01b40e6a)
    Signed-off-by: Hanne-Lotta Mäenpää <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

KVM: x86/mmu: Ensure hugepage is in by slot before checking max mapping level [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Fri Jun 26 13:24:37 2026 +0200

    KVM: x86/mmu: Ensure hugepage is in by slot before checking max mapping level
    
    commit ef057cbf825e03b63f6edf5980f96abf3c53089d upstream.
    
    When recovering hugepages in the shadow MMU, verify that the base gfn of
    the shadow page is actually contained within the target memslot, *before*
    querying the max mapping level given the shadow page's gfn.  Failure to
    pre-check the validity of the gfn can lead to an out-of-bounds access to
    the slot's lpage_info (which typically manifests as a host #PF because the
    lpage_info is vmalloc'd) if the guest creates a hugepage mapping (in its
    PTEs) that extends "below" the bounds of a memslot.
    
    When faulting in memory for a guest, and the size of the guest mapping is
    greater than KVM's (current) max mapping, then KVM will create a "direct"
    shadow page (direct in that there are no gPTEs to shadow, and so the target
    gfn is a direct calculation given the base gfn of the shadow page).  The
    hugepage recovery flow looks for such direct shadow pages, as forcing 4KiB
    mappings when dirty logging generates the guest > host mapping size case.
    When the 4KiB restriction is lifted, then KVM can replace the shadow page
    with a hugepage.
    
    But if KVM originally used a smaller mapping than the guest because the
    range of memory covered by the guest hugepage exceeds the bounds of a
    memslot, then KVM will link a direct shadow page with a gfn that is outside
    the bounds of the memslot being used to fault in memory.  The rmap entry
    added for the leaf mapping is correct and within bounds, but the gfn of the
    leaf SPTE's parent shadow page will be out of bounds.
    
      BUG: unable to handle page fault for address: ffffc90000806ffc
      #PF: supervisor read access in kernel mode
      #PF: error_code(0x0000) - not-present page
      PGD 100000067 P4D 100000067 PUD 1002a7067 PMD 10612f067 PTE 0
      Oops: Oops: 0000 [#1] SMP
      CPU: 13 UID: 1000 PID: 757 Comm: mmu_stress_test Not tainted 7.1.0-rc1-48ce1e26eace-x86_pir_to_irr_comments-vm #341 PREEMPT
      Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 0.0.0 02/06/2015
      RIP: 0010:kvm_mmu_max_mapping_level+0x79/0x2b0 [kvm]
      Call Trace:
       <TASK>
       kvm_mmu_recover_huge_pages+0x21b/0x320 [kvm]
       kvm_set_memslot+0x1ee/0x590 [kvm]
       kvm_set_memory_region.part.0+0x3a1/0x4d0 [kvm]
       kvm_vm_ioctl+0x9bf/0x15d0 [kvm]
       __x64_sys_ioctl+0x8a/0xd0
       do_syscall_64+0xb7/0xbb0
       entry_SYSCALL_64_after_hwframe+0x4b/0x53
      RIP: 0033:0x7f21c0f1a9bf
       </TASK>
    
    Don't bother pre-checking the bounds of the potential hugepage, i.e. don't
    check that e.g. sp->gfn + KVM_PAGES_PER_HPAGE(sp->role.level + 1) is also
    within the memslot, as the checks performed by kvm_mmu_max_mapping_level()
    are a superset of the basic bounds checks.  I.e. pre-checking the full
    range would be a dubious micro-optimization.
    
    Fixes: 9eba50f8d7fc ("KVM: x86/mmu: Consult max mapping level when zapping collapsible SPTEs")
    Cc: [email protected]
    Cc: David Matlack <[email protected]>
    Cc: James Houghton <[email protected]>
    Cc: Alexander Bulekov <[email protected]>
    Cc: Fred Griffoul <[email protected]>
    Cc: Alexander Graf <[email protected]>
    Cc: David Woodhouse <[email protected]>
    Cc: Filippo Sironi <[email protected]>
    Cc: Ivan Orlov <[email protected]>
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Paolo Bonzini <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

KVM: x86: Fix shadow paging use-after-free due to unexpected role [+ + +]

Author: Paolo Bonzini <[email protected]>
Date:   Fri Jun 26 13:24:36 2026 +0200

    KVM: x86: Fix shadow paging use-after-free due to unexpected role
    
    commit 81ccda30b4e83d8f5cc4fd50503c44e3a33abfeb upstream.
    
    Commit 0cb2af2ea66ad ("KVM: x86: Fix shadow paging use-after-free due
    to unexpected GFN") fixed a shadow paging mismatch between stored and
    computed GFNs; the bug could be triggered by changing a PDE mapping from
    outside the guest, and then deleting a memslot.  The rmap_remove()
    call would miss entries created after the PDE change because the GFN
    of the leaf SPTE does not match the GFN of the struct kvm_mmu_page.
    
    A similar hole however remains if the modified PDE points to a non-leaf
    page.  In this case the gfn can be made to match, but the role does not
    match: the original large 2MB page creates a kvm_mmu_page with direct=1,
    while the new 4KB needs a kvm_mmu_page with direct=0.  However,
    kvm_mmu_get_child_sp() does not compare the role, and therefore reuses
    the page.
    
    The next step is installing a leaf (4KB) SPTE on the new path which
    records an rmap entry under the gfn resolved by the walk.  But when
    that child is zapped its parent kvm_mmu_page has direct=1 and
    kvm_mmu_page_get_gfn() computes the gfn for the 4KB page as
    sp->gfn + index instead of using sp->shadowed_translation[] (or sp->gfns[]
    in older kernels).  It therefore fails to remove the recorded entry.
    
    When the memslot is dropped the shadow page is freed but the rmap
    entry survives, as in the scenario that was already fixed.  Code that
    later walks that gfn (dirty logging, MMU notifier invalidation, and
    so on) dereferences an sptep that lies in the freed page, causing the
    use-after-free.
    
    Fixes: 2032a93d66fa ("KVM: MMU: Don't allocate gfns page for direct mmu pages")
    Reported-by: Hyunwoo Kim <[email protected]>
    Signed-off-by: Paolo Bonzini <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Linux: Linux 6.1.177 [+ + +]

Author: Greg Kroah-Hartman <[email protected]>
Date:   Sat Jul 4 13:41:33 2026 +0200

    Linux 6.1.177
    
    Link: https://lore.kernel.org/r/[email protected]
    Tested-by: Brett A C Sheffield <[email protected]>
    Tested-by: Peter Schneider <[email protected]>
    Tested-by: Francesco Dolcini <[email protected]>
    Tested-by: Miguel Ojeda <[email protected]>
    Tested-by: Ron Economos <[email protected]>
    Tested-by: Shung-Hsi Yu <[email protected]>
    Tested-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

locking/rtmutex: Skip remove_waiter() when waiter is not enqueued [+ + +]

Author: Davidlohr Bueso <[email protected]>
Date:   Wed Jun 17 08:46:07 2026 -0400

    locking/rtmutex: Skip remove_waiter() when waiter is not enqueued
    
    [ Upstream commit 40a25d59e85b3c8709ac2424d44f65610467871e ]
    
    syzbot triggered the following splat in remove_waiter() via
    FUTEX_CMP_REQUEUE_PI:
    
      KASAN: null-ptr-deref in range [0x0000000000000a88-0x0000000000000a8f]
       class_raw_spinlock_constructor
       remove_waiter+0x159/0x1200 kernel/locking/rtmutex.c:1561
       rt_mutex_start_proxy_lock+0x103/0x120
       futex_requeue+0x10e4/0x20d0
       __x64_sys_futex+0x34f/0x4d0
    
    task_blocks_on_rt_mutex() does not arm the waiter upon deadlock detection,
    leaving waiter->task nil, where 3bfdc63936dd ("rtmutex: Use waiter::task instead
    of current in remove_waiter()") made this fatal.
    
    Furthermore, rt_mutex_start_proxy_lock() should not be calling into remove_waiter()
    upon a successfully grabbing the rtmutex. 1a1fb985f2e2 ("futex: Handle early deadlock
    return correctly"), moved the remove_waiter() out of __rt_mutex_start_proxy_lock()
    (where 'ret' was only ever 0 or < 0) into the wrapper. Tighten this check to
    account for try_to_take_rt_mutex().
    
    Fixes: 3bfdc63936dd ("rtmutex: Use waiter::task instead of current in remove_waiter()")
    Reported-by: [email protected]
    Signed-off-by: Davidlohr Bueso <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Cc: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]/
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mac802154: llsec: add skb_cow_data() before in-place crypto [+ + +]

Author: Doruk Tan Ozturk <[email protected]>
Date:   Tue May 26 20:37:26 2026 +0200

    mac802154: llsec: add skb_cow_data() before in-place crypto
    
    commit 84a04eb5b210643bd67aab81ff805d32f62aa865 upstream.
    
    llsec_do_encrypt_unauth(), llsec_do_encrypt_auth(),
    llsec_do_decrypt_unauth(), and llsec_do_decrypt_auth() all perform
    in-place cryptographic transformations on skb data.  They build a
    scatterlist with sg_init_one() pointing into the skb's linear data area
    and then pass the same scatterlist as both src and dst to the crypto API
    (e.g. crypto_skcipher_encrypt/decrypt, crypto_aead_encrypt/decrypt).
    
    On the RX path, __ieee802154_rx_handle_packet() clones the received skb
    before handing it to each subscriber via ieee802154_subif_frame().  The
    cloned skb shares the same underlying data buffer via reference
    counting.  When llsec_do_decrypt() subsequently modifies this shared
    buffer in place, it corrupts data that other clones -- potentially
    belonging to other sockets or subsystems -- still reference.
    
    On the TX path, similar data sharing can occur when an skb's head has
    been cloned (skb_cloned() returns true).
    
    The fix is to call skb_cow_data() before performing any in-place crypto
    operation.  skb_cow_data() ensures that the skb's data area is not
    shared: if the skb head is cloned or the data spans multiple fragments,
    it copies the data into a private buffer that can be safely modified in
    place.  This is the same pattern used by:
    
      - ESP (net/ipv4/esp4.c, net/ipv6/esp6.c)
      - MACsec (drivers/net/macsec.c)
      - WireGuard (drivers/net/wireguard/receive.c)
      - TIPC (net/tipc/crypto.c)
    
    Without this guard, in-place crypto on shared skb data leads to:
      - Silent data corruption of other skb clones
      - Use-after-free when the crypto API scatterwalk writes through a
        page that has already been freed by another clone's kfree_skb()
      - Kernel crashes under concurrent 802.15.4 traffic with security
        enabled (KASAN/KMSAN reports slab-use-after-free)
    
    Found by 0sec (https://0sec.ai) using automated source analysis.
    
    Fixes: 4c14a2fb5d14 ("mac802154: add llsec decryption method")
    Fixes: 03556e4d0dbb ("mac802154: add llsec encryption method")
    Cc: [email protected]
    Reported-by: Doruk Tan Ozturk <[email protected]>
    Closes: https://lore.kernel.org/linux-wpan/[email protected]/
    Reviewed-by: Alexander Lobakin <[email protected]>
    Signed-off-by: Doruk Tan Ozturk <[email protected]>
    Closes: <link to your mail on lore>
    Link: https://lore.kernel.org/[email protected]
    Signed-off-by: Stefan Schmidt <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: vidtv: fix NULL pointer dereference in vidtv_mux_push_si [+ + +]

Author: Ruslan Valiyev <[email protected]>
Date:   Tue Mar 17 17:05:44 2026 +0000

    media: vidtv: fix NULL pointer dereference in vidtv_mux_push_si
    
    commit 7d8bf3d8f91073f4db347ed3aa6302b56107499c upstream.
    
    syzbot reported a general protection fault in
    vidtv_psi_ts_psi_write_into [1].
    
    vidtv_mux_get_pid_ctx() can return NULL, but vidtv_mux_push_si() does
    not check for this before dereferencing the returned pointer to access
    the continuity counter. This leads to a general protection fault when
    accessing a near-NULL address.
    
    The root cause is that vidtv_mux_pid_ctx_init() does not check the
    return value of vidtv_mux_create_pid_ctx_once() for PMT section PIDs.
    If the allocation fails, the PID context is never created, but init
    returns success. The subsequent vidtv_mux_push_si() call then gets
    NULL from vidtv_mux_get_pid_ctx() and crashes.
    
    Fix both the root cause (add error check in vidtv_mux_pid_ctx_init
    for PMT PIDs) and add defensive NULL checks in vidtv_mux_push_si for
    all vidtv_mux_get_pid_ctx() calls.
    
    [1]
    Oops: general protection fault, probably for non-canonical address 0xdffffc0000000000: 0000 [#1] SMP KASAN PTI
    KASAN: null-ptr-deref in range [0x0000000000000000-0x0000000000000007]
    Workqueue: events vidtv_mux_tick
    RIP: 0010:vidtv_psi_ts_psi_write_into+0x54a/0xbc0 drivers/media/test-drivers/vidtv/vidtv_psi.c:197
    Call Trace:
     <TASK>
     vidtv_psi_table_header_write_into drivers/media/test-drivers/vidtv/vidtv_psi.c:799 [inline]
     vidtv_psi_pmt_write_into+0x3b2/0xa70 drivers/media/test-drivers/vidtv/vidtv_psi.c:1231
     vidtv_mux_push_si+0x932/0xe80 drivers/media/test-drivers/vidtv/vidtv_mux.c:196
     vidtv_mux_tick+0xe9b/0x1480 drivers/media/test-drivers/vidtv/vidtv_mux.c:408
    
    Fixes: f90cf6079bf67 ("media: vidtv: add a bridge driver")
    Cc: [email protected]
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=814c351d094f4f1a1b86
    Signed-off-by: Ruslan Valiyev <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

MIPS: DEC: Prevent initial console buffer from landing in XKPHYS [+ + +]

Author: Maciej W. Rozycki <[email protected]>
Date:   Wed May 6 23:42:27 2026 +0100

    MIPS: DEC: Prevent initial console buffer from landing in XKPHYS
    
    commit 7fb13fd35110ebe95eb053faf79d018f51144d85 upstream.
    
    In 64-bit configurations calling the initial console output handler from
    a kernel thread other than the initial one will result in a situation
    where the stack has been placed in the XKPHYS 64-bit memory segment and
    consequently so has been the buffer allocated there that is used as the
    argument corresponding to the `%s' output conversion specifier for the
    firmware's printf() entry point.
    
    This 64-bit address will then be truncated by 32-bit firmware, resulting
    in an attempt to access the wrong memory location, which in turn will
    cause all kinds of unpredictable behaviour, such as a kernel crash:
    
      Console: colour dummy device 160x64
      Calibrating delay loop... 49.36 BogoMIPS (lpj=192512)
      pid_max: default: 32768 minimum: 301
      CPU 0 Unable to handle kernel paging request at virtual address 000000000203bd00, epc == ffffffffbfc08364, ra == ffffffffbfc08800
      Oops[#1]:
      CPU: 0 PID: 0 Comm: swapper Not tainted 5.18.0-rc2-00254-gfb649bda6f56-dirty #121
      $ 0   : 0000000000000000 0000000000000001 0000000000000023 ffffffff80684ba0
      $ 4   : 000000000203bd00 ffffffffbfc0f3b4 ffffffffffffffff 0000000000000073
      $ 8   : 0a303d7469000000 0000000000000000 0000000000000073 ffffffffbfc0f473
      $12   : 0000000000000002 0000000000000000 ffffffff80684c1c 0000000000000000
      $16   : 0000000000000000 ffffffff80596dc9 0000000000000000 ffffffffbfc09240
      $20   : ffffffff80684c40 ffffffffbfc0f400 000000000000002d 000000000000002b
      $24   : ffffffffffffffbf 000000000203bd00
      $28   : ffffffff805f0000 ffffffff80684b58 0000000000000030 ffffffffbfc08800
      Hi    : 0000000000000000
      Lo    : 0000000000000aa8
      epc   : ffffffffbfc08364 0xffffffffbfc08364
      ra    : ffffffffbfc08800 0xffffffffbfc08800
      Status: 140120e2        KX SX UX KERNEL EXL
      Cause : 00000008 (ExcCode 02)
      BadVA : 000000000203bd00
      PrId  : 00000430 (R4000SC)
      Modules linked in:
      Process swapper (pid: 0, threadinfo=(____ptrval____), task=(____ptrval____), tls=0000000000000000)
      Stack : 0000000000000000 0000000000000000 0000000000000000 0000004d0000004d
              80684cc0806a2a40 80596dc80000004d 8061000000000000 bfc0850c80684c38
              0000000000000000 000000000203bd00 0000000000000000 0000000000000000
              0000000000000000 00000000bfc0f3b4 0000000000000000 0000000000000000
              0000000000000000 0000000000000000 0000000000000000 0000000000000000
              0000000000000000 0000000000000000 0000000000000000 0000000000000000
              0000002500000000 0000000000000000 0000000000000000 802c1a7400000000
              0203bd0080596dc8 0203bd4d69000000 6c61632000000018 5f746567646e6172
              6c616320625f6d6f 5f736e5f6d6f7266 206361323778302b 303d74696e726320
              806a0a38806b0000 806a0a38806b0000 00000000806b0000 80683c58806b0000
              ...
      Call Trace:
    
      Code: a082ffff  03e00008  00601021 <80820000> 00001821  10400005  24840001  80820000  24630001
    
      ---[ end trace 0000000000000000 ]---
      Kernel panic - not syncing: Fatal exception in interrupt
    
      KN04 V2.1k    (PC: 0xa0026768, SP: 0x806848e8)
      >>
    
    In this case the pointer in $4 was truncated from 0x980000000203bd00 to
    0x000000000203bd00.
    
    This may happen when no final console driver has been enabled in the
    configuration and consequently the initial console continues being used
    late into bootstrap or with an upcoming change that will switch the zs
    driver to use a platform device, which in turn will make the console
    handover happen only after other kernel threads have already been
    started.
    
    Fix the issue by making the buffer static and initdata, and therefore
    placed in the CKSEG0 32-bit compatibility segment, observing that the
    console output handler is called with the console lock held, implying
    no need for this code to be reentrant.  Add an assertion to verify the
    buffer actually has been placed in a compatibility segment.
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Signed-off-by: Maciej W. Rozycki <[email protected]>
    Cc: [email protected] # v2.6.12+
    Signed-off-by: Thomas Bogendoerfer <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

misc: fastrpc: Add dma_mask to fastrpc_channel_ctx [+ + +]

Author: Abel Vesa <[email protected]>
Date:   Tue Jun 16 15:05:45 2026 -0400

    misc: fastrpc: Add dma_mask to fastrpc_channel_ctx
    
    [ Upstream commit 9bde43a0e2f469961e18d0a3496a9a74379c22bf ]
    
    dma_set_mask_and_coherent only updates the mask to which the device
    dma_mask pointer points to. Add a dma_mask to the channel ctx and set
    the device dma_mask to point to that, otherwise the dma_set_mask will
    return an error and the dma_set_coherent_mask will be skipped too.
    
    Co-developed-by: Srinivas Kandagatla <[email protected]>
    Signed-off-by: Abel Vesa <[email protected]>
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Stable-dep-of: 5401fb4fe10f ("misc: fastrpc: Fix NULL pointer dereference in rpmsg callback")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

misc: fastrpc: Fix NULL pointer dereference in rpmsg callback [+ + +]

Author: Mukesh Ojha <[email protected]>
Date:   Tue Jun 16 15:05:46 2026 -0400

    misc: fastrpc: Fix NULL pointer dereference in rpmsg callback
    
    [ Upstream commit 5401fb4fe10fac6134c308495df18ed74aebb9c4 ]
    
    A NULL pointer dereference was observed on Hawi at boot when the DSP
    sends a glink message before fastrpc_rpmsg_probe() has completed
    initialization:
    
      Unable to handle kernel NULL pointer dereference at virtual address 0000000000000178
      pc : _raw_spin_lock_irqsave+0x34/0x8c
      lr : fastrpc_rpmsg_callback+0x3c/0xcc [fastrpc]
      ...
      Call trace:
       _raw_spin_lock_irqsave+0x34/0x8c (P)
       fastrpc_rpmsg_callback+0x3c/0xcc [fastrpc]
       qcom_glink_native_rx+0x538/0x6a4
       qcom_glink_smem_intr+0x14/0x24 [qcom_glink_smem]
    
    The faulting address 0x178 corresponds to the lock variable inside
    struct fastrpc_channel_ctx, confirming that cctx is NULL when
    fastrpc_rpmsg_callback() attempts to take the spinlock.
    
    There are two issues here. First, dev_set_drvdata() is called before
    spin_lock_init() and idr_init(), leaving a window where the callback
    can retrieve a valid cctx pointer but operate on an uninitialized
    spinlock. Second, the rpmsg channel becomes live as soon as the driver
    is bound, so fastrpc_rpmsg_callback() can fire before dev_set_drvdata()
    is called at all, resulting in dev_get_drvdata() returning NULL.
    
    Fix both issues by moving all cctx initialization ahead of
    dev_set_drvdata() so the structure is fully initialized before it
    becomes visible to the callback, and add a NULL check in
    fastrpc_rpmsg_callback() as a guard against any remaining window.
    
    Fixes: f6f9279f2bf0 ("misc: fastrpc: Add Qualcomm fastrpc basic driver model")
    Cc: [email protected]
    Signed-off-by: Mukesh Ojha <[email protected]>
    Reviewed-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/mglru: skip special VMAs in lru_gen_look_around() [+ + +]

Author: Yu Zhao <[email protected]>
Date:   Thu Jun 25 23:27:51 2026 +0200

    mm/mglru: skip special VMAs in lru_gen_look_around()
    
    [ Upstream commit c28ac3c7eb945fee6e20f47d576af68fdff1392a ]
    
    Special VMAs like VM_PFNMAP can contain anon pages from COW.  There isn't
    much profit in doing lookaround on them.  Besides, they can trigger the
    pte_special() warning in get_pte_pfn().
    
    Skip them in lru_gen_look_around().
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 018ee47f1489 ("mm: multi-gen LRU: exploit locality in rmap")
    Signed-off-by: Yu Zhao <[email protected]>
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/[email protected]/
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    [fix conflicts with variable declarations and vma pointer usage]
    Signed-off-by: Jakov Novak <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mptcp: fix missing wakeups in edge scenarios [+ + +]

Author: Paolo Abeni <[email protected]>
Date:   Tue Jun 16 12:18:58 2026 -0400

    mptcp: fix missing wakeups in edge scenarios
    
    [ Upstream commit 9d8d28738f24b75616d6ca7a27cb4aed88520343 ]
    
    The mptcp_recvmsg() can fill MPTCP socket receive queue via
    mptcp_move_skbs(), but currently does not try to wakeup any listener,
    because the same process is going to check the receive queue soon.
    
    When multiple threads are reading from the same fd, the above can
    cause stall. Add the missing wakeup.
    
    Fixes: 6771bfd9ee24 ("mptcp: update mptcp ack sequence from work queue")
    Cc: [email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Reviewed-by: Matthieu Baerts (NGI0) <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20260602-net-mptcp-misc-fixes-7-1-rc7-v2-1-856831229976@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mptcp: pm: fix extra_subflows underflow on userspace PM subflow creation [+ + +]

Author: Tao Cui <[email protected]>
Date:   Tue Jun 16 12:18:46 2026 -0400

    mptcp: pm: fix extra_subflows underflow on userspace PM subflow creation
    
    [ Upstream commit 14e9fea30b68fc75b2b3d97396a7e6adb544bd2a ]
    
    The userspace PM increments extra_subflows after __mptcp_subflow_connect()
    succeeds, but __mptcp_subflow_connect() calls mptcp_pm_close_subflow()
    on failure to roll back the pre-increment done by the kernel PM's fill_*()
    helpers. Because the userspace PM hasn't incremented yet at that point,
    this decrement is spurious and causes extra_subflows to underflow.
    
    Fix it by aligning the userspace PM with the kernel PM: increment
    extra_subflows before calling __mptcp_subflow_connect(), so the existing
    error path in subflow.c correctly rolls it back on failure. Also simplify
    the error handling by taking pm.lock only when needed for cleanup.
    
    Fixes: 77e4b94a3de6 ("mptcp: update userspace pm infos")
    Cc: [email protected]
    Signed-off-by: Tao Cui <[email protected]>
    Reviewed-by: Matthieu Baerts (NGI0) <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20260602-net-mptcp-misc-fixes-7-1-rc7-v2-5-856831229976@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net/sched: act_pedit: check static offsets a priori [+ + +]

Author: Pedro Tammela <[email protected]>
Date:   Thu Jun 18 11:42:48 2026 +0800

    net/sched: act_pedit: check static offsets a priori
    
    [ Upstream commit e1201bc781c28766720e78a5e099ffa568be4d74 ]
    
    Static key offsets should always be on 32 bit boundaries. Validate them on
    create/update time for static offsets and move the datapath validation
    for runtime offsets only.
    
    iproute2 already errors out if a given offset and data size cannot be
    packed to a 32 bit boundary. This change will make sure users which
    create/update pedit instances directly via netlink also error out,
    instead of finding out when packets are traversing.
    
    Reviewed-by: Jamal Hadi Salim <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Signed-off-by: Pedro Tammela <[email protected]>
    Signed-off-by: David S. Miller <[email protected]>
    (cherry picked from commit e1201bc781c28766720e78a5e099ffa568be4d74)
    Signed-off-by: Wentao Guan <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/sched: act_pedit: free pedit keys on bail from offset check [+ + +]

Author: Pedro Tammela <[email protected]>
Date:   Thu Jun 18 11:48:48 2026 +0800

    net/sched: act_pedit: free pedit keys on bail from offset check
    
    [ Upstream commit 1b483d9f5805c7e3d628d4995e97f4311fcb82eb ]
    
    Ido Schimmel reports a memleak on a syzkaller instance:
       BUG: memory leak
       unreferenced object 0xffff88803d45e400 (size 1024):
         comm "syz-executor292", pid 563, jiffies 4295025223 (age 51.781s)
         hex dump (first 32 bytes):
           28 bd 70 00 fb db df 25 02 00 14 1f ff 02 00 02  (.p....%........
           00 32 00 00 1f 00 00 00 ac 14 14 3e 08 00 07 00  .2.........>....
         backtrace:
           [<ffffffff81bd0f2c>] kmemleak_alloc_recursive include/linux/kmemleak.h:42 [inline]
           [<ffffffff81bd0f2c>] slab_post_alloc_hook mm/slab.h:772 [inline]
           [<ffffffff81bd0f2c>] slab_alloc_node mm/slub.c:3452 [inline]
           [<ffffffff81bd0f2c>] __kmem_cache_alloc_node+0x25c/0x320 mm/slub.c:3491
           [<ffffffff81a865d9>] __do_kmalloc_node mm/slab_common.c:966 [inline]
           [<ffffffff81a865d9>] __kmalloc+0x59/0x1a0 mm/slab_common.c:980
           [<ffffffff83aa85c3>] kmalloc include/linux/slab.h:584 [inline]
           [<ffffffff83aa85c3>] tcf_pedit_init+0x793/0x1ae0 net/sched/act_pedit.c:245
           [<ffffffff83a90623>] tcf_action_init_1+0x453/0x6e0 net/sched/act_api.c:1394
           [<ffffffff83a90e58>] tcf_action_init+0x5a8/0x950 net/sched/act_api.c:1459
           [<ffffffff83a96258>] tcf_action_add+0x118/0x4e0 net/sched/act_api.c:1985
           [<ffffffff83a96997>] tc_ctl_action+0x377/0x490 net/sched/act_api.c:2044
           [<ffffffff83920a8d>] rtnetlink_rcv_msg+0x46d/0xd70 net/core/rtnetlink.c:6395
           [<ffffffff83b24305>] netlink_rcv_skb+0x185/0x490 net/netlink/af_netlink.c:2575
           [<ffffffff83901806>] rtnetlink_rcv+0x26/0x30 net/core/rtnetlink.c:6413
           [<ffffffff83b21cae>] netlink_unicast_kernel net/netlink/af_netlink.c:1339 [inline]
           [<ffffffff83b21cae>] netlink_unicast+0x5be/0x8a0 net/netlink/af_netlink.c:1365
           [<ffffffff83b2293f>] netlink_sendmsg+0x9af/0xed0 net/netlink/af_netlink.c:1942
           [<ffffffff8380c39f>] sock_sendmsg_nosec net/socket.c:724 [inline]
           [<ffffffff8380c39f>] sock_sendmsg net/socket.c:747 [inline]
           [<ffffffff8380c39f>] ____sys_sendmsg+0x3ef/0xaa0 net/socket.c:2503
           [<ffffffff838156d2>] ___sys_sendmsg+0x122/0x1c0 net/socket.c:2557
           [<ffffffff8381594f>] __sys_sendmsg+0x11f/0x200 net/socket.c:2586
           [<ffffffff83815ab0>] __do_sys_sendmsg net/socket.c:2595 [inline]
           [<ffffffff83815ab0>] __se_sys_sendmsg net/socket.c:2593 [inline]
           [<ffffffff83815ab0>] __x64_sys_sendmsg+0x80/0xc0 net/socket.c:2593
    
    The recently added static offset check missed a free to the key buffer when
    bailing out on error.
    
    Fixes: e1201bc781c2 ("net/sched: act_pedit: check static offsets a priori")
    Reported-by: Ido Schimmel <[email protected]>
    Signed-off-by: Pedro Tammela <[email protected]>
    Reviewed-by: Ido Schimmel <[email protected]>
    Tested-by: Ido Schimmel <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    (cherry picked from commit 1b483d9f5805c7e3d628d4995e97f4311fcb82eb)
    Signed-off-by: Wentao Guan <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/sched: act_pedit: rate limit datapath messages [+ + +]

Author: Pedro Tammela <[email protected]>
Date:   Thu Jun 18 11:42:49 2026 +0800

    net/sched: act_pedit: rate limit datapath messages
    
    [ Upstream commit e3c9673e2f6e1b3aa4bb87c570336e10f364c28a ]
    
    Unbounded info messages in the pedit datapath can flood the printk
    ring buffer quite easily depending on the action created.
    As these messages are informational, usually printing some, not all,
    is enough to bring attention to the real issue.
    
    Reviewed-by: Jamal Hadi Salim <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Signed-off-by: Pedro Tammela <[email protected]>
    Signed-off-by: David S. Miller <[email protected]>
    (cherry picked from commit e3c9673e2f6e1b3aa4bb87c570336e10f364c28a)
    Signed-off-by: Wentao Guan <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/sched: fix pedit partial COW leading to page cache corruption [+ + +]

Author: Rajat Gupta <[email protected]>
Date:   Thu Jun 18 11:42:50 2026 +0800

    net/sched: fix pedit partial COW leading to page cache corruption
    
    [ Upstream commit 899ee91156e57784090c5565e4f31bd7dbffbc5a ]
    
    tcf_pedit_act() computes the COW range for skb_ensure_writable()
    once before the key loop using tcfp_off_max_hint, but the hint does
    not account for the runtime header offset added by typed keys. This
    can leave part of the write region un-COW'd.
    
    Fix by moving skb_ensure_writable() inside the per-key loop where
    the actual write offset is known, and add overflow checking on the
    offset arithmetic. For negative offsets (e.g. Ethernet header edits
    at ingress), use skb_cow() to COW the headroom instead. Guard
    offset_valid() against INT_MIN, where negation is undefined.
    
    Fixes: 8b796475fd78 ("net/sched: act_pedit: really ensure the skb is writable")
    Reported-by: Yiming Qian <[email protected]>
    Reported-by: Keenan Dong <[email protected]>
    Reported-by: Han Guidong <[email protected]>
    Reported-by: Zhang Cen <[email protected]>
    Reviewed-by: Han Guidong <[email protected]>
    Tested-by: Han Guidong <[email protected]>
    Reviewed-by: Davide Caratti <[email protected]>
    Tested-by: Davide Caratti <[email protected]>
    Reviewed-by: Toke Høiland-Jørgensen <[email protected]>
    Tested-by: Toke Høiland-Jørgensen <[email protected]>
    Reviewed-by: Victor Nogueira <[email protected]>
    Tested-by: Victor Nogueira <[email protected]>
    Acked-by: Jamal Hadi Salim <[email protected]>
    Signed-off-by: Rajat Gupta <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    [rename include file from linux/unaligned.h to asm/unaligned.h]
    Conflicts:
            include/net/tc_act/tc_pedit.h
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Wentao Guan <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: annotate data-races around sk->sk_{data_ready,write_space} [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Wed Feb 25 13:15:47 2026 +0000

    net: annotate data-races around sk->sk_{data_ready,write_space}
    
    [ Upstream commit 2ef2b20cf4e04ac8a6ba68493f8780776ff84300 ]
    
    skmsg (and probably other layers) are changing these pointers
    while other cpus might read them concurrently.
    
    Add corresponding READ_ONCE()/WRITE_ONCE() annotations
    for UDP, TCP and AF_UNIX.
    
    Fixes: 604326b41a6f ("bpf, sockmap: convert to generic sk_msg interface")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/netdev/[email protected]/
    Signed-off-by: Eric Dumazet <[email protected]>
    Cc: Daniel Borkmann <[email protected]>
    Cc: John Fastabend <[email protected]>
    Cc: Jakub Sitnicki <[email protected]>
    Cc: Willem de Bruijn <[email protected]>
    Reviewed-by: Kuniyuki Iwashima <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: phonet: free phonet_device after RCU grace period [+ + +]

Author: Santosh Kalluri <[email protected]>
Date:   Wed Jun 17 10:45:04 2026 -0400

    net: phonet: free phonet_device after RCU grace period
    
    [ Upstream commit 71de0177b28da751f407581a4515cf4d762f6296 ]
    
    phonet_device_destroy() removes a phonet_device from the per-net device
    list with list_del_rcu(), but frees it immediately. RCU readers walking
    the same list can still hold a pointer to the object after it has been
    removed, leading to a slab-use-after-free.
    
    Use kfree_rcu(), matching the lifetime rule already used by
    phonet_address_del() for the same object type.
    
    Fixes: eeb74a9d45f7 ("Phonet: convert devices list to RCU")
    Cc: [email protected]
    Signed-off-by: Santosh Kalluri <[email protected]>
    Acked-by: Rémi Denis-Courmont <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: qualcomm: rmnet: fix endpoint use-after-free in rmnet_dellink() [+ + +]

Author: Weiming Shi <[email protected]>
Date:   Thu May 14 05:25:12 2026 -0700

    net: qualcomm: rmnet: fix endpoint use-after-free in rmnet_dellink()
    
    commit d00c953a8f69921f484b629801766da68f27f658 upstream.
    
    rmnet_dellink() removes the endpoint from the hash table with
    hlist_del_init_rcu() and then immediately frees it with kfree(). However,
    RCU readers on the receive path (rmnet_rx_handler ->
    __rmnet_map_ingress_handler) may still hold a reference to the endpoint and
    dereference ep->egress_dev after the memory has been freed. The endpoint is
    a kmalloc-32 object, and the stale read at offset 8 corresponds to the
    egress_dev pointer.
    
      BUG: unable to handle page fault for address: ffffffffde942eef
      Oops: 0002 [#1] SMP NOPTI
      CPU: 1 UID: 0 PID: 137 Comm: poc_write Not tainted 7.0.0+ #4 PREEMPTLAZY
      RIP: 0010:rmnet_vnd_rx_fixup (rmnet_vnd.c:27)
      Call Trace:
       <TASK>
       __rmnet_map_ingress_handler (rmnet_handlers.c:48 rmnet_handlers.c:101)
       rmnet_rx_handler (rmnet_handlers.c:129 rmnet_handlers.c:235)
       __netif_receive_skb_core.constprop.0 (net/core/dev.c:6096)
       __netif_receive_skb_one_core (net/core/dev.c:6208)
       netif_receive_skb (net/core/dev.c:6467)
       tun_get_user (drivers/net/tun.c:1955)
       tun_chr_write_iter (drivers/net/tun.c:2003)
       vfs_write (fs/read_write.c:688)
       ksys_write (fs/read_write.c:740)
       </TASK>
    
    Add an rcu_head field to struct rmnet_endpoint and replace kfree() with
    kfree_rcu() so the endpoint memory remains valid through the RCU grace
    period. Also remove the rmnet_vnd_dellink() call and inline only the
    nr_rmnet_devs decrement, since rmnet_vnd_dellink() would set
    ep->egress_dev to NULL during the grace period, creating a data race
    with lockless readers.
    
    Fixes: ceed73a2cf4a ("drivers: net: ethernet: qualcomm: rmnet: Initial implementation")
    Reported-by: Xiang Mei <[email protected]>
    Signed-off-by: Weiming Shi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: skmsg: preserve sg.copy across SG transforms [+ + +]

Author: Yiming Qian <[email protected]>
Date:   Wed Jun 10 06:21:36 2026 +0000

    net: skmsg: preserve sg.copy across SG transforms
    
    commit 406e8a651a7b854c41fecd5117bb282b3a6c2c6b upstream.
    
    The sk_msg sg.copy bitmap is part of the scatterlist entry ownership
    state. A set bit tells sk_msg_compute_data_pointers() not to expose the
    entry through writable BPF ctx->data. This protects entries backed by
    pages that are not private to the sk_msg, such as splice-backed file
    page-cache pages.
    
    Several sk_msg transform paths move, copy, split, or compact
    msg->sg.data[] entries without moving the matching sg.copy bit. This can
    make an externally backed entry arrive at a new slot with a clear copy
    bit. A later SK_MSG verdict can then expose sg_virt(sge) as writable
    ctx->data and BPF stores can modify the original page cache.
    
    Keep sg.copy synchronized with sg.data[] whenever entries are
    transferred, shifted, split, or copied into a new sk_msg. Clear the bit
    when an entry is replaced by a newly allocated private page or freed.
    This covers the BPF pull/push/pop helpers, sk_msg_shift_left/right(),
    sk_msg_xfer(), and tls_split_open_record(), including the partial tail
    entry created during TLS open-record splitting.
    
    Fixes: d3b18ad31f93 ("tls: add bpf support to sk_msg handling")
    Cc: [email protected]
    Reported-by: Yiming Qian <[email protected]>
    Reported-by: Keenan Dong <[email protected]>
    Signed-off-by: Yiming Qian <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

netfilter: nf_tables: always increment set element count [+ + +]

Author: Florian Westphal <[email protected]>
Date:   Fri Jun 19 02:28:48 2026 -0700

    netfilter: nf_tables: always increment set element count
    
    [ Upstream commit d4b7f29eb85c93893bc27388b37709efbc3c9a0e ]
    
    At this time, set->nelems counter only increments when the set has
    a maximum size.
    
    All set elements decrement the counter unconditionally, this is
    confusing.
    
    Increment the counter unconditionally to make this symmetrical.
    This would also allow changing the set maximum size after set creation
    in a later patch.
    
    Signed-off-by: Florian Westphal <[email protected]>
    [ Shivani: Modified to apply on 6.1.y ]
    Signed-off-by: Shivani Agarwal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_tables: always walk all pending catchall elements [+ + +]

Author: Florian Westphal <[email protected]>
Date:   Thu Mar 5 21:32:00 2026 +0100

    netfilter: nf_tables: always walk all pending catchall elements
    
    commit 7cb9a23d7ae40a702577d3d8bacb7026f04ac2a9 upstream.
    
    During transaction processing we might have more than one catchall element:
    1 live catchall element and 1 pending element that is coming as part of the
    new batch.
    
    If the map holding the catchall elements is also going away, its
    required to toggle all catchall elements and not just the first viable
    candidate.
    
    Otherwise, we get:
     WARNING: ./include/net/netfilter/nf_tables.h:1281 at nft_data_release+0xb7/0xe0 [nf_tables], CPU#2: nft/1404
     RIP: 0010:nft_data_release+0xb7/0xe0 [nf_tables]
     [..]
     __nft_set_elem_destroy+0x106/0x380 [nf_tables]
     nf_tables_abort_release+0x348/0x8d0 [nf_tables]
     nf_tables_abort+0xcf2/0x3ac0 [nf_tables]
     nfnetlink_rcv_batch+0x9c9/0x20e0 [..]
    
    Fixes: 628bd3e49cba ("netfilter: nf_tables: drop map element references from preparation phase")
    Reported-by: Yiming Qian <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    [ Shivani: Modified to apply on v6.6.y-v6.1.y ]
    Signed-off-by: Shivani Agarwal <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

netfilter: nf_tables: fix set size with rbtree backend [+ + +]

Author: Pablo Neira Ayuso <[email protected]>
Date:   Fri Jun 19 02:28:49 2026 -0700

    netfilter: nf_tables: fix set size with rbtree backend
    
    [ Upstream commit 8d738c1869f611955d91d8d0fd0012d9ef207201 ]
    
    The existing rbtree implementation uses singleton elements to represent
    ranges, however, userspace provides a set size according to the number
    of ranges in the set.
    
    Adjust provided userspace set size to the number of singleton elements
    in the kernel by multiplying the range by two.
    
    Check if the no-match all-zero element is already in the set, in such
    case release one slot in the set size.
    
    Fixes: 0ed6389c483d ("netfilter: nf_tables: rename set implementations")
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    [ Shivani: Modified to apply on 6.1.y ]
    Signed-off-by: Shivani Agarwal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_tables: unconditionally bump set->nelems before insertion [+ + +]

Author: Pablo Neira Ayuso <[email protected]>
Date:   Fri Jun 19 02:28:50 2026 -0700

    netfilter: nf_tables: unconditionally bump set->nelems before insertion
    
    [ Upstream commit def602e498a4f951da95c95b1b8ce8ae68aa733a ]
    
    In case that the set is full, a new element gets published then removed
    without waiting for the RCU grace period, while RCU reader can be
    walking over it already.
    
    To address this issue, add the element transaction even if set is full,
    but toggle the set_full flag to report -ENFILE so the abort path safely
    unwinds the set to its previous state.
    
    As for element updates, decrement set->nelems to restore it.
    
    A simpler fix is to call synchronize_rcu() in the error path.
    However, with a large batch adding elements to already maxed-out set,
    this could cause noticeable slowdown of such batches.
    
    Fixes: 35d0ac9070ef ("netfilter: nf_tables: fix set->nelems counting with no NLM_F_EXCL")
    Reported-by: Inseo An <[email protected]>
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    [ Minor conflict resolved. ]
    Signed-off-by: Li hongliang <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    [ Shivani: Modified to apply on 6.1.y ]
    Signed-off-by: Shivani Agarwal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nfsd: check get_user() return when reading princhashlen [+ + +]

Author: Dominik Woźniak <[email protected]>
Date:   Thu May 21 17:46:56 2026 +0200

    nfsd: check get_user() return when reading princhashlen
    
    commit e186fa1c057f5eccb22afb1e83e34c0627085868 upstream.
    
    In __cld_pipe_inprogress_downcall(), the get_user() that reads
    princhashlen from the userspace cld_msg_v2 buffer does not check its
    return value. A failing copy leaves princhashlen with uninitialised
    stack contents, which are then used to drive memdup_user() and stored
    as princhash.len on the resulting reclaim record. The other get_user()
    calls in this function all check the return; only this one is missed,
    which is most likely a copy-paste oversight from when v2 upcalls were
    introduced.
    
    Mirror the existing pattern used a few lines above for namelen.
    namecopy is declared with __free(kfree) so the early return cleans up
    the already-allocated buffer automatically.
    
    Fixes: 6ee95d1c8991 ("nfsd: add support for upcall version 2")
    Cc: [email protected]
    Signed-off-by: Dominik Woźniak <[email protected]>
    Reviewed-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nfsd: fix posix_acl leak on SETACL decode failure [+ + +]

Author: Jeff Layton <[email protected]>
Date:   Thu May 21 13:51:43 2026 -0400

    nfsd: fix posix_acl leak on SETACL decode failure
    
    commit 0853ac544c590880d797b04daa33fcb72b6be0e1 upstream.
    
    nfsaclsvc_decode_setaclargs() and nfs3svc_decode_setaclargs() each
    call nfs_stream_decode_acl() twice, first for NFS_ACL and then for
    NFS_DFACL.  Each successful call transfers ownership of a freshly
    allocated posix_acl into argp->acl_access or argp->acl_default.  If
    the first call succeeds but the second fails, the decoder returns
    false and argp->acl_access is left dangling.
    
    ACLPROC2_SETACL.pc_release was wired to nfssvc_release_attrstat and
    ACLPROC3_SETACL.pc_release was wired to nfs3svc_release_fhandle.
    Both only call fh_put() and have no knowledge of the ACL fields on
    argp.  The posix_acl_release() pairs sat at the out: labels inside
    nfsacld_proc_setacl() and nfsd3_proc_setacl(), but svc_process()
    skips pc_func when pc_decode returns false, so that cleanup is
    unreachable on decode failure:
    
        svc_process_common()
          pc_decode()                  /* decode_setaclargs: false */
          /* pc_func skipped */
          pc_release()                 /* fh_put only -- ACLs leaked */
    
    The orphaned posix_acl is leaked for the lifetime of the server.
    
    Fix by adding nfsaclsvc_release_setacl() and nfs3svc_release_setacl(),
    which release both argp->acl_access and argp->acl_default in addition
    to fh_put(), and wiring them as pc_release for their respective SETACL
    procedures.  pc_release runs on every path svc_process() takes after
    decode, including decode failure, so the posix_acl_release() pairs are
    removed from the proc functions' out: labels to keep ownership in one
    place.  This matches the existing release_getacl() pattern used by
    the sibling GETACL procedures.
    
    Fixes: a257cdd0e217 ("[PATCH] NFSD: Add server support for NFSv3 ACLs.")
    Cc: [email protected]
    Assisted-by: kres:claude-opus-4-7
    Signed-off-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

NFSD: Fix SECINFO_NO_NAME decode error cleanup [+ + +]

Author: Guannan Wang <[email protected]>
Date:   Thu May 21 16:03:32 2026 +0800

    NFSD: Fix SECINFO_NO_NAME decode error cleanup
    
    commit 9e18e83b8846a5c3fe13fc8a464b4865d33996c6 upstream.
    
    nfsd4_decode_secinfo_no_name() currently initializes sin_exp after
    decoding sin_style. If the XDR stream is truncated, the decoder returns
    nfserr_bad_xdr before sin_exp is initialized.
    
    Since commit 3fdc54646234 ("NFSD: Reduce amount of struct
    nfsd4_compoundargs that needs clearing"), the inline iops array is not
    cleared between RPC calls. A failed SECINFO_NO_NAME decode can therefore
    leave sin_exp holding stale union contents from a previous operation.
    
    The error response path still invokes nfsd4_secinfo_no_name_release(),
    which calls exp_put() on a non-NULL sin_exp.
    
    Initialize sin_exp before the first failable decode step, matching
    nfsd4_decode_secinfo().
    
    Fixes: 3fdc54646234 ("NFSD: Reduce amount of struct nfsd4_compoundargs that needs clearing")
    Cc: [email protected]
    Signed-off-by: Guannan Wang <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

NFSv4/pNFS: reject zero-length r_addr in nfs4_decode_mp_ds_addr [+ + +]

Author: Michael Bommarito <[email protected]>
Date:   Wed May 27 12:30:35 2026 -0400

    NFSv4/pNFS: reject zero-length r_addr in nfs4_decode_mp_ds_addr
    
    commit 41fe0f7b84f0cb822ae10ab08592996a592b2a25 upstream.
    
    nfs4_decode_mp_ds_addr() decodes the r_netid and r_addr opaques of a
    netaddr4 from a GETDEVICEINFO multipath-DS body, then immediately
    calls strrchr(buf, '.') to locate the port separator. Both decodes
    use xdr_stream_decode_string_dup(), and the current code checks only
    "nlen < 0" / "rlen < 0" before dereferencing the returned string.
    
    When the on-wire opaque has length zero, xdr_stream_decode_opaque_inline()
    returns 0 and xdr_stream_decode_string_dup() falls through to its
    "*str = NULL; return ret" tail, leaving buf NULL with a return value
    of 0. The "< 0" check does not catch this, and the next line is
    strrchr(NULL, '.'), a kernel NULL pointer dereference reachable from
    any pNFS-flexfile client mounted against a malicious or compromised
    metadata server.
    
    Reject the zero-length cases explicitly so the decoder fails with
    -EBADMSG (treated as a malformed GETDEVICEINFO body) instead of
    panicking the client.
    
    Cc: [email protected]
    Fixes: 6b7f3cf96364 ("nfs41: pull decode_ds_addr from file layout to generic pnfs")
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Michael Bommarito <[email protected]>
    Signed-off-by: Anna Schumaker <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

NTB: epf: Avoid pci_iounmap() with offset when PEER_SPAD and CONFIG share BAR [+ + +]

Author: Koichiro Den <[email protected]>
Date:   Wed Mar 4 11:05:27 2026 +0900

    NTB: epf: Avoid pci_iounmap() with offset when PEER_SPAD and CONFIG share BAR
    
    commit d876153680e3d721d385e554def919bce3d18c74 upstream.
    
    When BAR_PEER_SPAD and BAR_CONFIG share one PCI BAR, the module teardown
    path ends up calling pci_iounmap() on the same iomem with some offset,
    which is unnecessary and triggers a kernel warning like the following:
    
      Trying to vunmap() nonexistent vm area (0000000069a5ffe8)
      WARNING: mm/vmalloc.c:3470 at vunmap+0x58/0x68, CPU#5: modprobe/2937
      [...]
      Call trace:
       vunmap+0x58/0x68 (P)
       iounmap+0x34/0x48
       pci_iounmap+0x2c/0x40
       ntb_epf_pci_remove+0x44/0x80 [ntb_hw_epf]
       pci_device_remove+0x48/0xf8
       device_remove+0x50/0x88
       device_release_driver_internal+0x1c8/0x228
       driver_detach+0x50/0xb0
       bus_remove_driver+0x74/0x100
       driver_unregister+0x34/0x68
       pci_unregister_driver+0x34/0xa0
       ntb_epf_pci_driver_exit+0x14/0xfe0 [ntb_hw_epf]
      [...]
    
    Fix it by unmapping only when PEER_SPAD and CONFIG use difference bars.
    
    Cc: [email protected]
    Fixes: e75d5ae8ab88 ("NTB: epf: Allow more flexibility in the memory BAR map method")
    Reviewed-by: Frank Li <[email protected]>
    Signed-off-by: Koichiro Den <[email protected]>
    Reviewed-by: Dave Jiang <[email protected]>
    Signed-off-by: Jon Mason <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ntfs3: reject direct userspace writes to reserved $LX* xattrs [+ + +]

Author: Konstantin Komarov <[email protected]>
Date:   Wed Jun 10 12:31:01 2026 +0200

    ntfs3: reject direct userspace writes to reserved $LX* xattrs
    
    commit 5b08dccecf825cbf905f348bc6ccb497507e28e2 upstream.
    
    NTFS3 uses $LXUID, $LXGID, $LXMOD and $LXDEV as internal WSL
    permission metadata and reloads them into i_uid, i_gid and i_mode
    from ntfs_get_wsl_perm().
    
    Because the empty-prefix xattr handler also lets file owners call
    setxattr() on these names directly, an unprivileged writer on a
    writable ntfs3 mount can plant root ownership and S_ISUID on their own
    file and gain euid 0 after inode reload.
    
    Reject direct userspace writes to the reserved $LX* names. Internal
    ntfs3 metadata updates are unchanged because ntfs_save_wsl_perm()
    writes them via ntfs_set_ea() directly.
    
    Signed-off-by: Zhen Yan <[email protected]>
    [[email protected]: added an additional check for non privileged users]
    Signed-off-by: Konstantin Komarov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ocfs2: reject oversized group bitmap descriptors [+ + +]

Author: Zhang Cen <[email protected]>
Date:   Sun May 24 19:12:48 2026 +0800

    ocfs2: reject oversized group bitmap descriptors
    
    commit 9bd541e09dffff27e5bec0f9f45b0228173a5375 upstream.
    
    ocfs2_validate_gd_parent() only bounds bg_bits against the parent
    allocator's chain geometry.  A malicious descriptor can still claim a
    bg_size/bg_bits pair that exceeds the bitmap bytes that physically fit in
    the group descriptor block, so later bitmap scans and bit updates can run
    past bg_bitmap.
    
    Add a physical-cap check based on ocfs2_group_bitmap_size() for the parent
    allocator type and reject descriptors whose bg_size or bg_bits exceed that
    capacity.  Keep the existing chain geometry check so both the on-disk
    bitmap layout and the allocator metadata must agree before the descriptor
    is used.
    
    Validation reproduced this kernel report:
    KASAN use-after-free in _find_next_bit+0x7f/0xc0
    Read of size 8
    Call trace:
      dump_stack_lvl+0x66/0xa0 (?:?)
      print_report+0xd0/0x630 (?:?)
      _find_next_bit+0x7f/0xc0 (?:?)
      srso_alias_return_thunk+0x5/0xfbef5 (?:?)
      __virt_addr_valid+0x188/0x2f0 (?:?)
      kasan_report+0xe4/0x120 (?:?)
      ocfs2_find_max_contig_free_bits+0x35/0x70 (fs/ocfs2/suballoc.c:1375)
      ocfs2_block_group_set_bits+0x472/0x4b0 (fs/ocfs2/suballoc.c:1457)
      ocfs2_cluster_group_search+0x16b/0x440 (fs/ocfs2/suballoc.c:86)
      ocfs2_bg_discontig_fix_result+0x1ef/0x230 (fs/ocfs2/suballoc.c:1786)
      ocfs2_search_chain+0x8f8/0x10a0 (fs/ocfs2/suballoc.c:1886)
      get_page_from_freelist+0x70e/0x2370 (?:?)
      lock_release+0xc6/0x290 (?:?)
      do_raw_spin_unlock+0x9a/0x100 (?:?)
      kasan_unpoison+0x27/0x60 (?:?)
      __bfs+0x147/0x240 (?:?)
      get_page_from_freelist+0x83d/0x2370 (?:?)
      ocfs2_claim_suballoc_bits+0x38c/0xe70 (fs/ocfs2/suballoc.c:96)
      sched_domains_numa_masks_clear+0x70/0xd0 (?:?)
      check_irq_usage+0xe8/0xb70 (?:?)
      __ocfs2_claim_clusters+0x18d/0x4c0 (fs/ocfs2/suballoc.c:2497)
      check_path+0x24/0x50 (?:?)
      rcu_is_watching+0x20/0x50 (?:?)
      check_prev_add+0xfd/0xd00 (?:?)
      ocfs2_add_clusters_in_btree+0x17d/0x810 (fs/ocfs2/suballoc.c:?)
      __folio_batch_add_and_move+0x1f5/0x3d0 (?:?)
      ocfs2_add_inode_data+0xd9/0x120 (fs/ocfs2/suballoc.c:?)
      filemap_add_folio+0x105/0x1f0 (?:?)
      ocfs2_write_begin_nolock+0x29f7/0x2f80 (fs/ocfs2/suballoc.c:3043)
      ocfs2_read_inode_block+0xb5/0x110 (fs/ocfs2/suballoc.c:?)
      down_write+0xf5/0x180 (?:?)
      ocfs2_write_begin+0x180/0x240 (fs/ocfs2/suballoc.c:?)
      __mark_inode_dirty+0x758/0x9a0 (?:?)
      inode_to_bdi+0x41/0x90 (?:?)
      balance_dirty_pages_ratelimited_flags+0xf8/0x1d0 (?:?)
      generic_perform_write+0x252/0x440 (?:?)
      mnt_put_write_access_file+0x16/0x70 (?:?)
      file_update_time_flags+0xe4/0x200 (?:?)
      ocfs2_file_write_iter+0x80a/0x1320 (fs/ocfs2/suballoc.c:?)
      lock_acquire+0x184/0x2f0 (?:?)
      ksys_write+0xd2/0x170 (?:?)
      apparmor_file_permission+0xf5/0x310 (?:?)
      read_zero+0x8d/0x140 (?:?)
      lock_is_held_type+0x8f/0x100 (?:?)
    
    Link: https://lore.kernel.org/[email protected]
    Fixes: ccd979bdbce9 ("[PATCH] OCFS2: The Second Oracle Cluster Filesystem")
    Assisted-by: Codex:gpt-5.5
    Signed-off-by: Zhang Cen <[email protected]>
    Reviewed-by: Joseph Qi <[email protected]>
    Cc: Mark Fasheh <[email protected]>
    Cc: Joel Becker <[email protected]>
    Cc: Junxiao Bi <[email protected]>
    Cc: Changwei Ge <[email protected]>
    Cc: Jun Piao <[email protected]>
    Cc: Heming Zhao <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

perf bench: Avoid NDEBUG warning [+ + +]

Author: Ian Rogers <[email protected]>
Date:   Thu Jun 25 13:32:21 2026 +0000

    perf bench: Avoid NDEBUG warning
    
    [ Upstream commit d1babea9c38282b58a6f822ab95027cba3165a42 ]
    
    With NDEBUG set the asserts are compiled out. This yields
    "unused-but-set-variable" variables. Move these variables behind
    NDEBUG to avoid the warning.
    
    Signed-off-by: Ian Rogers <[email protected]>
    Cc: Adrian Hunter <[email protected]>
    Cc: Alexander Shishkin <[email protected]>
    Cc: Ingo Molnar <[email protected]>
    Cc: Jiri Olsa <[email protected]>
    Cc: Mark Rutland <[email protected]>
    Cc: Namhyung Kim <[email protected]>
    Cc: Paolo Bonzini <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Cc: Sean Christopherson <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Stable-dep-of: 616b14b47a86 ("perf build: Conditionally define NDEBUG")
    Signed-off-by: Simon Liebold <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf block-range: Move debug code behind ifndef NDEBUG [+ + +]

Author: Ian Rogers <[email protected]>
Date:   Thu Jun 25 13:32:22 2026 +0000

    perf block-range: Move debug code behind ifndef NDEBUG
    
    [ Upstream commit 984a785f25e5b5db5fa673130b60dca6ca794406 ]
    
    Make good on a comment and avoid a unused-but-set-variable warning.
    
    Signed-off-by: Ian Rogers <[email protected]>
    Cc: Adrian Hunter <[email protected]>
    Cc: Alexander Shishkin <[email protected]>
    Cc: Ingo Molnar <[email protected]>
    Cc: Jiri Olsa <[email protected]>
    Cc: Mark Rutland <[email protected]>
    Cc: Namhyung Kim <[email protected]>
    Cc: Paolo Bonzini <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Cc: Sean Christopherson <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Stable-dep-of: 616b14b47a86 ("perf build: Conditionally define NDEBUG")
    Signed-off-by: Simon Liebold <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

phonet: Pass ifindex to fill_addr(). [+ + +]

Author: Kuniyuki Iwashima <[email protected]>
Date:   Wed Jun 17 10:45:02 2026 -0400

    phonet: Pass ifindex to fill_addr().
    
    [ Upstream commit 08a9572be36819b5d9011604edfa5db6c5062a7a ]
    
    We will convert addr_doit() and getaddr_dumpit() to RCU, both
    of which call fill_addr().
    
    The former will call phonet_address_notify() outside of RCU
    due to GFP_KERNEL, so dev will not be available in fill_addr().
    
    Let's pass ifindex directly to fill_addr().
    
    Signed-off-by: Kuniyuki Iwashima <[email protected]>
    Reviewed-by: Eric Dumazet <[email protected]>
    Signed-off-by: Paolo Abeni <[email protected]>
    Stable-dep-of: 71de0177b28d ("net: phonet: free phonet_device after RCU grace period")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

phonet: Pass net and ifindex to phonet_address_notify(). [+ + +]

Author: Kuniyuki Iwashima <[email protected]>
Date:   Wed Jun 17 10:45:03 2026 -0400

    phonet: Pass net and ifindex to phonet_address_notify().
    
    [ Upstream commit 68ed5c38b512b734caf3da1f87db4a99fcfe3002 ]
    
    Currently, phonet_address_notify() fetches netns and ifindex from dev.
    
    Once addr_doit() is converted to RCU, phonet_address_notify() will be
    called outside of RCU due to GFP_KERNEL, and dev will be unavailable
    there.
    
    Let's pass net and ifindex to phonet_address_notify().
    
    Signed-off-by: Kuniyuki Iwashima <[email protected]>
    Reviewed-by: Eric Dumazet <[email protected]>
    Signed-off-by: Paolo Abeni <[email protected]>
    Stable-dep-of: 71de0177b28d ("net: phonet: free phonet_device after RCU grace period")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

pNFS: Fix use-after-free in pnfs_update_layout() [+ + +]

Author: Wentao Liang <[email protected]>
Date:   Mon May 18 13:10:36 2026 +0000

    pNFS: Fix use-after-free in pnfs_update_layout()
    
    commit 13e198a90ca4050f4bee8a3f23680389a6563ccc upstream.
    
    When hitting the NFS_LAYOUT_RETURN branch in pnfs_update_layout(),
    the code calls pnfs_prepare_to_retry_layoutget(lo). If it succeeds,
    pnfs_put_layout_hdr(lo) is called before trace_pnfs_update_layout(),
    which still references 'lo'. This results in a use-after-free when the
    tracepoint accesses lo's fields.
    
    Fix this by moving the tracepoint call before pnfs_put_layout_hdr(lo).
    
    Fixes: 2c8d5fc37fe2 ("pNFS: Stricter ordering of layoutget and layoutreturn")
    Cc: [email protected]
    Signed-off-by: Wentao Liang <[email protected]>
    Signed-off-by: Anna Schumaker <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

power: reset: linkstation-poweroff: fix use-after-free in the linkstation_poweroff_init() [+ + +]

Author: Wentao Liang <[email protected]>
Date:   Tue Apr 7 07:30:25 2026 +0000

    power: reset: linkstation-poweroff: fix use-after-free in the linkstation_poweroff_init()
    
    commit 8eec545cde69e46e9a1d2b7d915ce4f5df85b3bd upstream.
    
    Move of_node_put(dn) after the of_match_node() call, which still needs
    the node pointer. The node reference is correctly released after use.
    
    Fixes: e2f471efe1d6 ("power: reset: linkstation-poweroff: prepare for new devices")
    Cc: [email protected]
    Signed-off-by: Wentao Liang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sebastian Reichel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

RDMA/bnxt_re: zero shared page before exposing to userspace [+ + +]

Author: Lord Ulf Henrik Holmberg <[email protected]>
Date:   Sat May 9 10:40:11 2026 +0200

    RDMA/bnxt_re: zero shared page before exposing to userspace
    
    commit f6b079629becfa977f9c51fe53ad2e6dcc55ef44 upstream.
    
    bnxt_re_alloc_ucontext() allocates uctx->shpg via
    __get_free_page(GFP_KERNEL). The buddy allocator does not zero pages
    without __GFP_ZERO, so the page contains stale kernel data from
    whatever object most recently freed it.
    
    The page is then mapped into userspace via vm_insert_page() under
    BNXT_RE_MMAP_SH_PAGE in bnxt_re_mmap(). The driver only ever writes
    4 bytes (a u32 AVID) at offset BNXT_RE_AVID_OFFT (0x10) inside
    bnxt_re_create_ah(); the remaining 4092 bytes of the page are exposed
    to userspace unsanitised, leaking kernel memory contents.
    
    Any user with access to /dev/infiniband/uverbsX on a host with a
    bnxt_re device (typically rdma group membership) can read this data
    via a single mmap() at pgoff 0 after IB_USER_VERBS_CMD_GET_CONTEXT.
    
    Other shared pages in the same file already use get_zeroed_page()
    correctly:
    
      drivers/infiniband/hw/bnxt_re/ib_verbs.c
          srq->uctx_srq_page = (void *)get_zeroed_page(GFP_KERNEL);
          cq->uctx_cq_page  = (void *)get_zeroed_page(GFP_KERNEL);
    
    uctx->shpg is the only outlier. Bring it in line with the existing
    convention by switching to get_zeroed_page().
    
    Fixes: 1ac5a4047975 ("RDMA/bnxt_re: Add bnxt_re RoCE driver")
    Signed-off-by: Lord Ulf Henrik Holmberg <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

regulator: core: fix locking in regulator_resolve_supply() error path [+ + +]

Author: André Draszik <[email protected]>
Date:   Fri Jan 9 08:38:38 2026 +0000

    regulator: core: fix locking in regulator_resolve_supply() error path
    
    commit 497330b203d2c59c5ff3fa4c34d14494d7203bc3 upstream.
    
    If late enabling of a supply regulator fails in
    regulator_resolve_supply(), the code currently triggers a lockdep
    warning:
    
        WARNING: drivers/regulator/core.c:2649 at _regulator_put+0x80/0xa0, CPU#6: kworker/u32:4/596
        ...
        Call trace:
         _regulator_put+0x80/0xa0 (P)
         regulator_resolve_supply+0x7cc/0xbe0
         regulator_register_resolve_supply+0x28/0xb8
    
    as the regulator_list_mutex must be held when calling _regulator_put().
    
    To solve this, simply switch to using regulator_put().
    
    While at it, we should also make sure that no concurrent access happens
    to our rdev while we clear out the supply pointer. Add appropriate
    locking to ensure that.
    
    While the code in question will be removed altogether in a follow-up
    commit, I believe it is still beneficial to have this corrected before
    removal for future reference.
    
    Fixes: 36a1f1b6ddc6 ("regulator: core: Fix memory leak in regulator_resolve_supply()")
    Fixes: 8e5356a73604 ("regulator: core: Clear the supply pointer if enabling fails")
    Signed-off-by: André Draszik <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Nazar Kalashnikov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Revert "ptp: add testptp mask test" [+ + +]

Author: Petr Machata <[email protected]>
Date:   Thu Jun 25 14:31:21 2026 +0200

    Revert "ptp: add testptp mask test"
    
    This reverts commit c1c50689799d0343598ab6ccb7209819bcef248d, which is
    commit 26285e689c6cd2cf3849568c83b2ebe53f467143 upstream.
    
    The reverted commit extends the selftest to test timestamp event queue mask
    manipulation in testptp. It exercises masks PTP_MASK_CLEAR_ALL and
    PTP_MASK_EN_SINGLE, introduced in commit c5a445b1e934 ("ptp: support event
    queue reader channel masks"), which is not on this stable branch. The test
    case thus cannot be built against this tree's own UAPI headers.
    
    The reverted commit was introduced to resolve a missing dependency of
    commit c6dc458227a3 ("testptp: Add option to open PHC in readonly mode"),
    which is 76868642e427 upstream. The only conflict between the two is the
    getopt string, and there is otherwise no direct dependency between the two.
    
    This patch therefore reverts the cited commit, with hand-resolving the
    getopt string to include 'r' (as introduced by c6dc458227a3), but not
    'F' (introduced by c1c50689799d).
    
    Reported-by: Yong Wang <[email protected]>
    Signed-off-by: Petr Machata <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ring-buffer: Remove ring_buffer_read_prepare_sync() [+ + +]

Author: Bjoern Doebel <[email protected]>
Date:   Wed Jun 24 12:23:22 2026 +0000

    ring-buffer: Remove ring_buffer_read_prepare_sync()
    
    [ Upstream commit 119a5d573622ae90ba730d18acfae9bb75d77b9a ]
    
    When the ring buffer was first introduced, reading the non-consuming
    "trace" file required disabling the writing of the ring buffer. To make
    sure the writing was fully disabled before iterating the buffer with a
    non-consuming read, it would set the disable flag of the buffer and then
    call an RCU synchronization to make sure all the buffers were
    synchronized.
    
    The function ring_buffer_read_start() originally  would initialize the
    iterator and call an RCU synchronization, but this was for each individual
    per CPU buffer where this would get called many times on a machine with
    many CPUs before the trace file could be read. The commit 72c9ddfd4c5bf
    ("ring-buffer: Make non-consuming read less expensive with lots of cpus.")
    separated ring_buffer_read_start into ring_buffer_read_prepare(),
    ring_buffer_read_sync() and then ring_buffer_read_start() to allow each of
    the per CPU buffers to be prepared, call the read_buffer_read_sync() once,
    and then the ring_buffer_read_start() for each of the CPUs which made
    things much faster.
    
    The commit 1039221cc278 ("ring-buffer: Do not disable recording when there
    is an iterator") removed the requirement of disabling the recording of the
    ring buffer in order to iterate it, but it did not remove the
    synchronization that was happening that was required to wait for all the
    buffers to have no more writers. It's now OK for the buffers to have
    writers and no synchronization is needed.
    
    Remove the synchronization and put back the interface for the ring buffer
    iterator back before commit 72c9ddfd4c5bf was applied.
    
    Cc: Mathieu Desnoyers <[email protected]>
    Link: https://lore.kernel.org/[email protected]
    Reported-by: David Howells <[email protected]>
    Fixes: 1039221cc278 ("ring-buffer: Do not disable recording when there is an iterator")
    Tested-by: David Howells <[email protected]>
    Reviewed-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Assisted-by: Kiro:claude-opus-4.8
    Signed-off-by: Bjoern Doebel <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

rpmsg: char: Add lock to avoid race when rpmsg device is released [+ + +]

Author: Deepak Kumar Singh <[email protected]>
Date:   Mon Sep 19 16:23:59 2022 +0530

    rpmsg: char: Add lock to avoid race when rpmsg device is released
    
    commit 17b88a2050e9d1f89a53562f2adb709a8959e763 upstream.
    
    When remote host goes down glink char device channel is freed and
    associated rpdev is destroyed through rpmsg_chrdev_eptdev_destroy(),
    At the same time user space apps can still try to open/poll rpmsg
    char device which will result in calling rpmsg_create_ept()/rpmsg_poll().
    These functions will try to reference rpdev which has already been freed
    through rpmsg_chrdev_eptdev_destroy().
    
    File operation functions and device removal function must be protected
    with lock. This patch adds existing ept lock in remove function as well.
    
    Signed-off-by: Deepak Kumar Singh <[email protected]>
    Signed-off-by: Bjorn Andersson <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Wen Yang <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

selftests/bpf: Check for timeout in perf_link test [+ + +]

Author: Ihor Solodrai <[email protected]>
Date:   Fri Jun 19 12:59:48 2026 +0800

    selftests/bpf: Check for timeout in perf_link test
    
    commit e6c209da7e0e9aaf955a7b59e91ed78c2b6c96fb upstream.
    
    Recently perf_link test started unreliably failing on libbpf CI:
      * https://github.com/libbpf/libbpf/actions/runs/11260672407/job/31312405473
      * https://github.com/libbpf/libbpf/actions/runs/11260992334/job/31315514626
      * https://github.com/libbpf/libbpf/actions/runs/11263162459/job/31320458251
    
    Part of the test is running a dummy loop for a while and then checking
    for a counter incremented by the test program.
    
    Instead of waiting for an arbitrary number of loop iterations once,
    check for the test counter in a loop and use get_time_ns() helper to
    enforce a 100ms timeout.
    
    v1: https://lore.kernel.org/bpf/zuRd072x9tumn2iN4wDNs5av0nu5nekMNV4PkR-YwCT10eFFTrUtZBRkLWFbrcCe7guvLStGQlhibo8qWojCO7i2-NGajes5GYIyynexD-w=@pm.me/
    
    Signed-off-by: Ihor Solodrai <[email protected]>
    Signed-off-by: Andrii Nakryiko <[email protected]>
    Link: https://lore.kernel.org/bpf/[email protected]
    Signed-off-by: Shung-Hsi Yu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

selftests/bpf: Move get_time_ns to testing_helpers.h [+ + +]

Author: Jiri Olsa <[email protected]>
Date:   Fri Jun 19 12:59:47 2026 +0800

    selftests/bpf: Move get_time_ns to testing_helpers.h
    
    commit 3830d04a7401f277185e4439eb259c1b13e76788 upstream.
    
    We'd like to have single copy of get_time_ns used b bench and test_progs,
    but we can't just include bench.h, because of conflicting 'struct env'
    objects.
    
    Moving get_time_ns to testing_helpers.h which is being included by both
    bench and test_progs objects.
    
    Signed-off-by: Jiri Olsa <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alexei Starovoitov <[email protected]>
    Stable-dep-of: e6c209da7e0e ("selftests/bpf: Check for timeout in perf_link test")
    Signed-off-by: Shung-Hsi Yu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

selftests/bpf: move SYS() macro into the test_progs.h [+ + +]

Author: Hangbin Liu <[email protected]>
Date:   Thu Jun 18 09:40:31 2026 +0800

    selftests/bpf: move SYS() macro into the test_progs.h
    
    commit b61987d37cbee3c44e80304598c60b163553926b upstream.
    
    A lot of tests defined SYS() macro to run system calls with goto label.
    Let's move this macro to test_progs.h and add configurable
    "goto_label" as the first arg.
    
    Suggested-by: Martin KaFai Lau <[email protected]>
    Signed-off-by: Hangbin Liu <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Martin KaFai Lau <[email protected]>
    Stable-dep-of: 967e8def1100 ("selftests/bpf: Fix bpf_nf selftest failure")
    [shung-hsi.yu: changes to several files are dropped because they don't exist
    yet: decap_sanity.c, fib_lookup.c, xdp_metadata.c, and xfrm_info.c. Addional
    changes are introdced to tc_redirect.c to patch SYS() macro usage not found
    in upstream. ]
    Signed-off-by: Shung-Hsi Yu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

serial: 8250_dw: unregister 8250 port if clk_notifier_register() fails [+ + +]

Author: Stepan Ionichev <[email protected]>
Date:   Thu Jun 25 10:06:00 2026 -0400

    serial: 8250_dw: unregister 8250 port if clk_notifier_register() fails
    
    [ Upstream commit 10fc708b4de7f86002d2d735a2dbf3b5b7f65692 ]
    
    dw8250_probe() registers the 8250 port via serial8250_register_8250_port()
    and then, if the device has a clock, registers a clock notifier. If
    clk_notifier_register() fails, probe returns the error but leaves the
    8250 port registered. The matching serial8250_unregister_port() lives
    in dw8250_remove(), which is not called when probe fails, so the port
    slot stays occupied until the device is rebound or the system is
    rebooted. The devm-allocated driver data is freed while the port still
    references it (via the saved private_data and serial_in/serial_out
    callbacks), so any access to that port slot before a rebind is a
    use-after-free hazard.
    
    Unregister the port on the clk_notifier_register() error path.
    
    Fixes: cc816969d7b5 ("serial: 8250_dw: Fix common clocks usage race condition")
    Cc: [email protected]
    Signed-off-by: Stepan Ionichev <[email protected]>
    Reviewed-by: Andy Shevchenko <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tipc: fix slab-use-after-free Read in tipc_aead_decrypt_done [+ + +]

Author: Doruk Tan Ozturk <[email protected]>
Date:   Wed Jun 17 09:58:18 2026 +0200

    tipc: fix slab-use-after-free Read in tipc_aead_decrypt_done
    
    commit bda3348872a2ef0d19f2df6aa8cb5025adce2f20 upstream.
    
    tipc_aead_decrypt() goes straight from tipc_bearer_hold(b) to
    crypto_aead_decrypt(req) without taking a reference on the netns, unlike
    the encrypt path. When crypto_aead_decrypt() is offloaded asynchronously
    (e.g. the SIMD aead wrapper queuing to cryptd), the cryptd worker runs
    tipc_aead_decrypt_done() later. If the bearer's netns is torn down in the
    meantime, cleanup_net() -> tipc_exit_net() -> tipc_crypto_stop() frees the
    per-netns tipc_crypto, and the completion then reads it:
    tipc_aead_decrypt_done() dereferences aead->crypto->stats and
    aead->crypto->net, and tipc_crypto_rcv_complete() dereferences
    aead->crypto->aead[] and the node table -- reading freed memory.
    
    Decoded KASAN splat (v7.1-rc7, CONFIG_KASAN_INLINE + TIPC + TIPC_CRYPTO):
    
      BUG: KASAN: slab-use-after-free in tipc_aead_decrypt_done (net/tipc/crypto.c:999)
      Read of size 8 at addr ffff8881056258a8 by task kworker/u16:2/51
      Workqueue: events_unbound
      Call Trace:
       tipc_aead_decrypt_done (net/tipc/crypto.c:999)
       process_one_work (kernel/workqueue.c:3314)
       worker_thread (kernel/workqueue.c:3397 kernel/workqueue.c:3478)
       kthread (kernel/kthread.c:436)
       ret_from_fork (arch/x86/kernel/process.c:158)
       ret_from_fork_asm (arch/x86/entry/entry_64.S:245)
    
      Allocated by task 169:
       __kasan_kmalloc (mm/kasan/common.c:398 mm/kasan/common.c:415)
       tipc_crypto_start (net/tipc/crypto.c:1502)
       tipc_init_net (net/tipc/core.c:72)
       ops_init (net/core/net_namespace.c:137)
       setup_net (net/core/net_namespace.c:446)
       copy_net_ns (net/core/net_namespace.c:579)
       create_new_namespaces (kernel/nsproxy.c:132)
       __x64_sys_unshare (kernel/fork.c:3316)
       do_syscall_64 (arch/x86/entry/syscall_64.c:63)
       entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:121)
    
      Freed by task 8:
       kfree (mm/slub.c:6566)
       tipc_exit_net (net/tipc/core.c:119)
       cleanup_net (net/core/net_namespace.c:704)
       process_one_work (kernel/workqueue.c:3314)
       kthread (kernel/kthread.c:436)
    
    This is the same class of bug that commit e279024617134 ("net/tipc: fix
    slab-use-after-free Read in tipc_aead_encrypt_done") fixed for the encrypt
    side. The encrypt path takes maybe_get_net(aead->crypto->net) before
    crypto_aead_encrypt() and drops it with put_net() on the synchronous
    return paths and in tipc_aead_encrypt_done(); the -EINPROGRESS/-EBUSY
    return keeps the reference for the async callback to release. The decrypt
    path was left without the equivalent guard.
    
    Mirror the encrypt-side fix on the decrypt path: take a net reference
    before crypto_aead_decrypt() (failing with -ENODEV and the matching
    bearer put if it cannot be acquired), keep it across the
    -EINPROGRESS/-EBUSY async return, and drop it with put_net() on the
    synchronous success/error return and at the end of
    tipc_aead_decrypt_done().
    
    Reproduced under KASAN on v7.1-rc7: a UDP bearer with a cluster key is
    flooded with crafted encrypted frames from an unknown peer (driving the
    cluster-key decrypt path) while the bearer's netns is repeatedly torn
    down. The completion must run asynchronously to outlive
    tipc_crypto_stop(); on x86 the stock aesni gcm(aes) now decrypts
    synchronously, so the async path was exercised via cryptd offload. The
    unguarded aead->crypto dereference in tipc_aead_decrypt_done() is the
    unpatched upstream path; tipc_aead_decrypt() still lacks
    maybe_get_net(aead->crypto->net), so the completion can outlive the free
    on any config where crypto_aead_decrypt() goes async.
    
    Found by 0sec automated security-research tooling (https://0sec.ai).
    
    Fixes: fc1b6d6de220 ("tipc: introduce TIPC encryption & authentication")
    Cc: [email protected]
    Signed-off-by: Doruk Tan Ozturk <[email protected]>
    Reviewed-by: Alexander Lobakin <[email protected]>
    Reviewed-by: Tung Nguyen <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

vc_screen: fix null-ptr-deref in vcs_notifier() during concurrent vcs_write [+ + +]

Author: Yi Yang <[email protected]>
Date:   Thu Jun 4 06:07:34 2026 +0000

    vc_screen: fix null-ptr-deref in vcs_notifier() during concurrent vcs_write
    
    commit a287620312dc6dcb9a093417a0e589bf30fcf38a upstream.
    
    A KASAN null-ptr-deref was observed in vcs_notifier():
    
    BUG: KASAN: null-ptr-deref in vcs_notifier+0x98/0x130
    Read of size 2 at addr qmp_cmd_name: qmp_capabilities, arguments: {}
    
    The issue is a race condition in vcs_write(). When the console_lock is
    temporarily dropped (to copy data from userspace), the vc_data pointer
    obtained from vcs_vc() may become stale. After re-acquiring the lock,
    vcs_vc() is called again to re-validate the pointer. If the vc has been
    deallocated in the meantime, vcs_vc() returns NULL, and the while loop
    breaks (with written > 0). However, after the loop, vcs_scr_updated(vc)
    is still called with the now-NULL vc pointer, leading to a null pointer
    dereference in the notifier chain (vcs_notifier dereferences param->vc).
    
    Fix this by adding a NULL check for vc before calling vcs_scr_updated().
    
    Fixes: 8fb9ea65c9d1 ("vc_screen: reload load of struct vc_data pointer in vcs_write() to avoid UAF")
    Cc: [email protected]
    Signed-off-by: Yi Yang <[email protected]>
    Reviewed-by: Jiri Slaby <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

virtiofs: fix UAF on submount umount [+ + +]

Author: Miklos Szeredi <[email protected]>
Date:   Thu May 28 10:58:24 2026 +0200

    virtiofs: fix UAF on submount umount
    
    commit 06b41351779e9289e8785694ade9042ae85e41ea upstream.
    
    iput() called from fuse_release_end() can Oops if the super block has
    already been destroyed.  Normally this is prevented by waiting for
    num_waiting to go down to zero before commencing with super block shutdown.
    
    This only works, however, for the last submount instance, as the wait
    counter is per connection, not per superblock.
    
    Revert to using synchronous release requests for the auto_submounts case,
    which is virtiofs only at this time.
    
    Reported-by: Aurélien Bombo <[email protected]>
    Reported-by: Zhihao Cheng <[email protected]>
    Cc: Greg Kurz <[email protected]>
    Closes: https://github.com/kata-containers/kata-containers/issues/12589
    Fixes: 26e5c67deb2e ("fuse: fix livelock in synchronous file put from fuseblk workers")
    Cc: [email protected]
    Reviewed-by: Greg Kurz <[email protected]>
    Signed-off-by: Miklos Szeredi <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: ath11k: fix warning when unbinding [+ + +]

Author: Jose Ignacio Tornos Martinez <[email protected]>
Date:   Mon Apr 20 13:01:29 2026 +0200

    wifi: ath11k: fix warning when unbinding
    
    commit 8b7a26b6681922a38cd5a7829ace61f8e54df9b7 upstream.
    
    If there is an error during some initialization related to firmware,
    the buffers dp->tx_ring[i].tx_status are released.
    However this is released again when the device is unbinded (ath11k_pci),
    and we get:
    WARNING: CPU: 0 PID: 6231 at mm/slub.c:4368 free_large_kmalloc+0x57/0x90
    Call Trace:
    free_large_kmalloc
    ath11k_dp_free
    ath11k_core_deinit
    ath11k_pci_remove
    ...
    
    The issue is always reproducible from a VM because the MSI addressing
    initialization is failing.
    
    In order to fix the issue, just set the buffers to NULL after releasing in
    order to avoid the double free.
    
    Fixes: d5c65159f289 ("ath11k: driver for Qualcomm IEEE 802.11ax devices")
    Cc: [email protected]
    Signed-off-by: Jose Ignacio Tornos Martinez <[email protected]>
    Reviewed-by: Baochen Qiang <[email protected]>
    Reviewed-by: Rameshkumar Sundaram <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jeff Johnson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: mt76: mt76x2u: Add support for ELECOM WDC-867SU3S [+ + +]

Author: Zenm Chen <[email protected]>
Date:   Tue Apr 7 23:44:30 2026 +0800

    wifi: mt76: mt76x2u: Add support for ELECOM WDC-867SU3S
    
    commit f4ce0664e9f0387873b181777891741c33e19465 upstream.
    
    Add the ID 056e:400a to the table to support an additional MT7612U
    adapter: ELECOM WDC-867SU3S.
    
    Compile tested only.
    
    Cc: [email protected] # 5.10.x
    Signed-off-by: Zenm Chen <[email protected]>
    Acked-by: Lorenzo Bianconi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Felix Fietkau <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: rtlwifi: rtl8821ae: Fix C2H bit location in RX descriptor [+ + +]

Author: Bitterblue Smith <[email protected]>
Date:   Sat Apr 25 22:32:58 2026 +0300

    wifi: rtlwifi: rtl8821ae: Fix C2H bit location in RX descriptor
    
    commit 83d38df6929118c3f996b9e3351c2d5014073d87 upstream.
    
    Bit 28 of double word 2 in the RX descriptor indicates if the packet is
    a normal 802.11 frame, or a message from the wifi firmware to the
    driver (Card 2 Host).
    
    Commit f5678bfe1cdc ("rtlwifi: rtl8821ae: Replace local bit manipulation
    macros") mistakenly made the driver look for this bit in double word 1,
    causing packet loss and Bluetooth coexistence problems.
    
    Fixes: f5678bfe1cdc ("rtlwifi: rtl8821ae: Replace local bit manipulation macros")
    Cc: <[email protected]>
    Signed-off-by: Bitterblue Smith <[email protected]>
    Acked-by: Ping-Ke Shih <[email protected]>
    Signed-off-by: Ping-Ke Shih <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>