scsi_eh_* process using idle CPU after upgrade to kernel 6.5

Bug #2048945 reported by Andrew Reis
18
This bug affects 3 people
Affects Status Importance Assigned to Milestone
linux
Confirmed
High
linux (Ubuntu)
Confirmed
Undecided
Unassigned
linux-aws-6.5 (Ubuntu)
Confirmed
Undecided
Unassigned
linux-hwe-6.5 (Ubuntu)
Confirmed
Undecided
Unassigned
linux-meta-hwe-6.5 (Ubuntu)
Confirmed
Undecided
Unassigned
linux-signed-hwe-6.5 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

Possibly related to https://lkml.kernel<email address hidden>/T/

Getting alerts from monitoring tools about high average CPU usage. Machine reporting the following:

root@server:~# uptime
 17:04:34 up 14 days, 6:33, 2 users, load average: 15.66, 15.88, 15.94

root@server:~# top -d 1

top - 17:04:49 up 14 days, 6:33, 2 users, load average: 15.59, 15.85, 15.93
Tasks: 223 total, 1 running, 222 sleeping, 0 stopped, 0 zombie
%Cpu(s): 0.7 us, 1.4 sy, 0.0 ni, 49.3 id, 48.6 wa, 0.0 hi, 0.0 si, 0.0 st
MiB Mem : 7896.6 total, 1772.6 free, 1163.9 used, 4960.0 buff/cache
MiB Swap: 2048.0 total, 1784.6 free, 263.4 used. 6431.6avail Mem

    PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
     88 root 20 0 0 0 0 D 55.4 0.0 11275:46 scsi_eh_1

root@server:~# lsb_release -rd
Description: Ubuntu 22.04.3 LTS
Release: 22.04

root@server:~# apt-cache policy linux-image-virtual-hwe-22.04-edge
linux-image-virtual-hwe-22.04-edge:
  Installed: 6.5.0.14.14~22.04.6
  Candidate: 6.5.0.14.14~22.04.7
  Version table:
     6.5.0.14.14~22.04.7 500
        500 http://us.archive.ubuntu.com/ubuntu jammy-updates/main amd64 Packages
        500 http://security.ubuntu.com/ubuntu jammy-security/main amd64 Packages
 *** 6.5.0.14.14~22.04.6 100
        100 /var/lib/dpkg/status
     5.15.0.25.27 500
        500 http://us.archive.ubuntu.com/ubuntu jammy/main amd64 Packages

ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: linux-image-6.5.0-14-generic 6.5.0-14.14~22.04.1
ProcVersionSignature: Ubuntu 6.5.0-14.14~22.04.1-generic 6.5.3
Uname: Linux 6.5.0-14-generic x86_64
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
CasperMD5CheckResult: unknown
Date: Wed Jan 10 17:02:50 2024
ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
SourcePackage: linux-signed-hwe-6.5
UpgradeStatus: Upgraded to jammy on 2023-05-02 (253 days ago)
---
ProblemType: Bug
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
CasperMD5CheckResult: unknown
DistroRelease: Ubuntu 22.04
Package: linux-image-virtual-hwe-22.04-edge 6.5.0.14.14~22.04.6
PackageArchitecture: amd64
ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcVersionSignature: Ubuntu 6.5.0-14.14~22.04.1-generic 6.5.3
Tags: jammy third-party-packages
Uname: Linux 6.5.0-14-generic x86_64
UpgradeStatus: Upgraded to jammy on 2023-05-02 (253 days ago)
UserGroups: N/A
_MarkForUpload: True
---
ProblemType: Bug
AlsaDevices:
 total 0
 crw-rw---- 1 root audio 116, 1 Dec 27 10:31 seq
 crw-rw---- 1 root audio 116, 33 Dec 27 10:31 timer
AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
CRDA: N/A
CasperMD5CheckResult: unknown
DistroRelease: Ubuntu 22.04
HibernationDevice: #RESUME=UUID=9bf7612f-b07b-4adf-9760-0c14b8d7d6fc
IwConfig:
 lo no wireless extensions.

 eth0 no wireless extensions.
Lsusb: Error: command ['lsusb'] failed with exit code 1:
Lsusb-t:

Lsusb-v: Error: command ['lsusb', '-v'] failed with exit code 1:
MachineType: VMware, Inc. VMware Virtual Platform
Package: linux (not installed)
PciMultimedia:

ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcFB: 0 vmwgfxdrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.5.0-14-generic root=UUID=07ffde4f-b32f-41fe-81dd-4fe4b186dab4 ro
ProcVersionSignature: Ubuntu 6.5.0-14.14~22.04.1-generic 6.5.3
RelatedPackageVersions:
 linux-restricted-modules-6.5.0-14-generic N/A
 linux-backports-modules-6.5.0-14-generic N/A
 linux-firmware 20220329.git681281e4-0ubuntu3.23
RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
Tags: jammy
Uname: Linux 6.5.0-14-generic x86_64
UpgradeStatus: Upgraded to jammy on 2023-05-02 (253 days ago)
UserGroups: N/A
_MarkForUpload: True
dmi.bios.date: 11/12/2020
dmi.bios.release: 4.6
dmi.bios.vendor: Phoenix Technologies LTD
dmi.bios.version: 6.00
dmi.board.name: 440BX Desktop Reference Platform
dmi.board.vendor: Intel Corporation
dmi.board.version: None
dmi.chassis.asset.tag: No Asset Tag
dmi.chassis.type: 1
dmi.chassis.vendor: No Enclosure
dmi.chassis.version: N/A
dmi.ec.firmware.release: 0.0
dmi.modalias: dmi:bvnPhoenixTechnologiesLTD:bvr6.00:bd11/12/2020:br4.6:efr0.0:svnVMware,Inc.:pnVMwareVirtualPlatform:pvrNone:rvnIntelCorporation:rn440BXDesktopReferencePlatform:rvrNone:cvnNoEnclosure:ct1:cvrN/A:sku:
dmi.product.name: VMware Virtual Platform
dmi.product.version: None
dmi.sys.vendor: VMware, Inc.
---
ProblemType: Bug
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
CasperMD5CheckResult: unknown
DistroRelease: Ubuntu 22.04
Package: linux-image-virtual-hwe-22.04-edge 6.5.0.17.17~22.04.9 [origin: unknown]
PackageArchitecture: amd64
ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcVersionSignature: Ubuntu 6.5.0-17.17~22.04.1-generic 6.5.8
Tags: jammy third-party-packages
Uname: Linux 6.5.0-17-generic x86_64
UnreportableReason: This does not seem to be an official Ubuntu package. Please retry after updating the indexes of available packages, if that does not work then remove related third party packages and try again.
UpgradeStatus: Upgraded to jammy on 2023-05-02 (266 days ago)
UserGroups: N/A
_MarkForUpload: True
---
ProblemType: Bug
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
CasperMD5CheckResult: unknown
DistroRelease: Ubuntu 22.04
Package: linux-image-6.5.0-17-generic 6.5.0-17.17~22.04.1 [origin: unknown]
PackageArchitecture: amd64
ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcVersionSignature: Ubuntu 6.5.0-17.17~22.04.1-generic 6.5.8
Tags: jammy third-party-packages
Uname: Linux 6.5.0-17-generic x86_64
UnreportableReason: This does not seem to be an official Ubuntu package. Please retry after updating the indexes of available packages, if that does not work then remove related third party packages and try again.
UpgradeStatus: Upgraded to jammy on 2023-05-02 (266 days ago)
UserGroups: N/A
_MarkForUpload: True

Revision history for this message
In , laktak (laktak-linux-kernel-bugs) wrote :

Overview:

Several users report that after upgrading from 6.4.12 to 6.5 the process scsi_eh_1 will constantly consume >10% CPU resources. This happens most often in VMs.

Steps to Reproduce:

- Create a VM (e.g. VMware Fusion)
- Create an SCSI disk
- Connect a virtual CD ROM (IDE)
- Boot a 6.4.12 kernel
- Boot a 6.5 kernel

Actual Results:

- no issues for the 6.4.12 kernel
- scsi_eh consumes too much CPU with the 6.5 kernel

Expected Results:

scsi_eh should not consume significant resources.

Build Date & Hardware:

Linux arch 6.5.2-arch1-1 #1 SMP PREEMPT_DYNAMIC Wed, 06 Sep 2023 21:01:01 +0000 x86_64 GNU/Linux
inside a VMware Fusion VM

Additional Builds and Platforms:

Other users were able to reproduce the error on bare metal hardware.

Additional Information:

More details can be found in this thread:
https://bbs.archlinux.org/viewtopic.php?id=288723

The users loqs and leonshaw helped to narrow it down to this commit:

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=624885209f31eb9985bf51abe204ecbffe2fdeea

good: 6.4.0-rc1-1-00007-g152e52fb6ff1
bad: 6.4.0-rc1-1-00008-g624885209f31

Revision history for this message
In , bvanassche (bvanassche-linux-kernel-bugs) wrote :

On 9/15/23 12:33, <email address hidden> wrote:
> The users loqs and leonshaw helped to narrow it down to this commit:
>
>
> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=624885209f31eb9985bf51abe204ecbffe2fdeea

Damien, can you please take a look?

Thanks,

Bart.

Revision history for this message
In , loberman (loberman-linux-kernel-bugs) wrote :

On Fri, 2023-09-15 at 13:42 -0700, Bart Van Assche wrote:
> On 9/15/23 12:33, <email address hidden> wrote:
> > The users loqs and leonshaw helped to narrow it down to this
> > commit:
> >
> >
> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=624885209f31eb9985bf51abe204ecbffe2fdeea
>
> Damien, can you please take a look?
>
> Thanks,
>
> Bart.
>
I had a quick look at this and its not making sense. The only calls I
see are in the scan when the devices is added and in the re-scan.
It should not be consuming the scsi_eh thread unless some type of udev
events keeps happening.

Would be good to get some
cat /proc/<PID>/stack of the scsi_eh threads if they are constantly
consuming CPUY

I will try reproduce and try figure out what is going on here.

Thanks
Laurence

Revision history for this message
In , loberman (loberman-linux-kernel-bugs) wrote :

Not reproducible generically for me

[root@penguin8 ~]# uname -a
Linux penguin8 6.5.0+ #2 SMP PREEMPT_DYNAMIC

[root@penguin8 ~]# lsscsi
[0:0:0:0] disk ATA Samsung SSD 850 3B6Q /dev/sdb
[1:0:0:0] disk ATA Samsung SSD 850 3B6Q /dev/sda

USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
root 1649 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_0]
root 1651 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_1]
root 1653 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_2]
root 1655 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_3]
root 1668 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_4]
root 1670 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_5]
root 1672 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_6]
root 1674 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_7]
root 1866 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_8]
root 1887 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_9]

root 1649 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_0]
root 1651 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_1]
root 1653 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_2]
root 1655 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_3]
root 1668 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_4]
root 1670 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_5]
root 1672 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_6]
root 1674 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_7]
root 1866 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_8]
root 1887 0.0 0.0 0 0 ? S 16:58 0:00 [scsi_eh_9]

I Have no CDROm so I think its the virtual cdrom.
In VMware the CDROM will continuously get probed and log errors due to no media and every time that happens it will call the cdl stuff.

I will bring up a Virtual guest now, will take time as I will have to build upstream kernels.

Revision history for this message
In , Niklas.Cassel (niklas.cassel-linux-kernel-bugs) wrote :

On Fri, Sep 15, 2023 at 01:42:18PM -0700, Bart Van Assche wrote:
> On 9/15/23 12:33, <email address hidden> wrote:
> > The users loqs and leonshaw helped to narrow it down to this commit:
> >
> >
> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=624885209f31eb9985bf51abe204ecbffe2fdeea
>
> Damien, can you please take a look?
>

Hello Bart,

It seems like:
https://<email address hidden>/

Solves the problem.

From a quick look at the logs with extra log leves enabled:
https://pastebin.com/f2LQ8kQD
it appears that the MAINTENANCE_IN / MI_REPORT_SUPPORTED_OPERATION_CODES
command with a non-zero service action issued by scsi_cdl_check() fails,
and will be added to SCSI EH over and over.

Kind regards,
Niklas

Revision history for this message
In , dlemoal (dlemoal-linux-kernel-bugs) wrote :

On 9/16/23 07:01, Niklas Cassel wrote:
> On Fri, Sep 15, 2023 at 01:42:18PM -0700, Bart Van Assche wrote:
>> On 9/15/23 12:33, <email address hidden> wrote:
>>> The users loqs and leonshaw helped to narrow it down to this commit:
>>>
>>>
>>> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=624885209f31eb9985bf51abe204ecbffe2fdeea
>>
>> Damien, can you please take a look?
>>
>
> Hello Bart,
>
> It seems like:
>
> https://<email address hidden>/
>
> Solves the problem.
>
> From a quick look at the logs with extra log leves enabled:
> https://pastebin.com/f2LQ8kQD
> it appears that the MAINTENANCE_IN / MI_REPORT_SUPPORTED_OPERATION_CODES
> command with a non-zero service action issued by scsi_cdl_check() fails,
> and will be added to SCSI EH over and over.

The failure is due to the drive not liking this command. My patch avoids sending
that command, thus solves the issue with drives that choke on it. However, the
constant retry sound to me like a different bug... We should not retry that
command at all I think. Or maybe limit it to 3 retries.

>
>
> Kind regards,
> Niklas

Revision history for this message
In , dlemoal (dlemoal-linux-kernel-bugs) wrote :

On 9/16/23 07:01, Niklas Cassel wrote:
> On Fri, Sep 15, 2023 at 01:42:18PM -0700, Bart Van Assche wrote:
>> On 9/15/23 12:33, <email address hidden> wrote:
>>> The users loqs and leonshaw helped to narrow it down to this commit:
>>>
>>>
>>> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=624885209f31eb9985bf51abe204ecbffe2fdeea
>>
>> Damien, can you please take a look?
>>
>
> Hello Bart,
>
> It seems like:
>
> https://<email address hidden>/
>
> Solves the problem.
>
> From a quick look at the logs with extra log leves enabled:
> https://pastebin.com/f2LQ8kQD
> it appears that the MAINTENANCE_IN / MI_REPORT_SUPPORTED_OPERATION_CODES
> command with a non-zero service action issued by scsi_cdl_check() fails,
> and will be added to SCSI EH over and over.

Looks like the vmware emulated scsi cdrom (sr) does not like this command...
While SPC would allow cdroms to support CDL, I do not think we will ever see
that. So we could restrict CDL probe to block devices only. That still does not
explain why the constant retry. The MAINTENANCE_IN /
MI_REPORT_SUPPORTED_OPERATION_CODES failing is expected in most cases so it
should silently move on with cdl probe returning false. My patch is still needed
as some drives seem to hang on that command.

>
>
> Kind regards,
> Niklas

Revision history for this message
In , kernel (kernel-linux-kernel-bugs) wrote :
Download full text (5.4 KiB)

Noticed the same here when upgrading from 6.1.0-13-amd64 to 6.5.0-0.deb12.1-amd64 (both Debian kernels) earlier this month:

============================================================================
$ sar -f /var/log/sysstat/sa20231108
                CPU %user %nice %system %iowait %steal %idle
[...]
18:30:03 all 1.03 0.00 0.45 0.04 0.00 98.47
18:40:01 all 1.07 0.00 0.52 0.05 0.00 98.36
18:50:01 all 1.07 0.00 0.53 0.04 0.00 98.37
19:00:01 all 1.35 0.00 0.69 0.08 0.00 97.88
19:10:04 all 1.09 0.00 0.52 0.07 0.00 98.31
19:20:02 all 1.14 0.00 0.51 0.05 0.00 98.30
19:30:04 all 1.62 0.00 0.65 0.08 0.00 97.65
Average: all 1.06 0.00 0.50 0.06 0.00 98.38

19:32:27 LINUX RESTART (2 CPU)

19:40:03 CPU %user %nice %system %iowait %steal %idle
19:50:00 all 2.27 0.00 3.23 57.40 0.00 37.11
20:00:02 all 1.29 0.00 2.70 59.27 0.00 36.75
20:10:03 all 1.48 0.00 2.93 58.38 0.00 37.21
20:20:03 all 1.40 0.00 2.94 58.93 0.00 36.73
20:30:02 all 1.39 0.00 2.87 59.99 0.00 35.74
20:40:03 all 1.48 0.00 3.44 59.83 0.00 35.26
20:50:00 all 1.29 0.00 2.88 60.84 0.00 34.98
21:00:03 all 1.31 0.00 2.63 59.81 0.00 36.25
21:10:03 all 1.33 0.00 2.72 59.85 0.00 36.09
21:20:01 all 1.31 0.00 2.82 59.28 0.00 36.59
21:30:01 all 1.39 0.00 2.92 60.51 0.00 35.18
21:40:01 all 1.34 0.00 3.04 60.04 0.00 35.57
21:50:03 all 1.29 0.00 2.51 59.79 0.00 36.41
22:00:03 all 1.36 0.00 3.23 59.81 0.00 35.59
22:10:03 all 1.37 0.00 2.56 59.13 0.00 36.93
22:20:03 all 1.36 0.00 2.88 58.46 0.00 37.29
22:30:03 all 1.31 0.00 2.65 59.07 0.00 36.97
22:40:00 all 1.32 0.00 2.72 59.61 0.00 36.35
22:50:01 all 1.32 0.00 2.72 59.35 0.00 36.61
23:00:03 all 1.29 0.00 2.68 59.30 0.00 36.72
23:10:03 all 1.35 0.00 2.62 60.11 0.00 35.91
23:20:02 all 1.29 0.00 2.91 59.55 0.00 36.25
23:30:03 all 1.32 0.00 2.72 58.37 0.00 37.59
23:40:01 all 1.34 0.00 2.97 57.74 0.00 37.95
23:50:00 all 1.33 0.00 2.54 59.90 0.00 36.24
Average: all 1.38 0.00 2.83 59.37 0.00 36.41

$ last -n 3 reboot
reboot syste...

Read more...

Revision history for this message
In , Niklas.Cassel (niklas.cassel-linux-kernel-bugs) wrote :
Download full text (6.3 KiB)

On Mon, Nov 13, 2023 at 03:30:57AM +0000, <email address hidden> wrote:
> https://bugzilla.kernel.org/show_bug.cgi?id=217914
>
> Christian Kujau (<email address hidden>) changed:
>
> What |Removed |Added
> ----------------------------------------------------------------------------
> CC| |<email address hidden>
>
> --- Comment #7 from Christian Kujau (<email address hidden>) ---
> Noticed the same here when upgrading from 6.1.0-13-amd64 to
> 6.5.0-0.deb12.1-amd64 (both Debian kernels) earlier this month:
>
> ============================================================================
> $ sar -f /var/log/sysstat/sa20231108
> CPU %user %nice %system %iowait %steal
> %idle
> [...]
> 18:30:03 all 1.03 0.00 0.45 0.04 0.00
> 98.47
> 18:40:01 all 1.07 0.00 0.52 0.05 0.00
> 98.36
> 18:50:01 all 1.07 0.00 0.53 0.04 0.00
> 98.37
> 19:00:01 all 1.35 0.00 0.69 0.08 0.00
> 97.88
> 19:10:04 all 1.09 0.00 0.52 0.07 0.00
> 98.31
> 19:20:02 all 1.14 0.00 0.51 0.05 0.00
> 98.30
> 19:30:04 all 1.62 0.00 0.65 0.08 0.00
> 97.65
> Average: all 1.06 0.00 0.50 0.06 0.00
> 98.38
>
> 19:32:27 LINUX RESTART (2 CPU)
>
> 19:40:03 CPU %user %nice %system %iowait %steal
> %idle
> 19:50:00 all 2.27 0.00 3.23 57.40 0.00
> 37.11
> 20:00:02 all 1.29 0.00 2.70 59.27 0.00
> 36.75
> 20:10:03 all 1.48 0.00 2.93 58.38 0.00
> 37.21
> 20:20:03 all 1.40 0.00 2.94 58.93 0.00
> 36.73
> 20:30:02 all 1.39 0.00 2.87 59.99 0.00
> 35.74
> 20:40:03 all 1.48 0.00 3.44 59.83 0.00
> 35.26
> 20:50:00 all 1.29 0.00 2.88 60.84 0.00
> 34.98
> 21:00:03 all 1.31 0.00 2.63 59.81 0.00
> 36.25
> 21:10:03 all 1.33 0.00 2.72 59.85 0.00
> 36.09
> 21:20:01 all 1.31 0.00 2.82 59.28 0.00
> 36.59
> 21:30:01 all 1.39 0.00 2.92 60.51 0.00
> 35.18
> 21:40:01 all 1.34 0.00 3.04 60.04 0.00
> 35.57
> 21:50:03 all 1.29 0.00 2.51 59.79 0.00
> 36.41
> 22:00:03 all 1.36 0.00 3.23 59.81 0.00
> 35.59
> 22:10:03 all 1.37 0.00 2.56 59.13 0.00
> 36.93
> 22:20:03 all 1.36 0.00 2.88 58.46 0.00
> 37.29
> 22:30:03 all 1.31 0.00 2.65 59.07 0.00
> 36.97
> 22:40:00 all 1.32 0.00 2.72 59.61 0.00
> 36.35
> 22:50:01 all 1.32 0.00 2.72 59.35...

Read more...

Revision history for this message
In , kernel (kernel-linux-kernel-bugs) wrote :

Oh, great! I was only searching for the subject line, not the actual changes. So, I just have to wait for Debian (Backports) to move to v6.6 then :-) Thanks! This Bugzilla entry can be closed then, I assume.

Revision history for this message
In , pedretti.fabio (pedretti.fabio-linux-kernel-bugs) wrote :

"scsi: core: ata: Do no try to probe for CDL on old drives" -> it's also on 6.5.6.

Revision history for this message
Andrew Reis (areis422) wrote :
Revision history for this message
Andrew Reis (areis422) wrote : Dependencies.txt
tags: added: apport-collected third-party-packages
description: updated
Revision history for this message
Andrew Reis (areis422) wrote : ProcCpuinfoMinimal.txt
description: updated
Revision history for this message
Andrew Reis (areis422) wrote : CurrentDmesg.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : Lspci.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : Lspci-vt.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : ProcCpuinfoMinimal.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : ProcInterrupts.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : ProcModules.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : UdevDb.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : WifiSyslog.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : acpidump.txt

apport information

Changed in kernel:
importance: Unknown → High
status: Unknown → Confirmed
Revision history for this message
Roxana Nicolescu (roxanan) wrote :

This should be solved in the lastest jammy:hwe-6.5 version. It's gonna be in -proposed these days.

Revision history for this message
Andrew Reis (areis422) wrote : Dependencies.txt

apport information

description: updated
Revision history for this message
Andrew Reis (areis422) wrote : ProcCpuinfoMinimal.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote :

Updated to 6.5.0-17 and the issue still persists.

user@host:~$ uptime
 11:26:35 up 1:15, 1 user, load average: 13.68, 13.67, 13.64

description: updated
Revision history for this message
Andrew Reis (areis422) wrote : Dependencies.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote : ProcCpuinfoMinimal.txt

apport information

Revision history for this message
Andrew Reis (areis422) wrote :

Any update on this?

Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux (Ubuntu):
status: New → Confirmed
Changed in linux-aws-6.5 (Ubuntu):
status: New → Confirmed
Changed in linux-hwe-6.5 (Ubuntu):
status: New → Confirmed
Changed in linux-meta-hwe-6.5 (Ubuntu):
status: New → Confirmed
Changed in linux-signed-hwe-6.5 (Ubuntu):
status: New → Confirmed
Revision history for this message
ku4eto (ku4eto) wrote :

Why is this still not getting backported here?

Revision history for this message
Andrew Reis (areis422) wrote :

@Roxanan,

When is the backport coming? I've tested the latest kernel from proposed and the issue still persists.

Revision history for this message
Andrew Reis (areis422) wrote :

Bump. Is anyone looking at this at all?

Revision history for this message
ku4eto (ku4eto) wrote (last edit ):

Issue seems to have been fixed, with the bug report not being mentioned in the changelog.
Tested on Ubuntu 22.04, with 6.5.0-28-generic, vSphere 6.0, ESXi 6.0, IDE CD/DVD as Client Device.
I guess this can be closed.

@areis422 Can you test and confirm on your side as well?

Revision history for this message
Andrew Reis (areis422) wrote : Re: [Bug 2048945] Re: scsi_eh_* process using idle CPU after upgrade to kernel 6.5
Download full text (10.2 KiB)

Confirmed. And it also looks like the fix made it into the 6.8 kernel for noble.

From: <email address hidden> <email address hidden> on behalf of ku4eto <email address hidden>
Date: Friday, May 3, 2024 at 15:20
To: Drew Reis <email address hidden>
Subject: [Bug 2048945] Re: scsi_eh_* process using idle CPU after upgrade to kernel 6.5
Issue seems to have been fixed, with the bug report not being mentioned in the changelog.
Tested on Ubuntu 22.04, with 6.5.0-28-generic, vSphere 6.5, ESXi 6.5, IDE CD/DVD as Client Device.
I guess this can be closed.

@areis422 Can you test and confirm on your side as well?

--
You received this bug notification because you are subscribed to the bug
report.
https://nam10.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbugs.launchpad.net%2Fbugs%2F2048945&data=05%7C02%7Cdrew_reis%40gensler.com%7Caa2ee77f19b24bd7689908dc6bae78e8%7C94a74758f2ff413c9f705725701b8d02%7C0%7C0%7C638503644324869373%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=5SEDThTBhEDUa%2BPxcBjVPk9Y0Oh9x6R49qET%2FIqA%2BxA%3D&reserved=0<https://bugs.launchpad.net/bugs/2048945>

Title:
  scsi_eh_* process using idle CPU after upgrade to kernel 6.5

Status in linux:
  Confirmed
Status in linux package in Ubuntu:
  Confirmed
Status in linux-aws-6.5 package in Ubuntu:
  Confirmed
Status in linux-hwe-6.5 package in Ubuntu:
  Confirmed
Status in linux-meta-hwe-6.5 package in Ubuntu:
  Confirmed
Status in linux-signed-hwe-6.5 package in Ubuntu:
  Confirmed

Bug description:
  Possibly related to https://nam10.safelinks.protection.outlook.com/?url=https%3A%2F%2Flkml.kernel.org%2Flinux-&data=05%7C02%7Cdrew_reis%40gensler.com%7Caa2ee77f19b24bd7689908dc6bae78e8%7C94a74758f2ff413c9f705725701b8d02%7C0%7C0%7C638503644324879127%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C0%7C%7C%7C&sdata=PkMnglOHo3q9hgHMSOyDsJug1rM0HJJO2YW%2FgtRUtW0%3D&reserved=0<https://lkml.kernel.org/linux->
  <email address hidden>/T/

  Getting alerts from monitoring tools about high average CPU usage.
  Machine reporting the following:

  root@server:~# uptime
   17:04:34 up 14 days, 6:33, 2 users, load average: 15.66, 15.88, 15.94

  root@server:~# top -d 1

  top - 17:04:49 up 14 days, 6:33, 2 users, load average: 15.59, 15.85, 15.93
  Tasks: 223 total, 1 running, 222 sleeping, 0 stopped, 0 zombie
  %Cpu(s): 0.7 us, 1.4 sy, 0.0 ni, 49.3 id, 48.6 wa, 0.0 hi, 0.0 si, 0.0 st
  MiB Mem : 7896.6 total, 1772.6 free, 1163.9 used, 4960.0 buff/cache
  MiB Swap: 2048.0 total, 1784.6 free, 263.4 used. 6431.6avail Mem

      PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
       88 root 20 0 0 0 0 D 55.4 0.0 11275:46 scsi_eh_1

  root@server:~# lsb_release -rd
  Description: Ubuntu 22.04.3 LTS
  Release: 22.04

  root@server:~# apt-cache policy linux-image-virtual-hwe-22.04-edge
  linux-image-virtual-hwe-22.04-edge:
    Installed: 6.5.0.14.14~22.04.6
    Candidate: 6.5.0.14.14~22.04.7
    Version table:
       6.5.0.14.14~22.04.7 500
          5...

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.