Suspend and Hibernate cause ksoftirqd to use 100% of a cpu core on resume

Bug #567660 reported by Cam Cope
48
This bug affects 9 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Triaged
Undecided
Unassigned
Nominated for Lucid by Cam Cope

Bug Description

After I resume from suspend or hibernate, I get lots of interrupts, and the computer runs slow and hot. I would guess the bug is in iwlagn, but I can't tell.

ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-2.6.32-21-generic-pae 2.6.32-21.32
Regression: Yes
Reproducible: Yes
ProcVersionSignature: Ubuntu 2.6.32-21.32-generic-pae 2.6.32.11+drm33.2
Uname: Linux 2.6.32-21-generic-pae i686
NonfreeKernelModules: nvidia
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
Architecture: i386
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: cam 1505 F.... pulseaudio
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xf3500000 irq 22'
   Mixer name : 'Realtek ALC662 rev1'
   Components : 'HDA:10ec0662,15580481,00100101 HDA:10573055,00305557,00100900'
   Controls : 30
   Simple ctrls : 18
Date: Tue Apr 20 22:09:48 2010
HibernationDevice: RESUME=UUID=182062cf-d5a7-42fa-aefe-2bc5807427e8
InstallationMedia: Ubuntu 10.04 "Lucid Lynx" - Beta i386 (20100318)
MachineType: CLEVO CO. M570TU
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-21-generic-pae root=UUID=257d413e-76af-4d9b-a406-0af0a1ece0c6 ro quiet splash
ProcEnviron:
 PATH=(custom, no user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.34
RfKill:
 0: phy0: Wireless LAN
  Soft blocked: no
  Hard blocked: no
SourcePackage: linux
dmi.bios.date: 03/31/2009
dmi.bios.vendor: Phoenix Technologies LTD
dmi.bios.version: 1.00.16
dmi.board.asset.tag: Tag 12345
dmi.board.name: M570TU
dmi.board.vendor: CLEVO CO.
dmi.board.version: Not Applicable
dmi.chassis.asset.tag: No Asset Tag
dmi.chassis.type: 9
dmi.chassis.vendor: No Enclosure
dmi.chassis.version: N/A
dmi.modalias: dmi:bvnPhoenixTechnologiesLTD:bvr1.00.16:bd03/31/2009:svnCLEVOCO.:pnM570TU:pvrNotApplicable:rvnCLEVOCO.:rnM570TU:rvrNotApplicable:cvnNoEnclosure:ct9:cvrN/A:
dmi.product.name: M570TU
dmi.product.version: Not Applicable
dmi.sys.vendor: CLEVO CO.

Revision history for this message
Cam Cope (ccope) wrote :
Revision history for this message
Jeremy Foshee (jeremyfoshee) wrote :

Hi Cam,

If you could also please test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

    [This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags: added: kernel-suspend
tags: added: kernel-hibernate
tags: added: kj-triage
Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
Eder Bastos (riskbreaker927) wrote :

This occurs on my machine as well, which is a desktop with nForce 630i motherboard / MCP73 chipset. No wireless or intel equipment on my computer at all so I doubt it is an iwlagn problem.

Revision history for this message
Cam Cope (ccope) wrote :

This problem does not exist in the mainline kernel.

Cam Cope (ccope)
tags: removed: needs-upstream-testing
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Changed in linux (Ubuntu):
status: Confirmed → Triaged
tags: added: kernel-power
Revision history for this message
Shura (shurymury) wrote :

I see the same behavior on my MacBook 5.1

Revision history for this message
Cam Cope (ccope) wrote :

My laptop has an nvidia 9800M GT, and the MacBook 5.1 also uses an nvidia chipset.

Revision history for this message
Shura (shurymury) wrote :

It is definitely related to suspend. It does not happen all the time though as my experiments show. Worst of all I am not able to suspend any longer once the ksoftirqd is consuming 100%

Tasks: 181 total, 3 running, 178 sleeping, 0 stopped, 0 zombie
Cpu(s): 17.7%us, 4.6%sy, 0.0%ni, 67.1%id, 2.3%wa, 0.1%hi, 8.1%si, 0.0%st
Mem: 3645260k total, 1899984k used, 1745276k free, 149636k buffers
Swap: 3999992k total, 0k used, 3999992k free, 1019436k cached

  PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
    4 root 20 0 0 0 0 R 101 0.0 12:04.57 ksoftirqd/0
 5746 root 20 0 145m 52m 21m S 6 1.5 1:21.82 Xorg
 5971 shura 20 0 211m 18m 10m S 2 0.5 0:01.82 gnome-terminal
 7770 shura 20 0 19212 1344 940 R 2 0.0 0:00.01 top
.....

Please let me know what kind of info I could provide.

Revision history for this message
Shura (shurymury) wrote :

I am also redy to run ome experiments if I am told what to do.

Revision history for this message
Shura (shurymury) wrote :

FWIW, the problem does not manifest itself after upgrading to 2.6.32-22.

Dunno if it is fixed - time will show - but so far so good.

Revision history for this message
Georgi Petrov (gogosofta) wrote :

Is there any progress on this issue?
I've disabled the 'sleep on lid close' option on my F5SL laptop due to the same problem, but I it's pretty much annoying to have it running and consuming battery when not using it...

Revision history for this message
Mark Fernandes (typist) wrote :

This issue has not be resolved as of:

#uname -a
Linux adler 2.6.32-24-generic #38-Ubuntu SMP Mon Jul 5 09:20:59 UTC 2010 x86_64 GNU/Linux

#top|head
top - 20:19:03 up 1:24, 3 users, load average: 1.03, 1.15, 0.92
Tasks: 184 total, 2 running, 182 sleeping, 0 stopped, 0 zombie
Cpu(s): 5.8%us, 2.6%sy, 0.1%ni, 75.1%id, 2.2%wa, 0.0%hi, 14.2%si, 0.0%st
Mem: 4055620k total, 1685048k used, 2370572k free, 89380k buffers
Swap: 3858388k total, 0k used, 3858388k free, 695080k cached

  PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
    4 root 20 0 0 0 0 R 98 0.0 17:54.91 ksoftirqd/0
    1 root 20 0 23696 1948 1268 S 0 0.0 0:00.95 init
    2 root 20 0 0 0 0 S 0 0.0 0:00.00 kthreadd

This is a regression bug because it was not present in Ubuntu 9.10 (karmic)
Any help is much appreciated.

Revision history for this message
Georgi Petrov (gogosofta) wrote :

The status is "triaged", so eventually someone will get on it :)

Revision history for this message
Mark Fernandes (typist) wrote :
Download full text (3.6 KiB)

This issue has not been resolved as of:

#uname -a
Linux adler 2.6.32-24-generic #42-Ubuntu SMP Fri Aug 20 14:21:58 UTC 2010 x86_64 GNU/Linux

#top|head
top - 21:33:45 up 2:25, 2 users, load average: 1.16, 1.05, 0.70
Tasks: 180 total, 3 running, 177 sleeping, 0 stopped, 0 zombie
Cpu(s): 9.7%us, 2.1%sy, 0.0%ni, 82.9%id, 0.7%wa, 0.0%hi, 4.6%si, 0.0%st
Mem: 4055620k total, 1567716k used, 2487904k free, 94160k buffers
Swap: 3858388k total, 0k used, 3858388k free, 513732k cached

  PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
 4806 root 20 0 0 0 0 R 101 0.0 8:35.37 ksoftirqd/1
 1308 root 20 0 210m 103m 20m S 2 2.6 6:12.78 Xorg
 3162 mark 20 0 239m 47m 20m S 2 1.2 1:11.44 compiz

#lspci
00:00.0 Host bridge: Intel Corporation Mobile 4 Series Chipset Memory Controller Hub (rev 07)
00:01.0 PCI bridge: Intel Corporation Mobile 4 Series Chipset PCI Express Graphics Port (rev 07)
00:1a.0 USB Controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #4 (rev 03)
00:1a.1 USB Controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #5 (rev 03)
00:1a.2 USB Controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #6 (rev 03)
00:1a.7 USB Controller: Intel Corporation 82801I (ICH9 Family) USB2 EHCI Controller #2 (rev 03)
00:1b.0 Audio device: Intel Corporation 82801I (ICH9 Family) HD Audio Controller (rev 03)
00:1c.0 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 2 (rev 03)
00:1c.2 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 3 (rev 03)
00:1c.4 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 5 (rev 03)
00:1c.5 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 6 (rev 03)
00:1d.0 USB Controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #1 (rev 03)
00:1d.1 USB Controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #2 (rev 03)
00:1d.2 USB Controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #3 (rev 03)
00:1d.7 USB Controller: Intel Corporation 82801I (ICH9 Family) USB2 EHCI Controller #1 (rev 03)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev 93)
00:1f.0 ISA bridge: Intel Corporation ICH9M LPC Interface Controller (rev 03)
00:1f.2 SATA controller: Intel Corporation ICH9M/M-E SATA AHCI Controller (rev 03)
00:1f.3 SMBus: Intel Corporation 82801I (ICH9 Family) SMBus Controller (rev 03)
01:00.0 VGA compatible controller: nVidia Corporation G92 [GeForce 9800M GTX] (rev a2)
02:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 02)
04:00.0 FireWire (IEEE 1394): JMicron Technology Corp. IEEE 1394 Host Controller
...

Read more...

Revision history for this message
Georgi Petrov (gogosofta) wrote :

Not fixed as of 2.6.32-25 either.

Revision history for this message
Shafqat (priomsrb) wrote :

Hi, I was also getting this problem but now have a workaround. Before suspending run:

sudo rmmod sr_mod

Then when coming back from suspend it should no longer hang at 50% or 100% cpu. You can also run this after suspending and it will also work but after some time.

I don't know how or why this works but I read it on a website which I cannot find anymore. I should also mention that I am acutally using arch linux with a 2.6.35 kernel.

Revision history for this message
Nathan Moore (nategoose) wrote :

I have this problem with hibernating as well -- currently running:

Linux system76-pc 2.6.32-25-generic #44-Ubuntu SMP Fri Sep 17 20:05:27 UTC 2010 x86_64 GNU/Linux
Ubuntu 10.04

but I don't seem to have sr_mod loaded.

I've had variations of this problem through several upgrades. It seems to have started when I upgraded from 9.04->9.10->10.04 (9.10 ran so badly that I was going to do a fresh install, but decided to try 10.04 first). Previously it seemed that ksoftirqd/0 would go to 100% pretty quickly after booting after a hibernate, but now ksoftorqd/1 is going to 100% at some random-ish time after rebooting after hibernate (within a few hours). It rarely seems to do this while I am actually at my laptop, but nothing odd seems to have been in kern.log or messages during the duration of time that it happens.

If there is anything additional information that may be helpful please let me know.

Revision history for this message
Georgi Petrov (gogosofta) wrote :

... The above seem like voices crying in the wilderness.
I switched back to XP due to this bug (producing a lack of suspend).
It's quite annoying having to shutdown/startup the laptop everytime I stop/start using it.

Revision history for this message
Cam Cope (ccope) wrote :

I have a feeling this bug isn't just a single bug. After Shafqat's post, i started removing modules after coming back from suspend. I didn't have sr_mod, but after I unloaded my webcam modules my system calmed down again. This problem is probably a generic issue that happens when modules don't properly unsuspend. The next time you resume from suspend, try unloading non-critical modules until your cpu usage drops back down, then report back the module name.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.