Comment 14 for bug 666828

Revision history for this message
Stuart Longland (redhatter) wrote :

Not sure what the official status is, but I'm getting this problem with a HP ProLiant box here running Ubuntu 12.04 LTS and linux-image-3.5.0-25-generic as well as the 3.2-series kernels.

For me the machine will run fine for about a day, then everything locks up. If I'm lucky, I can SSH in, but then `dmesg` hangs. `fold -w 80 /dev/vcs1` dumps the following output:

 message.
[158400.224328] INFO: task oned:3106 blocked for more than 120 seconds.
[158400.224337] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
 message.
[158400.224491] INFO: task mm_sched:3116 blocked for more than 120 seconds.
[158400.224501] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
 message.
[158520.224030] INFO: task rs:main Q:Reg:888 blocked for more than 120 seconds.
[158520.224046] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
 message.
[158520.224327] INFO: task oned:3106 blocked for more than 120 seconds.
[158520.224336] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
 message.
[158520.224503] INFO: task mm_sched:3116 blocked for more than 120 seconds.
[158520.224524] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
 message.
[158640.224028] INFO: task rs:main Q:Reg:888 blocked for more than 120 seconds.
[158640.224056] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
 message.
[158640.224362] INFO: task oned:3106 blocked for more than 120 seconds.
[158640.224382] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this
 message.

The box previously worked reliably running VMWare ESXi. The link to the Red Hat bug database seems to require a log in to see the bug, so at time of writing, the bug is inaccessible to me and I'm unable to follow any advice there. (I get an error message "You are not authorized to access bug #615543.")

I'm trying a complete reload (I install using a PXE boot image) to see if a newer kernel has been released that fixes the issue -- also to clean up the last vestiges of an OpenNebula installation that's no longer in use.