kernel panic with 'rejecting i/o to offline device'
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Expired
|
Undecided
|
Unassigned |
Bug Description
Hello,
I'm experiencing kernel panics on 5 identical Ubuntu 10.04 servers. They are identical down to every bit of hardware and operating environment, They've been reliably running Debian and Ubuntu Linux releases for years. There's something in the latest Ubuntu 10.04 server builds that has introduced a regression. They are used for heavy but sporadic CPU-bound work.
Over the course of a day, the odds are that a singular server will experience a kernel. Over two days it's practically guaranteed.
The kernel panic says:
"sd 2:0:0:0 rejecting i/o to offline device"
sd is the system drive and is most certainly online. A hard reboot fixes the matter for the following day or so.
ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-
Regression: Yes
Reproducible: No
ProcVersionSign
Uname: Linux 2.6.32-25-generic x86_64
Architecture: amd64
Date: Fri Oct 22 16:19:50 2010
Frequency: Once a day.
ProcEnviron:
LANG=en_GB.UTF-8
SHELL=/bin/bash
SourcePackage: linux
---
Architecture: amd64
DistroRelease: Ubuntu 10.04
NonfreeKernelMo
Package: linux (not installed)
ProcEnviron:
LANG=en_GB.UTF-8
SHELL=/bin/bash
ProcVersionSign
Regression: Yes
Reproducible: Yes
Tags: lucid filesystem regression-release needs-upstream-
Uname: Linux 2.6.32-33-generic x86_64
UserGroups:
tags: | added: apport-collected |
description: | updated |
Rebuilding these machines with identical software stacks but using EXT3 instead of EXT4 on the system drives has seemingly fixed this problem. As it remains then it appears there is a serious regression in the current EXT4 code.
Best,
Paul