Apps freeze because of BUG at fs/ext4/inode.c:2003

Bug #659358 reported by ooze
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

A system freeze occurs at least once a day because of a kernel assertion at fs/ext4/inode.c:2003. Applications start to freeze probably because they are being blocked indefinitely for I/O. The system is still alive, but barely usable even for debugging. There is a possible regression because it started to occur shortly after upgrading to maverick. However, the second hard drive in this system seems to be dying because of a extreme number of load cycles which may be triggering the bug.

It seems to occur more often when the boinc daemon is running. This program uses all the CPU available and creates a lot of file updates and truncations.

I tested with the latest mainline kernel (2.6.36-999.201009291611) and a similar bug seems to occur (line 2030 instead of 2003?).

Attached is the kernel log at the time of the freeze. Some part is corrupted, so I hit SysRq-t another time before rebooting.

ProblemType: Bug
DistroRelease: Ubuntu 10.10
Package: linux-image-2.6.35-22-generic 2.6.35-22.34
Regression: Yes
Reproducible: No
ProcVersionSignature: Ubuntu 2.6.35-22.34-generic 2.6.35.4
Uname: Linux 2.6.35-22-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.23.
AplayDevices:
 **** List of PLAYBACK Hardware Devices ****
 card 0: Intel [HDA Intel], device 0: ALC262 Analog [ALC262 Analog]
   Subdevices: 1/1
   Subdevice #0: subdevice #0
Architecture: amd64
ArecordDevices:
 **** List of CAPTURE Hardware Devices ****
 card 0: Intel [HDA Intel], device 0: ALC262 Analog [ALC262 Analog]
   Subdevices: 1/1
   Subdevice #0: subdevice #0
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: gauthierp 2375 F.... pulseaudio
CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found.
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xe0a00000 irq 46'
   Mixer name : 'Realtek ALC262'
   Components : 'HDA:10ec0262,103c280c,00100100'
   Controls : 31
   Simple ctrls : 19
CheckboxSubmission: 0378b0df62dc16d1ca4403d7647600ef
CheckboxSystem: 6ce041aeed0a2c17b3343b66d157175d
Date: Tue Oct 12 13:15:03 2010
Frequency: Once a day.
HibernationDevice: RESUME=UUID=d6dd96dc-8163-4e66-b272-ffcd8e09221f
InstallationMedia: Ubuntu 9.10 "Karmic Koala" - Release amd64 (20091027)
MachineType: Hewlett-Packard HP xw4400 Workstation
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.35-22-generic root=UUID=1f110f9b-6c53-489f-844d-81cdec9f9c1d ro quiet splash
ProcEnviron:
 PATH=(custom, user)
 LANG=fr_CA.utf8
 SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.38
RfKill:
 0: hp-wwan: Wireless WAN
  Soft blocked: no
  Hard blocked: no
SourcePackage: linux
dmi.bios.date: 12/07/2006
dmi.bios.vendor: Hewlett-Packard
dmi.bios.version: 786D7 v02.02
dmi.board.name: 0A68h
dmi.board.vendor: Hewlett-Packard
dmi.chassis.asset.tag: 2UA708151K
dmi.chassis.type: 6
dmi.chassis.vendor: Hewlett-Packard
dmi.modalias: dmi:bvnHewlett-Packard:bvr786D7v02.02:bd12/07/2006:svnHewlett-Packard:pnHPxw4400Workstation:pvr:rvnHewlett-Packard:rn0A68h:rvr:cvnHewlett-Packard:ct6:cvr:
dmi.product.name: HP xw4400 Workstation
dmi.sys.vendor: Hewlett-Packard

Revision history for this message
ooze (zoe-gauthier) wrote :
Revision history for this message
ooze (zoe-gauthier) wrote :

A new attachment with the content grabbed from /proc/kmsg includes a similar kernel trace.

This system that has been crashing at least once a day with kernel 2.6.35 (maverick) is back running 2.6.32 (lucid) all day since the last few days. The system never crashed with this older kernel. I see there is a 2.6.36 kernel available in natty; is it useful to test with this kernel?

Brad Figg (brad-figg)
tags: added: acpi-namespace-lookup
tags: added: acpi-parameter
tags: added: acpi-parse-exec-fail
Brad Figg (brad-figg)
Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
ooze (zoe-gauthier) wrote :

The RAM in the affected system has been replaced and this fixed a few strange bugs. I am closing this bug because it is very old and may be impossible to reproduce.

Changed in linux (Ubuntu):
status: Confirmed → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.