task blocked for more than 120 seconds on server kernel
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Expired
|
Medium
|
Unassigned |
Bug Description
Hi,
this is about a ubuntu server version.
The server consists mainly of fast HDDs and 2 external attached LTO-3 tape drives in a changer.
It's purpose is to sync with other servers and then write ewverything onto both tape drives in parallel overnight.
The following is our main problem:
[ 1081.590063] INFO: task mbuffer1:2589 blocked for more than 120 seconds.
[ 1081.590577] "echo 0 > /proc/sys/
[ 1081.591151] mbuffer1 D 0000000000000000 0 2589 2560 0x00000000
[ 1081.591162] ffff88080cee9c18 0000000000000082 0000000000015bc0 0000000000015bc0
[ 1081.591173] ffff8803f87ac890 ffff88080cee9fd8 0000000000015bc0 ffff8803f87ac4d0
[ 1081.591181] 0000000000015bc0 ffff88080cee9fd8 0000000000015bc0 ffff8803f87ac890
[ 1081.591189] Call Trace:
[ 1081.591208] [<ffffffff81558
[ 1081.591220] [<ffffffff812b4
[ 1081.591228] [<ffffffff81559
[ 1081.591238] [<ffffffff8138a
[ 1081.591246] [<ffffffff81557
[ 1081.591256] [<ffffffff8129d
[ 1081.591266] [<ffffffff8105a
[ 1081.591286] [<ffffffffa015c
[ 1081.591294] [<ffffffff81557
[ 1081.591305] [<ffffffffa015c
[ 1081.591315] [<ffffffffa0162
[ 1081.591324] [<ffffffff8105a
[ 1081.591334] [<ffffffff81143
[ 1081.591342] [<ffffffff81144
[ 1081.591351] [<ffffffff81012
[ 1081.591358] INFO: task mbuffer2:2608 blocked for more than 120 seconds.
[ 1081.591800] "echo 0 > /proc/sys/
[ 1081.592374] mbuffer2 D 0000000000000000 0 2608 2591 0x00000000
[ 1081.592383] ffff8800df895c18 0000000000000082 0000000000015bc0 0000000000015bc0
[ 1081.592392] ffff8803f87a9ab0 ffff8800df895fd8 0000000000015bc0 ffff8803f87a96f0
[ 1081.592400] 0000000000015bc0 ffff8800df895fd8 0000000000015bc0 ffff8803f87a9ab0
[ 1081.592408] Call Trace:
[ 1081.592417] [<ffffffff81558
[ 1081.592425] [<ffffffff812b4
[ 1081.592432] [<ffffffff81559
[ 1081.592439] [<ffffffff8138a
[ 1081.592448] [<ffffffff81557
[ 1081.592456] [<ffffffff8129d
[ 1081.592464] [<ffffffff8105a
[ 1081.592474] [<ffffffffa015c
[ 1081.592482] [<ffffffff81557
[ 1081.592492] [<ffffffffa015c
[ 1081.592502] [<ffffffffa0162
[ 1081.592510] [<ffffffff8105a
[ 1081.592518] [<ffffffff81143
[ 1081.592525] [<ffffffff81144
[ 1081.592533] [<ffffffff81012
After the 5th 120s delay the following aborts the backup:
[ 1818.980059] mptscsih: ioc1: attempting task abort! (sc=ffff880057b
[ 1818.980067] st 6:0:4:0: CDB: Write(6): 0a 00 04 00 00 00
[ 1829.300042] mptscsih: ioc1: WARNING - Issuing Reset from mptscsih_
[ 1831.280030] mptscsih: ioc1: task abort: SUCCESS (sc=ffff880057b
[ 1831.282296] mptscsih: ioc1: attempting task abort! (sc=ffff880057b
[ 1831.282302] st 6:0:5:0: CDB: Write(6): 0a 00 04 00 00 00
[ 1831.282321] mptscsih: ioc1: task abort: SUCCESS (sc=ffff880057b
[ 1831.284945] st0: Error 80000 (driver bt 0x0, host bt 0x8).
[ 1831.285106] st1: Error 80000 (driver bt 0x0, host bt 0x8).
[ 1831.490044] scsi target6:0:4: Beginning Domain Validation
[ 1831.637097] scsi target6:0:4: Ending Domain Validation
[ 1831.637208] scsi target6:0:4: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 64)
[ 1834.150032] scsi target6:0:5: Beginning Domain Validation
[ 1834.297533] scsi target6:0:5: Ending Domain Validation
[ 1834.297649] scsi target6:0:5: FAST-160 WIDE SCSI 320.0 MB/s DT IU RTI PCOMP (6.25 ns, offset 64)
[ 1910.340056] scsi target6:0:5: Beginning Domain Validation
[ 1910.729074] scsi target6:0:5: Ending Domain Validation
[ 1910.729194] scsi target6:0:5: FAST-160 WIDE SCSI 320.0 MB/s DT IU RTI PCOMP (6.25 ns, offset 64)
This is with the SAS-LSI driver manually updated to version:
# cat /sys/module/
4.24.00.00
because I get lost connections to SATA drives with the driver supplied with the kernel (was with 2.6.32-23).
This is a really serious bug for this server! It prevents it from doing backups.
Please also read Bug 494476
regards
Lars
ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-
Regression: No
Reproducible: Yes
ProcVersionSign
Uname: Linux 2.6.32-25-server x86_64
AlsaDevices: Error: command ['ls', '-l', '/dev/snd/'] failed with exit code 2: ls: cannot access /dev/snd/: No such file or directory
AplayDevices: Error: [Errno 2] No such file or directory
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory
CurrentDmesg:
Date: Fri Oct 1 10:20:57 2010
MachineType: Supermicro H8DI3+
PciMultimedia:
ProcCmdLine: BOOT_IMAGE=
ProcEnviron:
LANG=de_DE.UTF-8
SHELL=/bin/bash
SourcePackage: linux
dmi.bios.date: 12/07/2009
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 1.0b
dmi.board.
dmi.board.name: H8DI3+
dmi.board.vendor: Supermicro
dmi.board.version: 1234567890
dmi.chassis.
dmi.chassis.type: 3
dmi.chassis.vendor: Supermicro
dmi.chassis.
dmi.modalias: dmi:bvnAmerican
dmi.product.name: H8DI3+
dmi.product.
dmi.sys.vendor: Supermicro
Hi Lars,
If you could also please test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https:/ /wiki.ubuntu. com/KernelMainl ineBuilds . Once you've tested the upstream kernel, please remove the 'needs- upstream- testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs- upstream- testing' text. Please let us know your results.
Thanks in advance.
[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]