Comment 6 for bug 577785

Revision history for this message
alf@all.de (alf-all) wrote :

This drop is other than before. An other bug ?

Personalities : [raid6] [raid5] [raid4] [linear] [multipath] [raid0] [raid1] [raid10]
md4 : active raid5 sdk1[3](F) sdi1[4](F) sdj1[5](F)
      1953519872 blocks level 5, 64k chunk, algorithm 2 [3/0] [___]
md1 : active raid5 sdd1[3](F) sdg1[2] sde1[1]
      2930269824 blocks level 5, 64k chunk, algorithm 2 [3/2] [_UU]
md0 : active raid5 sdb1[1] sda1[0] sdc1[3](F)
      2930271872 blocks level 5, 64k chunk, algorithm 2 [3/2] [UU_]

At first the disks (md4) connected at SIL are complete gone (6:55:40), which not happend before:

04:00.0 Mass storage controller: Silicon Image, Inc. SiI 3132 Serial ATA Raid II Controller (rev 01)

May 19 06:55:40 downtown kernel: [117018.953812] ata9.00: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.954743] ata9.01: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.955641] ata9.02: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.956528] ata9.03: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.957418] ata9.04: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.958290] ata9.05: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.959155] ata9.06: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.960040] ata9.07: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.960916] ata9.08: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.961759] ata9.09: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.962564] ata9.10: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.963358] ata9.11: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.964120] ata9.12: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.964868] ata9.13: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.965593] ata9.14: failed to read SCR 1 (Emask=0x40)
May 19 06:55:40 downtown kernel: [117018.966289] ata9.15: exception Emask 0x4 SAct 0x0 SErr 0x0 action 0x6 frozen
May 19 06:55:40 downtown kernel: [117018.967044] ata9.00: exception Emask 0x100 SAct 0x0 SErr 0x0 action 0x6 frozen
May 19 06:55:40 downtown kernel: [117018.967723] ata9.01: exception Emask 0x100 SAct 0x1 SErr 0x0 action 0x6 frozen
May 19 06:55:40 downtown kernel: [117018.968353] ata9.01: failed command: READ FPDMA QUEUED
May 19 06:55:40 downtown kernel: [117018.968974] ata9.01: cmd 60/08:00:5f:00:56/00:00:72:00:00/40 tag 0 ncq 4096 in
May 19 06:55:40 downtown kernel: [117018.968975] res 40/00:ff:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
May 19 06:55:40 downtown kernel: [117018.970248] ata9.01: status: { DRDY }
May 19 06:55:40 downtown kernel: [117018.970862] ata9.02: exception Emask 0x100 SAct 0x0 SErr 0x0 action 0x6 frozen
...

About 2 hours later (9:17:26) the MCP55 raids also breaks:

00:0d.0 IDE interface: nVidia Corporation MCP55 IDE (rev a1)
00:0e.0 IDE interface: nVidia Corporation MCP55 SATA Controller (rev a2)
00:0e.1 IDE interface: nVidia Corporation MCP55 SATA Controller (rev a2)
00:0e.2 IDE interface: nVidia Corporation MCP55 SATA Controller (rev a2)

May 19 09:17:26 downtown kernel: [125524.980133] ata4: EH in SWNCQ mode,QC:qc_active 0x7 sactive 0x7
May 19 09:17:26 downtown kernel: [125524.980148] ata3: EH in SWNCQ mode,QC:qc_active 0x7 sactive 0x7
May 19 09:17:26 downtown kernel: [125524.980152] ata3: SWNCQ:qc_active 0x1 defer_bits 0x6 last_issue_tag 0x0
May 19 09:17:26 downtown kernel: [125524.980153] dhfis 0x1 dmafis 0x0 sdbfis 0x0
May 19 09:17:26 downtown kernel: [125524.980156] ata3: ATA_REG 0x40 ERR_REG 0x0
May 19 09:17:26 downtown kernel: [125524.980157] ata3: tag : dhfis dmafis sdbfis sacitve
May 19 09:17:26 downtown kernel: [125524.980160] ata3: tag 0x0: 1 0 0 1
May 19 09:17:26 downtown kernel: [125524.980171] ata3.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x6 frozen
May 19 09:17:26 downtown kernel: [125524.980174] ata3.00: failed command: READ FPDMA QUEUED
May 19 09:17:26 downtown kernel: [125524.980180] ata3.00: cmd 60/d8:00:bf:0b:5b/00:00:87:00:00/40 tag 0 ncq 110592 in
May 19 09:17:26 downtown kernel: [125524.980181] res 40/00:ff:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
May 19 09:17:26 downtown kernel: [125524.980184] ata3.00: status: { DRDY }
May 19 09:17:26 downtown kernel: [125524.980186] ata3.00: failed command: READ FPDMA QUEUED
May 19 09:17:26 downtown kernel: [125524.980191] ata3.00: cmd 60/28:08:97:0c:5b/00:00:87:00:00/40 tag 1 ncq 20480 in
May 19 09:17:26 downtown kernel: [125524.980192] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
May 19 09:17:26 downtown kernel: [125524.980194] ata3.00: status: { DRDY }
May 19 09:17:26 downtown kernel: [125524.980196] ata3.00: failed command: READ FPDMA QUEUED
May 19 09:17:26 downtown kernel: [125524.980201] ata3.00: cmd 60/80:10:3f:0d:5b/00:00:87:00:00/40 tag 2 ncq 65536 in
May 19 09:17:26 downtown kernel: [125524.980202] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
May 19 09:17:26 downtown kernel: [125524.980205] ata3.00: status: { DRDY }
May 19 09:17:26 downtown kernel: [125524.980212] ata3: hard resetting link
May 19 09:17:26 downtown kernel: [125524.980214] ata3: nv: skipping hardreset on occupied port
May 19 09:17:26 downtown kernel: [125525.021316] ata4: SWNCQ:qc_active 0x1 defer_bits 0x6 last_issue_tag 0x0
...