intel i945 GPU lockup, requires reboot to restore X

Bug #635379 reported by Steve Langasek
26
This bug affects 5 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

Twice in the past week, I've had an i915 GPU lockup with the maverick kernel. First with 2.6.35-16, now this latest with 2.6.35-20:

Sep 8 17:25:43 dario kernel: [440048.562567] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 8 17:25:43 dario kernel: [440048.562706] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 16037579 at 16037578)

This manifests as the X screen freezing for a few seconds, then going black, then X restarting in failsafe mode. Console works fine, but drm doesn't come back without a reboot.

I've noticed a rough but not definite correlation with toggling the video output selections and/or the lid status.

ProblemType: Bug
DistroRelease: Ubuntu 10.10
Package: linux-image-2.6.35-20-generic 2.6.35-20.29
Regression: Yes
Reproducible: No
ProcVersionSignature: Ubuntu 2.6.35-20.29-generic 2.6.35.4
Uname: Linux 2.6.35-20-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.23.
Architecture: amd64
ArecordDevices:
 **** List of CAPTURE Hardware Devices ****
 card 0: Intel [HDA Intel], device 0: AD198x Analog [AD198x Analog]
   Subdevices: 1/1
   Subdevice #0: subdevice #0
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: vorlon 2274 F.... pulseaudio
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xee240000 irq 47'
   Mixer name : 'Analog Devices AD1981'
   Components : 'HDA:11d41981,17aa2025,00100200'
   Controls : 20
   Simple ctrls : 11
Card29.Amixer.info:
 Card hw:29 'ThinkPadEC'/'ThinkPad Console Audio Control at EC reg 0x30, fw 79HT50WW-1.07'
   Mixer name : 'ThinkPad EC 79HT50WW-1.07'
   Components : ''
   Controls : 2
   Simple ctrls : 1
Card29.Amixer.values:
 Simple mixer control 'Console',0
   Capabilities: pvolume pvolume-joined pswitch pswitch-joined penum
   Playback channels: Mono
   Limits: Playback 0 - 14
   Mono: Playback 14 [100%] [off]
Date: Fri Sep 10 15:01:54 2010
Frequency: Once every few days.
HibernationDevice: RESUME=UUID=f108133c-6b9d-4d28-9058-0b3a0c5549b4
MachineType: LENOVO 6371CTO
PccardctlIdent:
 Socket 0:
   no product info available
PccardctlStatus:
 Socket 0:
   no card
ProcCmdLine: root=/dev/mapper/hostname-root ro quiet splash
ProcEnviron:
 PATH=(custom, user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.38
SourcePackage: linux
WpaSupplicantLog:

dmi.bios.date: 12/27/2006
dmi.bios.vendor: LENOVO
dmi.bios.version: 7IET23WW (1.04 )
dmi.board.name: 6371CTO
dmi.board.vendor: LENOVO
dmi.board.version: Not Available
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: Not Available
dmi.modalias: dmi:bvnLENOVO:bvr7IET23WW(1.04):bd12/27/2006:svnLENOVO:pn6371CTO:pvrThinkPadT60:rvnLENOVO:rn6371CTO:rvrNotAvailable:cvnLENOVO:ct10:cvrNotAvailable:
dmi.product.name: 6371CTO
dmi.product.version: ThinkPad T60
dmi.sys.vendor: LENOVO

Revision history for this message
Steve Langasek (vorlon) wrote :
Revision history for this message
Balage (jobbara-artalmatlan) wrote :

Same happens here on Kubuntu 10.10 almost every day. Kwin effects are enabled.

I'll attach a log sample from yesterday (egrep -i "i915|intel" kern.log | grep "Nov 23")

# uname -a
Linux gaia2 2.6.35-22-generic #35-Ubuntu SMP Sat Oct 16 20:45:36 UTC 2010 x86_64 GNU/Linux

Unfortunately Xorg.log was rotated out but I'll upload one from today anyway.

Let me know what else do you need.

Revision history for this message
Balage (jobbara-artalmatlan) wrote :
Revision history for this message
Balage (jobbara-artalmatlan) wrote :

this machine is a Lenovo T410

Revision history for this message
YenTheFirst (yenthefirst) wrote :

I can confirm I've been getting a similar crash on Kubuntu Lucid, with 2.6.32-26-generic.
I seem to be able to reproduce it by opening a mp4 video file in google chrome.

[219984.676055] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[219984.676073] render error detected, EIR: 0x00000000
[219984.676314] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 35052852 at 35052845)
[219986.197125] wlan0: deauthenticating from 00:18:f8:ef:42:3d by local choice (reason=3)
[219991.918725] scim-launcher[2984]: segfault at 0 ip (null) sp bf999fdc error 4 in libstdc++.so.6.0.13[110000+e9000]

Brad Figg (brad-figg)
tags: added: acpi-bad-address
Brad Figg (brad-figg)
Changed in linux (Ubuntu):
status: New → Confirmed
tags: removed: regression-potential
Revision history for this message
Marcello Romani (marcello-romani) wrote :
Download full text (3.4 KiB)

I can confirm this happens almost daily on a HP system

cat /etc/lsb-release
DISTRIB_ID=Ubuntu
DISTRIB_RELEASE=10.04
DISTRIB_CODENAME=lucid
DISTRIB_DESCRIPTION="Ubuntu 10.04.3 LTS"

uname -a
Linux <hostname_removed> 2.6.32-33-generic #72-Ubuntu SMP Fri Jul 29 21:07:13 UTC 2011 x86_64 GNU/Linux

Part of dmesg output:

[ 7395.714262] render error detected, EIR: 0x00000000
[ 7395.714292] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58002 at 47538)
[ 7396.962768] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7396.962774] render error detected, EIR: 0x00000000
[ 7396.962790] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58003 at 47538)
[ 7400.344180] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7400.344187] render error detected, EIR: 0x00000000
[ 7400.344214] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58008 at 47538)
[ 7401.970646] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7401.970652] render error detected, EIR: 0x00000000
[ 7401.970674] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58010 at 47538)
[ 7403.158014] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7403.158022] render error detected, EIR: 0x00000000
[ 7403.158039] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58011 at 47538)
[ 7406.541946] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7406.541952] render error detected, EIR: 0x00000000
[ 7406.542009] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58016 at 47538)
[ 7408.197114] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7408.197121] render error detected, EIR: 0x00000000
[ 7408.197146] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58018 at 47538)
[ 7409.394532] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7409.394538] render error detected, EIR: 0x00000000
[ 7409.394553] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58019 at 47538)
[ 7412.747840] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7412.747847] render error detected, EIR: 0x00000000
[ 7412.747874] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58024 at 47538)
[ 7413.497477] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7413.497484] render error detected, EIR: 0x00000000
[ 7413.497513] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58026 at 47538)
[ 7414.245224] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7414.245230] render error detected, EIR: 0x00000000
[ 7414.245254] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -5 (awaiting 58028 at 47538)
[ 7416.030124] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 7416.030131] render error detected, EIR: 0x00000000
[ 7416.030148] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request retur...

Read more...

Revision history for this message
Marcello Romani (marcello-romani) wrote :
Revision history for this message
penalvch (penalvch) wrote :

Steve Langasek, thank you for reporting this and helping make Ubuntu better. This bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? Can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .

If it remains an issue, could you run the following command from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux <replace-with-bug-number>

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

Changed in linux (Ubuntu):
status: Confirmed → Incomplete
Revision history for this message
Steve Langasek (vorlon) wrote :

I haven't seen this bug with later Ubuntu releases, no.

Changed in linux (Ubuntu):
status: Incomplete → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.