2.6.31-12.41 hangs with Radeon KMS

Bug #446421 reported by Nikolaj Sheller
32
This bug affects 5 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Won't Fix
High
Unassigned

Bug Description

First of all:
I cannot find out how to report bugs without using the command "ubuntu-bug".
As I need to have a running kernel to use this method I have run "$ubuntu-bug linux" on another kernel than the one I wish to report the bug on...

The kernel I have reported this bug on is running with "radeon.modeset=1" on 2.6.31-11.38 with a dual head ATI card.
The kernel I hangs on 2.6.31-12.41 with the same card and configuration.

A regression seems to have occurred between these kernels. No revealing logs were present. The last statement was:

First time:
Oct 8 09:45:50 nsh-ubuntu kernel: [ 20.776080] Unpin not necessary for ffff88013653f600 !

Second time:
Oct 8 09:49:27 nsh-ubuntu kernel: [ 21.604822] Unpin not necessary for ffff880135de4a00 !

The secondary screen goes dark, and the kernel freezes.

ProblemType: Bug
Architecture: amd64
ArecordDevices:
 **** List of CAPTURE Hardware Devices ****
 card 0: Intel [HDA Intel], device 0: AD198x Analog [AD198x Analog]
   Subdevices: 1/1
   Subdevice #0: subdevice #0
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: nsh 2011 F.... pulseaudio
                      nsh 6403 F.... NLNOTES.EXE
                      nsh 6481 F.... ntaskldr.EXE
 /dev/snd/pcmC0D0p: nsh 2011 F...m pulseaudio
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xdfffc000 irq 16'
   Mixer name : 'Analog Devices AD1983'
   Components : 'HDA:11d41983,102801da,00100400'
   Controls : 26
   Simple ctrls : 15
Date: Thu Oct 8 17:08:29 2009
DistroRelease: Ubuntu 9.10
HibernationDevice: RESUME=UUID=c9ffa247-ba39-436e-a246-abd23cdce0d7
IwConfig:
 lo no wireless extensions.

 eth1 no wireless extensions.
MachineType: Dell Inc. OptiPlex 745
Package: linux-image-2.6.31-11-generic 2.6.31-11.38
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.31-11-generic root=UUID=e31b8a9b-cbd0-483b-82c2-679e3c471373 ro radeon.modeset=1 quiet splash
ProcEnviron:
 LANG=en_DK.UTF-8
 SHELL=/bin/bash
ProcVersionSignature: Ubuntu 2.6.31-11.38-generic
RelatedPackageVersions:
 linux-backports-modules-2.6.31-11-generic N/A
 linux-firmware 1.21
RfKill:

SourcePackage: linux
Uname: Linux 2.6.31-11-generic x86_64
dmi.bios.date: 05/21/2007
dmi.bios.vendor: Dell Inc.
dmi.bios.version: 2.3.1
dmi.board.name: 0TY565
dmi.board.vendor: Dell Inc.
dmi.chassis.type: 6
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.:bvr2.3.1:bd05/21/2007:svnDellInc.:pnOptiPlex745:pvr:rvnDellInc.:rn0TY565:rvr:cvnDellInc.:ct6:cvr:
dmi.product.name: OptiPlex 745
dmi.sys.vendor: Dell Inc.

Revision history for this message
Nikolaj Sheller (nikolajsheller) wrote :
Changed in linux (Ubuntu):
importance: Undecided → High
status: New → Triaged
tags: added: regression-potential
Revision history for this message
Andy Whitcroft (apw) wrote :

We are not expecting KMS to be stable for all ATI cards in 2.6.31 and not enabling it there by default for Karmic. That said if it worked on 2.6.31-11.38 then I am supprised it does not do so on .41 as I cannot see any radeon changes in the kernel between those two versions.

Can we try booting the .41 kernel without modeset enabled and without quiet to see if we get to see something prior to the panic. Also there was one tty related panic in .41 which was fixed in subsequent kernels. So does this occur with the latest available kernel .42 and later? Again if it occurs with modeset can we try booting without modeset and without quiet there.

Please report back here. Thanks.

Revision history for this message
Nikolaj Sheller (nikolajsheller) wrote :

I have now tried booting with the 2.6.31-12.41 kernel, and kernel has not yet hung.
I saw this happen twice in a row when I originally booted with 2.6.31-12.41...
Could this be due to a race condition that I encountered twice by chance?

I have not encountered this on 2.6.31-13.44 yet.

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Hi Nikolaj,

Just curious if this is still an issue with the final 2.6.31-14.48 Karmic kernel or even the most recently released 2.6.31-15.50 kernel update? Thanks.

Changed in linux (Ubuntu):
status: Triaged → Incomplete
Revision history for this message
Nikolaj Sheller (nikolajsheller) wrote :

I have experienced a couple of system hangs since 2009-10-13, but apport did not report these, and I did not have time to investigate.

I will report any additional hangs that may be relevant here, when I encounter them.

tags: added: regression-release
removed: regression-potential
tags: added: karmic
Revision history for this message
Nikolaj Sheller (nikolajsheller) wrote :

I have not seen any system hangs since kernel 2.6.31-13.44. I think it can be closed.

Revision history for this message
Nikolaj Sheller (nikolajsheller) wrote :
Download full text (7.0 KiB)

Oh well spoke too soon:

Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980058] ------------[ cut here ]------------
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980068] WARNING: at /build/buildd/linux-2.6.31/lib/kref.c:43 kref_get+0x2d/0x30()
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980070] Hardware name: OptiPlex 745
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980072] Modules linked in: binfmt_misc vboxnetflt vboxnetadp vboxdrv kvm_intel kvm snd_hda_codec_analog snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_dummy snd_seq_oss ppdev iptable_filter snd_seq_midi ip_tables x_tables gspca_zc3xx gspca_main videodev v4l1_compat v4l2_compat_ioctl32 pl2303 usbserial snd_rawmidi snd_seq_midi_event psmouse serio_raw parport_pc dell_wmi dcdbas lp parport snd_seq snd_timer snd_seq_device snd soundcore snd_page_alloc usbhid dm_raid45 xor fbcon tileblit font bitblit softcursor radeon ttm drm i2c_algo_bit floppy tg3 intel_agp
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980114] Pid: 10, comm: events/1 Not tainted 2.6.31-18-generic #55-Ubuntu
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980116] Call Trace:
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980121] [<ffffffff8105e828>] warn_slowpath_common+0x78/0xb0
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980125] [<ffffffff8105e86f>] warn_slowpath_null+0xf/0x20
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980127] [<ffffffff81279bed>] kref_get+0x2d/0x30
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980135] [<ffffffffa007fdf4>] ttm_bo_delayed_delete+0x94/0x180 [ttm]
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980141] [<ffffffffa007fee0>] ? ttm_bo_delayed_workqueue+0x0/0x40 [ttm]
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980146] [<ffffffffa007fefa>] ttm_bo_delayed_workqueue+0x1a/0x40 [ttm]
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980150] [<ffffffff81073865>] run_workqueue+0x95/0x170
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980153] [<ffffffff810739e4>] worker_thread+0xa4/0x120
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980156] [<ffffffff81078b90>] ? autoremove_wake_function+0x0/0x40
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980159] [<ffffffff81073940>] ? worker_thread+0x0/0x120
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980162] [<ffffffff810787a6>] kthread+0xa6/0xb0
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980165] [<ffffffff8101312a>] child_rip+0xa/0x20
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980168] [<ffffffff81078700>] ? kthread+0x0/0xb0
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980170] [<ffffffff81013120>] ? child_rip+0x0/0x20
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980172] ---[ end trace c31a867b7006f62a ]---
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980238] CPU 0
Jan 27 08:18:07 nsh-ubuntu kernel: [ 263.980240] Modules linked in: binfmt_misc vboxnetflt vboxnetadp vboxdrv kvm_intel kvm snd_hda_codec_analog snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_dummy snd_seq_oss ppdev iptable_filter snd_seq_midi ip_tables x_tables gspca_zc3xx gspca_main videodev v4l1_compat v4l2_compat_ioctl32 pl2303 usbserial snd_rawmidi snd_seq_midi_event psmouse serio_raw parport_pc dell_wmi dcdbas lp parport snd_seq snd_timer snd_seq_device snd soundcore snd...

Read more...

Revision history for this message
Flávio Etrusco (etrusco) wrote :

Is a karmic trace/debug still wanted? I'm having all kinds of (similar) problems with Lucid BTW...

Revision history for this message
Klaus Doblmann (moviemaniac) wrote :

There have been several reports on the forums (mostly on r600 cards IIRC) about random system hangs with ATI and KMS in lucid. Also I have been helping 10+ people with these problems.

The symptoms: The machine hangs all of a sudden up to multiple times a day. What seems to trigger this is very often playing a flash video or also playing other videos or playing games or sometimes doing nothing at all.
There's nothing whatsoever to be found in syslog or all the other logfiles I've looked at so this is a very hard one to catch.
However there is a fix for this problem: using a -34 series kernel (which, of course, is still in its rc stage right now). With these kernels the machines run without problems. Mine (ATI 2600XT) used to crach twice a day to every second day and it's been running stable for over a month now using the -34 kernel series. This also fixed all other machines.

So my take on this is that there's some sort of problem in lucid's DRM stack and judging from the amount of machines affected I'd say this is definitely a critical bug.

Revision history for this message
Klaus Doblmann (moviemaniac) wrote :

Thread on the ubuntuforums about this problem: http://ubuntuforums.org/showthread.php?t=1347430

Revision history for this message
Alex Ainscow (ubuntu-thealexsite) wrote :

I have a similar problem, but it is not fixed by moving to the -34 kernel, see bug 581284.

Revision history for this message
Flávio Etrusco (etrusco) wrote :

Anybody with a r600 series card (mine is a Mobility HD 4650) that doesn't hang up the system on shutdown?
@Alex, you report is marked 'private'.

Revision history for this message
Alex Ainscow (ubuntu-thealexsite) wrote :

Flavio... sorry for delayed reply

My bug report turned out to be unrelated.... my mistake. (its no longer private)

Revision history for this message
Brad Figg (brad-figg) wrote : Unsupported series, setting status to "Won't Fix".

This bug was filed against a series that is no longer supported and so is being marked as Won't Fix. If this issue still exists in a supported series, please file a new bug.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: Incomplete → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.