BUG: unable to handle kernel paging request - EIP: [<f959ae41>] snd_ctl_poll (Followed by system lockup)

Bug #649141 reported by Bryce Harrington
10
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Won't Fix
Undecided
Unassigned
Maverick
Won't Fix
Undecided
Unassigned

Bug Description

OOPS on maverick with stock kernel.

I was listening to internet radio, it quit playing and I could not get the player to restart. Looked in dmesg and saw this oops report; I had been tailing dmesg in an xterm and roughly correlate when this oops happened to when the music quit.

[80384.879799] BUG: unable to handle kernel paging request at 66a9751c
[80384.879805] IP: [<f959ae41>] snd_ctl_poll+0x41/0x60 [snd]
[80384.879814] *pde = 00000000
[80384.879817] Oops: 0002 [#1] SMP
[80384.879821] last sysfs file: /sys/devices/system/cpu/cpu0/cpufreq/scaling_cur_freq
[80384.879824] Modules linked in: parport_pc ppdev snd_hda_codec_atihdmi snd_hda_codec_realtek binfmt_misc nfsd exportfs nfs lockd fscache nfs_acl auth_rpcgss sunrpc snd_hda_intel snd_hda_codec snd_hwdep snd_seq_midi snd_pcm snd_rawmidi snd_seq_midi_event snd_seq usblp snd_timer snd_seq_device tpm_tis tpm tpm_bios joydev snd soundcore snd_page_alloc lp parport raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear usbhid hid radeon ttm drm_kms_helper drm e1000e intel_agp agpgart i2c_algo_bit
[80384.879870]
[80384.879874] Pid: 1931, comm: pulseaudio Not tainted 2.6.35-22-generic #33-Ubuntu BB Name To be filled by O.E.M./Product Name To Be Filled By O.E.M.
[80384.879877] EIP: 0060:[<f959ae41>] EFLAGS: 00010246 CPU: 0
[80384.879883] EIP is at snd_ctl_poll+0x41/0x60 [snd]
[80384.879886] EAX: 00000041 EBX: f21e5100 ECX: f959ae00 EDX: f21e5130
[80384.879888] ESI: 00000000 EDI: 00000000 EBP: f2c81b9c ESP: f2c81b74
[80384.879891] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
[80384.879894] Process pulseaudio (pid: 1931, ti=f2c80000 task=f6948000 task.ti=f2c80000)
[80384.879896] Stack:
[80384.879897] 00000000 00000000 f2c81b9c f2c81b94 f21e5100 f21e5130 f959ae00 00000041
[80384.879904] <0> f2c81ea0 f2db2880 f2c81bf4 c0227b2b 00000000 c012ce18 f2c81bc0 00000000
[80384.879911] <0> f6948000 00000296 00000000 f2c81c0c f2c81e68 00000000 00000000 f2c81e68
[80384.879919] Call Trace:
[80384.879926] [<f959ae00>] ? snd_ctl_poll+0x0/0x60 [snd]
[80384.879932] [<c0227b2b>] ? do_poll+0xdb/0x230
[80384.879936] [<c012ce18>] ? default_spin_lock_flags+0x8/0x10
[80384.879940] [<c0228603>] ? do_sys_poll+0x133/0x1e0
[80384.879944] [<c0228390>] ? __pollwait+0x0/0xe0
[80384.879947] [<c0228470>] ? pollwake+0x0/0x60
[80384.879950] [<c0228470>] ? pollwake+0x0/0x60
[80384.879953] [<c0228470>] ? pollwake+0x0/0x60
[80384.879956] [<c0228470>] ? pollwake+0x0/0x60
[80384.879959] [<c0228470>] ? pollwake+0x0/0x60
[80384.879962] [<c0228470>] ? pollwake+0x0/0x60
[80384.879965] [<c0228470>] ? pollwake+0x0/0x60
[80384.879968] [<c0228470>] ? pollwake+0x0/0x60
[80384.879972] [<c0228470>] ? pollwake+0x0/0x60
[80384.879975] [<c0228470>] ? pollwake+0x0/0x60
[80384.879978] [<c0228470>] ? pollwake+0x0/0x60
[80384.879981] [<c0228470>] ? pollwake+0x0/0x60
[80384.879984] [<c0228470>] ? pollwake+0x0/0x60
[80384.879987] [<c0228470>] ? pollwake+0x0/0x60
[80384.879990] [<c0228470>] ? pollwake+0x0/0x60
[80384.879993] [<c0228470>] ? pollwake+0x0/0x60
[80384.879996] [<c0228470>] ? pollwake+0x0/0x60
[80384.879999] [<c0228470>] ? pollwake+0x0/0x60
[80384.880005] [<c04e767b>] ? sys_send+0x3b/0x40
[80384.880008] [<c02286e5>] ? sys_ppoll+0x35/0x120
[80384.880012] [<c05c90a4>] ? syscall_call+0x7/0xb
[80384.880017] [<c05c0000>] ? print_cpu_info+0x2/0x129
[80384.880019] Code: d6 8b 53 2c 85 d2 74 22 85 f6 8d 53 18 75 2b 8d 53 30 b8 41 00 00 00 3b 53 30 74 0e 8b 1c 24 8b 74 24 04 89 ec 5d c3 8d 76 00 60 <c0> 8b 1c 24 8b 74 24 00 89 ec 5d c3 8d 76 00 00 d2 74 d1 89 f1
[80384.880060] EIP: [<f959ae41>] snd_ctl_poll+0x41/0x60 [snd] SS:ESP 0068:f2c81b74
[80384.880068] CR2: 0000000066a9751c
[80384.880087] ---[ end trace 7be5302d1ad2383f ]---

[Background]
Since upgrading to maverick last week, I've noticed this machine has system lockups up to 2-3 times a day. So far, the freezes have only occurred while I was using the machine (not while it was idle), particularly when playing an OpenGL-utilizing game.

I've also periodically seen short (1-sec) display blanking, usually occurring only on one of the two monitors (and usually but not always the monitor that has graphics changes.) At least twice, I saw a repeated sequence of blankings, followed by a freeze. One of these two times I also noticed the audio change from music to making some odd (feedback-like) noises immediately prior to freezing.

My suspicion is that perhaps the OOPS, periodic freezes, and screen blanking behaviors are all related. Indeed, I also noticed 2-3 screen blankings prior to the sound going out, and have had several more blankings while filling out this bug report. (OTOH, I've seen blankings before with these monitors before upgrading, just not with such frequency.)

This particular hardware is a test box from Intel which I've been using this past year mainly for builds and testing, and had proven to be solid as a rock. However, I hadn't tested dual-head or audio prior to the upgrade so can't be 100% certain it's a regression, as opposed to just incomplete hardware support or something.

ProblemType: Bug
DistroRelease: Ubuntu 10.10
Package: linux-image-2.6.35-22-generic 2.6.35-22.33
Regression: Yes
Reproducible: No
ProcVersionSignature: Ubuntu 2.6.35-22.33-generic 2.6.35.4
Uname: Linux 2.6.35-22-generic i686
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.23.
Architecture: i386
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/by-path', '/dev/snd/controlC1', '/dev/snd/hwC1D0', '/dev/snd/pcmC1D3p', '/dev/snd/controlC0', '/dev/snd/hwC0D2', '/dev/snd/pcmC0D0c', '/dev/snd/pcmC0D0p', '/dev/snd/pcmC0D1c', '/dev/snd/pcmC0D1p', '/dev/snd/pcmC0D2c', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xfeaf4000 irq 43'
   Mixer name : 'Realtek ALC889A'
   Components : 'HDA:10ec0885,10ec0885,00100103'
   Controls : 42
   Simple ctrls : 23
Card1.Amixer.info:
 Card hw:1 'HDMI'/'HDA ATI HDMI at 0xfebec000 irq 43'
   Mixer name : 'ATI R6xx HDMI'
   Components : 'HDA:1002aa01,00aa0100,00100100'
   Controls : 4
   Simple ctrls : 1
Card1.Amixer.values:
 Simple mixer control 'IEC958',0
   Capabilities: pswitch pswitch-joined penum
   Playback channels: Mono
   Mono: Playback [on]
Date: Mon Sep 27 10:22:12 2010
Frequency: Once a day.
HibernationDevice: RESUME=UUID=6014ea95-bee9-4b30-b6f4-e5bb0891951b
InstallationMedia: Ubuntu 10.04 "Lucid Lynx" - Alpha i386 (20100113)
IwConfig:
 lo no wireless extensions.

 eth0 no wireless extensions.
MachineType: System Manufacturer To Be Filled By O.E.M. Product Name To Be Filled By O.E.M.
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.35-22-generic root=UUID=cc52aa3c-96d7-4e1e-b2d7-83b10e23c0d7 ro quiet splash
ProcEnviron:
 LANGUAGE=en_US:en
 PATH=(custom, user)
 LANG=C
 SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.38
RfKill:

SourcePackage: linux
dmi.bios.date: 05/30/2008
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: CGELIA47.86P
dmi.board.asset.tag: BB Asset To Be Filled By O.E.M.
dmi.board.name: BB Name To be filled by O.E.M.
dmi.board.vendor: BB Manufacturer To be filled by O.E.M.
dmi.board.version: BB Version To be filled by O.E.M.
dmi.chassis.asset.tag: Chasis Asset Tag To Be Filled By O.E.M.
dmi.chassis.type: 3
dmi.chassis.vendor: Chasis Manufacturer To Be Filled By O.E.M.
dmi.chassis.version: Chasis Version To Be Filled By O.E.M.
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvrCGELIA47.86P:bd05/30/2008:svnSystemManufacturerToBeFilledByO.E.M.:pnProductNameToBeFilledByO.E.M.:pvrSystemVersionToBeFilledByO.E.M.:rvnBBManufacturerTobefilledbyO.E.M.:rnBBNameTobefilledbyO.E.M.:rvrBBVersionTobefilledbyO.E.M.:cvnChasisManufacturerToBeFilledByO.E.M.:ct3:cvrChasisVersionToBeFilledByO.E.M.:
dmi.product.name: Product Name To Be Filled By O.E.M.
dmi.product.version: System Version To Be Filled By O.E.M.
dmi.sys.vendor: System Manufacturer To Be Filled By O.E.M.

Revision history for this message
Bryce Harrington (bryce) wrote :
Revision history for this message
Bryce Harrington (bryce) wrote :

Letting the system continue running after filing the bug report, I notice these two lines added to the end of dmesg:

[81531.800850] ACPI Warning: Incorrect checksum in table [iEIT] - 0x5C, should be 0x51 (20100428/tbutils-314)
[81531.801227] ACPI Warning: Incorrect checksum in table [iEIT] - 0x5C, should be 0x51 (20100428/tbutils-314)

description: updated
Bryce Harrington (bryce)
Changed in linux (Ubuntu Maverick):
milestone: none → maverick-updates
milestone: maverick-updates → none
Revision history for this message
Bryce Harrington (bryce) wrote :

bryce@blumonc:/var/log$ ps aux | grep pulseaudio
bryce 1931 1.1 0.0 0 0 ? Z<sl Sep26 15:36 [pulseaudio] <defunct>

Revision history for this message
Bryce Harrington (bryce) wrote :

Restarting pulseaudio immediately resulted in a system freeze.

Revision history for this message
Bryce Harrington (bryce) wrote :
Revision history for this message
Bryce Harrington (bryce) wrote :

Hmm, the first photo was unreadable. Basically it just shows that nothing was printed to any of the logs I was tailing. The second photo is clear and has all the relevant stuff in it.

Next I'm going to try to see if there's any relevant bios updates for the box from intel I'm missing...

Bryce Harrington (bryce)
summary: BUG: unable to handle kernel paging request - EIP: [<f959ae41>]
- snd_ctl_poll
+ snd_ctl_poll (Followed by system lockup)
tags: added: kernel-needs-review
Revision history for this message
Bryce Harrington (bryce) wrote :

Fwiw, I traced the screen blanking to a bent pin in my DVI cable. Hazards of being a regular tester of graphics hardware I guess. Anyway, haven't had that problem since swapping out the cable yesterday.

Also fwiw, haven't seen the system freeze since I reported it. Other than the cable, I haven't made any notable changes to the system (that I can recall).

Revision history for this message
Bryce Harrington (bryce) wrote :

Freeze occurred again...

Oct 3 17:12:26 blumonc kernel: [ 7249.505778] BUG: unable to handle kernel paging request at c023d410
Oct 3 17:12:26 blumonc kernel: [ 7249.505784] IP: [<ef9b0b73>] 0xef9b0b73
Oct 3 17:12:26 blumonc kernel: [ 7249.505788] *pde = 009a1067 *pte = 0023d161
Oct 3 17:12:26 blumonc kernel: [ 7249.505791] Oops: 0003 [#1] SMP
Oct 3 17:12:26 blumonc kernel: [ 7249.505794] last sysfs file: /sys/devices/system/cpu/cpu0/cpufreq/scaling_cur_freq
Oct 3 17:12:26 blumonc kernel: [ 7249.505796] Modules linked in: binfmt_misc parport_pc ppdev snd_hda_codec_atihdmi snd_hda_code\
c_realtek nfsd exportfs nfs lockd fscache nfs_acl auth_rpcgss usblp snd_seq_midi snd_hda_intel snd_rawmidi snd_hda_codec snd_hwde\
p snd_seq_midi_event snd_pcm snd_seq snd_seq_device snd_timer lp tpm_tis tpm joydev tpm_bios snd soundcore sunrpc snd_page_alloc \
parport raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear radeo\
n ttm drm_kms_helper drm usbhid hid intel_agp e1000e agpgart i2c_algo_bit
Oct 3 17:12:26 blumonc kernel: [ 7249.505829]
Oct 3 17:24:17 blumonc kernel: imklog 4.2.0, log source = /proc/kmsg started.

I'd freshly rebooted about 2 hours before after updating to latest maverick bits, and was playing Dwarf Fortress, a game that uses mesa fairly heavily (ironic because the interface is text like nethack). In fact, I had been playing the game last weekend when I noticed so many freezes. I haven't been playing it at all this week, and the system has been stable and freeze-free. So that sort of makes me think maybe the freeze is specific to DF's 3D/mesa usage. On the other hand, when it froze Monday I hadn't been playing it, so maybe not.

Brad Figg (brad-figg)
tags: added: acpi
tags: added: acpi-parse-exec-fail
tags: added: acpi-table-checksum
Brad Figg (brad-figg)
Changed in linux (Ubuntu):
status: New → Confirmed
tags: removed: regression-potential
Revision history for this message
Bryce Harrington (bryce) wrote :

Fwiw, I still see this bug from time to time, about once a week. It's possible it's something wrong with the machine's motherboard or cpu hardware.

Revision history for this message
Bryce Harrington (bryce) wrote :

Haven't seen this bug in a long long time. Since it looks like no one else reproduces it or are otherwise interested in it, will close it out at this point.

Changed in linux (Ubuntu):
status: Confirmed → Won't Fix
Changed in linux (Ubuntu Maverick):
status: New → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.