BUG: Bad page map in process sleep pte:2000000000000000 pmd:126932067

Bug #659705 reported by randallw
18
This bug affects 3 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Undecided
Unassigned

Bug Description

Hi,

I have a workstation that has been running Hardy with no problems for 1.5 years. When trying to run with the newer versions of Ubuntu, the system hangs (hard lock, power button to reset) a few minutes after logging in. This is true for 10.04.1 (installed) and 10.10 (livecd).

The hang problem seems to go away if I disable the "ACPI APIC support" in the bios, however if I do that only 1 of the 4 CPU cores is available, so this isn't really an acceptable workaround.

I have submitted this report with the system running 10.04.1 with the bios APIC option disabled so that it would not crash and I had time to generate the various files. Hence, readers of the files should not expect to see errors in the files. I did manage to capture a dmesg output before a crash, and I will post that separately.

This is a very frustrating problem. I have searched and found no exactly matching cases, although there seems to be some similar ones. I have run 10.04.1 (livecd) on a similar workstation with a similar ASUS mobo (a P5Q deluxe, rather than the P5Q-E that is in this one) and found no problems. However the bios on the other workstation is newer (v2301) than the latest one available for this one (v2101).

I realise that this might be difficult for the Ubuntu team to dianose, however I am willing to help in whatever way I can to get to the bottom of the problem.

Cheers,
Randall.

ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-2.6.32-25-generic 2.6.32-25.44
Regression: Yes
Reproducible: No
ProcVersionSignature: Ubuntu 2.6.32-25.44-generic 2.6.32.21+drm33.7
Uname: Linux 2.6.32-25-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: rwayth 1531 F.... pulseaudio
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xf9ff8000 irq 3'
   Mixer name : 'Analog Devices AD1989B'
   Components : 'HDA:11d4989b,10438311,00100300'
   Controls : 47
   Simple ctrls : 26
Card1.Amixer.info:
 Card hw:1 'NVidia'/'HDA NVidia at 0xfbdfc000 irq 11'
   Mixer name : 'Nvidia ID d'
   Components : 'HDA:10de000d,10de0101,00100100'
   Controls : 0
   Simple ctrls : 0
Card1.Amixer.values:

Date: Wed Oct 13 14:48:40 2010
Frequency: Once a day.
HibernationDevice: RESUME=UUID=e8bd63f9-2518-4997-a729-f2b94c9a1267
InstallationMedia: Ubuntu 10.04 LTS "Lucid Lynx" - Release amd64 (20100429)
IwConfig:
 lo no wireless extensions.

 eth0 no wireless extensions.

 eth1 no wireless extensions.
MachineType: System manufacturer System Product Name
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-25-generic root=UUID=895e99ea-b603-477e-85ce-949e67118b91 ro quiet splash
ProcEnviron:
 LANG=en_AU.utf8
 SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.34.1
RfKill:

SourcePackage: linux
dmi.bios.date: 04/06/2009
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 2101
dmi.board.asset.tag: To Be Filled By O.E.M.
dmi.board.name: P5Q-E
dmi.board.vendor: ASUSTeK Computer INC.
dmi.board.version: Rev 1.xx
dmi.chassis.asset.tag: Asset-1234567890
dmi.chassis.type: 3
dmi.chassis.vendor: Chassis Manufacture
dmi.chassis.version: Chassis Version
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr2101:bd04/06/2009:svnSystemmanufacturer:pnSystemProductName:pvrSystemVersion:rvnASUSTeKComputerINC.:rnP5Q-E:rvrRev1.xx:cvnChassisManufacture:ct3:cvrChassisVersion:
dmi.product.name: System Product Name
dmi.product.version: System Version
dmi.sys.vendor: System manufacturer

Revision history for this message
randallw (rwayth) wrote :
Revision history for this message
randallw (rwayth) wrote :

This is the output of dmesg when the APIC option in the bios is enabled (default) about 10 seconds before it crashed.

Revision history for this message
randallw (rwayth) wrote :

Here is the dmesg output for the mainline kernel 2.6.32-0206322410-generic. This also hangs, although it wasn't an immediate hard lockup. First the input from the usb keyboard seemed to die (mouse still worked). I tried switching to a text terminal to grab output, but it had hung by the time I got to a text prompt.

Revision history for this message
randallw (rwayth) wrote :

Here is the mainline kernel dmesg output with the APIC option disabled in the bios. Seems stable again (using it now), but again only 1 core of 4 core CPU.

Revision history for this message
randallw (rwayth) wrote :

This is the dmesg output that contains some interesting errors, however the system hung many seconds after the errors appeared in the log. In all cases, I am doing a tail -f /var/log/messages and nothing ever appears immediately before the system hangs. So the errors that appear in this log might not have anything to do with the real problem.

Revision history for this message
randallw (rwayth) wrote :

An update: I have been experimenting with kernel command-line params for APIC. Adding "noapic" make no difference- you still get all CPUs appearing and it still crashes after a few minutes. Adding "nolapic" has the same effect as disabling APIC in the bios: the system is stable, but you only get 1 CPU. The best option seems to be "nolapic_timer" which seems to be a stable system with all 4 CPUs showing.

I'll add a few dmesg outputs with the various kernel options used.

Revision history for this message
randallw (rwayth) wrote :
Revision history for this message
randallw (rwayth) wrote :
Revision history for this message
randallw (rwayth) wrote :
Revision history for this message
jason.sidabras (jason-sidabras) wrote :

I have traced this down to only occurring while in battery mode on my Samsung X360. It occurs seemingly random, but it is accompanied by a hard drive spindown. I have had this issue since 9.04.

Currently the only workaround that reduces the probability of a hard lockup is setting:

hdparm -S 0 /dev/sda

I have tried to simulate this by messing with -Y and such in hdparm, but I can't seem to hard lock it manually. The seemingly random lockup may be associated closely about the time I would get a 40% (the first "warning") battery life.

Hope this helps.

Revision history for this message
randallw (rwayth) wrote :

Hi Jason,
That is interesting. Just so we're 100% clear: your hard-lockup is still related to APIC bios/kernel options, right? So you can avoid the lockups like me by using the "nolapic_timer" kernel command-line option or by disabling APIC?

Does your laptop use some sort of CPU frequency scaling when in battery mode to power save? I find it surprising that the hard disk should have anything to do with it.

Cheers,
Randall

Revision history for this message
John Burkhart (jfburkhart) wrote :

Jason,

I have the same problem on a Samsung X360. It *was* only occurring on battery mode, but recently it has started occurring when I'm plugged as well.
Linux niflino 2.6.35-24-generic #42-Ubuntu SMP Thu Dec 2 02:41:37 UTC 2010 x86_64 GNU/Linux

I've just set the "nolapic_timer" in my /etc/default/grub file, but it certainly makes no difference on battery... I'll report back about on power, as I really have no way to 'instigate' this crash.

I've tried the hdparm setting as well, but these are SSD drives, is it even relevant?

--john

Brad Figg (brad-figg)
Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Predrag (martincom) wrote :

Hi

I have a similar problem.

I have a problem with my Ubuntu 10.10. I had problem with installation too. If I boot live cd my system after login into desktop freezes after 10-15 seconds. I had to go in installation process and hit F6 and check : noacpi=off, noapic, nolapic, edd=on (Im not sure if all otions should be check). After this I could install ubuntu(no freezes). After installation process I had the same problem, after 10-15 seconds system freezes (I have to hard restart the system). I discoverd if I go to recovery console and hit rootconsole>su predrag>startx my ubuntu start up with out any problems. I think that is not problem with Compiz becasuse I run Ubuntu with/without it. In recovery console mode I have turn on Compiz and everything is fine.
On my PC I have many times installed Ubuntu and everything goes good. I changed only my graphic card from Nvidia to Ati. Maybe this is a problem?(I dont think)

Tell me what I should send to you and I do it.

My PC configuration in attachment.

Revision history for this message
randallw (rwayth) wrote :

Hi Predrag,

Did you try your old graphics card? Did you try the nolapic_timer option?

Revision history for this message
penalvch (penalvch) wrote :

randallw, thank you for reporting this bug and helping make Ubuntu better. This bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? Can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .

If it remains an issue, could you run the following command from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux <replace-with-bug-number>

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

tags: added: kernel-bug
summary: - hard lock (hang) after a few minutes with APIC enabled
+ BUG: Bad page map in process sleep pte:2000000000000000 pmd:126932067
Changed in linux (Ubuntu):
status: Confirmed → Incomplete
Revision history for this message
randallw (rwayth) wrote : AlsaDevices.txt

apport information

tags: added: apport-collected
description: updated
Revision history for this message
randallw (rwayth) wrote : AplayDevices.txt

apport information

Revision history for this message
randallw (rwayth) wrote : ArecordDevices.txt

apport information

Revision history for this message
randallw (rwayth) wrote : BootDmesg.txt

apport information

Revision history for this message
randallw (rwayth) wrote : Card0.Amixer.values.txt

apport information

Revision history for this message
randallw (rwayth) wrote : Card0.Codecs.codec.0.txt

apport information

Revision history for this message
randallw (rwayth) wrote : Card1.Codecs.codec.0.txt

apport information

Revision history for this message
randallw (rwayth) wrote : Card1.Codecs.codec.1.txt

apport information

Revision history for this message
randallw (rwayth) wrote : Card1.Codecs.codec.2.txt

apport information

Revision history for this message
randallw (rwayth) wrote : Card1.Codecs.codec.3.txt

apport information

Revision history for this message
randallw (rwayth) wrote : Lspci.txt

apport information

Revision history for this message
randallw (rwayth) wrote : Lsusb.txt

apport information

Revision history for this message
randallw (rwayth) wrote : PciMultimedia.txt

apport information

Revision history for this message
randallw (rwayth) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
randallw (rwayth) wrote : ProcInterrupts.txt

apport information

Revision history for this message
randallw (rwayth) wrote : ProcModules.txt

apport information

Revision history for this message
randallw (rwayth) wrote : UdevDb.txt

apport information

Revision history for this message
randallw (rwayth) wrote : UdevLog.txt

apport information

Revision history for this message
randallw (rwayth) wrote : WifiSyslog.txt

apport information

Revision history for this message
randallw (rwayth) wrote :

Hi Christopher,
I tested my workstation with the 12.04 beta2 from the ubuntu releases website that you listed, booting the livecd image on a USB stick. I get identical behaviour on my system as with 10.04. If I boot from a USB livecd, then it boots, goes into Unity and then hangs after a few minutes. If I boot the livecd with the "nolapic_timer" kernel option, I get a stable system. Based on this, I have run the apport-collect command (again) and all the files are listed above.

The only way I am willing to test a mainline kernel is if you can point me to a livecd ISO so that I can boot from a USB drive. I can't easily install a non-lucid mainline kernel on this machine.

Regards,
Randall.

penalvch (penalvch)
description: updated
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.