nfs4 hangs, unkillable, using lots of CPU time

Bug #668813 reported by gpk
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Medium
Unassigned

Bug Description

bzr diff
on a repository that is mounted via NFSv4 triggers a bug.
The result is that bzr hangs and is unkillable by any means, including kill -9. Another result is a NFS4 packet storm involving repeated failures of a WRITE request.

This seems to be a very specific corner case, as NFS4 works perfectly for everything else. Reading other bug reports and the bzr code, it seems to involve some kind of interaction between file locking and the ftruncate() system call.

Complete information can be found in bug https://bugs.launchpad.net/bugs/651806 .

To reproduce, unpack the attached tar file to make a directory called "bar" containing a .bzr repository.
Unpack this somewhere that is mounted over nfs4.
cd bar
bzr diff

enjoy!

ProblemType: Bug
DistroRelease: Ubuntu 10.10
Package: linux-image-2.6.35-23-generic 2.6.35-23.36
Regression: Yes
Reproducible: Yes
ProcVersionSignature: Ubuntu 2.6.35-23.36-generic 2.6.35.7
Uname: Linux 2.6.35-23-generic x86_64
NonfreeKernelModules: fglrx
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.23.
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: gpk 5100 F.... pulseaudio
 /dev/snd/pcmC0D0p: gpk 5100 F...m pulseaudio
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
 Card hw:0 'SB'/'HDA ATI SB at 0xfe024000 irq 16'
   Mixer name : 'Realtek ALC887'
   Components : 'HDA:10ec0887,1458a002,00100202'
   Controls : 38
   Simple ctrls : 21
Date: Sat Oct 30 18:05:38 2010
HibernationDevice: RESUME=UUID=6f657719-3b2e-4471-b419-bd2d396aeac2
InstallationMedia: Ubuntu 10.10 "Maverick Meerkat" - Beta amd64 (20100901.1)
IwConfig:
 lo no wireless extensions.

 eth0 no wireless extensions.

 vboxnet0 no wireless extensions.
MachineType: Gigabyte Technology Co., Ltd. GA-MA78LMT-S2
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.35-23-generic root=UUID=f52187b6-c1df-4ee2-9996-bfac2bc98c2d ro quiet splash
ProcEnviron:
 PATH=(custom, user)
 LANG=en_GB.utf8
 SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.38
RfKill:

SourcePackage: linux
dmi.bios.date: 03/18/2010
dmi.bios.vendor: Award Software International, Inc.
dmi.bios.version: F6
dmi.board.name: GA-MA78LMT-S2
dmi.board.vendor: Gigabyte Technology Co., Ltd.
dmi.board.version: x.x
dmi.chassis.type: 3
dmi.chassis.vendor: Gigabyte Technology Co., Ltd.
dmi.modalias: dmi:bvnAwardSoftwareInternational,Inc.:bvrF6:bd03/18/2010:svnGigabyteTechnologyCo.,Ltd.:pnGA-MA78LMT-S2:pvr:rvnGigabyteTechnologyCo.,Ltd.:rnGA-MA78LMT-S2:rvrx.x:cvnGigabyteTechnologyCo.,Ltd.:ct3:cvr:
dmi.product.name: GA-MA78LMT-S2
dmi.sys.vendor: Gigabyte Technology Co., Ltd.

Revision history for this message
gpk (gpk-kochanski) wrote :
Revision history for this message
gpk (gpk-kochanski) wrote :

The server nfs options are (from /etc/exports)

/export 192.168.2.2(rw,fsid=0,root_squash,no_subtree_check,async) 127.0.0.1(rw,fsid=0,root_squash,no_subtree_check,async) 192.168.3.2(rw,fsid=0,root_squash,no_subtree_check,async)

and the client options are (from /etc/fstab)

desk.lan:/gpk /home/gpk/n nfs4 _netdev,defaults,soft 0 0

Revision history for this message
gpk (gpk-kochanski) wrote :
Brad Figg (brad-figg)
Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
gpk (gpk-kochanski) wrote :

Still happens in Maverick.
Linux desk 2.6.35-29-generic #51-Ubuntu SMP Fri Apr 15 17:12:35 UTC 2011 x86_64 GNU/Linux
It seems to have gone away for a while, but it is certainly back!

Revision history for this message
penalvch (penalvch) wrote :

gpk, thank you for reporting this and helping make Ubuntu better. Maverick reached EOL on April 10, 2012.
Please see this document for currently supported Ubuntu releases:
https://wiki.ubuntu.com/Releases

We were wondering if this is still an issue in a supported release? If so, can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .

If it remains an issue, could you run the following command in a supported release from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux <replace-with-bug-number>

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please do not test the kernel in the daily folder, but the one all the way at the bottom. Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. As well, please comment on which kernel version specifically you tested.

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream', and comment as to why specifically you were unable to test it.

Please let us know your results. Thanks in advance.

Helpful Bug Reporting Links:
https://help.ubuntu.com/community/ReportingBugs#Bug_Reporting_Etiquette
https://help.ubuntu.com/community/ReportingBugs#A3._Make_sure_the_bug_hasn.27t_already_been_reported
https://help.ubuntu.com/community/ReportingBugs#Adding_Apport_Debug_Information_to_an_Existing_Launchpad_Bug
https://help.ubuntu.com/community/ReportingBugs#Adding_Additional_Attachments_to_an_Existing_Launchpad_Bug

tags: removed: kill nfs4 nfsv4 packet storm
Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.