XHCI errors causes dock USB and network adapter dropouts

Bug #1663975 reported by leee
122
This bug affects 23 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Medium
Unassigned

Bug Description

Machine: Dell XPS 15 (9560) - released 2017 January
Install: disk completely erased
         Ubuntu 16.04.1 installed over wireless with:
           - download updates while installing ubuntu
           - install third-party hardware
           - full disk crypto+luks
         Note 1: Installation process did not realize there exists a GTX 1050
                 so we are solely using Intel integrated graphics.
         Note 2: Installation done not on Dell TB16 Dock to isolate wireless
                 problems alone first.
First Boot: update/upgrade/dist-upgrade'd after first boot and rebooted

--------------------------------------------------------------------------------

After testing wireless problems and being able to reproduce them, machine was rebooted to the login screen, and then plugged into a Dell TB16 dock.

1. Turn down wireless networking.
2. Turn up wired networking.
3. Browse things until this happens, after which the wired interface no longer works and can sometimes be successfully `down/up`ed but at other times locks up and requires a reboot:

[ 598.238303] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
[ 598.238307] xhci_hcd 0000:0e:00.0: Looking for event-dma 0000000855403010 trb-start 00000008550aafe0 trb-end 00000008550aafe0 seg-start 00000008550aa000 seg-end 00000008550aaff0
[ 598.238634] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
[ 598.238639] xhci_hcd 0000:0e:00.0: Looking for event-dma 0000000855403020 trb-start 00000008550aafe0 trb-end 00000008550aafe0 seg-start 00000008550aa000 seg-end 00000008550aaff0
[ 598.239248] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
[ 598.239253] xhci_hcd 0000:0e:00.0: Looking for event-dma 0000000855403030 trb-start 00000008550aafe0 trb-end 00000008550aafe0 seg-start 00000008550aa000 seg-end 00000008550aaff0
[ 598.239571] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
[ 598.239576] xhci_hcd 0000:0e:00.0: Looking for event-dma 0000000855403040 trb-start 00000008550aafe0 trb-end 00000008550aafe0 seg-start 00000008550aa000 seg-end 00000008550aaff0
[ 598.239792] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
[ 598.239794] xhci_hcd 0000:0e:00.0: Looking for event-dma 0000000855403050 trb-start 00000008550aafe0 trb-end 00000008550aafe0 seg-start 00000008550aa000 seg-end 00000008550aaff0
[ 598.240186] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
[ 598.240191] xhci_hcd 0000:0e:00.0: Looking for event-dma 0000000855403060 trb-start 00000008550aafe0 trb-end 00000008550aafe0 seg-start 00000008550aa000 seg-end 00000008550aaff0
[ 598.240326] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
[ 598.240328] xhci_hcd 0000:0e:00.0: Looking for event-dma 0000000855403070 trb-start 00000008550aafe0 trb-end 00000008550aafe0 seg-start 00000008550aa000 seg-end 00000008550aaff0
[ 598.240698] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
[ 598.240703] xhci_hcd 0000:0e:00.0: Looking for event-dma 0000000855403080 trb-start 00000008550aafe0 trb-end 00000008550aafe0 seg-start 00000008550aa000 seg-end 00000008550aaff0
[ 605.601884] ------------[ cut here ]------------
[ 605.601890] WARNING: CPU: 1 PID: 0 at /build/linux-W6HB68/linux-4.4.0/net/sched/sch_generic.c:306 dev_watchdog+0x237/0x240()
[ 605.601892] NETDEV WATCHDOG: enxd481d70e6a39 (r8152): transmit queue 0 timed out
[ 605.601892] Modules linked in: hid_logitech_hidpp snd_usb_audio snd_usbmidi_lib hid_generic hid_logitech_dj cdc_ether usbnet r8152 mii ctr ccm arc4 rfcomm bnep nls_iso8859_1 i2c_designware_platform i2c_designware_core dell_wmi dcdbas ath10k_pci ath10k_core ath snd_hda_codec_hdmi mac80211 dell_led snd_hda_codec_realtek snd_hda_codec_generic cfg80211 rtsx_pci_ms memstick snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm x86_pkg_temp_thermal coretemp snd_seq_midi kvm_intel snd_seq_midi_event snd_rawmidi kvm snd_seq uvcvideo irqbypass snd_seq_device snd_timer videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 input_leds videobuf2_core snd joydev v4l2_common serio_raw videodev soundcore hid_multitouch media btusb idma64 virt_dma btrtl mei_me processor_thermal_device mei shpchp intel_lpss_pci intel_soc_dts_iosf
[ 605.601927] hci_uart btbcm btqca btintel bluetooth dell_smo8800 intel_hid intel_lpss_acpi intel_lpss tpm_crb acpi_als sparse_keymap int3403_thermal int3400_thermal int340x_thermal_zone acpi_thermal_rel kfifo_buf acpi_pad mac_hid industrialio parport_pc ppdev lp parport autofs4 drbg ansi_cprng algif_skcipher af_alg dm_crypt usbhid rtsx_pci_sdmmc crct10dif_pclmul crc32_pclmul ghash_clmulni_intel i915_bpo nouveau aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd mxm_wmi intel_ips ttm psmouse i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ahci nvme rtsx_pci drm libahci i2c_hid hid wmi pinctrl_sunrisepoint video pinctrl_intel fjes
[ 605.601956] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G W 4.4.0-62-generic #83-Ubuntu
[ 605.601959] Hardware name: Dell Inc. XPS 15 9560/05FFDN, BIOS 1.0.3 01/09/2017
[ 605.601962] 0000000000000286 2b53e360d81133f5 ffff88087f443d98 ffffffff813f7c63
[ 605.601964] ffff88087f443de0 ffffffff81d6afd8 ffff88087f443dd0 ffffffff810812d2
[ 605.601965] 0000000000000000 ffff880856c3c280 0000000000000001 ffff88085439c000
[ 605.601967] Call Trace:
[ 605.601968] <IRQ> [<ffffffff813f7c63>] dump_stack+0x63/0x90
[ 605.601974] [<ffffffff810812d2>] warn_slowpath_common+0x82/0xc0
[ 605.601975] [<ffffffff8108136c>] warn_slowpath_fmt+0x5c/0x80
[ 605.601978] [<ffffffff81753957>] dev_watchdog+0x237/0x240
[ 605.601980] [<ffffffff81753720>] ? qdisc_rcu_free+0x40/0x40
[ 605.601983] [<ffffffff810ecd55>] call_timer_fn+0x35/0x120
[ 605.601984] [<ffffffff81753720>] ? qdisc_rcu_free+0x40/0x40
[ 605.601986] [<ffffffff810ed70a>] run_timer_softirq+0x23a/0x2f0
[ 605.601988] [<ffffffff81085db1>] __do_softirq+0x101/0x290
[ 605.601990] [<ffffffff810860b3>] irq_exit+0xa3/0xb0
[ 605.601992] [<ffffffff8183b0a2>] smp_apic_timer_interrupt+0x42/0x50
[ 605.601995] [<ffffffff81839362>] apic_timer_interrupt+0x82/0x90
[ 605.601995] <EOI> [<ffffffff816cb5d1>] ? cpuidle_enter_state+0x111/0x2b0
[ 605.602000] [<ffffffff816cb7a7>] cpuidle_enter+0x17/0x20
[ 605.602002] [<ffffffff810c4522>] call_cpuidle+0x32/0x60
[ 605.602004] [<ffffffff816cb783>] ? cpuidle_select+0x13/0x20
[ 605.602006] [<ffffffff810c47e0>] cpu_startup_entry+0x290/0x350
[ 605.602008] [<ffffffff81051784>] start_secondary+0x154/0x190
[ 605.602010] ---[ end trace f9200c001343dda6 ]---
[ 605.602014] r8152 4-1.2:1.0 enxd481d70e6a39: Tx timeout
[ 605.643576] r8152 4-1.2:1.0 enxd481d70e6a39: Tx status -2
[ 605.656410] r8152 4-1.2:1.0 enxd481d70e6a39: Tx status -2
[ 605.669219] r8152 4-1.2:1.0 enxd481d70e6a39: Tx status -2
[ 605.679077] r8152 4-1.2:1.0 enxd481d70e6a39: Tx status -2
[ 608.319280] usb 4-1.2: reset SuperSpeed USB device number 4 using xhci_hcd

ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: linux-image-4.4.0-62-generic 4.4.0-62.83
ProcVersionSignature: Ubuntu 4.4.0-62.83-generic 4.4.40
Uname: Linux 4.4.0-62-generic x86_64
ApportVersion: 2.20.1-0ubuntu2.5
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC2: leee 2092 F.... pulseaudio
 /dev/snd/controlC0: leee 2092 F.... pulseaudio
CurrentDesktop: Unity
Date: Sun Feb 12 03:09:30 2017
HibernationDevice: RESUME=UUID=54594d9c-c63b-4990-8336-2d68892236a7
InstallationDate: Installed on 2017-02-12 (0 days ago)
InstallationMedia: Ubuntu 16.04.1 LTS "Xenial Xerus" - Release amd64 (20160719)
MachineType: Dell Inc. XPS 15 9560
ProcFB: 0 inteldrmfb
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-4.4.0-62-generic.efi.signed root=/dev/mapper/ubuntu--vg-root ro quiet splash vt.handoff=7
RelatedPackageVersions:
 linux-restricted-modules-4.4.0-62-generic N/A
 linux-backports-modules-4.4.0-62-generic N/A
 linux-firmware 1.157.8
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 01/09/2017
dmi.bios.vendor: Dell Inc.
dmi.bios.version: 1.0.3
dmi.board.name: 05FFDN
dmi.board.vendor: Dell Inc.
dmi.board.version: A00
dmi.chassis.type: 10
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.:bvr1.0.3:bd01/09/2017:svnDellInc.:pnXPS159560:pvr:rvnDellInc.:rn05FFDN:rvrA00:cvnDellInc.:ct10:cvr:
dmi.product.name: XPS 15 9560
dmi.sys.vendor: Dell Inc.

Revision history for this message
leee (leee) wrote :
Revision history for this message
Brad Figg (brad-figg) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.10 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.10-rc8

Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
tags: added: kernel-da-key
Revision history for this message
leee (leee) wrote :

The same xhci error happens in 4.9 and 4.10 which causes the network adapter to fail or cause usb devices to drop out, needing re-plugging in, in a manner exactly like here for 4.4.

Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Did this issue start happening after an update/upgrade? Was there a kernel version where you were not having this particular problem? This will help determine if the problem you are seeing is the result of a regression, and when this regression was introduced. If this is a regression, we can perform a kernel bisect to identify the commit that introduced the problem.

Revision history for this message
leee (leee) wrote :

Hi - sorry for the delay in response. I wanted to wait until the final 4.10 was released, and then had some prior obligations I had to tend to until now.

I and colleagues with the same machine (which rules out a hardware problem) experienced this problem out of the box with 16.04.1 (which provides 4.4).

While on 16.04.1, I tested with 4.4, 4.9 and 4.10 (rc8), and on 16.04.2, I tested with 4.8 and 4.10 (final).

All tested kernels have me run into the exact same xhci_hcd error about `TRB DMA ptr`.

Revision history for this message
Michael (0x32) wrote :

Hi,

i also have the same problem :-(

Using a Dell XPS 15 (9560) with Linux Mint 18.1 and Kernel 4.8.0-39-generic installed intel-microcode + nvidia-375 drivers from https://launchpad.net/~graphics-drivers/+archive/ubuntu/ppa but running on intel gpu (at least the intel profile is selected in nvidia prime ;-) )

Ethernet seems to work for a few minutes but then crashes after some time with the same xhci_hcd messages. If i try to download any large file (4 GB ISO) i've never managed to finish the download before running into the xhci_hcd error. It also seems that on the same time, the USB devices connected via the dock will freeze.

The XPS is updated to newest BIOS version (i believe 1.0.3) and newest thunderbolt firmware.

If i can do anything to help you guys, please let me know :-)

leee (leee)
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
leee (leee) wrote :

A colleague with the same laptop told me that Windows does not experience any wired dock network dropouts.

To rule out any hardware being possibly faulty - I ran continuous network file transfers on Windows 10 for 24 hours and did not see any diminished performance, let alone any outright network failure.

Installing Ubuntu 16.04.2 again (with 4.8) and testing that as well as with 4.10 makes the problem reproducible:

- xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
- xhci_hcd 0000:0e:00.0: Looking for event-dma [16-digit-integer] trb-start [16-digit-hexadecimal] trb-end [16-digit-hexadecimal] seg-start [16-digit-hexadecimal] seg-end [16-digit-hexadecimal]

This, along with other users running into the same issue, points to the issue being _clearly_ software related unless I've missed something in my diagnosis.

What other data can I provide? I wish I could be of more help, but am not competent enough to know where to even begin to write a patch.

Revision history for this message
leee (leee) wrote :

While installing Dell-provided software and drivers for Windows, I became armed with additional information about the dock - it is a Realtek NIC connected to an ASMedia host controller.

Armed with that and the search phrase "realtek ep_index 2 comp_code 13" provides the following result, which shows this problem is known not just here (find on the page posts made by Ansis Atteka <aatteka at nicira com>)

https://groups.google.com/forum/#!msg/linux.kernel/vdR3nK8k8zM/Vhbp1CjtBgAJ

Revision history for this message
Kaz Wolfe (kazwolfe) wrote :

Just for the sake of completeness, yet another error log: http://pastebin.com/z8U9usDY

4.8.0-41-generic #44~16.04.1-Ubuntu, HWE because reasons. Kernel is tainted (NVIDIA, VirtualBox), but this issue seems to exist anyways.

Revision history for this message
imperia (imperia777) wrote :

I beleive https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1663975 is the same bug and it should be merged.

Revision history for this message
leee (leee) wrote :

You've commented in this bug, 1663975. Was that your intention?

Revision history for this message
imperia (imperia777) wrote :

Sorry, I meant bug: 1667750

I wanted to say that I believe 1663975 and 1667750 are same bug.

Revision history for this message
Anthony Cunningham (db2boy) wrote :

I too was having the issue with ethernet on Ubuntu 16.04 at 4.8.0-51.

I also have the DA200 adapter:-

http://www.dell.com/en-us/shop/dell-adapter-usb-c-to-hdmi-vga-ethernet-usb-3-0-da200/apd/470-abqn/pc-accessories

If I plug this into the ethernet port on the TB16 and use the ethernet port on the DA200 my ethernet hasn't dropped out in the last hour. I've tested with copying large files from my NAS, from the net and NAS and net concurrently.

Let me know if I can provide further details.

Revision history for this message
ruffsl (roxfoxpox) wrote :

Can confirm that is still an issue with kernel 4.11.3 for both the Dell TB15 and TB16 docks.

$ uname -a
Linux ubuntu 4.11.3-041103-generic #201705251233 SMP Thu May 25 16:34:52 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

$ tail /var/log/kern.log
May 29 00:35:40 ubuntu kernel: [ 1413.483482] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
May 29 00:35:40 ubuntu kernel: [ 1413.483493] xhci_hcd 0000:0e:00.0: Looking for event-dma 000000085bbde010 trb-start 00000007eee37fe0 trb-end 00000007eee37fe0 seg-start 00000007eee37000 seg-end 00000007eee37ff0
May 29 00:35:40 ubuntu kernel: [ 1413.483702] xhci_hcd 0000:0e:00.0: ERROR Transfer event TRB DMA ptr not part of current TD ep_index 2 comp_code 13
...

I wonder if the microsoft docks that have been recently added also exhibit these same issues:
https://github.com/torvalds/linux/commits/master/drivers/net/usb/r8152.c

Revision history for this message
Mario Limonciello (superm1) wrote :

This is an issue with the host controller. The vendor (ASMedia) has submitted a patch here that fixes the issue:
http://www.spinics.net/lists/linux-usb/msg157958.html

Revision history for this message
Ian Pilcher (arequipeno) wrote :
Revision history for this message
zach (dunkirk) wrote :

I too was having the issue with Ethernet on Linux Mint 18.1 at 4.4.0-100.

I have the USB3 Ethernet adapter:

https://www.amazon.com/gp/product/B00BBD7NFU

with USB3 adapter:

https://www.amazon.com/gp/product/B01GGKYXVE

If I plug this into the thunderbolt port on the TB16 and then use the USB3 Ethernet adapter, my Ethernet hasn't dropped out in the past few days. I've tested with copying large files.

Let me know if I can provide further details.

Revision history for this message
Luca Osvaldo Mastromatteo (lukycrociato) wrote :

I'm having a similar problem with USB network adapters on my XPS too but 9550 version.
Not the exact error

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.