4.3.3-300 DMAR: DMAR:[DMA Write] Request device [8a:06.1] fault addr fc26e000

Nate Pearlstein npearl at sgi.com
Thu Jan 21 18:21:01 UTC 2016


Seeing errors after upgrading from 4.2.8-300 to 4.3.3-300, I also see the same errors on newer kernels for f23 from koji and 
4.5.0-0.rc0.git6.1.vanilla.knurd.1


dmesg | egrep -i ‘mlx|dmar'

[   17.816756] mlx4_core 0000:82:00.0: Mapped 1 chunks/256 KB at 120040000 for ICM
[   17.825330] mlx4_core 0000:8a:00.0: SRIOV, disabling HA mode for intf proto 0
[   17.825541] <mlx4_ib> mlx4_ib_add: counter index 0 for port 1 allocated 0
[   17.833869] <mlx4_ib> mlx4_ib_add: counter index 1 for port 2 allocated 0
[   17.906397] mlx4_core 0000:8a:00.0: Mapped 1 chunks/256 KB at 120040000 for ICM
[   17.911403] mlx4_core 0000:8a:00.0: mlx4_ib: multi-function enabled
[   17.925065] mlx4_core 0000:8a:00.0: mlx4_ib: initializing demux service for 128 qp1 clients
[   17.937459] mlx4_core 0000:8a:00.0: Mapped 1 chunks/256 KB at 128040000 for ICM
[   17.938766] mlx4_core 0000:8a:00.0: Mapped 1 chunks/256 KB at 1200c0000 for ICM
[   29.527780] mlx4_core 0000:8a:00.0: Mapped 1 chunks/256 KB at 128080000 for ICM
[   29.529083] mlx4_core 0000:8a:00.0: Mapped 1 chunks/256 KB at 120140000 for ICM
[   31.330799] DMAR: DRHD: handling fault status reg 2
[   31.330803] DMAR: DMAR:[DMA Write] Request device [8a:06.1] fault addr fc26e000
              DMAR:[fault reason 02] Present bit in context entry is clear
[   31.330865] DMAR: DRHD: handling fault status reg 102
[   31.330868] DMAR: DMAR:[DMA Read] Request device [8a:06.1] fault addr fc632000
              DMAR:[fault reason 02] Present bit in context entry is clear
[   31.530006] DMAR: DRHD: handling fault status reg 202
.
.
.

All previous f22 and f23 releases I’ve used were fine.

I have two IB cards: all Firmware version: 2.9.1000

82:00.0 InfiniBand: Mellanox Technologies MT26428 [ConnectX VPI PCIe 2.0 5GT/s - IB QDR / 10GigE] (rev b0)
8a:00.0 InfiniBand: Mellanox Technologies MT26428 [ConnectX VPI PCIe 2.0 5GT/s - IB QDR / 10GigE] (rev b0)

The first one has sriov off the second has sriov on.


More information about the users mailing list