Comment 13 for bug 1823836

Revision history for this message
bugproxy (bugproxy) wrote : Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2019-04-12 13:01 EDT-------
(In reply to comment #22)
> Ok, once my setup was fixed it worked fine.
> By any chance are your FW levels up to date?
> For your card that should be:
> ConnectX-5: 16.21.1000 and above.
> ConnectX-5 Ex: 16.21.1000 and above.
>
> You can run ibv_devinfo to get what you currently have.
> $ sudo ibv_devinfo
>
> If it is not your FW, we are back to my suggestion to file this with
> the infiniband/dpdk/mellanox folks to get their experience as well.
>

Our firmware level looks ok, higher than you suggested.

ConnectX-5 Firmware version:
# mstflint -d 0000:01:00.0 q
Image type: FS4
FW Version: 16.23.1020 <<<<<<<<<
FW Release Date: 10.7.2018
Product Version: 16.23.1020
Description: UID GuidsNumber
Base GUID: ec0d9a0300cab17c 4
Base MAC: ec0d9acab17c 4
Image VSD: N/A
Device VSD: N/A
PSID: IBM0000000020
Security Attributes: N/A

I have chased the -95 into the ib_uverbs modules, the error is returned from uverbs_request_finish(), called from ib_uverbs_ex_create_rwq_ind_table().

If I understand, the purpose of uverbs_request_finish() is to verify that the unused portion of some part of buffer passed in from userspace is filled with NULLs. The error indicates its not. If I remove the call to uverbs_request_finish() testpmd starts up fine, but I cant say its functioning beyond that.

I am thinking some incompatibility between libibverbs and the ib_uverbs. There have been recent changes with this kernel interface that may be new for the 5.0 kernel. I am trying to correlate the user space structures with the kernel, but I am struggling at the moment.

Thank you for for testing on x86, I agree its time for me to reach out to Mellanox.