Comment 10 for bug 1857074

Revision history for this message
Juerg Haefliger (juergh) wrote : Re: [Bug 1857074] Re: Cavium ThunderX CN88XX Panic : Unknown reason

On Mon, 13 Jan 2020 23:51:13 -0000
dann frazier <email address hidden> wrote:

> v4.14.151 upstream fails as well - but with a different symptom (see
> below). v4.14.150 seems fine, so I'll try and bisect between the two. Of
> course, that's really just a shot in the dark, as we know this issue is
> finicky..

The cause of that is:
cce360b54ce6 arm64: capabilities: Filter the entries based on a given mask

and the fixup for the above is:
d4af3c4b81f4 arm64: cpufeature: Enable Qualcomm Falkor/Kryo errata 1003

We don't have the second commit but it also doesn't apply since things
in cap handling changed between the two and we don't have those changes. It
seems cap handling is broken and we're seeing the problem that's mentioned in
d4af3c4b81f4.

> [ 34.896151] Unable to handle kernel paging request at virtual address 748214240b9fa200
> [ 34.908263] Mem abort info:
> [ 34.915217] Exception class = IABT (current EL), IL = 32 bits
> [ 34.925365] SET = 0, FnV = 0
> [ 34.932606] EA = 0, S1PTW = 0
> [ 34.939877] [748214240b9fa200] address between user and kernel address ranges
> [ 34.951247] Internal error: Oops: 86000004 [#1] SMP
> [ 34.960370] Modules linked in: nls_iso8859_1 sch_fq_codel thunderx_edac thunderx_zip cavium_rng_vf cavium_rng gpio_keys shpchp uio_pdrv_genirq ipmi_ssif ipmi_devintf uio ipmi_msghandler ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear nicvf nicpf ast i2c_algo_bit ttm drm_kms_helper aes_ce_blk syscopyarea sysfillrect aes_ce_cipher sysimgblt thunder_bgx crc32_ce fb_sys_fops crct10dif_ce ghash_ce drm sha2_ce sha256_arm64 sha1_ce ahci libahci i2c_thunderx thunder_xcv i2c_smbus mdio_thunder thunderx_mmc mdio_cavium aes_neon_bs aes_neon_blk crypto_simd cryptd aes_arm64
> [ 35.060004] Process apparmor_parser (pid: 1188, stack limit = 0xffff00001b948000)
> [ 35.071987] CPU: 16 PID: 1188 Comm: apparmor_parser Not tainted 4.14.151 #1
> [ 35.083436] Hardware name: Cavium ThunderX CRB/To be filled by O.E.M., BIOS 5.11 12/12/2012
> [ 35.096272] task: ffff801f68160f00 task.stack: ffff00001b948000
> [ 35.106632] PC is at 0x748214240b9fa200
> [ 35.114857] LR is at 0x748214240b9fa200
> [ 35.122977] pc : [<748214240b9fa200>] lr : [<748214240b9fa200>] pstate: 20400145
> [ 35.134654] sp : ffff00001b94bca0
> [ 35.142170] x29: ffff801f713f2020 x28: 0000000000000040
> [ 35.151657] x27: 0000000000000000 x26: ffff00001b94be00
> [ 35.161073] x25: ffff801f713f2020 x24: ffff00001b94bdd8
> [ 35.170410] x23: ffff801f713f1ea8 x22: ffff801f71bd8000
> [ 35.179678] x21: 0000000000000000 x20: ffff0000082352bc
> [ 35.185726] audit: type=1400 audit(1578951898.572:9): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/snapd/snap-confine" pid=1185 comm="apparmor_parser"
> [ 35.188933] x19: ffff00001b94bcb0 x18: 0000ffff8916aa70
> [ 35.188938] x17: 0000000000000000 x16: 0000000000000000
> [ 35.188941] x15: 0000000000000000 x14: 0000000000000000
> [ 35.213254] audit: type=1400 audit(1578951898.572:10): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/snapd/snap-confine//mount-namespace-capture-helper" pid=1185 comm="apparmor_parser"
> [ 35.222441] x13: 0000000000000000 x12: 0000000000000000
> [ 35.222445] x11: 0000000000000000 x10: 0000000000000001
> [ 35.222448] x9 : 0000000000000228 x8 : 0000000000000e4a
> [ 35.281468] audit: type=1400 audit(1578951898.668:11): apparmor="STATUS" operation="profile_load" profile="unconfined" name="lxc-container-default" pid=1199 comm="apparmor_parser"
> [ 35.285416] x7 : 0000000000000000 x6 : 000000000000000f
> [ 35.285420] x5 : 00000000000010c0 x4 : 0000801f72a36000
> [ 35.285424] x3 : ffff00001b94bbe0 x2 : 745014240b9fa200
> [ 35.343627] x1 : 0000000000000000 x0 : 00000000000067a1
> [ 35.352234] Call trace:
> [ 35.357875] Exception stack(0xffff00001b94bb60 to 0xffff00001b94bca0)
> [ 35.367508] bb60: 00000000000067a1 0000000000000000 745014240b9fa200 ffff00001b94bbe0
> [ 35.378536] bb80: 0000801f72a36000 00000000000010c0 000000000000000f 0000000000000000
> [ 35.389484] bba0: 0000000000000e4a 0000000000000228 0000000000000001 0000000000000000
> [ 35.400385] bbc0: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
> [ 35.411233] bbe0: 0000000000000000 0000000000000000 0000ffff8916aa70 ffff00001b94bcb0
> [ 35.422060] bc00: ffff0000082352bc 0000000000000000 ffff801f71bd8000 ffff801f713f1ea8
> [ 35.433212] bc20: ffff00001b94bdd8 ffff801f713f2020 ffff00001b94be00 0000000000000000
> [ 35.444158] bc40: 0000000000000040 ffff801f713f2020 748214240b9fa200 ffff00001b94bca0
> [ 35.455559] bc60: 748214240b9fa200 0000000020400145 ffff000008b02000 ffff801f68160f00
> [ 35.466803] bc80: 0000ffffffffffff 00000e4a1b94bdd8 ffff801f713f2020 748214240b9fa200
> [ 35.477595] [<748214240b9fa200>] 0x748214240b9fa200
> [ 35.485363] Code: bad PC value
> [ 35.491214] ---[ end trace e1671159f66f9f11 ]---
>
> ** Attachment added: "4.14.151.dmesg"
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857074/+attachment/5319928/+files/4.14.151.dmesg
>