maas cli: rack-controller list-boot-images Connection to the other side was lost in a non-clean fashion

Bug #1841807 reported by John George
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
MAAS
Fix Released
Medium
Blake Rouse

Bug Description

Twice this week Solutions QA test runs have failed when using the maas cli rack-controller list-boot-images

2019-08-27-21:32:49 root ERROR Command failed: rack-controller list-boot-images ba6gn7
2019-08-27-21:32:49 root ERROR b'Connection to the other side was lost in a non-clean fashion.'

subprocess.CalledProcessError: Command '['maas', 'root', 'rack-controller', 'list-boot-images', 'rskk8a']' returned non-zero exit status 2.

All the detail and logs are available at the links below. Follow the artifacts link at the bottom of the page and look in fce_build*_console.out to find the Traceback.
https://solutions.qa.canonical.com/#/qa/testRun/efb03811-dd55-4f6e-8da2-16ad7e323285
https://solutions.qa.canonical.com/#/qa/testRun/c0ad2dec-3711-4eb2-855b-f6a3d8fd126a

Related branches

Changed in maas:
importance: Undecided → Medium
Changed in maas:
status: New → Triaged
Revision history for this message
Blake Rouse (blake-rouse) wrote :
Download full text (4.0 KiB)

This seems to be more of a network issue in your lab, digging through the logs I was able to determine this:

2019-08-27 21:33:02 provisioningserver.rpc.clusterservice: [info] Failure on ping dropping connection to event-loop: leafeon:pid=9545
2019-08-27 21:33:02 ClusterClient,client: [info] ClusterClient connection lost (HOST:IPv6Address(TCP, '::ffff:192.168.33.23', 53010) PEER:IPv6Address(TCP, '::ffff:192.168.33.22', 5251))
2019-08-27 21:33:02 provisioningserver.rpc.clusterservice: [info] Failure on ping dropping connection to event-loop: swoobat:pid=9969
2019-08-27 21:33:02 provisioningserver.rpc.clusterservice: [info] Failure on ping dropping connection to event-loop: swoobat:pid=9961
2019-08-27 21:33:02 ClusterClient,client: [info] ClusterClient connection lost (HOST:IPv6Address(TCP, '::ffff:192.168.33.23', 54470) PEER:IPv6Address(TCP, '::ffff:192.168.33.24', 5252))
2019-08-27 21:33:02 ClusterClient,client: [info] ClusterClient connection lost (HOST:IPv6Address(TCP, '::ffff:192.168.33.23', 52882) PEER:IPv6Address(TCP, '::ffff:192.168.33.24', 5251))
2019-08-27 21:33:03 provisioningserver.rpc.clusterservice: [info] Making connections to event-loops: leafeon:pid=9545, swoobat:pid=9961, swoobat:pid=9969
2019-08-27 21:33:05 sstreams: [info] maas:v2:download/maas:boot:ubuntu:amd64:hwe-16.04-edge:xenial: to_add=['20190822'] to_remove=[]
2019-08-27 21:33:09 sstreams: [info] maas:v2:download/maas:boot:ubuntu:amd64:hwe-16.04-lowlatency-edge:xenial: to_add=['20190822'] to_remove=[]
2019-08-27 21:33:13 sstreams: [info] maas:v2:download/maas:boot:ubuntu:amd64:hwe-16.04-lowlatency:xenial: to_add=['20190822'] to_remove=[]
2019-08-27 21:33:18 sstreams: [info] maas:v2:download/maas:boot:ubuntu:amd64:hwe-16.04:xenial: to_add=['20190822'] to_remove=[]
2019-08-27 21:33:18 Uninitialized: [info] ClusterClient connection established (HOST:IPv6Address(TCP, '::ffff:192.168.33.23', 53780) PEER:IPv6Address(TCP, '::ffff:192.168.33.24', 5251))
2019-08-27 21:33:18 Uninitialized: [info] ClusterClient connection established (HOST:IPv6Address(TCP, '::ffff:192.168.33.23', 53900) PEER:IPv6Address(TCP, '::ffff:192.168.33.22', 5251))
2019-08-27 21:33:18 Uninitialized: [info] ClusterClient connection established (HOST:IPv6Address(TCP, '::ffff:192.168.33.23', 55368) PEER:IPv6Address(TCP, '::ffff:192.168.33.24', 5252))
2019-08-27 21:33:18 provisioningserver.rpc.clusterservice: [info] Event-loop 'leafeon:pid=9545' authenticated.
2019-08-27 21:33:18 provisioningserver.rpc.clusterservice: [info] Event-loop 'swoobat:pid=9961' authenticated.
2019-08-27 21:33:18 provisioningserver.rpc.clusterservice: [info] Event-loop 'swoobat:pid=9969' authenticated.
2019-08-27 21:33:18 provisioningserver.rpc.clusterservice: [info] Rack controller 'ba6gn7' registered (via leafeon:pid=9545) with MAAS version 2.6.0-7802-g59416a869-0ubuntu1~18.04.1.
2019-08-27 21:33:19 provisioningserver.rpc.clusterservice: [info] Rack controller 'ba6gn7' registered (via swoobat:pid=9969) with MAAS version 2.6.0-7802-g59416a869-0ubuntu1~18.04.1.
2019-08-27 21:33:19 provisioningserver.rpc.clusterservice: [info] Rack controller 'ba6gn7' registered (via swoobat:pid=9961) with MAAS version 2.6.0-7802-g59416a869-0ubuntu1~1...

Read more...

Changed in maas:
assignee: nobody → Blake Rouse (blake-rouse)
milestone: none → 2.7.0alpha1
Changed in maas:
status: Triaged → Fix Committed
Revision history for this message
Jason Hobbs (jason-hobbs) wrote :

Can you please clarify your comment? Is this a network issue on our side or a bug in maas? I see there is a commit landed for it.

We're seeing this quite a bit on 2.6. This should be considered for backport to 2.6.

Changed in maas:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.