Comment 7 for bug 1743249

Revision history for this message
Jason Hobbs (jason-hobbs) wrote : Re: [Bug 1743249] Re: Failed Deployment after timeout trying to retrieve grub cfg

On Wed, Jan 17, 2018 at 11:11 AM, Andres Rodriguez
<email address hidden> wrote:
> Hi Jason,
>
> Can you confirm:
>
> 1. I'm guessing you are running MAAS in HA. Can you remind us of your HA configuration (3 regions, 2 racks? The racks connect to the regions via the VIP ?).

We have 3 region controllers and 2 active rack controllers (rack
controller packages installed on all 3 but only configured to be
active on 2).

> 2. From what rack controller did it boot?

It looks like from 10.244.40.32, but I expect you're more skilled at
interpreting the logs than I am.

> 3. Was the chosen region with high load?

We don't have load monitoring software installed on the MAAS servers.
The MAAS servers are 10 core 128GB of RAM gen9 HPE servers. They're
hosting MAAS, postgresql, and some VMs.

> 4. Could this be another HAProxy issue?

I don't know. It's timing out retrieving the grub cfg from the rack
controller - where would HAproxy come into play there? I don't know
enough about MAAS internals to answer. I know rack has to get some
info from the region controller for this, but I would expect this goes
through RPC, which does not go through HAproxy as far as I know.