Comment 36 for bug 2012596

Revision history for this message
Alan Baghumian (alanbach) wrote :

Hi Igor!

I believe this behavior requires a specific series of events to happen.

I have a 4 node cluster (2 Region + 2 Rack) and this happened to me last week. The primary Region controller's memory usage went out of control causing a OOM crash.

I'm still not quite sure what led to it, but it happened during a custom image sync process.

This cluster has 4GB RAM on Region nodes and 2GB RAM on Rack nodes and has been setup like this since MAAS 3.1 but this is the first time I see this memory leak issue.

Just to add to the context, I do have a 3 node HA proxy setup (pacemaker/corosync) in front of MAAS that also does the SSL termination and all Rack and Region controllers have been configured in rackd.conf and regiond.conf to point to HA Proxy's FQDN.

I'll try to poke around to see if I can reproduce this.

Best,
Alan