Comment 23 for bug 1469214

Revision history for this message
dann frazier (dannf) wrote : Re: [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault

On Tue, Jul 7, 2015 at 2:25 AM, Ming Lei <email address hidden> wrote:
> On Tue, Jul 7, 2015 at 11:16 AM, Ming Lei <email address hidden> wrote:
>> Looks there are two kinds of translation fault from irqbalance:
>>
>> 1) happend in place_irq_in_node() which can reproduce in vivid package
>>
>> 2) the 2nd one happened in glib2, which is built by myself, because
>> irqbalance can choose to use its own local glib if there isn't glib2 available,
>> and the glib2 does exist in my server in which I build irqbalance.
>
>
> Both of two above reports can be fixed by the following irqbalance commit:
>
> NUMA is not available fix
>
> https://github.com/Irqbalance/irqbalance/commit/a3c812eb6cd627cd3fae45b8345538558b86973c
>
> Looks stress-ng can't only find kernel bug, but also userspace
> issue, :-)

I was looking to upload a fix for wily, but I haven't been able to
reproduce it to in order to verify the fix. I ran 'stress-ng --seq 0
-t 60 --syslog --metrics --times -v' overnight in a loop, but
irqbalance never crashed. How long should I expect this to take on
average? Does it usually crash in a single run?