Merge into bzr.dev : cleanup_testcases_by_collection

Reviewer	Date Requested	Status
Andrew Bennetts		Needs Fixing on 2010-11-25
Vincent Ladeuil	2010-10-20	Needs Fixing on 2010-11-11
Review via email: mp+38999@code.launchpad.net

Revision history for this message

Robert Collins (lifeless) wrote on 2010-08-15: Posted in a previous version of this proposal

#

So, I think this is more correct; fixing cycles is pretty important
for us in other ways : I like that we're finding latent issues here.

I'm not worried about the references to test cases in errors/failures
lists : they are a benefit (and the __dict__ stripping was a _REAL_
pain there in the past).

-Rob

Revision history for this message

Martin Packman (gz) wrote on 2010-08-15: Posted in a previous version of this proposal

#

> I'm not worried about the references to test cases in errors/failures
> lists : they are a benefit (and the __dict__ stripping was a _REAL_
> pain there in the past).

Indeed that mostly won't be an issue (and can sometimes be useful), my one worry is the occasional breaks-everything rev will then go back to swapping my machine to death. That's why the current implementation still wipes all uncollected tests, but I'm okay with the idea of removing that once everything else is sorted.

A few things I forgot to mention in the initial review comment.

The lock decorators now use this horribly obnoxious pattern:

    self.lock_read()
    try:
        result = unbound(self, *args, **kwargs)
    except:
        import sys
        exc_info = sys.exc_info()
        try:
            self.unlock()
        finally:
            try:
                raise exc_info[0], exc_info[1], exc_info[2]
            finally:
                del exc_info
    else:
        self.unlock()
        return result

If anyone can think of a sane way of spelling that, please say.

Cycles needing to be avoided will have to be documented if we go with this, and can be annoying to understand without decent knowledge of python internals. I've been using a rough reference tracking function to fix things, but it's not worth landing. Is there any package that can help contributors diagnose things like this?

Would there be any benefit in running all tests with gc disabled by default, and only enabling it for those that explicitly require it?

Is there a better way of telling if the test you just ran failed other than comparing failure counts on the result object?

Revision history for this message

Robert Collins (lifeless) wrote on 2010-08-15: Posted in a previous version of this proposal

#

> Is there a better way of telling if the test you just ran failed other than comparing failure counts on the result object?

Where do you need to know this?

Revision history for this message

Martin Packman (gz) wrote on 2010-08-15: Posted in a previous version of this proposal

#

> > Is there a better way of telling if the test you just ran failed other than
> comparing failure counts on the result object?
>
> Where do you need to know this?

It has to be outside all the normal methods unfortunately. Even stopTest is too early, as that's passed a reference to the test case, so clearly the caller is going to be keeping it alive. Instead I added the collection accounting in TestSuite.run which was already popping testcases from its internal list, now it uses weakref to check that they really expire after being run.

I've put up <lp:~gz/testtools/avoid_exc_info_cycles> with the testtools changes I needed.

Revision history for this message

Vincent Ladeuil (vila) wrote on 2010-08-19: Posted in a previous version of this proposal

#

Hmm, I appreciate the effort to reduce the memory footprint, but unfortunately this seems far from landable :-/

BZR_PLUGIN_PATH=-site ./bzr selftest --parallel=fork
bzr selftest: /home/vila/src/bzr/reviews/cleanup_testcases_by_collection_613247/bzr
/home/vila/src/bzr/reviews/cleanup_testcases_by_collection_613247/bzrlib
bzr-2.2b3 python-2.6.5 Linux-2.6.32-24-generic-x86_64-with-Ubuntu-10.04-lucid

----------------------------------------------------------------------
Ran 8 tests in 0.489s

OK
2 tests skipped

Huh ? Where are my tests gone ? (Strong suspicion around your modification of __iter__)

Ran 24856 tests in 1050.614s

FAILED (failures=2, errors=2, known_failure_count=48)

Including failures for bb.test_log so your fix is wrong there.

On the overall, with a superficial reading, the way you avoid ref cycles seem hard to understand
so very likely to be re-introduced. How about breaking the ref-cycles with addCleanup calls instead
(by deleting the offending attributes) ?

Now, a way to track the uncollectable test case sounds like a good way to progress but given the amount of work, could you instead devise a patch where the check depends on some option ?
Once we reach 0 we can toggle the default value for this option.

For the run above, I also get ~4000 Uncollected test cases, far too much to land this patch :-/

review: Needs Fixing

Revision history for this message

Martin Packman (gz) wrote on 2010-08-19: Posted in a previous version of this proposal

#

Thanks for looking at this Vincent.

> Huh ? Where are my tests gone ? (Strong suspicion around your modification of
> __iter__)

I suspect this is related to --parallel which I thought might be affected but didn't have any easy way of testing on my machine, I'll take another look at that code.

> Including failures for bb.test_log so your fix is wrong there.

Gah, how did I miss those...

> On the overall, with a superficial reading, the way you avoid ref cycles seem
> hard to understand
> so very likely to be re-introduced. How about breaking the ref-cycles with
> addCleanup calls instead
> (by deleting the offending attributes) ?

Yes, that it's hard to understand is a big concern. At least this patch, with retroactively changing a bunch of tests in subtle ways, is the worst part of it. It's much easier to adjust the test you're writing right now because you get told there's a cycle. So, moving in the direction of actually causing a failure once all existing issues are fixed would I think prevent regressions without being too taxing on contributors.

Note, that sometimes cycles do have to be avoided, because it's easier than trying to break them later, particularly with exc_info and closures where it's internal objects pointing at each other rather than public TestCase attributes.

> Now, a way to track the uncollectable test case sounds like a good way to
> progress but given the amount of work, could you instead devise a patch where
> the check depends on some option ?
> Once we reach 0 we can toggle the default value for this option.

Well, the "print junk to stdout" is my less intrusive option, but the aim is to get it to 0 before landing. However...

> For the run above, I also get ~4000 Uncollected test cases, far too much to
> land this patch :-/

This could well be just needing the testtools exc_info cycle cleanup as well, branch for that linked in previous comment.

...getting 4000 complaints from not having the testtools change probably does mean this needs to land quieter, making everyone patch testtools or be spammed isn't polite.