1
=== modified file 'NEWS'
2
--- NEWS	2009-08-18 20:05:30 +0000
3
+++ NEWS	2009-08-19 16:35:14 +0000
4
@@ -9,22 +9,6 @@
5
9
In Development
9
In Development
6
10
##############
10
##############
7
11
11
8
12
Bug Fixes
9
13
*********
10
14
11
15
* Fix a test failure on karmic by making a locale test more robust.
12
16
  (Vincent Ladeuil, #413514)
13
17
14
18
Improvements
15
19
************
16
20
17
21
* A better description of the platform is shown in crash tracebacks, ``bzr
18
22
  --version`` and ``bzr selftest``.
19
23
  (Martin Pool, #409137)
20
24
21
25
bzr 1.18
22
26
########
23
27
24
28
Compatibility Breaks
12
Compatibility Breaks
25
29
********************
13
********************
26
30
14
27
@@ -83,6 +67,9 @@
28
83
  version-3 protocol, but it does cause test suite failures when testing
67
  version-3 protocol, but it does cause test suite failures when testing
29
84
  downlevel protocol behaviour. (Robert Collins)
68
  downlevel protocol behaviour. (Robert Collins)
30
85
69
31
70
* Fix a test failure on karmic by making a locale test more robust.
32
71
  (Vincent Ladeuil, #413514)
33
72
34
86
* Fixed "Pack ... already exists" error when running ``bzr pack`` on a
73
* Fixed "Pack ... already exists" error when running ``bzr pack`` on a
35
87
  fully packed 2a repository.  (Andrew Bennetts, #382463)
74
  fully packed 2a repository.  (Andrew Bennetts, #382463)
36
88
75
37
@@ -109,12 +96,21 @@
38
109
Improvements
96
Improvements
39
110
************
97
************
40
111
98
41
99
* A better description of the platform is shown in crash tracebacks, ``bzr
42
100
  --version`` and ``bzr selftest``.
43
101
  (Martin Pool, #409137)
44
102
45
112
* Cross-format fetches (such as between 1.9-rich-root and 2a) via the
103
* Cross-format fetches (such as between 1.9-rich-root and 2a) via the
46
113
  smart server are more efficient now.  They send inventory deltas rather
104
  smart server are more efficient now.  They send inventory deltas rather
47
114
  than full inventories.  The smart server has two new requests,
105
  than full inventories.  The smart server has two new requests,
48
115
  ``Repository.get_stream_1.19`` and ``Repository.insert_stream_1.19`` to
106
  ``Repository.get_stream_1.19`` and ``Repository.insert_stream_1.19`` to
49
116
  support this.  (Andrew Bennetts, #374738, #385826)
107
  support this.  (Andrew Bennetts, #374738, #385826)
50
117
108
51
109
* Extracting the full ancestry and computing the ``merge_sort`` is now
52
110
  significantly faster. This effects things like ``bzr log -n0``. (For
53
111
  example, ``bzr log -r -10..-1 -n0 bzr.dev`` is 2.5s down to 1.0s.
54
112
  (John Arbash Meinel)
55
113
56
118
Documentation
114
Documentation
57
119
*************
115
*************
58
120
116
59
@@ -136,15 +132,20 @@
60
136
  friendly StreamSource, which now automatically does the same
132
  friendly StreamSource, which now automatically does the same
61
137
  transformations as InterDifferingSerializer.  (Andrew Bennetts)
133
  transformations as InterDifferingSerializer.  (Andrew Bennetts)
62
138
134
63
135
* ``KnownGraph`` now has a ``.topo_sort`` and ``.merge_sort`` member which
64
136
  are implemented in pyrex and significantly faster. This is exposed along
65
137
  with ``CombinedGraphIndex.find_ancestry()`` as
66
138
  ``VersionedFiles.get_known_graph_ancestry(keys)``.
67
139
  (John Arbash Meinel)
68
140
69
139
* RemoteBranch.open now honours ignore_fallbacks correctly on bzr-v2
141
* RemoteBranch.open now honours ignore_fallbacks correctly on bzr-v2
70
140
  protocols. (Robert Collins)
142
  protocols. (Robert Collins)
71
141
143
72
142
* The index code now has some specialized routines to extract the full
144
* The index code now has some specialized routines to extract the full
73
143
  ancestry of a key in a more efficient manner.
145
  ancestry of a key in a more efficient manner.
78
144
  ``CombinedGraphIndex.find_ancestry()``. This is not fully exposed to the
146
  ``CombinedGraphIndex.find_ancestry()``. (Time to get ancestry for
79
145
  higher levels yet, but has the potential to improve grabbing the full
147
  bzr.dev drops from 1.5s down to 300ms. For OOo from 33s => 10.5s) (John
80
146
  ancestry tremendously. (Time to get ancestry for bzr.dev drops from 1.5s
148
  Arbash Meinel)
77
147
  down to 300ms. For OOo from 33s => 10.5s) (John Arbash Meinel)
81
148
149
82
149
Testing
150
Testing
83
150
*******
151
*******
84
151
152
85
=== modified file 'bzrlib/_known_graph_py.py'
86
--- bzrlib/_known_graph_py.py	2009-07-08 20:58:10 +0000
87
+++ bzrlib/_known_graph_py.py	2009-08-19 16:35:14 +0000
88
@@ -18,6 +18,7 @@
89
18
"""
18
"""
90
19
19
91
20
from bzrlib import (
20
from bzrlib import (
92
21
    errors,
93
21
    revision,
22
    revision,
94
22
    )
23
    )
95
23
24
96
@@ -40,6 +41,18 @@
97
40
            self.parent_keys, self.child_keys)
41
            self.parent_keys, self.child_keys)
98
41
42
99
42
43
100
44
class _MergeSortNode(object):
101
45
    """Information about a specific node in the merge graph."""
102
46
103
47
    __slots__ = ('key', 'merge_depth', 'revno', 'end_of_merge')
104
48
105
49
    def __init__(self, key, merge_depth, revno, end_of_merge):
106
50
        self.key = key
107
51
        self.merge_depth = merge_depth
108
52
        self.revno = revno
109
53
        self.end_of_merge = end_of_merge
110
54
111
55
112
43
class KnownGraph(object):
56
class KnownGraph(object):
113
44
    """This is a class which assumes we already know the full graph."""
57
    """This is a class which assumes we already know the full graph."""
114
45
58
115
@@ -171,3 +184,51 @@
116
171
            self._known_heads[heads_key] = heads
184
            self._known_heads[heads_key] = heads
117
172
        return heads
185
        return heads
118
173
186
119
187
    def topo_sort(self):
120
188
        """Return the nodes in topological order.
121
189
122
190
        All parents must occur before all children.
123
191
        """
124
192
        for node in self._nodes.itervalues():
125
193
            if node.gdfo is None:
126
194
                raise errors.GraphCycleError(self._nodes)
127
195
        pending = self._find_tails()
128
196
        pending_pop = pending.pop
129
197
        pending_append = pending.append
130
198
131
199
        topo_order = []
132
200
        topo_order_append = topo_order.append
133
201
134
202
        num_seen_parents = dict.fromkeys(self._nodes, 0)
135
203
        while pending:
136
204
            node = pending_pop()
137
205
            if node.parent_keys is not None:
138
206
                # We don't include ghost parents
139
207
                topo_order_append(node.key)
140
208
            for child_key in node.child_keys:
141
209
                child_node = self._nodes[child_key]
142
210
                seen_parents = num_seen_parents[child_key] + 1
143
211
                if seen_parents == len(child_node.parent_keys):
144
212
                    # All parents have been processed, enqueue this child
145
213
                    pending_append(child_node)
146
214
                    # This has been queued up, stop tracking it
147
215
                    del num_seen_parents[child_key]
148
216
                else:
149
217
                    num_seen_parents[child_key] = seen_parents
150
218
        # We started from the parents, so we don't need to do anymore work
151
219
        return topo_order
152
220
153
221
    def merge_sort(self, tip_key):
154
222
        """Compute the merge sorted graph output."""
155
223
        from bzrlib import tsort
156
224
        as_parent_map = dict((node.key, node.parent_keys)
157
225
                             for node in self._nodes.itervalues()
158
226
                              if node.parent_keys is not None)
159
227
        # We intentionally always generate revnos and never force the
160
228
        # mainline_revisions
161
229
        # Strip the sequence_number that merge_sort generates
162
230
        return [_MergeSortNode(key, merge_depth, revno, end_of_merge)
163
231
                for _, key, merge_depth, revno, end_of_merge
164
232
                 in tsort.merge_sort(as_parent_map, tip_key,
165
233
                                     mainline_revisions=None,
166
234
                                     generate_revno=True)]
167
174
235
168
=== modified file 'bzrlib/_known_graph_pyx.pyx'
169
--- bzrlib/_known_graph_pyx.pyx	2009-07-14 16:10:32 +0000
170
+++ bzrlib/_known_graph_pyx.pyx	2009-08-19 16:35:14 +0000
171
@@ -44,8 +44,9 @@
172
44
44
173
45
    void Py_INCREF(object)
45
    void Py_INCREF(object)
174
46
46
175
47
import gc
176
47
48
178
48
from bzrlib import revision
49
from bzrlib import errors, revision
179
49
50
180
50
cdef object NULL_REVISION
51
cdef object NULL_REVISION
181
51
NULL_REVISION = revision.NULL_REVISION
52
NULL_REVISION = revision.NULL_REVISION
182
@@ -59,10 +60,9 @@
183
59
    cdef object children
60
    cdef object children
184
60
    cdef public long gdfo
61
    cdef public long gdfo
185
61
    cdef int seen
62
    cdef int seen
186
63
    cdef object extra
187
62
64
188
63
    def __init__(self, key):
65
    def __init__(self, key):
189
64
        cdef int i
190
65
191
66
        self.key = key
66
        self.key = key
192
67
        self.parents = None
67
        self.parents = None
193
68
68
194
@@ -70,6 +70,7 @@
195
70
        # Greatest distance from origin
70
        # Greatest distance from origin
196
71
        self.gdfo = -1
71
        self.gdfo = -1
197
72
        self.seen = 0
72
        self.seen = 0
198
73
        self.extra = None
199
73
74
200
74
    property child_keys:
75
    property child_keys:
201
75
        def __get__(self):
76
        def __get__(self):
202
@@ -115,9 +116,7 @@
203
115
    return <_KnownGraphNode>temp_node
116
    return <_KnownGraphNode>temp_node
204
116
117
205
117
118
209
118
# TODO: slab allocate all _KnownGraphNode objects.
119
cdef class _MergeSorter
207
119
#       We already know how many we are going to need, except for a couple of
208
120
#       ghosts that could be allocated on demand.
210
121
120
211
122
cdef class KnownGraph:
121
cdef class KnownGraph:
212
123
    """This is a class which assumes we already know the full graph."""
122
    """This is a class which assumes we already know the full graph."""
213
@@ -136,6 +135,9 @@
214
136
        # Maps {sorted(revision_id, revision_id): heads}
135
        # Maps {sorted(revision_id, revision_id): heads}
215
137
        self._known_heads = {}
136
        self._known_heads = {}
216
138
        self.do_cache = int(do_cache)
137
        self.do_cache = int(do_cache)
217
138
        # TODO: consider disabling gc since we are allocating a lot of nodes
218
139
        #       that won't be collectable anyway. real world testing has not
219
140
        #       shown a specific impact, yet.
220
139
        self._initialize_nodes(parent_map)
141
        self._initialize_nodes(parent_map)
221
140
        self._find_gdfo()
142
        self._find_gdfo()
222
141
143
223
@@ -183,11 +185,16 @@
224
183
            parent_keys = <object>temp_parent_keys
185
            parent_keys = <object>temp_parent_keys
225
184
            num_parent_keys = len(parent_keys)
186
            num_parent_keys = len(parent_keys)
226
185
            node = self._get_or_create_node(key)
187
            node = self._get_or_create_node(key)
229
186
            # We know how many parents, so we could pre allocate an exact sized
188
            # We know how many parents, so we pre allocate the tuple
228
187
            # tuple here
230
188
            parent_nodes = PyTuple_New(num_parent_keys)
189
            parent_nodes = PyTuple_New(num_parent_keys)
231
189
            # We use iter here, because parent_keys maybe be a list or tuple
232
190
            for pos2 from 0 <= pos2 < num_parent_keys:
190
            for pos2 from 0 <= pos2 < num_parent_keys:
233
191
                # Note: it costs us 10ms out of 40ms to lookup all of these
234
192
                #       parents, it doesn't seem to be an allocation overhead,
235
193
                #       but rather a lookup overhead. There doesn't seem to be
236
194
                #       a way around it, and that is one reason why
237
195
                #       KnownGraphNode maintains a direct pointer to the parent
238
196
                #       node.
239
197
                # We use [] because parent_keys may be a tuple or list
240
191
                parent_node = self._get_or_create_node(parent_keys[pos2])
198
                parent_node = self._get_or_create_node(parent_keys[pos2])
241
192
                # PyTuple_SET_ITEM will steal a reference, so INCREF first
199
                # PyTuple_SET_ITEM will steal a reference, so INCREF first
242
193
                Py_INCREF(parent_node)
200
                Py_INCREF(parent_node)
243
@@ -335,3 +342,353 @@
244
335
        if self.do_cache:
342
        if self.do_cache:
245
336
            PyDict_SetItem(self._known_heads, heads_key, heads)
343
            PyDict_SetItem(self._known_heads, heads_key, heads)
246
337
        return heads
344
        return heads
247
345
248
346
    def topo_sort(self):
249
347
        """Return the nodes in topological order.
250
348
251
349
        All parents must occur before all children.
252
350
        """
253
351
        # This is, for the most part, the same iteration order that we used for
254
352
        # _find_gdfo, consider finding a way to remove the duplication
255
353
        # In general, we find the 'tails' (nodes with no parents), and then
256
354
        # walk to the children. For children that have all of their parents
257
355
        # yielded, we queue up the child to be yielded as well.
258
356
        cdef _KnownGraphNode node
259
357
        cdef _KnownGraphNode child
260
358
        cdef PyObject *temp
261
359
        cdef Py_ssize_t pos
262
360
        cdef int replace
263
361
        cdef Py_ssize_t last_item
264
362
265
363
        pending = self._find_tails()
266
364
        if PyList_GET_SIZE(pending) == 0 and len(self._nodes) > 0:
267
365
            raise errors.GraphCycleError(self._nodes)
268
366
269
367
        topo_order = []
270
368
271
369
        last_item = PyList_GET_SIZE(pending) - 1
272
370
        while last_item >= 0:
273
371
            # Avoid pop followed by push, instead, peek, and replace
274
372
            # timing shows this is 930ms => 770ms for OOo
275
373
            node = _get_list_node(pending, last_item)
276
374
            last_item = last_item - 1
277
375
            if node.parents is not None:
278
376
                # We don't include ghost parents
279
377
                PyList_Append(topo_order, node.key)
280
378
            for pos from 0 <= pos < PyList_GET_SIZE(node.children):
281
379
                child = _get_list_node(node.children, pos)
282
380
                if child.gdfo == -1:
283
381
                    # We know we have a graph cycle because a node has a parent
284
382
                    # which we couldn't find
285
383
                    raise errors.GraphCycleError(self._nodes)
286
384
                child.seen = child.seen + 1
287
385
                if child.seen == PyTuple_GET_SIZE(child.parents):
288
386
                    # All parents of this child have been yielded, queue this
289
387
                    # one to be yielded as well
290
388
                    last_item = last_item + 1
291
389
                    if last_item < PyList_GET_SIZE(pending):
292
390
                        Py_INCREF(child) # SetItem steals a ref
293
391
                        PyList_SetItem(pending, last_item, child)
294
392
                    else:
295
393
                        PyList_Append(pending, child)
296
394
                    # We have queued this node, we don't need to track it
297
395
                    # anymore
298
396
                    child.seen = 0
299
397
        # We started from the parents, so we don't need to do anymore work
300
398
        return topo_order
301
399
302
400
303
401
    def merge_sort(self, tip_key):
304
402
        """Compute the merge sorted graph output."""
305
403
        cdef _MergeSorter sorter
306
404
307
405
        # TODO: consider disabling gc since we are allocating a lot of nodes
308
406
        #       that won't be collectable anyway. real world testing has not
309
407
        #       shown a specific impact, yet.
310
408
        sorter = _MergeSorter(self, tip_key)
311
409
        return sorter.topo_order()
312
410
313
411
314
412
cdef class _MergeSortNode:
315
413
    """Tracks information about a node during the merge_sort operation."""
316
414
317
415
    # Public api
318
416
    cdef public object key
319
417
    cdef public long merge_depth
320
418
    cdef public object end_of_merge # True/False Is this the end of the current merge
321
419
322
420
    # Private api, used while computing the information
323
421
    cdef _KnownGraphNode left_parent
324
422
    cdef _KnownGraphNode left_pending_parent
325
423
    cdef object pending_parents # list of _KnownGraphNode for non-left parents
326
424
    cdef long _revno_first
327
425
    cdef long _revno_second
328
426
    cdef long _revno_last
329
427
    # TODO: turn these into flag/bit fields rather than individual members
330
428
    cdef int is_first_child # Is this the first child?
331
429
    cdef int seen_by_child # A child node has seen this parent
332
430
    cdef int completed # Fully Processed
333
431
334
432
    def __init__(self, key):
335
433
        self.key = key
336
434
        self.merge_depth = -1
337
435
        self.left_parent = None
338
436
        self.left_pending_parent = None
339
437
        self.pending_parents = None
340
438
        self._revno_first = -1
341
439
        self._revno_second = -1
342
440
        self._revno_last = -1
343
441
        self.is_first_child = 0
344
442
        self.seen_by_child = 0
345
443
        self.completed = 0
346
444
347
445
    def __repr__(self):
348
446
        return '%s(depth:%s rev:%s,%s,%s first:%s seen:%s)' % (self.__class__.__name__,
349
447
            self.merge_depth,
350
448
            self._revno_first, self._revno_second, self._revno_last,
351
449
            self.is_first_child, self.seen_by_child)
352
450
353
451
    cdef int has_pending_parents(self):
354
452
        if self.left_pending_parent is not None or self.pending_parents:
355
453
            return 1
356
454
        return 0
357
455
358
456
    cdef object _revno(self):
359
457
        if self._revno_first == -1:
360
458
            if self._revno_second != -1:
361
459
                raise RuntimeError('Something wrong with: %s' % (self,))
362
460
            return (self._revno_last,)
363
461
        else:
364
462
            return (self._revno_first, self._revno_second, self._revno_last)
365
463
366
464
    property revno:
367
465
        def __get__(self):
368
466
            return self._revno()
369
467
370
468
371
469
cdef class _MergeSorter:
372
470
    """This class does the work of computing the merge_sort ordering.
373
471
374
472
    We have some small advantages, in that we get all the extra information
375
473
    that KnownGraph knows, like knowing the child lists, etc.
376
474
    """
377
475
378
476
    # Current performance numbers for merge_sort(bzr_dev_parent_map):
379
477
    #  302ms tsort.merge_sort()
380
478
    #   91ms graph.KnownGraph().merge_sort()
381
479
    #   40ms kg.merge_sort()
382
480
383
481
    cdef KnownGraph graph
384
482
    cdef object _depth_first_stack  # list
385
483
    cdef Py_ssize_t _last_stack_item # offset to last item on stack
386
484
    # cdef object _ms_nodes # dict of key => _MergeSortNode
387
485
    cdef object _revno_to_branch_count # {revno => num child branches}
388
486
    cdef object _scheduled_nodes # List of nodes ready to be yielded
389
487
390
488
    def __init__(self, known_graph, tip_key):
391
489
        cdef _KnownGraphNode node
392
490
393
491
        self.graph = known_graph
394
492
        # self._ms_nodes = {}
395
493
        self._revno_to_branch_count = {}
396
494
        self._depth_first_stack = []
397
495
        self._last_stack_item = -1
398
496
        self._scheduled_nodes = []
399
497
        if (tip_key is not None and tip_key != NULL_REVISION
400
498
            and tip_key != (NULL_REVISION,)):
401
499
            node = self.graph._nodes[tip_key]
402
500
            self._get_ms_node(node)
403
501
            self._push_node(node, 0)
404
502
405
503
    cdef _MergeSortNode _get_ms_node(self, _KnownGraphNode node):
406
504
        cdef PyObject *temp_node
407
505
        cdef _MergeSortNode ms_node
408
506
409
507
        if node.extra is None:
410
508
            ms_node = _MergeSortNode(node.key)
411
509
            node.extra = ms_node
412
510
        else:
413
511
            ms_node = <_MergeSortNode>node.extra
414
512
        return ms_node
415
513
416
514
    cdef _push_node(self, _KnownGraphNode node, long merge_depth):
417
515
        cdef _KnownGraphNode parent_node
418
516
        cdef _MergeSortNode ms_node, ms_parent_node
419
517
        cdef Py_ssize_t pos
420
518
421
519
        ms_node = self._get_ms_node(node)
422
520
        ms_node.merge_depth = merge_depth
423
521
        if PyTuple_GET_SIZE(node.parents) > 0:
424
522
            parent_node = _get_parent(node.parents, 0)
425
523
            ms_node.left_parent = parent_node
426
524
            ms_node.left_pending_parent = parent_node
427
525
        if PyTuple_GET_SIZE(node.parents) > 1:
428
526
            ms_node.pending_parents = []
429
527
            for pos from 1 <= pos < PyTuple_GET_SIZE(node.parents):
430
528
                parent_node = _get_parent(node.parents, pos)
431
529
                if parent_node.parents is None: # ghost
432
530
                    continue
433
531
                PyList_Append(ms_node.pending_parents, parent_node)
434
532
435
533
        ms_node.is_first_child = 1
436
534
        if ms_node.left_parent is not None:
437
535
            ms_parent_node = self._get_ms_node(ms_node.left_parent)
438
536
            if ms_parent_node.seen_by_child:
439
537
                ms_node.is_first_child = 0
440
538
            ms_parent_node.seen_by_child = 1
441
539
        self._last_stack_item = self._last_stack_item + 1
442
540
        if self._last_stack_item < PyList_GET_SIZE(self._depth_first_stack):
443
541
            Py_INCREF(node) # SetItem steals a ref
444
542
            PyList_SetItem(self._depth_first_stack, self._last_stack_item,
445
543
                           node)
446
544
        else:
447
545
            PyList_Append(self._depth_first_stack, node)
448
546
449
547
    cdef _pop_node(self):
450
548
        cdef PyObject *temp
451
549
        cdef _MergeSortNode ms_node, ms_parent_node, ms_prev_node
452
550
        cdef _KnownGraphNode node, parent_node, prev_node
453
551
454
552
        node = _get_list_node(self._depth_first_stack, self._last_stack_item)
455
553
        ms_node = <_MergeSortNode>node.extra
456
554
        self._last_stack_item = self._last_stack_item - 1
457
555
        if ms_node.left_parent is not None:
458
556
            # Assign the revision number from the left-hand parent
459
557
            ms_parent_node = <_MergeSortNode>ms_node.left_parent.extra
460
558
            if ms_node.is_first_child:
461
559
                # First child just increments the final digit
462
560
                ms_node._revno_first = ms_parent_node._revno_first
463
561
                ms_node._revno_second = ms_parent_node._revno_second
464
562
                ms_node._revno_last = ms_parent_node._revno_last + 1
465
563
            else:
466
564
                # Not the first child, make a new branch
467
565
                #  (mainline_revno, branch_count, 1)
468
566
                if ms_parent_node._revno_first == -1:
469
567
                    # Mainline ancestor, the increment is on the last digit
470
568
                    base_revno = ms_parent_node._revno_last
471
569
                else:
472
570
                    base_revno = ms_parent_node._revno_first
473
571
                temp = PyDict_GetItem(self._revno_to_branch_count,
474
572
                                      base_revno)
475
573
                if temp == NULL:
476
574
                    branch_count = 1
477
575
                else:
478
576
                    branch_count = (<object>temp) + 1
479
577
                PyDict_SetItem(self._revno_to_branch_count, base_revno,
480
578
                               branch_count)
481
579
                ms_node._revno_first = base_revno
482
580
                ms_node._revno_second = branch_count
483
581
                ms_node._revno_last = 1
484
582
        else:
485
583
            temp = PyDict_GetItem(self._revno_to_branch_count, 0)
486
584
            if temp == NULL:
487
585
                # The first root node doesn't have a 3-digit revno
488
586
                root_count = 0
489
587
                ms_node._revno_first = -1
490
588
                ms_node._revno_second = -1
491
589
                ms_node._revno_last = 1
492
590
            else:
493
591
                root_count = (<object>temp) + 1
494
592
                ms_node._revno_first = 0
495
593
                ms_node._revno_second = root_count
496
594
                ms_node._revno_last = 1
497
595
            PyDict_SetItem(self._revno_to_branch_count, 0, root_count)
498
596
        ms_node.completed = 1
499
597
        if PyList_GET_SIZE(self._scheduled_nodes) == 0:
500
598
            # The first scheduled node is always the end of merge
501
599
            ms_node.end_of_merge = True
502
600
        else:
503
601
            prev_node = _get_list_node(self._scheduled_nodes,
504
602
                                    PyList_GET_SIZE(self._scheduled_nodes) - 1)
505
603
            ms_prev_node = <_MergeSortNode>prev_node.extra
506
604
            if ms_prev_node.merge_depth < ms_node.merge_depth:
507
605
                # The previously pushed node is to our left, so this is the end
508
606
                # of this right-hand chain
509
607
                ms_node.end_of_merge = True
510
608
            elif (ms_prev_node.merge_depth == ms_node.merge_depth
511
609
                  and prev_node not in node.parents):
512
610
                # The next node is not a direct parent of this node
513
611
                ms_node.end_of_merge = True
514
612
            else:
515
613
                ms_node.end_of_merge = False
516
614
        PyList_Append(self._scheduled_nodes, node)
517
615
518
616
    cdef _schedule_stack(self):
519
617
        cdef _KnownGraphNode last_node, next_node
520
618
        cdef _MergeSortNode ms_node, ms_last_node, ms_next_node
521
619
        cdef long next_merge_depth
522
620
        ordered = []
523
621
        while self._last_stack_item >= 0:
524
622
            # Peek at the last item on the stack
525
623
            last_node = _get_list_node(self._depth_first_stack,
526
624
                                       self._last_stack_item)
527
625
            if last_node.gdfo == -1:
528
626
                # if _find_gdfo skipped a node, that means there is a graph
529
627
                # cycle, error out now
530
628
                raise errors.GraphCycleError(self.graph._nodes)
531
629
            ms_last_node = <_MergeSortNode>last_node.extra
532
630
            if not ms_last_node.has_pending_parents():
533
631
                # Processed all parents, pop this node
534
632
                self._pop_node()
535
633
                continue
536
634
            while ms_last_node.has_pending_parents():
537
635
                if ms_last_node.left_pending_parent is not None:
538
636
                    # recurse depth first into the primary parent
539
637
                    next_node = ms_last_node.left_pending_parent
540
638
                    ms_last_node.left_pending_parent = None
541
639
                else:
542
640
                    # place any merges in right-to-left order for scheduling
543
641
                    # which gives us left-to-right order after we reverse
544
642
                    # the scheduled queue.
545
643
                    # Note: This has the effect of allocating common-new
546
644
                    #       revisions to the right-most subtree rather than the
547
645
                    #       left most, which will display nicely (you get
548
646
                    #       smaller trees at the top of the combined merge).
549
647
                    next_node = ms_last_node.pending_parents.pop()
550
648
                ms_next_node = self._get_ms_node(next_node)
551
649
                if ms_next_node.completed:
552
650
                    # this parent was completed by a child on the
553
651
                    # call stack. skip it.
554
652
                    continue
555
653
                # otherwise transfer it from the source graph into the
556
654
                # top of the current depth first search stack.
557
655
558
656
                if next_node is ms_last_node.left_parent:
559
657
                    next_merge_depth = ms_last_node.merge_depth
560
658
                else:
561
659
                    next_merge_depth = ms_last_node.merge_depth + 1
562
660
                self._push_node(next_node, next_merge_depth)
563
661
                # and do not continue processing parents until this 'call'
564
662
                # has recursed.
565
663
                break
566
664
567
665
    cdef topo_order(self):
568
666
        cdef _MergeSortNode ms_node
569
667
        cdef _KnownGraphNode node
570
668
        cdef Py_ssize_t pos
571
669
        cdef PyObject *temp_key, *temp_node
572
670
573
671
        # Note: allocating a _MergeSortNode and deallocating it for all nodes
574
672
        #       costs approx 8.52ms (21%) of the total runtime
575
673
        #       We might consider moving the attributes into the base
576
674
        #       KnownGraph object.
577
675
        self._schedule_stack()
578
676
579
677
        # We've set up the basic schedule, now we can continue processing the
580
678
        # output.
581
679
        # Note: This final loop costs us 40.0ms => 28.8ms (11ms, 25%) on
582
680
        #       bzr.dev, to convert the internal Object representation into a
583
681
        #       Tuple representation...
584
682
        #       2ms is walking the data and computing revno tuples
585
683
        #       7ms is computing the return tuple
586
684
        #       4ms is PyList_Append()
587
685
        ordered = []
588
686
        # output the result in reverse order, and separate the generated info
589
687
        for pos from PyList_GET_SIZE(self._scheduled_nodes) > pos >= 0:
590
688
            node = _get_list_node(self._scheduled_nodes, pos)
591
689
            ms_node = <_MergeSortNode>node.extra
592
690
            PyList_Append(ordered, ms_node)
593
691
            node.extra = None
594
692
        # Clear out the scheduled nodes now that we're done
595
693
        self._scheduled_nodes = []
596
694
        return ordered
597
338
695
598
=== modified file 'bzrlib/annotate.py'
599
--- bzrlib/annotate.py	2009-07-08 17:09:03 +0000
600
+++ bzrlib/annotate.py	2009-08-19 16:35:14 +0000
601
@@ -188,6 +188,10 @@
602
188
        # or something.
188
        # or something.
603
189
        last_revision = current_rev.revision_id
189
        last_revision = current_rev.revision_id
604
190
        # XXX: Partially Cloned from branch, uses the old_get_graph, eep.
190
        # XXX: Partially Cloned from branch, uses the old_get_graph, eep.
605
191
        # XXX: The main difficulty is that we need to inject a single new node
606
192
        #      (current_rev) into the graph before it gets numbered, etc.
607
193
        #      Once KnownGraph gets an 'add_node()' function, we can use
608
194
        #      VF.get_known_graph_ancestry().
609
191
        graph = repository.get_graph()
195
        graph = repository.get_graph()
610
192
        revision_graph = dict(((key, value) for key, value in
196
        revision_graph = dict(((key, value) for key, value in
611
193
            graph.iter_ancestry(current_rev.parent_ids) if value is not None))
197
            graph.iter_ancestry(current_rev.parent_ids) if value is not None))
612
194
198
613
=== modified file 'bzrlib/branch.py'
614
--- bzrlib/branch.py	2009-08-17 06:22:18 +0000
615
+++ bzrlib/branch.py	2009-08-19 16:35:14 +0000
616
@@ -446,15 +446,11 @@
617
446
        # start_revision_id.
446
        # start_revision_id.
618
447
        if self._merge_sorted_revisions_cache is None:
447
        if self._merge_sorted_revisions_cache is None:
619
448
            last_revision = self.last_revision()
448
            last_revision = self.last_revision()
629
449
            graph = self.repository.get_graph()
449
            last_key = (last_revision,)
630
450
            parent_map = dict(((key, value) for key, value in
450
            known_graph = self.repository.revisions.get_known_graph_ancestry(
631
451
                     graph.iter_ancestry([last_revision]) if value is not None))
451
                [last_key])
632
452
            revision_graph = repository._strip_NULL_ghosts(parent_map)
452
            self._merge_sorted_revisions_cache = known_graph.merge_sort(
633
453
            revs = tsort.merge_sort(revision_graph, last_revision, None,
453
                last_key)
625
454
                generate_revno=True)
626
455
            # Drop the sequence # before caching
627
456
            self._merge_sorted_revisions_cache = [r[1:] for r in revs]
628
457
634
458
        filtered = self._filter_merge_sorted_revisions(
454
        filtered = self._filter_merge_sorted_revisions(
635
459
            self._merge_sorted_revisions_cache, start_revision_id,
455
            self._merge_sorted_revisions_cache, start_revision_id,
636
460
            stop_revision_id, stop_rule)
456
            stop_revision_id, stop_rule)
637
@@ -470,27 +466,34 @@
638
470
        """Iterate over an inclusive range of sorted revisions."""
466
        """Iterate over an inclusive range of sorted revisions."""
639
471
        rev_iter = iter(merge_sorted_revisions)
467
        rev_iter = iter(merge_sorted_revisions)
640
472
        if start_revision_id is not None:
468
        if start_revision_id is not None:
642
473
            for rev_id, depth, revno, end_of_merge in rev_iter:
469
            for node in rev_iter:
643
470
                rev_id = node.key[-1]
644
474
                if rev_id != start_revision_id:
471
                if rev_id != start_revision_id:
645
475
                    continue
472
                    continue
646
476
                else:
473
                else:
647
477
                    # The decision to include the start or not
474
                    # The decision to include the start or not
648
478
                    # depends on the stop_rule if a stop is provided
475
                    # depends on the stop_rule if a stop is provided
652
479
                    rev_iter = chain(
476
                    # so pop this node back into the iterator
653
480
                        iter([(rev_id, depth, revno, end_of_merge)]),
477
                    rev_iter = chain(iter([node]), rev_iter)
651
481
                        rev_iter)
654
482
                    break
478
                    break
655
483
        if stop_revision_id is None:
479
        if stop_revision_id is None:
658
484
            for rev_id, depth, revno, end_of_merge in rev_iter:
480
            # Yield everything
659
485
                yield rev_id, depth, revno, end_of_merge
481
            for node in rev_iter:
660
482
                rev_id = node.key[-1]
661
483
                yield (rev_id, node.merge_depth, node.revno,
662
484
                       node.end_of_merge)
663
486
        elif stop_rule == 'exclude':
485
        elif stop_rule == 'exclude':
665
487
            for rev_id, depth, revno, end_of_merge in rev_iter:
486
            for node in rev_iter:
666
487
                rev_id = node.key[-1]
667
488
                if rev_id == stop_revision_id:
488
                if rev_id == stop_revision_id:
668
489
                    return
489
                    return
670
490
                yield rev_id, depth, revno, end_of_merge
490
                yield (rev_id, node.merge_depth, node.revno,
671
491
                       node.end_of_merge)
672
491
        elif stop_rule == 'include':
492
        elif stop_rule == 'include':
675
492
            for rev_id, depth, revno, end_of_merge in rev_iter:
493
            for node in rev_iter:
676
493
                yield rev_id, depth, revno, end_of_merge
494
                rev_id = node.key[-1]
677
495
                yield (rev_id, node.merge_depth, node.revno,
678
496
                       node.end_of_merge)
679
494
                if rev_id == stop_revision_id:
497
                if rev_id == stop_revision_id:
680
495
                    return
498
                    return
681
496
        elif stop_rule == 'with-merges':
499
        elif stop_rule == 'with-merges':
682
@@ -499,10 +502,12 @@
683
499
                left_parent = stop_rev.parent_ids[0]
502
                left_parent = stop_rev.parent_ids[0]
684
500
            else:
503
            else:
685
501
                left_parent = _mod_revision.NULL_REVISION
504
                left_parent = _mod_revision.NULL_REVISION
687
502
            for rev_id, depth, revno, end_of_merge in rev_iter:
505
            for node in rev_iter:
688
506
                rev_id = node.key[-1]
689
503
                if rev_id == left_parent:
507
                if rev_id == left_parent:
690
504
                    return
508
                    return
692
505
                yield rev_id, depth, revno, end_of_merge
509
                yield (rev_id, node.merge_depth, node.revno,
693
510
                       node.end_of_merge)
694
506
        else:
511
        else:
695
507
            raise ValueError('invalid stop_rule %r' % stop_rule)
512
            raise ValueError('invalid stop_rule %r' % stop_rule)
696
508
513
697
509
514
698
=== modified file 'bzrlib/graph.py'
699
--- bzrlib/graph.py	2009-08-04 04:36:34 +0000
700
+++ bzrlib/graph.py	2009-08-19 16:35:14 +0000
701
@@ -21,7 +21,6 @@
702
21
    errors,
21
    errors,
703
22
    revision,
22
    revision,
704
23
    trace,
23
    trace,
705
24
    tsort,
706
25
    )
24
    )
707
26
from bzrlib.symbol_versioning import deprecated_function, deprecated_in
25
from bzrlib.symbol_versioning import deprecated_function, deprecated_in
708
27
26
709
@@ -926,6 +925,7 @@
710
926
        An ancestor may sort after a descendant if the relationship is not
925
        An ancestor may sort after a descendant if the relationship is not
711
927
        visible in the supplied list of revisions.
926
        visible in the supplied list of revisions.
712
928
        """
927
        """
713
928
        from bzrlib import tsort
714
929
        sorter = tsort.TopoSorter(self.get_parent_map(revisions))
929
        sorter = tsort.TopoSorter(self.get_parent_map(revisions))
715
930
        return sorter.iter_topo_order()
930
        return sorter.iter_topo_order()
716
931
931
717
932
932
718
=== modified file 'bzrlib/groupcompress.py'
719
--- bzrlib/groupcompress.py	2009-08-04 04:36:34 +0000
720
+++ bzrlib/groupcompress.py	2009-08-19 16:35:14 +0000
721
@@ -62,16 +62,15 @@
722
62
    # groupcompress ordering is approximately reverse topological,
62
    # groupcompress ordering is approximately reverse topological,
723
63
    # properly grouped by file-id.
63
    # properly grouped by file-id.
724
64
    per_prefix_map = {}
64
    per_prefix_map = {}
727
65
    for item in parent_map.iteritems():
65
    for key, value in parent_map.iteritems():
726
66
        key = item[0]
728
67
        if isinstance(key, str) or len(key) == 1:
66
        if isinstance(key, str) or len(key) == 1:
729
68
            prefix = ''
67
            prefix = ''
730
69
        else:
68
        else:
731
70
            prefix = key[0]
69
            prefix = key[0]
732
71
        try:
70
        try:
734
72
            per_prefix_map[prefix].append(item)
71
            per_prefix_map[prefix][key] = value
735
73
        except KeyError:
72
        except KeyError:
737
74
            per_prefix_map[prefix] = [item]
73
            per_prefix_map[prefix] = {key: value}
738
75
74
739
76
    present_keys = []
75
    present_keys = []
740
77
    for prefix in sorted(per_prefix_map):
76
    for prefix in sorted(per_prefix_map):
741
@@ -1099,6 +1098,13 @@
742
1099
            self._check_lines_not_unicode(lines)
1098
            self._check_lines_not_unicode(lines)
743
1100
            self._check_lines_are_lines(lines)
1099
            self._check_lines_are_lines(lines)
744
1101
1100
745
1101
    def get_known_graph_ancestry(self, keys):
746
1102
        """Get a KnownGraph instance with the ancestry of keys."""
747
1103
        parent_map, missing_keys = self._index._graph_index.find_ancestry(keys,
748
1104
                                                                          0)
749
1105
        kg = _mod_graph.KnownGraph(parent_map)
750
1106
        return kg
751
1107
752
1102
    def get_parent_map(self, keys):
1108
    def get_parent_map(self, keys):
753
1103
        """Get a map of the graph parents of keys.
1109
        """Get a map of the graph parents of keys.
754
1104
1110
755
1105
1111
756
=== modified file 'bzrlib/index.py'
757
--- bzrlib/index.py	2009-08-13 19:56:26 +0000
758
+++ bzrlib/index.py	2009-08-19 16:35:14 +0000
759
@@ -333,6 +333,22 @@
760
333
        if combine_backing_indices is not None:
333
        if combine_backing_indices is not None:
761
334
            self._combine_backing_indices = combine_backing_indices
334
            self._combine_backing_indices = combine_backing_indices
762
335
335
763
336
    def find_ancestry(self, keys, ref_list_num):
764
337
        """See CombinedGraphIndex.find_ancestry()"""
765
338
        pending = set(keys)
766
339
        parent_map = {}
767
340
        missing_keys = set()
768
341
        while pending:
769
342
            next_pending = set()
770
343
            for _, key, value, ref_lists in self.iter_entries(pending):
771
344
                parent_keys = ref_lists[ref_list_num]
772
345
                parent_map[key] = parent_keys
773
346
                next_pending.update([p for p in parent_keys if p not in
774
347
                                     parent_map])
775
348
                missing_keys.update(pending.difference(parent_map))
776
349
            pending = next_pending
777
350
        return parent_map, missing_keys
778
351
779
336
352
780
337
class GraphIndex(object):
353
class GraphIndex(object):
781
338
    """An index for data with embedded graphs.
354
    """An index for data with embedded graphs.
782
339
355
783
=== modified file 'bzrlib/knit.py'
784
--- bzrlib/knit.py	2009-08-04 04:36:34 +0000
785
+++ bzrlib/knit.py	2009-08-19 16:35:14 +0000
786
@@ -1190,6 +1190,12 @@
787
1190
        generator = _VFContentMapGenerator(self, [key])
1190
        generator = _VFContentMapGenerator(self, [key])
788
1191
        return generator._get_content(key)
1191
        return generator._get_content(key)
789
1192
1192
790
1193
    def get_known_graph_ancestry(self, keys):
791
1194
        """Get a KnownGraph instance with the ancestry of keys."""
792
1195
        parent_map, missing_keys = self._index.find_ancestry(keys)
793
1196
        kg = _mod_graph.KnownGraph(parent_map)
794
1197
        return kg
795
1198
796
1193
    def get_parent_map(self, keys):
1199
    def get_parent_map(self, keys):
797
1194
        """Get a map of the graph parents of keys.
1200
        """Get a map of the graph parents of keys.
798
1195
1201
799
@@ -2560,6 +2566,33 @@
800
2560
        except KeyError:
2566
        except KeyError:
801
2561
            raise RevisionNotPresent(key, self)
2567
            raise RevisionNotPresent(key, self)
802
2562
2568
803
2569
    def find_ancestry(self, keys):
804
2570
        """See CombinedGraphIndex.find_ancestry()"""
805
2571
        prefixes = set(key[:-1] for key in keys)
806
2572
        self._load_prefixes(prefixes)
807
2573
        result = {}
808
2574
        parent_map = {}
809
2575
        missing_keys = set()
810
2576
        pending_keys = list(keys)
811
2577
        # This assumes that keys will not reference parents in a different
812
2578
        # prefix, which is accurate so far.
813
2579
        while pending_keys:
814
2580
            key = pending_keys.pop()
815
2581
            if key in parent_map:
816
2582
                continue
817
2583
            prefix = key[:-1]
818
2584
            try:
819
2585
                suffix_parents = self._kndx_cache[prefix][0][key[-1]][4]
820
2586
            except KeyError:
821
2587
                missing_keys.add(key)
822
2588
            else:
823
2589
                parent_keys = tuple([prefix + (suffix,)
824
2590
                                     for suffix in suffix_parents])
825
2591
                parent_map[key] = parent_keys
826
2592
                pending_keys.extend([p for p in parent_keys
827
2593
                                        if p not in parent_map])
828
2594
        return parent_map, missing_keys
829
2595
830
2563
    def get_parent_map(self, keys):
2596
    def get_parent_map(self, keys):
831
2564
        """Get a map of the parents of keys.
2597
        """Get a map of the parents of keys.
832
2565
2598
833
@@ -3049,6 +3082,10 @@
834
3049
            options.append('no-eol')
3082
            options.append('no-eol')
835
3050
        return options
3083
        return options
836
3051
3084
837
3085
    def find_ancestry(self, keys):
838
3086
        """See CombinedGraphIndex.find_ancestry()"""
839
3087
        return self._graph_index.find_ancestry(keys, 0)
840
3088
841
3052
    def get_parent_map(self, keys):
3089
    def get_parent_map(self, keys):
842
3053
        """Get a map of the parents of keys.
3090
        """Get a map of the parents of keys.
843
3054
3091
844
3055
3092
845
=== modified file 'bzrlib/missing.py'
846
--- bzrlib/missing.py	2009-03-23 14:59:43 +0000
847
+++ bzrlib/missing.py	2009-08-19 16:35:14 +0000
848
@@ -138,31 +138,13 @@
849
138
    if not ancestry: #Empty ancestry, no need to do any work
138
    if not ancestry: #Empty ancestry, no need to do any work
850
139
        return []
139
        return []
851
140
140
873
141
    mainline_revs, rev_nos, start_rev_id, end_rev_id = log._get_mainline_revs(
141
    merge_sorted_revisions = branch.iter_merge_sorted_revisions()
853
142
        branch, None, tip_revno)
854
143
    if not mainline_revs:
855
144
        return []
856
145
857
146
    # This asks for all mainline revisions, which is size-of-history and
858
147
    # should be addressed (but currently the only way to get correct
859
148
    # revnos).
860
149
861
150
    # mainline_revisions always includes an extra revision at the
862
151
    # beginning, so don't request it.
863
152
    parent_map = dict(((key, value) for key, value
864
153
                       in graph.iter_ancestry(mainline_revs[1:])
865
154
                       if value is not None))
866
155
    # filter out ghosts; merge_sort errors on ghosts.
867
156
    # XXX: is this needed here ? -- vila080910
868
157
    rev_graph = _mod_repository._strip_NULL_ghosts(parent_map)
869
158
    # XXX: what if rev_graph is empty now ? -- vila080910
870
159
    merge_sorted_revisions = tsort.merge_sort(rev_graph, tip,
871
160
                                              mainline_revs,
872
161
                                              generate_revno=True)
874
162
    # Now that we got the correct revnos, keep only the relevant
142
    # Now that we got the correct revnos, keep only the relevant
875
163
    # revisions.
143
    # revisions.
876
164
    merge_sorted_revisions = [
144
    merge_sorted_revisions = [
878
165
        (s, revid, n, d, e) for s, revid, n, d, e in merge_sorted_revisions
145
        # log.reverse_by_depth expects seq_num to be present, but it is
879
146
        # stripped by iter_merge_sorted_revisions()
880
147
        (0, revid, n, d, e) for revid, n, d, e in merge_sorted_revisions
881
166
        if revid in ancestry]
148
        if revid in ancestry]
882
167
    if not backward:
149
    if not backward:
883
168
        merge_sorted_revisions = log.reverse_by_depth(merge_sorted_revisions)
150
        merge_sorted_revisions = log.reverse_by_depth(merge_sorted_revisions)
884
169
151
885
=== modified file 'bzrlib/reconcile.py'
886
--- bzrlib/reconcile.py	2009-06-10 03:56:49 +0000
887
+++ bzrlib/reconcile.py	2009-08-19 16:35:14 +0000
888
@@ -33,7 +33,7 @@
889
33
    repofmt,
33
    repofmt,
890
34
    )
34
    )
891
35
from bzrlib.trace import mutter, note
35
from bzrlib.trace import mutter, note
893
36
from bzrlib.tsort import TopoSorter
36
from bzrlib.tsort import topo_sort
894
37
from bzrlib.versionedfile import AdapterFactory, FulltextContentFactory
37
from bzrlib.versionedfile import AdapterFactory, FulltextContentFactory
895
38
38
896
39
39
897
@@ -247,8 +247,7 @@
898
247
247
899
248
        # we have topological order of revisions and non ghost parents ready.
248
        # we have topological order of revisions and non ghost parents ready.
900
249
        self._setup_steps(len(self._rev_graph))
249
        self._setup_steps(len(self._rev_graph))
903
250
        revision_keys = [(rev_id,) for rev_id in
250
        revision_keys = [(rev_id,) for rev_id in topo_sort(self._rev_graph)]
902
251
            TopoSorter(self._rev_graph.items()).iter_topo_order()]
904
252
        stream = self._change_inv_parents(
251
        stream = self._change_inv_parents(
905
253
            self.inventory.get_record_stream(revision_keys, 'unordered', True),
252
            self.inventory.get_record_stream(revision_keys, 'unordered', True),
906
254
            self._new_inv_parents,
253
            self._new_inv_parents,
907
@@ -378,7 +377,7 @@
908
378
        new_inventories = self.repo._temp_inventories()
377
        new_inventories = self.repo._temp_inventories()
909
379
        # we have topological order of revisions and non ghost parents ready.
378
        # we have topological order of revisions and non ghost parents ready.
910
380
        graph = self.revisions.get_parent_map(self.revisions.keys())
379
        graph = self.revisions.get_parent_map(self.revisions.keys())
912
381
        revision_keys = list(TopoSorter(graph).iter_topo_order())
380
        revision_keys = topo_sort(graph)
913
382
        revision_ids = [key[-1] for key in revision_keys]
381
        revision_ids = [key[-1] for key in revision_keys]
914
383
        self._setup_steps(len(revision_keys))
382
        self._setup_steps(len(revision_keys))
915
384
        stream = self._change_inv_parents(
383
        stream = self._change_inv_parents(
916
385
384
917
=== modified file 'bzrlib/repofmt/weaverepo.py'
918
--- bzrlib/repofmt/weaverepo.py	2009-08-14 11:11:29 +0000
919
+++ bzrlib/repofmt/weaverepo.py	2009-08-19 16:35:14 +0000
920
@@ -28,6 +28,7 @@
921
28
lazy_import(globals(), """
28
lazy_import(globals(), """
922
29
from bzrlib import (
29
from bzrlib import (
923
30
    xml5,
30
    xml5,
924
31
    graph as _mod_graph,
925
31
    )
32
    )
926
32
""")
33
""")
927
33
from bzrlib import (
34
from bzrlib import (
928
@@ -663,6 +664,13 @@
929
663
            result[key] = parents
664
            result[key] = parents
930
664
        return result
665
        return result
931
665
666
932
667
    def get_known_graph_ancestry(self, keys):
933
668
        """Get a KnownGraph instance with the ancestry of keys."""
934
669
        keys = self.keys()
935
670
        parent_map = self.get_parent_map(keys)
936
671
        kg = _mod_graph.KnownGraph(parent_map)
937
672
        return kg
938
673
939
666
    def get_record_stream(self, keys, sort_order, include_delta_closure):
674
    def get_record_stream(self, keys, sort_order, include_delta_closure):
940
667
        for key in keys:
675
        for key in keys:
941
668
            text, parents = self._load_text_parents(key)
676
            text, parents = self._load_text_parents(key)
942
669
677
943
=== modified file 'bzrlib/repository.py'
944
--- bzrlib/repository.py	2009-08-17 23:15:55 +0000
945
+++ bzrlib/repository.py	2009-08-19 16:35:14 +0000
946
@@ -4351,7 +4351,7 @@
947
4351
        phase = 'file'
4351
        phase = 'file'
948
4352
        revs = search.get_keys()
4352
        revs = search.get_keys()
949
4353
        graph = self.from_repository.get_graph()
4353
        graph = self.from_repository.get_graph()
951
4354
        revs = list(graph.iter_topo_order(revs))
4354
        revs = tsort.topo_sort(graph.get_parent_map(revs))
952
4355
        data_to_fetch = self.from_repository.item_keys_introduced_by(revs)
4355
        data_to_fetch = self.from_repository.item_keys_introduced_by(revs)
953
4356
        text_keys = []
4356
        text_keys = []
954
4357
        for knit_kind, file_id, revisions in data_to_fetch:
4357
        for knit_kind, file_id, revisions in data_to_fetch:
955
4358
4358
956
=== modified file 'bzrlib/tests/__init__.py'
957
--- bzrlib/tests/__init__.py	2009-08-18 14:20:28 +0000
958
+++ bzrlib/tests/__init__.py	2009-08-19 16:35:15 +0000
959
@@ -3434,6 +3434,7 @@
960
3434
                   'bzrlib.tests.per_repository',
3434
                   'bzrlib.tests.per_repository',
961
3435
                   'bzrlib.tests.per_repository_chk',
3435
                   'bzrlib.tests.per_repository_chk',
962
3436
                   'bzrlib.tests.per_repository_reference',
3436
                   'bzrlib.tests.per_repository_reference',
963
3437
                   'bzrlib.tests.per_versionedfile',
964
3437
                   'bzrlib.tests.per_workingtree',
3438
                   'bzrlib.tests.per_workingtree',
965
3438
                   'bzrlib.tests.test__annotator',
3439
                   'bzrlib.tests.test__annotator',
966
3439
                   'bzrlib.tests.test__chk_map',
3440
                   'bzrlib.tests.test__chk_map',
967
@@ -3585,7 +3586,6 @@
968
3585
                   'bzrlib.tests.test_urlutils',
3586
                   'bzrlib.tests.test_urlutils',
969
3586
                   'bzrlib.tests.test_version',
3587
                   'bzrlib.tests.test_version',
970
3587
                   'bzrlib.tests.test_version_info',
3588
                   'bzrlib.tests.test_version_info',
971
3588
                   'bzrlib.tests.test_versionedfile',
972
3589
                   'bzrlib.tests.test_weave',
3589
                   'bzrlib.tests.test_weave',
973
3590
                   'bzrlib.tests.test_whitebox',
3590
                   'bzrlib.tests.test_whitebox',
974
3591
                   'bzrlib.tests.test_win32utils',
3591
                   'bzrlib.tests.test_win32utils',
975
3592
3592
976
=== modified file 'bzrlib/tests/blackbox/test_ancestry.py'
977
--- bzrlib/tests/blackbox/test_ancestry.py	2009-03-23 14:59:43 +0000
978
+++ bzrlib/tests/blackbox/test_ancestry.py	2009-08-19 16:35:15 +0000
979
@@ -43,9 +43,15 @@
980
43
43
981
44
    def _check_ancestry(self, location='', result=None):
44
    def _check_ancestry(self, location='', result=None):
982
45
        out = self.run_bzr(['ancestry', location])[0]
45
        out = self.run_bzr(['ancestry', location])[0]
984
46
        if result is None:
46
        if result is not None:
985
47
            self.assertEqualDiff(result, out)
986
48
        else:
987
49
            # A2 and B1 can be in either order, because they are parallel, and
988
50
            # thus their topological order is not defined
989
47
            result = "A1\nB1\nA2\nA3\n"
51
            result = "A1\nB1\nA2\nA3\n"
991
48
        self.assertEqualDiff(out, result)
52
            if result != out:
992
53
                result = "A1\nA2\nB1\nA3\n"
993
54
            self.assertEqualDiff(result, out)
994
49
55
995
50
    def test_ancestry(self):
56
    def test_ancestry(self):
996
51
        """Tests 'ancestry' command"""
57
        """Tests 'ancestry' command"""
997
52
58
998
=== renamed file 'bzrlib/tests/test_versionedfile.py' => 'bzrlib/tests/per_versionedfile.py'
999
--- bzrlib/tests/test_versionedfile.py	2009-08-04 04:36:34 +0000
1000
+++ bzrlib/tests/per_versionedfile.py	2009-08-19 16:35:15 +0000
1001
@@ -26,6 +26,7 @@
1002
26
26
1003
27
from bzrlib import (
27
from bzrlib import (
1004
28
    errors,
28
    errors,
1005
29
    graph as _mod_graph,
1006
29
    groupcompress,
30
    groupcompress,
1007
30
    knit as _mod_knit,
31
    knit as _mod_knit,
1008
31
    osutils,
32
    osutils,
1009
@@ -1737,6 +1738,25 @@
1010
1737
            f.get_record_stream([key_b], 'unordered', True
1738
            f.get_record_stream([key_b], 'unordered', True
1011
1738
                ).next().get_bytes_as('fulltext'))
1739
                ).next().get_bytes_as('fulltext'))
1012
1739
1740
1013
1741
    def test_get_known_graph_ancestry(self):
1014
1742
        f = self.get_versionedfiles()
1015
1743
        if not self.graph:
1016
1744
            raise TestNotApplicable('ancestry info only relevant with graph.')
1017
1745
        key_a = self.get_simple_key('a')
1018
1746
        key_b = self.get_simple_key('b')
1019
1747
        key_c = self.get_simple_key('c')
1020
1748
        # A
1021
1749
        # |\
1022
1750
        # | B
1023
1751
        # |/
1024
1752
        # C
1025
1753
        f.add_lines(key_a, [], ['\n'])
1026
1754
        f.add_lines(key_b, [key_a], ['\n'])
1027
1755
        f.add_lines(key_c, [key_a, key_b], ['\n'])
1028
1756
        kg = f.get_known_graph_ancestry([key_c])
1029
1757
        self.assertIsInstance(kg, _mod_graph.KnownGraph)
1030
1758
        self.assertEqual([key_a, key_b, key_c], list(kg.topo_sort()))
1031
1759
1032
1740
    def test_get_record_stream_empty(self):
1760
    def test_get_record_stream_empty(self):
1033
1741
        """An empty stream can be requested without error."""
1761
        """An empty stream can be requested without error."""
1034
1742
        f = self.get_versionedfiles()
1762
        f = self.get_versionedfiles()
1035
1743
1763
1036
=== modified file 'bzrlib/tests/test__known_graph.py'
1037
--- bzrlib/tests/test__known_graph.py	2009-07-08 20:58:10 +0000
1038
+++ bzrlib/tests/test__known_graph.py	2009-08-19 16:35:15 +0000
1039
@@ -16,6 +16,8 @@
1040
16
16
1041
17
"""Tests for the python and pyrex extensions of KnownGraph"""
17
"""Tests for the python and pyrex extensions of KnownGraph"""
1042
18
18
1043
19
import pprint
1044
20
1045
19
from bzrlib import (
21
from bzrlib import (
1046
20
    errors,
22
    errors,
1047
21
    graph as _mod_graph,
23
    graph as _mod_graph,
1048
@@ -30,13 +32,15 @@
1049
30
    """Parameterize tests for all versions of groupcompress."""
32
    """Parameterize tests for all versions of groupcompress."""
1050
31
    scenarios = [
33
    scenarios = [
1051
32
        ('python', {'module': _known_graph_py, 'do_cache': True}),
34
        ('python', {'module': _known_graph_py, 'do_cache': True}),
1052
35
    ]
1053
36
    caching_scenarios = [
1054
33
        ('python-nocache', {'module': _known_graph_py, 'do_cache': False}),
37
        ('python-nocache', {'module': _known_graph_py, 'do_cache': False}),
1055
34
    ]
38
    ]
1056
35
    suite = loader.suiteClass()
39
    suite = loader.suiteClass()
1057
36
    if CompiledKnownGraphFeature.available():
40
    if CompiledKnownGraphFeature.available():
1058
37
        from bzrlib import _known_graph_pyx
41
        from bzrlib import _known_graph_pyx
1059
38
        scenarios.append(('C', {'module': _known_graph_pyx, 'do_cache': True}))
42
        scenarios.append(('C', {'module': _known_graph_pyx, 'do_cache': True}))
1061
39
        scenarios.append(('C-nocache',
43
        caching_scenarios.append(('C-nocache',
1062
40
                          {'module': _known_graph_pyx, 'do_cache': False}))
44
                          {'module': _known_graph_pyx, 'do_cache': False}))
1063
41
    else:
45
    else:
1064
42
        # the compiled module isn't available, so we add a failing test
46
        # the compiled module isn't available, so we add a failing test
1065
@@ -44,8 +48,14 @@
1066
44
            def test_fail(self):
48
            def test_fail(self):
1067
45
                self.requireFeature(CompiledKnownGraphFeature)
49
                self.requireFeature(CompiledKnownGraphFeature)
1068
46
        suite.addTest(loader.loadTestsFromTestCase(FailWithoutFeature))
50
        suite.addTest(loader.loadTestsFromTestCase(FailWithoutFeature))
1071
47
    result = tests.multiply_tests(standard_tests, scenarios, suite)
51
    # TestKnownGraphHeads needs to be permutated with and without caching.
1072
48
    return result
52
    # All other TestKnownGraph tests only need to be tested across module
1073
53
    heads_suite, other_suite = tests.split_suite_by_condition(
1074
54
        standard_tests, tests.condition_isinstance(TestKnownGraphHeads))
1075
55
    suite = tests.multiply_tests(other_suite, scenarios, suite)
1076
56
    suite = tests.multiply_tests(heads_suite, scenarios + caching_scenarios,
1077
57
                                 suite)
1078
58
    return suite
1079
49
59
1080
50
60
1081
51
class _CompiledKnownGraphFeature(tests.Feature):
61
class _CompiledKnownGraphFeature(tests.Feature):
1082
@@ -73,14 +83,16 @@
1083
73
alt_merge = {'a': [], 'b': ['a'], 'c': ['b'], 'd': ['a', 'c']}
83
alt_merge = {'a': [], 'b': ['a'], 'c': ['b'], 'd': ['a', 'c']}
1084
74
84
1085
75
85
1087
76
class TestKnownGraph(tests.TestCase):
86
class TestCaseWithKnownGraph(tests.TestCase):
1088
77
87
1089
78
    module = None # Set by load_tests
88
    module = None # Set by load_tests
1090
79
    do_cache = None # Set by load_tests
1091
80
89
1092
81
    def make_known_graph(self, ancestry):
90
    def make_known_graph(self, ancestry):
1093
82
        return self.module.KnownGraph(ancestry, do_cache=self.do_cache)
91
        return self.module.KnownGraph(ancestry, do_cache=self.do_cache)
1094
83
92
1095
93
1096
94
class TestKnownGraph(TestCaseWithKnownGraph):
1097
95
1098
84
    def assertGDFO(self, graph, rev, gdfo):
96
    def assertGDFO(self, graph, rev, gdfo):
1099
85
        node = graph._nodes[rev]
97
        node = graph._nodes[rev]
1100
86
        self.assertEqual(gdfo, node.gdfo)
98
        self.assertEqual(gdfo, node.gdfo)
1101
@@ -127,6 +139,11 @@
1102
127
        self.assertGDFO(graph, 'a', 5)
139
        self.assertGDFO(graph, 'a', 5)
1103
128
        self.assertGDFO(graph, 'c', 5)
140
        self.assertGDFO(graph, 'c', 5)
1104
129
141
1105
142
1106
143
class TestKnownGraphHeads(TestCaseWithKnownGraph):
1107
144
1108
145
    do_cache = None # Set by load_tests
1109
146
1110
130
    def test_heads_null(self):
147
    def test_heads_null(self):
1111
131
        graph = self.make_known_graph(test_graph.ancestry_1)
148
        graph = self.make_known_graph(test_graph.ancestry_1)
1112
132
        self.assertEqual(set(['null:']), graph.heads(['null:']))
149
        self.assertEqual(set(['null:']), graph.heads(['null:']))
1113
@@ -227,3 +244,513 @@
1114
227
        self.assertEqual(set(['c']), graph.heads(['c', 'b', 'd', 'g']))
244
        self.assertEqual(set(['c']), graph.heads(['c', 'b', 'd', 'g']))
1115
228
        self.assertEqual(set(['a', 'c']), graph.heads(['a', 'c', 'e', 'g']))
245
        self.assertEqual(set(['a', 'c']), graph.heads(['a', 'c', 'e', 'g']))
1116
229
        self.assertEqual(set(['a', 'c']), graph.heads(['a', 'c', 'f']))
246
        self.assertEqual(set(['a', 'c']), graph.heads(['a', 'c', 'f']))
1117
247
1118
248
1119
249
class TestKnownGraphTopoSort(TestCaseWithKnownGraph):
1120
250
1121
251
    def assertTopoSortOrder(self, ancestry):
1122
252
        """Check topo_sort and iter_topo_order is genuinely topological order.
1123
253
1124
254
        For every child in the graph, check if it comes after all of it's
1125
255
        parents.
1126
256
        """
1127
257
        graph = self.make_known_graph(ancestry)
1128
258
        sort_result = graph.topo_sort()
1129
259
        # We should have an entry in sort_result for every entry present in the
1130
260
        # graph.
1131
261
        self.assertEqual(len(ancestry), len(sort_result))
1132
262
        node_idx = dict((node, idx) for idx, node in enumerate(sort_result))
1133
263
        for node in sort_result:
1134
264
            parents = ancestry[node]
1135
265
            for parent in parents:
1136
266
                if parent not in ancestry:
1137
267
                    # ghost
1138
268
                    continue
1139
269
                if node_idx[node] <= node_idx[parent]:
1140
270
                    self.fail("parent %s must come before child %s:\n%s"
1141
271
                              % (parent, node, sort_result))
1142
272
1143
273
    def test_topo_sort_empty(self):
1144
274
        """TopoSort empty list"""
1145
275
        self.assertTopoSortOrder({})
1146
276
1147
277
    def test_topo_sort_easy(self):
1148
278
        """TopoSort list with one node"""
1149
279
        self.assertTopoSortOrder({0: []})
1150
280
1151
281
    def test_topo_sort_cycle(self):
1152
282
        """TopoSort traps graph with cycles"""
1153
283
        g = self.make_known_graph({0: [1],
1154
284
                                  1: [0]})
1155
285
        self.assertRaises(errors.GraphCycleError, g.topo_sort)
1156
286
1157
287
    def test_topo_sort_cycle_2(self):
1158
288
        """TopoSort traps graph with longer cycle"""
1159
289
        g = self.make_known_graph({0: [1],
1160
290
                                   1: [2],
1161
291
                                   2: [0]})
1162
292
        self.assertRaises(errors.GraphCycleError, g.topo_sort)
1163
293
1164
294
    def test_topo_sort_cycle_with_tail(self):
1165
295
        """TopoSort traps graph with longer cycle"""
1166
296
        g = self.make_known_graph({0: [1],
1167
297
                                   1: [2],
1168
298
                                   2: [3, 4],
1169
299
                                   3: [0],
1170
300
                                   4: []})
1171
301
        self.assertRaises(errors.GraphCycleError, g.topo_sort)
1172
302
1173
303
    def test_topo_sort_1(self):
1174
304
        """TopoSort simple nontrivial graph"""
1175
305
        self.assertTopoSortOrder({0: [3],
1176
306
                                  1: [4],
1177
307
                                  2: [1, 4],
1178
308
                                  3: [],
1179
309
                                  4: [0, 3]})
1180
310
1181
311
    def test_topo_sort_partial(self):
1182
312
        """Topological sort with partial ordering.
1183
313
1184
314
        Multiple correct orderings are possible, so test for
1185
315
        correctness, not for exact match on the resulting list.
1186
316
        """
1187
317
        self.assertTopoSortOrder({0: [],
1188
318
                                  1: [0],
1189
319
                                  2: [0],
1190
320
                                  3: [0],
1191
321
                                  4: [1, 2, 3],
1192
322
                                  5: [1, 2],
1193
323
                                  6: [1, 2],
1194
324
                                  7: [2, 3],
1195
325
                                  8: [0, 1, 4, 5, 6]})
1196
326
1197
327
    def test_topo_sort_ghost_parent(self):
1198
328
        """Sort nodes, but don't include some parents in the output"""
1199
329
        self.assertTopoSortOrder({0: [1],
1200
330
                                  1: [2]})
1201
331
1202
332
1203
333
class TestKnownGraphMergeSort(TestCaseWithKnownGraph):
1204
334
1205
335
    def assertSortAndIterate(self, ancestry, branch_tip, result_list):
1206
336
        """Check that merge based sorting and iter_topo_order on graph works."""
1207
337
        graph = self.make_known_graph(ancestry)
1208
338
        value = graph.merge_sort(branch_tip)
1209
339
        value = [(n.key, n.merge_depth, n.revno, n.end_of_merge)
1210
340
                 for n in value]
1211
341
        if result_list != value:
1212
342
            self.assertEqualDiff(pprint.pformat(result_list),
1213
343
                                 pprint.pformat(value))
1214
344
1215
345
    def test_merge_sort_empty(self):
1216
346
        # sorting of an emptygraph does not error
1217
347
        self.assertSortAndIterate({}, None, [])
1218
348
        self.assertSortAndIterate({}, NULL_REVISION, [])
1219
349
        self.assertSortAndIterate({}, (NULL_REVISION,), [])
1220
350
1221
351
    def test_merge_sort_not_empty_no_tip(self):
1222
352
        # merge sorting of a branch starting with None should result
1223
353
        # in an empty list: no revisions are dragged in.
1224
354
        self.assertSortAndIterate({0: []}, None, [])
1225
355
        self.assertSortAndIterate({0: []}, NULL_REVISION, [])
1226
356
        self.assertSortAndIterate({0: []}, (NULL_REVISION,), [])
1227
357
1228
358
    def test_merge_sort_one_revision(self):
1229
359
        # sorting with one revision as the tip returns the correct fields:
1230
360
        # sequence - 0, revision id, merge depth - 0, end_of_merge
1231
361
        self.assertSortAndIterate({'id': []},
1232
362
                                  'id',
1233
363
                                  [('id', 0, (1,), True)])
1234
364
1235
365
    def test_sequence_numbers_increase_no_merges(self):
1236
366
        # emit a few revisions with no merges to check the sequence
1237
367
        # numbering works in trivial cases
1238
368
        self.assertSortAndIterate(
1239
369
            {'A': [],
1240
370
             'B': ['A'],
1241
371
             'C': ['B']},
1242
372
            'C',
1243
373
            [('C', 0, (3,), False),
1244
374
             ('B', 0, (2,), False),
1245
375
             ('A', 0, (1,), True),
1246
376
             ],
1247
377
            )
1248
378
1249
379
    def test_sequence_numbers_increase_with_merges(self):
1250
380
        # test that sequence numbers increase across merges
1251
381
        self.assertSortAndIterate(
1252
382
            {'A': [],
1253
383
             'B': ['A'],
1254
384
             'C': ['A', 'B']},
1255
385
            'C',
1256
386
            [('C', 0, (2,), False),
1257
387
             ('B', 1, (1,1,1), True),
1258
388
             ('A', 0, (1,), True),
1259
389
             ],
1260
390
            )
1261
391
1262
392
    def test_merge_sort_race(self):
1263
393
        # A
1264
394
        # |
1265
395
        # B-.
1266
396
        # |\ \
1267
397
        # | | C
1268
398
        # | |/
1269
399
        # | D
1270
400
        # |/
1271
401
        # F
1272
402
        graph = {'A': [],
1273
403
                 'B': ['A'],
1274
404
                 'C': ['B'],
1275
405
                 'D': ['B', 'C'],
1276
406
                 'F': ['B', 'D'],
1277
407
                 }
1278
408
        self.assertSortAndIterate(graph, 'F',
1279
409
            [('F', 0, (3,), False),
1280
410
             ('D', 1, (2,2,1), False),
1281
411
             ('C', 2, (2,1,1), True),
1282
412
             ('B', 0, (2,), False),
1283
413
             ('A', 0, (1,), True),
1284
414
             ])
1285
415
        # A
1286
416
        # |
1287
417
        # B-.
1288
418
        # |\ \
1289
419
        # | X C
1290
420
        # | |/
1291
421
        # | D
1292
422
        # |/
1293
423
        # F
1294
424
        graph = {'A': [],
1295
425
                 'B': ['A'],
1296
426
                 'C': ['B'],
1297
427
                 'X': ['B'],
1298
428
                 'D': ['X', 'C'],
1299
429
                 'F': ['B', 'D'],
1300
430
                 }
1301
431
        self.assertSortAndIterate(graph, 'F',
1302
432
            [('F', 0, (3,), False),
1303
433
             ('D', 1, (2,1,2), False),
1304
434
             ('C', 2, (2,2,1), True),
1305
435
             ('X', 1, (2,1,1), True),
1306
436
             ('B', 0, (2,), False),
1307
437
             ('A', 0, (1,), True),
1308
438
             ])
1309
439
1310
440
    def test_merge_depth_with_nested_merges(self):
1311
441
        # the merge depth marker should reflect the depth of the revision
1312
442
        # in terms of merges out from the mainline
1313
443
        # revid, depth, parents:
1314
444
        #  A 0   [D, B]
1315
445
        #  B  1  [C, F]
1316
446
        #  C  1  [H]
1317
447
        #  D 0   [H, E]
1318
448
        #  E  1  [G, F]
1319
449
        #  F   2 [G]
1320
450
        #  G  1  [H]
1321
451
        #  H 0
1322
452
        self.assertSortAndIterate(
1323
453
            {'A': ['D', 'B'],
1324
454
             'B': ['C', 'F'],
1325
455
             'C': ['H'],
1326
456
             'D': ['H', 'E'],
1327
457
             'E': ['G', 'F'],
1328
458
             'F': ['G'],
1329
459
             'G': ['H'],
1330
460
             'H': []
1331
461
             },
1332
462
            'A',
1333
463
            [('A', 0, (3,),  False),
1334
464
             ('B', 1, (1,3,2), False),
1335
465
             ('C', 1, (1,3,1), True),
1336
466
             ('D', 0, (2,), False),
1337
467
             ('E', 1, (1,1,2), False),
1338
468
             ('F', 2, (1,2,1), True),
1339
469
             ('G', 1, (1,1,1), True),
1340
470
             ('H', 0, (1,), True),
1341
471
             ],
1342
472
            )
1343
473
1344
474
    def test_dotted_revnos_with_simple_merges(self):
1345
475
        # A         1
1346
476
        # |\
1347
477
        # B C       2, 1.1.1
1348
478
        # | |\
1349
479
        # D E F     3, 1.1.2, 1.2.1
1350
480
        # |/ /|
1351
481
        # G H I     4, 1.2.2, 1.3.1
1352
482
        # |/ /
1353
483
        # J K       5, 1.3.2
1354
484
        # |/
1355
485
        # L         6
1356
486
        self.assertSortAndIterate(
1357
487
            {'A': [],
1358
488
             'B': ['A'],
1359
489
             'C': ['A'],
1360
490
             'D': ['B'],
1361
491
             'E': ['C'],
1362
492
             'F': ['C'],
1363
493
             'G': ['D', 'E'],
1364
494
             'H': ['F'],
1365
495
             'I': ['F'],
1366
496
             'J': ['G', 'H'],
1367
497
             'K': ['I'],
1368
498
             'L': ['J', 'K'],
1369
499
            },
1370
500
            'L',
1371
501
            [('L', 0, (6,), False),
1372
502
             ('K', 1, (1,3,2), False),
1373
503
             ('I', 1, (1,3,1), True),
1374
504
             ('J', 0, (5,), False),
1375
505
             ('H', 1, (1,2,2), False),
1376
506
             ('F', 1, (1,2,1), True),
1377
507
             ('G', 0, (4,), False),
1378
508
             ('E', 1, (1,1,2), False),
1379
509
             ('C', 1, (1,1,1), True),
1380
510
             ('D', 0, (3,), False),
1381
511
             ('B', 0, (2,), False),
1382
512
             ('A', 0, (1,),  True),
1383
513
             ],
1384
514
            )
1385
515
        # Adding a shortcut from the first revision should not change any of
1386
516
        # the existing numbers
1387
517
        self.assertSortAndIterate(
1388
518
            {'A': [],
1389
519
             'B': ['A'],
1390
520
             'C': ['A'],
1391
521
             'D': ['B'],
1392
522
             'E': ['C'],
1393
523
             'F': ['C'],
1394
524
             'G': ['D', 'E'],
1395
525
             'H': ['F'],
1396
526
             'I': ['F'],
1397
527
             'J': ['G', 'H'],
1398
528
             'K': ['I'],
1399
529
             'L': ['J', 'K'],
1400
530
             'M': ['A'],
1401
531
             'N': ['L', 'M'],
1402
532
            },
1403
533
            'N',
1404
534
            [('N', 0, (7,), False),
1405
535
             ('M', 1, (1,4,1), True),
1406
536
             ('L', 0, (6,), False),
1407
537
             ('K', 1, (1,3,2), False),
1408
538
             ('I', 1, (1,3,1), True),
1409
539
             ('J', 0, (5,), False),
1410
540
             ('H', 1, (1,2,2), False),
1411
541
             ('F', 1, (1,2,1), True),
1412
542
             ('G', 0, (4,), False),
1413
543
             ('E', 1, (1,1,2), False),
1414
544
             ('C', 1, (1,1,1), True),
1415
545
             ('D', 0, (3,), False),
1416
546
             ('B', 0, (2,), False),
1417
547
             ('A', 0, (1,),  True),
1418
548
             ],
1419
549
            )
1420
550
1421
551
    def test_end_of_merge_not_last_revision_in_branch(self):
1422
552
        # within a branch only the last revision gets an
1423
553
        # end of merge marker.
1424
554
        self.assertSortAndIterate(
1425
555
            {'A': ['B'],
1426
556
             'B': [],
1427
557
             },
1428
558
            'A',
1429
559
            [('A', 0, (2,), False),
1430
560
             ('B', 0, (1,), True)
1431
561
             ],
1432
562
            )
1433
563
1434
564
    def test_end_of_merge_multiple_revisions_merged_at_once(self):
1435
565
        # when multiple branches are merged at once, both of their
1436
566
        # branch-endpoints should be listed as end-of-merge.
1437
567
        # Also, the order of the multiple merges should be
1438
568
        # left-right shown top to bottom.
1439
569
        # * means end of merge
1440
570
        # A 0    [H, B, E]
1441
571
        # B  1   [D, C]
1442
572
        # C   2  [D]       *
1443
573
        # D  1   [H]       *
1444
574
        # E  1   [G, F]
1445
575
        # F   2  [G]       *
1446
576
        # G  1   [H]       *
1447
577
        # H 0    []        *
1448
578
        self.assertSortAndIterate(
1449
579
            {'A': ['H', 'B', 'E'],
1450
580
             'B': ['D', 'C'],
1451
581
             'C': ['D'],
1452
582
             'D': ['H'],
1453
583
             'E': ['G', 'F'],
1454
584
             'F': ['G'],
1455
585
             'G': ['H'],
1456
586
             'H': [],
1457
587
             },
1458
588
            'A',
1459
589
            [('A', 0, (2,), False),
1460
590
             ('B', 1, (1,3,2), False),
1461
591
             ('C', 2, (1,4,1), True),
1462
592
             ('D', 1, (1,3,1), True),
1463
593
             ('E', 1, (1,1,2), False),
1464
594
             ('F', 2, (1,2,1), True),
1465
595
             ('G', 1, (1,1,1), True),
1466
596
             ('H', 0, (1,), True),
1467
597
             ],
1468
598
            )
1469
599
1470
600
    def test_parallel_root_sequence_numbers_increase_with_merges(self):
1471
601
        """When there are parallel roots, check their revnos."""
1472
602
        self.assertSortAndIterate(
1473
603
            {'A': [],
1474
604
             'B': [],
1475
605
             'C': ['A', 'B']},
1476
606
            'C',
1477
607
            [('C', 0, (2,), False),
1478
608
             ('B', 1, (0,1,1), True),
1479
609
             ('A', 0, (1,), True),
1480
610
             ],
1481
611
            )
1482
612
1483
613
    def test_revnos_are_globally_assigned(self):
1484
614
        """revnos are assigned according to the revision they derive from."""
1485
615
        # in this test we setup a number of branches that all derive from
1486
616
        # the first revision, and then merge them one at a time, which
1487
617
        # should give the revisions as they merge numbers still deriving from
1488
618
        # the revision were based on.
1489
619
        # merge 3: J: ['G', 'I']
1490
620
        # branch 3:
1491
621
        #  I: ['H']
1492
622
        #  H: ['A']
1493
623
        # merge 2: G: ['D', 'F']
1494
624
        # branch 2:
1495
625
        #  F: ['E']
1496
626
        #  E: ['A']
1497
627
        # merge 1: D: ['A', 'C']
1498
628
        # branch 1:
1499
629
        #  C: ['B']
1500
630
        #  B: ['A']
1501
631
        # root: A: []
1502
632
        self.assertSortAndIterate(
1503
633
            {'J': ['G', 'I'],
1504
634
             'I': ['H',],
1505
635
             'H': ['A'],
1506
636
             'G': ['D', 'F'],
1507
637
             'F': ['E'],
1508
638
             'E': ['A'],
1509
639
             'D': ['A', 'C'],
1510
640
             'C': ['B'],
1511
641
             'B': ['A'],
1512
642
             'A': [],
1513
643
             },
1514
644
            'J',
1515
645
            [('J', 0, (4,), False),
1516
646
             ('I', 1, (1,3,2), False),
1517
647
             ('H', 1, (1,3,1), True),
1518
648
             ('G', 0, (3,), False),
1519
649
             ('F', 1, (1,2,2), False),
1520
650
             ('E', 1, (1,2,1), True),
1521
651
             ('D', 0, (2,), False),
1522
652
             ('C', 1, (1,1,2), False),
1523
653
             ('B', 1, (1,1,1), True),
1524
654
             ('A', 0, (1,), True),
1525
655
             ],
1526
656
            )
1527
657
1528
658
    def test_roots_and_sub_branches_versus_ghosts(self):
1529
659
        """Extra roots and their mini branches use the same numbering.
1530
660
1531
661
        All of them use the 0-node numbering.
1532
662
        """
1533
663
        #       A D   K
1534
664
        #       | |\  |\
1535
665
        #       B E F L M
1536
666
        #       | |/  |/
1537
667
        #       C G   N
1538
668
        #       |/    |\
1539
669
        #       H I   O P
1540
670
        #       |/    |/
1541
671
        #       J     Q
1542
672
        #       |.---'
1543
673
        #       R
1544
674
        self.assertSortAndIterate(
1545
675
            {'A': [],
1546
676
             'B': ['A'],
1547
677
             'C': ['B'],
1548
678
             'D': [],
1549
679
             'E': ['D'],
1550
680
             'F': ['D'],
1551
681
             'G': ['E', 'F'],
1552
682
             'H': ['C', 'G'],
1553
683
             'I': [],
1554
684
             'J': ['H', 'I'],
1555
685
             'K': [],
1556
686
             'L': ['K'],
1557
687
             'M': ['K'],
1558
688
             'N': ['L', 'M'],
1559
689
             'O': ['N'],
1560
690
             'P': ['N'],
1561
691
             'Q': ['O', 'P'],
1562
692
             'R': ['J', 'Q'],
1563
693
            },
1564
694
            'R',
1565
695
            [('R', 0, (6,), False),
1566
696
             ('Q', 1, (0,4,5), False),
1567
697
             ('P', 2, (0,6,1), True),
1568
698
             ('O', 1, (0,4,4), False),
1569
699
             ('N', 1, (0,4,3), False),
1570
700
             ('M', 2, (0,5,1), True),
1571
701
             ('L', 1, (0,4,2), False),
1572
702
             ('K', 1, (0,4,1), True),
1573
703
             ('J', 0, (5,), False),
1574
704
             ('I', 1, (0,3,1), True),
1575
705
             ('H', 0, (4,), False),
1576
706
             ('G', 1, (0,1,3), False),
1577
707
             ('F', 2, (0,2,1), True),
1578
708
             ('E', 1, (0,1,2), False),
1579
709
             ('D', 1, (0,1,1), True),
1580
710
             ('C', 0, (3,), False),
1581
711
             ('B', 0, (2,), False),
1582
712
             ('A', 0, (1,), True),
1583
713
             ],
1584
714
            )
1585
715
1586
716
    def test_ghost(self):
1587
717
        # merge_sort should be able to ignore ghosts
1588
718
        # A
1589
719
        # |
1590
720
        # B ghost
1591
721
        # |/
1592
722
        # C
1593
723
        self.assertSortAndIterate(
1594
724
            {'A': [],
1595
725
             'B': ['A'],
1596
726
             'C': ['B', 'ghost'],
1597
727
            },
1598
728
            'C',
1599
729
            [('C', 0, (3,), False),
1600
730
             ('B', 0, (2,), False),
1601
731
             ('A', 0, (1,), True),
1602
732
            ])
1603
733
1604
734
    def test_graph_cycle(self):
1605
735
        # merge_sort should fail with a simple error when a graph cycle is
1606
736
        # encountered.
1607
737
        #
1608
738
        # A
1609
739
        # |,-.
1610
740
        # B  |
1611
741
        # |  |
1612
742
        # C  ^
1613
743
        # |  |
1614
744
        # D  |
1615
745
        # |'-'
1616
746
        # E
1617
747
        self.assertRaises(errors.GraphCycleError,
1618
748
            self.assertSortAndIterate,
1619
749
                {'A': [],
1620
750
                 'B': ['D'],
1621
751
                 'C': ['B'],
1622
752
                 'D': ['C'],
1623
753
                 'E': ['D'],
1624
754
                },
1625
755
                'E',
1626
756
                [])
1627
230
757
1628
=== modified file 'bzrlib/tests/test_tsort.py'
1629
--- bzrlib/tests/test_tsort.py	2009-08-17 15:26:18 +0000
1630
+++ bzrlib/tests/test_tsort.py	2009-08-19 16:35:15 +0000
1631
@@ -17,6 +17,7 @@
1632
17
17
1633
18
"""Tests for topological sort."""
18
"""Tests for topological sort."""
1634
19
19
1635
20
import pprint
1636
20
21
1637
21
from bzrlib.tests import TestCase
22
from bzrlib.tests import TestCase
1638
22
from bzrlib.tsort import topo_sort, TopoSorter, MergeSorter, merge_sort
23
from bzrlib.tsort import topo_sort, TopoSorter, MergeSorter, merge_sort
1639
@@ -39,6 +40,23 @@
1640
39
                          list,
40
                          list,
1641
40
                          TopoSorter(graph).iter_topo_order())
41
                          TopoSorter(graph).iter_topo_order())
1642
41
42
1643
43
    def assertSortAndIterateOrder(self, graph):
1644
44
        """Check topo_sort and iter_topo_order is genuinely topological order.
1645
45
1646
46
        For every child in the graph, check if it comes after all of it's
1647
47
        parents.
1648
48
        """
1649
49
        sort_result = topo_sort(graph)
1650
50
        iter_result = list(TopoSorter(graph).iter_topo_order())
1651
51
        for (node, parents) in graph:
1652
52
            for parent in parents:
1653
53
                if sort_result.index(node) < sort_result.index(parent):
1654
54
                    self.fail("parent %s must come before child %s:\n%s"
1655
55
                              % (parent, node, sort_result))
1656
56
                if iter_result.index(node) < iter_result.index(parent):
1657
57
                    self.fail("parent %s must come before child %s:\n%s"
1658
58
                              % (parent, node, iter_result))
1659
59
1660
42
    def test_tsort_empty(self):
60
    def test_tsort_empty(self):
1661
43
        """TopoSort empty list"""
61
        """TopoSort empty list"""
1662
44
        self.assertSortAndIterate([], [])
62
        self.assertSortAndIterate([], [])
1663
@@ -60,6 +78,15 @@
1664
60
                                        1: [2],
78
                                        1: [2],
1665
61
                                        2: [0]}.items())
79
                                        2: [0]}.items())
1666
62
80
1667
81
    def test_topo_sort_cycle_with_tail(self):
1668
82
        """TopoSort traps graph with longer cycle"""
1669
83
        self.assertSortAndIterateRaise(GraphCycleError,
1670
84
                                       {0: [1],
1671
85
                                        1: [2],
1672
86
                                        2: [3, 4],
1673
87
                                        3: [0],
1674
88
                                        4: []}.items())
1675
89
1676
63
    def test_tsort_1(self):
90
    def test_tsort_1(self):
1677
64
        """TopoSort simple nontrivial graph"""
91
        """TopoSort simple nontrivial graph"""
1678
65
        self.assertSortAndIterate({0: [3],
92
        self.assertSortAndIterate({0: [3],
1679
@@ -72,10 +99,10 @@
1680
72
    def test_tsort_partial(self):
99
    def test_tsort_partial(self):
1681
73
        """Topological sort with partial ordering.
100
        """Topological sort with partial ordering.
1682
74
101
1685
75
        If the graph does not give an order between two nodes, they are
102
        Multiple correct orderings are possible, so test for 
1686
76
        returned in lexicographical order.
103
        correctness, not for exact match on the resulting list.
1687
77
        """
104
        """
1689
78
        self.assertSortAndIterate(([(0, []),
105
        self.assertSortAndIterateOrder([(0, []),
1690
79
                                   (1, [0]),
106
                                   (1, [0]),
1691
80
                                   (2, [0]),
107
                                   (2, [0]),
1692
81
                                   (3, [0]),
108
                                   (3, [0]),
1693
@@ -83,8 +110,7 @@
1694
83
                                   (5, [1, 2]),
110
                                   (5, [1, 2]),
1695
84
                                   (6, [1, 2]),
111
                                   (6, [1, 2]),
1696
85
                                   (7, [2, 3]),
112
                                   (7, [2, 3]),
1699
86
                                   (8, [0, 1, 4, 5, 6])]),
113
                                   (8, [0, 1, 4, 5, 6])])
1698
87
                                  [0, 1, 2, 3, 4, 5, 6, 7, 8])
1700
88
114
1701
89
    def test_tsort_unincluded_parent(self):
115
    def test_tsort_unincluded_parent(self):
1702
90
        """Sort nodes, but don't include some parents in the output"""
116
        """Sort nodes, but don't include some parents in the output"""
1703
@@ -102,12 +128,8 @@
1704
102
                           mainline_revisions=mainline_revisions,
128
                           mainline_revisions=mainline_revisions,
1705
103
                           generate_revno=generate_revno)
129
                           generate_revno=generate_revno)
1706
104
        if result_list != value:
130
        if result_list != value:
1707
105
            import pprint
1708
106
            self.assertEqualDiff(pprint.pformat(result_list),
131
            self.assertEqualDiff(pprint.pformat(result_list),
1709
107
                                 pprint.pformat(value))
132
                                 pprint.pformat(value))
1710
108
        self.assertEquals(result_list,
1711
109
            merge_sort(graph, branch_tip, mainline_revisions=mainline_revisions,
1712
110
                generate_revno=generate_revno))
1713
111
        self.assertEqual(result_list,
133
        self.assertEqual(result_list,
1714
112
            list(MergeSorter(
134
            list(MergeSorter(
1715
113
                graph,
135
                graph,
1716
114
136
1717
=== modified file 'bzrlib/tsort.py'
1718
--- bzrlib/tsort.py	2009-08-17 15:26:18 +0000
1719
+++ bzrlib/tsort.py	2009-08-19 16:35:14 +0000
1720
@@ -18,8 +18,11 @@
1721
18
"""Topological sorting routines."""
18
"""Topological sorting routines."""
1722
19
19
1723
20
20
1726
21
from bzrlib import errors
21
from bzrlib import (
1727
22
import bzrlib.revision as _mod_revision
22
    errors,
1728
23
    graph as _mod_graph,
1729
24
    revision as _mod_revision,
1730
25
    )
1731
23
26
1732
24
27
1733
25
__all__ = ["topo_sort", "TopoSorter", "merge_sort", "MergeSorter"]
28
__all__ = ["topo_sort", "TopoSorter", "merge_sort", "MergeSorter"]
1734
@@ -30,12 +33,21 @@
1735
30
33
1736
31
    graph -- sequence of pairs of node->parents_list.
34
    graph -- sequence of pairs of node->parents_list.
1737
32
35
1740
33
    The result is a list of node names, such that all parents come before
36
    The result is a list of node names, such that all parents come before their
1741
34
    their children.
37
    children.
1742
35
38
1743
36
    node identifiers can be any hashable object, and are typically strings.
39
    node identifiers can be any hashable object, and are typically strings.
1744
40
1745
41
    This function has the same purpose as the TopoSorter class, but uses a
1746
42
    different algorithm to sort the graph. That means that while both return a
1747
43
    list with parents before their child nodes, the exact ordering can be
1748
44
    different.
1749
45
1750
46
    topo_sort is faster when the whole list is needed, while when iterating
1751
47
    over a part of the list, TopoSorter.iter_topo_order should be used.
1752
37
    """
48
    """
1754
38
    return TopoSorter(graph).sorted()
49
    kg = _mod_graph.KnownGraph(dict(graph))
1755
50
    return kg.topo_sort()
1756
39
51
1757
40
52
1758
41
class TopoSorter(object):
53
class TopoSorter(object):
1759
@@ -60,22 +72,8 @@
1760
60
        iteration or sorting may raise GraphCycleError if a cycle is present
72
        iteration or sorting may raise GraphCycleError if a cycle is present
1761
61
        in the graph.
73
        in the graph.
1762
62
        """
74
        """
1764
63
        # a dict of the graph.
75
        # store a dict of the graph.
1765
64
        self._graph = dict(graph)
76
        self._graph = dict(graph)
1766
65
        self._visitable = set(self._graph)
1767
66
        ### if debugging:
1768
67
        # self._original_graph = dict(graph)
1769
68
1770
69
        # this is a stack storing the depth first search into the graph.
1771
70
        self._node_name_stack = []
1772
71
        # at each level of 'recursion' we have to check each parent. This
1773
72
        # stack stores the parents we have not yet checked for the node at the
1774
73
        # matching depth in _node_name_stack
1775
74
        self._pending_parents_stack = []
1776
75
        # this is a set of the completed nodes for fast checking whether a
1777
76
        # parent in a node we are processing on the stack has already been
1778
77
        # emitted and thus can be skipped.
1779
78
        self._completed_node_names = set()
1780
79
77
1781
80
    def sorted(self):
78
    def sorted(self):
1782
81
        """Sort the graph and return as a list.
79
        """Sort the graph and return as a list.
1783
@@ -100,67 +98,64 @@
1784
100
        After finishing iteration the sorter is empty and you cannot continue
98
        After finishing iteration the sorter is empty and you cannot continue
1785
101
        iteration.
99
        iteration.
1786
102
        """
100
        """
1788
103
        while self._graph:
101
        graph = self._graph
1789
102
        visitable = set(graph)
1790
103
1791
104
        # this is a stack storing the depth first search into the graph.
1792
105
        pending_node_stack = []
1793
106
        # at each level of 'recursion' we have to check each parent. This
1794
107
        # stack stores the parents we have not yet checked for the node at the
1795
108
        # matching depth in pending_node_stack
1796
109
        pending_parents_stack = []
1797
110
1798
111
        # this is a set of the completed nodes for fast checking whether a
1799
112
        # parent in a node we are processing on the stack has already been
1800
113
        # emitted and thus can be skipped.
1801
114
        completed_node_names = set()
1802
115
1803
116
        while graph:
1804
104
            # now pick a random node in the source graph, and transfer it to the
117
            # now pick a random node in the source graph, and transfer it to the
1812
105
            # top of the depth first search stack.
118
            # top of the depth first search stack of pending nodes.
1813
106
            node_name, parents = self._graph.popitem()
119
            node_name, parents = graph.popitem()
1814
107
            self._push_node(node_name, parents)
120
            pending_node_stack.append(node_name)
1815
108
            while self._node_name_stack:
121
            pending_parents_stack.append(list(parents))
1816
109
                # loop until this call completes.
122
1817
110
                parents_to_visit = self._pending_parents_stack[-1]
123
            # loop until pending_node_stack is empty
1818
111
                # if all parents are done, the revision is done
124
            while pending_node_stack:
1819
125
                parents_to_visit = pending_parents_stack[-1]
1820
126
                # if there are no parents left, the revision is done
1821
112
                if not parents_to_visit:
127
                if not parents_to_visit:
1822
113
                    # append the revision to the topo sorted list
128
                    # append the revision to the topo sorted list
1826
114
                    # all the nodes parents have been added to the output, now
129
                    # all the nodes parents have been added to the output,
1827
115
                    # we can add it to the output.
130
                    # now we can add it to the output.
1828
116
                    yield self._pop_node()
131
                    popped_node = pending_node_stack.pop()
1829
132
                    pending_parents_stack.pop()
1830
133
                    completed_node_names.add(popped_node)
1831
134
                    yield popped_node
1832
117
                else:
135
                else:
1879
118
                    while self._pending_parents_stack[-1]:
136
                    # recurse depth first into a single parent
1880
119
                        # recurse depth first into a single parent
137
                    next_node_name = parents_to_visit.pop()
1881
120
                        next_node_name = self._pending_parents_stack[-1].pop()
138
1882
121
                        if next_node_name in self._completed_node_names:
139
                    if next_node_name in completed_node_names:
1883
122
                            # this parent was completed by a child on the
140
                        # parent was already completed by a child, skip it.
1884
123
                            # call stack. skip it.
141
                        continue
1885
124
                            continue
142
                    if next_node_name not in visitable:
1886
125
                        if next_node_name not in self._visitable:
143
                        # parent is not a node in the original graph, skip it.
1887
126
                            continue
144
                        continue
1888
127
                        # otherwise transfer it from the source graph into the
145
1889
128
                        # top of the current depth first search stack.
146
                    # transfer it along with its parents from the source graph
1890
129
                        try:
147
                    # into the top of the current depth first search stack.
1891
130
                            parents = self._graph.pop(next_node_name)
148
                    try:
1892
131
                        except KeyError:
149
                        parents = graph.pop(next_node_name)
1893
132
                            # if the next node is not in the source graph it has
150
                    except KeyError:
1894
133
                            # already been popped from it and placed into the
151
                        # if the next node is not in the source graph it has
1895
134
                            # current search stack (but not completed or we would
152
                        # already been popped from it and placed into the
1896
135
                            # have hit the continue 4 lines up.
153
                        # current search stack (but not completed or we would
1897
136
                            # this indicates a cycle.
154
                        # have hit the continue 6 lines up).  this indicates a
1898
137
                            raise errors.GraphCycleError(self._node_name_stack)
155
                        # cycle.
1899
138
                        self._push_node(next_node_name, parents)
156
                        raise errors.GraphCycleError(pending_node_stack)
1900
139
                        # and do not continue processing parents until this 'call'
157
                    pending_node_stack.append(next_node_name)
1901
140
                        # has recursed.
158
                    pending_parents_stack.append(list(parents))
1856
141
                        break
1857
142
1858
143
    def _push_node(self, node_name, parents):
1859
144
        """Add node_name to the pending node stack.
1860
145
1861
146
        Names in this stack will get emitted into the output as they are popped
1862
147
        off the stack.
1863
148
        """
1864
149
        self._node_name_stack.append(node_name)
1865
150
        self._pending_parents_stack.append(list(parents))
1866
151
1867
152
    def _pop_node(self):
1868
153
        """Pop the top node off the stack
1869
154
1870
155
        The node is appended to the sorted output.
1871
156
        """
1872
157
        # we are returning from the flattened call frame:
1873
158
        # pop off the local variables
1874
159
        node_name = self._node_name_stack.pop()
1875
160
        self._pending_parents_stack.pop()
1876
161
1877
162
        self._completed_node_names.add(node_name)
1878
163
        return node_name
1902
164
159
1903
165
160
1904
166
def merge_sort(graph, branch_tip, mainline_revisions=None, generate_revno=False):
161
def merge_sort(graph, branch_tip, mainline_revisions=None, generate_revno=False):
1905
@@ -414,7 +409,8 @@
1906
414
409
1907
415
        # seed the search with the tip of the branch
410
        # seed the search with the tip of the branch
1908
416
        if (branch_tip is not None and
411
        if (branch_tip is not None and
1910
417
            branch_tip != _mod_revision.NULL_REVISION):
412
            branch_tip != _mod_revision.NULL_REVISION and
1911
413
            branch_tip != (_mod_revision.NULL_REVISION,)):
1912
418
            parents = self._graph.pop(branch_tip)
414
            parents = self._graph.pop(branch_tip)
1913
419
            self._push_node(branch_tip, 0, parents)
415
            self._push_node(branch_tip, 0, parents)
1914
420
416
1915
@@ -571,7 +567,11 @@
1916
571
                        # current search stack (but not completed or we would
567
                        # current search stack (but not completed or we would
1917
572
                        # have hit the continue 4 lines up.
568
                        # have hit the continue 4 lines up.
1918
573
                        # this indicates a cycle.
569
                        # this indicates a cycle.
1920
574
                        raise errors.GraphCycleError(node_name_stack)
570
                        if next_node_name in self._original_graph:
1921
571
                            raise errors.GraphCycleError(node_name_stack)
1922
572
                        else:
1923
573
                            # This is just a ghost parent, ignore it
1924
574
                            continue
1925
575
                    next_merge_depth = 0
575
                    next_merge_depth = 0
1926
576
                    if is_left_subtree:
576
                    if is_left_subtree:
1927
577
                        # a new child branch from name_stack[-1]
577
                        # a new child branch from name_stack[-1]
1928
@@ -673,11 +673,12 @@
1929
673
        else:
673
        else:
1930
674
            # no parents, use the root sequence
674
            # no parents, use the root sequence
1931
675
            root_count = self._revno_to_branch_count.get(0, 0)
675
            root_count = self._revno_to_branch_count.get(0, 0)
1932
676
            root_count = self._revno_to_branch_count.get(0, -1)
1933
677
            root_count += 1
1934
676
            if root_count:
678
            if root_count:
1935
677
                revno = (0, root_count, 1)
679
                revno = (0, root_count, 1)
1936
678
            else:
680
            else:
1937
679
                revno = (1,)
681
                revno = (1,)
1938
680
            root_count += 1
1939
681
            self._revno_to_branch_count[0] = root_count
682
            self._revno_to_branch_count[0] = root_count
1940
682
683
1941
683
        # store the revno for this node for future reference
684
        # store the revno for this node for future reference
1942
684
685
1943
=== modified file 'bzrlib/versionedfile.py'
1944
--- bzrlib/versionedfile.py	2009-08-07 05:56:29 +0000
1945
+++ bzrlib/versionedfile.py	2009-08-19 16:35:14 +0000
1946
@@ -32,6 +32,7 @@
1947
32
from bzrlib import (
32
from bzrlib import (
1948
33
    annotate,
33
    annotate,
1949
34
    errors,
34
    errors,
1950
35
    graph as _mod_graph,
1951
35
    groupcompress,
36
    groupcompress,
1952
36
    index,
37
    index,
1953
37
    knit,
38
    knit,
1954
@@ -941,6 +942,20 @@
1955
941
            if '\n' in line[:-1]:
942
            if '\n' in line[:-1]:
1956
942
                raise errors.BzrBadParameterContainsNewline("lines")
943
                raise errors.BzrBadParameterContainsNewline("lines")
1957
943
944
1958
945
    def get_known_graph_ancestry(self, keys):
1959
946
        """Get a KnownGraph instance with the ancestry of keys."""
1960
947
        # most basic implementation is a loop around get_parent_map
1961
948
        pending = set(keys)
1962
949
        parent_map = {}
1963
950
        while pending:
1964
951
            this_parent_map = self.get_parent_map(pending)
1965
952
            parent_map.update(this_parent_map)
1966
953
            pending = set()
1967
954
            map(pending.update, this_parent_map.itervalues())
1968
955
            pending = pending.difference(parent_map)
1969
956
        kg = _mod_graph.KnownGraph(parent_map)
1970
957
        return kg
1971
958
1972
944
    def get_parent_map(self, keys):
959
    def get_parent_map(self, keys):
1973
945
        """Get a map of the parents of keys.
960
        """Get a map of the parents of keys.
1974
946
961
Status:	Merged
Merged at revision:	not available
Proposed branch:	lp:~jameinel/bzr/1.19-known-graph-sorted
Merge into:	lp:~bzr/bzr/trunk-old
Diff against target:	1973 lines
To merge this branch:	bzr merge lp:~jameinel/bzr/1.19-known-graph-sorted
Related bugs:	Link a bug report
Reviewer	Date Requested	Status
Gary van der Merwe		Abstain on 2009-08-19
Vincent Ladeuil	2009-08-18	Approve on 2009-08-18
Review via email: mp+10293@code.launchpad.net