Comment 2 for bug 1384463

Revision history for this message
Joshua Harlow (harlowja) wrote :

I believe all lines are treated as utf-8 (or converted to that) and then use the basic len() operation.

>>> x = u'known exploit in the wild, for example – the time between advance notification'
>>> x
u'known exploit in the wild, for example \u2013 the time between advance notification'
>>> len(x)
78