I believe all lines are treated as utf-8 (or converted to that) and then use the basic len() operation.
>>> x = u'known exploit in the wild, for example – the time between advance notification'
>>> x
u'known exploit in the wild, for example \u2013 the time between advance notification'
>>> len(x)
78
I believe all lines are treated as utf-8 (or converted to that) and then use the basic len() operation.
>>> x = u'known exploit in the wild, for example – the time between advance notification'
>>> x
u'known exploit in the wild, for example \u2013 the time between advance notification'
>>> len(x)
78