Comment 7 for bug 139855

Revision history for this message
Michael Bienia (geser) wrote : Re: [Bug 139855] Re: Display stats about PPA usage

On 2009-08-22 14:48:58 -0000, Fabien Tassin wrote:
> In the meantime, what about giving the PPA owners access to their raw logs,
> properly anonymized, for ex by md5-ing IP addresses? The privacy risk will be
> the same as with popcon (i.e. if there's just 1 user for a given package, it's
> safe to assume it's the PPA maintainer, making him a target), but given a md5,
> finding the IP to exploit is, well, you know..

MD5-ing (or any other hash) the IP address doesn't anonymize it properly
and it can be easily undone. You need just a big table with md5sum to IP
address. And that table isn't even that big as there are only 2^32 IPv4
addresses at all and you need only 128bit for the md5sum and 32bit for
the IP address. That's 20 bytes per record and only 80 GB for the whole
table. And that's only the naive approach for the needed space (more a
upper bound).