So there's a good difference with the bigger dataset, almost half a second longer:
detailed=True:
$ time ./scripts/active_edit -c CVE-2024-NNN1 -p python-django
real 0m0.848s
user 0m0.752s
sys 0m0.096s
detailed=False:
$ time ./scripts/active_edit -c CVE-2024-NNN1 -p python-django
real 0m0.440s
user 0m0.383s
sys 0m0.056s
I think we're better off keeping a non-detailed dataset, and if we think other tools could benefit from this, we can certainly generate a detailed dataset too. There's no reason why we can't generate a few different pickles there, depending on usage.
So there's a good difference with the bigger dataset, almost half a second longer:
detailed=True: active_ edit -c CVE-2024-NNN1 -p python-django
$ time ./scripts/
real 0m0.848s
user 0m0.752s
sys 0m0.096s
detailed=False: active_ edit -c CVE-2024-NNN1 -p python-django
$ time ./scripts/
real 0m0.440s
user 0m0.383s
sys 0m0.056s
I think we're better off keeping a non-detailed dataset, and if we think other tools could benefit from this, we can certainly generate a detailed dataset too. There's no reason why we can't generate a few different pickles there, depending on usage.