Dne 19. 11. 22 v 12:37 Miro Hrončok napsal(a):
On 18. 11. 22 15:09, Miroslav Suchý wrote:
> |Fun fact: the script with the checks runs on my notebook for 32 hours.|
That sounds pretty bad. What is the biggest bottleneck? Do you clone the repositories
from dist-git, or use
https://pkgs.fedoraproject.org/repo/rpm-specs-latest.tar.xz ?
It is not so bad. I am not sitting in front of the computer and waiting till the script
finish. :) I just wanted
emphasis that I cannot gather the data daily and the heuristics is already complicated.
It is mostly quick'n'dirty design. Like gathering the data from one step. Then
feeding the data from first step to next
script which process second step. Then feeding the data to do third one...
I already started concating the pipes so it will run in paralel next time.
But yeah, biggest time consumer is git checkout of packages (that does not mention spdx in
%changelog). I am not aware
of other method how to retrieve git log.
Miroslav