This directory contains the software used to assemble the Software Heritage License Blob Dataset (also available from Zenodo).
The main components used are:
01-select-blobs.sql
, 02-to-sha1.sql
, 03-clean.sh
licenseblobs.stats
, available under python/
licenseblobs.scancode
, available under python/
04-swhid-to-origin.sh
, querying the swh-graph APIjava/