To download and decompress the data sets on-the-fly, make sure you have curl
and zstd
installed, then run:
export DATASET_URL=...
curl --silent --fail ${DATASET_URL} | tar -xv --use-compress-program=unzstd
For multi-file data sets, first download them. Then, to recombine and decompress, run:
cat <data-set-filename>.tar.zst* | tar -xv --use-compress-program=unzstd
This command works on both standalone files (.tar.zst
) and chunked ones (.tar.zst.XXX
).
Use the initial snapshots from the BI data sets, see https://ldbcouncil.org/data-sets-surf-repository/snb-business-intelligence.html.
See https://ldbcouncil.org/data-sets-surf-repository/snb-interactive-v2-updates.html.
See https://ldbcouncil.org/data-sets-surf-repository/snb-factor-tables.html.