You can create a release to package software, along with release notes and links to binary files, for other people to use. Learn more about releases in our docs.
From there, you can measure different aspects of different datasets by running run_data_measurements.py with different options. The options specify the HF Dataset, the Dataset config, the Dataset ...