richardyoung's picture
Upload reproduce/README.md with huggingface_hub
ed40df8 verified
|
Raw
History Blame Contribute Delete
4.09 kB

Reproduction guide

This directory contains the necessary information and assets to reproduce the results obtained during this Heretic run.

Models

Datasets

Selected trial

  • Trial number: 104
  • KL divergence: 0.006577
  • Refusals: 53/100

System

  • Python: 3.10.12 (CPython, GCC 11.4.0) [Virtualenv/Venv]
  • Operating system: Linux-6.8.0-107-generic-x86_64-with-glibc2.35 (x86_64)
  • CPU: AMD EPYC 9355 32-Core Processor

Accelerators

  • CUDA: Detected 1 device(s) (94.97 GB total VRAM)
    • CUDA Version: 12.8
    • Driver Version: 580.126.20
  • Devices:
    • CUDA 0: NVIDIA RTX PRO 6000 Blackwell Server Edition (94.97 GB)

Environment

  • Heretic: v1.4.0 (Origin: PyPI)
  • PyTorch: 2.11.0+cu128
  • Other dependencies: See requirements.txt.

Contents of this directory

How to reproduce

You can automate this process, including all verification steps, by downloading the reproduce.json file and running heretic --reproduce reproduce.json.

  1. Ensure your system matches the specifications in the System section above. Exact reproducibility is only guaranteed if all aspects of your system are identical to the one the model was originally generated on.
  2. Install the exact version of Heretic indicated in the Environment section above, from its original source.
  3. Install the packages listed in requirements.txt: pip install -r requirements.txt
  4. Install the correct version of PyTorch: pip install torch==2.11.0+cu128 --index-url https://download.pytorch.org/whl/cu128
  5. Place the provided config.toml in your working directory.
  6. Run Heretic without any additional arguments: heretic
  7. Wait for the run to finish, then select trial 104 and export the model.
  8. Verify that the weight files have been exactly reproduced by comparing their SHA-256 hashes against those in SHA256SUMS: sha256sum -c SHA256SUMS (or look at the hashes online if you uploaded to Hugging Face)

To use the included Optuna study journal empero-ai--Qwythos-9B-Claude-Mythos-5-1M.jsonl, place it in the checkpoints directory (usually checkpoints/) before running Heretic.

This allows you to export other models from the Pareto front, or to run additional trials without having to re-run the stored trials.