Skip to content

Installation

Placeholder — fill in as code-release plan firms up

This page should cover environment setup once the code-release path is decided.

Hardware requirements

  • GPU: 1× A100 80GB or H100 (FFT runs require ~38 GB VRAM for Whisper-large-v3 at batch_size=2, grad_accum=16)
  • Disk: ~6 GB per checkpoint × ~300 ckpts = ~2 TB for the full matrix
  • RAM: ≥64 GB recommended (goldfish corpora can be 10 GB)

Python environment

# Python 3.10+
python -m venv .venv && source .venv/bin/activate

# Core deps
pip install \
    torch \
    transformers \
    datasets \
    tokenizers \
    jiwer \
    librosa \
    soundfile \
    regex \
    wandb

External data

Repository checkout

(To fill in once the code is released publicly.)

See also