Installation
Requirements
- Python 3.13 or newer
- uv for environment and dependency management
- Access credentials for judge and synthetic generation APIs (for example
OPENAI_API_KEY) - Quarto CLI (
quarto check) if you plan to build these docs locally
Set Up Environment
Clone the repository:
git clone https://github.com/RANDCorporation/judge-reliability-harness.git cd judge-reliability-harnessCreate and sync the project environment with uv (native TLS may not be required based on your security policies):
uv sync --extra dev --native-tlsThe command installs runtime and development dependencies defined in
pyproject.toml.Activate the uv-managed virtual environment:
source .venv/bin/activateOn Windows Command Prompt use
.venv\Scripts\activate, or PowerShell.\.venv\Scripts\Activate.ps1.
Configure Credentials
Create a .env file in the project root with API keys and organization IDs required by your judge providers:
cat <<'EOF' > .env
OPENAI_API_KEY=replace_me
OPENAI_ORG_ID=replace_me
EOFIf your environment enforces a private certificate authority, configure trust settings to ensure HTTP clients recognize the SSL chain before running the harness.