Adding Harness Support

This guide is for developers building AI clients, IDEs, agent frameworks, or other tools who want to support the Agent Harnesses standard.

What clients must implement

A compliant client must support the four-phase loading model:

Load — at session start, inject the full HARNESS.md body into the agent’s context
Discovery — expose tools so the agent can read routing files and content descriptions to find what is relevant to a task
Activation — expose tools so the agent can read the full content of a skill, document, or routing file when it matches the task
Execution — expose tools so the agent can run scripts bundled within skills

Clients may implement additional features (caching, search, UI affordances) but these four behaviors are the baseline.

Loading model in detail

Load

At session start, read HARNESS.md frontmatter to get name and description, then inject the full body into the agent’s context:

harness = parse_frontmatter_and_body("HARNESS.md")
present_to_user(harness.name, harness.description)
inject_into_context(harness.body)

HARNESS.md is the agent’s map of the harness — it establishes the agent’s role and tells it what top-level directories are available.

Discovery

When a task arrives, the agent reads routing files to decide which directories are relevant. Expose a tool so the agent can request content on demand rather than having the client inject everything upfront.

Activation

When the agent determines a directory or file is relevant, your client reads the requested content and returns it. A single path-based tool handles the full directory hierarchy:

load_content(path: str) -> str
    If path points to a grouping directory, returns its routing file.
    If path points to a leaf directory, returns its primary file (per .leaf-detectors)
      and lists available scripts.
    If path points to a file, returns the full file content.

Using a single path-based interface keeps the API uniform across flat and deeply nested harnesses and handles arbitrary directory structures without client changes.

Traversal and termination

A harness may contain any number of top-level subdirectories. Each uses a routing file named after the top-level directory in all-caps — TOOLS.md for tools/, DATA.md for data/, and so on. This convention propagates throughout each subtree. The agent navigates progressively: reads HARNESS.md to learn about top-level directories, loads a routing file to learn what is in a branch, then loads individual files only when a task requires them. Two mechanisms signal that a directory should not be traversed further:

.harnessleaf — a file that explicitly marks a directory as a leaf
.leaf-detectors — a file at the harness root declaring keyword patterns; any directory containing the named file is treated as a leaf of the declared type

Your load_content tool should respect these boundaries: return the leaf directory’s primary file or listing rather than recursing into subdirectories.

Reference file types

Files in a harness may be any type — markdown, images, code, data files, or anything else. Your load_content tool should return the raw file content and let the agent handle interpretation. When indexing files for presentation or search, only markdown files (.md) may carry a description frontmatter field. Non-markdown files have no structured metadata.

Script execution

When an agent invokes a script bundled inside a skill, your client is responsible for executing it. The expected interface:

Scripts receive input via command-line arguments or stdin
Scripts write results to stdout
Non-zero exit code indicates failure; stderr contains the error message

Clients should sandbox script execution appropriately for their environment and expose a run_script(path: str, script: str, args: list[str]) -> str tool to the agent.

Validation

Use the harnesses-ref CLI to validate a harness before loading it:

pip install harnesses-ref
harnesses-ref validate ./my-harness

The validator checks structural correctness including HARNESS.md frontmatter, detected skill leaf validity, and the presence of routing files in grouping subdirectories. Clients may run this check at install time and surface errors to the user.

Minimal example

A minimal Python implementation of harness loading:

from pathlib import Path


def load_harness(harness_path: Path) -> dict:
    harness_md = (harness_path / "HARNESS.md").read_text()
    frontmatter, body = parse_frontmatter_and_body(harness_md)
    return {
        "name": frontmatter["name"],
        "description": frontmatter["description"],
        "body": body,
    }


def _load_leaf_detectors(harness_path: Path) -> dict[str, str]:
    detectors: dict[str, str] = {}
    leaf_file = harness_path / ".leaf-detectors"
    if leaf_file.exists():
        for line in leaf_file.read_text().splitlines():
            line = line.strip()
            if not line or line.startswith("#"):
                continue
            if "=" in line:
                leaf_type, rel_path = line.split("=", 1)
                detectors[leaf_type.strip()] = rel_path.strip()
    return detectors


def _routing_file_name(top_level_dir: Path) -> str:
    return top_level_dir.name.upper() + ".md"


def _leaf_type(directory: Path, detectors: dict[str, str]) -> str | None:
    if (directory / ".harnessleaf").exists():
        return "leaf"
    for ltype, rel_path in detectors.items():
        if (directory / rel_path).exists():
            return ltype
    return None


def load_content(harness_path: Path, rel_path: str) -> str:
    target = harness_path / rel_path
    detectors = _load_leaf_detectors(harness_path)

    if target.is_dir():
        ltype = _leaf_type(target, detectors)
        if ltype is not None:
            primary_rel = detectors.get(ltype)
            if primary_rel and (target / primary_rel).exists():
                result = (target / primary_rel).read_text()
                scripts = list((target / "scripts").glob("*")) if (target / "scripts").exists() else []
                if scripts:
                    result += "\n\nAvailable scripts: " + ", ".join(s.name for s in scripts)
                return result
            return "\n".join(f.name for f in sorted(target.iterdir()))

        # Find the top-level ancestor to derive the routing file name
        parts = Path(rel_path).parts
        routing_name = parts[0].upper() + ".md" if parts else "HARNESS.md"
        routing_file = target / routing_name
        if routing_file.exists():
            return routing_file.read_text()

        # Fall back to listing the directory
        return "\n".join(f.name for f in sorted(target.iterdir()))

    return target.read_bytes().decode(errors="replace")

For harness creators

For client implementors

Adding Harness Support

Adding Harness Support

What clients must implement

Loading model in detail

Load

Discovery

Activation

Traversal and termination

Reference file types

Script execution

Validation

Minimal example

​Adding Harness Support

​What clients must implement

​Loading model in detail

​Load

​Discovery

​Activation

​Traversal and termination

​Reference file types

​Script execution

​Validation

​Minimal example

Adding Harness Support

What clients must implement

Loading model in detail

Load

Discovery

Activation

Traversal and termination

Reference file types

Script execution

Validation

Minimal example