halter

halter is a simple and configurable agent harness and SDK for building and operating thoroughbred agents.

Caution

halter is still a heavy work in progress. Proceed at your own risk.

Design Goals

Cache friendliness
Obsessive token optimization
Best in class multi model support
Best in class tool calling and hook support
Simple and expressive API

What halter gives you

At a high level, halter combines:

a typed protocol for sessions, turns, events, resources, and tool calls
a configurable runtime for prompt assembly, context management, provider execution, hooks, and subagents
a built-in tool harness for reading, editing, writing, shell execution, process control, and delegated work
resource loading for repo-local skills and plugins
policy enforcement around filesystem writes, shell usage, tool output size, and subagent fanout
session persistence with memory and SQLite backends

Quickstart

Halter is designed to be plug and play in existing Rust code and services. The goal of the halter SDK is to abstract away the details of a harness, however there is still some small boilerplate:

loading config
compiling resources
injecting custom tools or hooks
selecting persistence strategy
consuming session events programmatically

Basic example with config file

use futures::StreamExt;
use halter::prelude::*;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let harness = Halter::from_config_file("halter.toml").await?;
    let session = harness.new_session(SessionInit::default()).await?;

    let mut events = session
        .submit_turn(Turn::user("Summarize the session persistence design"))
        .await?;

    while let Some(event) = events.next().await {
        let event = event?;
        println!("{:?}", event.payload);
    }

    Ok(())
}

This code does all of the following:

loads and validates config
compiles resources
builds providers, tools, hooks, policy, and session storage
creates a runtime
creates a session
executes one turn and streams the resulting events

Detailed events

use futures::StreamExt;
use halter::prelude::*;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let harness = Halter::from_config_file("halter.toml").await?;

    let session = harness
        .new_session(SessionInit {
            working_dir: std::env::current_dir()?,
            ..SessionInit::default()
        })
        .await?;

    let mut stream = session.submit_turn(Turn::user("List the major crates in this repo")).await?;
    while let Some(event) = stream.next().await {
        let event = event?;
        match event.payload {
            SessionEventPayload::DeltaItem { delta } => print!("{}", delta.text),
            SessionEventPayload::TurnCompleted { usage, .. } => {
                println!("\nusage: in={} out={}", usage.input_tokens, usage.output_tokens);
            }
            SessionEventPayload::TurnFailed { error, .. } => {
                eprintln!("turn failed: {error}");
            }
            _ => {}
        }
    }

    Ok(())
}

Simple programmatic config with snapshot

use halter::prelude::*;
use halter_config::{
    ConfiguredProvider, ModelConfig, ModelsConfig, ProviderConfig, ProvidersConfig,
};
use halter_protocol::ReasoningEffort;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let config = HarnessConfig {
        providers: ProvidersConfig {
            openai: Some(ProviderConfig {
                api_key: Some(std::env::var("OPENAI_API_KEY")?),
                ..ProviderConfig::default()
            }),
            ..ProvidersConfig::default()
        },
        models: ModelsConfig {
            default: Some(ModelConfig {
                provider: ConfiguredProvider::OpenAi,
                model: "gpt-5".to_owned(),
                max_input_tokens: Some(200_000),
                max_output_tokens: Some(8_192),
                reasoning: Some(ReasoningEffort::Medium),
                tokens_per_minute: Some(500_000),
            }),
            ..ModelsConfig::default()
        },
        ..HarnessConfig::default()
    };
    let snapshot = ResourceSnapshot::empty();

    let harness = Halter::from_config(config, snapshot).await?;
    let _session = harness.new_session(SessionInit::default()).await?;
    Ok(())
}

Build from config + compiled resources

use halter::{HalterBuilder, ResourceCompiler};
use halter_config::load_path;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let config = load_path("halter.toml").await?;
    let resources = ResourceCompiler::from_config(&config).compile().await?;

    let harness = HalterBuilder::new()
        .with_config(config)
        .with_compiled_resources(resources)
        .build()
        .await?;

    println!("default model = {:?}", harness.config().default_model()?.model);
    Ok(())
}

Add a custom tool

use std::sync::Arc;

use async_trait::async_trait;
use halter::HalterBuilder;
use halter_config::load_path;
use halter_protocol::{ToolCapabilities, ToolConcurrency, ToolName, ToolResult, ToolSpec};
use halter_tools::{Tool, ToolContext};
use serde_json::{json, Value};

#[derive(Debug)]
struct EchoTool;

#[async_trait]
impl Tool for EchoTool {
    fn spec(&self) -> ToolSpec {
        ToolSpec {
            name: ToolName::from("echo_json"),
            description: "Return the input JSON unchanged".to_owned(),
            input_schema: json!({
                "type": "object",
                "additionalProperties": true
            }),
            concurrency: ToolConcurrency::ParallelSafe,
            capabilities: ToolCapabilities::default(),
            provider_aliases: Default::default(),
        }
    }

    async fn execute(&self, _context: ToolContext, input: Value) -> anyhow::Result<ToolResult> {
        Ok(ToolResult::Json { value: input })
    }
}

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let config = load_path("halter.toml").await?;
    let resources = halter::ResourceCompiler::from_config(&config).compile().await?;

    let _halter = HalterBuilder::new()
        .with_config(config)
        .with_compiled_resources(resources)
        .with_tool(Arc::new(EchoTool))
        .build()
        .await?;

    Ok(())
}

Provide your own session store

use std::sync::Arc;

use halter::{HalterBuilder, ResourceCompiler};
use halter_config::load_path;

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let config = load_path("halter.toml").await?;
    let resources = ResourceCompiler::from_config(&config).compile().await?;
    let store = Arc::new(MyFancySessionStore::default());

    let _halter = HalterBuilder::new()
        .with_config(config)
        .with_compiled_resources(resources)
        .with_session_store(store)
        .build()
        .await?;

    Ok(())
}

Default prompts and the built-in coding agent

halter ships three built-in prompts, all available to SDK users:

a general-purpose system prompt (the default for every session),
a batteries-included coding-agent prompt — a quick on-ramp for a working coding agent, and
the conversation-compaction instructions used by the default context manager.

The two system prompts are assembled from a shared behavioral core plus a small role-specific intro, so common guidance (acting autonomously in headless runs, preferring the provided tools, honesty, safety) is defined in exactly one place.

Read or compose them through the halter::prompts module, and seed a session with one via SessionInit::with_system_prompt. Use SessionInit::append_system_prompt to add stable house rules without restating the base prompt:

use halter::prelude::*;
use halter::prompts;

// Spin up a coding agent in one line:
let session = harness
    .new_session(
        SessionInit::default()
            .with_system_prompt(prompts::default_coding_agent_prompt())
            .append_system_prompt("## House rules\n- Never touch files under vendor/."),
    )
    .await?;

Or select a built-in from config — prompts.preset picks the built-in prompt, prompts.system_prompt overrides it entirely, and prompts.append_system_prompt adds static instructions after whichever base wins:

[prompts]
preset = "coding"            # "general" (default) | "coding"
# system_prompt = "..."      # full override; wins over preset
append_system_prompt = """
## House rules
- Conventional Commits for every commit.
- Never touch files under vendor/.
"""

Base system-prompt precedence, most specific first: an explicit per-session prompt (SessionInit::with_system_prompt) > prompts.system_prompt > prompts.preset > the built-in general-purpose default. Appends are additive: config append applies after the resolved base, then per-session append_system_prompt calls apply after that. Appended text stays in the static, prefix-cacheable System block before skills.

Crates

halter — high-level SDK and builder
halter-config — config schema, loading, overrides, validation
halter-protocol — shared types and wire-format vocabulary
halter-runtime — session engine, prompt assembly, event bus, compaction, subagents
halter-providers — provider adapters and model registry
halter-tools — tool runtime, built-in tools, policy, subagent control tools
halter-hooks — event-driven hook and policy interception layer
halter-session — session persistence and replay

halter-config

halter-config defines the schema for:

providers
model roles (default, small, subagent), with default and subagent optionally backed by a "model_judge" panel via [models.model_judge]
resource roots
prompts (a built-in preset plus a full system_prompt override)
context compaction settings
tool enablement
policy
network access
session persistence
runtime settings

It also handles:

file loading
environment overrides
layered merges
JSON Schema export
starter config generation

halter provides convenience functionality for consuming .toml config files where that makes sense operationally.

Note

.toml config file usage is a thin serialization veneer over the programmatic config. For full customization programmatic configuration should be used, and probably preferred in headless, automated, or dynamic environments.

Credentials and overrides

When using config files, halter resolves the effective config first, then resolves provider credentials.

Config value hierarchy:

Built-in defaults.
Layered config files, when using load_layered: user config, then project config, then the explicit config path. Later files replace earlier values, except skill and plugin root arrays append and dedupe.
Supported HALTER_* environment overrides, such as HALTER_TOOLS_ENABLED or HALTER_POLICY_SHELL_ALLOW. These are applied after file loading, so they win over file values.

Provider credential hierarchy:

[providers.<name>].api_key in the effective config wins when it is present and non-empty.
For OpenAI only, [providers.openai].oauth may be configured instead of api_key. It must include client_id, access_token, id_token, and refresh_token; the provider sends access_token as the bearer token. OAuth traffic for /v1/responses, every path below that prefix, and /chat/completions rewrites to https://chatgpt.com/backend-api/codex/responses, so base_url is ignored for those paths. OAuth requests send the assembled system prompt as top-level instructions, omit the developer/system input item, and set store to false.
If no explicit credential is configured, halter reads the provider-specific process environment variable: OPENAI_API_KEY, ANTHROPIC_API_KEY, or OPENROUTER_API_KEY.
If no credential source is available for a selected provider, config loading fails.

OpenAI api_key and oauth config are mutually exclusive. If both are present in [providers.openai], validation fails.

OAuth config uses this shape:

[providers.openai.oauth]
client_id = "..."
access_token = "..."
id_token = "..."
refresh_token = "..."

Halter reads process environment variables. It does not parse .env files directly; if you use a .env file, load it into the process environment before starting the CLI or SDK process.

Disk resource parsing

ResourceCompiler::from_config(&config).compile() loads disk resources from [resources.skills].roots and [resources.plugins].roots. Halter::from_config_file(...) uses this path automatically.

Resource root paths expand a leading ~/ through $HOME. Other shell-style substitutions, such as $VAR, ${VAR}, and ~user, are not expanded. Missing resource roots are skipped.

Standalone skills are discovered recursively under each skill root:

Each directory is scanned for child directories.
A child directory containing SKILL.md is loaded as one skill.
If a child directory does not contain SKILL.md, scanning continues inside it.

SKILL.md frontmatter is intentionally small. If the file starts with ---, halter reads key: value lines until the next ---. The loader currently uses name and description; missing name falls back to the directory name, and missing description becomes an empty string. The compiled snapshot stores the skill id, name, description, and full SKILL.md body. Files under a skill's immediate scripts/ directory are recorded on the loaded skill, but arbitrary supporting files are not loaded into the compiled snapshot today.

Note

Halter attempts to replicate Codex and Claude Code behavior when parsing plugins for all supported functionality. Right now this is a moving target, so minor bugs and differences may manifest. File an issue or PR to fix any inconsistencies.

Plugins are discovered one level below each plugin root. Each child directory is treated as a plugin only if it contains a manifest at one of these paths, checked in order:

.codex-plugin/plugin.json
.claude-plugin/plugin.json
.agent-plugin/plugin.json
plugin.json

The manifest must include non-empty string fields name and version. Optional manifest fields include skills, agents, hooks, mcpServers, lspServers, allowedHttpHosts, and allowedEnvVars. skills and agents may be a string path or an array of string paths. Skill and agent entries must be paths relative to the plugin root and must start with ./, unless they use a plugin alias. Supported aliases are ${CLAUDE_PLUGIN_ROOT}, ${PLUGIN_ROOT}, ${CLAUDE_PLUGIN_DATA}, and ${PLUGIN_DATA}. ${PLUGIN_ROOT} resolves to the enclosing plugin root (or the skill's own root for standalone skills), and ${PLUGIN_DATA} resolves to <root>/.data. These aliases are also rendered inside skill (SKILL.md) bodies and agent prompt files at load time. Parent-directory traversal is rejected, and resolved paths must stay inside the plugin root.

Plugin skills entries can point at a single skill directory containing SKILL.md or at a directory tree that should be searched recursively for skills. Plugin agents entries can point at one prompt file or a directory of prompt files; each file becomes an agent named after its file stem. Hooks are loaded from the manifest's hooks path, or from hooks/hooks.json when the manifest omits hooks. Hook parse failures are retained as hook warnings instead of aborting the whole resource compile.

Compiled resources use stable identifiers. Skill ids are based on the canonical skill path, plugin ids are based on plugin name, version, and canonical plugin path, and the final resource snapshot revision is derived from loaded skill revisions, rendered agent prompt revisions, plugin name/version pairs, and hook revisions. Skill bodies and agent prompts are rendered before their revisions are computed, so changing an alias expansion (e.g., loading the same plugin through a different absolute root) produces a different snapshot revision.

Bundled agent plugins

The repository ships a bundled plugin set under halter-agent-plugins/. Today it contains one plugin, halter-rust (0.1.0), described as "Rust engineering guidance for Halter." That plugin carries two skills: basic-rust and workflows-rust.

The two bundled skills cover different layers of using the halter SDK:

basic-rust — use when building, explaining, testing, or modifying Rust agent harnesses with the Halter library. It covers config-driven runners, the SDK builder, custom tools, hooks, policy, persistence, providers, subagents, and workspace Rust changes.
workflows-rust — use when building scripted, deterministic multi-agent workflows in Rust on top of the Halter SDK, such as fan-out/fan-in, staged pipelines, adversarial verification, judge panels, loop-until-budget loops, structured agent output, concurrency control, and resumable orchestration. It pairs with basic-rust, which covers harness construction.

To load the bundled plugin, add its plugin directory to your plugin roots. For example, with a config file:

[resources.plugins]
roots = ["./halter-agent-plugins/plugins"]

Remote GitHub plugins

The halter-config crate has an opt-in remote-plugins feature for SDKs that want to fetch GitHub-hosted plugins without installing them to a local cache. These plugins are downloaded and made available to the harness dynamically and purely in memory. At the moment only skills and agents are supported, commands and hooks are on the roadmap but are not currently planned. It returns the same LoadedPlugin values that ResourceCompiler::with_loaded_plugins(...) already accepts:

use halter::HalterBuilder;
use halter_config::github::{GithubPlugins, load_plugins};

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let plugins = load_plugins([
        GithubPlugins::plugin("https://github.com/acme/rust-tools")?,
        GithubPlugins::marketplace("https://github.com/acme/agent-marketplace")?
            .only(["pairs-review", "sql-guard"]),
    ])
    .await?;

    let harness = HalterBuilder::new()
        .with_loaded_plugins(plugins)
        .build()
        .await?;

    Ok(())
}

Remote marketplace loading checks .agents/plugins/marketplace.json first and falls back to .claude-plugin/marketplace.json. Remote plugin directories check .codex-plugin/plugin.json first and falls back to .claude-plugin/plugin.json. If neither manifest exists, loading fails.

Config File Example (non-exhaustive)

version = 1

[models.default]
provider = "openai"
model = "gpt-5"
reasoning = "medium"

[models.small]
provider = "openai"
model = "gpt-5-mini"
reasoning = "low"

[models.subagent]
provider = "openai"
model = "gpt-5-mini"
reasoning = "medium"

[prompts]
preset = "general"   # "general" (default) | "coding"; system_prompt overrides either
# append_system_prompt = "stable extra instructions appended after the base prompt"

[resources.skills]
roots = ["./.agent/skills"]

[resources.plugins]
roots = ["./.agent/plugins"]

[tools]
enabled = [
  "read",
  "glob",
  "grep",
  "write",
  "edit",
  "shell",
  "process",
  "task",
  "wait_agent",
  "spawn_agent",
  "send_input",
  "close_agent",
]

[context]
compaction_threshold = 200_000
pre_compaction_target = 150_000
prune_signal_threshold = "low"

[policy]
allowed_write_roots = ["./", "/tmp/halter"]
max_read_bytes = 1048576
max_subagent_depth = 3
max_concurrent_subagents = 8

[policy.network]
enabled = false
allowed_hosts = []

[policy.shell]
enabled = true
allow = ["git", "cargo", "rg", "ls", "find", "true", "cd", "python", "pwd", "cwd", "echo"]
timeout_secs = 30

[sessions]
backend = "memory"

[runtime]
# Optional. When set, halter writes a `<session_id>.txt` JSONL trace per session
# into this directory: one header line followed by every committed SessionEvent.
# Useful for offline debugging and replay tooling.
# traces_dir = "/tmp/halter/traces"

# Optional. Keep off unless the caller wants the parent turn stream to include
# raw events from subagents spawned under that parent.
# subagent_event_forwarding = "off"
# subagent_event_forwarding = "all"
# subagent_event_forwarding_cap = 100_000 # 0 = unbounded

Model Judge

The models.default and models.subagent slots accept either an inline model (the form above) or the string "model_judge", which references a shared [models.model_judge] block. A model-judge slot deliberates over a panel of models, judges their work with a synthesis model, and hands the result to the default model as advisory guidance — the default model always owns the visible answer and all real tool execution.

The mode field picks how deeply each panelist deliberates:

mode = "one_shot" (default). Each panelist answers with a single message (one inference, no tools), and the whole cycle runs on every model call within a turn:

The context and user message are multiplexed to every model in panel in parallel (each receives the full context and the tool specs). A constant framing prefix is prepended as the first user message so each panelist answers as one advisory voice on a judged panel — comparable prose about what it would do and why, rather than executing or replying. The prefix is added only on the panel path, so it never reaches the synthesis or default model.
The synthesis model is given the panel responses and a rank_responses tool. It first stack-ranks the responses (emitted as structured tracing telemetry), then writes a synthesis that judges, not merges, them (strengths, weaknesses, pros, cons, overall).
The synthesis is handed to the default model as internal guidance for the same provider call; the guidance is not persisted to the transcript.

Cheap and fast — a fresh second opinion at each step. Expect N + 2 model calls per iteration.

mode = "full_turn". Each panelist runs a complete agentic turn (inference + tool loop to completion) on the user's message, once per turn. The synthesis model judges the outcomes of those turns (using the same rank_responses + judge flow), and the synthesis is injected as advisory guidance on the default model's opening inference of the turn. The default model then runs its own full turn, owning the real, user-visible execution. Heavier, but each panelist actually investigates the task rather than only describing what it would do. Panelists are advisory — their work is scratch exploration that feeds the judge, not the shipped result.

panel_isolation controls how FullTurn panelist sub-sessions are sandboxed (ignored under one_shot):

read_only (default) — panelists share the working directory but get a tool set with every mutating tool (write/edit/shell/process/task) filtered out. Safe under concurrency.
shared_full — panelists share the working directory with the full tool set. Maximum fidelity, but concurrent panelists can clobber each other's writes.
worktree — each panelist runs in its own git worktree with the full tool set, so it can mutate freely without colliding. Requires a git repository; falls back to shared_full (with a warning) otherwise.

[models]
default = "model_judge"
subagent = "model_judge"   # optional; subagents can fan out too

[models.model_judge]
mode = "one_shot"            # "one_shot" (default) | "full_turn"
panel_isolation = "read_only"  # full_turn only: "read_only" (default) | "shared_full" | "worktree"

[models.model_judge.default]
provider = "anthropic"
model = "claude-opus-4-8"
reasoning = "high"

[models.model_judge.synthesis]
provider = "openai"
model = "gpt-5"
reasoning = "high"

[[models.model_judge.panel]]
provider = "openai"
model = "gpt-5"

[[models.model_judge.panel]]
provider = "openrouter"
model = "z-ai/glm-5.1"

[[models.model_judge.panel]]
provider = "anthropic"
model = "claude-sonnet-4-6"

Rankings, panel responses/outcomes, and the synthesis message are all logged via tracing (target halter::model_judge); set RUST_LOG=halter::model_judge=info to capture the quality telemetry.

Programmatic Example (non-exhaustive)

use std::path::PathBuf;
use std::sync::Arc;

use halter::session::InMemorySessionStore;
use halter::{HalterBuilder, LoadedSkill};
use halter_config::{
    ConfiguredProvider, ContextConfig, HarnessConfig, ModelConfig, ModelsConfig,
    NetworkPolicyConfig, PolicyConfig, PromptsConfig, ProviderConfig, ProvidersConfig,
    ResourcesConfig, RuntimeConfig, SearchRoots, SessionBackend, SessionsConfig,
    ShellPolicyConfig, ToolsConfig,
};
use halter_protocol::{PruneSignalThreshold, ReasoningEffort, SkillId, Turn};
use halter_runtime::SessionInit;

const SYSTEM_PROMPT: &str =
    "You are a careful local coding agent. Prefer concrete, verifiable answers.";
const REPO_REVIEW_SKILL: &str = r#"When asked to review a codebase:
1. Start with correctness risks.
2. Then call out maintainability issues.
3. End with the smallest high-leverage next steps.
"#;

fn build_config() -> anyhow::Result<HarnessConfig> {
    let working_dir = std::env::current_dir()?;
    let temp_write_root = std::env::temp_dir().join("halter");

    Ok(HarnessConfig {
        version: 1,
        providers: ProvidersConfig {
            openai: Some(ProviderConfig {
                base_url: Some("https://api.openai.com".to_owned()),
                api_key: Some(std::env::var("OPENAI_API_KEY")?),
                ..ProviderConfig::default()
            }),
            anthropic: None,
            openrouter: None,
        },
        models: ModelsConfig {
            default: Some(ModelConfig {
                provider: ConfiguredProvider::OpenAi,
                model: "gpt-5".to_owned(),
                max_input_tokens: Some(200_000),
                max_output_tokens: Some(8_192),
                reasoning: Some(ReasoningEffort::High),
                tokens_per_minute: Some(500_000),
            }),
            small: Some(ModelConfig {
                provider: ConfiguredProvider::OpenAi,
                model: "gpt-5-mini".to_owned(),
                max_input_tokens: Some(200_000),
                max_output_tokens: Some(4_096),
                reasoning: Some(ReasoningEffort::Low),
                tokens_per_minute: Some(1_000_000),
            }),
            subagent: Some(ModelConfig {
                provider: ConfiguredProvider::OpenAi,
                model: "gpt-5-mini".to_owned(),
                max_input_tokens: Some(200_000),
                max_output_tokens: Some(4_096),
                reasoning: Some(ReasoningEffort::Medium),
                tokens_per_minute: Some(750_000),
            }),
        },
        resources: ResourcesConfig {
            skills: SearchRoots { roots: Vec::new() },
            plugins: SearchRoots { roots: Vec::new() },
        },
        prompts: PromptsConfig {
            system_prompt: Some(SYSTEM_PROMPT.to_owned()),
            ..PromptsConfig::default()
        },
        context: ContextConfig {
            compaction_threshold: 200_000,
            pre_compaction_target: 150_000,
            prune_signal_threshold: PruneSignalThreshold::Low,
        },
        tools: ToolsConfig {
            enabled: vec![
                "read".to_owned(),
                "glob".to_owned(),
                "grep".to_owned(),
                "write".to_owned(),
                "edit".to_owned(),
                "shell".to_owned(),
                "process".to_owned(),
                "spawn_agent".to_owned(),
                "send_input".to_owned(),
                "wait_agent".to_owned(),
                "close_agent".to_owned(),
            ],
        },
        policy: PolicyConfig {
            allowed_write_roots: vec![working_dir.clone(), temp_write_root],
            max_read_bytes: 1_048_576,
            max_subagent_depth: 3,
            max_concurrent_subagents: 8,
            shell: ShellPolicyConfig {
                enabled: true,
                allow: vec![
                    "git".to_owned(),
                    "cargo".to_owned(),
                    "rg".to_owned(),
                    "ls".to_owned(),
                    "find".to_owned(),
                    "python".to_owned(),
                    "pwd".to_owned(),
                    "echo".to_owned(),
                ],
                timeout_secs: 30,
            },
            network: NetworkPolicyConfig {
                enabled: false,
                allowed_hosts: Vec::new(),
                allowed_loopback: Vec::new(),
            },
        },
        sessions: SessionsConfig {
            backend: SessionBackend::Memory,
            sqlite_path: None,
        },
        runtime: RuntimeConfig {
            working_dir: Some(working_dir),
            ..RuntimeConfig::default()
        },
    })
}

fn inline_skills() -> Vec<LoadedSkill> {
    vec![LoadedSkill {
        id: SkillId::from("repo-review"),
        name: "repo-review".to_owned(),
        description: "Review a repository for correctness, maintainability, and next steps."
            .to_owned(),
        root: PathBuf::from("inline-skills/repo-review"),
        body: REPO_REVIEW_SKILL.to_owned(),
        supporting_files: Vec::new(),
        scripts: Vec::new(),
        revision: "repo-review-v1".to_owned(),
    }]
}

#[tokio::main]
async fn main() -> anyhow::Result<()> {
    let harness = HalterBuilder::new()
        .with_config(build_config()?)
        .with_loaded_skills(inline_skills())
        .with_session_store(Arc::new(InMemorySessionStore::default()))
        .build()
        .await?;

    let session = harness.new_session(SessionInit::default()).await?;
    let _events = session
        .submit_turn(Turn::user("Describe the active runtime and available skills"))
        .await?;

    Ok(())
}

To use OpenAI OAuth in programmatic config, set oauth instead of api_key:

use halter_config::{OpenAiOAuthConfig, ProviderConfig};

ProviderConfig {
    oauth: Some(OpenAiOAuthConfig {
        client_id,
        access_token,
        id_token,
        refresh_token,
    }),
    ..ProviderConfig::default()
}

halter-protocol

halter-protocol defines the shared vocabulary used by the rest of the workspace.

That includes types for:

turns
messages
session events
tool calls and tool results
resources and compiled artifacts
provider-facing request/response chunks

If you are building integrations or parsing structured output, this crate matters a lot.

halter-providers

halter-providers adapts concrete model backends into halter's normalized provider interface.

Built-in providers include:

OpenAI
Anthropic
OpenRouter
Fake/test provider
Unsupported placeholder for builds where a transport is not wired in

Important operational differences:

OpenAI supports streaming, prompt caching, and dedicated Responses compaction
OpenRouter supports streaming, prompt caching, and inline Responses compaction
Anthropic supports streaming, prompt caching, interleaved thinking, and inline Messages compaction
capability differences are explicit and should be handled intentionally

halter-tools

Note

Many of the original ideas (and code) in this crate are taken from other FOSS projects, namely pi-mono and oh-my-pi's native Rust tool.

Built-in tools include:

read
glob
grep
write
edit
shell
process
task (in-memory todo list scoped to the session)

Optional feature-gated tools include:

pty
ast_grep
image
browser
profile

Subagent tools include:

spawn_agent
send_input
wait_agent
close_agent

This crate also enforces policy boundaries such as:

shell allowlisting
write-root restrictions
network host and loopback restrictions
read/output size limits
subagent depth and concurrency limits

halter-hooks

halter-hooks lets you observe and influence runtime behavior by reacting to lifecycle events.

Hooks can:

approve or block actions
request or deny permissions
add system messages
attach additional context
rewrite inputs and outputs
suppress output visibility
stop execution

Example

use halter::HalterBuilder;
use halter_hooks::{Hook, HookEventName, HookResponse, RegisteredHookPriority};
use halter_protocol::PluginId;

let hook = Hook::callback(HookEventName::PreToolUse, |input| async move {
    if input.tool_name() == Some("shell") {
        Ok(HookResponse::passthrough().with_system_message(
            "Shell usage was requested; verify the command is minimal and necessary.",
        ))
    } else {
        Ok(HookResponse::passthrough())
    }
});

let _builder = HalterBuilder::new()
    .with_plugin_hook_priority(
        PluginId::from("example-tool-id"),
        RegisteredHookPriority::BeforePlugins,
        hook,
    );

halter-session

halter-session provides persistence and replay.

Built-in backends:

InMemorySessionStore
SqliteSessionStore (behind the sqlite feature)

Use memory for:

tests
ephemeral local runs
simplest setup

Use SQLite for:

resumable local agents
durable transcripts
replay after process restart

halter-runtime

halter-runtime executes sessions.

It owns:

session lifecycle
prompt assembly
context management and compaction
event publication
hook dispatch
tool execution orchestration
subagent lineage and coordination
session replay/resume

The public session handle is SessionHandle; HalterSession remains a backwards-compatible alias.

Note

halter implements its own compaction strategy. This can be less token efficient than managed compaction from inference providers or frontier harnesses. The goal is a higher-quality context window, which can reduce overall token use throughout the turn and gives halter a consistent baseline across providers and models.

Security model

Halter does its best to operate in a sane and safe way, but does not provide hard security boundaries. Run sensitive workloads in fully sandboxed environments with defense in depth beyond process-level safeguards.

Tool boundaries

Enforced mechanically, best effort, by tool policy:

where writes may occur
which shell programs may run through the shell parser and allowlist
which network hosts may be reached; loopback requires a separate allowlist
how much can be read or emitted
how many subagents may be active
how deep delegation may go

Semantic/runtime boundaries

The following can all be implemented in custom hooks:

approvals
denials
stop conditions
input/output rewriting
extra context or warnings
audit annotations

Features

The halter crate keeps optional capabilities out of the default build. No feature is enabled by default. Enable the feature at compile time, then make sure the corresponding tool or session backend is enabled by config and policy.

Feature	What it enables	Extra dependencies	Runtime notes
`advanced-tools`	Enables the advanced `grep` execution path: parallel content searches when possible and `memmap2`-backed reads for larger regular files.	`rayon`, `memmap2`	Applies to the existing `grep` tool. It does not register a new tool name.
`ast-tools`	Adds the syntax-aware `ast_grep` built-in tool for code search and rewrites.	`ast-grep-core`, `ast-grep-language`	Tool name: `ast_grep`. Actions: `find`, `replace`.
`browser-tools`	Adds the `browser` built-in tool for remote browser automation over Chrome DevTools Protocol (CDP).	`playwright-rs`, `reqwest`	Tool name: `browser`. Requires provider configuration, currently `BROWSERBASE_API_KEY` and `BROWSERBASE_PROJECT_ID`, plus Playwright runtime setup. Network policy still applies.
`image-tools`	Adds the `image` built-in tool for local image inspection and transforms.	`image`	Tool name: `image`. Actions: `info`, `resize`, `convert`. File reads and writes remain subject to tool policy.
`pty`	Adds the `pty` built-in tool for bounded interactive terminal sessions.	`portable-pty`	Tool name: `pty`. Actions: `start`, `write`, `resize`, `kill`. Use this when a plain `shell` command is not enough.
`profiling`	Adds the `profile` built-in tool for profiling and instrumentation workflows.	`inferno`	Tool name exposed to the model: `profile`.
`full`	Convenience rollup for the optional built-in tool families.	Same extra dependencies as `advanced-tools`, `ast-tools`, `browser-tools`, `image-tools`, `pty`, and `profiling`.	Does not include `sqlite`; enable `sqlite` separately when persistent session storage is needed.
`sqlite`	Enables SQLite-backed session persistence and the matching config schema.	`rusqlite`	Allows `sessions.backend = "sqlite"` and exposes `halter::session::SqliteSessionStore`. The default backend remains memory unless config selects SQLite.

Environment overrides

The config crate supports a focused set of environment overrides, including:

HALTER_SESSION_BACKEND
HALTER_POLICY_SHELL_ENABLED
HALTER_POLICY_NETWORK_ENABLED
HALTER_SKILL_ROOTS
HALTER_PLUGIN_ROOTS
HALTER_POLICY_SHELL_ALLOW
HALTER_POLICY_ALLOWED_HOSTS
HALTER_TOOLS_ENABLED

These are useful for CI, local overrides, or environment-specific deployment adjustments without duplicating full config files.

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
.agents/plugins		.agents/plugins
.claude-plugin		.claude-plugin
.github/workflows		.github/workflows
bin		bin
crates		crates
examples/software-factory		examples/software-factory
halter-agent-plugins/plugins/halter-rust		halter-agent-plugins/plugins/halter-rust
scripts		scripts
vendor		vendor
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
SOUL.md		SOUL.md
clippy.toml		clippy.toml

Folders and files

Latest commit

History

Repository files navigation

halter

Design Goals

What halter gives you

Quickstart

Basic example with config file

Detailed events

Simple programmatic config with snapshot

Build from config + compiled resources

Add a custom tool

Provide your own session store

Default prompts and the built-in coding agent

Crates

halter-config

Credentials and overrides

Disk resource parsing

Bundled agent plugins

Remote GitHub plugins

Config File Example (non-exhaustive)

Model Judge

Programmatic Example (non-exhaustive)

halter-protocol

halter-providers

halter-tools

halter-hooks

Example

halter-session

halter-runtime

Security model

Tool boundaries

Semantic/runtime boundaries

Features

Environment overrides

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages