Yantra

An AI agent orchestrator written in Go. Multi-provider, tool-equipped, memory-backed agents that you fully own and understand.

What is this?

Yantra is a system that lets LLMs (GPT-4, Claude, Gemini) do things — not just chat. It gives them tools to read files, write code, run commands, and fetch data. Then it runs them in a loop: the LLM thinks, calls tools, reads results, thinks again, until the task is done.

Think of it as building your own Claude Code / Cursor agent from scratch.

Architecture at a glance

┌─────────────────────────────────────────────┐
│                    CLI                       │
│              cmd/yantra/main.go              │
│         yantra init | run | version          │
├─────────────────────────────────────────────┤
│                  Runtime                     │
│           agent turn loop + session          │
│     stream → think → act → observe → loop   │
├──────────────┬──────────────┬───────────────┤
│   Provider   │    Tools     │    Memory      │
│   Layer      │   System     │   SQLite +     │
│              │              │   Hybrid Search│
│  OpenAI      │  read_file   │  store/recall  │
│  Anthropic   │  write_file  │  vector + FTS  │
│  Gemini      │  list_files  │  embeddings    │
│              │  shell_exec  │  summaries     │
│              │  web_fetch   │  sessions      │
├──────────────┴──────────────┴───────────────┤
│                   Types                      │
│          interfaces, contracts, config       │
└─────────────────────────────────────────────┘

Quick start

# Build
go build ./...

# Generate default config
go run ./cmd/yantra init

# Edit yantra.toml — set your API key
$EDITOR yantra.toml

# Set your provider API key
export OPENAI_API_KEY=sk-...
# Or for Anthropic:
export ANTHROPIC_API_KEY=sk-ant-...

# Run the agent
go run ./cmd/yantra run "What is 2+2? Answer briefly."

# Run with a custom system prompt and workspace
go run ./cmd/yantra run --system "You are a Go expert" --workspace ./myproject "add tests for main.go"

Project structure

cmd/yantra/           CLI entry point (init, run, version, start, serve, tui)
internal/
  types/              Shared interfaces and data types
    config.go         Configuration structs + defaults
    config_loader.go  TOML + env var config loading
    tool.go           Tool interface, SafetyTier, FunctionDecl
    provider.go       Provider interface, ProviderType
    message.go        Message, ToolCall, streaming types
    errors.go         Typed errors (ToolError, ProviderError, etc.)
    memory.go         Memory interface, embedding, retrieval
    session.go        Session store interface
    channel.go        WebSocket frame protocol
    delegation.go     Multi-agent delegation types
  provider/           LLM provider implementations
    provider.go       Factory (Build, BuildFromConfig)
    openai.go         OpenAI Chat Completions
    anthropic.go      Anthropic Messages API
    gemini.go         Google Gemini GenerateContent
    reliable.go       Retry wrapper with exponential backoff
  runtime/            Agent turn loop
    session.go        In-memory conversation buffer + compaction
    runtime.go        AgentRuntime, Run(), stream accumulation, tool dispatch
  tool/               Tool system
    schema.go         JSON Schema builder helpers
    security.go       SecurityPolicy + WorkspacePolicy
    registry.go       ToolRegistry (register, lookup, execute)
    read_file.go      read_file tool
    write_file.go     write_file tool
    list_files.go     list_files tool
    shell_exec.go     shell_exec tool
    web_fetch.go      web_fetch tool
    memory_save.go    memory_save tool
    memory_search.go  memory_search tool
    builtin.go        RegisterBuiltins() convenience
  memory/             Persistent memory system
    sqlite.go         SQLite database wrapper + schema migrations
    store.go          Memory store (CRUD + hybrid retrieval)
    retrieval.go      Vector search, FTS search, RRF fusion
    session_store.go  Session lifecycle management
    embedding.go      Embedding backend factory
    embedding_openai.go  OpenAI embedding integration

Configuration

Yantra uses layered configuration (highest priority wins):

Built-in defaults
Config file (yantra.toml, auto-discovered)
Environment variables (YANTRA__ prefix)

Run yantra init to generate a starter yantra.toml.

Providers

Provider	SDK	Default context
OpenAI	`openai-go/v3`	128,000
Anthropic	`anthropic-sdk-go`	200,000
Gemini	`google.golang.org/genai`	1,000,000

All providers implement the same Provider interface. Swap between them with one config change. The ReliableProvider wrapper adds automatic retries with exponential backoff for transient failures.

Tools

Tool	Safety tier	Timeout	What it does
`read_file`	ReadOnly	10s	Read file with line numbers, offset, limit
`write_file`	SideEffecting	10s	Write/append to file, auto-creates dirs
`list_files`	ReadOnly	10s	List directory, optional recursive
`shell_exec`	Privileged	60s	Run shell command, capture stdout/stderr
`web_fetch`	SideEffecting	30s	HTTP GET/POST, return status + body
`memory_save`	SideEffecting	15s	Save knowledge to persistent memory
`memory_search`	ReadOnly	15s	Search memory with hybrid retrieval

Security

All tool execution goes through a SecurityPolicy:

Path containment: file tools can only access files inside the workspace directory
Shell allowlist: ~40 common commands permitted by default (git, go, node, python, curl, etc.)
Shell denylist: dangerous commands blocked (sudo, rm, mkfs, shutdown, etc.)
Operator blocking: |, &&, ||, ;, > blocked by default (configurable)
Deny always overrides allow

Runtime

The runtime is the core agent loop that ties providers and tools together:

User message is added to an in-memory session
Session context (system prompt + messages + tool schemas) is streamed to the provider
Response is accumulated, including fragmented tool call deltas
If the LLM returns tool calls, they're dispatched respecting safety tiers:
- ReadOnly tools in a contiguous block run in parallel
- SideEffecting/Privileged tools run sequentially at their original position
- Model-provided tool call order is preserved (e.g., write_file before read_file)
Tool results are appended to the session, and the loop repeats
When the LLM responds with text only (no tool calls), the loop ends

The turn timeout covers both provider streaming and tool execution as a single budget. Ctrl-C (SIGINT/SIGTERM) propagates cleanly into the runtime via context cancellation.

When the conversation approaches the context limit, the runtime triggers rolling summarization — it compacts older messages into a summary and keeps recent turns, so the agent can maintain context across long sessions.

Memory

Persistent memory backed by SQLite (pure Go, no CGO via modernc.org/sqlite).

Hybrid retrieval combines two search strategies:

Vector search — cosine similarity over OpenAI embeddings (semantic meaning)
Full-text search — SQLite FTS5 with BM25 ranking (exact keywords)
Results are merged using Reciprocal Rank Fusion with configurable weights (default 0.7 vector / 0.3 FTS)

Graceful degradation — if no OPENAI_API_KEY is set, the system falls back to FTS-only search. Memory is optional; the agent works without it.

What gets persisted:

Memory chunks (knowledge stored by the agent or user)
Conversation events (every message in the turn loop)
Rolling summaries (context compaction across sessions)
Session scratchpad (key-value state per session)

Tests

go test ./... -race -count=1

Docs

See docs/ for detailed architecture documentation.

Inspired by

Oxydra — the same architecture in Rust. Yantra is a Go rewrite for simplicity and learning.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
cmd/yantra		cmd/yantra
docs		docs
internal		internal
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Yantra

What is this?

Architecture at a glance

Quick start

Project structure

Configuration

Providers

Tools

Security

Runtime

Memory

Tests

Docs

Inspired by

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Yantra

What is this?

Architecture at a glance

Quick start

Project structure

Configuration

Providers

Tools

Security

Runtime

Memory

Tests

Docs

Inspired by

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages