Make the agents you ship faster, more accurate, and safer.
NeMo Platform brings NVIDIA NeMo libraries together under one CLI, Python SDK, and web UI. Hardening, evaluation, and tuning for the agents you put in production.
Prerequisites: Python 3.11-3.13, uv, and an API key for an inference provider (NVIDIA Build, OpenAI, Anthropic, Google Gemini, or a local Ollama instance). Node.js 22.18.x with pnpm only if you want the web UI.
git clone https://github.com/NVIDIA-NeMo/nemo-platform.git
cd nemo-platform
make bootstrap
source .venv/bin/activate
nemo setupnemo setup starts local services, registers your LLM provider, discovers available models, installs agent skills, and deploys a sample agent (see more below).
See SETUP.md for the full setup playbook (local data dir, DB reset, manual service start, troubleshooting). Coding agents pick the same playbook up automatically via .agents/skills/nemo-setup/SKILL.md.
Verify:
nemo services statusTo permanently reset the database state: rm -rf ~/.local/share/nemo.
Useful CLI commands once setup completes
nemo --help # All commands
nemo models list # Available models
nemo chat <model-name> # Chat directly with a model
nemo services status # Platform health
nemo skills list # Skills installed on the platformEvery capability is also available via REST API. Model inference uses the model IDs returned from nemo models list and is available at:
http://localhost:8080/apis/inference-gateway/v2/workspaces/default/openai/-/v1/chat/completions
To run platform services in the foreground in a separate terminal (instead of the background process nemo setup starts):
nemo services runStudio (web UI) bootstrap troubleshooting
If make bootstrap reports that Studio asset bootstrap did not complete, the API still runs but the web UI is unavailable until the bundle is built. Install Node 22.18.x with pnpm env use --global 22.18.0, then run make bootstrap-studio from the repo root.
Non-interactive setup (for agents, CI, or scripts)
export NVIDIA_API_KEY=nvapi...
export NEMO_DEFAULT_MODEL=nvidia-nemotron-3-super-120b-a12b
nemo setup --auto --start-services --install-skills --deploy-agentAfter installation, launch your coding agent (Claude Code, Codex, Cursor, OpenCode, etc) from inside the nemo-platform directory. This is the primary way of interacting with the NeMo Platform.
Things you can ask it to do, once the platform is running:
- "Scaffold an agent from this spec and deploy it."
- "Run an evaluation against my agent."
- "Add content-safety guardrails to my agent."
- "Help me optimize my agent."
- "Show me what's running on the platform."
- "Shut down NeMo cleanly."
- Secure agents. Guardrails (content safety, jailbreak detection, PII redaction), Auditor (red-teaming via garak), Anonymizer (PII handling for training data).
- Evaluate agents. LLM-as-judge, deterministic, agentic, and RAG benchmarks. Harbor-backed eval suites for regression testing.
- Tune agents. Skill optimization, prompt and hyperparameter tuning, Switchyard model routing.
- Build agents. NVIDIA NeMo Agent Toolkit (NAT) for LangGraph-based agents. Shared infrastructure: Inference Gateway, Secrets, Files, Entity Store, Jobs.
- Generate synthetic data. Generate synthetic data for training or evaluation purposes using Data Designer.
- NeMo Studio (alpha). Installed automatically with the platform. Browser UI for chat, monitoring, and reviewing optimization suggestions. Studio's agent-focused features are still a work in progress; the CLI is the primary surface today.
- Fine-tuning
- Safe Synthesizer (synthetic data with privacy guarantees)
- Broader agent framework support. Today NeMo Platform optimizes LangGraph agents wrapped in NAT. If your agent is in another framework, you need to write the NAT wrapper.
nemo setup detects Claude Code, Cursor, Codex, and OpenCode and installs NeMo skills into your agent of choice, either into the local directory or globally. Platform-level skills live under packages/nemo_platform_ext/src/nemo_platform_ext/skills/ and ship with the nemo-platform package; plugin-owned skills live under plugins/<plugin>/src/<plugin>/skills/.
To install or refresh skills:
nemo skills install --agent claude
nemo skills install --agent claude --skill nemo-build-agent --skill nemo-statusnemo setup --deploy-agent deploys a demo calculator agent you can use to
explore the platform's evaluate / optimize loop.
nemo agents invoke --agent calculator-agent --input "what is 12 * 8?"The calculator-agent package is installed automatically (plugins/nemo-agents/examples/calculator-agent/).
Deploy it manually
```bash nemo agents create --name calculator-agent \ --agent-config plugins/nemo-agents/examples/calculator-agent/src/calculator_agent/calculator-agent.yml nemo agents deploy --agent calculator-agent nemo agents deployments wait --agent calculator-agent ```Evaluate the agent
```bash nemo agents evaluate run \ --eval-config plugins/nemo-agents/examples/calculator-agent/src/calculator_agent/calculator-eval.yml \ --agent calculator-agent ```Optimize the agent
```bash nemo agents optimize run \ --optimize-config plugins/nemo-agents/examples/calculator-agent/src/calculator_agent/calculator-optimize.yml \ --agent calculator-agent ```The demo agent uses ${NEMO_DEFAULT_MODEL} for both execution and the judge LLM. To select different models for either/both, update the yaml config files.
Full documentation: NeMo Platform docs.
- Setup: installation, providers, SDK.
- CLI reference: all commands.
- API reference: REST endpoints.
See CONTRIBUTING.md for development workflow. See TESTING.md for testing strategy.
NeMo Platform is licensed under the Apache License 2.0. Third-party open-source dependencies have their own licenses; review them before use.