Context
Surfaced while landing the multi-tenant rate-limiter changes in cpex-rate-limiter 0.0.4. This issue tracks the follow-up work that emerged during implementation, code review, and end-to-end validation. Items are prioritized by impact on production confidence vs. operator UX vs. nice-to-have.
P0 — Required for production confidence
P1 — Operator UX improvements
P2 — Correctness hardening
P3 — Performance / features
P4 — Exploratory
Context
Surfaced while landing the multi-tenant rate-limiter changes in
cpex-rate-limiter0.0.4. This issue tracks the follow-up work that emerged during implementation, code review, and end-to-end validation. Items are prioritized by impact on production confidence vs. operator UX vs. nice-to-have.P0 — Required for production confidence
rate_limiter_decisions_total{result, dim}, backend-latency histograms, span attributes for active traces. Scope:cpex-plugins.P1 — Operator UX improvements
cpex-plugins+ gateway admin.cpex-plugins.plugins/config.yamland rely on restart/reload. Expose rate configs through the runtime-management API (same pattern as the existing mode-toggle endpoint). Scope: main repo +cpex-plugins."60/hour"vs"60/h") and typo'd algorithm names are silently accepted. JSON-schema validation at config-load time. Scope:cpex-plugins.P2 — Correctness hardening
cpex-plugins.sliding_window/token_bucketRedis paths — Integration tests currently exercise onlyfixed_window; other algorithms are unit-tested only. Scope: main repo integration suite.P3 — Performance / features
MGET/ Lua script for 2-3× throughput on multi-dim configs. Scope:cpex-plugins.cpex-plugins.RateLimiterPluginto support aby_tokensdimension keyed off (a) tool-result payload size, (b)prompts/getcontent, (c) A2A payload. Composes additively with existingby_user/by_tenant/by_toolrequest-based dimensions. Useful for deployments where MCP traffic feeds LLM contexts and the operator wants to cap aggregate token throughput per tenant. Not a replacement for request-based limits — adds a complementary cost-dimension that matches LLM-shaped traffic. Scope:cpex-plugins.cpex-plugins+ pipeline integration.P4 — Exploratory
cpex-plugins.