feat!: balatrobot v2 by S1M0N38 · Pull Request #155 · coder/balatrobot

S1M0N38 · 2026-02-24T11:10:19Z

Copilot

Pull request overview

This PR introduces version 2 of the balatrobot API with breaking changes (!). The main focus is on restructuring how tags are represented in the game state and improving error messages across all endpoints to be more actionable and helpful.

Changes:

Restructured tag representation from flat tag_name/tag_effect fields to nested tag objects with key, name, and effect fields
Added tags array to gamestate for tracking accumulated player-owned tags
Enhanced error messages across all endpoints with actionable guidance (e.g., suggesting reroll, sell, etc.)
Added support for selling jokers when Buffoon packs are open (SMODS_BOOSTER_OPENED state)
Implemented voucher effect extraction using game's localize function
Added comprehensive Tag enum definitions and test coverage

Reviewed changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/lua/utils/types.lua	Updated Blind type to use nested Tag object instead of flat tag_name/tag_effect fields; added Tag class definition
src/lua/utils/openrpc.json	Updated OpenRPC schema to reflect Tag object structure and enhanced sell endpoint description
src/lua/utils/gamestate.lua	Implemented voucher effect extraction, tag ownership tracking, and updated blind tag structure
src/lua/utils/enums.lua	Added comprehensive Tag.Key enum definitions for all Balatro tag types
src/lua/endpoints/sell.lua	Added support for SMODS_BOOSTER_OPENED state with Buffoon pack validation
src/lua/endpoints/skip.lua	Enhanced error message with actionable guidance
src/lua/endpoints/buy.lua	Enhanced error messages with actionable guidance
src/lua/endpoints/add.lua	Updated to support pack additions and refactored voucher handling to use dedicated SMODS function
src/lua/endpoints/use.lua	Enhanced error messages with actionable guidance
src/lua/endpoints/play.lua	Enhanced error message with actionable guidance
src/lua/endpoints/discard.lua	Enhanced error messages with actionable guidance
src/lua/endpoints/pack.lua	Enhanced error messages with actionable guidance
tests/lua/endpoints/test_skip.py	Added tests for tag accumulation after skipping blinds
tests/lua/endpoints/test_pack.py	Added tests for selling jokers during Buffoon pack selection
tests/lua/endpoints/test_gamestate.py	Added comprehensive test coverage for voucher effects and tag structure
tests/lua/endpoints/test_buy.py	Updated error message expectations
tests/lua/endpoints/test_add.py	Updated error message expectations
docs/api.md	Updated documentation to reflect new Tag structure and enhanced endpoint descriptions

Comments suppressed due to low confidence (4)

src/lua/endpoints/add.lua:409

The comment says "For jokers and consumables" but this else branch will also execute for vouchers and packs, creating unnecessary params that won't be used. Consider adding an explicit check: elseif card_type == "joker" or card_type == "consumable" then to match the comment and avoid creating unused params for vouchers and packs.

    else
      -- For jokers and consumables - just pass the key
      params = {
        key = args.key,
        skip_materialize = true,
        stickers = {},
        force_stickers = true,
      }

      -- Add edition if provided
      if edition_value then
        params.edition = edition_value
      end

      -- Add eternal if provided (jokers only - validation already done)
      if args.eternal then
        params.stickers[#params.stickers + 1] = "eternal"
      end

      -- Add perishable if provided (jokers only - validation already done)
      if args.perishable then
        params.stickers[#params.stickers + 1] = "perishable"
      end

      -- Add rental if provided (jokers only - validation already done)
      if args.rental then
        params.stickers[#params.stickers + 1] = "rental"
      end
    end

tests/lua/endpoints/test_skip.py:43

Grammar issue in comment: "because it used immediately" should be "because it is used immediately"

        assert "tag_investment" not in gamestate["tags"]  # because it used immediately

tests/lua/endpoints/test_skip.py:53

Grammar issue in comment: "because it used immediately" should be "because it is used immediately"

        assert "tag_investment" not in gamestate["tags"]  # because it used immediately

src/lua/utils/types.lua:58

Typo: "bilnd" should be "blind"

---@field status Blind.Status Status of the bilnd

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-25T12:16:40Z

+        assert gamestate["tags"][0]["key"] == "tag_polychrome"
+        assert "tag_investment" not in gamestate["tags"]  # because it used immediately


This test file has a bug that will cause test_skip_big_boss to fail. The test at line 54-58 (not shown in diff) expects the error message "Cannot skip Boss blind" but skip.lua line 39 now returns "Cannot skip Boss blind. Use select to select and play the boss blind." The expected error message in the test needs to be updated to match the new implementation.

Copilot · 2026-02-25T12:16:40Z

        assert gamestate["blinds"]["big"]["status"] == "SKIPPED"
        assert gamestate["blinds"]["boss"]["status"] == "SELECT"
+        assert gamestate["tags"][0]["key"] == "tag_polychrome"
+        assert "tag_investment" not in gamestate["tags"]  # because it used immediately


This assertion is checking if the string "tag_investment" is in a list of tag objects. Since gamestate["tags"] is a list of objects (each with "key", "name", "effect" fields), the in operator will never find a string match. This should likely be checking if any tag in the list has key == "tag_investment", such as: assert not any(tag["key"] == "tag_investment" for tag in gamestate["tags"])

Copilot · 2026-02-25T12:16:41Z

        assert gamestate["state"] == "BLIND_SELECT"
        assert gamestate["blinds"]["boss"]["status"] == "SELECT"
+        assert gamestate["tags"][0]["key"] == "tag_polychrome"
+        assert "tag_investment" not in gamestate["tags"]  # because it used immediately


This assertion is checking if the string "tag_investment" is in a list of tag objects. Since gamestate["tags"] is a list of objects (each with "key", "name", "effect" fields), the in operator will never find a string match. This should likely be checking if any tag in the list has key == "tag_investment", such as: assert not any(tag["key"] == "tag_investment" for tag in gamestate["tags"])

Previously, used_vouchers extracted descriptions from static voucher_data.description which was unreliable. Now uses get_voucher_effect() that fetches effect text via the game's localize() function with proper loc_vars for each voucher type. Also adds strip_color_codes() helper and comprehensive parametrized tests covering all 32 voucher types. Closes #154.

Improve error messages across 6 endpoint files by adding actionable guidance to help bots self-heal from failed tool calls. Changes: - buy.lua: Add endpoint suggestions for empty shop/slot errors - use.lua: Add card parameter guidance for consumable errors - discard.lua/play.lua: Add card limit suggestions - pack.lua: Add pack buying and target selection hints - skip.lua: Add boss blind selection suggestion - Update test_buy.py to match new error messages Closes #148.

…chers

Closes #143.

Closes #156.

- Remove .claude/ directory (settings.json, skills/balatrobot/SKILL.md) - Remove CLAUDE.md in favor of AGENTS.md - Remove .mux/ directory (init, mcp.jsonc, tool_env, tool_post) - Remove .mdformat.toml (flags moved to Makefile) - Add AGENTS.md with project structure and rules - Add CONTEXT.md with glossary of domain terms - Add .agents/skills/balatrobot/SKILL.md for pi skill

Replace verbose boilerplate with minimal, curated entries covering macOS, Python, Lua, and project-specific ignores.

Inline --number and --exclude flags since the config file was removed.

Remove integration marker from pyproject.toml markers config.

The integration marker is no longer used. Remove auto-marking hooks from conftest files and the @pytest.mark.integration decorator.

Rename BalatroInstance module to match its primary export. Update import paths in tests.

Introduce BalatroPool with start/stop lifecycle, automatic port allocation, fail-fast cleanup, and async context-manager support. Includes InstanceInfo frozen dataclass for connection metadata.

StateFile wraps BalatroPool with a JSON state file (Jupyter pattern). Atomic write on pool start, delete on stop. Supports PID-based liveness checks, stale-file cleanup, and resolve-by-host:port or index. Add platformdirs dependency for cross-platform state directory.

Replace single BalatroInstance with pool-based serve. Adds -n / --num-instances flag for launching multiple instances. State file is written on start and cleaned up on exit.

New `balatrobot list` command reads the state file and displays running instances. Supports --json for machine-readable output. Register in CLI app.

Host/port are now optional — when omitted, the api command resolves the target from the state file. Supports --index flag to select an instance (default: 0). Falls back gracefully when no state file exists.

Add check_alive() that delegates to each instance's check_alive(). Remove __aenter__/__aexit__ — lifecycle is now managed by the caller (Server) rather than the pool itself.

Replace instance-based context manager with static write()/delete() methods. Remove __init__, __aenter__/__aexit__, _write_state, _delete_state, and all instance properties (path, instances, is_started). Lifecycle ownership moves to the new Server class.

Add Server async context manager that owns pool start/stop, state file write/delete, and a run() supervision loop (SIGTERM + check_alive). Rewrite _serve() and serve() to use Server, replacing the old StateFile context manager. Add InstanceDiedError and StateFileBusy exception handling in serve().

- test_instance: add check_alive tests (healthy, dead, not-started) - test_pool: replace context manager tests with check_alive tests - test_state: replace context manager tests with write/delete tests - test_server: new file with 5 tests for Server lifecycle and run loop

Remove BalatroPool, InstanceInfo, and StateFile from __all__. These are CLI internals, not part of the public bot-author API. Keep APIError, BalatroClient, BalatroInstance, Config, __version__.

Add sys.platform != 'win32' guard around add_signal_handler/ remove_signal_handler. Wrap the supervision loop in try/finally to ensure the signal handler is always removed, even on InstanceDiedError.

Replace 'async context-manager protocol' with check_alive(). The pool no longer has __aenter__/__aexit__.

Verify that add_signal_handler is never called on win32 and that the supervision loop still exits cleanly.

Sends SIGTERM to the server PID from the state file, polls for process death (100ms intervals, 5s timeout). Idempotent — calling twice or on an already-dead process is safe.

Verify that Server.run() registers a SIGTERM signal handler on non-Windows platforms and that the full SIGTERM flow triggers clean shutdown (state file deleted, pool stopped).

- Move InstanceInfo from pool.py to instance.py, change log_path type to Path | None (stringify only at JSON boundary) - Rename _default_state_path to default_state_path (public API) - Fix Server.__aexit__ ordering: stop pool before deleting state file - Document BalatroPool as non-restartable after stop() - Fix stale docstring in test_instance.py - Drop redundant host/port args in api.py else branch

Reusable skill for generating conventional commits with auto-staging and logical grouping.

Promote routine operational messages (endpoint actions, state transitions, mode activations) from sendDebugMessage to their appropriate levels: sendInfoMessage for normal flow, sendWarnMessage for recoverable issues, sendErrorMessage for failures. Shorten endpoint lifecycle messages to concise labels: "Init foo()" → "foo()", "Return foo()" → "foo() → ok". Rename utils/logger.lua to utils/format.lua (BB_LOGGER → BB_FORMAT) to reflect that the module provides formatting/serialization helpers, not logging — actual logging uses SMODS send*Message functions.

When BALATROBOT_LOGS_PATH is set, the Lua server appends each request and response as a JSONL line to <port>.req.jsonl and <port>.res.jsonl respectively. The Python launcher now exposes this env var automatically so sessions are recorded out of the box. These trace files feed the new CLI replay mode introduced in the next commit.

Add --requests and --responses flags that replay a JSONL trace file against a live server. With --responses, the command verifies each reply matches the expected result and reports divergences. tqdm shows progress when installed. Also move tqdm from test to main dependencies since it now ships to end users.

Update balatrobot skill to cover the automatic req.jsonl/res.jsonl recording and the new --requests/--responses flags on the api command.

When a Charm Tag (or similar) opens a free booster pack from BLIND_SELECT, skipping or selecting all cards must return the game to BLIND_SELECT — not SHOP. The polling loop now checks for both states so it doesn't hang waiting for SHOP after a tag-triggered pack. This issue was first noted in PR #190. Co-authored-by: icebear <icebear0828@users.noreply.github.com>

Add a boss parameter to the set endpoint that overrides which Boss Blind appears during the blind selection screen. This enables testing boss-specific bugs without playing through multiple antes. Implementation uses the game's own perscribed_bosses mechanism with G.FUNCS.reroll_boss, bypassing the charge via G.from_boss_tag. The response awaits the controller lock release before returning the updated gamestate. Validation ensures: - Only callable in BLIND_SELECT state - Boss blind state must be Upcoming - Key must exist in G.P_BLINDS and have .boss = true - Mutually exclusive with the shop parameter Also adds key field to the Blind type in gamestate output, Blind.Key enum alias, OpenRPC spec update, and docs with boss blind key reference table. Tests: 7 unit tests + 1 integration test (set -> skip -> select -> play -> verify boss name).

Add inline # comments describing each blind's effect, matching the style used by other enums in the file. Showdown bosses are grouped at the end and tagged with (Showdown).

…raction The API now returns a `key` field on each blind (`bl_small`, `bl_big`, `bl_manacle`). Update the expected dict to include these.

… card The Cerulean Bell boss blind (bl_final_bell) forces one random card to always be selected via card.ability.forced_selection. Both endpoints called unhighlight_all() which preserves forced cards, causing the forced card to silently leak into the played/discarded hand alongside the user's requested cards. Add validation that checks for forced_selection on any card in hand and rejects with BAD_REQUEST if the forced card is not included in the request. Replace unhighlight_all() with targeted logic that only clears non-forced highlights and only clicks cards not already highlighted, avoiding reliance on toggle no-op behavior. This issue was first notice in PR #190 Co-authored-by: icebear <icebear0828@users.noreply.github.com>

…card Add state-SELECTING_HAND--blinds.boss.key-bl_final_bell fixture under both play and discard sections in fixtures.json. Setup uses set boss bl_final_bell, skip small and big blinds, then select boss.

Add test_cerulean_bell_forced_card_not_included_in_play and test_cerulean_bell_forced_card_not_included_in_discard. Both load the bl_final_bell fixture, find the forced-highlighted card, then attempt to play/discard a different card and assert a BAD_REQUEST error containing 'forced-selected by the boss blind'.

Remove trailing whitespace in docs/api.md set_value and boss-blind tables. Split the long string-concatenation line in discard.lua error message for readability.

S1M0N38 changed the title ~~BalatroBot v2~~ feat!: balatrobot v2 Feb 24, 2026

S1M0N38 marked this pull request as ready for review February 25, 2026 12:09

Copilot AI review requested due to automatic review settings February 25, 2026 12:09

Copilot started reviewing on behalf of S1M0N38 February 25, 2026 12:10 View session

Copilot AI reviewed Feb 25, 2026

View reviewed changes

S1M0N38 force-pushed the dev branch from 73b6148 to b0b366e Compare May 27, 2026 15:24

S1M0N38 added 24 commits June 9, 2026 13:08

test(lua.endpoints): fix test for vouchers effect

0e83d85

refactor(lua.endpoints): us the SMODS.add_voucher_to_shop for add vou…

554ee45

…chers

feat: add support for Tags

043f6dd

Closes #143.

test(lua.endpoints): add test for tags support

9ac6acc

docs(lua.utils): fix the description of the enums tags

98a6623

docs(api): add documentation for tags

6ae1217

fix: allow to sell jokers when a Buffoon pack is open

aa306d3

Closes #156.

test(lua.endpoints): fix the assertion for the tags tests

cd6c643

chore(repo): slim down .gitignore to project-relevant entries

a4e0a01

Replace verbose boilerplate with minimal, curated entries covering macOS, Python, Lua, and project-specific ignores.

chore(build): move mdformat flags from .mdformat.toml to Makefile

bcb96d6

Inline --number and --exclude flags since the config file was removed.

chore(build): bump ruff to 0.15.14 and ty to 0.0.40

00f15cf

Remove integration marker from pyproject.toml markers config.

chore(tests): remove integration marker plumbing

0ccf775

The integration marker is no longer used. Remove auto-marking hooks from conftest files and the @pytest.mark.integration decorator.

chore(docs): exclude adr/ directory from mkdocs site

918740d

fix(tests): use ty: ignore directive for type suppression

27effea

ci: align mdformat flags with Makefile

7d83bd4

refactor(cli): rename manager.py to instance.py

87f29d5

Rename BalatroInstance module to match its primary export. Update import paths in tests.

feat(pool): add BalatroPool for managing N BalatroInstance instances

be9aefa

Introduce BalatroPool with start/stop lifecycle, automatic port allocation, fail-fast cleanup, and async context-manager support. Includes InstanceInfo frozen dataclass for connection metadata.

feat(serve): use BalatroPool and StateFile in serve command

e7f8428

Replace single BalatroInstance with pool-based serve. Adds -n / --num-instances flag for launching multiple instances. State file is written on start and cleaned up on exit.

feat(cli): add list command to show running instances

643f96e

New `balatrobot list` command reads the state file and displays running instances. Supports --json for machine-readable output. Register in CLI app.

feat(api): add instance discovery to api command

a09fb06

Host/port are now optional — when omitted, the api command resolves the target from the state file. Supports --index flag to select an instance (default: 0). Falls back gracefully when no state file exists.

S1M0N38 added 19 commits June 9, 2026 13:09

refactor(pool): add check_alive, remove context manager

72fecf1

Add check_alive() that delegates to each instance's check_alive(). Remove __aenter__/__aexit__ — lifecycle is now managed by the caller (Server) rather than the pool itself.

refactor: trim __all__ to public API surface

e5ce462

Remove BalatroPool, InstanceInfo, and StateFile from __all__. These are CLI internals, not part of the public bot-author API. Keep APIError, BalatroClient, BalatroInstance, Config, __version__.

fix(serve): guard signal handler for Windows and clean up on exit

63cc443

Add sys.platform != 'win32' guard around add_signal_handler/ remove_signal_handler. Wrap the supervision loop in try/finally to ensure the signal handler is always removed, even on InstanceDiedError.

docs(pool): update docstring to remove context-manager mention

fbbb8d8

Replace 'async context-manager protocol' with check_alive(). The pool no longer has __aenter__/__aexit__.

test(server): add Windows signal-handler guard test

d3a2991

Verify that add_signal_handler is never called on win32 and that the supervision loop still exits cleanly.

style: use parenthesized with-statements

fbf7452

feat(cli): add stop command to gracefully shut down server

c155a3d

Sends SIGTERM to the server PID from the state file, polls for process death (100ms intervals, 5s timeout). Idempotent — calling twice or on an already-dead process is safe.

test(server): add SIGTERM handler and clean shutdown tests

711d8c0

Verify that Server.run() registers a SIGTERM signal handler on non-Windows platforms and that the full SIGTERM flow triggers clean shutdown (state file deleted, pool stopped).

chore(gitignore): add antigravitycli and review skill to gitignore

2740c92

style: format long lines and add class spacing

d2d8fd9

docs(skill): add stop command to balatrobot skill

4b784fe

chore(skills): add git-commit skill

3efd844

Reusable skill for generating conventional commits with auto-staging and logical grouping.

S1M0N38 force-pushed the dev branch from c3b1c2a to cc32129 Compare June 9, 2026 11:09

S1M0N38 and others added 8 commits June 9, 2026 14:28

docs(skill): document JSONL traces and replay mode

fb4b2e3

Update balatrobot skill to cover the automatic req.jsonl/res.jsonl recording and the new --requests/--responses flags on the api command.

docs(lua): add descriptions to Blind.Key enum values

fc0b0ce

Add inline # comments describing each blind's effect, matching the style used by other enums in the file. Showdown bosses are grouped at the end and tagged with (Showdown).

test: add key field to expected blinds in test_blinds_structure_ext…

a63d5df

…raction The API now returns a `key` field on each blind (`bl_small`, `bl_big`, `bl_manacle`). Update the expected dict to include these.

test(fixtures): add Cerulean Bell boss blind fixture for play and dis…

cfebcf8

…card Add state-SELECTING_HAND--blinds.boss.key-bl_final_bell fixture under both play and discard sections in fixtures.json. Setup uses set boss bl_final_bell, skip small and big blinds, then select boss.

S1M0N38 force-pushed the dev branch from 0f4937c to 932e5cd Compare June 9, 2026 19:14

style: trim trailing whitespace in tables, wrap long line

f6d3f85

Remove trailing whitespace in docs/api.md set_value and boss-blind tables. Split the long string-concatenation line in discard.lua error message for readability.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat!: balatrobot v2#155

feat!: balatrobot v2#155
S1M0N38 wants to merge 63 commits into
mainfrom
dev

S1M0N38 commented Feb 24, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 25, 2026

Uh oh!

Copilot AI Feb 25, 2026

Uh oh!

Copilot AI Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		assert gamestate["tags"][0]["key"] == "tag_polychrome"
		assert "tag_investment" not in gamestate["tags"] # because it used immediately

Conversation

S1M0N38 commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

S1M0N38 commented Feb 24, 2026 •

edited

Loading