Add autotune agent and tools#31

Merged

NinaCai merged 8 commits intomainfrom

Apr 30, 2026

Collaborator

NinaCai commented Apr 22, 2026

Add autotune agent
Add autotune tool
Integrate with the system


          add autotune agent and tools

6f92656

google-cla Bot commented Apr 22, 2026

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.


          Add autotune summary

NinaCai changed the title ~~[DO NOT MERGE]add autotune agent and tools~~ Add autotune agent and tools

NinaCai added 2 commits

April 27, 2026 19:49


          record all results and save to session_id folder

7323d64


          tool cleanup

82f852b

shangkunwang01 self-requested a review

April 28, 2026 19:50

shangkunwang01 reviewed

View reviewed changes

MaxKernel/hitl_agent/subagents/autotuning/prompts/autotune_prompt.py Outdated

shangkunwang01 reviewed

View reviewed changes

MaxKernel/hitl_agent/subagents/autotuning/prompts/autotune_prompt.py Outdated

shangkunwang01 reviewed

View reviewed changes

MaxKernel/hitl_agent/subagents/autotuning/prompts/autotune_prompt.py Outdated

shangkunwang01 reviewed

View reviewed changes

MaxKernel/hitl_agent/subagents/autotuning/agent.py Outdated

shangkunwang01 reviewed

View reviewed changes

MaxKernel/hitl_agent/subagents/autotuning/autotune_tool.py

shangkunwang01 reviewed

View reviewed changes

MaxKernel/hitl_agent/server_utils/cpu_server.py

Collaborator

shangkunwang01 Apr 28, 2026

The autotune will not be run on CPU. I would suggest removing this end point in cpu_server.

Collaborator Author

NinaCai Apr 29, 2026

I'd say let's keep cpu_server in case tpu server is not available for customers or for infra_verification if they need.

Collaborator

shangkunwang01 Apr 29, 2026

The current implementation will not fallback to cpu backend (the default backend in autotune_kernel is tpu and when it is used in the AutotuneRunner, the backend is not specified. As a result it will still only use tpu). A good guidance for this kind of situation would be go/tott/737.

Collaborator Author

NinaCai Apr 29, 2026

Ok, I added cpu fallback logic so when there is no tpu resources, users are still able to use cpu for infra verification.

shangkunwang01 reviewed

View reviewed changes

MaxKernel/hitl_agent/server_utils/tpu_server.py Outdated

shangkunwang01 requested changes

View reviewed changes


          changes based on comments

4d5670e

NinaCai requested a review from shangkunwang01

April 29, 2026 17:58


          adding cpu fallback logic

697670f

NinaCai force-pushed the nina-autotune branch from afad3ac to 697670f Compare

April 29, 2026 22:03

shangkunwang01 reviewed

View reviewed changes

MaxKernel/hitl_agent/subagents/autotuning/agent.py

Collaborator

shangkunwang01 Apr 29, 2026

I would suggest letting the eval_server to handle the backend selection logic if we want to use both. See examples like in accelerator-agents/MaxKernel/hitl_agent/subagents/profiling/kernel_profile.py and accelerator-agents/MaxKernel/hitl_agent/subagents/kernel_writing/kernel_compilation.py.

Collaborator Author

NinaCai Apr 29, 2026

Done


          move cpu / tpu logic to eval_server and apply lint

ffb41e3

NinaCai force-pushed the nina-autotune branch from fbc22e6 to ffb41e3 Compare

April 29, 2026 22:35

NinaCai requested a review from shangkunwang01

April 29, 2026 22:37


          edit README

b0f96c0

shangkunwang01 approved these changes

View reviewed changes

NinaCai merged commit db2a5b6 into main

2 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet