Skip to content

Add tileiras optional dependency for bundled compiler support#90

Open
hannahli-nv wants to merge 1 commit intomainfrom
hanli/add-tileiras-extra
Open

Add tileiras optional dependency for bundled compiler support#90
hannahli-nv wants to merge 1 commit intomainfrom
hanli/add-tileiras-extra

Conversation

@hannahli-nv
Copy link
Copy Markdown
Collaborator

@hannahli-nv hannahli-nv commented Mar 31, 2026

Summary

Addresses #79: users without a system CUDA Toolkit installation need cuda-tile[tileiras] to compile TileGym kernels at runtime.

  • Add tileiras extras_require to setup.py, enabling pip install .[tileiras]
  • Restructure README installation section with Option A (bundled tileiras via pip) and Option B (system tileiras)
  • Add comment in requirements.txt about the tileiras option

Closes #79

Details

TileGym kernels are compiled at runtime by the tileiras compiler from cuda-tile. Users can either:

  1. Install the compiler via pip: pip install .[tileiras] (bundles nvidia-cuda-tileiras, nvidia-cuda-nvcc, nvidia-nvvm into the Python env)
  2. Use a system CUDA Toolkit 13.1+ which includes tileiras

Previously, the README only documented pip install . which installs cuda-tile without the compiler binaries, causing failures for users without a system CUDA Toolkit.

Test plan

  • Verify pip install .[tileiras] installs nvidia-cuda-tileiras and bundled compiler
  • Verify pip install . still works on systems with CUDA Toolkit 13.1+
  • Verify benchmarks run successfully with bundled tileiras

CI Configuration

config:
  build: true
  test: ["ops", "benchmark"]

Addresses GitHub issue #79: users without a system CUDA Toolkit installation
need cuda-tile[tileiras] to compile TileGym kernels at runtime.

- Add 'tileiras' extras_require to setup.py, enabling pip install .[tileiras]
- Update README installation section with Option A (bundled tileiras) and
  Option B (system tileiras)
- Add comment in requirements.txt about the tileiras option
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot bot commented Mar 31, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Install instructions/requirements.txt need to add cuda-tile[tileiras]

1 participant