Skip to content

Bugfix - Add SM80/90 architecture to CUTLASS builds for CUDA 12.8+ for A100#785

Open
polarG wants to merge 1 commit intomainfrom
fix/cutlass-add-sm80-arch
Open

Bugfix - Add SM80/90 architecture to CUTLASS builds for CUDA 12.8+ for A100#785
polarG wants to merge 1 commit intomainfrom
fix/cutlass-add-sm80-arch

Conversation

@polarG
Copy link
Contributor

@polarG polarG commented Mar 11, 2026

Add sm_80 to the CUTLASS architecture lists to ensure Ampere GPU support is included in builds targeting CUDA 12.8+ for A100

Add sm_80 to the CUTLASS architecture lists to ensure Ampere GPU
support is included in builds targeting CUDA 12.8+ and 12.9+.
@polarG polarG requested a review from a team as a code owner March 11, 2026 23:00
Copilot AI review requested due to automatic review settings March 11, 2026 23:00
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates the CUTLASS build configuration to ensure Ampere (A100 / SM80) is included in CUTLASS builds when using CUDA 12.8+.

Changes:

  • Add sm_80 to the CUTLASS ARCHS list for CUDA >= 12.8.
  • Expand the CUDA >= 12.9 CUTLASS ARCHS list to include 80 and 90 in addition to 100/103.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@@ -42,11 +42,11 @@ sb_micro_path:
# for cuda 12.9 and later Build from commit v3.9 (3.9 release commit) for blackwell support
Copy link

Copilot AI Mar 11, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The comment above cuda_cutlass says CUDA>=12.9 builds are from CUTLASS v3.9, but the rule actually clones --branch v4.1.0. Please update the comment to match the version/branch being used (and optionally clarify why v4.1.0 is needed for Blackwell).

Suggested change
# for cuda 12.9 and later Build from commit v3.9 (3.9 release commit) for blackwell support
# For CUDA 12.9 and later, build from CUTLASS v4.1.0 (required for Blackwell support).

Copilot uses AI. Check for mistakes.
@codecov
Copy link

codecov bot commented Mar 11, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 85.70%. Comparing base (6b8e810) to head (e0bb811).

Additional details and impacted files
@@           Coverage Diff           @@
##             main     #785   +/-   ##
=======================================
  Coverage   85.70%   85.70%           
=======================================
  Files         102      102           
  Lines        7703     7703           
=======================================
  Hits         6602     6602           
  Misses       1101     1101           
Flag Coverage Δ
cpu-python3.10-unit-test 70.96% <ø> (ø)
cpu-python3.12-unit-test 70.96% <ø> (ø)
cpu-python3.7-unit-test 70.43% <ø> (ø)
cuda-unit-test 83.59% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@polarG polarG changed the title Bugfix: Add SM80/90 architecture to CUTLASS builds for CUDA 12.8+ for A100 Bugfix - Add SM80/90 architecture to CUTLASS builds for CUDA 12.8+ for A100 Mar 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants