Skip to content

{2025.06}[rocm-compilers/19.0.0-ROCm-6.4.1] HIP-6.4.1, RCCL-2.22.3#1526

Open
zerefwayne wants to merge 1 commit into
EESSI:mainfrom
zerefwayne:add-hip-641
Open

{2025.06}[rocm-compilers/19.0.0-ROCm-6.4.1] HIP-6.4.1, RCCL-2.22.3#1526
zerefwayne wants to merge 1 commit into
EESSI:mainfrom
zerefwayne:add-hip-641

Conversation

@zerefwayne

Copy link
Copy Markdown
Contributor

No description provided.

@zerefwayne zerefwayne changed the title Add HIP-6.4.1 and RCCL-2.22.3 {2025.06}[rocm-compilers/19.0.0-ROCm-6.4.1] HIP-6.4.1, RCCL-2.22.3 Jun 17, 2026
@zerefwayne

zerefwayne commented Jun 17, 2026

Copy link
Copy Markdown
Contributor Author

Test

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=amd/gfx90a

EDIT: rocm-compilers is not available yet

@eessi-bot-aws

eessi-bot-aws Bot commented Jun 17, 2026

Copy link
Copy Markdown

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator amd/gfx90a
Job dir: /project/def-users/SHARED/jobs/2026.06/pr_1526/167291

date job status comment
Jun 17 12:08:59 UTC 2026 submitted job id 167291 awaits release by job manager
Jun 17 12:09:49 UTC 2026 released job awaits launch by Slurm scheduler
Jun 17 12:15:02 UTC 2026 running job 167291 is running
Jun 17 12:21:28 UTC 2026 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-167291.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-amd-gfx90a-17816987630.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a
no other files in tarball
Jun 17 12:21:28 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/5) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/22Jul2025-foss-2024a-kokkos %scale=1_node /ade8cad7 @BotBuildTests:x86-64-zen2+default
P: perf: 440.76 timesteps/s (r:0, l:None, u:None)
[ OK ] (2/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen2+default
P: latency: 1.41 us (r:0, l:None, u:None)
[ OK ] (3/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen2+default
P: latency: 2.06 us (r:0, l:None, u:None)
[ OK ] (4/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen2+default
P: latency: 0.18 us (r:0, l:None, u:None)
[ OK ] (5/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen2+default
P: bandwidth: 7613.27 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 5/5 test case(s) from 5 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-167291.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@zerefwayne

Copy link
Copy Markdown
Contributor Author

Test 2

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=amd/gfx90a

@eessi-bot-aws

eessi-bot-aws Bot commented Jun 17, 2026

Copy link
Copy Markdown

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator amd/gfx90a
Job dir: /project/def-users/SHARED/jobs/2026.06/pr_1526/167339

date job status comment
Jun 17 13:59:53 UTC 2026 submitted job id 167339 awaits release by job manager
Jun 17 14:00:34 UTC 2026 released job awaits launch by Slurm scheduler
Jun 17 14:05:39 UTC 2026 running job 167339 is running
Jun 17 14:15:12 UTC 2026 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-167339.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-amd-gfx90a-17817054370.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a
no other files in tarball
Jun 17 14:15:12 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/5) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/22Jul2025-foss-2024a-kokkos %scale=1_node /ade8cad7 @BotBuildTests:x86-64-zen2+default
P: perf: 442.26 timesteps/s (r:0, l:None, u:None)
[ OK ] (2/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen2+default
P: latency: 1.35 us (r:0, l:None, u:None)
[ OK ] (3/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen2+default
P: latency: 3.44 us (r:0, l:None, u:None)
[ OK ] (4/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen2+default
P: latency: 0.18 us (r:0, l:None, u:None)
[ OK ] (5/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen2+default
P: bandwidth: 8036.91 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 5/5 test case(s) from 5 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-167339.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@zerefwayne

Copy link
Copy Markdown
Contributor Author
ERROR: Toolchain {'name': 'rocm-compilers', 'version': '19.0.0-ROCm-6.4.1'} (required by rocm-cmake/0.14.0-rocm-compilers-19.0.0-ROCm-6.4.1) is not supported in EESSI/2025.06

@boegel

boegel commented Jun 17, 2026

Copy link
Copy Markdown
Contributor
ERROR: Toolchain {'name': 'rocm-compilers', 'version': '19.0.0-ROCm-6.4.1'} (required by rocm-cmake/0.14.0-rocm-compilers-19.0.0-ROCm-6.4.1) is not supported in EESSI/2025.06
== No toolchain hierarchy found for {'name': 'rfoss', 'version': '2025a'},
ignoring! (Could not find easyconfig for rfoss toolchain version 2025a)

It's missing the easyconfigs for rfoss, essentially.
I think you'll need to explicitly list the rocm-compilers (sub)toolchain in the hooks, for now...

@zerefwayne

Copy link
Copy Markdown
Contributor Author

Test 3

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=amd/gfx90a

@eessi-bot-aws

eessi-bot-aws Bot commented Jun 17, 2026

Copy link
Copy Markdown

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator amd/gfx90a
Job dir: /project/def-users/SHARED/jobs/2026.06/pr_1526/167731

date job status comment
Jun 17 20:02:55 UTC 2026 submitted job id 167731 awaits release by job manager
Jun 17 20:03:24 UTC 2026 released job awaits launch by Slurm scheduler
Jun 17 20:09:29 UTC 2026 running job 167731 is running
Jun 17 20:14:37 UTC 2026 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-167731.out
✅ no message matching FATAL:
❌ found message matching ERROR:
✅ no message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-amd-gfx90a-17817270070.tar.zstsize: 0 MiB (22 bytes)
entries: 0
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen2/accel/amd/gfx90a
no other files in tarball
Jun 17 20:14:37 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/5) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/22Jul2025-foss-2024a-kokkos %scale=1_node /ade8cad7 @BotBuildTests:x86-64-zen2+default
P: perf: 442.198 timesteps/s (r:0, l:None, u:None)
[ OK ] (2/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen2+default
P: latency: 1.35 us (r:0, l:None, u:None)
[ OK ] (3/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen2+default
P: latency: 2.07 us (r:0, l:None, u:None)
[ OK ] (4/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen2+default
P: latency: 0.18 us (r:0, l:None, u:None)
[ OK ] (5/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen2+default
P: bandwidth: 8088.14 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 5/5 test case(s) from 5 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-167731.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@zerefwayne

Copy link
Copy Markdown
Contributor Author

@boegel

== No toolchain hierarchy found for {'name': 'rocm-compilers', 'version':
'19.0.0'}, ignoring! (Could not find easyconfig for rocm-compilers toolchain
version 19.0.0)


ERROR: Toolchain {'name': 'rocm-compilers', 'version': '19.0.0-ROCm-6.4.1'} (required by rocm-cmake/0.14.0-rocm-compilers-19.0.0-ROCm-6.4.1) is not supported in EESSI/2025.06
```

@boegel

boegel commented Jun 17, 2026

Copy link
Copy Markdown
Contributor

@boegel

== No toolchain hierarchy found for {'name': 'rocm-compilers', 'version':
'19.0.0'}, ignoring! (Could not find easyconfig for rocm-compilers toolchain
version 19.0.0)


ERROR: Toolchain {'name': 'rocm-compilers', 'version': '19.0.0-ROCm-6.4.1'} (required by rocm-cmake/0.14.0-rocm-compilers-19.0.0-ROCm-6.4.1) is not supported in EESSI/2025.06

Ah snap, looks like we should keep the versionsuffix part after all, that's my bad... Sorry

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants