Skip to content

Dependency conflicts: Verl requires tensordict<=0.6.2 but Torch 2.7.0 (CUDA 12.8, B200 GPUs) requires tensordict>=0.7.0 #67

@jatinS-dev

Description

@jatinS-dev

Description

I am trying to run Agent-R1 on HiPerGator using the B200 GPU partition (hpg-b200). The new B200 GPUs require PyTorch ≥2.7.0 with CUDA 12.8 for sm_100 architecture support. However, Verl and vLLM dependencies were originally pinned for Torch 2.3.0, which creates conflicts.

Steps to Reproduce

  1. Create a clean conda env with Python 3.10.
  2. Install Verl (0.2.0.dev0) as per project instructions.
  3. Install PyTorch stack for B200:
    pip install torch==2.7.0+cu128 torchvision==0.22.0+cu128 torchaudio==2.7.0+cu128 --index-url https://download.pytorch.org/whl/cu128
  4. Try installing tensordict, xformers, and vllm.

Actual Results

  • verl 0.2.0.dev0 requires tensordict<=0.6.2.
  • torch==2.7.0 requires tensordict>=0.7.0.
  • vllm 0.4.x (used in Agent-R1) only works with torch==2.3.0.
  • vllm >=0.10.x works with Torch 2.7.0, but introduces breaking API changes (model_hf_config removed).

Example error

ERROR: pip's dependency resolver does not currently take into account all the packages that are installed.
verl 0.2.0.dev0 requires tensordict<=0.6.2, which is not installed.
torchaudio 2.7.0+cu128 requires torch==2.7.0, but you have torch 2.7.1 which is incompatible.
torchvision 0.22.0+cu128 requires torch==2.7.0, but you have torch 2.7.1 which is incompatible.

Expected Results

Ability to install and run Agent-R1/Verl with PyTorch 2.7.0 + CUDA 12.8 (for B200 GPUs) without dependency conflicts.

CC

@lyumengxian

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions