Yuting Yangโ ย Haichao Jiangโ ย Tianming Liang ย Quan Zhang ย Jian-Fang Hu* ย
Sun Yat-sen University ย
When the user query lacks complete
details to uniquely distinguish the intended target, existing methods tend to arbitrarily guess user preferences, while our IC-Seg proactively interacts with the user to clarify their real intention.
IC-Seg resolves ambiguities via multi-turn dialogues with an MLLM-based User Simulator. Our Hi-GRPO algorithm empowers the agent through a hierarchical reward chain: supervising final localization accuracy at the trajectory level, inquiry quality at the turn level, and fine-grained reasoning steps using expert-diagnosed signals.
Our work is built upon Seg-ReSearch and SDPO. We sincerely appreciate these excellent works.
If you find our work helpful for your research, please consider citing our paper.
@misc{yang2026dontguessjustask,
title={Don't Guess, Just Ask: Resolving Ambiguity in Referring Segmentation via Multi-turn Clarification},
author={Yuting Yang and Haichao Jiang and Tianming Liang and Quan Zhang and Jian-Fang Hu},
year={2026},
eprint={2605.17531},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2605.17531},
}