feat: support Risk-Aligned Cache under Classifier-Free Guidance. by yiming-l21 · Pull Request #1273 · jd-opensource/xllm

yiming-l21 · 2026-04-14T03:42:59Z

No description provided.

gemini-code-assist

Code Review

This pull request implements the RACFGCache policy, a joint decision-making caching mechanism for DiT models that incorporates accumulated risk and propagation-aware reweighting. The changes include the addition of hardcoded calibration tables, updates to global flags and configuration structures, and the integration of runtime context handling within the caching framework. Feedback identifies several style guide violations, specifically the need to use fixed-width integers (int64_t) instead of int and the requirement to use CHECK instead of TORCH_CHECK for assertions. Additionally, a recommendation was made to avoid changing default values in base configuration structs to prevent unintended side effects on other caching policies.

gemini-code-assist · 2026-04-14T03:46:31Z

+inline float NaN() { return std::numeric_limits<float>::quiet_NaN(); }
+
+// Build a square rho table with shape [steps, steps].
+torch::Tensor make_square_table(int steps, const std::vector<float>& values) {


The style guide mandates the use of fixed-width integers (e.g., int32_t, int64_t) instead of int. For consistency with other step-related variables in the codebase, int64_t is preferred.

Suggested change

torch::Tensor make_square_table(int steps, const std::vector<float>& values) {

torch::Tensor make_square_table(int64_t steps, const std::vector<float>& values) {

References

Use fixed-width integers (int32_t, int64_t) instead of plain int, unless the API you are calling explicitly requires int. ^(link)

gemini-code-assist · 2026-04-14T03:46:31Z

+  TORCH_CHECK(steps > 0, "steps must be positive, got ", steps);
+  TORCH_CHECK(values.size() == static_cast<size_t>(steps * steps),
+              "rho table size mismatch, expected ",
+              steps * steps,
+              ", got ",
+              values.size());


The style guide requires using CHECK from glog for assertions, not TORCH_CHECK.

CHECK(steps > 0) << "steps must be positive, got " << steps; CHECK(values.size() == static_cast<size_t>(steps * steps)) << "rho table size mismatch, expected " << steps * steps << ", got " << values.size();

References

Use CHECK (glog) instead of TORCH_CHECK for assertions. ^(link)

gemini-code-assist · 2026-04-14T03:46:31Z

+struct RhoTableSpec {
+  std::string model_name;
+  float cfg_scale = 0.0f;
+  int infer_steps = 0;


The style guide requires using fixed-width integers like int64_t instead of int. Other parts of the codebase use int64_t for step counts.

Suggested change

int infer_steps = 0;

int64_t infer_steps = 0;

References

Use fixed-width integers (int32_t, int64_t) instead of plain int, unless the API you are calling explicitly requires int. ^(link)

gemini-code-assist · 2026-04-14T03:46:31Z

 struct DiTBaseCacheOptions {
  // the number of warmup steps.
-  int warmup_steps = 0;
+  int warmup_steps = 5;


Changing the default warmup_steps in the base DiTBaseCacheOptions from 0 to 5 will affect all caching policies, not just the new RACFGCache. This could introduce unintended behavior changes for existing policies. It would be safer to revert this change and set the warmup steps specifically for RACFGCache when parsing its flags.

Suggested change

int warmup_steps = 5;

int warmup_steps = 0;

yiming-l21 requested review from shan-chen-feng, xiao-yu-chen and z-jun03 April 14, 2026 03:42

yiming-l21 requested review from DongheJin, JimHsiung, RobbieLeung, XuZhang99, liutongxuan, walsonyang and yq33victor as code owners April 14, 2026 03:43

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

yiming-l21 force-pushed the main branch from c69d28f to b2a4322 Compare April 14, 2026 04:35

feat: support Risk-Aligned Cache under Classifier-Free Guidance.

2d7838e

yiming-l21 force-pushed the main branch from b2a4322 to 2d7838e Compare April 14, 2026 04:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support Risk-Aligned Cache under Classifier-Free Guidance.#1273

feat: support Risk-Aligned Cache under Classifier-Free Guidance.#1273
yiming-l21 wants to merge 1 commit intojd-opensource:mainfrom
yiming-l21:main

yiming-l21 commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 14, 2026

Uh oh!

yiming-l21 Apr 14, 2026

Uh oh!

gemini-code-assist bot Apr 14, 2026

Uh oh!

yiming-l21 Apr 14, 2026

Uh oh!

gemini-code-assist bot Apr 14, 2026

Uh oh!

yiming-l21 Apr 14, 2026

Uh oh!

gemini-code-assist bot Apr 14, 2026

Uh oh!

yiming-l21 Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	torch::Tensor make_square_table(int steps, const std::vector<float>& values) {
	torch::Tensor make_square_table(int64_t steps, const std::vector<float>& values) {

Conversation

yiming-l21 commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

yiming-l21 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

yiming-l21 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

yiming-l21 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

yiming-l21 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant