support DynamicFilterPhysicalExpr by adriangb · Pull Request #4961 · vortex-data/vortex

adriangb · 2025-10-15T21:54:00Z

adriangb · 2025-10-15T21:56:32Z

vortex-datafusion/src/persistent/opener.rs

+                            if is_dynamic_physical_expr(&e) {
+                                e.snapshot().ok().flatten()
+                            } else {
+                                Some(Arc::clone(e))
+                            }


Not sure if it's possible or worth it but you could do this more lazily, or wrap the DataFusion dynamic filter in a Vortex dynamic filter and call DynamicFilterPhysicalExpr::current() or PhysicalExpr::snapshot() on each RecordBatch (that would be more overhead but also will kick in sooner on scans that touch very large files).

I roughly tried to do that in some old PR, but I don't remember why I stopped pushing it. This seems like a good starting point, and we can experiment with it in the future to see which one is better.

adriangb · 2025-10-15T21:58:03Z

vortex-datafusion/src/persistent/mod.rs

+            .ok_or_else(|| {
+                vortex_err!("Plan should have 2 DataSourceExec, the second is the probe side")
+            })?;
+        assert!(data_source_line.contains("DynamicFilterPhysicalExpr [ a@0 >= 1 AND a@0 <= 3 ]"));


I'm not sure what other assertions we could make. I don't think Vortex gives any metrics on rows pruned, etc. And the Vortex expression isn't saved anywhere / is not in the plan.

I put in a print statement and can see that an appropriate Vortex predicate is being created, but can't verify that it's doing what's expected

I'm not sure I have a better idea here right now, I need to figure out how to improve the overall testability

codecov · 2025-10-15T22:40:23Z

Codecov Report

❌ Patch coverage is 95.00000% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 88.03%. Comparing base (839b7dd) to head (a9e0abb).
⚠️ Report is 671 commits behind head on develop.

Files with missing lines	Patch %	Lines
vortex-datafusion/src/persistent/mod.rs	92.77%	6 Missing ⚠️

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

joseph-isaacs · 2025-10-17T21:13:48Z

We cannot currently run pre commit benchmark on fork so I created this (#4986)

AdamGS · 2025-10-17T21:50:24Z

@adriangb haven't forgotten this PR, just really busy week

AdamGS · 2025-10-17T22:09:46Z

I can belive some of the perf impact, but I suspect AWS tonight is just extremely noisy, gotta love the 40% on clickbecnh q0.

adriangb · 2025-10-18T06:19:15Z

I can belive some of the perf impact, but I suspect AWS tonight is just extremely noisy, gotta love the 40% on clickbecnh q0.

Is there a benchmark or something I'm not seeing? If it's Polar Signal I don't seem to have access.

AdamGS · 2025-10-18T10:58:46Z

They ran on this PR, seems like there's an issue to trigger benchmarks on this branch (@joseph-isaacs got something I can look at? Would love to be able to do that)

adriangb · 2025-10-21T13:08:03Z

@AdamGS what's your read on the state of this PR / what can I do to help? It's not clear to me if there are issues with benchmarks, if @joseph-isaacs is planning on taking it over in another PR, if I should rewrite the commits with the licensing info, etc.

AdamGS · 2025-10-21T13:09:18Z

You beat me to comment by a minute!
I've re-triggered the benchmarks over at #4986, if you can it'll be great if you can fix the lint failure (just run with nightly) and the DCO thing and we can merge this IMO.

AdamGS · 2025-10-21T13:10:02Z

Something is going on with the label check, I'll figure that one out.

AdamGS

This is awesome 🥳

AdamGS · 2025-10-21T13:15:08Z

Last benchmarks run - https://github.com/vortex-data/vortex/actions/runs/18684856431

joseph-isaacs

There are some big regressions on tpch which we need to address before merging

AdamGS · 2025-10-22T09:49:18Z

Completely missed that, I'll take a deeper look.

github-actions · 2026-01-17T02:13:09Z

This PR has been marked as stale because it has been open for 30 days with no activity. Please comment or remove the stale label if you wish to keep it active, otherwise it will be closed in 7 days

adriangb · 2026-01-17T02:16:35Z

I think you may have found some real performance issues. We are discussing similar findings on the DataFusion side:
apache/datafusion#19639
apache/datafusion#3463

TLDR is we think some of the expressions may be expensive to evaluate. Unclear if that means they're not worth having at all, or if the plan shape / parallelism needs to be tweaked, but it does seem real.

adriangb added 2 commits October 15, 2025 16:53

support DynamicFilterPhysicalExpr

531360f

revert debug dep changes

54de540

adriangb commented Oct 15, 2025

View reviewed changes

adriangb added 2 commits October 15, 2025 17:08

simplify

e126f45

simplify

a9e0abb

joseph-isaacs added the changelog/performance A performance improvement label Oct 17, 2025

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:16 View deployment

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:19 View deployment

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:20 View deployment

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:24 View deployment

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:25 View deployment

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:26 View deployment

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:33 View deployment

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:39 View deployment

github-actions bot deployed to polar-signals-cloud October 17, 2025 21:40 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 10:54 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 10:57 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 10:58 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 11:01 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 11:02 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 11:03 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 11:04 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 11:11 View deployment

github-actions bot deployed to polar-signals-cloud October 20, 2025 11:17 View deployment

AdamGS added changelog/performance A performance improvement and removed changelog/performance A performance improvement labels Oct 21, 2025

AdamGS added the changelog/feature A new feature label Oct 21, 2025

AdamGS approved these changes Oct 21, 2025

View reviewed changes

github-actions bot deployed to polar-signals-cloud October 21, 2025 13:12 View deployment

github-actions bot deployed to polar-signals-cloud October 21, 2025 13:13 View deployment

github-actions bot deployed to polar-signals-cloud October 21, 2025 13:16 View deployment

github-actions bot deployed to polar-signals-cloud October 21, 2025 13:18 View deployment

github-actions bot deployed to polar-signals-cloud October 21, 2025 13:20 View deployment

github-actions bot deployed to polar-signals-cloud October 21, 2025 13:25 View deployment

github-actions bot deployed to polar-signals-cloud October 21, 2025 13:31 View deployment

joseph-isaacs requested changes Oct 22, 2025

View reviewed changes

github-actions bot added the stale This PR is stale and will be auto-closed soon label Jan 17, 2026

github-actions bot removed the stale This PR is stale and will be auto-closed soon label Jan 20, 2026

AdamGS mentioned this pull request Jan 21, 2026

Support datafusion dynamic filter expressions #4034

Open

Conversation

adriangb commented Oct 15, 2025

Uh oh!

adriangb Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

AdamGS Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

adriangb Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

adriangb Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

AdamGS Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

joseph-isaacs commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AdamGS commented Oct 17, 2025

Uh oh!

AdamGS commented Oct 17, 2025

Uh oh!

adriangb commented Oct 18, 2025

Uh oh!

AdamGS commented Oct 18, 2025

Uh oh!

adriangb commented Oct 21, 2025

Uh oh!

AdamGS commented Oct 21, 2025

Uh oh!

AdamGS commented Oct 21, 2025

Uh oh!

AdamGS left a comment

Choose a reason for hiding this comment

Uh oh!

AdamGS commented Oct 21, 2025

Uh oh!

joseph-isaacs left a comment

Choose a reason for hiding this comment

Uh oh!

AdamGS commented Oct 22, 2025

Uh oh!

github-actions bot commented Jan 17, 2026

Uh oh!

adriangb commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Oct 15, 2025 •

edited

Loading

joseph-isaacs commented Oct 17, 2025 •

edited

Loading