feat: add SparkPow UDF returning Infinity for pow(0, negative) by Brijesh-Thakkar · Pull Request #22605 · apache/datafusion

Brijesh-Thakkar · 2026-05-28T20:42:10Z

Which issue does this PR close?

Rationale for this change

In Apache Spark, the pow(base, exp) function follows IEEE 754 semantics where raising 0 (or -0.0) to a negative exponent yields positive Infinity.

Currently, DataFusion's default core PowerFunc mimics PostgreSQL behavior, throwing an explicit error ("zero raised to a negative power is undefined"). To support standard Spark compatibility without breaking core DataFusion expectations, this PR introduces a specialized SparkPow UDF inside the datafusion-spark crate.

What changes are included in this PR?

This PR introduces the following changes within the datafusion-spark integration crate:

Added SparkPow UDF (datafusion/spark/src/function/math/pow.rs): Overrides the Float64 execution path to evaluate base == 0.0 && exp < 0.0 as f64::INFINITY (safely catching both 0.0 and -0.0 due to IEEE 754 equality rules).
Decimal Delegation: Preserves correctness by delegating non-float types (like decimals) back to the standard PowerFunc, as decimals cannot represent infinity.
Function Registration (datafusion/spark/src/function/math/mod.rs): Registers the new pow function and establishes power as a valid alias.
SQL Integration Tests (datafusion/sqllogictest/test_files/spark/math/pow.slt): Updates and adds test coverage ensuring pow(0, -1), power(0, -1), and pow(0.0, -1.0) successfully return Infinity.

Are these changes tested?

Yes, the changes are covered via both unit and integration tests:

Unit Tests: Added test_spark_pow_zero_negative_returns_infinity and test_spark_pow_normal_cases within pow.rs to validate the core scalar execution logic.
Integration Tests: Extended datafusion/sqllogictest/test_files/spark/math/pow.slt to verify the end-to-end SQL evaluation behavior.

Are there any user-facing changes?

Yes, but only for users utilizing the datafusion-spark compatibility features. When the Spark dialect/crate is active, evaluating pow(0, <negative>) will now return Infinity instead of throwing an evaluation error. Core DataFusion behavior remains completely unchanged.

Spark returns Infinity for pow(0, <negative>) following IEEE 754, while the DataFusion default (PowerFunc) raises an error to match PostgreSQL behavior. This adds SparkPow to the datafusion-spark crate which overrides the Float64 path to explicitly return +Infinity when base == 0.0 and exp < 0.0 (covers both 0.0 and -0.0), and delegates all decimal types to the existing PowerFunc. Both 'pow' and 'power' aliases are covered. Closes apache#22598

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds a Spark-compatible pow/power implementation and updates SLT expectations to match Spark’s 0 ^ negative = Infinity behavior.

Changes:

Introduces SparkPow UDF that overrides DataFusion’s default pow/power semantics for 0 ^ negative.
Registers the new function in the Spark math module and exposes it via expr_fn.
Updates SLT cases to assert Spark’s Infinity results for pow(0, -1) variants.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
datafusion/sqllogictest/test_files/spark/math/pow.slt	Updates Spark SLT expectations for `pow` and adds `0 ^ negative` coverage expecting `Infinity`.
datafusion/spark/src/function/math/pow.rs	Adds `SparkPow` UDF wrapping `PowerFunc` with Spark-specific edge-case behavior and unit tests.
datafusion/spark/src/function/math/mod.rs	Registers `pow` UDF module, exports it, and adds to the function registry.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+        // Only Float64 needs the Spark override.
+        // Decimal / integer paths are delegated to the standard PowerFunc which
+        // already handles them correctly (decimal can't represent Infinity anyway).
+        if !matches!(args.args[0].data_type(), DataType::Float64) {
+            return self.inner.invoke_with_args(args);


+                (Some(b), Some(e)) => {
+                    if b == 0.0 && e < 0.0 {
+                        Some(f64::INFINITY)
+                    } else {
+                        Some(b.powf(e))
+                    }
+                }


+        // ── Array path ───────────────────────────────────────────────────────
+        let [base, exponent] = take_function_args(self.name(), &args.args)?;
+
+        let base_arr: ArrayRef = base.to_array(num_rows)?;
+        let exp_arr: ArrayRef = exponent.to_array(num_rows)?;


+query R
+SELECT pow(2::int, 3::int);
+----
+8


comphead

Thanks @Brijesh-Thakkar this a solid PR, please remove tests from pow.rs, those are repetitive.

for the pow.slt lets have more double edgecases, specifically, nulls, nans, -0, +0, -inf, +Inf

Spark returns Infinity for pow(0, <negative>) following IEEE 754, while the DataFusion default (PowerFunc) raises an error to match PostgreSQL behavior. This adds SparkPow to the datafusion-spark crate which overrides the Float64 path to return +Infinity when base == 0.0 and exp < 0.0 (covers both 0.0 and -0.0). All decimal types delegate to PowerFunc. Both 'pow' and 'power' aliases are covered. Adds sqllogictest edge cases for: nulls, NaN, signed zeros (-0/+0), and signed infinities (-Inf/+Inf) including array and mixed paths. Closes apache#22598

Brijesh-Thakkar · 2026-05-29T06:32:58Z

Thanks @Brijesh-Thakkar this a solid PR, please remove tests from pow.rs, those are repetitive.

for the pow.slt lets have more double edgecases, specifically, nulls, nans, -0, +0, -inf, +Inf

@comphead Removed tests from pow.rs file as you sugegested and also added more edge cases
Thank you

comphead · 2026-05-29T18:42:06Z

+        ] = args.args.as_slice()
+        {
+            // b and e are &Option<f64>; Option<f64> is Copy.
+            let result = (*b).zip(*e).map(|(b, e)| {


lets call it base, exp instead of b, e

comphead · 2026-05-29T18:43:08Z

+        let result: Float64Array = base_f64
+            .iter()
+            .zip(exp_f64.iter())
+            .map(|(b, e)| match (b, e) {


Brijesh-Thakkar · 2026-05-29T18:45:34Z

@comphead I will fix this and commit the changes
Thanks for review

Brijesh-Thakkar · 2026-05-29T18:51:19Z

@comphead I have done the changes as you suggested and commited them as well
Thanks

Brijesh-Thakkar · 2026-05-29T19:13:38Z

@comphead All requested changes have been addressed and checks are passing. Could you please approve the PR when you get a chance? Thanks!

comphead

Thanks @Brijesh-Thakkar lgtm!

Copilot AI review requested due to automatic review settings May 28, 2026 20:42

github-actions Bot added sqllogictest SQL Logic Tests (.slt) spark labels May 28, 2026

Merge branch 'main' into spark-pow-func

222d5b5

Copilot AI reviewed May 28, 2026

View reviewed changes

suggestions from copilot done

5710f0c

comphead reviewed May 28, 2026

View reviewed changes

Brijesh-Thakkar added 2 commits May 29, 2026 12:01

Merge branch 'main' into spark-pow-func

a3bfd42

Brijesh-Thakkar added 4 commits May 29, 2026 13:30

Merge branch 'main' into spark-pow-func

ced6b66

Merge branch 'main' into spark-pow-func

6cfcc48

Merge branch 'main' into spark-pow-func

559c11b

Merge branch 'main' into spark-pow-func

96a6102

comphead reviewed May 29, 2026

View reviewed changes

refactor(spark): use descriptive variable names in pow

07ddab9

comphead enabled auto-merge May 29, 2026 19:01

comphead approved these changes May 29, 2026

View reviewed changes

comphead added this pull request to the merge queue May 29, 2026

Merged via the queue into apache:main with commit d8c4588 May 29, 2026
35 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add SparkPow UDF returning Infinity for pow(0, negative)#22605

feat: add SparkPow UDF returning Infinity for pow(0, negative)#22605
comphead merged 10 commits into
apache:mainfrom
Brijesh-Thakkar:spark-pow-func

Brijesh-Thakkar commented May 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

comphead left a comment

Uh oh!

Brijesh-Thakkar commented May 29, 2026

Uh oh!

comphead May 29, 2026

Uh oh!

comphead May 29, 2026

Uh oh!

Brijesh-Thakkar commented May 29, 2026

Uh oh!

Brijesh-Thakkar commented May 29, 2026

Uh oh!

Brijesh-Thakkar commented May 29, 2026

Uh oh!

comphead left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Brijesh-Thakkar commented May 28, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

comphead left a comment

Choose a reason for hiding this comment

Uh oh!

Brijesh-Thakkar commented May 29, 2026

Uh oh!

comphead May 29, 2026

Choose a reason for hiding this comment

Uh oh!

comphead May 29, 2026

Choose a reason for hiding this comment

Uh oh!

Brijesh-Thakkar commented May 29, 2026

Uh oh!

Brijesh-Thakkar commented May 29, 2026

Uh oh!

Brijesh-Thakkar commented May 29, 2026

Uh oh!

comphead left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants