Feat: Enable celery workers and beat by maltesander · Pull Request #724 · stackabletech/superset-operator

maltesander · 2026-04-23T14:53:53Z

Description

Added two new roles. worker and beat (max 1 replica).

New integration test works
Logging configurable (currently hardcoded info for worker & beat)
Docs missing, examples etc not updated yet.

Currently, RESULTS_BACKEND only supports Redis, no S3 support.

Definition of Done Checklist

Not all of these items are applicable to all PRs, the author should update this template to only leave the boxes in that are relevant
Please make sure all these things are done and tick the boxes

Author

Changes are OpenShift compatible
CRD changes approved
CRD documentation for all fields, following the style guide.
Helm chart can be installed and deployed operator works
Integration tests passed (for non trivial changes)
Changes need to be "offline" compatible
Links to generated (nightly) docs added
Release note snippet added

Reviewer

Code contains useful comments
Code contains useful logging statements
(Integration-)Test cases added
Documentation added or updated. Follows the style guide.
Changelog updated
Cargo.toml only contains references to git tags (not specific commits or branches)

Acceptance

Feature Tracker has been updated
Proper release label has been added
Links to generated (nightly) docs added
Release note snippet added
Add type/deprecation label & add to the deprecation schedule
Add type/experimental label & add to the experimental features tracker

…rkers

adwk67

Looks good - mainly minor things.

adwk67 · 2026-04-28T08:37:51Z

+        host: superset-postgresql
+        database: superset
+        credentialsSecretName: superset-postgresql-credentials
+    celeryResultsBackend:


In Airflow, we put celeryResultsBackend and celeryBroker in the worker so that they are included where needed. Should we do the same here for consistency?

Sebastian mentioned the same thing. I think this is wrong in Airflow.
It is used by ALL roles, so clearly belongs into the clusterConfig. I wont push it down to workers just to avoid some consistency checks.

It's more to do with avoiding unintended misconfiguration: yes, it is used by all roles, but it is only relevant if the resource has a worker role defined in the first place. It's easier to overlook inconsistencies if the dependency is less obvious.

By that logic, why is the metadata database clusterConfig (only the webserver needs it)? Or any authorization/authentication in many cases? We might need it in the future and this is clearly "clusterWide" configuration.

As said, i think it is just plain wrong in Airflow (or at least bad design). The clusterConfig is exactly there to avoid having to cross reference stuff between "siblings". It is a bad practice in programming and bad API design because you introduce in-transparency where not needed.

Why is it bad design to have a section of config - that is only relevant to an optional role - defined as part of that role? Apart from opensearch (which is different in many ways), I can only think of Airflow (kubernetes vs. celery executors), Kafka (optional controller role) and Superset (optional worker/beat roles) where we have that in the platform. Do we have other operators where we include things under clusterconfig only for optional roles?

By that logic, why is the metadata database clusterConfig (only the webserver needs it)

The difference here is that the webserver role is not optional.

It is a bad design because the assumption that it is only relevant to an optional role is wrong.
Once the role is defined, it is relevant to ALL roles.

A webserver role should not be concerned with anything in the worker role. If it needs anything from the worker, this is already a clear sign that the API design is bad :-)

By that logic, why is the metadata database clusterConfig (only the webserver needs it)

The difference here is that the webserver role is not optional.

Well, then even more points to put it into its own role...?

adwk67 · 2026-04-28T09:07:54Z

+        ///
+        /// Ignored otherwise.
+        #[serde(skip_serializing_if = "Option::is_none")]
+        pub celery_results_backend: Option<CeleryResultsBackendConnection>,


See other comment: in airflow this is part of the worker

Very against it :)

adwk67 · 2026-04-28T14:18:15Z

Openshift tests 🟢 https://testing.stackable.tech/view/02%20Operator%20Tests%20(custom)/job/superset-operator-it-custom/45/

We will bring up the CR open question in the planning meeting.

maltesander added 15 commits February 26, 2026 15:18

wip - add roles & cleanup resource structure

33a9c3d

wip - cleanup

84fa755

wip - more refactoring

b4597cb

wip - added celery test

3846bd7

improve test

d2fc6c9

wip - improve testing

2e9892c

add logs for worker

66c8b03

Merge remote-tracking branch 'origin/main' into feat/enable-celery-wo…

93a1726

…rkers

fix: remaining compile errors after merge.

07622b4

fix: adapt celery worker test to breaking secret changes

b5c7bfe

fix: regenerated charts after merge

a3ce2fb

fix: adapt celery worker test to EXPERIMENTAL_FILE_FOOTER changes.

0969bfb

feat: add celery backend and broker database to crd

756f271

feat: wire backend and broker crd values

b4133f8

test: celery worker test successful with beat.

586c9cb

maltesander self-assigned this Apr 23, 2026

maltesander changed the title ~~Feat: Eenable celery workers and beat~~ Feat: Enable celery workers and beat Apr 23, 2026

fix: pre-commit

96e948e

maltesander added this to Stackable Engineering Apr 24, 2026

maltesander moved this to Development: In Progress in Stackable Engineering Apr 24, 2026

maltesander and others added 8 commits April 24, 2026 13:24

fix: set beat replicas to 1 or 0 only.

ec8dea7

docs: extend database connection for broker and results backend.

843d1b3

docs: fix missing footnotes.

e9d5f6c

docs: add celery async query docs.

69a7087

Merge branch 'main' into feat/enable-celery-workers

0ddf79a

docs: remove outdated code comments.

b13c2f6

fix: remove outdated comment

60e89b3

fix: remove generic from results backend; clean up unwraps.

daf0884

maltesander linked an issue Apr 25, 2026 that may be closed by this pull request

Support Async Queries via Celery #698

Open

fix: remove obsolete comment

3d0b5cc

maltesander added 5 commits April 25, 2026 15:49

fix: increase worker memory to 4GB for version 6.0.0

96cc92f

fix: replicas for beat

50430eb

fix: rename result_backend_* to results_backend_*

1a56aeb

clippy: fix host lint.

9fe6794

fix: enable proper logging for worker and beat.

00d603b

maltesander marked this pull request as ready for review April 27, 2026 09:28

maltesander moved this from Development: In Progress to Development: Waiting for Review in Stackable Engineering Apr 27, 2026

docs: adapt changelog.

ed93462

adwk67 self-requested a review April 27, 2026 14:54

adwk67 moved this from Development: Waiting for Review to Development: In Review in Stackable Engineering Apr 27, 2026

adwk67 reviewed Apr 27, 2026

View reviewed changes

Comment thread docs/modules/superset/pages/usage-guide/celery-async-queries.adoc

fix: use database id for results backend.

6660043

adwk67 requested changes Apr 28, 2026

View reviewed changes

maltesander added 5 commits April 28, 2026 11:34

fix: error variants in statefulset & deployment

f6bcf04

fix: optional role definition in validated config.

6ec3054

docs: improve beat replica documentation

f9b0f86

docs: fix linter

08a377d

fix: remove obsolete logging TODOs.

fc62924

adwk67 reviewed Apr 28, 2026

View reviewed changes

Comment thread rust/operator-binary/src/superset_controller.rs Outdated

maltesander added 2 commits April 28, 2026 12:53

fix: remove c&p code comment

e9f8b45

fix: remove "missing" from celery results backend connection details.

b2f9e60

Uh oh!

Conversation

maltesander commented Apr 23, 2026 • edited by adwk67 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Definition of Done Checklist

Author

Reviewer

Acceptance

Uh oh!

Uh oh!

adwk67 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adwk67 Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

adwk67 Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

adwk67 commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

maltesander commented Apr 23, 2026 •

edited by adwk67

Loading

adwk67 Apr 28, 2026 •

edited

Loading

adwk67 Apr 28, 2026 •

edited

Loading

adwk67 commented Apr 28, 2026 •

edited

Loading