uts: stop the base-station feed from silently dying by haoruizhou · Pull Request #74 · Western-Formula-Racing/data-acquisition

haoruizhou · 2026-06-07T17:38:28Z

Two fixes for the "telemetry stops after a few random minutes" failure on
the MacBook base station.

websocket_bridge.redis_listener: wrap the pub/sub loop in a reconnect loop
with health_check_interval. Previously a single Redis connection blip (idle
timeout, transient Docker-bridge hiccup, Redis restart) made the listener
coroutine return for good while the WebSocket server kept running — PECAN
stayed connected but never received another frame, with no error surfaced.
ws_relay already reconnects this way; redis_listener now matches.

main.py: the child-process monitor only logged "Process X died!" once per
second forever and never recovered. Because the parent stayed alive, neither
Docker's restart: unless-stopped nor systemd's Restart=always ever saw the
failure. Now a dead child tears down the surviving children and exits non-zero
so the supervisor restarts the whole stack cleanly.

Two fixes for the "telemetry stops after a few random minutes" failure on the MacBook base station. websocket_bridge.redis_listener: wrap the pub/sub loop in a reconnect loop with health_check_interval. Previously a single Redis connection blip (idle timeout, transient Docker-bridge hiccup, Redis restart) made the listener coroutine return for good while the WebSocket server kept running — PECAN stayed connected but never received another frame, with no error surfaced. ws_relay already reconnects this way; redis_listener now matches. main.py: the child-process monitor only logged "Process X died!" once per second forever and never recovered. Because the parent stayed alive, neither Docker's `restart: unless-stopped` nor systemd's `Restart=always` ever saw the failure. Now a dead child tears down the surviving children and exits non-zero so the supervisor restarts the whole stack cleanly.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 60fd029649

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-07T17:41:44Z

+            # silently keeps running half-dead. Fail fast instead: tear down the
+            # surviving children and exit non-zero so the supervisor restarts the
+            # whole stack cleanly.
+            dead = [p for p in processes if not p.is_alive()]


Avoid restarting telemetry for optional video failures

With this check treating any child death as fatal, an optional media process can now take down the core telemetry stack: main.py starts Video whenever ENABLE_VIDEO is true, and the car systemd unit sets ENABLE_VIDEO=true, while run_video() returns when GStreamer reports an error. In that camera/misconfigured-GStreamer scenario the parent exits non-zero and systemd restarts the whole service repeatedly, so CAN telemetry never stays up even though only the optional video feed failed; consider limiting fail-fast to critical children or making optional child failures non-fatal.

Useful? React with 👍 / 👎.

haoruizhou added 2 commits June 6, 2026 17:06

Install slicks from PyPI; treat critical processes

62b9a60

chatgpt-codex-connector Bot reviewed Jun 7, 2026

View reviewed changes

haoruizhou merged commit b7fbc39 into main Jun 7, 2026
40 checks passed

haoruizhou deleted the uts-telemetry-audit branch June 7, 2026 18:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

uts: stop the base-station feed from silently dying#74

uts: stop the base-station feed from silently dying#74
haoruizhou merged 2 commits into
mainfrom
uts-telemetry-audit

haoruizhou commented Jun 7, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

haoruizhou commented Jun 7, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant