Signal loop by belliottsmith · Pull Request #4788 · apache/cassandra

belliottsmith · 2026-05-07T14:58:48Z

Thanks for sending a pull request! Here are some tips if you're new here:

Ensure you have added or run the appropriate tests for your PR.
Be sure to keep the PR description updated to reflect all changes.
Write your PR title to summarize what this PR proposes.
If possible, provide a concise example to reproduce the issue for a faster review.
Read our contributor guidelines
If you're making a documentation change, see our guide to documentation contribution

Commit messages should follow the following format:

<One sentence description, usually Jira title or CHANGES.txt summary>

<Optional lengthier description (context on patch)>

patch by <Authors>; reviewed by <Reviewers> for CASSANDRA-#####

Co-authored-by: Name1 <email1>
Co-authored-by: Name2 <email2>

The Cassandra Jira

- Clean Shutdown/Restart - Rebootstrap to allow nodes to rejoin consensus if they are out of sync, or did not shutdown cleanly Improve: - Improve efficiency of BTreeReducingRangeMap - DurableBefore backed by BTreeReducingRangeMap - Soft reject new transactions when a replica has a backlog of work - system_accord_debug.command_store_ops supports starting journal replay Fix: - Ensure SequentialExecutor.owner is unset only by the owner - Detect and avoid taking two AccordExecutor locks simultaneously - MaxConflicts serializer - tryToExecuteListening after replay, to handle invalidated dependencies - TxnNamedRead does not close a RowIterator, leading to native memory leaks - GetLatestDepsNack serialization - journal.readLast() did not read last - DefaultRemoteListeners not correctly synchronised - RangeTxnScanner cancellation assumes was already started - Start cfk compaction before replay - txn_graph should avoid visiting parents twice (possible if two different dependency chains connecting them in the graph) - Topology.cloneEquivalentWithEpoch should also share nodeLookup and ranges, especially to accelerate computeWaitForEpoch/computeScope - Do not invoke slowReplicaDelay or slowCoordinatorDelay during restore - Do not fail if slowReplicaDelay or slowCoordinatorDelay are invoked without knowing the transaction

- TransactionStatement token aware routing - Mitigate convoy effect: - ExecuteTxnBacklog v1 can execute single key transaction backlogs and disseminate the result - Direct local execution possible for single key transactions - Use shardAppliedBefore instead of gcBefore to cleanup CFK faster Also Improve: - Introduce distributed tracing - Combine RejectBefore and MaxConflicts - Self-addressed messages are delivered directly - Disable AccordSyncPropagator by default, as incompatible with large clusters - FastPathStrategy.SIMPLE should not modify fast path contents; FastPathStrategy.UP introduced to adopt this behaviour - Configurable queue prioritisation - ProtocolModifiers mostly final - Remove unnecessary condition check when serializing TxnUpdate Also Fix: - SequentialExecutor ownership bug - BTree.XUpdater.reset()

- consumer and producer threads wait signal without acquiring the lock first - lock owners may prepare more work than they need, moving some dynamically-adjusted portion of the work from the prioritised lock-managed structures onto a non-blocking queue, so that other threads may consume work from there; the portion is continually micro-adjusted to target some available work whenever the lock is acquired. - adopts/supports some features of the SEPExecutor: - threads may auto-adjust the number of running threads based on how much time is collectively spent waiting, to minimise time spent signalling - consumers may (timed-sleep) poll rather than await a signal, reducing the number of kernel interactions needed; relying on the auto-adjustment to bound the time the scheduler spends waking threads with no work to do

aweisberg · 2026-05-07T16:34:24Z

+1

belliottsmith and others added 4 commits May 7, 2026 15:20

CASSANDRA-17024: Artificial Latency Injection

86fae9b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Signal loop#4788

Signal loop#4788
belliottsmith wants to merge 4 commits intoapache:trunkfrom
belliottsmith:signal-loop

belliottsmith commented May 7, 2026

Uh oh!

aweisberg commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

belliottsmith commented May 7, 2026

Uh oh!

aweisberg commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants