feat: timeouts by tristan-f-r · Pull Request #457 · Reed-CompBio/spras

tristan-f-r · 2026-01-13T14:29:13Z

Adds timeout to algorithms as a demonstration of passing through errors. Closes #316.

Caveat: ML requires at least one pathway, and failing pathways can break ML-work. How do we want to handle downstream analysis when errors occur (including in the future heuristic errors?)

read-the-docs-community · 2026-01-13T14:34:06Z

Documentation build overview

📚 spras | 🛠️ Build #32419831 | 📁 Comparing 7a0c4f3 against latest (87b314c)

🔍 Preview build

6 files changed · + 2 added · ± 4 modified

+ Added

± Modified

ntalluri · 2026-02-05T16:14:41Z

This does not work with singularity (singularity has no docker wait equivalent and to implement timeouts in singularity would probably require constant polling of a detached thread)

This is a problem; if this is something we are going to use for the benchmarking study we need this to work with singularity because CHTC only uses singularity/apptainer.

tristan-f-r · 2026-02-05T17:25:31Z

I originally assumed this was a more esoteric PR to test the error-handling workflow. Though, based on the meeting just now, I'll look into a nice way to get this working with Singularity.

ntalluri

Here is my first round review. I like the empty files if an error occurs, and it is good to have an associated log explaining why.
(Adding this comment again here) My only issue with this is the definition of "error." If a parameter combination fails the heuristics check, I do not want the output to be empty. I want the output to reflect what was actually produced, so I do not have to rerun that combination even though it "failed" the heuristics. I should be able to freely update the heuristics and have that output counted if it now passes, without rerunning combinations that previously produced empty output. In short, a parameter combination failing the heuristics should not be classified as an error as defined in this PR.

This PR not working with singularity is a very big problem because of the chtc integration.

"ML requires at least one pathway, and failing pathways can break ML-work. How do we want to handle downstream analysis when errors occur." This seems like a separate problem that I will fix internally in the ML code (I want to still make figures if we only have one pathway or a set of empty pathways).

This perplexes me but from my tests we do not need --keep-going. I do not know my original intent here

Co-Authored-By: Neha Talluri <78840540+ntalluri@users.noreply.github.com>

tristan-f-r · 2026-04-23T20:27:11Z

My only issue with this is the definition of "error." If a parameter combination fails the heuristics check, I do not want the output to be empty. I want the output to reflect what was actually produced, so I do not have to rerun that combination even though it "failed" the heuristics.

The small part here that is adaptable is that errors, in this PR, are only made in the reconstruct rule. We can define other errors in some heuristics rule, and we can rewire other rules to depend on the success of heuristics instead of on reconstruct (via that resource_info = rules.reconstruct.output.resource_info input rule above, but instead we would say rules.heuristics.output.resource_info instead.)

tristan-f-r · 2026-04-25T22:13:29Z

This works with apptainer now, but this now touches an untested part of profiling, so that part needs a review from @jhiemstrawisc.

ntalluri

here is half of a review on this pr

ntalluri · 2026-04-29T20:38:45Z

+
+By default, whenever SPRAS runs into a container error (i.e. an internal
+algorithm error), the full workflow will fail. However, there are
+certain designated errors where we don't want this to be the case (at


can we make this a separate section that explains each error instead of putting it in parentheses (even though right now it is only timeouts). We can expand this in the future.

ntalluri · 2026-04-29T20:40:58Z

+     timeout: 1d
+
+The timeout string parsing is delegated to `pytimeparse
+<https://pypi.org/project/pytimeparse/>`__, which allows for more


Suggested change

<https://pypi.org/project/pytimeparse/>`__, which allows for more

<https://pypi.org/project/pytimeparse/>`__, (examples linked here). This allows for more

ntalluri · 2026-04-29T20:41:23Z

+ Timeouts
+##########
+
+SPRAS allows for per-algorithm timeouts, specified under the global


Suggested change

SPRAS allows for per-algorithm timeouts, specified under the global

SPRAS allows for optional per-algorithm timeouts, specified under the global

can you also add a sentence on what happens when timeout is not included?

ntalluri · 2026-04-29T20:43:32Z

+
+    def __init__(self, timeout: int, *args):
+        self.timeout = timeout
+        self.message = f"Timed out after {timeout}s."


could we convert this back into other time scales (if I put 5.6 days I don't want to have to read that as 483840 seconds).

Suggested change

self.message = f"Timed out after {timeout}s."

self.message = f"Timed out after {timeout} seconds."

f"Timed out after {timeout} seconds; {hours} hours; {days} days ."

Something like this?

(only do this if it is easy to do)

tristan-f-r added 4 commits January 12, 2026 09:41

feat: timeout

f81a33e

feat: snakemake err checkpoint

0342b5c

fix: use timeout correctly

841d242

fix: filter files w/ errors

75fd7f1

tristan-f-r added the enhancement New feature or request label Jan 13, 2026

tristan-f-r added 2 commits January 13, 2026 07:22

fix: correct timeout order

7abd709

fix(cytoscape): specify optional timeout

e07c961

tristan-f-r added the P-high This is a blocker for many PRs/issues/features label Jan 13, 2026

tristan-f-r added 2 commits January 13, 2026 16:03

chore(Snakefile): decheckpointify reconstruct

d5b7e18

perf(Snakefile): make is_error check not consume the entire file

111e53f

tristan-f-r force-pushed the timeout-arg branch from 7182a85 to 111e53f Compare January 13, 2026 20:23

github-actions Bot added the merge-conflict This PR has merge conflicts. label Jan 31, 2026

Merge branch 'main' into timeout-arg

c2febff

github-actions Bot removed the merge-conflict This PR has merge conflicts. label Jan 31, 2026

github-actions Bot added the merge-conflict This PR has merge conflicts. label Mar 18, 2026

ntalluri reviewed Apr 23, 2026

View reviewed changes

tristan-f-r and others added 4 commits April 23, 2026 19:31

docs: timeout

cc46eed

docs: clarification on container_obj

83c5ed0

docs: remove the strange comment

71c1976

This perplexes me but from my tests we do not need --keep-going. I do not know my original intent here

refactor: use mark_error and is_error more often

6e60afe

Co-Authored-By: Neha Talluri <78840540+ntalluri@users.noreply.github.com>

tristan-f-r added 2 commits April 23, 2026 15:36

Merge branch 'umain' into timeout-arg

7ce0580

style: fmt

699ddca

github-actions Bot removed the merge-conflict This PR has merge conflicts. label Apr 23, 2026

tristan-f-r added 3 commits April 25, 2026 02:18

docs: on errors

4e3c28f

fix: tests and such

a322f4d

feat: singularity timeouts

641608f

docs: mention works with apptainer

c5ff6ad

tristan-f-r requested a review from jhiemstrawisc April 25, 2026 22:13

tristan-f-r added 2 commits April 25, 2026 23:29

fix: don't use capture_output and stderr in the same command

208eb4a

fix: use correct variable reference for reconstruct

7a0c4f3

ntalluri reviewed Apr 30, 2026

View reviewed changes

	<https://pypi.org/project/pytimeparse/>`__, which allows for more
	<https://pypi.org/project/pytimeparse/>`__, (examples linked here). This allows for more

	SPRAS allows for per-algorithm timeouts, specified under the global
	SPRAS allows for optional per-algorithm timeouts, specified under the global

	self.message = f"Timed out after {timeout}s."
	self.message = f"Timed out after {timeout} seconds."

Conversation

tristan-f-r commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

read-the-docs-community Bot commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation build overview

Uh oh!

ntalluri commented Feb 5, 2026

Uh oh!

tristan-f-r commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ntalluri left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tristan-f-r commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tristan-f-r commented Apr 25, 2026

Uh oh!

ntalluri left a comment

Choose a reason for hiding this comment

Uh oh!

ntalluri Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

ntalluri Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

ntalluri Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

ntalluri Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

ntalluri Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

ntalluri Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

ntalluri Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tristan-f-r commented Jan 13, 2026 •

edited

Loading

read-the-docs-community Bot commented Jan 13, 2026 •

edited

Loading

tristan-f-r commented Feb 5, 2026 •

edited

Loading

ntalluri left a comment •

edited

Loading

tristan-f-r commented Apr 23, 2026 •

edited

Loading