Add basic element filter by ElliottKasoar · Pull Request #559 · ddmms/ml-peg

ElliottKasoar · 2026-05-19T11:41:48Z

Pre-review checklist for PR author

PR author must check the checkboxes below when creating the PR.

I've confirmed the contribution guidelines.

Summary

This is a (somewhat) simplified version of #512, in that each test will only filter based on the superset of elements it contains, and so we will not partially update scores.

However, this still requires most of the mechanics of the full filter:

Introduces 'mock' calculator to ensure all test outputs are generated and saved.
Introduces (formally unused) 'error' calculator, to check that calculations complete when non-runtime errors are raised by calculations
Updates all calculations to pass with use of error calculator i.e. they should be robust to errors
Add info and elements attributes to app, which can store information required for element filtering
Add stores property to all apps, to make computed/raw/weights/thresholds globally accessible
Store set of original data for all tables
Add filter_table function to apps, which at base just returns the same data, but is the mechanism for applying the filter
Separate handling of None and NaN and allow None as valid score option.
Combines all updates to summary (category and overall) tables to ensure race conditions do not lead to incorrect calculations when multiple triggers occur
Adds callback to trigger table recalculation on stored updates

How this works more generally is:

During analysis, we must save the elements that are involved in the calculation to info.json or similar
- Ideally, this would be done in a way that can be used for the full filter later, but this is not necessary
- This is still to be done for most tests
During the app, we define a way to read the elements from info.json, and a filter_table function that uses this information, and a list of elements to be filtered, to modify the rows of the tables
- This is still to be done for most tests, but for the simple form, this should generally be almost identical to the current examples

This is all that benchmark contributors should need to worry about. From a backend perspective, the main change is that while previously, only one table would be updated at a time, and these tables would always be on the current page, now all tables will update simultaneously, and most of these will not be rendered.

This means that we cannot rely in table.data, so must use globally accessible stored values for tables. We then also need to make sure updates to summaries are robust to multiple triggers, rather than one at a time. The current form is relatively inefficient, as it will trigger many, many times, but I think this is an ok starting point.

There are a few things left to do:

Update all analysis to store info.json data
Update all apps to convert the dict from info.json to a set of elements
Add filter_table to all apps to apply the filter to a table
Test the robustness of the callbacks
Try to improve callback efficiency e.g. a single global sync, which isn't trivial to set up robustly
Add store for filter inputs
Add @joehart2001's element filter input via periodic table
Add quick-select options for models

Linked issue

Resolves #255

Testing

joehart2001 · 2026-05-19T14:15:28Z

maybe best to call this file filters.py

Yes good point, filter is a keyword, so a bit dangerous

joehart2001 · 2026-05-19T15:08:40Z

+            warnings.warn("Unable to read elements lists.", stacklevel=2)
+
+    @staticmethod
+    def filter_data(


we probably need to generalise this in some way. im unsure we can fully generalise it, but at least different helper functions for different types of data

Yes definitely! For now, I think we'll have two main general types: one where we only have one list e.g. Li diffusion, and one where we have a list of lists as the elements, and then the filter for now should basically be the same for all apps: if there's an overlap in the superset, set all metrics to None, which should be entirely general, but as we expand this, they'll differentiate based on data

joehart2001 · 2026-05-19T15:22:55Z

 from ml_peg.app import APP_ROOT
 from ml_peg.app.base_app import BaseApp
 from ml_peg.app.utils.build_callbacks import (
+    filter_table,


i think this is in base_app.py not build callbacks?

Ah yes the DMC_ICE/X23 examples are less up to date, sorry

joehart2001 · 2026-05-19T15:26:57Z

+        )
+
+    states = []
+    for entry in app_entries:


here we use unsorted app entries but we use sorted above. i think this will assign the incorrect weights as we dont assingn to a specific key

Ah yes I think this should be sorted too, god catch

This reverts commit d3c994a.

ElliottKasoar requested a review from joehart2001 May 19, 2026 11:41

ElliottKasoar added enhancement New feature or request breaking API breaking changes labels May 19, 2026

joehart2001 reviewed May 19, 2026

View reviewed changes

Comment thread ml_peg/calcs/conftest.py

joehart2001 reviewed May 19, 2026

View reviewed changes

Comment thread ml_peg/app/molecular_crystal/DMC_ICE13/app_DMC_ICE13.py

joehart2001 reviewed May 19, 2026

View reviewed changes

Comment thread ml_peg/app/utils/register_callbacks.py

joehart2001 reviewed May 19, 2026

View reviewed changes

ElliottKasoar added 20 commits May 19, 2026 17:43

Allow calculations to fail

4581535

Add mock models

6d79ba3

Add new CLI option to run mock model

aea409c

Add option to only run mock

a02a87f

Refactor getting info during analysis

b16c602

Load info and add element dropdown

baa067b

Refactor analysis for reuse in app

c4220fc

Update app

fd9cf3f

Write mock structs during analysis

57ed1aa

Make element filter deselective

7a354b0

Update apps for filter

fdcb20c

Update analysis to save structures

00884eb

Fix mock model for precision as kwarg

cb2a0ec

Add filter callback

974687d

Fix filter list from apps

1bfeac4

Simply inputs

f0930f5

Reorder parameters

1b16652

Warn if no mock directory

05489f5

Print missing mock dir in warning

f16f4ed

Update Li diffusion for filtering

b6567e1

ElliottKasoar added 15 commits May 19, 2026 17:43

Allow NaNs in metrics

1acf02f

Warn for missing info file

c804771

Allow null filter

789aeee

Fix missing models during analysis

5326e8b

Add utility functions for filtering

47e2e6b

Move mock parameters

68842ca

Fix unhandled errors

26fb122

Fix writing structures with energy keys

698d3dd

Change thresholds from optional

d34e26a

Return booleans from app filter

d550029

Temp update callbacks

9ac5721

Change application of filter

7de1705

Store filter booleans

b84be54

Revert "Temp update callbacks"

8d7d9c8

This reverts commit d3c994a.

Update filter callbacks and app builder

ba17009

ElliottKasoar force-pushed the add-element-filter-basic branch from f68f79d to ba17009 Compare May 19, 2026 16:43

ElliottKasoar mentioned this pull request May 19, 2026

Add mock calculators #560

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add basic element filter#559

Add basic element filter#559
ElliottKasoar wants to merge 35 commits into
ddmms:mainfrom
ElliottKasoar:add-element-filter-basic

ElliottKasoar commented May 19, 2026 •

edited

Loading

Uh oh!

joehart2001 May 19, 2026

Uh oh!

ElliottKasoar May 19, 2026

Uh oh!

Uh oh!

Uh oh!

joehart2001 May 19, 2026

Uh oh!

ElliottKasoar May 19, 2026

Uh oh!

joehart2001 May 19, 2026 •

edited

Loading

Uh oh!

ElliottKasoar May 19, 2026

Uh oh!

Uh oh!

joehart2001 May 19, 2026 •

edited

Loading

Uh oh!

ElliottKasoar May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ElliottKasoar commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pre-review checklist for PR author

Summary

Linked issue

Testing

Uh oh!

joehart2001 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

joehart2001 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar May 19, 2026

Choose a reason for hiding this comment

Uh oh!

joehart2001 May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

joehart2001 May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ElliottKasoar May 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ElliottKasoar commented May 19, 2026 •

edited

Loading

joehart2001 May 19, 2026 •

edited

Loading

joehart2001 May 19, 2026 •

edited

Loading