Skip to content

Fix evalbench core robustness#248

Open
aseering wants to merge 11 commits intoGoogleCloudPlatform:mainfrom
aseering:fix-evalbench-core-robustness
Open

Fix evalbench core robustness#248
aseering wants to merge 11 commits intoGoogleCloudPlatform:mainfrom
aseering:fix-evalbench-core-robustness

Conversation

@aseering
Copy link
Contributor

  • evalbench.py: Support dict-based dataset_config extraction.
  • postgres.py, mysql.py: Update to support local databases
  • postgres.py: Enable UNIX domain socket connections for local auth.
  • oneshotorchestrator.py: Add database name mapping/overrides for multi-engine evaluations.
  • analyzer.py: Fix KeyError and ZeroDivisionError during reporting.

@google-cla
Copy link

google-cla bot commented Feb 21, 2026

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

@aseering aseering force-pushed the fix-evalbench-core-robustness branch from 5b65c62 to 0e2bf12 Compare February 21, 2026 03:55
@IsmailMehdi
Copy link
Collaborator

/gcbrun

IsmailMehdi
IsmailMehdi previously approved these changes Feb 21, 2026
@aseering
Copy link
Contributor Author

Oops -- I believe I've now removed the extra commits

@IsmailMehdi
Copy link
Collaborator

/gcbrun

aseering added 6 commits March 9, 2026 08:28
- evalbench.py: Support dict-based dataset_config extraction.
- postgres.py: Enable UNIX domain socket connections for local auth.
- oneshotorchestrator.py: Add database name mapping/overrides for multi-engine evaluations.
- analyzer.py: Fix KeyError and ZeroDivisionError during reporting.
- databases/__init__.py: Register SpannerDB and MongoDB in factory.
- mysql.py: Improve connection pooling and Cloud SQL handling.
- sqlite.py: Implement copy-on-write for temporary databases to support file-based datasets.
- databases/__init__.py: Register SpannerDB and MongoDB in factory.
- mysql.py: Improve connection pooling and Cloud SQL handling.
- sqlite.py: Implement copy-on-write for temporary databases to support file-based datasets.
@aseering aseering force-pushed the fix-evalbench-core-robustness branch from 5448356 to a81d1d5 Compare March 9, 2026 08:28
- Updated `evalbench/test/mongodb_test.py` to match the expected data format for `insert_data`.
- Fixed `batch_execute` in `evalbench/databases/spanner.py` to correctly use `self.database.update_ddl` instead of just executing standard queries, as Spanner snapshots do not support DDL commands.
- Updated `evalbench/test/spanner_test.py` to use `batch_execute` for DDL statements (CREATE TABLE, DROP TABLE).
- Modified `.pycodestyle` to ignore `W504` (line break after binary operator), which conflicts with `W503` (line break before binary operator).
@aseering aseering force-pushed the fix-evalbench-core-robustness branch from 5ff04a8 to f21a310 Compare March 9, 2026 16:58
@aseering aseering force-pushed the fix-evalbench-core-robustness branch from b54e758 to 4b2682b Compare March 9, 2026 17:36
- Used autopep8 and sed to clean up all W291 and W293 errors in the python source.
@aseering aseering force-pushed the fix-evalbench-core-robustness branch from 4b2682b to 3121072 Compare March 9, 2026 17:44
@IsmailMehdi
Copy link
Collaborator

/gcbrun

@IsmailMehdi
Copy link
Collaborator

/gcbrun

@IsmailMehdi
Copy link
Collaborator

/gcbrun

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants