Merged
Conversation
added 30 commits
August 5, 2025 00:25
# Conflicts: # pyproject.toml
…to use async context manager for file watching lifecycle. Enhance WebSocket connection handling in the frontend with exponential backoff for reconnection attempts.
…tions in Agent class for tool results and tool call execution.
…ocalFSDatasetLoggerAdapter to use current JSONL file path. Enhance read method to aggregate rows from all JSONL files in the dataset directory.
added 25 commits
August 5, 2025 17:23
…eplacing table with div elements
…d simplify message display
…on Result and Ground Truth, improving code readability and maintainability.
…ty and positioning, enhancing user experience for chat window adjustments.
…ermine if the evaluation passed based on the threshold of success.
…to gray-600 for improved visual consistency.
…lexibility for displaying connection and evaluation statuses. Update App and Row components to utilize the new StatusIndicator implementation.
…tor, simplifying initialization. Update _call_model method to return a Message object with structured response data. Modify default_agent_rollout_processor to utilize the new constructor and streamline dataset population. Enhance evaluation_test to initialize eval_metadata for each row before running rollouts, ensuring consistent metadata handling.
…alization. Ensure eval_metadata is set for each row before rollouts, and enhance exception management to log errors appropriately while maintaining pytest behavior.
…le. This update ensures that unnecessary files are ignored during version control, streamlining project management.
…r Stdio MCP Config usage in decorator
…r chat window resizing from 80% to 66% of the available width, improving layout responsiveness.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
progress so far: