Skip to content

Fix agentic cli dataloading#1416

Open
wang2yn84 wants to merge 1 commit intomainfrom
lance-gsm8k
Open

Fix agentic cli dataloading#1416
wang2yn84 wants to merge 1 commit intomainfrom
lance-gsm8k

Conversation

@wang2yn84
Copy link
Copy Markdown
Collaborator

This PR fixes the broken cli data loading for gsm8k; Refactor the standard and agentic grpo to a single function, deduped a lot of boilerplate code.

Reference

Colab Notebook

Checklist

  • I have added all the necessary unit tests for my change.
  • I have verified that my change does not break existing code and all unit tests pass.
  • I have added all appropriate doc-strings/documentation.
  • My PR is based on the latest changes of the main branch (if unsure, rebase the code).
  • I have signed the Contributor License Agreement.
  • I have followed Contribution Guidelines.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants