mfcsorg
Popular repositories Loading
-
-
mfcs-bench
mfcs-bench PublicMFCS-Bench is a benchmark suite for evaluating large language models (LLMs) on function calling tasks based on the MFCS protocol. It standardizes the evaluation of how well different LLMs handle st…
Python 2
-
-
Repositories
Showing 6 of 6 repositories
- mfcs-bench Public
MFCS-Bench is a benchmark suite for evaluating large language models (LLMs) on function calling tasks based on the MFCS protocol. It standardizes the evaluation of how well different LLMs handle structured function calls, offering robust metrics and visualization tools to compare model performance across various tasks.
mfcsorg/mfcs-bench’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…