Skip to content

Add 8 new models from v1.2.1 benchmark results#21

Merged
olearycrew merged 1 commit intomainfrom
add-v1.2.1-models
Apr 8, 2026
Merged

Add 8 new models from v1.2.1 benchmark results#21
olearycrew merged 1 commit intomainfrom
add-v1.2.1-models

Conversation

@ScuttleBot
Copy link
Copy Markdown

New models added from PinchBench v1.2.1 benchmarks

Model Best Avg
bytedance-seed/seed-2.0-lite 85.2% 82.0%
google/gemma-4-26b-a4b-it 83.9% 78.0%
x-ai/grok-4.20 79.9% 67.8%
openai/gpt-5.4-nano 78.5% 72.6%
mistralai/mistral-small-2603 76.7% 73.2%
google/gemma-4-31b-it 76.4% 69.9%
openai/gpt-5.4-mini 76.2% 72.0%
z-ai/glm-5v-turbo 66.6% 62.7%

All results from official PinchBench v1.2.1 runs.

New models added:
- bytedance-seed/seed-2.0-lite (85.2% best, 82% avg)
- google/gemma-4-26b-a4b-it (83.9% best, 78% avg)
- google/gemma-4-31b-it (76.4% best, 69.9% avg)
- x-ai/grok-4.20 (79.9% best, 67.8% avg)
- openai/gpt-5.4-mini (76.2% best, 72% avg)
- openai/gpt-5.4-nano (78.5% best, 72.6% avg)
- mistralai/mistral-small-2603 (76.7% best, 73.2% avg)
- z-ai/glm-5v-turbo (66.6% best, 62.7% avg)

All models benchmarked on PinchBench v1.2.1
@olearycrew olearycrew merged commit e597f5f into main Apr 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants