Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 10 additions & 0 deletions providers/together-ai/Qwen/Qwen3.6-Plus.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
costs:
- input_cost_per_token: 5e-7
output_cost_per_token: 3e-6
region: "*"
limits:
context_window: 1000000
mode: unknown
model: Qwen/Qwen3.6-Plus
supportedModes:
- chat
11 changes: 11 additions & 0 deletions providers/together-ai/deepseek-ai/DeepSeek-V4-Pro.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
costs:
- cache_read_input_token_cost: 2e-7
input_cost_per_token: 2.1e-6
output_cost_per_token: 4.4e-6
region: "*"
limits:
context_window: 512000
mode: unknown
model: deepseek-ai/DeepSeek-V4-Pro
supportedModes:
- chat
8 changes: 8 additions & 0 deletions providers/together-ai/mistralai/Voxtral-Mini-3B-2507.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
costs:
- input_cost_per_token: 2.7e-7
output_cost_per_token: 8.5e-7
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Audio transcription model uses per-token instead of per-second pricing

High Severity

Voxtral-Mini-3B-2507.yaml is declared as an audio_transcription model but uses input_cost_per_token and output_cost_per_token. Every other audio transcription model in this provider that has costs (openai/whisper-large-v3, nvidia/parakeet-tdt-0.6b-v3) uses input_cost_per_second instead. Using per-token pricing for an audio input model will produce incorrect cost calculations.

Fix in Cursor Fix in Web

Reviewed by Cursor Bugbot for commit a96a8ec. Configure here.

region: "*"
mode: unknown
model: mistralai/Voxtral-Mini-3B-2507
supportedModes:
- audio_transcription
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
costs:
- input_cost_per_token: 0.0
output_cost_per_token: 0.0
region: "*"
limits:
context_window: 131072
mode: unknown
model: nvidia/nemotron-3-nano-omni-30b-a3b-reasoning-fp8
supportedModes:
- chat
Loading