What happened?
running: cce savings
always reports the same savings constantly.
Bee · 12 queries
⛁ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ 88% tokens saved
Input savings 1.9M tokens $27.78
Output savings 4.8k tokens $0.36
──────────────────────────────────────────
Total saved 1.9M tokens $28.15
~154.8k tokens / query ~$2.35 / query
Breakdown:
retrieval 84% ▰▰▰▰▰▰▰▰▰▰ 1.8M $26.76 · 12 calls
chunk compression 3% ▰▱▱▱▱▱▱▱▱▱ 68.5k $1.03 · 12 calls
output compression* <1% ▰▱▱▱▱▱▱▱▱▱ 4.8k $0.36 · 12 calls
- estimated. output compression assumes a 500-token avg reply; progressive disclosure compares against full payload dump.
Output compression levels seen: max=12
Cost estimate based on Opus pricing (input $15.0/1M, output $75.0/1M)
after a agent chat
Bee · 12 queries
⛁ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ 88% tokens saved
Input savings 1.9M tokens $27.78
Output savings 4.8k tokens $0.36
──────────────────────────────────────────
Total saved 1.9M tokens $28.15
~154.8k tokens / query ~$2.35 / query
Breakdown:
retrieval 84% ▰▰▰▰▰▰▰▰▰▰ 1.8M $26.76 · 12 calls
chunk compression 3% ▰▱▱▱▱▱▱▱▱▱ 68.5k $1.03 · 12 calls
output compression* <1% ▰▱▱▱▱▱▱▱▱▱ 4.8k $0.36 · 12 calls
- estimated. output compression assumes a 500-token avg reply; progressive disclosure compares against full payload dump.
Output compression levels seen: max=12
Cost estimate based on Opus pricing (input $15.0/1M, output $75.0/1M)
What did you expect?
savings to update
Steps to reproduce
cce savings after chats
Relevant logs or error output
Python version
na
OS
na
CCE version
latest
What happened?
running: cce savings
always reports the same savings constantly.
Bee · 12 queries
⛁ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ 88% tokens saved
Input savings 1.9M tokens $27.78
Output savings 4.8k tokens $0.36
──────────────────────────────────────────
Total saved 1.9M tokens $28.15
~154.8k tokens / query ~$2.35 / query
Breakdown:
retrieval 84% ▰▰▰▰▰▰▰▰▰▰ 1.8M $26.76 · 12 calls
chunk compression 3% ▰▱▱▱▱▱▱▱▱▱ 68.5k $1.03 · 12 calls
output compression* <1% ▰▱▱▱▱▱▱▱▱▱ 4.8k $0.36 · 12 calls
Output compression levels seen: max=12
Cost estimate based on Opus pricing (input $15.0/1M, output $75.0/1M)
after a agent chat
Bee · 12 queries
⛁ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ ⛶ 88% tokens saved
Input savings 1.9M tokens $27.78
Output savings 4.8k tokens $0.36
──────────────────────────────────────────
Total saved 1.9M tokens $28.15
~154.8k tokens / query ~$2.35 / query
Breakdown:
retrieval 84% ▰▰▰▰▰▰▰▰▰▰ 1.8M $26.76 · 12 calls
chunk compression 3% ▰▱▱▱▱▱▱▱▱▱ 68.5k $1.03 · 12 calls
output compression* <1% ▰▱▱▱▱▱▱▱▱▱ 4.8k $0.36 · 12 calls
Output compression levels seen: max=12
Cost estimate based on Opus pricing (input $15.0/1M, output $75.0/1M)
What did you expect?
savings to update
Steps to reproduce
cce savings after chats
Relevant logs or error output
Python version
na
OS
na
CCE version
latest