Handle missing tiktoken encoding data in api token counting by bhavana-giri · Pull Request #266 · redis/agent-memory-server

bhavana-giri · 2026-04-03T10:39:24Z

Summary

make working-memory token counting resilient when tiktoken cannot load cl100k_base
cache the encoding after the first successful load and fall back to a character-based estimate when unavailable
add regression coverage for both direct token counting and the GET /v1/working-memory/{session_id}?model_name=... API path

Testing

git commit pre-commit hooks (ruff, ruff format, typos, trailing whitespace, EOF checks)
python3 -m py_compile agent_memory_server/api.py tests/test_issue_237.py

Closes #237.

Note

Low Risk
Low risk: changes are limited to token-counting/truncation paths and add a conservative fallback plus regression tests to prevent 500s when tiktoken can’t load.

Overview
Prevents GET /v1/working-memory/{session_id} (and summarization/truncation logic) from failing when tiktoken.get_encoding("cl100k_base") cannot load by introducing a cached encoder with a 5-minute backoff and a character-based token estimate fallback.

Refactors token counting to use _count_text_tokens throughout and adds regression tests covering both direct token counting and the API path, including retry/backoff behavior in _get_tiktoken_encoding.

^{Reviewed by Cursor Bugbot for commit d1bddbc. Bugbot is set up for automated code reviews on this repo. Configure here.}

jit-ci · 2026-04-03T10:39:51Z

Hi, I’m Jit, a friendly security platform designed to help developers build secure applications from day zero with an MVS (Minimal viable security) mindset.

In case there are security findings, they will be communicated to you as a comment inside the PR.

Hope you’ll enjoy using Jit.

Questions? Comments? Want to learn more? Get in touch with us.

agent_memory_server/api.py

nkanu17

lgtm but it's worth checking the copilot review comments

Copilot

Pull request overview

This PR makes working-memory token counting resilient when tiktoken can’t load the cl100k_base encoding (e.g., in air-gapped / restricted-egress environments), preventing API requests and summarization/truncation logic from failing due to missing tokenizer data.

Changes:

Add a cached tiktoken encoding loader with a safe character-based fallback token estimator.
Route working-memory token counting and summarization token calculations through a shared _count_text_tokens() helper.
Add regression tests covering both direct token counting and the GET /v1/working-memory/{session_id}?model_name=... path when tiktoken.get_encoding() raises.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
agent_memory_server/api.py	Adds cached encoding load + fallback estimation and updates working-memory token counting/summarization to use it.
tests/test_issue_237.py	Adds regression coverage ensuring token counting + working-memory GET do not 500 when `tiktoken` encoding load fails.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

agent_memory_server/api.py

tests/test_issue_237.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

agent_memory_server/api.py

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

agent_memory_server/api.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

bsbodden

LGTM! Thanks!

Handle missing tiktoken encoding data in api token counting

2cbfcd6

cursor bot reviewed Apr 3, 2026

View reviewed changes

agent_memory_server/api.py Outdated Show resolved Hide resolved

bhavana-giri added 3 commits April 3, 2026 16:24

Tighten oversized message truncation

38f0396

Keep issue 237 fix narrowly scoped

0ef76f6

Use lower-case names for tiktoken state

2947bbc

nkanu17 approved these changes Apr 3, 2026

View reviewed changes

nkanu17 requested a review from Copilot April 3, 2026 14:13

Copilot started reviewing on behalf of nkanu17 April 3, 2026 14:13 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

agent_memory_server/api.py Outdated Show resolved Hide resolved

agent_memory_server/api.py Show resolved Hide resolved

tests/test_issue_237.py Outdated Show resolved Hide resolved

bhavana-giri added 2 commits April 3, 2026 22:30

Address Copilot review cleanups

05d3c95

Simplify tiktoken encoding cache state

8447212

cursor bot reviewed Apr 3, 2026

View reviewed changes

agent_memory_server/api.py Show resolved Hide resolved

bhavana-giri added 3 commits April 3, 2026 23:00

Memoize failed tiktoken initialization

1acdc0e

Add tiktoken retry backoff

e4dc8ff

Refactor tiktoken retry window check

705e950

bsbodden requested a review from Copilot April 6, 2026 14:40

Copilot started reviewing on behalf of bsbodden April 6, 2026 14:41 View session

Copilot AI reviewed Apr 6, 2026

View reviewed changes

agent_memory_server/api.py Outdated Show resolved Hide resolved

Update agent_memory_server/api.py

d1bddbc

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

bsbodden self-assigned this Apr 6, 2026

bsbodden self-requested a review April 6, 2026 15:04

bsbodden approved these changes Apr 6, 2026

View reviewed changes

bsbodden merged commit 8bcd4c8 into redis:main Apr 6, 2026
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle missing tiktoken encoding data in api token counting#266

Handle missing tiktoken encoding data in api token counting#266
bsbodden merged 10 commits intoredis:mainfrom
bhavana-giri:fix/237-tiktoken-fallback

bhavana-giri commented Apr 3, 2026 •

edited by cursor bot

Loading

Uh oh!

jit-ci bot commented Apr 3, 2026

Uh oh!

Uh oh!

nkanu17 left a comment •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

bsbodden left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

bhavana-giri commented Apr 3, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

jit-ci bot commented Apr 3, 2026

Uh oh!

Uh oh!

nkanu17 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

bsbodden left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bhavana-giri commented Apr 3, 2026 •

edited by cursor bot

Loading

nkanu17 left a comment •

edited

Loading