Fix _split_cells to handle non-unit width characters correctly by nisha-muthurajan · Pull Request #4155 · Textualize/rich

nisha-muthurajan · 2026-06-04T04:21:54Z

The previous implementation used a proportional heuristic to estimate the starting character position, which overshot for multi-cell characters like emoji. Replace with a linear scan that accumulates real cell widths.

Type of changes

-✅ Bug fix

New feature
Documentation / docstrings
✅ Tests
Other

AI?

AI was used to generate this PR

AI generated PRs may be accepted, but only if @willmcgugan has responded on an issue or discussion.

Checklist

I've run the latest black with default args on new code.
I've updated CHANGELOG.md and CONTRIBUTORS.md where appropriate (see note about typos above).
✅ I've added tests for new code.
✅ I accept that @willmcgugan may be pedantic in the code review.

Description

Fixes #3299

Segment._split_cells used a proportional heuristic to guess the starting
character position:

pos = int((cut / cell_length) * len(text))

This overshot for multi-cell characters (emoji, CJK) because it assumed all
characters have equal width. The fallback loop then couldn't recover correctly,
producing wrong splits like ('🦊🦊 ', ' abcdef') instead of ('🦊 ', ' abcdef').

Fixed by replacing the heuristic + loop with a simple linear scan that
accumulates real cell widths character by character, stopping precisely at
the cut point.

Added a regression test test_split_cells_emoji covering the two examples
from the issue plus an exact-boundary case.

Fixes Textualize#3299 The previous implementation used a proportional heuristic to estimate the starting character position, which overshot for multi-cell characters like emoji. Replace with a linear scan that accumulates real cell widths.

nisha-muthurajan added 2 commits June 4, 2026 09:35

Apply black formatting and update CONTRIBUTORS.md

dad7eff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix _split_cells to handle non-unit width characters correctly#4155

Fix _split_cells to handle non-unit width characters correctly#4155
nisha-muthurajan wants to merge 2 commits into
Textualize:masterfrom
nisha-muthurajan:fix/split-cells-non-unit-chars

nisha-muthurajan commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

nisha-muthurajan commented Jun 4, 2026

Type of changes

AI?

Checklist

Description

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant