perf: cache grapheme widths in Window.print word wrap #286

dyxushuai · 2025-12-28T14:08:57Z

Problem

Window.print(.word) computed gwidth twice per word: once for the word width check and again per-grapheme while writing cells. This duplicates grapheme width work and adds overhead in word-wrapped text.

Fix

Cache grapheme slices + widths while computing the word width, then reuse that cache when writing cells. If the fixed buffer fills up, reuse the cached prefix and fall back to the original per-grapheme path for the remainder.

Note: Caching uses a fixed 4KB stack buffer to avoid heap allocation for the per-word cache
(ArrayListUnmanaged append).

Bench (local, zig build bench, iterations=200, 80x24)

Baseline = print + extra per-word gwidth pass to mirror the old double-work
Cached = current print implementation

Case	Baseline ns/frame	Cached ns/frame	Improvement	Speedup
Small	81,281	67,043	-17.5%	1.21x
Medium	318,911	264,232	-17.1%	1.21x
Large	632,170	526,237	-16.8%	1.20x
Overflow	4,600,554	3,894,809	-15.3%	1.18x

Improvement: Small -17.5% (1.21x); Medium -17.1% (1.21x); Large -16.8% (1.20x); Overflow -15.3% (1.18x).

Tests

zig build test
zig build bench

Copilot

Pull request overview

This PR optimizes word-wrapping performance in Window.print by caching grapheme slice positions and widths during the initial width calculation pass, eliminating redundant gwidth() calls when rendering. The optimization uses a fixed 4KB stack buffer to avoid heap allocations and falls back to the original per-grapheme iteration if the buffer overflows.

Key Changes:

Introduced a caching mechanism that stores grapheme boundaries and widths in a fixed buffer during width calculation
Refactored the word-rendering path into two branches: a cached path that reuses computed widths, and a fallback path for cache overflow scenarios
Added comprehensive benchmarks demonstrating ~18-22% performance improvement across different text sizes

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
src/Window.zig	Implements grapheme width caching with `WordPiece` struct and `FixedBufferAllocator`, adds cached and fallback rendering paths
bench/bench.zig	Adds benchmark infrastructure and test cases (small/medium/large) to measure the caching optimization, includes helper iterators for baseline comparison

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

perf: cache grapheme widths in word wrap

f410925

Copilot AI review requested due to automatic review settings December 28, 2025 14:08

Copilot started reviewing on behalf of dyxushuai December 28, 2025 14:09 View session

dyxushuai marked this pull request as draft December 28, 2025 14:11

Copilot AI reviewed Dec 28, 2025

View reviewed changes

perf: reuse cached prefix when word cache overflows

45c8f5d

dyxushuai marked this pull request as ready for review December 28, 2025 14:17

dyxushuai mentioned this pull request Dec 28, 2025

Perf tracking: cumulative impact of optimization PRs #289

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: cache grapheme widths in Window.print word wrap #286

perf: cache grapheme widths in Window.print word wrap #286

Uh oh!

dyxushuai commented Dec 28, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

perf: cache grapheme widths in Window.print word wrap #286

Are you sure you want to change the base?

perf: cache grapheme widths in Window.print word wrap #286

Uh oh!

Conversation

dyxushuai commented Dec 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Bench (local, zig build bench, iterations=200, 80x24)

Tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dyxushuai commented Dec 28, 2025 •

edited

Loading