ref(spans): Write payloads outside of eval #88453

jan-auer · 2025-04-01T16:37:52Z

Based on our timings and monitoring, we notice a lot of the CPU load in Redis comes from the large pipeline that runs the add-buffer.lua script. It combines three bad aspects: Sends large payloads, runs many commands, and executes in a long, batched pipeline.

To understand and isolate its impact better, this PR splits the large script into two phases:

Push payloads into Redis using SADD under the top-most parent key determined by the current in-memory batch.
Run the pipelined script to restructure the partial trees

Additionally, this PR contains a few optimizations:

Avoid redundant set merges (SUNIONSTORE) when there's no subsegment to merge
Use UNLINK to defer large blocking deletes
Use EVALSHA to speed up script invocation

This prepares for batched calls to the script, which should result in even fewer writes.

codecov · 2025-04-02T12:56:12Z

Codecov Report

Attention: Patch coverage is 96.66667% with 2 lines in your changes missing coverage. Please review.

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
src/sentry/spans/buffer.py	96.66%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master   #88453       +/-   ##
===========================================
+ Coverage   33.08%   87.74%   +54.65%     
===========================================
  Files        8464    10015     +1551     
  Lines      472987   567419    +94432     
  Branches    22294    22247       -47     
===========================================
+ Hits       156477   497860   +341383     
+ Misses     316090    69141   -246949     
+ Partials      420      418        -2

* master: (96 commits) fix(autofix): fix highlight popup behavior (#88552) 🔧 chore: introduce workflow engine ui links ff (#88569) feat(pipeline): Add CoveragePageWrapper component and tests for it (#88519) feat(taskworker):Make sdk tasks taskworker compatible (#88488) chore(flamegraph): Updating wording to trace (#88516) feat(shared-views): Create `GET` `group-search-view/starred` endpoint (#88398) DI-612: print flaky pytest errors to log (v2) (#88512) chore(nav): Update banner copy (#88566) feat(sentryapps): add RPC method to get all installation component contents (#88179) chore(issue summary): Add limit to query (#88563) fix(dashboards): Add limit suggestion to validation (#88436) feat(checkout): show starting PAYG prices (#88510) ✨ feat(aci): add workflow_id when creating an ephemeral rule in noa (#88520) fix(billing): hide pay now button for self serve partners (#88504) chore(HC): Re-adds logging with a low sample rate for cache hits/misses on options (#88464) fix(logs): Upgrade sentry log integration to fix dogfooding issues (#88561) chore(issue summary): Remove dividers from AI summary alert (#88554) feat(insights): Enable bubbles in full-screen mode (#88445) fix(explore): Update search bar query on filter change (#88473) chore(dependencies): Upgrade drf-spectacular (#88459) ...

getsentry-bot · 2025-04-03T06:40:36Z

PR reverted: de9a709

This reverts commit 89c00d5. Co-authored-by: jan-auer <[email protected]>

Based on our timings and monitoring, we notice a lot of the CPU load in Redis comes from the large pipeline that runs the `add-buffer.lua` script. It combines three bad aspects: Sends large payloads, runs many commands, and executes in a long, batched pipeline. To reduce the time Redis is blocked on slow operations, this PR splits the large script into two phases: 1. Push payloads into Redis using `SADD` under the top-most parent key determined by the current in-memory batch. 2. Run the pipelined script to restructure the partial trees Additionally, this PR contains a few optimizations: - Avoid redundant set merges (`SUNIONSTORE`) when there's no subsegment to merge - Use `UNLINK` to defer large blocking deletes - Use `EVALSHA` to speed up script invocation This prepares for batched calls to the script, which should result in even fewer writes. Second attempt of #88453 --------- Co-authored-by: Markus Unterwaditzer <[email protected]>

This reverts commit 89c00d5. Co-authored-by: jan-auer <[email protected]>

Based on our timings and monitoring, we notice a lot of the CPU load in Redis comes from the large pipeline that runs the `add-buffer.lua` script. It combines three bad aspects: Sends large payloads, runs many commands, and executes in a long, batched pipeline. To reduce the time Redis is blocked on slow operations, this PR splits the large script into two phases: 1. Push payloads into Redis using `SADD` under the top-most parent key determined by the current in-memory batch. 2. Run the pipelined script to restructure the partial trees Additionally, this PR contains a few optimizations: - Avoid redundant set merges (`SUNIONSTORE`) when there's no subsegment to merge - Use `UNLINK` to defer large blocking deletes - Use `EVALSHA` to speed up script invocation This prepares for batched calls to the script, which should result in even fewer writes. Second attempt of #88453 --------- Co-authored-by: Markus Unterwaditzer <[email protected]>

Based on our timings and monitoring, we notice a lot of the CPU load in Redis comes from the large pipeline that runs the `add-buffer.lua` script. It combines three bad aspects: Sends large payloads, runs many commands, and executes in a long, batched pipeline. To understand and isolate its impact better, this PR splits the large script into two phases: 1. Push payloads into Redis using `SADD` under the top-most parent key determined by the current in-memory batch. 2. Run the pipelined script to restructure the partial trees Additionally, this PR contains a few optimizations: - Avoid redundant set merges (`SUNIONSTORE`) when there's no subsegment to merge - Use `UNLINK` to defer large blocking deletes - Use `EVALSHA` to speed up script invocation This prepares for batched calls to the script, which should result in even fewer writes.

This reverts commit 89c00d5. Co-authored-by: jan-auer <[email protected]>

Based on our timings and monitoring, we notice a lot of the CPU load in Redis comes from the large pipeline that runs the `add-buffer.lua` script. It combines three bad aspects: Sends large payloads, runs many commands, and executes in a long, batched pipeline. To reduce the time Redis is blocked on slow operations, this PR splits the large script into two phases: 1. Push payloads into Redis using `SADD` under the top-most parent key determined by the current in-memory batch. 2. Run the pipelined script to restructure the partial trees Additionally, this PR contains a few optimizations: - Avoid redundant set merges (`SUNIONSTORE`) when there's no subsegment to merge - Use `UNLINK` to defer large blocking deletes - Use `EVALSHA` to speed up script invocation This prepares for batched calls to the script, which should result in even fewer writes. Second attempt of #88453 --------- Co-authored-by: Markus Unterwaditzer <[email protected]>

ref(spans): Write payloads outside of eval

188215a

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Apr 1, 2025

vercel bot deployed to Preview April 1, 2025 16:42 View deployment

ref: Batch in-memory to reduce redirects

7e38a61

vercel bot deployed to Preview April 2, 2025 09:23 View deployment

ref: Avoid redundant sunionstore

57b44d7

vercel bot deployed to Preview April 2, 2025 10:55 View deployment

fix: Properly merge redirected subsegments

08e00c7

vercel bot deployed to Preview April 2, 2025 11:05 View deployment

fix: Use evalsha

4e15c11

vercel bot deployed to Preview April 2, 2025 12:28 View deployment

jan-auer assigned untitaker Apr 2, 2025

jan-auer marked this pull request as ready for review April 2, 2025 13:47

jan-auer requested a review from a team as a code owner April 2, 2025 13:47

jan-auer mentioned this pull request Apr 2, 2025

ref(span-buffer): Reduce zadd/zrem calls to Redis #88463

Merged

vercel bot deployed to Preview April 2, 2025 17:42 View deployment

jan-auer added 2 commits April 2, 2025 19:48

fix: Typing

aa4fa61

fix: Pipelines

4d85401

vercel bot deployed to Preview April 2, 2025 17:57 View deployment

untitaker approved these changes Apr 2, 2025

View reviewed changes

untitaker merged commit 89c00d5 into master Apr 2, 2025
47 checks passed

untitaker deleted the ref/spans-separate-payload branch April 2, 2025 18:37

jan-auer added the Trigger: Revert Add to a merged PR to revert it (skips CI) label Apr 3, 2025

getsentry-bot added a commit that referenced this pull request Apr 3, 2025

Revert "ref(spans): Write payloads outside of eval (#88453)"

de9a709

This reverts commit 89c00d5. Co-authored-by: jan-auer <[email protected]>

jan-auer restored the ref/spans-separate-payload branch April 3, 2025 07:32

jan-auer mentioned this pull request Apr 3, 2025

ref(spans): Optimize the span buffer redis script #88661

Merged

adrian-codecov pushed a commit that referenced this pull request Apr 3, 2025

Revert "ref(spans): Write payloads outside of eval (#88453)"

dd94a4d

This reverts commit 89c00d5. Co-authored-by: jan-auer <[email protected]>

andrewshie-sentry pushed a commit that referenced this pull request Apr 8, 2025

Revert "ref(spans): Write payloads outside of eval (#88453)"

49f4433

This reverts commit 89c00d5. Co-authored-by: jan-auer <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ref(spans): Write payloads outside of eval #88453

ref(spans): Write payloads outside of eval #88453

jan-auer commented Apr 1, 2025 •

edited

Loading

codecov bot commented Apr 2, 2025 •

edited

Loading

getsentry-bot commented Apr 3, 2025

ref(spans): Write payloads outside of eval #88453

ref(spans): Write payloads outside of eval #88453

Conversation

jan-auer commented Apr 1, 2025 • edited Loading

codecov bot commented Apr 2, 2025 • edited Loading

Codecov Report

getsentry-bot commented Apr 3, 2025

jan-auer commented Apr 1, 2025 •

edited

Loading

codecov bot commented Apr 2, 2025 •

edited

Loading