New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

feat(alerts): Add issue summary to slack issue alerts #88033

Merged

roaga merged 8 commits into master from issue-summary/alerts

Mar 31, 2025

Member

roaga commented Mar 26, 2025

This PR should:

for any issue alert in slack, only for error issues, only if behind the gen-ai-features FF and behind the new project level summary x alerts FF...
fetch the summary for the issue (will hit the cache if it already exists or generate a new one otherwise)
time out at 5 seconds
replace the title and body of the alert with the summary content only if we got a summary successfully


          Add issue summary to slack issue alerts

9f06b58

roaga requested a review from jennmueng

March 26, 2025 21:17

github-actions bot added the Scope: Backend label

vercel bot deployed to Preview

March 26, 2025 21:19

View deployment

jennmueng reviewed

View reviewed changes

Member

jennmueng left a comment

lgtm, defer to alerts team to review

roaga commented

View reviewed changes

src/sentry/features/temporary.py Outdated

Comment on lines 524 to 525

		# Enables automatically triggering issue summary on alerts
		manager.add("projects:trigger-issue-summary-on-alerts", ProjectFeature, FeatureHandlerStrategy.FLAGPOLE, api_expose=True)

Member Author

roaga Mar 26, 2025

using a project level flag so I can just try it out on Seer before releasing to internal org

Member

JoshFerge Mar 27, 2025

do we need this to be exposed in the API to the frontend?

Member Author

roaga Mar 27, 2025

nope, set it to false


          Fix mypy

451f2ad

roaga marked this pull request as ready for review

March 26, 2025 21:38

roaga requested review from a team as code owners

March 26, 2025 21:38

vercel bot deployed to Preview

March 26, 2025 21:40

View deployment

iamrajjoshi reviewed

View reviewed changes

Member

iamrajjoshi left a comment

can we also please get a screenshot of how the notification looks?
https://develop.sentry.dev/integrations/slack/ - docs on how to set it up

src/sentry/integrations/slack/message_builder/issues.py Outdated Show resolved Hide resolved

src/sentry/integrations/slack/message_builder/issues.py Outdated

+                      try:
+                          with concurrent.futures.ThreadPoolExecutor() as executor:
+                              future = executor.submit(get_issue_summary, self.group)
+                              summary_result, status_code = future.result(timeout=5)

Member

iamrajjoshi Mar 26, 2025

can we refactor the timeout magic number into a constant and perhaps add metrics/span here so we can monitor how long this is taking?

Member

GabeVillalobos Mar 26, 2025

Could we instead define a new sentry option that allows us to modify this timeout on the fly if we need to adjust it. It'll be faster than waiting for a full rollout if something goes wrong.

GabeVillalobos reviewed

View reviewed changes

src/sentry/integrations/slack/message_builder/issues.py Outdated

+                      try:
+                          with concurrent.futures.ThreadPoolExecutor() as executor:
+                              future = executor.submit(get_issue_summary, self.group)
+                              summary_result, status_code = future.result(timeout=5)

Member

GabeVillalobos Mar 26, 2025

Could we instead define a new sentry option that allows us to modify this timeout on the fly if we need to adjust it. It'll be faster than waiting for a full rollout if something goes wrong.

src/sentry/integrations/slack/message_builder/issues.py Outdated Show resolved Hide resolved

src/sentry/integrations/slack/message_builder/issues.py Outdated

+                                  return summary_result
+                              return None
+                      except (concurrent.futures.TimeoutError, Exception) as e:
+                          logger.exception("Error generating issue summary: %s", e)

Member

GabeVillalobos Mar 26, 2025

Do we want metrics to track these failures?

Member Author

roaga Mar 26, 2025

Is a span and the Sentry issue generated from this sufficient? I'm not too knowledgeable about other ways to track metrics here

Member

GabeVillalobos Mar 26, 2025 •

edited

Loading

That should be fine for now. If we want more granular tracking of things like duration over time, success/failure count, we tend to wrap calls like these in metrics decorators like this for us to graph in Datadog, assign to SLOs, etc:

sentry/src/sentry/tasks/groupowner.py

Line 57 in c284a68

with metrics.timer("sentry.tasks.process_suspect_commits.process_loop"):

This is totally optional though.

iamrajjoshi reviewed

View reviewed changes

src/sentry/integrations/slack/message_builder/issues.py Outdated

+                          return None
+                      try:
+                          with concurrent.futures.ThreadPoolExecutor() as executor:

Member

iamrajjoshi Mar 26, 2025

is this pattern something seer uses?

Member Author

roaga Mar 26, 2025

I'm not sure what you mean. We use threading a lot in Seer but doesn't seem relevant here. I just thought this would be a simple way to set a timeout, since we said we wanted a timeout.

Member

GabeVillalobos Mar 26, 2025

What's the benefit of starting an async thread here and synchronously waiting for it vs passing it as a parameter to get_issue_summary util function instead? Seems like we're just using a request under the hood which can already handle this concern:

sentry/src/sentry/seer/issue_summary.py

Line 99 in a05d27f

response = requests.post(

Member Author

roaga Mar 26, 2025

My concern with the timeout is not just the Seer call, but also the queries we're running to get trace-connected issues on the Sentry backend. In my experience those are often the main culprit for slow summary generation. Is there a better way to wrap both of those in a timeout?

Member

GabeVillalobos Mar 26, 2025

Ahh that's fair. I'm not super familiar with eventstore, but this approach makes sense if we're trying to cover those under the same 5s timeout.

codecov bot commented Mar 26, 2025 •

edited

Loading

Codecov Report

Attention: Patch coverage is 93.20388% with 7 lines in your changes missing coverage. Please review.

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
...try/integrations/utils/issue_summary_for_alerts.py	82.14%	5 Missing ⚠️
...entry/integrations/slack/message_builder/issues.py	96.87%	1 Missing ⚠️
src/sentry/integrations/slack/utils/escape.py	75.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #88033      +/-   ##
==========================================
- Coverage   87.71%   87.71%   -0.01%     
==========================================
  Files        9977     9978       +1     
  Lines      564630   564726      +96     
  Branches    22232    22232              
==========================================
+ Hits       495292   495375      +83     
- Misses      68922    68935      +13     
  Partials      416      416


          PR feedback

f683305

vercel bot deployed to Preview

March 26, 2025 22:33

View deployment

roaga and others added 4 commits

March 27, 2025 14:29

v2

553498d


          Merge branch 'master' into issue-summary/alerts

21f803a


          Remove API expose

ca0498f


          🛠️ apply pre-commit fixes

bf68d7d

Member Author

roaga commented Mar 27, 2025

Screenshot of current version:

Once again, this is under a project feature flag, and the plan is to start with just the internal seer project


          typing

f1269e2

vercel bot deployed to Preview

March 27, 2025 21:47

View deployment

roaga requested review from iamrajjoshi and GabeVillalobos

March 27, 2025 21:58

iamrajjoshi approved these changes

View reviewed changes

roaga merged commit 07dd6e4 into master

48 checks passed

roaga deleted the issue-summary/alerts branch

March 31, 2025 16:57

andrewshie-sentry pushed a commit that referenced this pull request


          feat(alerts): Add issue summary to slack issue alerts (#88033)

16fd9e4

This PR should:
- for any issue alert in slack, only for error issues, only if behind
the gen-ai-features FF and behind the new project level summary x alerts
FF...
- fetch the summary for the issue (will hit the cache if it already
exists or generate a new one otherwise)
- time out at 5 seconds
- replace the title and body of the alert with the summary content only
if we got a summary successfully

---------

Co-authored-by: getsantry[bot] <66042841+getsantry[bot]@users.noreply.github.com>

sentry-io bot commented Apr 1, 2025

Suspect Issues

This pull request was deployed and Sentry observed the following issues:

‼️ TimeoutError sentry.tasks.post_process.post_process_group View Issue

_{Did you find this useful? React with a 👍 or 👎}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels