ref(grouping): Add training mode for similarity model rollout #102623

yuvmen · 2025-11-03T23:29:15Z

Introduce training_mode parameter to send dual embeddings during model upgrades. Centralize model version config and add should_send_new_model_embeddings() to track which groups need new embeddings. Rename feature flag to be version-agnostic.

codecov · 2025-11-03T23:46:19Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@             Coverage Diff              @@
##           master   #102623       +/-   ##
============================================
+ Coverage   70.58%    80.80%   +10.21%     
============================================
  Files        9200      8936      -264     
  Lines      392841    391345     -1496     
  Branches    25009     24848      -161     
============================================
+ Hits       277304    316234    +38930     
+ Misses     115111     74743    -40368     
+ Partials      426       368       -58

…ition

src/sentry/grouping/ingest/seer.py

yuvmen · 2025-11-04T19:35:25Z

src/sentry/features/temporary.py

-    # Enable v2 similarity grouping model (part of v2 grouping rollout)
-    manager.add("projects:similarity-grouping-v2-model", ProjectFeature, FeatureHandlerStrategy.FLAGPOLE, api_expose=False)
+    # Enable new similarity grouping model upgrade (version-agnostic rollout)
+    manager.add("projects:similarity-grouping-model-upgrade", ProjectFeature, FeatureHandlerStrategy.FLAGPOLE, api_expose=False)


I wasn't sure here if I should keep the old feature, if its safe to delete a feature and its usage in the same PR. This is basically just a rename of the flag.

src/sentry/event_manager.py

…ition

…training to seer

cursor · 2025-11-04T22:48:16Z

src/sentry/event_manager.py

            result = "found_secondary"
+            maybe_send_seer_for_new_model_training(
+                event, secondary.existing_grouphash, secondary.variants
+            )


Bug: Inconsistent Updates for Seer-Matched Groups Embeddings

Missing call to maybe_send_seer_for_new_model_training when a Seer match is found. The function is called when an existing grouphash is found via primary grouping (line 1293) and secondary grouping (lines 1305-1307), but it's not called when an existing grouphash is found via Seer matching (line 1316). This means that existing groups found through Seer similarity matching won't get their embeddings updated for the new model version, which is inconsistent with the behavior for groups found through other grouping methods.

this is wrong, they get updated on the regular flow its fine

…ition

JoshFerge · 2025-11-10T22:16:13Z

src/sentry/grouping/ingest/seer.py

+    gh_metadata = existing_grouphash.metadata
+    grouphash_seer_model = gh_metadata.seer_model if gh_metadata else None
+
+    if should_send_new_model_embeddings(event.project, grouphash_seer_model):


nit: could use guard clause / early return here

JoshFerge · 2025-11-10T22:16:50Z

src/sentry/grouping/ingest/seer.py

+    if should_send_new_model_embeddings(event.project, grouphash_seer_model):
+        had_metadata = gh_metadata is not None
+        # Send training mode request (honor all checks like rate limits, circuit breaker, etc.)
+        if should_call_seer_for_grouping(event, variants, existing_grouphash):


JoshFerge · 2025-11-10T22:17:44Z

src/sentry/grouping/ingest/seer.py

+        had_metadata = gh_metadata is not None
+        # Send training mode request (honor all checks like rate limits, circuit breaker, etc.)
+        if should_call_seer_for_grouping(event, variants, existing_grouphash):
+            record_did_call_seer_metric(event, call_made=True, blocker="none")


should we add some tag to this metric to say that we're calling seer from new_model_training?

hmm yea agreed

JoshFerge · 2025-11-10T22:19:21Z

src/sentry/grouping/ingest/seer.py

+                get_seer_similar_issues(event, existing_grouphash, variants, training_mode=True)
+
+                # Record metrics for new model embedding requests
+                metrics.incr(


seems like we have a bunch of telemetry in get_seer_similar_issues already, should we just consolidate there with a training_mode tag?

oh yea for sure, this doesnt make sense, ill consolidate

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Nov 3, 2025

jasonyuezhang mentioned this pull request Nov 3, 2025

ref(grouping): Add training mode for similarity model rollout jasonyuezhang/sentry#121

Open

vercel bot deployed to Preview November 3, 2025 23:31 View deployment

test fix

8df6bc6

vercel bot deployed to Preview November 4, 2025 00:02 View deployment

vercel bot deployed to Preview November 4, 2025 00:08 View deployment

typing fix

261abb2

yuvmen force-pushed the yuvmen/support-similiar-issues-model-transition branch from 51dd1a9 to 261abb2 Compare November 4, 2025 00:34

Merge branch 'master' into yuvmen/support-similiar-issues-model-trans…

ac752d1

…ition

vercel bot deployed to Preview November 4, 2025 00:40 View deployment

add test for coverage

cee3bac

vercel bot deployed to Preview November 4, 2025 01:11 View deployment

yuvmen marked this pull request as ready for review November 4, 2025 19:16

yuvmen requested review from a team as code owners November 4, 2025 19:16

set training_mode true on backfill, remove some comments

ed13a36

cursor bot reviewed Nov 4, 2025

View reviewed changes

src/sentry/grouping/ingest/seer.py Outdated Show resolved Hide resolved

vercel bot deployed to Preview November 4, 2025 19:28 View deployment

yuvmen commented Nov 4, 2025

View reviewed changes

removed redundant seer fail metric recording

4bf69b2

vercel bot deployed to Preview November 4, 2025 21:53 View deployment

cursor bot reviewed Nov 4, 2025

View reviewed changes

src/sentry/event_manager.py Show resolved Hide resolved

Merge branch 'master' into yuvmen/support-similiar-issues-model-trans…

7c09475

…ition

vercel bot deployed to Preview November 4, 2025 22:13 View deployment

send secondary.variants when sending sending secondary for new model …

40a7616

…training to seer

cursor bot reviewed Nov 4, 2025

View reviewed changes

vercel bot deployed to Preview November 4, 2025 22:49 View deployment

Merge branch 'master' into yuvmen/support-similiar-issues-model-trans…

959a517

…ition

vercel bot deployed to Preview November 10, 2025 20:45 View deployment

JoshFerge reviewed Nov 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ref(grouping): Add training mode for similarity model rollout #102623

ref(grouping): Add training mode for similarity model rollout #102623

yuvmen commented Nov 3, 2025

Uh oh!

codecov bot commented Nov 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

yuvmen Nov 4, 2025

Uh oh!

Uh oh!

cursor bot Nov 4, 2025

Uh oh!

yuvmen Nov 4, 2025

Uh oh!

JoshFerge Nov 10, 2025

Uh oh!

JoshFerge Nov 10, 2025

Uh oh!

JoshFerge Nov 10, 2025

Uh oh!

yuvmen Nov 10, 2025

Uh oh!

JoshFerge Nov 10, 2025

Uh oh!

yuvmen Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

ref(grouping): Add training mode for similarity model rollout #102623

Are you sure you want to change the base?

ref(grouping): Add training mode for similarity model rollout #102623

Conversation

yuvmen commented Nov 3, 2025

Uh oh!

codecov bot commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cursor bot Nov 4, 2025

Choose a reason for hiding this comment

Bug: Inconsistent Updates for Seer-Matched Groups Embeddings

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Nov 3, 2025 •

edited

Loading