ore: ensure Sentry transport never shuts down #34050

teskje · 2025-11-07T11:25:06Z

Previously, the mz_ore::tracing::configure would initialize the Sentry transport and then return a TracingGuard that needed to be kept alive to prevent the Sentry transport from shutting down. Callers would usually keep the guard until the end of the current function scope. This caused us to miss reporting panics when a fatal error was bubbled up to the process' main before becoming a panic, and the guard was dropped in the process.

The fix taken here is to immediately forget the guard, to make sure the Sentry transport remains intact for the process lifetime. If there is ever a need to shut down the Sentry transport again we'll need to reconsider, but we can keep things simple for now.

Motivation

This PR fixes a previously unreported bug.

Panics caused by fatal errors that bubble up to main are not reported to Sentry.

Slack thread.

Tips for reviewer

Checklist

This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.

def-

Is there a good way to write a test for this?

teskje · 2025-11-07T11:35:52Z

Is there a good way to write a test for this?

I think we'd need some way to run Materialize together with Sentry in tests. Then we could force a panic and check if it gets reported. The same thing would also have been useful with other Sentry reporting issues we had previously.

Previously, the `mz_ore::tracing::configure` would initialize the Sentry transport and then return a `TracingGuard` that needed to be kept alive to prevent the Sentry transport from shutting down. Callers would usually keep the guard until the end of the current function scope. This caused us to miss reporting panics when a fatal error was bubbled up to the process' `main` before becoming a panic, and the guard was dropped in the process. The fix taken here is to immediately forget the guard, to make sure the Sentry transport remains intact for the process lifetime. If there is ever a need to shut down the Sentry transport again we'll need to reconsider, but we can keep things simple for now.

def- · 2025-11-07T11:40:22Z

We have test/tracing/mzcompose.py, which reports to a special sentry DSN, which we can then check via the API to see if the expected log has made it to it?

teskje · 2025-11-07T11:47:38Z

Ah, is that connected to our production Sentry instance? Yes, that would work. Though DSN authentication doesn't work to list issues in a project, we'd also need to supply a Sentry auth token.

def- · 2025-11-07T11:55:51Z

Should be the database-backend-testing project: https://materializeinc.sentry.io/insights/projects/database-backend-testing/?project=4506542270906368

ggevay

Thank you!

teskje · 2025-11-12T17:23:53Z

I don't have time to look at testing the Sentry integration now, and I don't think we should block this fix on that. So I'm going ahead with merging. I've opened an issue to track adding tests: https://github.com/MaterializeInc/database-issues/issues/9897

teskje · 2025-11-12T17:24:01Z

TFTR!

teskje marked this pull request as ready for review November 7, 2025 11:26

teskje requested review from a team and ggevay as code owners November 7, 2025 11:27

def- reviewed Nov 7, 2025

View reviewed changes

teskje force-pushed the sentry-prevent-shutdown branch from cf205f0 to 908943b Compare November 7, 2025 11:39

ggevay approved these changes Nov 12, 2025

View reviewed changes

teskje merged commit c1a55c7 into MaterializeInc:main Nov 12, 2025
129 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ore: ensure Sentry transport never shuts down #34050

ore: ensure Sentry transport never shuts down #34050

Uh oh!

teskje commented Nov 7, 2025 •

edited

Loading

Uh oh!

def- left a comment

Uh oh!

teskje commented Nov 7, 2025

Uh oh!

def- commented Nov 7, 2025

Uh oh!

teskje commented Nov 7, 2025

Uh oh!

def- commented Nov 7, 2025

Uh oh!

ggevay left a comment

Uh oh!

teskje commented Nov 12, 2025

Uh oh!

teskje commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ore: ensure Sentry transport never shuts down #34050

ore: ensure Sentry transport never shuts down #34050

Uh oh!

Conversation

teskje commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Tips for reviewer

Checklist

Uh oh!

def- left a comment

Choose a reason for hiding this comment

Uh oh!

teskje commented Nov 7, 2025

Uh oh!

def- commented Nov 7, 2025

Uh oh!

teskje commented Nov 7, 2025

Uh oh!

def- commented Nov 7, 2025

Uh oh!

ggevay left a comment

Choose a reason for hiding this comment

Uh oh!

teskje commented Nov 12, 2025

Uh oh!

teskje commented Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

teskje commented Nov 7, 2025 •

edited

Loading