Fix: tighten email regex to reduce false positives in email detection #20

DivInstance · 2025-10-06T05:27:54Z

Summary

This PR reduces false positives in email detection used across Slash by replacing the very loose \S+@\S+ with a stricter, centralized regex.

Changes

Add EMAIL_RE in api/extract.py
- Pattern: \b[a-zA-Z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Za-z]{2,}\b
Update extract.mail()

Rationale

The previous pattern matched tokens with trailing punctuation or random strings containing @, causing misclassification (e.g., treating the input as an email when it’s not). The new pattern aligns with common email formats and reduces noise.

Testing

Positive: [email protected], [email protected] → detected as emails
Negative: hello@world, username@, @domain.com, not-an-email. → not detected

Impact

Improves accuracy for flows that depend on [extract.just.mail()]

Checklist

Centralized email regex
Updated both email extraction methods
Manual smoke tests for positives and negatives

Resolves #2

fix(extract): tighten email regex to reduce false positives

8280790

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: tighten email regex to reduce false positives in email detection #20

Fix: tighten email regex to reduce false positives in email detection #20

Uh oh!

DivInstance commented Oct 6, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix: tighten email regex to reduce false positives in email detection #20

Are you sure you want to change the base?

Fix: tighten email regex to reduce false positives in email detection #20

Uh oh!

Conversation

DivInstance commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Rationale

Testing

Impact

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

DivInstance commented Oct 6, 2025 •

edited

Loading