Skip to content

Conversation

@DAlperin
Copy link
Member

@DAlperin DAlperin commented Nov 5, 2025

The iceberg sink is a multistage dataflow comprised of the following operators:

  • Batch description minting: The batch minter picks time bounds x time wide into the future which all subsequent operators use.
  • Iceberg Writer Iceberg writer writes data files (parquet) with times bounded by the batch descriptions
  • Iceberg committer Coalesces the files from all writers and commits them to the catalog

Motivation

Tips for reviewer

Checklist

  • This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
  • This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
  • If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
  • If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
  • If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.

@DAlperin DAlperin force-pushed the dov/iceberg-sink-render branch from 7325acf to 212847c Compare November 13, 2025 18:11
The iceberg sink is a multistage dataflow comprised of the following
operators:

- Batch description minting:
  The batch minter picks time bounds `x` time wide into the future which
  all subsequent operators use.
- Iceberg Writer
  Iceberg writer writes data files (parquet) with times bounded by the batch
  descriptions
- Iceberg committer
  Coalesces the files from all writers and commits them to the catalog
@DAlperin DAlperin force-pushed the dov/iceberg-sink-render branch from 212847c to 2ddb360 Compare November 16, 2025 03:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant