Partitioning: Use more consistent language regarding partitions and buckets #853

RasmusRendal · 2025-08-01T09:02:59Z

In the current state, a partition consists of partitions, which is quite confusing. This PR changes that, such that a partition consists of buckets.

…uckets In the current state, a partition consists of partitions, which is quite confusing. This PR changes that, such that a partition consists of buckets.

bruth · 2025-08-01T11:27:32Z

nats-concepts/subject_mapping.md

 Deterministic token partitioning allows you to use subject-based addressing to deterministically divide (partition) a flow of messages where one or more of the subject tokens is mapped into a partition key. Deterministically means, the same tokens are always mapped into the same key. The mapping will appear random and may not be `fair` for a small number of subjects.

-For example: new customer orders are published on `neworders.<customer id>`, you can partition those messages over 3 partition numbers (buckets), using the `partition(number of partitions, wildcard token positions...)` function which returns a partition number (between 0 and number of partitions-1) by using the following mapping `"neworders.*" : "neworders.{{wildcard(1)}}.{{partition(3,1)}}"`.
+For example: new customer orders are published on `neworders.<customer id>`, you can partition those messages over 3 partition numbers (buckets), using the `partition(number of buckets, wildcard token positions...)` function which returns a partition number (between 0 and number of partitions-1) by using the following mapping `"neworders.*" : "neworders.{{wildcard(1)}}.{{partition(3,1)}}"`.


Hi @RasmusRendal, thanks for the contribution. Now that you pointed this discrepancy out, I think removing the term bucket generally would make more sense and stick with partition since it is redundant.

For example:

Suggested change

For example: new customer orders are published on `neworders.<customer id>`, you can partition those messages over 3 partition numbers (buckets), using the `partition(number of buckets, wildcard token positions...)` function which returns a partition number (between 0 and number of partitions-1) by using the following mapping `"neworders.*" : "neworders.{{wildcard(1)}}.{{partition(3,1)}}"`.

For example: new customer orders are published on `neworders.<customer id>`, you can spread those messages over 3 partition, using the `partition(number of partitions, wildcard token positions...)` function which returns a partition number (between 0 and number of partitions-1) by using the following mapping `"neworders.*" : "neworders.{{wildcard(1)}}.{{partition(3,1)}}"`.

Does this make sense?

I think calling the things the partition consists of something different than "partition" is still important, if not for making the documentation understandable, to make it easier to talk about NATS. I want to be able to tell a colleague that "We create a partition of our subject, and each bucket/part is handled by a separate consumer".

This is also how people talk about partitions in other contexts: https://en.wikipedia.org/wiki/Partition_of_a_set#Definition_and_notation

The sets in $P$ are called the blocks, parts, or cells, of the partition.

Partitioning: Use more consistent language regarding partitions and b…

e436b4b

…uckets In the current state, a partition consists of partitions, which is quite confusing. This PR changes that, such that a partition consists of buckets.

bruth reviewed Aug 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Partitioning: Use more consistent language regarding partitions and buckets #853

Partitioning: Use more consistent language regarding partitions and buckets #853

Uh oh!

RasmusRendal commented Aug 1, 2025

Uh oh!

bruth Aug 1, 2025

Uh oh!

RasmusRendal Aug 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	For example: new customer orders are published on `neworders.<customer id>`, you can partition those messages over 3 partition numbers (buckets), using the `partition(number of buckets, wildcard token positions...)` function which returns a partition number (between 0 and number of partitions-1) by using the following mapping `"neworders.*" : "neworders.{{wildcard(1)}}.{{partition(3,1)}}"`.
	For example: new customer orders are published on `neworders.<customer id>`, you can spread those messages over 3 partition, using the `partition(number of partitions, wildcard token positions...)` function which returns a partition number (between 0 and number of partitions-1) by using the following mapping `"neworders.*" : "neworders.{{wildcard(1)}}.{{partition(3,1)}}"`.

Partitioning: Use more consistent language regarding partitions and buckets #853

Are you sure you want to change the base?

Partitioning: Use more consistent language regarding partitions and buckets #853

Uh oh!

Conversation

RasmusRendal commented Aug 1, 2025

Uh oh!

bruth Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

RasmusRendal Aug 1, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants