Skip to content

Commit f2649ba

Browse files
committed
Cleanup
1 parent 28a9e3d commit f2649ba

File tree

2 files changed

+5
-4
lines changed

2 files changed

+5
-4
lines changed

docs/cli_options.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -280,12 +280,12 @@ The standard deviation of the number of tokens in each output.
280280

281281
#### `--prompt-prefix-pool-size`, `--prefix-prompt-pool-size`, `--num-prefix-prompts` `<int>`
282282

283-
The total size of the prefix prompt pool to select prefixes from. If this value is not zero, these are prompts that are prepended to input prompts. This is useful for benchmarking models that use a K-V cache.
283+
The total size of the prefix prompt pool to select prefixes from. If this value is not zero, these are prompts that are prepended to input prompts. This is useful for benchmarking models that use a K-V cache. This field cannot be used with --prefix-reuse-rate.
284284
<br>_Default: `0`_
285285

286286
#### `--prompt-prefix-length`, `--prefix-prompt-length` `<int>`
287287

288-
The number of tokens in each prefix prompt. This is only used if "num" is greater than zero. Note that due to the prefix and user prompts being concatenated, the number of tokens in the final prompt may be off by one. This field is ignored when --prefix-reuse-rate is used.
288+
The number of tokens in each prefix prompt. This is only used if "num" is greater than zero. Note that due to the prefix and user prompts being concatenated, the number of tokens in the final prompt may be off by one. This field cannot be used with --prefix-reuse-rate.
289289
<br>_Default: `0`_
290290

291291
#### `--prefix-reuse-rate` `<float>`

src/aiperf/common/config/prompt_config.py

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -148,7 +148,8 @@ def validate_prefix_reuse_rate_conflicts(self) -> Self:
148148
description=(
149149
"The total size of the prefix prompt pool to select prefixes from.\n"
150150
"If this value is not zero, these are prompts that are prepended to input prompts.\n"
151-
"This is useful for benchmarking models that use a K-V cache."
151+
"This is useful for benchmarking models that use a K-V cache. "
152+
"This field cannot be used with --prefix-reuse-rate."
152153
),
153154
),
154155
CLIParameter(
@@ -170,7 +171,7 @@ def validate_prefix_reuse_rate_conflicts(self) -> Self:
170171
'This is only used if "num" is greater than zero.\n'
171172
"Note that due to the prefix and user prompts being concatenated,\n"
172173
"the number of tokens in the final prompt may be off by one.\n"
173-
"This field is ignored when --prefix-reuse-rate is used."
174+
"This field cannot be used with --prefix-reuse-rate."
174175
),
175176
),
176177
CLIParameter(

0 commit comments

Comments
 (0)