Skip to content

Commit 4cc6253

Browse files
Merge pull request #666 from codinglover222/deepseek-doc-fix
fix an args description.
2 parents 57d7bd4 + 4a65fd9 commit 4cc6253

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

inference/kernel.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -87,7 +87,7 @@ def weight_dequant(x: torch.Tensor, s: torch.Tensor, block_size: int = 128) -> t
8787
8888
Args:
8989
x (torch.Tensor): The quantized weight tensor of shape (M, N).
90-
s (torch.Tensor): The scale tensor of shape (M, N).
90+
s (torch.Tensor): The scale tensor of shape (M//block_size, N//block_size).
9191
block_size (int, optional): The block size to use for dequantization. Defaults to 128.
9292
9393
Returns:

0 commit comments

Comments
 (0)