-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Pull requests: karpathy/nanochat
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
remove unnecessary check to make the logic in CausalSelfAttention.forward() clearer
refactor
suggest/merge
#310
opened Nov 19, 2025 by
ericsilberstein1
Loading…
make mid_train script work even with a tiny number of iterations
improvement
#309
opened Nov 19, 2025 by
ericsilberstein1
Loading…
rename checkpoint_dir to checkpoints_dir
refactor
suggest/merge
#308
opened Nov 19, 2025 by
ericsilberstein1
Loading…
fixing two typos in comments
docs
Improvements or additions to documentation
#307
opened Nov 19, 2025 by
ericsilberstein1
Loading…
change test/train split approach to fix bug in spelling bee task
potential_bug
Needs investigation/confirmation whether or not it's a bug
#306
opened Nov 19, 2025 by
ericsilberstein1
Loading…
stats: include model_tag and model_step in /stats
feature
New feature or request
#296
opened Nov 16, 2025 by
intently
Loading…
Save checkpoint before possible OOM in CORE metric/Inference
improvement
#295
opened Nov 16, 2025 by
nitishpandey04
Loading…
feat: Add TorchScript/ONNX export support for cross-language inference
feature
New feature or request
#275
opened Nov 10, 2025 by
dhruvsoni365
Loading…
dataset downloader: add progress bar, essential info to stdout, and debug logs to New feature or request
logs/ (no functional change)
feature
#258
opened Nov 7, 2025 by
h3nock
Loading…
fix(common): avoid destroying non-initialized DDP process group
potential_bug
Needs investigation/confirmation whether or not it's a bug
#256
opened Nov 6, 2025 by
dipeshbabu
Loading…
Add centralized wandb initialization utility to reduce code duplication
refactor
#253
opened Nov 5, 2025 by
SermetPekin
Loading…
feat(engine.py): Sample unique initial tokens for each sequence in a batch
improvement
#201
opened Oct 29, 2025 by
azekowka
Loading…
Faster Regex pattern parsing in C
feature
New feature or request
suggest/feedback
#161
opened Oct 23, 2025 by
MadMax129
Loading…
Improve configurator: add testable parse_args() and ConfigManager class
refactor
#159
opened Oct 23, 2025 by
SermetPekin
Loading…
Multi platform CI - Github Workflow tests v3
tests
todo
#151
opened Oct 22, 2025 by
SermetPekin
Loading…
Fix mfu statically keyed to h100 max tflops
improvement
#147
opened Oct 22, 2025 by
Qubitium
Loading…
Enabled markdown rendering for response by nanochat in ui.html
todo
#141
opened Oct 21, 2025 by
Goderr
Loading…
add optional chunked cross-entropy for memory-efficient training
feature
New feature or request
#128
opened Oct 20, 2025 by
mnehete32
Loading…
Enable Proxy/Gateway Compatibility via Uvicorn's --root-path
improvement
#93
opened Oct 17, 2025 by
wpybtw
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-10-19.