Csatv2 contribution #2627

rwightman · 2025-12-09T22:45:23Z

Continuation of work in #2624 by @gusdlf93

…s where possible

HuggingFaceDocBuilderDev · 2025-12-09T22:49:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

rwightman · 2025-12-09T22:50:17Z

@gusdlf93 hey, I used claude to make some additional changes to fit timm norms a bit better, it did require remapping checkpoints though. I verified 80.024% accuracy remains.

Unfortunately the diff of the model got messed up (can't see what was changed) because your commit was a mix of CRLF and LF and it got cleaned to LF only which touched every line.

An interesting model for higher resolution.

rwightman · 2025-12-09T22:51:08Z

I may add a few more small things like grad checkpointing, and then I guess I'll push a remapped checkpoint to the timm org that references the original

…inal norm is 2d so we can disable pooling if desired. Still inconsistent line endings

…odules

gusdlf93 · 2025-12-10T10:14:21Z

Thanks a lot for taking over and polishing the implementation.
Let me know if you need any additional details about the training setup or checkpoints.

For reproducibility and detailed training recipes, I’ve documented everything in the Hugging Face model card:
Link : https://huggingface.co/Hyunil/CSATv2

rwightman · 2025-12-10T17:05:13Z

@gusdlf93 okay thanks, I'm probably not going to get a chance to merge this for a few more days, I feel it's in a good state but I have a few days off and wanted to check a few more small things.

…dynamic for other network shapes, allow drop path option for transformer blocks.

gusdlf93 and others added 9 commits December 9, 2025 14:45

Upload CSATv2

78d6b19

Update csatv2.py

c584304

Update csatv2.py

5dc2e6d

Add files via upload

f1de327

Add files via upload

24a884b

Delete csatv2.py

6462c73

Add files via upload

9e40d5a

Pass the Test

8d5e51e

Some rework of csatv2 to better fit timm norms, re-use existing layer…

dad1ca1

…s where possible

rwightman force-pushed the csatv2-gusdlf93 branch from b73a79c to dad1ca1 Compare December 9, 2025 22:46

rwightman added 4 commits December 9, 2025 15:34

Add grad_checkpointing. Remap head to use NormMlpClassifierHead and f…

b0cb744

…inal norm is 2d so we can disable pooling if desired. Still inconsistent line endings

Weird mix of camel/snake conv naming -> snake case

1c6b0f5

Consistent use of device/dtype factory kwargs through csatv2 and subm…

4706c8e

…odules

Make default out_indices use stage outputs and skip the dct out

6cec2f0

Another round of consistency changes for csatv2, make stage building …

b6eb61a

…dynamic for other network shapes, allow drop path option for transformer blocks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Csatv2 contribution #2627

Csatv2 contribution #2627

rwightman commented Dec 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 9, 2025

Uh oh!

rwightman commented Dec 9, 2025

Uh oh!

rwightman commented Dec 9, 2025 •

edited

Loading

Uh oh!

gusdlf93 commented Dec 10, 2025

Uh oh!

rwightman commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Csatv2 contribution #2627

Are you sure you want to change the base?

Csatv2 contribution #2627

Conversation

rwightman commented Dec 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 9, 2025

Uh oh!

rwightman commented Dec 9, 2025

Uh oh!

rwightman commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gusdlf93 commented Dec 10, 2025

Uh oh!

rwightman commented Dec 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rwightman commented Dec 9, 2025 •

edited

Loading