System Info
Default number of training steps for SmolVLA is 20k, but sometimes we want to train it longer, eg. in case of bigger datasets. Hovewer, learning rate curve decreases at same rate independently of nr of steps choosen, and model stops to learn after 30k steps.
Information
Reproduction
I trained SmolVLA with parameter of steps set to 120000. Here is my learning rate curve:

Expected behavior
Can't say what expected behaviour should be. Maybe lr decrease rate should be proportional to nr of steps?
@jadechoghari