>>107065138
If you use a higher alpha of course you should decrease the learning rate proportionally. You can't change just alpha. Picrel from the QLoRA paper (https://arxiv.org/pdf/2305.14314).