How to set the learning rate using WSD? As mentioned in MiniCPM
How to set the learning rate using WSD? As mentioned in MiniCPM