Popular repositories Loading
-
Distributed_QLoRA
Distributed_QLoRA Public用于微调预训练语言模型并评估其性能。它支持使用LoRA或QLoRA技术进行高效的微调,并提供评估模型性能的工具。
Python 2
-
S2R
S2R PublicForked from NineAbyss/S2R
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
