๐ OPEN SOURCE
Gpt-oss Reinforcement Learning - Fastest inference now in Unsloth! (<15GB VRAM)
"Hey guys we've got lots of updates for Reinforcement Learning (RL)! Weโre excited to introduce gpt-oss, Vision, and even better RL in Unsloth. Our new gpt-oss RL inference also achieves the fastest token/s vs. any other implementation. Our GitHub: [https://github.com/unslothai/unsloth](https://githu..."
๐ฌ Reddit Discussion: 46 comments
๐ BUZZING
๐ฏ Fine-tuning LLMs โข Open-source AI models โข Code generation usecase
๐ฌ "You would need to construct how you're going to qualify success and the rewards."
โข "Before RL, look into how to train a LoRA, and try that."