Introducing `AutoRL` 📈 The world's simplest way to train a task-specific LLM with RL. *Just write a SENTENCE describing the model you want.* A chain of AI systems will generate data + rubrics and train a model for you. Powered by ART, it's open source. Link in thread:
@theRohitDas For this run, I spent $0 on the GPU, and 40 cents on OpenRouter credits for prompt generation, RULER ranking, etc.
140,3K