Show HN: RULER – Easily apply RL to any agent Hey HN, Kyle here, one of the co-founders of OpenPipe. Reinforcement learning is one of the be...
No comments:
Post a Comment