Latest news & insights

Introducing the Arena Client: Reinforcement learning at scale, from your terminal

How we built a robust and scalable async-RL system that beats TRL and ART by 7x

How to pick an RL algorithm and reward system for multi-turn LLM training

Company updates

Bringing RL to the Enterprise: AgileRL Raises $7.5M to date

January 7, 2026

Combining GRPO with evolutionary HPO to squeeze the most out of small models

November 6, 2025

Breaking the Reinforcement Learning HPO Bottleneck

August 25, 2025

See Arena on your data, in your environment

Bring a sample environment or dataset and we will walk you through training, tuning, and deployment in a live session tailored to your use case.