Arena

Reinforcement learning streamlined with the platform built for RLOps

Trusted by leading institutions

Andrew Nestor

T&I Team, Decision Lab

> import agilerl

Single-agent and Multi-agent Support

Train on-policy, off-policy, offline, multi-agent and contextual multi-armed bandits with unmatched speed and performance

Evolutionary Hyperparameter Optimization

Achieve automatic, optimal performance in a single training run through hyperparameter and neural network evolution

Distributed Training

Take full advantage of your entire compute stack for online and offline reinforcement learning with multi-GPU support

Hierarchical Skills

Solve complex problems by breaking down tasks into smaller, learnable sub-tasks with the AgileRL Skills wrapper

Discover how AgileRL is delivering value

Dramatically increasing utilisation and reducing training time for complex bin-packing with Decision Lab

Maximising returns from high-frequency trading with Maxvankekeren-IT

Pushing the boundaries of multi-agent reinforcement learning with University of Minnesota