Blog

GRPO and evolutionary HPO

Combining GRPO with evolutionary hyperparameter optimization to squeeze the most out of small LLMs