AgileRL lands a £6m seed round led by Fusion Fund to speed up reinforcement learning for training AI models

AgileRL, a startup accelerating the development of Reinforcement Learning for training AI models, today announces their seed, led by Fusion Fund, along with Flying Fish, Octopus Ventures, Entrepreneur First, and Counterview Capital, bringing their total funding to £6 million.

Its platform reduces the time and cost of RL by 10x with performant end-to-end tooling. AgileRL plans to use the funding to open a San Francisco office and hire more than a dozen roles across engineering and go-to-market.

Building an RL program in-house means effectively creating a small AI research lab. You need teams of expensive PhDs, months of trial runs, and big compute budgets. Companies must assemble everything from scratch each time, including simulators, reward design, data collection, hyperparameter search, distributed training, evaluation suites, monitoring, safety guardrails, and deployment pipelines. Every new use case breaks the old setup and starts the work over. It's slow, expensive, and brittle, and all but the largest technology companies can afford to effectively do it at scale.

AgileRL offers both a free open-source RL platform and Arena, a managed full-stack RLOps platform that handles all the difficult engineering work. Its approach enables training on-policy, off-policy, offline, multi-agent, contextual multi-armed bandits and large language models, alongside evolutionary hyperparameter optimisation, distributed training with multi-GPU support, environment validation, and one-click deployment. The result is streamlined development, a 10x improvement in training speed, and superior AI model performance compared to standard approaches, supported by a community shaped by academic citations and more than 300,000 downloads.

The company's framework is already being used by labs at institutions including MIT, Roblox, Carnegie Mellon and University of Waterloo, for applications that span defence, robotics, finance, and others.

Having built a reinforcement learning system from scratch at my last company, I saw firsthand how costly and complex it is. No company in 2026 should need a full AI research lab just to use RL. Our goal is to make reinforcement learning a standard tool in every company's tech stack.
Param Kumar, Co-founder & CEO

Reinforcement learning remains the gold standard of AI training, yet very few companies actually have the resources to implement it in house. As hundreds of thousands of companies continue to invest in AI, AgileRL will undoubtedly play a key role in powering this revolution.
Lu Zhang, Founding Partner at Fusion Fund

Arena has significantly streamlined our RL development workflow, making training and deploying agents a breeze. The platform's hyperparameter tuning capabilities have dramatically accelerated our experimentation and improved the performance of our models.
Andrew Nestor, Machine Learning Engineer at Decision Lab

AgileRL lands a £6m seed round led by Fusion Fund to speed up reinforcement learning for training AI models

Similar articles

Undo raises £28m in funding from Elsewhere Partners to help AI agents find the root cause of bugs in complex codebases

Dessn raises £4.5m led by Connect Ventures for product design and prototyping inside live codebases

ManaMind secures a £1.1m pre-seed round led by SVV to develop autonomous QA agents for gaming