verl: Volcano Engine Reinforcement Learning for LLMs - GitHub verl is a flexible, efficient and production-ready RL training library for large language models (LLMs) verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper
Welcome to verl’s documentation! — verl documentation verl is a flexible, efficient and production-ready RL training framework designed for large language models (LLMs) post-training It is an open source implementation of the HybridFlow paper