verl: Volcano Engine Reinforcement Learning for LLMs - GitHub verl is a flexible, efficient and production-ready RL training library for large language models (LLMs) verl is the open-source version of HybridFlow: A Flexible and Efficient RLHF Framework paper
Welcome to verl’s documentation! — verl documentation verl is a flexible, efficient and production-ready RL training framework designed for large language models (LLMs) post-training It is an open source implementation of the HybridFlow paper
Verl:字节跳动开源的 LLM 强化学习训练工具,高效支持 RLHF 与多算法部署 Verl是由字节跳动Seed团队发起、社区共同维护的开源强化学习(RL)训练库,专为大型语言模型(LLMs)设计,该项目以“灵活易用、高效性能、生产级就绪”为核心优势,深度整合PPO、GRPO、DAPO等主流RL算法,无缝兼容FSDP、vLLM、Hugging Face Transformers等训练与推理框架,支持多模态交互、工具调用、长
Installation — verl documentation The installation steps below are recommended configurations for the latest version of verl If you are trying to customize your own environment, please ignore the strict constraints