Skip to content
@OpenRLHF

OpenRLHF

Open-sourced Reinforcment Learning from Human Feedback

Pinned Loading

  1. OpenRLHF OpenRLHF Public

    An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

    Python 8.8k 858

  2. OpenRLHF-M OpenRLHF-M Public

    An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

    Python 154 8

  3. OpenRLHF-Docs OpenRLHF-Docs Public

    3 4

Repositories

Showing 3 of 3 repositories
  • OpenRLHF Public

    An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

    OpenRLHF/OpenRLHF’s past year of commit activity
    Python 8,842 Apache-2.0 858 289 24 Updated Jan 21, 2026
  • OpenRLHF-M Public

    An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

    OpenRLHF/OpenRLHF-M’s past year of commit activity
    Python 154 Apache-2.0 8 7 1 Updated Jan 5, 2026
  • OpenRLHF-Docs Public
    OpenRLHF/OpenRLHF-Docs’s past year of commit activity
    3 4 0 0 Updated Jan 5, 2026

Top languages

Loading…

Most used topics

Loading…