ThinkRL Documentation

A modular, high-performance, reasoning-centric library for Reinforcement Learning from Human and AI Feedback (RLHF & RLAIF).

⌘K

Made with love from India 🇮🇳 for the world

State-of-the-art algorithms for RLHF training

Step-by-step guides for every training scenario

PRM, STaR, LoRA, vLLM, and more

Deep dives into advanced functionality

Contribute, report issues, or star the repo

Quick Install

Get started with ThinkRL in seconds

# Clone and setup

git clone https://github.com/ellanorai/ThinkRL.git

cd ThinkRL

# Create venv and install

python -m venv .venv

source .venv/bin/activate

pip install -e .

# Or use the make shortcut

make install

# Or with all extras

pip install -e ".[all]"