castform
docs
learn
blogs
careers
contact us
try playground
docs
learn
blogs
careers
learn
guides on reinforcement learning, fine-tuning, and the ideas behind modern, custom ai systems.
the state of continual learning
rl
agents
memory
Apr 10, 2026
reward hacking: when your ai aces the wrong test
rl
Apr 10, 2026
what is an rl environment?
rl
Apr 10, 2026
grpo explained: group relative policy optimization for llm finetuning
rl
Apr 9, 2026