A reinforcement learning framework that learns adaptive retrieval strategies for healthcare prior authorization decisions. Instead of fixed top-K retrieval, an RL agent sequentially decides which ...
[2026-03-07] Added AgileX (PiPER/PiPER-X) support for real-world RL. [2026-02-26] First SO101 real-world RL baseline and reproducible CLI workflow are released.