Causal Evaluation of Planning Strategies in Large Language Models Through Interpretable Quality Prediction and Counterfactual Reinforcement Learning. JAAIR. 2026;5(1). Accessed July 29, 2026. https://www.jaair.org/index.php/home/article/view/32