Return to Issue Details Causal Evaluation of Planning Strategies in Large Language Models Through Interpretable Quality Prediction and Counterfactual Reinforcement Learning Download Download PDF