Localizing Reasoning Training-Induced Changes in Large Language Models

Published in Mechanistic Interpretability Workshop at Neural Information Processing Systems (NeurIPS), 2025