Where Computation Lives Inside TabPFN: Causal Localisation of Attention Head Function

June 11, 2026 · Grace Period · 🏛 ICML 2026

Authors Atharva Gupta, Dhruv Kumar, Murari Mandal, Saurabh Deshpande arXiv ID 2606.12917 Category cs.LG: Machine Learning Citations 0 Venue ICML 2026

Abstract

We present the first causal mechanistic analysis of a tabular foundation model, investigating how TabPFN 2.5's feature wise attention heads distribute computation across layers. Using activation patching, ablation, and attention entropy across two synthetic regression datasets, we find clear temporal specialisation: one head's causal necessity dominates that of the others by 2 to 5 times at peak layer, with its dominant layer shifting across tasks of different complexity, while the remaining heads exhibit symmetric late layer profiles. Attention entropy and patching provide convergent evidence for the computationally active layers of the dominant head. We additionally investigate inference time steerability via contrastive activation steering, which fails to transfer across samples. We attribute this result to TabPFN's in context learning mechanism, which encodes task structure through context dependent attention rather than the stable parametric directions that make steering tractable in language models.